1
|
Yue Y, Fan H, Zhao J, Xia J. Protein language model-based prediction for plant miRNA encoded peptides. PeerJ Comput Sci 2025; 11:e2733. [PMID: 40134870 PMCID: PMC11935769 DOI: 10.7717/peerj-cs.2733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2024] [Accepted: 02/05/2025] [Indexed: 03/27/2025]
Abstract
Plant miRNA encoded peptides (miPEPs), which are short peptides derived from small open reading frames within primary miRNAs, play a crucial role in regulating diverse plant traits. Plant miPEPs identification is challenging due to limitations in the available number of known miPEPs for training. Existing prediction methods rely on manually encoded features, including miPEPPred-FRL, to infer plant miPEPs. Recent advances in deep learning modeling of protein sequences provide an opportunity to improve the representation of key features, leveraging large datasets of protein sequences. In this study, we propose an accurate prediction model, called pLM4PEP, which integrates ESM2 peptide embedding with machine learning methods. Our model not only demonstrates precise identification capabilities for plant miPEPs, but also achieves remarkable results across diverse datasets that include other bioactive peptides. The source codes, datasets of pLM4PEP are available at https://github.com/xialab-ahu/pLM4PEP.
Collapse
Affiliation(s)
- Yishan Yue
- College of Mathematics and System Science, Xinjiang University, Urumqi, Xinjiang, China
| | - Henghui Fan
- Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui, China
| | - Jianping Zhao
- College of Mathematics and System Science, Xinjiang University, Urumqi, Xinjiang, China
| | - Junfeng Xia
- Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui, China
| |
Collapse
|
2
|
Zhou J, Zhang R, Han Q, Yang H, Wang W, Wang Y, Zheng X, Luo F, Cai G, Zhang Y. Identification of multiple miRNA-encoded peptide reveals OsmiPEP162a putatively stabilizes OsMIR162 in rice. PLANT CELL REPORTS 2024; 44:9. [PMID: 39708129 DOI: 10.1007/s00299-024-03380-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2024] [Accepted: 11/13/2024] [Indexed: 12/23/2024]
Abstract
KEY MESSAGE MiPEPs regulate growth, development and stress response. Identification of rice miPEPs plays a crucial role in elucidation of molecular functions of rice miPEPs and rice genetic improvement. MicroRNAs (miRNAs) are derivatives of primary miRNAs (pri-miRNAs) and govern the expression of target genes. Plant pri-miRNAs encode regulatory peptides known as miPEPs, which specifically boost the transcription of their originating pri-miRNA. Although there are hundreds of pri-miRNAs in rice, research on whether they encode functional peptides is limited. In this study, we identified 10 miPEPs using a transient protoplast expression system. Among these, we focused our attention on OsmiPEP162a, which influences growth. OsmiPEP162a-edited plants exhibited reduced plant height, similar to mature OsmiR162-edited plants. Transcriptome-focused molecular analysis unveiled significant alterations in transcription profiles following the depletion of OsmiPEP162a. In addition, knocking out OsmiPEP162a led to decreased expression levels of mature OsMIR162a and OsMIR162b. This study suggests that OsmiPEP162a potentially plays a crucial role in stabilizing mature OsMIR162.
Collapse
Affiliation(s)
- Jianping Zhou
- Department of Biotechnology, School of Life Sciences and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.
| | - Rui Zhang
- Department of Biotechnology, School of Life Sciences and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Qinqin Han
- Department of Biotechnology, School of Life Sciences and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Hongjun Yang
- Department of Biotechnology, School of Life Sciences and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Wei Wang
- Department of Biotechnology, School of Life Sciences and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Yibo Wang
- Department of Biotechnology, School of Life Sciences and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Xuelian Zheng
- Department of Biotechnology, School of Life Sciences and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City, Chongqing Key Laboratory of Plant Resource Conservation and Germplasm Innovation, School of Life Sciences, Southwest University, Chongqing, 400715, China
| | - Fan Luo
- Xichang University, Xichang, 615013, Sichuan, China
| | - Guangze Cai
- Xichang University, Xichang, 615013, Sichuan, China
| | - Yong Zhang
- Department of Biotechnology, School of Life Sciences and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.
- Integrative Science Center of Germplasm Creation in Western China (Chongqing) Science City, Chongqing Key Laboratory of Plant Resource Conservation and Germplasm Innovation, School of Life Sciences, Southwest University, Chongqing, 400715, China.
| |
Collapse
|
3
|
Guclu TF, Atilgan AR, Atilgan C. Deciphering GB1's Single Mutational Landscape: Insights from MuMi Analysis. J Phys Chem B 2024; 128:7987-7996. [PMID: 39115184 PMCID: PMC11671028 DOI: 10.1021/acs.jpcb.4c04916] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2024] [Revised: 08/02/2024] [Accepted: 08/02/2024] [Indexed: 08/23/2024]
Abstract
Mutational changes that affect the binding of the C2 fragment of Streptococcal protein G (GB1) to the Fc domain of human IgG (IgG-Fc) have been extensively studied using deep mutational scanning (DMS), and the binding affinity of all single mutations has been measured experimentally in the literature. To investigate the underlying molecular basis, we perform in silico mutational scanning for all possible single mutations, along with 2 μs-long molecular dynamics (WT-MD) of the wild-type (WT) GB1 in both unbound and IgG-Fc bound forms. We compute the hydrogen bonds between GB1 and IgG-Fc in WT-MD to identify the dominant hydrogen bonds for binding, which we then assess in conformations produced by Mutation and Minimization (MuMi) to explain the fitness landscape of GB1 and IgG-Fc binding. Furthermore, we analyze MuMi and WT-MD to investigate the dynamics of binding, focusing on the relative solvent accessibility of residues and the probability of residues being located at the binding interface. With these analyses, we explain the interactions between GB1 and IgG-Fc and display the structural features of binding. In sum, our findings highlight the potential of MuMi as a reliable and computationally efficient tool for predicting protein fitness landscapes, offering significant advantages over traditional methods. The methodologies and results presented in this study pave the way for improved predictive accuracy in protein stability and interaction studies, which are crucial for advancements in drug design and synthetic biology.
Collapse
Affiliation(s)
- Tandac F. Guclu
- Faculty of Natural Sciences
and Engineering, Sabanci University, Tuzla, Istanbul 34956, Turkey
| | - Ali Rana Atilgan
- Faculty of Natural Sciences
and Engineering, Sabanci University, Tuzla, Istanbul 34956, Turkey
| | - Canan Atilgan
- Faculty of Natural Sciences
and Engineering, Sabanci University, Tuzla, Istanbul 34956, Turkey
| |
Collapse
|
4
|
Merz KM, Wei GW, Zhu F. Editorial: Machine Learning in Bio-cheminformatics. J Chem Inf Model 2024; 64:2125-2128. [PMID: 38587006 DOI: 10.1021/acs.jcim.4c00444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/09/2024]
Affiliation(s)
- Kenneth M Merz
- Department of Chemistry, Michigan State University, Lansing 48824, Michigan, United States
| | - Guo-Wei Wei
- Department of Mathematics, Michigan State University, Lansing 48824, Michigan, United States
| | - Feng Zhu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| |
Collapse
|