1
|
Gu X, Li L, Li S, Shi W, Zhong X, Su Y, Wang T. Adaptive evolution and co-evolution of chloroplast genomes in Pteridaceae species occupying different habitats: overlapping residues are always highly mutated. BMC PLANT BIOLOGY 2023; 23:511. [PMID: 37880608 PMCID: PMC10598918 DOI: 10.1186/s12870-023-04523-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 10/13/2023] [Indexed: 10/27/2023]
Abstract
BACKGROUND The evolution of protein residues depends on the mutation rates of their encoding nucleotides, but it may also be affected by co-evolution with other residues. Chloroplasts function as environmental sensors, transforming fluctuating environmental signals into different physiological responses. We reasoned that habitat diversity may affect their rate and mode of evolution, which might be evidenced in the chloroplast genome. The Pteridaceae family of ferns occupy an unusually broad range of ecological niches, which provides an ideal system for analysis. RESULTS We conducted adaptive evolution and intra-molecular co-evolution analyses of Pteridaceae chloroplast DNAs (cpDNAs). The results indicate that the residues undergoing adaptive evolution and co-evolution were mostly independent, with only a few residues being simultaneously involved in both processes, and these overlapping residues tend to exhibit high mutations. Additionally, our data showed that Pteridaceae chloroplast genes are under purifying selection. Regardless of whether we grouped species by lineage (which corresponded with ecological niches), we determined that positively selected residues mainly target photosynthetic genes. CONCLUSIONS Our work provides evidence for the adaptive evolution of Pteridaceae cpDNAs, especially photosynthetic genes, to different habitats and sheds light on the adaptive evolution and co-evolution of proteins.
Collapse
Affiliation(s)
- Xiaolin Gu
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Lingling Li
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Sicong Li
- College of Natural Resources and Environment, South China Agricultural University, Guangzhou, 510642, China
| | - Wanxin Shi
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Xiaona Zhong
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Yingjuan Su
- School of Life Sciences, Sun Yat-sen University, Guangzhou, 510275, China.
- Research Institute of Sun Yat-sen University in Shenzhen, Shenzhen, 518057, China.
| | - Ting Wang
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China.
| |
Collapse
|
2
|
Karamanos TK. Chasing long-range evolutionary couplings in the AlphaFold era. Biopolymers 2023; 114:e23530. [PMID: 36752285 PMCID: PMC10909459 DOI: 10.1002/bip.23530] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Revised: 01/26/2023] [Accepted: 01/27/2023] [Indexed: 02/09/2023]
Abstract
Coevolution between protein residues is normally interpreted as direct contact. However, the evolutionary record of a protein sequence contains rich information that may include long-range functional couplings, couplings that report on homo-oligomeric states or even conformational changes. Due to the complexity of the sequence space and the lack of structural information on various members of a protein family, it has been difficult to effectively mine the additional information encoded in a multiple sequence alignment (MSA). Here, taking advantage of the recent release of the AlphaFold (AF) database we attempt to identify coevolutionary couplings that cannot be explained simply by spatial proximity. We propose a simple computational method that performs direct coupling analysis on a MSA and searches for couplings that are not satisfied in any of the AF models of members of the identified protein family. Application of this method on 2012 protein families suggests that ~12% of the total identified coevolving residue pairs are spatially distant and more likely to be disordered than their contacting counterparts. We expect that this analysis will help improve the quality of coevolutionary distance restraints used for structure determination and will be useful in identifying potentially functional/allosteric cross-talk between distant residues.
Collapse
|
3
|
Mukherjee I, Chakrabarti S. Co-evolutionary landscape at the interface and non-interface regions of protein-protein interaction complexes. Comput Struct Biotechnol J 2021; 19:3779-3795. [PMID: 34285778 PMCID: PMC8271121 DOI: 10.1016/j.csbj.2021.06.039] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 06/22/2021] [Accepted: 06/22/2021] [Indexed: 11/16/2022] Open
Abstract
Proteins involved in interactions throughout the course of evolution tend to co-evolve and compensatory changes may occur in interacting proteins to maintain or refine such interactions. However, certain residue pair alterations may prove to be detrimental for functional interactions. Hence, determining co-evolutionary pairings that could be structurally or functionally relevant for maintaining the conservation of an inter-protein interaction is important. Inter-protein co-evolution analysis in several complexes utilizing multiple existing methodologies suggested that co-evolutionary pairings can occur in spatially proximal and distant regions in inter-protein interactions. Subsequently, the Co-Var (Correlated Variation) method based on mutual information and Bhattacharyya coefficient was developed, validated, and found to perform relatively better than CAPS and EV-complex. Interestingly, while applying the Co-Var measure and EV-complex program on a set of protein-protein interaction complexes, co-evolutionary pairings were obtained in interface and non-interface regions in protein complexes. The Co-Var approach involves determining high degree co-evolutionary pairings that include multiple co-evolutionary connections between particular co-evolved residue positions in one protein with multiple residue positions in the binding partner. Detailed analyses of high degree co-evolutionary pairings in protein-protein complexes involved in cancer metastasis suggested that most of the residue positions forming such co-evolutionary connections mainly occurred within functional domains of constituent proteins and substitution mutations were also common among these positions. The physiological relevance of these predictions suggested that Co-Var can predict residues that could be crucial for preserving functional protein-protein interactions. Finally, Co-Var web server (http://www.hpppi.iicb.res.in/ishi/covar/index.html) that implements this methodology identifies co-evolutionary pairings in intra and inter-protein interactions.
Collapse
Affiliation(s)
- Ishita Mukherjee
- Structural Biology and Bioinformatics Division, Council for Scientific and Industrial Research (CSIR) - Indian Institute of Chemical Biology (IICB), Kolkata, West Bengal 700032, India
| | - Saikat Chakrabarti
- Structural Biology and Bioinformatics Division, Council for Scientific and Industrial Research (CSIR) - Indian Institute of Chemical Biology (IICB), Kolkata, West Bengal 700032, India
| |
Collapse
|
4
|
Wu Z, Liu H, Xu L, Chen HF, Feng Y. Algorithm-based coevolution network identification reveals key functional residues of the α/β hydrolase subfamilies. FASEB J 2020; 34:1983-1995. [PMID: 31907985 DOI: 10.1096/fj.201900948rr] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Revised: 10/02/2019] [Accepted: 10/21/2019] [Indexed: 11/11/2022]
Abstract
Covariant residues identified by computational algorithms have provided new insights into enzyme evolutionary routes. However, the reliability and accuracy of routine statistical coupling analysis (SCA) are unable to satisfy the needs of protein engineering because SCA depends only on sequence information. Here, we set up a new SCA algorithm, SCA.SIM, by integrating structure information and MD simulation data. The more reliable covariant residues with high-quality scores are obtained from sequence alignment weighted by residual movement for eight related subfamilies, belonging to α/β hydrolase family, with Candida antarctica lipase B (CALB). The 38 predicted covariant residues are tested for function by high-throughput quantitative evaluation in combination with activity and thermostability assays of a mutant library and deep sequencing. Based on the landscapes of both activity and thermostability, most mutants play key roles in catalysis, and some mutants gain 2.4- to 6-fold increase in half-life at 50°C and 9- to 12-fold improvement in catalytic efficiency. The activity of double mutants for A225F/T103A is higher than those of A225F and T103A which means that SCA.SIM method might be useful for identifying the allosteric coupling. The SCA.SIM algorithm can be used for protein coevolution and enzyme engineering research.
Collapse
Affiliation(s)
- Zhiyun Wu
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Hao Liu
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Lishi Xu
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Hai-Feng Chen
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China.,Shanghai Center for Bioinformation Technology, Shanghai, China
| | - Yan Feng
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic and Developmental Sciences, National Experimental Teaching Center for Life Sciences and Biotechnology, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| |
Collapse
|
5
|
Yang Y, Gao J, Wang J, Heffernan R, Hanson J, Paliwal K, Zhou Y. Sixty-five years of the long march in protein secondary structure prediction: the final stretch? Brief Bioinform 2018; 19:482-494. [PMID: 28040746 PMCID: PMC5952956 DOI: 10.1093/bib/bbw129] [Citation(s) in RCA: 89] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2016] [Revised: 11/15/2016] [Indexed: 11/13/2022] Open
Abstract
Protein secondary structure prediction began in 1951 when Pauling and Corey predicted helical and sheet conformations for protein polypeptide backbone even before the first protein structure was determined. Sixty-five years later, powerful new methods breathe new life into this field. The highest three-state accuracy without relying on structure templates is now at 82-84%, a number unthinkable just a few years ago. These improvements came from increasingly larger databases of protein sequences and structures for training, the use of template secondary structure information and more powerful deep learning techniques. As we are approaching to the theoretical limit of three-state prediction (88-90%), alternative to secondary structure prediction (prediction of backbone torsion angles and Cα-atom-based angles and torsion angles) not only has more room for further improvement but also allows direct prediction of three-dimensional fragment structures with constantly improved accuracy. About 20% of all 40-residue fragments in a database of 1199 non-redundant proteins have <6 Å root-mean-squared distance from the native conformations by SPIDER2. More powerful deep learning methods with improved capability of capturing long-range interactions begin to emerge as the next generation of techniques for secondary structure prediction. The time has come to finish off the final stretch of the long march towards protein secondary structure prediction.
Collapse
Affiliation(s)
- Yuedong Yang
- Insitute for Glycomics and School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, QLD, Australia
| | - Jianzhao Gao
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin, China
| | - Jihua Wang
- Shandong Provincial Key Laboratory of Biophysics, Institute of Biophysics, Dezhou University, Dezhou, China
| | - Rhys Heffernan
- Signal Processing Laboratory, Griffith University, Brisbane, Australia
| | - Jack Hanson
- Signal Processing Laboratory, Griffith University, Brisbane, Australia
| | - Kuldip Paliwal
- Signal Processing Laboratory, Griffith University, Brisbane, Australia
| | - Yaoqi Zhou
- Insitute for Glycomics and School of Information and Communication Technology, Griffith University, Parklands Drive, Southport, QLD, Australia
- Shandong Provincial Key Laboratory of Biophysics, Institute of Biophysics, Dezhou University, Dezhou, China
| |
Collapse
|
6
|
Origins of coevolution between residues distant in protein 3D structures. Proc Natl Acad Sci U S A 2017; 114:9122-9127. [PMID: 28784799 DOI: 10.1073/pnas.1702664114] [Citation(s) in RCA: 129] [Impact Index Per Article: 16.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Residue pairs that directly coevolve in protein families are generally close in protein 3D structures. Here we study the exceptions to this general trend-directly coevolving residue pairs that are distant in protein structures-to determine the origins of evolutionary pressure on spatially distant residues and to understand the sources of error in contact-based structure prediction. Over a set of 4,000 protein families, we find that 25% of directly coevolving residue pairs are separated by more than 5 Å in protein structures and 3% by more than 15 Å. The majority (91%) of directly coevolving residue pairs in the 5-15 Å range are found to be in contact in at least one homologous structure-these exceptions arise from structural variation in the family in the region containing the residues. Thirty-five percent of the exceptions greater than 15 Å are at homo-oligomeric interfaces, 19% arise from family structural variation, and 27% are in repeat proteins likely reflecting alignment errors. Of the remaining long-range exceptions (<1% of the total number of coupled pairs), many can be attributed to close interactions in an oligomeric state. Overall, the results suggest that directly coevolving residue pairs not in repeat proteins are spatially proximal in at least one biologically relevant protein conformation within the family; we find little evidence for direct coupling between residues at spatially separated allosteric and functional sites or for increased direct coupling between residue pairs on putative allosteric pathways connecting them.
Collapse
|
7
|
Evolution acting on the same target, but at multiple levels: Proteins as the test case. J Biosci 2017; 42:1-3. [DOI: 10.1007/s12038-017-9672-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
8
|
Khwaja A, Galilee M, Marx A, Alian A. Structure of FIV capsid C-terminal domain demonstrates lentiviral evasion of genetic fragility by coevolved substitutions. Sci Rep 2016; 6:24957. [PMID: 27102180 PMCID: PMC4840305 DOI: 10.1038/srep24957] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2015] [Accepted: 04/08/2016] [Indexed: 12/22/2022] Open
Abstract
Viruses use a strategy of high mutational rates to adapt to environmental and therapeutic pressures, circumventing the deleterious effects of random single-point mutations by coevolved compensatory mutations, which restore protein fold, function or interactions damaged by initial ones. This mechanism has been identified as contributing to drug resistance in the HIV-1 Gag polyprotein and especially its capsid proteolytic product, which forms the viral capsid core and plays multifaceted roles in the viral life cycle. Here, we determined the X-ray crystal structure of C-terminal domain of the feline immunodeficiency virus (FIV) capsid and through interspecies analysis elucidate the structural basis of co-evolutionarily and spatially correlated substitutions in capsid sequences, which when otherwise uncoupled and individually substituted into HIV-1 capsid impair virion assembly and infectivity. The ability to circumvent the deleterious effects of single amino acid substitutions by cooperative secondary substitutions allows mutational flexibility that may afford viruses an important survival advantage. The potential of such interspecies structural analysis for preempting viral resistance by identifying such alternative but functionally equivalent patterns is discussed.
Collapse
Affiliation(s)
- Aya Khwaja
- Faculty of Biology, Technion - Israel Institute of Technology, Haifa 320003, Israel
| | - Meytal Galilee
- Faculty of Biology, Technion - Israel Institute of Technology, Haifa 320003, Israel
| | - Ailie Marx
- Faculty of Biology, Technion - Israel Institute of Technology, Haifa 320003, Israel
| | - Akram Alian
- Faculty of Biology, Technion - Israel Institute of Technology, Haifa 320003, Israel
| |
Collapse
|