Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Saraswathi S, Fernández-Martínez JL, Kolinski A, Jernigan RL, Kloczkowski A. Fast learning optimized prediction methodology (FLOPRED) for protein secondary structure prediction. J Mol Model 2012;18:4275-89. [PMID: 22562230 PMCID: PMC3694724 DOI: 10.1007/s00894-012-1410-7] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2012] [Accepted: 03/19/2012] [Indexed: 10/28/2022]

For:	Saraswathi S, Fernández-Martínez JL, Kolinski A, Jernigan RL, Kloczkowski A. Fast learning optimized prediction methodology (FLOPRED) for protein secondary structure prediction. J Mol Model 2012;18:4275-89. [PMID: 22562230 PMCID: PMC3694724 DOI: 10.1007/s00894-012-1410-7] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2012] [Accepted: 03/19/2012] [Indexed: 10/28/2022]

Number

Cited by Other Article(s)

Rashid S, Sundaram S, Kwoh CK. Empirical Study of Protein Feature Representation on Deep Belief Networks Trained With Small Data for Secondary Structure Prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023;20:955-966. [PMID: 35439138 DOI: 10.1109/tcbb.2022.3168676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Akbar S, Pardasani KR, Panda NR. PSO Based Neuro-fuzzy Model for Secondary Structure Prediction of Protein. Neural Process Lett 2021. [DOI: 10.1007/s11063-021-10615-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Krieger S, Kececioglu J. Boosting the accuracy of protein secondary structure prediction through nearest neighbor search and method hybridization. Bioinformatics 2021;36:i317-i325. [PMID: 32657384 PMCID: PMC7355242 DOI: 10.1093/bioinformatics/btaa336] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Agrawal S, Ransom RF, Saraswathi S, Garcia-Gonzalo E, Webb A, Fernandez-Martinez JL, Popovic M, Guess AJ, Kloczkowski A, Benndorf R, Sadee W, Smoyer WE, on behalf of the Pediatric Nephrology Research Consortium (PNRC). Sulfatase 2 Is Associated with Steroid Resistance in Childhood Nephrotic Syndrome. J Clin Med 2021;10:523. [PMID: 33540508 PMCID: PMC7867139 DOI: 10.3390/jcm10030523] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 01/20/2021] [Accepted: 01/23/2021] [Indexed: 01/17/2023] Open

Affiliation(s)

Shipra Agrawal Center for Clinical and Translational Research, Abigail Wexner Research Institute at Nationwide Children’s Hospital, Columbus, OH 43205, USA; (R.F.R.); (M.P.); (A.J.G.); (R.B.) Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH 43210, USA;
Richard F. Ransom Center for Clinical and Translational Research, Abigail Wexner Research Institute at Nationwide Children’s Hospital, Columbus, OH 43205, USA; (R.F.R.); (M.P.); (A.J.G.); (R.B.) Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH 43210, USA;
Saras Saraswathi Battelle Center for Mathematical Medicine at Abigail Wexner Research Institute at Nationwide Children’s Hospital, Columbus, OH 43205, USA;
Esperanza Garcia-Gonzalo Department of Mathematics, University of Oviedo, 33033 Oviedo, Spain; (E.G.-G.); (J.L.F.-M.)
Amy Webb Department of Biomedical Informatics, The Ohio State University College of Medicine, Columbus, OH 43210, USA;
Juan L. Fernandez-Martinez Department of Mathematics, University of Oviedo, 33033 Oviedo, Spain; (E.G.-G.); (J.L.F.-M.)
Milan Popovic Center for Clinical and Translational Research, Abigail Wexner Research Institute at Nationwide Children’s Hospital, Columbus, OH 43205, USA; (R.F.R.); (M.P.); (A.J.G.); (R.B.)
Adam J. Guess Center for Clinical and Translational Research, Abigail Wexner Research Institute at Nationwide Children’s Hospital, Columbus, OH 43205, USA; (R.F.R.); (M.P.); (A.J.G.); (R.B.)
Andrzej Kloczkowski Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH 43210, USA; Battelle Center for Mathematical Medicine at Abigail Wexner Research Institute at Nationwide Children’s Hospital, Columbus, OH 43205, USA;
Rainer Benndorf Center for Clinical and Translational Research, Abigail Wexner Research Institute at Nationwide Children’s Hospital, Columbus, OH 43205, USA; (R.F.R.); (M.P.); (A.J.G.); (R.B.) Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH 43210, USA;
Wolfgang Sadee Department of Cancer Biology and Genetics, Center for Pharmacogenomics, The Ohio State University College of Medicine, Columbus, OH 43210, USA;
William E. Smoyer Center for Clinical and Translational Research, Abigail Wexner Research Institute at Nationwide Children’s Hospital, Columbus, OH 43205, USA; (R.F.R.); (M.P.); (A.J.G.); (R.B.) Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH 43210, USA;
on behalf of the Pediatric Nephrology Research Consortium (PNRC)

Collapse

Prediction of Protein Tertiary Structure via Regularized Template Classification Techniques. Molecules 2020;25:molecules25112467. [PMID: 32466409 PMCID: PMC7321371 DOI: 10.3390/molecules25112467] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Revised: 05/21/2020] [Accepted: 05/22/2020] [Indexed: 11/24/2022] Open

Álvarez Ó, Fernández-Martínez JL, Corbeanu AC, Fernández-Muñiz Z, Kloczkowski A. Predicting protein tertiary structure and its uncertainty analysis via particle swarm sampling. J Mol Model 2019;25:79. [PMID: 30810816 PMCID: PMC7586042 DOI: 10.1007/s00894-019-3956-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2018] [Accepted: 02/05/2019] [Indexed: 10/27/2022]

Yang Y, Gao J, Wang J, Heffernan R, Hanson J, Paliwal K, Zhou Y. Sixty-five years of the long march in protein secondary structure prediction: the final stretch? Brief Bioinform 2018;19:482-494. [PMID: 28040746 PMCID: PMC5952956 DOI: 10.1093/bib/bbw129] [Citation(s) in RCA: 89] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2016] [Revised: 11/15/2016] [Indexed: 11/13/2022] Open

Álvarez Ó, Fernández-Martínez JL, Fernández-Brillet C, Cernea A, Fernández-Muñiz Z, Kloczkowski A. Principal component analysis in protein tertiary structure prediction. J Bioinform Comput Biol 2018;16:1850005. [PMID: 29566640 DOI: 10.1142/s0219720018500051] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Abstract

We discuss applicability of principal component analysis (PCA) for protein tertiary structure prediction from amino acid sequence. The algorithm presented in this paper belongs to the category of protein refinement models and involves establishing a low-dimensional space where the sampling (and optimization) is carried out via particle swarm optimizer (PSO). The reduced space is found via PCA performed for a set of low-energy protein models previously found using different optimization techniques. A high frequency term is added into this expansion by projecting the best decoy into the PCA basis set and calculating the residual model. This term is aimed at providing high frequency details in the energy optimization. The goal of this research is to analyze how the dimensionality reduction affects the prediction capability of the PSO procedure. For that purpose, different proteins from the Critical Assessment of Techniques for Protein Structure Prediction experiments were modeled. In all the cases, both the energy of the best decoy and the distance to the native structure have decreased. Our analysis also shows how the predicted backbone structure of native conformation and of alternative low energy states varies with respect to the PCA dimensionality. Generally speaking, the reconstruction can be successfully achieved with 10 principal components and the high frequency term. We also provide a computational analysis of protein energy landscape for the inverse problem of reconstructing structure from the reduced number of principal components, showing that the dimensionality reduction alleviates the ill-posed character of this high-dimensional energy optimization problem. The procedure explained in this paper is very fast and allows testing different PCA expansions. Our results show that PSO improves the energy of the best decoy used in the PCA when the adequate number of PCA terms is considered.

Collapse

Protein secondary structure prediction: A survey of the state of the art. J Mol Graph Model 2017;76:379-402. [DOI: 10.1016/j.jmgm.2017.07.015] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2017] [Revised: 07/14/2017] [Accepted: 07/17/2017] [Indexed: 11/21/2022]

Rashid S, Saraswathi S, Kloczkowski A, Sundaram S, Kolinski A. Protein secondary structure prediction using a small training set (compact model) combined with a Complex-valued neural network approach. BMC Bioinformatics 2016;17:362. [PMID: 27618812 PMCID: PMC5020447 DOI: 10.1186/s12859-016-1209-0] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2015] [Accepted: 08/25/2016] [Indexed: 11/17/2022] Open

Abstract

BACKGROUND

Protein secondary structure prediction (SSP) has been an area of intense research interest. Despite advances in recent methods conducted on large datasets, the estimated upper limit accuracy is yet to be reached. Since the predictions of SSP methods are applied as input to higher-level structure prediction pipelines, even small errors may have large perturbations in final models. Previous works relied on cross validation as an estimate of classifier accuracy. However, training on large numbers of protein chains compromises the classifier ability to generalize to new sequences. This prompts a novel approach to training and an investigation into the possible structural factors that lead to poor predictions. Here, a small group of 55 proteins termed the compact model is selected from the CB513 dataset using a heuristics-based approach. In a prior work, all sequences were represented as probability matrices of residues adopting each of Helix, Sheet and Coil states, based on energy calculations using the C-Alpha, C-Beta, Side-chain (CABS) algorithm. The functional relationship between the conformational energies computed with CABS force-field and residue states is approximated using a classifier termed the Fully Complex-valued Relaxation Network (FCRN). The FCRN is trained with the compact model proteins.

RESULTS

The performance of the compact model is compared with traditional cross-validated accuracies and blind-tested on a dataset of G Switch proteins, obtaining accuracies of ∼81 %. The model demonstrates better results when compared to several techniques in the literature. A comparative case study of the worst performing chain identifies hydrogen bond contacts that lead to Coil ⇔ Sheet misclassifications. Overall, mispredicted Coil residues have a higher propensity to participate in backbone hydrogen bonding than correctly predicted Coils.

CONCLUSIONS

The implications of these findings are: (i) the choice of training proteins is important in preserving the generalization of a classifier to predict new sequences accurately and (ii) SSP techniques sensitive in distinguishing between backbone hydrogen bonding and side-chain or water-mediated hydrogen bonding might be needed in the reduction of Coil ⇔ Sheet misclassifications.

Collapse

Patel MS, Mazumdar HS. Knowledge base and neural network approach for protein secondary structure prediction. J Theor Biol 2014;361:182-9. [DOI: 10.1016/j.jtbi.2014.08.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2013] [Revised: 08/01/2014] [Accepted: 08/04/2014] [Indexed: 10/24/2022]

Huang G, Huang GB, Song S, You K. Trends in extreme learning machines: a review. Neural Netw 2014;61:32-48. [PMID: 25462632 DOI: 10.1016/j.neunet.2014.10.001] [Citation(s) in RCA: 487] [Impact Index Per Article: 44.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2014] [Revised: 08/25/2014] [Accepted: 10/02/2014] [Indexed: 01/29/2023]

Cartwright H, Curteanu S. Neural Networks Applied in Chemistry. II. Neuro-Evolutionary Techniques in Process Modeling and Optimization. Ind Eng Chem Res 2013. [DOI: 10.1021/ie4000954] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Saraswathi S, Fernández-Martínez JL, Koliński A, Jernigan RL, Kloczkowski A. Distributions of amino acids suggest that certain residue types more effectively determine protein secondary structure. J Mol Model 2013;19:4337-48. [PMID: 23907551 DOI: 10.1007/s00894-013-1911-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2013] [Accepted: 06/05/2013] [Indexed: 11/27/2022]

Abstract

Exponential growth in the number of available protein sequences is unmatched by the slower growth in the number of structures. As a result, the development of efficient and fast protein secondary structure prediction methods is essential for the broad comprehension of protein structures. Computational methods that can efficiently determine secondary structure can in turn facilitate protein tertiary structure prediction, since most methods rely initially on secondary structure predictions. Recently, we have developed a fast learning optimized prediction methodology (FLOPRED) for predicting protein secondary structure (Saraswathi et al. in JMM 18:4275, 2012). Data are generated by using knowledge-based potentials combined with structure information from the CATH database. A neural network-based extreme learning machine (ELM) and advanced particle swarm optimization (PSO) are used with this data to obtain better and faster convergence to more accurate secondary structure predicted results. A five-fold cross-validated testing accuracy of 83.8 % and a segment overlap (SOV) score of 78.3 % are obtained in this study. Secondary structure predictions and their accuracy are usually presented for three secondary structure elements: α-helix, β-strand and coil but rarely have the results been analyzed with respect to their constituent amino acids. In this paper, we use the results obtained with FLOPRED to provide detailed behaviors for different amino acid types in the secondary structure prediction. We investigate the influence of the composition, physico-chemical properties and position specific occurrence preferences of amino acids within secondary structure elements. In addition, we identify the correlation between these properties and prediction accuracy. The present detailed results suggest several important ways that secondary structure predictions can be improved in the future that might lead to improved protein design and engineering.

Collapse

Zhou C, Hou C, Zhang Q, Wei X. Enhanced hybrid search algorithm for protein structure prediction using the 3D-HP lattice model. J Mol Model 2013;19:3883-91. [PMID: 23824509 DOI: 10.1007/s00894-013-1907-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2012] [Accepted: 05/30/2013] [Indexed: 10/26/2022]