Journal of Biomedical Science and Engineering

Volume 6, Issue 12 (December 2013)

ISSN Print: 1937-6871   ISSN Online: 1937-688X

Google-based Impact Factor: 0.66  Citations  h5-index & Ranking

PFP-RFSM: Protein fold prediction by using random forests and sequence motifs

HTML  Download Download as PDF (Size: 254KB)  PP. 1161-1170  
DOI: 10.4236/jbise.2013.612145    3,072 Downloads   5,175 Views  Citations

ABSTRACT

Protein tertiary structure is indispensible in revealing the biological functions of proteins. De novo perdition of protein tertiary structure is dependent on protein fold recognition. This study proposes a novel method for prediction of protein fold types which takes primary sequence as input. The proposed method, PFP-RFSM, employs a random forest classifier and a comprehensive feature representation, including both sequence and predicted structure descriptors. Particularly, we propose a method for generation of features based on sequence motifs and those features are firstly employed in protein fold prediction. PFP-RFSM and ten representative protein fold predictors are validated in a benchmark dataset consisting of 27 fold types. Experiments demonstrate that PFP-RFSM outperforms all existing protein fold predictors and improves the success rates by 2%-14%. The results suggest sequence motifs are effective in classification and analysis of protein sequences.

 

Share and Cite:

Li, J. , Wu, J. and Chen, K. (2013) PFP-RFSM: Protein fold prediction by using random forests and sequence motifs. Journal of Biomedical Science and Engineering, 6, 1161-1170. doi: 10.4236/jbise.2013.612145.

Cited by

[1] Recent Advances in the Prediction of Protein Structural Classes: Feature Descriptors and Machine Learning Algorithms
2021
[2] AFP-CMBPred: Computational identification of antifreeze proteins by extending consensus sequences into multi-blocks evolutionary information
Computers in Biology …, 2021
[3] Improving the classification of protein sequence functions by reducing the heterogeneity of datasets
2021
[4] Artificial Intelligence for Drug Development, Precision Medicine, and Healthcare
2020
[5] DeepFrag-k: a fragment-based deep learning approach for protein fold recognition
2020
[6] Identification of antioxidant proteins using a discriminative intelligent model of k-space amino acid pairs based descriptors incorporating with ensemble feature …
2020
[7] Recent Trends in Machine Learning-based Protein Fold Recognition Methods
2020
[8] Machine Learning Methods for the Protein Fold Recognition Problem
Machine Learning Paradigms, 2019
[9] A Mathematical Model Quantifying Sequence Alignment for Constructing Phylogenetic Trees and Ant-Minor Protein Structure Classification.
2019
[10] Relevance of Machine Learning Techniques and Various Protein Features in Protein Fold Classification: A Review
2019
[11] Highly Accurate Fragment Library for Protein Fold Recognition
2019
[12] An Overview on Protein Fold Classification via Machine Learning Approach
Current Proteomics, 2018
[13] ProFold: Protein Fold Classification with Additional Structural Features and a Novel Ensemble Classifier
BioMed Research International, 2016
[14] Recent Progress in Machine Learning-Based Methods for Protein Fold Recognition
International Journal of Molecular Sciences, 2016
[15] Enhanced Feature Extraction from Evolutionary Profiles for Protein Fold Recognition
2015
[16] The recognition of multi-class protein folds by adding average chemical shifts of secondary structure elements
Saudi Journal of Biological Sciences, 2015
[17] Recognition of 27-Class Protein Folds by Adding the Interaction of Segments and Motif Information
BioMed research international, 2014
[18] Review and Research Analysis of Computational Target Methods Using BioRuby and In silico Screening of Herbal Lead Compounds Against Pancreatic Cancer Using R Programming
Current drug metabolism, 2014
[19] Review and research analysis of computational target methods using BioRuby and in silico screening of herbal lead compounds against pancreatic cancer using R …
Current Drug Metabolism, 2014

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.