Natural Science

Volume 9, Issue 9 (September 2017)

ISSN Print: 2150-4091   ISSN Online: 2150-4105

Google-based Impact Factor: 0.74  Citations  h5-index & Ranking

pLoc-mGpos: Incorporate Key Gene Ontology Information into General PseAAC for Predicting Subcellular Localization of Gram-Positive Bacterial Proteins

HTML  XML Download Download as PDF (Size: 1467KB)  PP. 330-349  
DOI: 10.4236/ns.2017.99032    1,341 Downloads   2,748 Views  Citations
Author(s)

ABSTRACT

The basic unit in life is cell. It contains many protein molecules located at its different organelles. The growth and reproduction of a cell as well as most of its other biological functions are performed via these proteins. But proteins in different organelles or subcellular locations have different functions. Facing the avalanche of protein sequences generated in the postgenomic age, we are challenged to develop high throughput tools for identifying the subcellular localization of proteins based on their sequence information alone. Although considerable efforts have been made in this regard, the problem is far apart from being solved yet. Most existing methods can be used to deal with single-location proteins only. Actually, proteins with multi-locations may have some special biological functions that are particularly important for drug targets. Using the ML-GKR (Multi-Label Gaussian Kernel Regression) method, we developed a new predictor called “pLoc-mGpos” by in-depth extracting the key information from GO (Gene Ontology) into the Chou’s general PseAAC (Pseudo Amino Acid Composition) for predicting the subcellular localization of Gram-positive bacterial proteins with both single and multiple location sites. Rigorous cross-validation on a same stringent benchmark dataset indicated that the proposed pLoc-mGpos predictor is remarkably superior to “iLoc-Gpos”, the state-of-the-art predictor for the same purpose. To maximize the convenience of most experimental scientists, a user-friendly web-server for the new powerful predictor has been established at http://www.jci-bioinfo.cn/pLoc-mGpos/, by which users can easily get their desired results without the need to go through the complicated mathematics involved.

Share and Cite:

Xiao, X. , Cheng, X. , Su, S. , Mao, Q. and Chou, K. (2017) pLoc-mGpos: Incorporate Key Gene Ontology Information into General PseAAC for Predicting Subcellular Localization of Gram-Positive Bacterial Proteins. Natural Science, 9, 330-349. doi: 10.4236/ns.2017.99032.

Cited by

[1] The Development and Progress in Machine Learning for Protein Subcellular Localization Prediction
The Open Bioinformatics Journal, 2022
[2] Protein Subcellular Localization Based on Evolutionary Information and Segmented Distribution
Mathematical Problems in Engineering, 2021
[3] The significant and profound impacts of Gordon life science institute
2021
[4] 4mC-RF: Improving the prediction of 4mC sites using composition and position relative features and statistical moment
Analytical Biochemistry, 2021
[5] The Remarkable impacts of Gordon life science institute
2021
[6] NPalmitoylDeep-PseAAC: A Predictor of N-Palmitoylation Sites in Proteins Using Deep Representations of Proteins and PseAAC via Modified 5-Steps Rule
2021
[7] IAmideV-deep: Valine amidation site prediction in proteins using deep learning and pseudo amino acid compositions
2021
[8] iPhosD-PseAAC: Identification of phosphoaspartate sites in proteins using statistical moments and PseAAC
2021
[9] iPhosS (Deep)-PseAAC: Identification of Phosphoserine Sites in Proteins Using Deep Learning on General Pseudo Amino Acid Compositions
IEEE/ACM Transactions …, 2020
[10] The divination of things by things
Extended Abstracts of the 2020 CHI Conference on …, 2020
[11] RF-MaloSite and DL-MaloSite: Two independent computational methods based on Random Forest (RF) and Deep Learning (DL) to predict malonylation sites
2020
[12] Distorted key theory and its implication for drug development
2020
[13] Showcase to Illustrate How the Web-Server pLoc_Deep-mGneg Is Working
2020
[14] pLoc_Deep-mEuk: Predict Subcellular Localization of Eukaryotic Proteins by Deep Learning
2020
[15] Predicting gram-positive bacterial protein subcellular location by using combined features
2020
[16] Analysis of domain feature of gram-positive bacterial protein.
2020
[17] ML-RBF: Predict protein subcellular locations in a multi-label system using evolutionary features
2020
[18] Recent Progresses for Computationally Identifying N 6-methyladenosine Sites in Saccharomyces cerevisiae
2020
[19] GP4: an integrated Gram-Positive Protein Prediction Pipeline for subcellular localization mimicking bacterial sorting
2020
[20] The End of Our Earth Is Certainly to Come:“When”? and “Why”?
2020
[21] RF-MaloSite and DL-Malosite: Methods based on random forest and deep learning to identify malonylation sites
2020
[22] Enhancing Segmentation Approaches from Oaam to Fuzzy KC-Means
2020
[23] Use Chou's 5-Steps Rule to Predict Remote Homology Proteins by Merging Grey Incidence Analysis and Domain Similarity Analysis
2020
[24] Gordon Life Science Institute and Its Impacts on Computational Biology and Drug Development
2020
[25] Showcase to Illustrate How the Web-server iPTM-mLys is working
2020
[26] How the artificial intelligence tool iRNA-2methyl is working for RNA 2'-O-methylation sites
2020
[27] An insightful 20-year recollection since the birth of pseudo amino acid components
2020
[28] Showcase to illustrate how the web-server iKcr-PseEns is working
2020
[29] Predicting lncRNA subcellular localization using unbalanced pseudo-k nucleotide compositions
2020
[30] Sequence-based identification of arginine amidation sites in proteins using deep representations of proteins and PseAAC
2020
[31] An Effective Multi-Label Protein Sub-Chloroplast Localization Prediction by Skipped-Grams of Evolutionary Profiles Using Deep Neural Network
IEEE/ACM Transactions on Computational …, 2020
[32] cACP: Classifying anticancer peptides using discriminative intelligent model via Chou's 5-step rules and general pseudo components
2020
[33] Feature selection and classification for gene expression data using novel correlation based overlapping score method via Chou's 5-steps rule
2020
[34] Some illuminating remarks on molecular genetics and genomics as well as drug development
2020
[35] Gordon Life Science Institute, Boston, Massachusetts 02478, United States of America. Showcase To Illustrate How the Web-Server Initro-Tyr Is Working
Journal of Clinical Cancer Research, 2019
[36] The impact of statin therapy on the survival of patients with gastrointestinal cancer
2019
[37] A Study for Therapeutic Treatment against Parkinson's Disease via Chou's 5-steps Rule
2019
[38] Inhibition of α-amylase Activity by Zn2+: Insights from Spectroscopy and Molecular Dynamics Simulations
2019
[39] A Possible Modulation Mechanism of Intramolecular and Intermolecular Interactions for NCAM Polysialylation and Cell Migration
2019
[40] RAACBook: a web server of reduced amino acid alphabet for sequence-dependent inference by using Chou's five-step rule
2019
[41] Bayesian Polytrees with Learned Deep Features for Multi-Class Cell Segmentation
2019
[42] Prediction of Apoptosis Protein Subcellular Localization with Multilayer Sparse Coding and Oversampling Approach
2019
[43] Identifying N6-methyladenosine sites using extreme gradient boosting system optimized by particle swarm optimizer
2019
[44] LAIPT: Lysine Acetylation Site Identification with Polynomial Tree
2019
[45] MULTiPly: a novel multi-layer predictor for discovering general and specific types of promoters
2019
[46] MFSC: Multi-voting based feature selection for classification of Golgi proteins by adopting the general form of Chou's PseAAC components
2019
[47] pSSbond-PseAAC: Prediction of disulfide bonding sites by integration of PseAAC and statistical moments
2019
[48] SPalmitoylC-PseAAC: A sequence-based model developed via Chou's 5-steps rule and general PseAAC for identifying S-palmitoylation sites in proteins
2019
[49] MsDBP: Exploring DNA-Binding Proteins by Integrating Multiscale Sequence Information via Chou's Five-Step Rule
2019
[50] iSulfoTyr-PseAAC: Identify Tyrosine Sulfation Sites by Incorporating Statistical Moments via Chou's 5-steps Rule and Pseudo Components
2019
[51] In Silico Design and Synthesis of Targeted Curcumin Derivatives as Xanthine Oxidase Inhibitors
2019
[52] Established and In-trial GPCR Families in Clinical Trials: A Review for Target Selection
2019
[53] iPhosH-PseAAC: Identify phosphohistidine sites in proteins by blending statistical moments and position relative features according to the Chou's 5-step rule and …
2019
[54] The preliminary efficacy evaluation of the CTLA-4-Ig treatment against Lupus nephritis through in-silico analyses
2019
[55] MsDBP: Exploring DNA-binding Proteins by Integrating Multi-scale Sequence Information via Chou's 5-steps Rule
Journal of Proteome Research, 2019
[56] iRSpot-SPI: Deep learning-based recombination spots prediction by incorporating secondary sequence information coupled with physio-chemical properties via …
2019
[57] Identifying DNase I hypersensitive sites using multi-features fusion and F-score features selection via Chou's 5-steps rule
2019
[58] Metabolism of Oxalate in Humans: A Potential Role Kynurenine Aminotransferase/Glutamine Transaminase/Cysteine Conjugate Betalyase Plays in Hyperoxaluria
2019
[59] iMethylK-PseAAC: Improving Accuracy of Lysine Methylation Sites Identification by Incorporating Statistical Moments and Position Relative Features into General …
2019
[60] Identification and characterization of WD40 superfamily genes in peach
2019
[61] Bioimage-based Prediction of Protein Subcellular Location in Human Tissue with Ensemble Features and Deep Networks
2019
[62] Application of machine learning approaches for the design and study of anticancer drugs
2019
[63] Glioma stages prediction based on machine learning algorithm combined with protein-protein interaction networks
2019
[64] Prediction of lysine formylation sites using the composition of k-spaced amino acid pairs via Chou's 5-steps rule and general pseudo components
2019
[65] Quantitative Structure-activity Relationship of Acetylcholinesterase Inhibitors based on mRMR Combined with Support Vector Regression
2019
[66] Advance in Predicting Subcellular Localization of Multi-label Proteins and its Implication for Developing Multi-target Drugs
2019
[67] Proposing Pseudo Amino Acid Components is an Important Milestone for Proteome and Genome Analyses
2019
[68] Progresses in predicting post-translational modification
2019
[69] iHyd-PseAAC (EPSV): Identifying Hydroxylation Sites in Proteins by Extracting Enhanced Position and Sequence Variant Feature via Chou's 5-Step Rule and …
2019
[70] Advances in Predicting Subcellular Localization of Multi-label Proteins and its Implication for Developing Multi-target Drugs
2019
[71] pLoc_bal-mVirus: predict subcellular localization of multi-label virus proteins by Chou's general PseAAC and IHTS treatment to balance training dataset
2019
[72] pLoc_bal-mEuk: predict subcellular localization of eukaryotic proteins by general PseAAC and quasi-balancing training dataset
2019
[73] Critical evaluation of web-based prediction tools for human protein subcellular localization
2019
[74] Showcase to Illustrate How the Web-Server Idna6ma-Pseknc is Working
Journal of Pathology Research Reviews & Reports, 2019
[75] How the artificial intelligence tool iSNO-PseAAC is working in predicting the cysteine S-nitrosylation sites in proteins
Journal of Stem Cell Research and Medicine, 2019
[76] Artificial intelligence (AI) tools constructed via the 5-steps rule for predicting post-translational modifications
2019
[77] An insightful recollection for predicting protein subcellular locations in multi-label systems
2019
[78] An insightful 10-year recollection since the emergence of the 5-steps rule.
2019
[79] iHyd-PseAAC (EPSV): identifying hydroxylation sites in proteins by extracting enhanced position and sequence variant feature via chou's 5-step rule and general …
2019
[80] Recent progresses in predicting protein subcellular localization with artificial intelligence (AI) tools developed via the 5-steps rule
2019
[81] Identify Lysine Neddylation Sites Using Bi-Profile Bayes Feature Extraction via the Chou's 5-Steps Rule and General Pseudo Components
2019
[82] Biological Production of (S)-acetoin: A State-of-the-Art Review
2019
[83] Impacts of pseudo amino acid components and 5-steps rule to proteomics and proteome analysis
2019
[84] The multiple applications and possible mechanisms of the hyperbaric oxygenation therapy
2019
[85] Gordon life science institute: its philosophy, achievements, and perspective
2019
[86] An insightful recollection since the birth of Gordon Life Science Institute about 17 years ago
2019
[87] Recent progresses in predicting protein subcellular localization with artificial intelligence (AI) tools developed via the 5‐steps rule
2019
[88] 19F-NMR in Target-based Drug Discovery
2019
[89] Intriguing Story about the Birth of Gordon Life Science Institute and its Development and Driving Force
2019
[90] Efficient computational model for classification of protein localization images using extended threshold adjacency statistics and support vector machines
Computer methods and …, 2018
[91] iRSpot-DTS: Predict recombination spots by incorporating the dinucleotide-based spare-cross covariance information into Chou's pseudo components
2018
[92] pLoc-mGneg: Predict subcellular localization of Gram-negative bacterial proteins by deep gene ontology learning via general PseAAC
Genomics, 2018
[93] iKcr-PseEns: Identify lysine crotonylation sites in histone proteins with pseudo components and ensemble classifier
Genomics, 2018
[94] iDNA6mA-PseKNC: Identifying DNA N6-methyladenosine sites by incorporating nucleotide physicochemical properties into PseKNC
Genomics, 2018
[95] pLoc_bal-mGpos: Predict subcellular localization of Gram-positive bacterial proteins by quasi-balancing training dataset and PseAAC
Genomics, 2018
[96] iProt-Sub: a comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites
Briefings in Bioinformatics, 2018
[97] iRNA-3typeA: Identifying Three Types of Modification at RNA's Adenosine Sites
Molecular Therapy - Nucleic Acids, 2018
[98] pLoc_bal-mAnimal: predict subcellular localization of animal proteins by balancing training dataset and PseAAC
Bioinformatics, 2018
[99] Implications of newly identified brain eQTL genes and their interactors in Schizophrenia
Molecular Therapy - Nucleic Acids, 2018
[100] Using Chou's general PseAAC to analyze the evolutionary relationship of receptor associated proteins (RAP) with various folding patterns of protein domains
Journal of Theoretical Biology, 2018
[101] iMem-2LSAAC: A two-level model for discrimination of membrane proteins and their types by extending the notion of SAAC into chou's pseudo amino acid …
Journal of Theoretical Biology, 2018
[102] BlaPred: Predicting and classifying β-lactamase using a 3-tier prediction system via Chou's general PseAAC
Journal of Theoretical Biology, 2018
[103] iRO-3wPseKNC: Identify DNA replication origins by three-window-based PseKNC.
Bioinformatics, 2018
[104] EvoStruct-Sub: An accurate Gram-positive protein subcellular localization predictor using evolutionary and structural features
Journal of Theoretical Biology, 2018
[105] iPromoter-FSEn: Identification of bacterial σ70 promoter sequences using feature subspace based ensemble classifier
Genomics, 2018
[106] iRNA (m6A)-PseDNC: identifying N6-methyladenosine sites using pseudo dinucleotide composition
Analytical Biochemistry, 2018
[107] Analysis and prediction of ion channel inhibitors by using feature selection and Chou's general pseudo amino acid composition
Journal of Theoretical Biology, 2018
[108] pLoc_bal-mGneg: predict subcellular localization of Gram-negative bacterial proteins by quasi-balancing training dataset and general PseAAC
Journal of Theoretical Biology, 2018
[109] Predicting subcellular localization of multi-label proteins by incorporating the sequence features into Chou's PseAAC
Genomics, 2018
[110] Prediction of citrullination sites by incorporating k-spaced amino acid pairs into Chou's general pseudo amino acid composition
Gene, 2018
[111] pLoc_bal-mHum: Predict subcellular localization of human proteins by PseAAC and quasi-balancing training dataset
Genomics, 2018
[112] Identifying 5-methylcytosine sites in RNA sequence using composite encoding feature into Chou's PseKNC
Journal of Theoretical Biology, 2018
[113] Improved DNA-binding protein identification by incorporating evolutionary information into the Chou's PseAAC
2018
[114] iRSpot-ADPM: Identify recombination spots by incorporating the associated dinucleotide product model into Chou's pseudo components
Journal of Theoretical Biology, 2018
[115] Predicting protein submitochondrial locations by incorporating the pseudo-position specific scoring matrix into the general Chou's pseudo-amino acid composition
Journal of Theoretical Biology, 2018
[116] Identify Gram-negative bacterial secreted protein types by incorporating different modes of PSSM into Chou's general PseAAC via Kullback-Leibler divergence
Journal of Theoretical Biology, 2018
[117] A proteome-wide systems toxicological approach deciphers the interaction network of chemotherapeutic drugs in the cardiovascular milieu
RSC Advances, 2018
[118] Predicting apoptosis protein subcellular localization by integrating auto-cross correlation and PSSM into Chou's PseAAC
Journal of Theoretical Biology, 2018
[119] Characterization of proteins in different subcellular localizations for Escherichia coli K12
Genomics, 2018
[120] Improving succinylation prediction accuracy by incorporating the secondary structure via helix, strand and coil, and evolutionary information from profile …
PLOS ONE, 2018
[121] Large-scale frequent stem pattern mining in RNA families
Journal of Theoretical Biology, 2018
[122] Characterize the difference between TMPRSS2-ERG and non-TMPRSS2-ERG fusion patients by clinical and biological characteristics in prostate cancer
Gene, 2018
[123] mLysPTMpred: Multiple Lysine PTM Site Prediction Using Combination of SVM with Resolving Data Imbalance Issue
2018
[124] iRSpot-PDI: Identification of recombination spots by incorporating dinucleotide property diversity information into Chou's pseudo components
Genomics, 2018
[125] Prediction and functional analysis of prokaryote lysine acetylation site by incorporating six types of features into Chou's general PseAAC
Journal of Theoretical Biology, 2018
[126] iNuc-ext-PseTNC: an efficient ensemble model for identification of nucleosome positioning by extending the concept of Chou's PseAAC to pseudo-tri-nucleotide …
Molecular Genetics and Genomics, 2018
[127] Predicting membrane proteins and their types by extracting various sequence features into Chou's general PseAAC
Molecular Biology Reports, 2018
[128] Analysis and prediction of presynaptic and postsynaptic neurotoxins by Chou's general pseudo amino acid composition and motif features
Journal of Theoretical Biology, 2018
[129] Accelerated search for perovskite materials with higher Curie temperature based on the machine learning methods
Computational Materials Science, 2018
[130] iEnhancer-EL: Identifying enhancers and their strength with ensemble learning approach
Bioinformatics, 2018
[131] NucPosPred: Predicting species-specific genomic nucleosome positioning via four different modes of general PseKNC
Journal of Theoretical Biology, 2018
[132] LQTA-R: A new 3D-QSAR methodology applied to a set of DGAT1 inhibitors
Computational Biology and Chemistry, 2018
[133] iPPI-PseAAC (CGR): Identify protein-protein interactions by incorporating chaos game representation into PseAAC
Journal of Theoretical Biology, 2018
[134] iLoc-lncRNA: predict the subcellular location of lncRNAs by incorporating octamer composition into general PseKNC
Bioinformatics, 2018
[135] Heterodimer binding scaffolds recognition via the analysis of kinetically hot residues
Pharmaceuticals, 2018
[136] Analysis and prediction of animal toxins by various Chou's pseudo components and reduced amino acid compositions
Journal of Theoretical Biology, 2018
[137] UbiSitePred: A novel method for improving the accuracy of ubiquitination sites prediction by using LASSO to select the optimal Chou's pseudo components
Chemometrics and Intelligent Laboratory Systems, 2018
[138] Identification of protein subcellular localization via integrating evolutionary and physicochemical information into Chou's general PseAAC
Journal of Theoretical Biology, 2018
[139] iRO-3wPseKNC: Identify DNA replication origins by three-window-based PseKNC
2018
[140] pNitro-Tyr-PseAAC: Predict Nitrotyrosine Sites in Proteins by Incorporating Five Features into Chou's General PseAAC
2018
[141] Simulated protein thermal detection (SPTD) for enzyme thermostability study and an application example for pullulanase from Bacillus deramificans
2018
[142] pLoc_bal-mPlant: Predict Subcellular Localization of Plant Proteins by General PseAAC and Balancing Training Dataset
2018
[143] iPSW (2L)-PseKNC: A two-layer predictor for identifying promoters and their strength by hybrid features via pseudo K-tuple nucleotide composition
2018
[144] pLoc-mHum: predict subcellular localization of multi-location human proteins via general PseAAC to winnow out the crucial GO information
Bioinformatics, 2017
[145] Intriguing Story about the Birth of Gordon Life Science Institute and its Development and Driving Force. J Retro Virol Anti Retro Virol 2019, 1 (1): 180002 …

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.