American Journal of Molecular Biology

Volume 1, Issue 2 (July 2011)

ISSN Print: 2161-6620   ISSN Online: 2161-6663

Google-based Impact Factor: 0.47  Citations  

A reduced computational load protein coding predictor using equivalent amino acid sequence of DNA string with period-3 based time and frequency domain analysis

HTML  Download Download as PDF (Size: 435KB)  PP. 79-86  
DOI: 10.4236/ajmb.2011.12010    4,713 Downloads   10,647 Views  Citations

Affiliation(s)

.

ABSTRACT

Development of efficient gene prediction algorithms is one of the fundamental efforts in gene prediction study in the area of genomics. In genomic signal processing the basic step of the identification of protein coding regions in DNA sequences is based on the period-3 property exhibited by nucleotides in exons. Several approaches based on signal processing tools and numerical representations have been applied to solve this problem, trying to achieve more accurate predictions. This paper presents a new indicator sequence based on amino acid sequence, called as aminoacid indicator sequence, derived from DNA string that uses the existing signal processing based time-domain and frequency domain methods to predict these regions within the billions long DNA sequence of eukaryotic cells which reduces the computational load by one-third. It is known that each triplet of bases, called as codon, instructs the cell machinery to synthesize an amino acid. The codon sequence therefore uniquely identifies an amino acid sequence which defines a protein. Thus the protein coding region is attributed by the codons in amino acid sequence. This property is used for detection of period-3 regions using amino acid sequence. Physico-chemical properties of amino acids are used for numerical representation. Various accuracy measures such as exonic peaks, discriminating factor, sensitivity, specificity, miss rate, wrong rate and approximate correlation are used to demonstrate the efficacy of the proposed predictor. The proposed method is validated on various organisms using the standard data-set HMR195, Burset and Guigo and KEGG. The simulation result shows that the proposed method is an effective approach for protein coding prediction.

Share and Cite:

Meher, J. , Dash, G. , Meher, P. and Raval, M. (2011) A reduced computational load protein coding predictor using equivalent amino acid sequence of DNA string with period-3 based time and frequency domain analysis. American Journal of Molecular Biology, 1, 79-86. doi: 10.4236/ajmb.2011.12010.

Cited by

[1] Single-Walled Carbon Nanohorns as Boosting Surface for the Analysis of Low-Molecular-Weight Compounds by SALDI-MS
International Journal of …, 2022
[2] DNA numerical encoding schemes for exon prediction: a recent history
Nucleosides, Nucleotides & …, 2021
[3] A tri-nucleotide mapping scheme based on residual volume of amino acids for short length exon prediction using sliding window DFT method.
2020
[4] Unbinding events of amino acids and peptides from water–pyrite interfaces: A case study of life's origin on mineral surfaces
2020
[5] Selectivity of Bedaquiline reacting with different polypeptide chains. Theoretical approach
2020
[6] A tri-nucleotide mapping scheme based on residual volume of amino acids for short length exon prediction using sliding window DFT method
2020
[7] КОНЪЮГАТЫ БОРФТОРИДНЫХ КОМПЛЕКСОВ ДИПИРРОМЕТЕНА С АМИНОКИСЛОТАМИ: ПОЛУЧЕНИЕ И ФИЗИКО-ХИМИЧЕСКИЕ СВОЙСТВА
2020
[8] USING DIT-FFT ALGORITHM FOR IDENTIFICATION OF PROTEIN CODING REGION IN EUKARYOTIC GENE
2018
[9] Application of BT and PC-BT in Homo sapiens gene prediction
Microsystem Technologies, 2015
[10] REALIZATION OF AN EVD MODEL IN LABVIEW ENVIRONMENT FOR IDENTIFICATION OF CANCER AND HEALTHY HOMO SAPIENS GENES.
Annals of the Faculty of Engineering Hunedoara-International Journal of Engineering, 2015
[11] REALIZATION OF AN EVD MODEL IN LABVIEW ENVIRONMENT FOR IDENTIFICATION OF CANCER AND HEALTHY HOMO SAPIENS GENES
Annals of the Faculty of Engineering Hunedoara, 2015
[12] 基于全相位滤波理论的基因预测
上海交通大学学报, 2013

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.