TITLE:
Similarity/dissimilarity analysis of protein sequences using the spatial median as a descriptor
AUTHORS:
Mervat M. Abo-Elkhier
KEYWORDS:
Right Cone; Non Equal Proteins; Spatial Median; Similarity/Dissimilarity; Linear Correlation and Significance Analysis
JOURNAL NAME:
Journal of Biophysical Chemistry,
Vol.3 No.2,
May
29,
2012
ABSTRACT: A novel 3-D graphical representation of protein sequence has been introduced. A right cone of a unit base and unit height has been selected to represent protein sequences on its surface. The twenty amino acids have been represented by 20 circles and all protein's residues have been represented by n lines on the cone's surface. All the spots which represent the protein's residues have been shown in the cone's top view. The spatial median of all the spots is used as a new descriptor of any protein sequence. This approach was applied on two short segments of protein of yeast Saccharomyces cerevisiae. The examination of the similarities/dissimilarities for the eight ND5 proteins and the six β-globin proteins illustrate the utility of our approach. A linear correlation and significance analysis have been provided to compare our results and the percentage sequence alignment identity.