Bioinformatics of Profilin in Wheat

Profilin is a small actin-binding protein that is essential in all organisms. While the wheat genome sequence database is not currently available, the exploration of bioinformatics is very important. Therefore, this article predicted the structure and function of profilin in wheat by bioinformatics methods. The amino acid sequence of profilin was searched in GeneBack. While its signal peptides were analyzed through CBS prediction server, the hydrophobicities were analyzed by bioedit software, we used EMBnet server and DNAstar to analyse the transmembrane domains and the B-cell epitopes of the profilin respectively. Finally, the tertiary structure of the protein was predicted through Swiss-Model. This information will help develop rational strategies to improve the component-resolving diagnosis and immunotherapy of profilin allergy.


Introduction
Profilins are ubiquitous and abundant cytosolic proteins of 12 -15 kDa, found in all eukaryotic cells [1] [2] [3] and virus [4].They are key regulators of actin cytoskeleton dynamics through their interaction to monomeric actin (G-actin), and to a plethora of actin-binding proteins, which involve poly-L-proline (PLP) stretches [5].Moreover, interaction with phosphatid ylinositol-4,5-bisphosphate, a major component of cellsignaling transduction pathways, is essential for the integration of stress responses through cytoskeleton rearrangement, in addition to processes such as cell movement and cytokinesis driven by actin polymerization dynamics [6].Profilin was firstly recognized as an allergen (called Bet v 2) in birch pollen [7] and later described as allergen in plant foods and latex [8].The wide spread cross-reactivity of profilins has led to the designation of profilins as "pan-allergens" [9].The sensitization to these allergens has been considered as a risk factor for allergic reactions [10].
Wheat (Triticum aestivum L.) is one of the major food grains consumed by humans [11].The economic and social significance of wheat is derived from its broad production and superior ability to generate diverse foods .However, ingestion of wheat can induce immune-mediated adverse food reactions such as celiac disease or IgE mediated Type 1 allergies in predisposed individuals.The prevalence of wheat induced food allergy is low in adults.However, in young children wheat is one of the major causes of food allergy.Respiratory allergy to wheat flour, affecting particularly persons working at bakeries and therefore named baker's asthma, is one of the most common causes of occupational asthma [12].Recently, wheat profilin, designated Tri a 12, has been found to be recognized by specific IgE antibodies in patients suffering from baker's asthma.
In the present study, we predicted the structure and studied its influence over the structure of the profilin in wheat by bioinformatics methods.We have carried out an extensive analysis of the B-cell epitopes, to unravel profilin-specific epitopes, providing a comprehensive understanding of the broad cross-reactivity and specific allergy reactions to profilin.The knowledge provided in this study will help develop rational strategies to improve the component-resolving diagnosis and immunotherapy of profilin allergy.

Method
The amino acid sequence of the wheat profilin protein can be found in the NCBI website (http://www.ncbi.nlm.nih.gov/protein/CAA61945.2); the signal peptide is analyzed by CBS prediction server login (http://www.cbs.dtu.dk/services/,select signal P method) and the hydrophobic protein is predicted by bioedit Sequence Alignment Editer.Besides, we use EMBnet server (http://www.ch.embnet.org/,select TMPRED method) and DNAstar (http://www.dnastar.com/)to analyse the transmembrane domains and the B-cell epitopes of the protein respectively.
We predict the secondary structure of the wheat profilin on espasy proteomics website (http://www.expasy.org/tools/,select SOPMA method); use Swiss-Model to predict the tertiary structure of the protein (http://www.swissmodel.expasy.org/,select Automatic Modelling mode).

Analysis of Gene Function
Amino acid sequence of profilin protein searched in the NCBI gene bank as follows: MSWKAYVDDHLCCEIDGQNLTSAAILGHDGSVWAQSPNFPQFKPEENAGIVKDF EEPGHLAPTGLFLGGTKYMVIQGEPGVVIRGKKGTGGITIKKTGMALILGIYDEPM TPGQCNLVVERLGDYLIDQGY.

Prediction of the Signal Peptide
The active site of the protein was predicted by ScanProsite method .We predict that 93 and 111 are threonine phosphorylation sites; 6 and 12 are tyrosine phosphorylation sites (Figure 1).The glycosylation sites are 19 -21.The predictions of a signal peptide is shown in Figure 2, in which we can find the score in 35 is the highest, so 1 -35 are predicting to be the signal peptide of profilin.

Analysis of Protein Hydrophobicity and Transmembrane Domain
Protein transmembrane region is the region about 20 hydrophobic amino acid residues and generally shows α-helix structure.Using Bioedit software to predict protein's hydrophobicity, the score is more than 0, with the increasing of the value, the representative of hydrophobic is better.As shown in Figure 3, the profilin protein's hydrophobicity is poor, and there is no a hydrophobic space of more than consecutive 20 amino  acids, which indicates that the protein does not exist transmembrane structure area.Figure 4 is using EMBnet server to predict transmembrane region.In general, if the score of the sequence of amino acids is positive, the region will be the transmembrane area of the protein.So Figure 4 shows that the wheat profiln protein does not exist transmembrane region.

Analysis of the Secondary Structure
Predicting the secondary structure of profilin protein in wheat is using SOPMA method.The result is shown in Figure 5.The relative content of α-helix is 19.8%; β-fold relative content is 30.53%.The relative content of β-fold is higher than α-helix.

Analysis of the Tertiary Structure
We find a Profilin-5 protein of which homology can reach 83.21% of wheat profiling  protein by PDB structure database (http://www.rcsb.org/pdb/home/home.do), the number of the PDB is 5fef.1.A. Meanwhile wheat profilin protein is predicted by Swiss-Model, as shown in Figure 6, we find wheat profilin protein has three helical domains that is all at the edge of the protein structure, and in the middle region of the protein is as a whole β-sheet structure area, which highly consistent with the predicted secondary structure in relative content of α helices and β fold.

Analysis of the Profilin Epitope of B-Cell
As shown in Figure 7, Wheat profilin proteinʼs hydrophilic, surface accessibility and antigen index are predicted by DNAstar software.There are three lines in the figure, which respectively is hydrophilic, antigenic index and surface accessibility from top to bottom.Upper and lower of horizontal lines represent the range of values that are important factors to determine the epitope of protein epitopes.When the surface accessibility is more than 1, antigenic index is more than 0, and surface accessibility is more than 1; amino acid residue region of the protein may form epitopes.The region of amino acids 40 -47, 53 -57 and 108 -112 can be inferred to be antigen region of profilin protein through the various parameters and all of them are located on the surface, among which the number of the 40 -47 is located on the first α helice, the region of

Conclusions
In this paper, the signal peptide of profilin was predicted by CBS prediction software and we found the region of 1 -35 is a signal peptide of profilin protein.Meanwhile, the profilins have a glycosylation site and a number of phosphorylation sites.Then, the hydrophobicities and transmembrane domains of profilin were analyzed by bioedit and EMBnet software.We had known that the hydrophobicities of profilin are poor, and there are no transmembrane domains.The content of secondary structure of profilin is calculated by establishing the model of secondary structure.The result shows that the profilins contains a lot of beta folding structure.Besides, the relative content of the alpha helix and beta folding are more than 15%.Finally, the segments of 40 -47, 53 -57, and 108 -112 may be a region of B cell epitope by using DNAstar software and the homology of profilin protein and Arah5 protein that have 3 segments of α helix domain can reach 79.07%by homology comparison.The results of this study are similar to those of Peanut profilin that was analysed by Xiao Jie in 2011 [13].
Allergic disease is one of the major problems in the current world of hygiene.Plant pollen and food play an important role in allergic reactions.The wheat can also cause severe allergic reactions.Recent studies have found that wheat could induce the patient's allergies to food and gas through the profiling-Tri a12 which have been identified by specific IgE antibody on the surface of bread workers [14].Profilins are one of the most prevalent plant protein families causing IgE mediated reactions [15].How to change the structure of wheat profilin protein or amino acid residues that can reduce the binding force of IgE and affect the immune response so as to reduce the allergy, is the fundamental method to solve the problem of wheat allergy.So, this article analyzes the structure and function of profilin using bioinformatics methods, which is the basis for further study on the allergic reaction of wheat cross reactivity and the sensitivity of profilin protein.
Bioinformatics aims at understanding the biological meaning of data, using biological tools to quickly predict the structure and functional properties of proteins.Profilin is a kind of actin binding protein which can regulate microfilaments and has a great important role in the biological growth and development, so there has certain theoretical significance and practical value of the application of bioinformatics analysis of the structure and function of profilin in wheat [16].

Figure 1 .
Figure 1.Prediction of the phosphorylation sites for profilin protein.

Figure 2 .
Figure 2. Prediction of the signal peptide for profilin protein.

Figure 3 .
Figure 3. Prediction of the hydrophobicity for profilin protein.

Figure 4 .
Figure 4. Prediction of the transmembrane domains in profilin protein.

Figure 5 .
Figure 5. c.h.e and t refer to coil.helix.extendstrand and beta turn respectively.

Figure 6 .
Figure 6.Swiss-model prediction result of tertiary structure of profilin protein.