A two-dimensional electrophoresis reference map of the earthworm Eisenia fetida

The molecular mechanisms underlying innate immunity in the earthworm E. fetida remain unclear. For the recognition of innate immunity in the earthworm E. fetida, a detailed knowledge of this proteome is a prerequisite. The absence of a high-resolution E. fetida proteome map prompted us to determine E. fetida protein spots that can be visualised on 2-D protein gels. In this study, we present a preliminary description of the whole earthworm E. fetida proteome. A highly detailed two-dimensional gel electrophoresis (2-DE) map of the E. fetida proteome was established and approximately 1500 protein spots were detected from the earthworm sample when applying a 500 μg protein 2-DE in the pH range 3.0 10.0. We present a 2-DE proteome map of E. fetida, identifying 76 different proteins by matrix-assisted laser desorption/ionization-tandem time of flight mass spectrometry (MALDI-TOF/TOFMS) analysis. These identified proteins, including heat shock protein 90 (Hsp90), chaperonine protein HSP60, caspase-8, fibrinolytic protease 0, gelsolin-like protein, lombricine kinase, coelomic cytolytic factor1 (CCF 1), manganous superoxide dismutase (MnSOD), triosephosphate isomerase, extracellular globin-4, lysenin, and intermediate filament protein, glyceraldehyde-3phosphate dehydrogenase, et al., are involved in several processes, including transcripttion, translation, the tricarboxylic acid cycle, the cellular amino acid metabolic process, protein amino acid phosphorylation, glycolysis, and the glucose metabolic process. These 2-DE data will enhance future comparisons of immunity, toxicology, biotic processes and other challenges, thereby allowing for further study of the molecular mechanisms in response to environmental stressors, and it will be useful to investigate environmental proteomics in invertebrate earthworms.


INTRODUCTION
As a key representative of soil fauna, earthworms are essential in maintaining soil fertility through their burrowing, ingestion, and excretion [1].Earthworms are increasingly recognized as indicators of agroecosystem health and an ecotoxicological sentinel species because they are constantly exposed to soil contaminants [2].Working with earthworms presents some common challenge: none has a fully sequenced genome; they are often so small as to preclude ready dissection of internal organs or tissues [3].Modern genomic approaches offer a potential opportunity to circumvent some of these drawbacks [4][5][6][7][8][9].This study aims at creating a proteome reference map for to be used for the study of its ecotoxicoproteomics.
Proteomics is a large-scale study of the proteins that gene transcription and translation, which ultimately provides direct measurement of protein expression levels and insight into the activity state of all relevant proteins [10].Like in many other ecologically important species, proteomics research in earthworms lags far behind other model species, such as Mus musculus and Caenorhabditis elegans.There are only five research papers about proteomics of earthworm until now [11][12][13][14][15].The key technologies of the proteome analysis are protein separation by 2-DE and mass spectrometric protein identification, which result in the efficient analysis of hundreds to thousands of proteins expressed in cells at one time [16].Protein identification of earthworm is difficulty for most of proteins in earthworm have unknown function.The proteomics map of the earthworm E. fetida will provide information on earthworm proteomics that will be useful in our knowledge of the innate immunity, ecotoxicology, environmental science, et al.

The Experimental Organism and Preparation of an Artificial Soil
Adult earthworm species E. fetida (weighing between 300 and 400 mg) reared in composted cattle feces were purchased from an earthworm farm of the China Agricultural University and kept in test substrates for a week before the experiment.Artificial substrates that used in laboratory and could maintain the same conditions were prepared as described in the OECD (1984) guidelines [17].Briefly, the substrate was composed (percentages refer to dry weight) of 10% sphagnum, 20% kaolinite clay, and 70% quartz sand.The pH was adjusted to 6.0 ± 0.5 with CaCO 3 .The artificial climate chamber was maintained at 23˚C ± 1˚C and a lighting of about 1000 lx with a 12/12-h photo-period; water was supplemented to ensure appropriate water content of the substrate by weighing the containers.Two days before the experiments, the earthworms were transferred onto filter paper soaked with isotonic Lumbricus balanced salt solution12 at 22˚C ± 1˚C to void the gut load.Whole E. fetida were stored at -80˚C for protein extraction.The treatment was repeated three times for each sample (three biological replicates).

Protein Extraction
Proteins were extracted according to the TCA-A procedure described previously [13].Briefly, fresh whole E. fetida samples (2 g) collected from three biologically replications, three protein samples, and each sample made from three worms were pulverized to a fine powder with liquid nitrogen and a mortar and pestle.The powder was resuspended and the homogenate was precipitated overnight with 10 ml protein extraction buffer (acetone/20% TCA/0.1%DTT)(-20˚C).The protein pellets were washed with ice-cold methanol followed by multiple ice-cold acetone washes, dried in a vacuum freeze-drying machine, and redissolved in IEF buffer (7 M urea, 2 M thiourea, 2% CHAPS, 65 mM DTT, and 0.5% w/v carrier ampholytes (nonlinear pH 3 -10; Amersham Biosciences, Piscataway, NJ, USA).Protein concentrations were quantified using the 2D-Quant kit protein assay (GE Healthcare Biosciences, Milwaukee, WI, USA).The supernatants were stored in aliquots at -80˚C or directly loaded for isoelectric focusing.

IEF and SDS-PAGE
IPG strips (24 cm, nonlinear pH 3 -10, GE Healthcare Biosciences Immobiline DryStrips) were rehydrated overnight with 450 µl IEF buffer containing an estimated 500 µg protein and focused using a Ettan IPGphor Multiphor III (GE Healthcare Biosciences) at 20˚C, applying the following program: 30 V for 6 h, 60 V for 6 h, 100 V for 1 h, 200 V for 1 h, 500 V gradient for 1 h, 1,000 V gradient for 30 min, 5,000 V gradient for 2 h, 10,000 V for 1 h, and 10,000 V for 60,000 V•h.After focusing, the proteins reduction were in equilibration buffer (6 M urea, 29.3% w/v glycerol, 2% SDS, and 1.5 M Tris-HCl, pH 8.8) + 1% w/v DTT for 15 min and alkylation followed this in a separate incubation for an additional 15 min in equilibration buffer + 2.5% w/v IAA.The strips were then transferred to 12.5% SDS-PAGE gels for 2-DE using Amersham's Ettan DALTsix gel system with SDS electrophoresis buffer [250 mM Tris (pH 8.3), 1.92 M glycine, and 1% w/v SDS)] and a 2 W/gel for 20 min and a 16.7 W/gel for 5 h.At least three gels were run for each sample.

Image Analysis
Protein spots were detected by Coomassie Blue G-250 staining [18].After the 2-DE run, gel cassettes were opened and the gels carefully removed and placed in fixing solution (10% v/v acetic acid, 40% ethanol) for 1.5 h.Gels were then stained using Coomassie solution (0.1% w/v CBB G-250, 1.6% w/v o-phosphoric acid, 10% (NH 4 ) 2 SO 4 , and 20% ethanol) overnight.The gels were then washed with deionized water until the gel background was clear.Stained gels were scanned and calibrated using LabScan 5 software (Amersham Biosciences).Image analysis was performed using Image-Master TM 2D Platinum 7 (GE Healthcare Amersham Biosciences).Detection and matching of the protein spots was facilitated by the use of software and re-evaluated by visual inspection.Three well separated gels of each sample were used to create "replicate groups".

Protein Identification and Database Search
Protein spots of interest were manually excised from the 2-D gels.The gel plugs were washed three times with ultrapure water, destained twice with 100 mM NH 4 HCO 3 in 50% acetonitrile, reduced with 10 mM DTT in 100 mM NH 4 HCO 3 for 1 h at 56˚C and incubated with 40 mM IAA in 100 mM NH 4 HCO 3 for 45 min., dried twice with 100% acetonitrile, and digested overnight at 37˚C with 0.1 mg/ml trypsin (Promega, Madison, WI, USA) for 1:40 (w/w) (trypsin:protein) according to manufacturer's instructions.The resulting peptides were subjected to sequential extraction (3 times at 37˚C) with 8 µl each of 5% trifluoroacetic acid (TFA) for 1 h, 2.5% TFA in 50% ACN for 1 h, and 100% ACN for 1 h.Extracted peptides were dried in a vacuum centrifugation and stored at -80˚C.The digests were mixed with a saturation solution of α-cyano-4-hydroxycinnamic acid in 50% acetonitrile/0.1% trifluoroacetic acid and spotted on a MALDI target plate.
Peptide masses were measured using a MALDI-TOF/ TOF ultraflex mass spectrometer (Bruker Daltonics, Billerica, MA, USA).Mass spectra were recorded in positive reflector mode at a maximum accelerating potential of 25 kV in both manual and automatic modes.Internal calibration of the standard spectra was performed after every 10 consecutive spectra using Pepmix peptide calibration standards (Bruker Daltonics).The spectra were analyzed and a peak list was generated using XTOF/ XMASS spectrum analysis software, prior to database searching using MASCOT software (Matrix Science, London, UK).The peptide mass data were analyzed for corresponding protein matching in the NCBInr database with settings of 100 ppm mass accuracy and 0 to 2 peptide cleavage sites in the MASCOT search engine.Positive identification was considered only with a high MASCOT score, maximum peptide coverage, and additional experimental confirmation of the Mr and pI of the protein based on the 2-DE studies.The database was set to NCBInr (released on September 20, 2011; 9363125 sequences; 3207829265 residues).Search parameters included: 1) MS/MS Ion Search, as the type of search; 2) Trypsin, as the enzyme of protein digestion; 3) Carbamidomethyl (C) and Oxidation (M), as Variable modifications; 4) MONOISOTOPIC, as Mass value; 5) Unrestricted, as Protein Mass; 6) ±50 ppm, as Peptide Mass Tolerance; 7) 0.25 Da, as Fragment Mass Tolerance; 8) 1, as Max Missed Cleavages; and 9) MALDI-TOF-TOF, as Instrument type.The identification was considered with MASCOT score, total ion C.I.%.The minimum MASCOT score was 45, the minimum total ion C.I.% for positive identification was 95%, the minimum number of unique peptides was 1.The experimental Mr of the protein spots was calculated using ladder.The experimental pI of the protein spots was calculated from their position on the IEF strips as mentioned by the manufacturer's specification (GE Healthcare).
Functional characterization of the identified proteins was determined by gene ontology tool available at http://www.uniprot.org.Due probably to the lack of resolution sometimes associated with 2-DE, the MASCOT searches for some protein spots occasionally gave several confident identifications [19].Only the top scoring proteins identified within each sample are reported in the Results section.

RESULTS
The proteome of the whole earthworms E. fetida was profiled using 2-DE.Multiple 2-DE gels were acquired, and we present a 2-DE map of the earthworms E. fetida (Figure 1).Analytical and biological variances were calculated for triplicate 2-DE gels obtained for similar but independent culture conditions.MALDI-TOF/TOF-MS allowed the identification of the proteins from all indicated spots, corresponding to 76 different proteins.The identified proteins are listed in Table 1 according to their functional categories.These data have allowed us to establish a earthworm E. fetida database, available worldwide, that contains information on protein species identified in the gels.
The resulting master image, representing all reproduceble features, contains 1500 protein spots in the range pH 3 -10.We excised 230 spots from gels and a total of 76 spots were identified after tryptic digestion by MALDI-TOF/TOF-MS.Representative MALDI-TOF/TOF-MS data in Figure 2, and a database search result are shown in Figure 3.However, it cannot be ruled out that the multiple protein identifications due to streaky protein spots or due to diffusion of protein spots in close proximity.Some proteins were present as multiple spots, including lysenin, lombricine kinase, extracellular globin-4, intermediate filament protein, etc.According to the database, the identified proteins were classified by Gene Ontology Tool (http://www.uniprot.org).
Spot 1560 (Predicted protein XP_001503943) corresponded to an MS/MS sequences (i.e.K.AAESPPHPLEESEVLVEK.S) indicating similarity to the cardiomyopathy associated 5, which is expressed in the Equus caballus.Spot 1553 corresponded to an MS/MS sequence of R.NVSNVNYVSPVR.N.Homology searches revealed that this sequence corresponds to hypothetical protein TRIADDRAFT_28083, Trichoplax adhaerens.Thus, we have determined the most inclusive 2-DE map of the earthworms E. fetida proteins published to date.
The individual protein spots, the putative names of proteins, the organism from which the protein was identified, the MASCOT score together with the sequence coverage, the values for experimental and theoretical pI, and the molecular mass.The identified proteins could be classified into five major groups according to their biological function as shown in Figure 3: 1) Metabolism, 32 spots: seven are associated with transporter activity/ATP synthesis coupled proton transport/transferase/microtubule-based movement/synaptic transmission /regulation of GTPase activity, three other spots are involved in catalytic activity, nine spots are associated with zinc ion-binding/calcium ion-binding/ATP-binding/sugar binding/actin binding, three spots are associated with proteolysis, nine spots are associated with the tricarboxylic acid cycle/cellular amino acid metabolic process/protein amino acid phosphorylation/nucleobase, nucleoside, nucleotide and nucleic acid metabolic process/glycolysis/ glucose metabolic process, and one spot is associated with oxidation reduction; 2) Stress response related proteins, 12 spots in total: one spot is associated with electron transport, two spots are associated with oxidation by MALDI-TOF/TOF-MS analysis.We found that these spots could be classified into eight major categories according to their putative biological role (Figure 4): metabolism, stress, defense, transcription, translation, signal transduction, predicted and hypothetical, and other.Several proteins produced multiple spots, which was most likely due to posttranslational modifications (PTMs).In most cases, the shift in position is horizontal, suggesting that the modification affects only the pI without changes in molecular mass [20].Their functional significance is discussed below.reduction, one spot is associated with positive regulation of apoptosis, one spot is associated with protein folding, two spots are associated with oxygen transport, two spots are associated with response to stress and oxidative stress, one spot is associated with kinase activity, and two spots are associated with actin-binding; 3) Defense-related, 4 spots: two spots are associated with lysenin, whose function is a defense response to bacteria, one spot is associated with antimicrobial peptide lumbricin 1, and one spot is associated with coelomic cytolytic factor 1, whose function is a defense response to microbial stimulation; 4) Transcription, two spots are associated with the regulation of transcription, and one spot is associated with transcription; 5) Translation, one spot is associated with the regulation of translation; 6) Signal transduction, one spot is associated with signal transduction.
Of the identified proteins, the most interesting group plays a role in invertebrate immunity function.Members of this group include caspase-8, manganese superoxide dismutase (MnSOD), chaperonine protein HSP60, Extracellular globin-4, gelsolin-like proteins, lysenin, coelomic cytolytic factor 1 (CCF 1).HSP60 is known to be up-regulated in the early period after Cd exposure [21].A study reported that the potential functions of hemoglobin may include anti-oxidative defense [22].Gelsolins all likely play an important role in altering the contractile function and motility of vascular smooth muscle [23].Lysenin, which has been shown to cause

DISCUSSION
The pattern of protein expression in the earthworms E. fetida was studied by 2-DE and Coomassie staining method.We were able to visualize and quantify around 1500 protein spots.76 different proteins were identified contraction of rat vascular smooth muscles [24].The cytotoxic effects of lysenin seem to be due to membrane damage that is caused by the binding of lysenin to SM in the plasma membrane [25].Lysenin produced two spots (232 and 212) most likely due to PTMs.In annelids, CCF is a defense molecule with functional analogies to mammalian tumor necrosis factor (TNF), which plays an important role in regulating immune responses [26,27].
In invertebrates, the orthodenticle gene is involved in regulating morphogens by activating transcription [28].One spot, corresponding to a predicted mitochondrial aldehyde dehydrogenase 2 family protein.Moreover, aldehyde dehydrogenase was reported as inducible in response to various environmental stress conditions in mice [29].McKim et al. found that apo-MT was degraded by cathepsins.Another predicted protein spot, identified as cinema1 belonging to the immunity-related GTPase family [30].The powerful biological activity of the immunity-related GTPases (IRG) against a range of pathogens has been extensively reviewed [31].Djabali et al. found that the intracellular state of alphaB-crystallin seemed to correlate directly with the remodeling of the intermediate filament network in response to stress [32].One identified protein lombricine kinase which plays key roles in the coupling of energy production and utilization in animals [33].Fibrinolytic proteases dissolve blood fibrin clots, and are important chemotherapeutic agents [34].One spot (spot 919), identified as caspase-8, correspond to enzymes that has been demonstrated to play a key role in mediating Fas-induced apoptosis [35].Apoptosis is executed by caspases, a family of proteases that disassemble a cell [36].Caspase-8 is an initiator of the downstream apoptotic process that includes the activetion of Casp3, -6, and -7 and mitochondrial damage [37].One spot (spot 155) corresponding to catalase which is important for peroxide and radical detoxification [38].Catalase activity was studied as possible biomarkers of toxic stress in earthworms [39].One spot (spot 406), identified as HSP90, Hsp90 is known to be involved in the inhibition of stress-induced apoptotic cell death and is a ubiquitous molecular chaperone found in eubacteria and all branches of Eukarya [40].It plays a central role in cellular signaling because it is essential for maintaining the activity of key signaling factors, including steroid hormone receptors and protein kinases [41].One spot (spot 155) corresponding to antimicrobial peptide lumbricin 1. Antimicrobial peptides have been considered to be a universal host defense tool used by earthworms against microbial infection.Lumbricin 1 has been isolated and characterized from the earthworm L. rubellus, which showed antimicrobial activity in vitro against a broad spectrum of microorganisms without hemolytic activity [42].
Analyzing gene expression at the protein level is po-tentially more informative than mRNA analysis.In fact, proteins have a number of important features that reveal their actual, rather than potential, function.Consequently, a series of proteome database projects have been undertaken to identify all proteins in an organized cell and to explain the role of each protein using published biological data.Furthermore, in contrast to the predominantly static genome, a proteome is exceedingly dynamic: Mechanisms such as cell activation, disease, infection, and differentiation can considerably transform the proteome [43].Our data support a resource for the proteomics profile of the whole earthworm E. fetida.We used a gel image to identify and localize 76 distinct proteins.To our knowledge, this is the first description of a 2-DE reference map of the whole earthworm E. fetida proteins.These 2-DE data will enhance future comparisons of immunity, toxicology, biotic processes and other challenges, thereby allowing for further study of the molecular mechanisms of innate immunity.

Figure 2 .
Figure 2. Identification of spot 23 via MALDI-TOF/TOF-MS.The protein excised from the gels was digested with trypsin and the resulting peptides were analyzed by a 4800 MALDI-TOF/TOF Proteomics Analyzer [Applied Biosystems, Foster City, CA].[A] The MS spectra.The 975.52 ion marked with an asterisk was analyzed by MS/MS.[B] MS/MS spectra of ion 975.52.

Figure 3 .
Figure 3. Database search result of spot 23.

Figure 4 .
Figure 4. Summary of the classification of the biological functions with gene ontology analysis of identified proteins.

Table 1 .
List of identified proteins grouped in functional categories.
a SC is the abbreviation for sequence coverage.Experimental pI and molecular mass [kDa] from 2-DE gels and NCBI accession numbers are listed.Proteins identified are listed with the mascot MOWSE score as a measure of confidence for MS/MS-based identification.Spot numbers are identical to spot numbers in Figure1.