Analysis of Peanut Seed Oil by NIR

Near infrared reflectance spectra (NIRS) was collected from Arachis hypogaea seed samples and used in predictive models to rapidly identify varieties with high oleic acid. The method was developed for shelled peanut seeds with intact testa. Spectra was evaluated initially by principal component analysis (PCA) followed by partial least squares (PLS). PCA performed with full spectra and reduced spectra with one principal component accounted for 97% to 99% variability, respectively. The PLS model generated from first derivative spectra provided a standard error of prediction (SEP) of 7.7204808. This technique provides a non-destructive method to rapidly identify high oleic peanut seeds to support the selection and cultivation of high oleic acid peanut varieties. The method can also be useful at peanut processing facilities for screening and quality assessments.


Introduction
Near infrared reflectance spectroscopy (NIRS) is widely used by the food industry for routine oil, protein, and moisture analysis.It is expanding into agricultural applications with the development of precision farming techniques.This includes both crop production and post-harvest applications.The infrared instrumentation developed for laboratory use is now commercially available in smaller and more rugged designs suitable for use in field and processing facility applications.These new devices are versatile and may be incorporated with remote sensing equipment or integrated with process control systems.
Interest in high oleic peanut varieties has increased recently due to the health benefits associated with unsaturated fatty acids [1] [2].The analysis of oilseed lipids by spectroscopic methods demonstrated great success and was accepted as the basis for AOCS standard methods [3].The standard method focused on the measurement of total oil although the spectroscopic approach was also applied to determine fatty acid profiles and lipid oxidation products [4] [5].The technique proved useful for quality assessments in food and feed formulations supplemented with polyunsaturated fatty acids such as docosahexaenoic acid and linolenic acid.These compounds are particularly susceptible to oxidation and the formation of malodorous degradation products [6] [7].Advantages of infrared spectroscopic analysis include minimal sample preparation and reduced analysis time compared to other methods [8] [9].
The combination of spectroscopic and chemometric methods provides a powerful technique to detect, interpret, and model changes in spectra correlated with sample composition.These results may be used both for descriptive and predictive purposes.Spectra are collected and subjected to various pre-processing transformations to facilitate analysis and model development.Different spectral regions carry information of greater interest to a particular problem.A systematic approach to test and validate a predictive model is necessary.This investigation used two of the most common chemometric techniques, principal component analysis (PCA) and partial least squares (PLS), to analyze a set of near infrared spectra obtained from shelled peanuts with varying amounts of oleic acid.

Materials and Methods
Peanut seed samples of normal and high oleic varieties were provided by Prof. Naveen Puppala, New Mexico State University, NM, USA.Peanuts were shelled and stored in a -5 C freezer until measurements were taken.Extraction and analysis of peanut lipids were performed by gas chromatography following AOCS standard method [10].

Spectroscopy
Near infrared reflectance spectra were collected with the SeedMeister model 709A spectrophotometer manufactured by the Brimrose Corp. of America, Sparks, MD, USA.Samples were scanned from 1200 to 2000 nm.Spectral analyses were performed with Unscrambler X, Camo Software, Oslo, Norway.Spectra were examined initially by principal component analysis (PCA) to identify clusters of high oleic and normal oleic content seeds.PCA is useful to construct models that describe data.PCA transforms the original variables into new variables that are linear combinations of the original variables and calculated to reduce the variability in the data [11] [12].Spectra were transformed by multiplicative scatter correction (MSC) and first derivative Norris-gap.After the descriptive PCA models were evaluated the predictive partial least squares (PLS) models were developed [13].PLS attempts to converge the model to a minimum residual error condition.The resulting model may be used to describe spectral data, however, in this case it was used to predict values of oleic acid from spectral data.The error between measured and predicted values is described by either the root mean square error of prediction (RMSEP) or the standard error of prediction (SEP) [14].PLS models were generated from full spectra, 1200 -2000 nm, and reduced spectra, 1600 -1800 nm.Analyses were performed with cross-validation using 8 segments and 1225 calibration samples.Model performance was reported in terms of standard error of prediction (SEP).

Results and Discussion
The infrared spectra from normal and high oleic peanut seeds are shown in Figure 1.These curves are averages of fifteen first derivative spectra.The spectral features appear very similar and require chemometric analysis to identify differences that may correlate to changes in seed oil compositions.Evaluation of the spectral data was performed by principal component analysis (PCA) to visualize groups or clusters of data and reduce the number of variables, i.e., wavelengths, required to develop a predictive model.The application of PCA provided a descriptive model to explore the spectral data and estimate the success of a predictive model.If PCA did not show two groups that correlated to normal and high oleic content oil seeds no predictive model could be developed.
Initial results from PCA are presented in Figure 2 and Figure 3 using full and reduced spectra, respectively.These plots show two varieties of peanut seeds, e. g., normal oleic and high oleic content.In Figure 2 the full spectral range of 1200 -2000 nm was used and the first principal component accounted for 97% of the variabil-    ity in the measured values.In Figure 3 the PCA results obtained for the reduced spectral region of 1600 -1800 nm accounted for 99% of the variability with one principal component.In both cases there is separation into two groups corresponding to normal oleic and high oleic oilseeds.These results show the potential for successfully developing models to predict oleic acid content of shelled peanut seeds from infrared spectra.At this point the descriptive PCA model has indicated very good probability to construct a predictive model using a regression technique such as partial least squares (PLS).Subsequent analysis by PLS was performed with the first derivative spectra and provided the results shown in Figure 4 and Figure 5 for predictive models based on the full and reduced spectral regions.Figure 4 displays the score plots for the full spectra case with 64% of the variability in the predicted value accounted by the first factor.The score plot for the model developed from the reduced spectral region in Figure 5 accounted for 98% of the variability in the predicted value by the first factor.Models for both cases are able to predict normal and high oleic peanut seeds.The performance of the two models is described in more detail by the standard error of prediction (SEP) values.Results for the two models described by the first and second factors showed the SEP value decreased from 12.08858 for the full model to 7.204808 for the reduced model.This represents a significant improvement in the ability of the reduced model to accurately predict oleic acid in peanuts.The SEP carries the units of the predicted variable, % oleic acid, and may be interpreted accordingly.For example, if both models predicted 70% oleic acid content the error in this value would be 12.08858% for the full model and 7.204808% for the reduced model.Further refinement of the model by reduction of the spectral region used is possible using techniques such as two-dimensional correlation spectroscopy and multivariate curve resolution [15] [16].These methods provide a systematic approach to select spectral regions, reduce the number of calculations, and improve the predictive capability of the model [17] [18].However, the current model is adequate for the determination of normal and high oleic peanut seeds without additional modifications.

Conclusion
The importance of selecting and cultivating high oleic peanut varieties is motivated by the health benefits associated with seed oil composed of stable unsaturated fatty acids such as oleic acid.NIRS provides a tool capable of rapidly and non-destructively measuring the amount of oleic acid in shelled peanut seeds.The implementation of predictive models used with modern infrared instrumentation supports this objective and finds applications in the field, the laboratory, and peanut processing facilities.