Experimental Investigation and Modeling of Activity Coefficient at Infinite Dilution of Solutes Using Dicationic Solvent Based on Pyrrolidinium as a New Stationary Phase in Gas Chromatography

Activity coefficients at infinite dilution, γ ∞ i, were calculated for 12 solutes, with organic solutes including linear alcohols (methanol, ethanol, propanol), linear alkanes (heptane, octane), benzene, toluene, cyclohexane, 1, 2-dichloroethane, trichloroethylene, acetonitrile and carbon tetrachloride. The values of γ ∞ i were determined via either thermodynamic or artificial neural network modelling at different temperatures. A comparison between extracted results from these two methods confirmed that experimental and predicted results are roughly the same. The accuracy of predicted results proves this model is fully compatible with a wide range of solutes, and it can readily be used as an alternative to conventional gas-liquid chromatography for the measurements of activity coefficient at infinite dilution.


Introduction
The measurements of activity coefficient at infinite dilution (γ∞) are crucially important for either theoretical or practicing chemistry.This parameter describes the behavior of a solute completely surrounded by solvent molecules.Ac-tivity coefficients at infinite dilution have been widely used for determining quantity of solutes' volatility and also made information about intermolecular energy between solvent and solute [1] [2] [3] [4] [5].Values of γ∞ are decisive factors for the calculation of limiting separation factors necessary for the reliable design of distillation processes and the selection of solvents for extraction and extractive distillation.Moreover, activity coefficients are important for characterizing the behavior of liquid mixtures, predicting the existence of azeotrope, estimation of mutual solubility and calculation of Henry constants and partition coefficients.
Several methods were developed for the measurement of γ∞ such as dilutor technique (DT) [6] [7], inert gas stripping [6], differential ebulliometry [8], head space [9] and dew point techniques [10].However, there are some drawbacks, in terms of time, cost and material, associated with each method.As chromatographic technique needs less than 1 gram of ILs and it can be considered as a cost-efficient, rapid and reliable method.
It is important to have a simple method to estimate all property distributions from known bulk properties.Artificial Neural Networks (ANN) has been widely applied to an extensive range of chemical engineering such as process modeling, optimization and PVT behavior over the last 20 years.In the mathematical algorithm of ANN, it is possible to relate input and output parameters without requiring prior knowledge of relationships between the process parameters [11] [12] [13] [14] [15].
In this work, values of γ∞ (the activity coefficients at infinite dilution) for 12 compounds in the following di-cationic ionic liquid with three phase loadings (10%, 15% and 20%) have been determined at various temperatures 308, 313, 318 and 323 K. Regarding the importance of activity coefficient at infinite dilution in thermodynamic and separation processes, a growing need for gaining activity coefficient in a simple and fast way has been felt.Therefore, an artificial neural network (ANN) model has been developed to predict the measures of γ∞ for an extensive range of solutes.

Solvents and Solutes
All solvents were distilled from standard drying agents before use.All used Ionic Liquids were synthesized in CCERI [1].N-Methyl pyrrolidine, 1, 9-di-bromononane, Lithium bis (trifluoromethylsulfonyl) imide and pentaoxide phosphor were purchased from Sigma-Aldrich company. 1 H NMR spectra (500 MHz) were recorded in deuterated ACN.Since the GLC process separated the solutes from any impurities, the solutes were used without further purification.

Analysis Method
Gas chromatography experiments were performed using a Varian CP-3800 gas chromatograph equipped with a heated 1041 injector and a thermal conductivity detector (TCD).The injector and detector temperatures were kept constant at 473 K during all experiments.The flow rate of helium was adjusted to obtain adequate retention times.The dead time was determined by injection of air with each solute.A personal computer equipped software as used for recording detector signals and corresponding chromatograms were obtained by Galaxie software.

Stationary Phase Preparation and Sample Injection Condition
Column packing, containing from 10%, 15% and 20% of stationary phase (IL) on Chromosorb W-AW (80 -100 mesh), was prepared using the rotary evaporator technique.After evaporation of the dichloromethane under vacuum, the support was equilibrated at 323 K for 18 hours.The solid support material with the stationary phase was filled in a stainless steel column with an inner diameter of 3 mm and a length of 1 m.The weight of the packing material was calculated from the weights of the packed and empty column.A volume of the headspace vapor of samples of 0.1 -0.5 micro liter was introduced to be in infinite dilution conditions.No differences in retention times t r were found by injecting individual pure components or their mixtures.The measurements were carried out at temperatures between 308 and 323 K.At a given temperature, each experiment was repeated at least three times to verify the reproducibility.The difference of the retention times of the three measurements was ordinarily reproducible within (0.01 to 0.1) min.
Under aforementioned condition, the retention data for 12 solutes in 3 gas-chromatography columns with different phase load (10%, 15%, and 20%) and in different temperature (308, 313, and 318 K) have been obtained and used for calculating of activity coefficients at infinite dilution.

Thermodynamic Modeling
Equation (1) suggested by Everett and Cruickshank et al. [16] [17] shown below, was used for determining of i γ ∞ values for the solute eluting in a carrier gas.
where n is the mole number of the stationary phase component inside the column, R is the ideal gas constant, T is the temperature of the oven, V N is the standardized retention volume of the solute, P˚ is the column outlet pressure (equal to atmospheric pressure), V S the saturated liquid molar volume of the solute at T and V ∞ is the partial molar volume of the solute at infinite dilution in the solvent.B 11 the second Virial coefficient of the solute in the gaseous state at temperature T, B 12 the mutual Virial coefficient between the solute 1 and the carrier gas helium 2 and P S is the probe vapor pressure at temperature T. The second and third terms in Equation ( 1) are correction terms that result from the non-ideality of the mobile gaseous phase.The molar volume of the solute V S was determined from experimental densities, and the partial molar volumes of the solutes at infinite dilution V ∞ were assumed to be equal to V S .The vapor pressure values were calculated using the Antoine equation [18] [19].The standardized retention volume, V N , can be calculated with the following relationship: The adjusted retention time, r t′ calculated from the difference between the retention times of a solute and that of air.U 0 , the flow rate of the carrier gas, measured at the room temperature.The factor J corrects for the influence of the pressure drop along the column.Among of J relies on the pressure at the column outlet and inlet.This factor is defined by Equation ( 3).
( ) ( ) The values of B 11 and B 12 were calculated using the McGlashan and Potter [20].
The critical properties of the pure component (  [22] and the mutual critical data 12 12  , c c T V were calcu- lated using the combining rule presented by Hudson and McCoubrey [23]. Activity coefficients at infinite dilution of various types of solutes were computed in the di-cationic stationary phase with different phase load (10%, 15%, and 20%) in four temperatures (308, 313, 318, and 323 K).The obtained results of activity coefficients at infinite dilution for 12 solutes are presented in Table 1.

Artificial Intelligent Modeling
An artificial neural network was applied to model the system in order to predict activity coefficient of dilute solution for lots of chemical compounds.144 data sets were used for training and testing.70% of these data have been used for training, test data and validate data used the equal percentage of 15.
One of the most popular and commonly used networks is the multilayer perceptron network (MLP).The MLP configuration has gained a widespread use in static regression applications [24]- [29].It can have one or more hidden layer(s).Whereas Cybenko [30] and Huang et al. [31] had proved that a one hidden layer network is suitable to represent any type of multidimensional DOI: 10.4236/ajac.2018.94020non-linear function with sufficient number of neurons and more hidden layers may result in over-fitting, therefore, in this work, one hidden layer was applied as displayed in Figure 2. In addition, a procedure modified at our last works [32] [33] was selected to design a relatively small and entirely accurate network.
The procedure flowchart is shown in Figure 3.At the first step of the procedure, a training method was randomly applied to find the number of neurons in the hidden layer that minimizes the mean squared normalized error (MSE) (defined by Equation ( 4)) of the network.
where e i is the differences between experimental and predicted data.
In order to improve the model generalization and prevent over-fitting, the number of neurons has to be chosen so that the number of internal parameters in the network does not exceed the number of training data sets [34].The number of internal parameters was calculated according to the following equation [35]: where, tot n is the total number of network parameters, o n is the number of outputs and hi n is the number of the neurons in the ith hidden layer.In this work, the maximum number of neurons that can be used in hidden layer in this system to prevent over-fitting was calculated to be seven.Thus, the choice of neuron number was limited in the range of 1 -7 neurons for the hidden layer.At the second step, the network with the neuron number of the last step was used to find a training method that leads to minimum MSE of the network.If the network MSE was less than the desirable MSE the third step was started.Otherwise, the last two steps were repeated till the desirable MSE value was reached.
The applied training methods consist of Bayesian Regularization (BR), BFGS 1  Gradient Descent with Momentum (GDM), and Gradient Descent (GD).
At the third step, the selected training method was applied to train the network using a number of neurons (1 -8).Each of these trainings was repeated 1000 time and the means of MSEs for the repeated trainings were recorded.
In addition to MSE, correlation coefficients (R) are commonly used to verify ANN models.In this work R has also been applied as defined by Equation ( 6). 1 Hesian updating methods of Broyden, Fletcher, Goldfarb, Shanno (BFGS).
where, τ i is the target and α i is the network output and , τ α are the mean amount of the data.

Results
The ANN model was also employed to predict activity coefficient at infinite dilution of different solutes.The procedure described in section 3 was applied to design the model.Temperature, Ionization energy, Molecular weight and stationary phase loading were chosen as the input data of network and Activity coefficient, Saturated pressure, Saturated volume, Adjusted retention time and the correction factor (J) were chosen as the output data.
Levenberg-Marquardt (LM) method was found to have the minimum error as shown in Table 2. Mean squared normalized error of the ANN model is indicated in Figure 4.This figure shows that using seven neurons has resulted in a minimum error.Therefore, this structure (144:7:1) was selected as the best network to model this system.Hence, a network with seven neurons in the hidden layer which trained by Levenberg-Marquardt (LM) method, selected as the best network.Optimal network structure can be seen in Figure 2.This network consists of 144 input data that divided to train, test and validation data.
The results of ANN model and experimental data are depicted in Figure 5 and     bins is shown in Figure 6.According to the Figure 6, it can be seen that the histogram has a peak around 0.017.In this work, having calculated the activity coefficients at infinite dilution in three different ways, a comparison between their final results has been drawn.The first method is based on using experimental data extracted from the thermodynamic model.In the second method, data were obtained from ANN model and the third method is based on the thermodynamic model used ANN predicted data.Table 5 presents the results achieved through these methods.In this comparison, the first method that used experimental data for calculating activity coefficient is chosen as the basis to calculate errors.As it is shown in

Discussion
The chromatographic data has been used in order to determine the values of activity coefficients at infinite dilution by either thermodynamic or ANN model.
In the thermodynamic model, the values of activity coefficients at infinite dilution have been calculated for 12 solutes at different temperatures (308, 313, 318 and 323 K) in three columns with different stationary phase loadings (10%, 15% and 20%).It can be seen that the results obtained from two models come from a broadly similar direction.As a result, ANN can be efficiently used to measure the values of activity coefficients at infinite dilution in different temperatures.A great advantage associated with ANN model is that the values of activity coefficients at infinite dilution can be directly obtained through retention time (t r ), the saturated liquid molar volume (V ᶳ ), the probe vapor pressure (Pᶳ) and the ioni-zation energy (I) at T, without getting involved in complicated thermodynamic computations.According to the strong similarity between the results of two models, the range of solutes can be expanded, and the values of activity coefficients at infinite dilution can be predicted precisely by ANN model for an extensive range of solutes according to their retention time (t r ), the saturated liquid molar volume (V ᶳ ) , the probe vapor pressure (Pᶳ) and the ionization energy (I) at the wanted temperature (T).As in ANN model all the steps related to the calculation of physiochemical parameters can be skipped, ANN model can be considered as a time-saving and cost-efficient technique for determination of activity coefficients at infinite dilution, in comparison with the thermodynamic model.
As it can be seen in Table 1, the Train error in ANN model is 0.087; Validate error is 0.167; Test error is 0.166 and the overall error is 0.111.

Figure 2 .
Figure 2. The selected structure for the artificial neural network (Ann).

Figure 3 .
Figure 3.The procedure to design artificial neural network.

Figure 6 .
Figure 6.Figure 5, the regression plot of the ANN model and experimental da-

Figure 5 ,
Figure 6. Figure 5, the regression plot of the ANN model and experimental data, shows an accurate prediction for the model.The error histogram with twenty

Figure 4 .
Figure 4.The variation of average relative error and R with number of neurons in the hidden layer.

Figure 5 .
Figure 5. Comparing of the experimental data and predicted results of network.

Figure 6 .
Figure 6.Error histogram with 20 bins obtained using the presented model and number of pure compounds in each range.

Table 1 .
Activity coefficient of solutes at infinite dilute solution.

Table 2 .
The results of different training methods; Mean squared normalized error of the data.
Table 3 reports the errors for training and test stages of the ANN model.The weight and biases of this network were reported in Table4in order to predict resulted data, and also use this model for finding directly the precise amount of activity coefficients of other materials without carrying out time consuming experiments and using thermodynamic modeling.

Table 3 .
Statistical properties of trained ANN.

Table 4 .
Weights and Biases of the selected artificial neural network.

Table 5
, that predicted activity coefficient extracted from ANN model has the smallest error.The Average Overall error of test data for the second and third method is DOI: 10.4236/ajac.2018.94020

Table 5 .
Comparison between activity coefficient of the test data that calculated with three different method; Using thermodynamic model that used experimental data; ANN output; Using thermodynamic model that used ANN prediction data.