Temperature Prediction of Aluminum Alloy Work-Pieces in Aging Furnaces Based on Improved Case-Based Reasoning ()
1. Introduction
The Aging Furnace (AF) is an important equipment for the thermal treatment of aluminum alloy work-pieces to enhance their comprehensive performance of anticorrosion property and mechanical properties, such as hardness and ultimate tensile strength [1] . These properties are directly influenced by the temperature of aluminum alloy work-pieces during the temperature holding period, which is called aging temperature. According to the production technology [2] , the required control precision of work-pieces temperature is often strictly confined to a very narrow range about ±1˚C. To obtain the qualified symmetrical high-strength work-pieces, accurate measurement of work-piece temperature in real time is of great importance. Direct measuring method is currently used to obtain the temperature of work-pieces by installing the thermocouples manually on the work-pieces before loading into the furnace each time. Due to the oscillations of the work-pieces and strong convection of the air inside the furnace, the thermocouples may detach from the work-pieces and even be damaged, leading to inaccurate measurement which causes unsuitable thermal treatment and even wasted work-pieces. Therefore, the establishment of a reasonable temperature prediction model of aluminum alloy work-pieces is of practical significance to realize precise control of the work-pieces temperature during the aging process. In order to predict the temperature of work-piece in furnace, Shen et al. [3] proposed a 3D multi-zone multi-phase thermal model based on a novel double-extrapolation finite element method, and achieved good results in a certain situation indeed. However, the complexity and randomness of the temperature field in the aging furnace are not yet clear which severely restricts the accuracy of the mechanism models. To deal with the problems, large amounts of data being measured and stored in the process industry were used to build predictive models based on artificial intelligence techniques, and these models are called Soft Sensors (SSs) [4] . Using SSs method, Yu and He [5] build a temperature measurement model for aging furnaces based on mixed kernel partial least squares algorithms (KPLS), and realized online estimation for the temperature of work-piece in furnace. However, the KPLS algorithm has a disadvantage of over-fitting which may decrease the accuracy of prediction. As a branch of artificial intelligence technique, Case-based reasoning (CBR) receives increasingly attentions and has a rapid development. It has a good interpretability, and over-fitting can be avoided by incremental learning. It is applicable to the fields with no accurate mathematical models, but with rich experiences and historical cases, such as Al-alloy factories and steelmaking industry. CBR can explore knowledge from the experienced cases to carry out solutions of new problems, usually applied to the field of weak theory in which the mathematical model cannot or be difficult to be established [6] . When a new operation condition is given, the CBR system can retrieve similar cases in the database of data being measured and stored in the industry to provide a possible predicted temperature of work-piece. The weight distribution determines the accuracy of case retrieval and affects the performance of CBR model. Genetic algorithm (GA) [7] is frequently utilized in weights optimization for CBR model. However, the GA has a property of premature phenomena [8] and low reliability [9] , which may render weights less optimal and even unreasonable. In recent years, Zhou et al. [10] proposed a novel optimization algorithm, the state transition algorithm (STA), which has a better performance of global and local searches compared with GA.
Thus, this paper introduces a novel weight allocation method that is based on STA to improve the traditional CBR system, and the improved CBR system is called STCBR. To test and verify the effectiveness of the developed method, STCBR is applied in temperature prediction of aluminum alloy work-pieces in AF. The experimental results show that STCBR can realize high-accuracy prediction of work-pieces temperature and has strong robustness.
2. STCBR Algorithm
STCBR model involves five processes: case representation, case retrieval, case reuse, case revise and case retain. Figure 1 shows an overview of the process.
2.1. Case Representation
Typically, a case model consists of two parts: feature attributes and solution attributes. Feature attributes are the mathematical description of problems, and solution attributes are solutions to problems. Therefore, the general case model of a source case can be represented by
as Equation (1):
(1)
where n is the number of feature attributes, m is the number of feature attributes,
and
represent the value of feature attributes and solution attributes of the ith source case.
2.2. Case Retrieval
The purpose of case retrieval is to retrieve one or more cases with the maximum similarity to the new target case from the case base, by calculating the similarity between them. At present, the k nearest neighbor algorithm (KNN) is often used for case retrieval in the CBR system [11] . Based on KNN, there are three steps to obtain the cases with the maximum similarity to the target case from the case base.
At the first step, it is needed to calculate the local similarity
between
Figure 1. Flow diagram of the improved CBR.
the target case and the source case in the case base, as is shown in Equation (2):
(2)
where
is the value of the jth feature of the ith source case, and
is the value of the jth feature attribute of the target case. It is noteworthy that the local similarity describes the similarity degree of the same feature attribute between the target case and the source case. The number of feature attributes is n, there are thus n local similarities needed to be calculated by Equation (2).
After all of the local similarities between the ith source case and the target case are available, the global similarity is introduced at the second step by Equation (3):
(3)
where
is the weight of the jth feature, and n is the number of feature attributes.
At the third step, sort the cases by similarity based on KNN, and then the cases with maximum global similarity are chosen for case revise and reuse. The maximum global similarity is defined as Equation (4):
(4)
where num is the number of cases in the case base.
2.3. Weights Allocation Based on STA
In similarity calculation, the weights of feature attributes have a great impact on retrieved results and the accuracy of problem solving. Thus, it is important to optimize the weights of feature attributes to improve the quality of the CBR system. The optimization problem of weight allocation can be expressed as Equation (5):
(5)
where
is the weight vector;
is a fitness function to estimate the prediction accuracy of CBR for the training case set base on the weight vector
. The fitness function is defined as Equation (6):
(6)
where q is the size of the training case set, and
represents that if the lth training case is matched successfully; if it is,
is 1, otherwise
is 0.
According to STA, a solution to an optimization problem is regarded as a state and the process of updating current solution is regarded as a state transition [12] . Speaking specifically, the weight vector
is considered as a state x. By referring to state space representation, on the basis of current state
, the framework of state transition algorithm can be outlined as Equation (7):
(7)
where
stands for a state corresponding to a current solution to the optimization problem;
and
are state transition matrices with appropriate dimensions, which are usually regarded as transformation operators for the optimization algorithm;
is a function of
and historical states; and
is the fitness function.
There are four special state transformation operators which are designed to solve the continuous optimization problems:
(1) Rotation transformation:
(8)
where
is a positive constant called the rotation factor;
is a random matrix with its entries belonging to the range of
;and
is the 2-norm of a vector.
(2) Translation transformation:
(9)
where
is a positive constant called the translation factor;
is a random variable with its components in the range of [0,1]. The translation transformation will be performed only when a better solution is found.
(3) Expansion transformation:
(10)
where
is a positive constant called the expansion factor;
is a random diagonal matrix with its entries obeying Gaussian distribution.
(4) Axesion transformation:
(11)
where
is a positive constant called the axesion factor;
is a random diagonal matrix with its elements obeying Gaussian distribution and only one random index having a nonzero value.
The optimization flowchart of the feature weights using STA is shown in Figure 2. The detailed explanation for each phase of weights allocation based on STA is presented as follows [13] :
Phase 1 (parameters initialization). Set the values of SE (the search enforce-
Figure 2. Flowchart of allocation of feature weights using STA.
ment, which means the times of the transformation),
,
,
,
(operation factor),
(a constant coefficient used for lessening the
), and K (times of iteration). Set
and generate initial solution
randomly.
Phase 2 (state transition operation). Perform state transition operations for
. The specific operations for
are given as follows:
Next, the operator will be described in detail, taking the expansion operator as an example:
(a) Make
copies of
and carry out an expansion operation for each copy state by Equation (10); the result is
.
(b) Perform CBR process for the training data set, and then calculate the fitness value of each state of
by Equation (5) and make a ranking of the states according to the fitness value in descending order; and assign the state which is sorted as 1 to
.
(c)
; make
copies of
and carry out a translation operation for each copy state by Equation (9); the result is
.
(d) Perform CBR process for the training data set, and then calculate the fitness value of each state of
by Equation (5) and make a ranking of the states according to the fitness value in descending order; and assign the state which is sorted as 1 to
.
Phase 3 (iteration or termination). If
, then
,
, and go to step 2. Otherwise, assign
as the optimal weight vector
.
2.4. Case Revise and Reuse
Target and source cases are generally impossible to be exactly the same, thus it is significant to study how to make an appropriate adjustment for retrieved cases to achieve accurate results [14] . Based on the similar cases obtained in case retrieval process, the solution to the target case
is evaluated in Equation (12):
(12)
where m is the number of cases with the maximum similarity.
3. Experiments and Results
To verify the validity of the improved CBR proposed in this paper, some experiments are conducted based on actual production data which were collected from an aging furnace in Southwest Aluminum Co. Ltd., China. The detailed location of the work-piece thermocouples and the working room thermocouples is shown in Figure 3. The temperature of the working room is measured by two thermocouples, A1 and A2, which are firmly installed in Area 1 and Area 2 of the working room wall. To obtain temperature information of the work-piece, two
Figure 3. The structure of 12 t aging furnace.
thermocouples, Z1 and Z2, are installed in the Zone 1 and Zone 2 of the work-piece. Based on the production data drawn from real factory, six feature attributes are chosen from data records directly, which are working room temperature of Area 1 f1, working room temperature of Area 2 f2, the duration of heating period f3, the duration of holding period f4, the set temperature f5 and alloy state f6, respectively. Work-piece temperature of Zone 1 s1 and Work-piece temperature of Zone 2 s2 are chosen as solution attributes.
For the experiment, there are 2230 groups of data in total, and 103 groups are selected as testing data, and the remaining 2127 groups as case base and training data for feature weights allocation. The testing data comes from 2 batches of different products as shown in Table 1.
In order to validate the performance of the developed CBR model, three different CBR models were established based on the same case base. The CBR model based on STA proposed in this paper is referred to as STCBR. Furthermore, the traditional method with equal weight as EWCBR, and the genetic algorithm as GACBR:
(1) The setting parameters of STCBR [13] : the number of weight objects in the initial population is 6, the value of search enforcement
is 30, the times of iteration K is 10, rotation factor
is 1, translation factor
is 1, expansion factor
is 1, axesion factor
is 1, and the constant coefficient
is 2.
(2) In GACBR [7] , the number of weight objects in the initial population is 6, the crossover probability is 0.4, and the mutation probability is 0.05. Each weight object is coded using three binary numbers, and the number of iteration is 10.
(3) EWCBR allocates average weights for each attributes.
Table 2 shows the results of feature weights allocation based on EWCBR, GACBR and STCBR, respectively. As can be seen from Table 2, all the feature weights are set equal in EWCBR model, which ignores the various influences of
Table 1. The parameters of different aluminum alloy produced in 12 t aging furnace.
Table 2. The feature weights of STCBR, GACBR and EWCBR.
different feature attribute on reasoning results. For GACBR and STCBR, the weights allocation methods are more reasonable and credible because they are performed based on the information of data rather than experiences of experts. For GACBR model, the weight of working room temperature of Area 2 and duration of rising period are assigned as 0.2917 and 0.2083, which are much larger than those of the other features. It is presented that working room temperature of Area 2 and the duration of rising period play more important roles in predicting the temperature of work-pieces for GACBR model. However, for STCBR model, the alloy state and working room temperature of Area 1 are of most correlation with the temperature of work-pieces.
Figure 4 depicts the temperature profiles of different aluminum work-pieces; Figure 4(a) shows the measured temperature and predicted temperature of 7075-T6 aluminum alloy, and Figure 4(b) shows those of 7050-T6 aluminum alloy. Compared to the temperature profiles calculated by EWCBR, the temperature profiles calculated by both GACBR and STCBR have better agreement with the measured temperature curve. In order to present the prediction errors of GACBR and STCBR in a more intuitive way, Figure 5 indicates the deviation degree between the predicted temperatures and the measured temperatures. The
Figure 4. Prediction results of aluminum work-pieces temperature: (a) 7075-T6; (b) 7050-T6.
Figure 5. Comparisons of the computational accuracy of GACBR and STCBR: (a) 7075-T6; (b) 7050-T6.
closer the predicted values are to the 45 measured values, the higher the method’s precision is. From Figure 5, it can be observed that, when the temperature is lower than 100, several points of the temperature calculated by GACBR have a large deviation with the measurement. In general, the prediction performance of STCBR is better than that of GACBR.
The absolute errors of different prediction models are shown in Figure 6. For both 7075-T6 and 7050-T6, in rising period, the upper and lower limits of the absolute error of STCBR are slightly better than those of GACBR and EWCBR. According to the requirement of prediction accuracy (±1˚C) in holding period, STCBR is the only one whose prediction accuracy can meet the requirement for both 7075-T6 and 7050-T6 aluminum alloy.
To analyze the error in a numerical way, two indices Mean-SE (mean absolute error) and Max-SE (maximum absolute error) are used to evaluate the performance of each model as shown in Table 3. For the experimental results of both 7075-T6 aluminum alloy and 7050-T6 aluminum alloy, EWCBR has the largest errors in the 3 CBR models, because it ignores the various influences of different feature attribute on reasoning results which decrease the prediction accuracy of CBR system. However, the errors of GACBR and STCBR are much lower than those of EWCBR, because different feature attribute on reasoning results were considered for GACBR and STCBR. STCBR has the least mean absolute errors and least maximum absolute error. The prediction errors of STCBR may result
Figure 6. Prediction errors of aluminum work-pieces temperature: (a) 7075-T6; (b)7050-T6.
Table 3. Prediciton errors of different CBR models.
from disturbances of environment or insufficiency of cases.
Table 4 shows the advantages and disadvantages of these CBR models used in this paper. Obviously, the good results indicate that the prediction model based on STCBR not only has higher accuracy of prediction, but also has strong robustness when tested on different aluminum alloys.
4. Conclusions
Due to the significant role of online prediction of the work-pieces temperature for aging furnace, an improved CBR method was introduced in establishing the temperature prediction model. In order to promote the performance of problem solving of the CBR model, a novel optimization approach STA which combines
Table 4. Advantages and disadvantages of different CBR models.
global search with local search to avoid the local minima is proposed to optimize feature weights. Then the established model was verified by the practical production data in AF, and the experiment results show the advantages of the STCBR model, which effectively promotes the prediction accuracy of traditional CBR in general and has practical value to apply to the aging furnace in industry.
Future research may be focused on enriching the existing STA models and improving the case revise strategy to realize better prediction performance.
Acknowledgements
This research was supported by National Natural Science Foundation of China (61174132) and Doctoral Fund of Ministry of Education of China (20130162110067). These financial contributions are gratefully acknowledged.