Shape Retrieval Using Fourier Descriptors Applied to Industrial Process ()
1. Introduction
The principal problems on a smelting furnace in metalworking are caused by high temperatures and the gases produced by the slag. For the industry, the casting process is important because the product quality depends on it [1], if the process is slow. Then the molten steel will cool and if it is a quick casting then it will provoke turbulences, its effect produces corrosion and erosion in the refractory [2]. It exists several methods for detecting slags. There are several methods to detect and segment slags, in [3] the authors report a method to segment slag during the steelmaking process, in which they propose obtaining X-ray tomography (HRXCT) of the steel and by applying the Otsu model obtaining slag segmentation, however, the limited resolution of HRXCT [4] and the sensitivity of the Otsu method to variations in brightness [5] are the main disadvantages of this method. In the investigation reported in [6] and [7], a CCD camera is used to capture images of the process and slag is manually segmented by selecting a region of interest, this being the main limitation for its implementation in production due to the speeds that the process demands. Other developments are based on the use of a camera for thermography, taking advantage of the emissivity of objects to segment and detect slag [8], however, research using this method focuses on temperature measurement [9], measurement of slag [10], estimation of radioactive parameters in slag [11] or the determination of slag properties.
In addition, private software has been developed for the segmentation and detection of slag based on infrared images [12]. However, in the literature, an automatic method for slag segmentation has not been reported, which allows other vision algorithms to develop a more robust method, for the detection of the slag and measurement of the level of liquid metal, by which they can avoid spillage, mold damage and improve product quality. The motivation of this article stems from this need, which seeks to associate the light changes of the slag in terms of texture, to find a representative form of the slag and to spatially locate the liquid metal in the refractory. To carry out this proposal, this document is organized as follows: Section I describes the introduction, Section II describes the mathematical concepts required for the preprocessing, segmentation, features extraction and the shape retrieval. In Section III, the methodology is presented. The experiments and results are obtained by this method are shown in Section IV. Finally, the conclusions of the methodology are exposed in Section V.
2. Foundations
According to [13], the trichromatic theory, colors are separated into three parameters. They are characterized according to the color space used. Although there are several color spaces, in which, the model RGB (Red, Green and Blue components) is the most used because it does not require transformation for its display [14]. However, other alternatives like the representative model HSV (Hue, Saturation and Value components), may be provided as well for more information on its components. This model separates the image intensity, from chroma or the color information [15]. The transformation from RGB to HSV is defined by [16] like follows
(1)
(2)
(3)
where R represents the intensity value of red channel, G represents the intensity of green channel and B represents the blue channel of the RGB color model.
2.1. Fourier Transform
The Fourier Transform is applied in a lot of vision systems for transformation, image analysis, filtering, reconstruction, compression, smoothing, and sharpening images, among others [17]. The aim of this preprocessing is the noise attenuation, the abrupt intensities as texture can be distinguished. Fourier theorem states that a function can be represented like a summation of a series of sine and cosine terms as in (4).
(4)
where F is the Fourier Transform of a signal x in time domain, f is the frequency of the signal and i is a complex term.
These terms sinusoids show the image behavior in high and low frequencies but to facilitate the image analysis, it was used the Discrete Fourier transform (DFT) (5) and Inverse Discrete Fourier Transform (IDFT) (6).
(5)
where f is the distribution of grayscale image, N is the number of sample of images, ki and lj are samples of discrete space of f and X is the DFT.
(6)
where X is frequency spectrum, N is the number of sample of images, ka and lb are samples of discrete space of X and f is the IDFT.
2.2. Segmentation
Segmentation is an important part of the image processing. It allows extracting a region of interest and reduces the analysis of time. Its objective is to convert an image from grayscale to a binary one, there are several methods that according to their application can be used. These methods are classified in discontinuity that they are based on intensity changes, and they are similarly partitioned into regions with certain criteria of similarity [18]. This investigation focuses on segmentation by histogram thresholding. In [19] the following steps are defined for this method:
1) Estimate an initial threshold to segment the image for creating two sets: object and background.
2) Calculate the average value of each set.
3) Obtain a new threshold from the average values calculated:
(7)
where, µ1 and µ2 represent the average value of each set and T is the threshold calculated.
4) Repeat previous steps using the new threshold calculated until convergence has been reached.
2.3. Convex Hull Method
The convex hull method is used in several applications like interferometric measurements [20], object tracking [21], disaster forecasting [22], gesture recognition [23], rendering model [24], in others. The objective of applying this method in this stage is to generate a unique binary object. For that purpose, a quick hull is going to be used that is then based on a divided and conquered approach which is similar to a quicksort [25]. The steps to apply are shown below.
1) Determinate the points P1 and P2 with minimum and maximum coordinates.
2) Draw the line between P1 and P2 to divide the set into two subsets points.
3) Determine a point (P3) of one of the subsets with maximum distance in respect from the segment P1 and P2.
4) Draw new segments from P3 points to P2 and P1 points.
5) Eliminate the points inside of the triangle formed by P1, P2, and P3.
6) Repeat the above steps using the new segments formed by P1 and P3, P1 and P2 until no points are left.
3. Methodology
In the first step, the image is acquired by a thermographic camera in which a high rainbow palette was used, and the image was transformed from RGB color space to HSV color space (see Figure 1). In which, the gray intensities tend to be clustered in the lower and upper classes, which are translated into low and high temperatures around of 200˚C and 1500˚C respectively. The problem to use the red, and blue components of RGB model is that they reflect high temperatures in the inner and external walls of refractory or combine the liquid metal and the inner wall with minimal values of intensity.
In case of the green component, the difference between the furnace wall and the liquid metal can be seen, but when it is observed in the histogram of Figure 2. It shows minimal values in high temperatures, in comparison with the Hue
![]()
Figure 1. Channel decomposition of the RGB and HSV color spaces. Source: Own elaboration.
![]()
Figure 2. Distribution of frequencies in: (a) Green component, (b) the Hue component. Source: Own elaboration.
component of HSV. For this reason, HSV model is used, also this step is going to allow a simpler process segmentation.
In the next step, the image is processed with the Discrete Fourier Transform (DFT), in which a low-pass filter is applied to intensify the gray levels and the Inverse Discrete Fourier Transform (IDFT). This technique helps to visualize the texture of slag of the other textures, also, it helps to select an adequate threshold to segment the image (see Figure 3).
As the next step, the histogram is obtained to establish a threshold and binarize the image. The threshold selection using histogram allows to segment the slag from the liquid metal. After that, the binary large objects (blobs) are removed using mathematical morphology (MM) in this case the erosion. The resulting objects are features from slag that allow retrieval of the shape of the liquid metal. Now, these features will be processed to generate a unique binary object, in this step the convex hull is applied, although the blob got does belong to the partial area of liquid metal.
4. Experiments and Results
For validating this methodology, images were captured by a FLIR A320 thermographic camera with a resolution of 24-bits of 640 pixels × 480 pixels, from which, 180 images out of 25 casting operations were obtained.
Figure 4 shows the results after applying the methodology of this proposal, as can be seen, the recovery of the shape is carried out in the area where the hopper does not interfere with the visualization of the slag, in some cases, it is almost the complete shape of the liquid metal section has been recovered in the area of vision. Nevertheless, in others, the algorithm fails, this is mainly due to two things: 1) the first is because when slag falls on the metal, this causes gas to escape, preventing the camera from having visibility of the area under observation; 2) the size of the structuring element when small objects are removed is very
![]()
Figure 3. Distribution of frequencies after that DFT is applied. Source: Own elaboration.
![]()
Figure 4. Shape retrieval. Images on column a show features from DFT, images on column b are created after convex hull algorithm is applied and images on column c show the original images. Source: Own elaboration.
large, so the size window should be adjusted.
5. Conclusions
The motivation of this paper is due to according to literature, an automated system for slag detection using vision algorithms is not reported yet. In this research, the analysis of the topology of the liquid metal surface is presented, which was characterized through the Discrete Fourier Transform, associating changes of intensities to the frequency domain and obtaining the main features of these frequencies, these features were used to represent molten metal, building its shape by using the convex envelope. Advantages of this proposal are that through the discrete Fourier transform, the molten steel in the furnace has been spatially located, so future work can focus on:
1) Detect the percentage of slag in the refractory.
2) Measure the level of the liquid metal to avoid spillage and mold damage.
3) Develop algorithms to solve camera vision loss problems due to the gas generated by the slag.
Nevertheless, the proposal has limitations, the main is that there are a few features obtained by the Fourier descriptor, which limits the segmentation and complete shape retrieval of the slag in the metal, so it is recommended to use another texture analysis method that is sensitive to subtle changes in lighting to detect slag, but that can omit the edges of the refractory and hope to build a better representation. In addition, for future research, you can think of an algorithm that knowing the location of the slag and supported by holography and calibration of the camera, the level of molten metal can be measured. Another observation for future work is that the slag percentage can be detected, which is done at the moment when the metal begins to fall into the refractory, by means of optical flow analysis.
Acknowledgements
This work has been partially carried out thanks to the support of the scholarship granted by Consejo Nacional de Ciencia y Tecnología (CONACyT).