Soft Image Segmentation Based on the Mixture of Gaussians and the Phase-Transition Theory

In this paper, we propose a new soft multi-phase segmentation model where it is assumed that the pixel intensities are distributed as a Gaussian mixture. The model is formulated as a minimization problem through the use of the maximum likelihood estimator and phase-transition theory. The mixture coefficients, which are estimated using a spatially varying mean and variance procedure, are used for image segmentation. The experimental results indicate the effectiveness of the method.


Introduction
Image segmentation is one of the most extensively studied problems in image processing and computer vision. Many different approaches have been proposed for the partitioning of images based on a variety of criteria including brightness (intensity), color, or texture. In general the partitioning of an image or detection of edges is under the assumption that an image consists of several patterns, and each point on the image domain belongs exclusively to only one pattern. Finding boundaries separating the different patterns in this sense is called a hard segmentation. Different to hard segmentation, soft segmentation assumes that each point may belong to more than one pattern. The goal of soft segmentation is to find all the probabilities that each pixel can belong to each pattern. This probability is also called membership (or ownership) in the literature.
One of the most extensively studied approaches for hard segmentation is the variational method. Many effective variational models have been developed, for instance, the Mumford-Shah model [1], geodesic active contour [2], geodesic active region [3], and region competition [4]. Level set technique [5] has been proven to be powerful in the implementation of variational models. In two-phase segmentation the composition of the Heaviside function with the level set function is used to represent the regions of the object and background. In [6] [7], the authors extended the level set method to multiphase segmentation by using multiple level set functions, while in [8] [9], the authors proposed another means to extend the level set method by using multiple layers for each level set function. Through the act of carefully choosing the initial values, these methods can work very well. However, the non-convexity of the energy functional in the level set formulation is an inherent drawback of the level set method. As a result, many level set based variational segmentation models are sensitive to initial values and may converge to an undesirable local minimum. This problem is more difficult to deal with for multiphase segmentation.
To overcome the non-convexity problem mentioned above, one approach is to replace the composition of the heaviside function with the level set function in level set formulation by use of a weight/membership function. Through the implementation of this modification the energy is convex with respect to the membership function. For example, Chan et al. [10] and Bresson et al. [11] stated certain non-convex minimization problems for image segmentation and denosing as equivalent convex minimization problems by using membership functions to replace characteristic functions. These new models allow for the finding of global minimizers via standard convex minimization schemes. In particular in [11] efficient and fast numerical schemes to globally minimize the variational segmentation models were proposed. These algorithms are based on a dual formulation of the TV norm proposed and developed in [12]- [16].
Soft segmentation is also motivated by its applications to real world problems. In medical imaging, due to limited spacial resolution of the equipment, not all the voxels in a segmented region contain the same tissue type, especially near the boundary of two subregions. A typical example is the partial volume effect in MRI brain image segmentation. Instead of labeling each image voxel with a unique tissue type [17], partial volume segmentation aims at estimating the percentage of each voxel belonging to each tissue, which can be viewed as the probability of the voxel belonging to the tissue. Since soft segmentation allows each pixel to belong to several patterns with certain probabilities, it provides a more flexible mechanism, and thereby keeps more options available for post-processing steps.
There have been many soft segmentation methods. Mory and Ardon extended the original region competition model [4] to a fuzzy region competition method [18] [19]. The technique generalizes some existing supervised and unsupervised region-based model. The proposed functional is convex, which guarantees the global solution in the supervised case. Unfortunately, this method only applies to two-phase segmentation and is difficult to extend to multiphase segmentation. The fuzzy C-mean (FCM) [17] [20] [21] is a method developed for pattern classification and recognition. Hence it is also applicable to image segmentation. The standard FCM model into M clusters by the following objective function [22] [23] where ik u is the membership value of datum i x for class k with where m is the bias field and ( ) G m is the regularization term for the bias field m . AFCM is more robust to noise than the standard FCM. The soft segmentation model developed in [17] used a different similarity measure to that in [21]. Their objective functional reads as k N stands for the set of neighbors falling into a window around k x , and k N is its cardinality. The parameter α in the second term controls the effect of the penalty. Another class of soft segmentation is based on stochastic approaches. It considers that pixel intensities are independent samples from one or several distributions. The likelihood functions have been widely used in soft segmentation. In [24] the maximum-likelihood (ML) is used to find the optimal parameters in the joint pdf such that the likelihood function is maximized. An expectation-maximization (EM) algorithm is used to solve the problem when we are dealing with incomplete data. However, simply using likelihood to model an image is not enough since it ignored the prior knowledge of an image. In [25] the authors proposed an adaptive segmentation method that uses the knowledge of tissue properties and intensity inhomogeneities to correct and segment MR images. The EM algorithm was used to iteratively estimate the posterior tissue class probabilities when the bias field is known, and having a maximum a posteriori principle (MAP) estimator of the bias field, when tissue class probabilities are known.
In [20], a segmentation framework based on the MAP principle was proposed for partial volume (PV) segmentation of magnetic resonance brain images. A mixture of the probability density functions is considered to address the PV effect. A Markov Random Field (MRF) model is used to define the prior distribution of the mixture coefficient field imposing a smoothness on the mixture coefficients (ownerships). The fuzzy c-means model is extended to define the likelihood function of the observed image.
The phase transition theory in material sciences and fluid mechanics have inspired people to borrow ideas from contemporary material sciences, e.g., the diffuse interface model of Cahn-Hilliard [26], and its rigorous mathematical analysis in the framework of Γ -convergence approximation by Modica and Mortola [27] [28] into image segmentation. The phase field relaxation consists of approximating the perimeter of the interface using a Cahn-Hilliard type penalization functional [26], with the form is a scalar function with exactly two minimizers at 0 and 1 satisfying ( ) ( ) The second term of the penalty functional ensures that the values of the material density υ converges to 0 or 1 as 0 →  , while the first term controls the perimeter. The parameter  can be interpreted as the width of the diffused edge representation in υ . The phase field approach has been used in topological optimization problems [29]- [31]. In [32], the authors used the phase field to approximate sharp edges and a variational phase field model is derived to compute a shape average of a given number of shapes. In [30], the authors used the phase transition theory in a Cahn-Hilliard impainting model. The authors in [33] and [34] presented a models for image segmentation based upon the phase transition theory of Modica and Mortola and discussed theirs connections to the Mumford-Shah segmentation model and some related works.
In paper [35], Under the assumption that all random variables are independent, we have the following joint pdf The regularization is made using a double well potential borrowed from the phase-transition theory. By assuming that all patterns are Gaussian distributions with mean fields i u ( ) , and a fixed variance 2 σ , the pdf of the mixed Gaussian is given by defines the Gaussian probability density function, The model is to solve the following minimization problem: In this paper, we propose a new multiphase soft segmentation model that integrates phase-transition theory into a mixture of Gaussian model for image intensities. The proposed model is an extension of the paper [35].
The difference between this work and [35] lies in the facts: i) the data fidelity term in the proposed model is a Gaussian mixture model, while the model in [35] is only an approximation of Gaussian mixture model due to simplification. Although this simplification facilitates the numerical computation it does not reflect upon the real behavior of the data; ii) paper [35] assumed that all the variances of different phases are the same and fixed, while in the proposed model each phase could have a different variance which will also be optimized, making the model more flexible and more robust. This paper is organized as follows: Section 2 addresses the proposed model development. Section 3 presents the implementation details and experimental results. Finally, the conclusion is given in Section 4.

Proposed Model
In this section, we develop a soft multiphase segmentation model under the assumption that the intensity of the image is distributed as a mixture of Gaussians.
We assume the intensity ( )   (13) assuming that the mixture patterns U and the mixture rules P are independent. Since I is given, For energies ( ) E U , we use the general variational form, and assume that all pattern channels are independent to each other. For functions whose gradients are square integrable, we may consider: [ ] ( ) ( ) where 1   . Since 1   , the second term will force i p approximate either 1 or 0. The first term is the regularity condition on each i p . The advantage of this expression is that it contains the boundary information. By Γ -convergence theory, the term converges to the length of the boundary in the sense of Γ -convergence [37] [38] as  goes to zero. Now, in combination of (15), (16), (18) and (19), the final proposed segmentation model is the minimization of: where λ and α are weights balancing the effects of the three terms.

Implementation and Experimental Results
Since the energy functional contains three group parameters, in order to minimize the energy, we use the alternating iteration scheme: where V is defined as ; Considering that where p dt and dt σ are the step sizes for Equations (27) and (28), respectively, and u L , p L and L σ are given by: To start the iteration process, we need to choose the initial values for the ownership functions i p , the patterns i u and the standard deviations i σ . We also need to choose suitable parameters α and λ . The adopted procedures is: given In Figure 1, a comparison was made between Shen's model and the proposed model using synthetic images. Figure 1(a) is the original image, which is a piecewise constant image added with constant Gaussian noise. Figure 1(b) and Figure 1(c) are the reconstructed images using the proposed model after 10 iterations and 50 iterations resp., while Figure 1(d) to Figure 1(f) are the images reconstructed using Shen's model after 50 iterations, 100 iterations and 500 iterations, respectively. We see that the result was improved using the proposed model with few iterations. However, when Shen's model is used very little differences is perceived even when the number of iterations are increased.
In Figure 2, we present a comparison between variances updated and not updated. The original image Figure  2(a) is a piecewise constant image added with different Gaussian noise for different phases.  Figure 3 shows the bias correction in the proposed model when bias is evident prior to correction in an image. Figure 3(a) is the original image. This image was firstly used by X. Bresson and T.F. Chan in their non-local Chan-Vese model [6]. It is clear that the object (disk) in Figure 3(a) is biased. Figure 3 In the following experiments, we tested our model on real images. In Figure 4, we carried out the experiment on the MRI brain image.

Conclusion
In this paper, we extended the idea in paper [35] and developed a soft multiphase segmentation model. The model is a pure Gaussian mixture model. It allows for the choosing of different means and different variances, which leads to a more flexible model. The experiments show that the model is more robust to noise compared with the previous model. Moreover, with the experiment on MRI brain image, we see the advantage of soft segmentation where we can find and calculate partial volume which is very important for brain image segmentation.