Multiple Targets Recognition for Highly-Compressed Color Images in a Joint Transform Correlator

In this paper, we are proposing a compression-based multiple color target detection for practical near real-time optical pattern recognition applications. By reducing the size of the color images to its utmost compression, the speed and the storage of the system are greatly increased. We have used the power-ful Fringe-adjusted joint transform correlation technique to successfully detect compression-based multiple targets in colored images. The colored image is decomposed into three fundamental color components images (Red, Green, Blue) and they are separately processed by three-channel correlators. The outputs of the three channels are then combined into a single correlation output. To eliminate the false alarms and zero-order terms due to multiple desired and undesired targets in a scene, we have used the reference shifted phase-encoded and the reference phase-encoded techniques. The performance of the proposed compression-based technique is assessed through many computer simulation tests for images polluted by strong additive Gaussian and Salt & Pepper noises as well as reference occluded images. The robustness of the scheme is demonstrated for severely compressed images (up to 94% ratio), strong noise densities (up to 0.5), and large reference occlusion images (up to 75%).

as excellent potential architecture for target recognition and tracking applications as well as other optical information processing applications such as in optical cryptosystems [1]- [20]. Over the years, since its introduction in the sixties of the last century, the optical joint transform correlator (JTC) had gained growing interest as a near real-time optical processor over the classical optical Lugt correlator [21] [22], which suffers from poor light efficiency, large correlation sidelobes and large autocorrelation width in addition, to the need of precise alignment of the optical elements and the fabrication of complex-valued filters. Interests in real-time JTC-based applications have grown with the fast development of electrically addressed spatial light modulators (EASLM). The discrimination capability of target detection using optical JTC was greatly improved by proposing various forms of JTC techniques [5] [6] [7] [8] [9]. Among them, the reported fringe-adjusted JTC (FJTC) technique has been proven to greatly enhance the JTC correlation peaks [3]. The JTC faces false alarm detection when the input scene contains many identical targets as well as many identical nontargets objects. To alleviate these problems, several methods were reported such as Fourier plane subtraction, shifted phase-encoded, and random phase-mask.
Usually, target images captured by a CCD camera as well as reference images are uploaded to the EASLM at the input of the JTC architecture for target recognition processing. A major factor that affects the speed of processing depends on the sizes of the images for uploading. Further, variations in target images regarding rotation and scaling necessitate the storage of a large number of reference images for successful target detection. Thus, a large storage capacity is needed. Furthermore, the limited size and pixel resolution of the SLM put more constraints on the useful size of images for practical optical processing. Moreover, the constraints on the image sizes and the growing storage capacity to store the reference images appear more profound when dealing with high-resolution and colored images. Consequently, researchers and developers seek for techniques to compress the image sizes for practical image transmission, automated optical target recognition, and/or encryption applications. In this regard, JPEG (Joint Photographic Experts Group) compression algorithm has been proven to be one of the efficient techniques to compress images and it is widely used in the transmission and storage of images [23] [24] [25] [26] [27].
In this paper, we are proposing a compression-based JTC that detects multiple color targets where we have used largely compressed targets and/or reference images. We will prevent the usual false alarms correlation peaks from the output plane (a common issue when dealing with multiple targets detection) by using fringe-adjusted filter and reference phase-shifted and reference phase-encoded schemes. The proposed system detects multiple targets and references compressed up to a ratio of 94%. Many simulation experiments (with added Gaussian, Salt & Peppers noises as well as occluded reference images) are carried out to demonstrate the robustness, discrimination, and detection capability of the proposed scheme. In Section 2, we provide the theoretical discussion of the var-ious JTC schemes. Section 3 presents the computer simulation experiments on the compressed colored images. Section 4 is a short conclusion of this work.

The Joint Transform Correlator
The classical JTC is widely used in various imaging systems for accurate reconstruction of the optical field and Fourier plane filtering. Near real-time optical pattern recognition can be implemented by using JTC since it does not need a match filter. One possible JTC architecture is shown in Figure 1. In this figure, an input scene ( ) 0 s y y − , that is captured by a CCD camera or may be stored in a computer, is captured and is displayed at the spatial light modulator (SLM) side-by-side with a reference image ( ) 0 r y y + , which is stored in a computer system. Both images can be easily updated in real time. The lens performs the Fourier transform (FT) of the joint image and its intensity (called joint power spectrum (JPS)) is captured by a CCD camera and sent to the computer for processing. The processed JPS is loaded again into the SLM and it is Fourier transformed by the lens and its intensity is recorded by the CCD camera to produce the correlation output. For color images, both the reference and the scene images are separated into the three basic color components (Red, Green, and Blue). Then, we process the individual color components images either sequentially or in three separate correlators (three channels). The final correlation output is the combination of these three output correlators. In the following discussion, for convenient and simplicity, we will adopt one dimensional presentation and present the mathematical expressions for one channel. The input joint image can be expressed as:    At the focal plane of the lens, the CCD camera records the intensity of Equation (2): The first term in Equation (3a) and the first three terms in Equation (3b) represent strong zero-order DC terms. The fourth and the fifth terms demonstrate the auto-correlations between the identical targets and the identical nontargets while the sixth term is the cross-correlation between the targets and the other objects. The seventh and the eighth terms are the cross-correlations between the noise and the targets and the other objects. All these no useful correlation terms are within the input scene and they greatly degrade the detection capability of the JTC. By taking the inverse FT of Equation (3) and recording its intensity, we produce the correlation output. A typical correlation output of a classical JTC is shown in Figure 2 where the desired auto correlation peaks of the targets coexist with many other cross-correlation peaks as well as a wide and strong peak (DC value) at the center of the output, as discussed in Equation (3b). In addition, the classical JTC suffers from large correlation lobes, large correlation peak width, and low optical efficiency.
There are several techniques to enhance the targets detectability in the JTC, especially when multiple targets or identical non-target objects are present in an input scene, which cause the false alarms. Fourier plane subtraction [8] is one method that subtracts both the input-scene power spectrum ( ( )  modified JPS is expressed as: Therefore, the Fourier plane subtraction technique gets rid of all the terms in Equation (3b) as well as the ( ) 2 R v DC term. Now, Equation (4) contains only correlation terms between the reference and the objects in the input scene and the conjugates of these terms. It is worth mentioning that the subtraction scheme is a three-step process. As an alternative to this scheme, a two-step process called the reference shifted phase-encoding technique provides the same results. Hence, this technique has less processing steps compared to the previous subtraction technique, which eventually enhances the system processing speed. The first step is to display input joint image of Equation (1) at the SLM and capture its JPS, which is the same as Equation (3). The second step is to 180˚ phase-shift the reference image, combine it with the input scene, take FT and capture its JPS: Now, digital subtraction of Equation (3) and Equation (7) yields the modified joint power spectrum: Equation (8) is basically the same as Equation (4). Therefore, the modified JPS provides large improvement for target detection. However, missing the targets at the correlation output may still occur when some objects are brighter than others in the input scene [9]. To alleviate this problem, a real valued fringe-adjusted filter (FAF) is employed to get a better correlation output for both single and multiple targets detection. The FAF is expressed as: where ( ) to be a very small value such that . Now, the FAF multiplies the modified JPS and the product is displayed at the SLM plane to generate the correlation output.
On the other hand, since its introduction more than two decades ago, the ref- , which has randomly distributed phase from −π to π, and then inverse FT to produce: where "*" denotes the convolution operation. Now, the input joint images are written as: The joint power spectra for these joint images are: The modified JPS is: To obtain the correlation output, Equation (16)

Compressed Multiple Colored Target Detection
In this section, based on the previous discussions, we propose multiple targets recognition for color images when the targets and/or the reference images are greatly compressed and experienced under severe noise and occlusion conditions. Handling compressed images would make the JTC processor practically closer to near real-time processing and reduce the storage requirements as discussed in the introduction of this paper. Colored images, especially high-resolution ones, take large space (bytes) for storage and more time to upload them into the SLM at the input plane of the JTC processor. Developers seek compression techniques to facilitate transmitting and storing colored images. JPEG compression scheme, which uses Wavelet transform in its compression algorithm, is very efficient unlike other compression schemes [11] [12] [13] since it compresses the image with less storage space while keeping more details of it. Thus, we will adopt this compressing scheme to compress the colored reference images in our proposed JTC-based automatic target recognition.
A three-channel (Red, Green, Blue) correlator is used to deal with colored images, where each channel will process separately one fundamental color component. Hence, all the equations in the previous section would represent the mathematics for one out of three fundamental color components. The output correlations of the three channels (see Equation (17)) are combined to generate the final correlation output to detect the colored targets. The input joint image is divided into two halves: the left half contains the input scene while the right half has the reference target. An illustrative example of the detection capability of the proposed scheme for colored images is presented in Figure 3. Figure 3(a) presents an input scene, prepared to have multiple identical targets images, multiple identical non targets images, and other images with various colors. The size of all colored images is 32 × 32 pixels and have JPEG format. Figures 3(b)-(d) show the correlation peaks when the target is the orange, red apple, and yellow pear, respectively. Next, we present simulation results for detecting colored targets that are exposed to severe compression and strong noise (Gaussian and Salt & Peppers) conditions. In addition, we tested our proposed detection scheme for references occluded up to 75%. Finally, we repeated the same experiment for compressing high-resolution color images. Note that Figure 3   shown in Figure 4(b). Now, the correlation results of the cases {compressed target, uncompressed reference} and {compressed target, compressed reference} are shown in Figure 5(a) and Figure 5(b), respectively. The next simulation results will experiment the most challenging case of compression of both target and reference images, which are subjected to severe noise conditions. Figure 6 shows that the proposed JTC scheme successfully recognizes 94% compressed targets added to a random Gaussian noise with a maximum noise density of 0.2. Note that the random noise expectedly produces unequal correlation peaks for the targets. Further, one of the targets is at edge of not being recognized with low correlation peak value (Figure 6(a)). To support greater noise density, one must decrease the compression ratio for the color images as illustrated in Figure 6(b) and Figure 6(c) where the compression ratio is decreased to 90% and 80%, respectively. Table 1 lists the maximum correlation peaks of the targets for different Gaussian noise densities and different compression ratios. Note that the correlation peak values are degraded significantly as the density of the noise increases. This implies that a careful threshold value might be needed to pick up the targets' peaks and avoid false alarms. This experiment is repeated for added Salt &  Peppers noise. The results are shown in Figure 7 and in Table 1. It is worth mentioning that the results in Table 1 correspond to the reference (ORANGE). The correlation peak values will be different for different references. Optics and Photonics Journal  In many situations it is needed to recognize a region or a part of a target image. Consequently, the proposed JTC scheme is tested for 25%, 50%, and 75% occluded reference with noise-free 94% compressed images as shown in Figure  8. Further, 90% compressed noisy images correlated with 50% occluded reference image are displayed in Figure 9. As a matter of fact, when images are  occluded, the remaining pixels in the image are easily affected by the noise density. This is demonstrated by comparing the correlation output of the 50% occluded reference with Gaussian noise density = 0.1 of Figure 9(a) to the correlation output of Figure 6(b) with Gaussian noise density = 0.5. Likewise, is the case for the Salt & Peppers noise density of Figure 9(b) and Figure 7(b), where the noise density decreases from 0.4 to 0.3.
It is well known that JPEG compression algorithm discards pixels that are not important in human eye perception such as small color variations and/or high-frequency components in color images. In this regard, we tested our proposed color multiple targets recognition scheme for higher resolution images such as the 128 × 128 pixels color images shown in Figure 10. The targets detection capability is excellent for noise-free images that are significantly compressed to 94% ratio as illustrated in Figure 10(a). However, this detection capability is Optics and Photonics Journal   greatly degraded (and failed) once a small amount of noise (Gaussian density = 0.005 and Salt & Peppers density = 0.003) is added to the target images as demonstrated in Figure 10(b) and Figure 10(c).
In comparison, the low-resolution 32 × 32 pixels images in Figure 6(a) and Figure 7(a) afforded Gaussian and Salt & Peppers noise densities of 0.2 and 0.3, respectively. The multiple targets recognition can be significantly improved by slightly decreasing the compression ratio of images. For instance, in Figure  11(a) and Figure 11(b), we used 90% compressed images instead of 94% ones. This has resulted in increasing the noise capability of the correlator to handle Gaussian noise density increase from 0.005 to 0.1 (20 folds improvement) while the Salt & Peppers noise density changes from 0.003 to 0.125 (41 folds improvement). In addition, Figure 11(c) and Figure 11(d) show more improvement in handling severe noise densities (Gaussian and Salt & Peppers densities = 0.4) when the image compression ratio decreases to 80%.
Furthermore, the simulations show that the performance of the proposed scheme for occluded high-resolution images is excellent for noise-free and up to 94% compressed color images (see Figure 12). Again, in order to support large amount of noises, the compression ratio must be lowered. An illustrative example is shown in Figure 13.

Conclusion
In this paper, we have demonstrated a compression-based FJTC target detection for colored images. First, we have demonstrated the detection capability of the proposed scheme for the same target with different colors. Then, we have used the reference phase-shifted technique to eliminate false alarms and zero-order terms due to multiple desired and undesired multiple cross-correlation peaks which appeared at the output correlation plane. Also, we employed the random-phase mask method to avoid displaying the usual second pair of correlation peaks at the output plane of the JTC architecture. The proposed JTC scheme was tested through a large number of simulations for low-resolution as well as high-resolution colored images. Both types of images were subjected to severe compression (up to a ratio of 94%) and strong densities of Gaussian and Salt & Peppers noises (up to 0.5). Further, noise-free and noisy-occluded reference images (up to 75%) are tested. We have demonstrated that low-resolution color images can afford large amounts of compression ratios and strong noise densities. The proposed scheme successfully detects the multiple compressed targets under all the above conditions.