Wavelet Density Estimation of Censoring Data and Evaluate of Mean Integral Square Error with Convergence Ratio and Empirical Distribution of Given Estimator

Wavelet has rapid development in the current mathematics new areas. It also has a double meaning of theory and application. In signal and image compression, signal analysis, engineering technology has a wide range of applications. In this paper, we use wavelet method, for estimating the density function for censoring data. We evaluate the mean integrated squared error, convergence ratio of given estimator. Also, we obtain empirical distribution of given estimator and verify the conclusion by two simulation examples.


Introduction
One of data types, which researchers are extremely interested in, is caring to the time interval till the occurrence of certain events such as death etc.Any process waiting for a specific event produces survival data.Survival function, which is shown by ( ) S t , indicates the ratio of people who survived since the base time which is the point they enter the experiment.Failure in survival analysis means the occurrence of the event we were waiting for.The time, where survival is measured after that point, is called the start time.The failure time is the time that failure occurs for each individual which is denoted by i T for 1, 2, 3, i =  .The failure time is occurred from the base time up to when the failure occurs and it's known as i T .It's not always possible to observe the failure time for each individual.In such cases, censorship occurs.The rate of occurrences of an event (failure) in a spe-cific short period of time providing that no failure occurred before that time is the concept which is discussed by the name hazard function in survival analysis.Hazard function for the failure time line is as follows:

P t T t t T t F t S t h t t F t S t
Wavelets can be used for transient phenomena analysis or functions analysis which sometimes changes rapidly, and they are symmetrical and have limited period unlike rugged Sine waves, thus the signals with radical changes are analyzed better.The close relationship between wavelet coefficients and some spaces, wavelet bases being orthogonal and also useful properties of them in wavelet issues simplify the computational algorithms.
Wavelets theory was proposed by Alfred Harr [1] for the first time in 1910.He showed that a continuous function can be approximated as follows: ( ) ( ) ( ) ( ) Such that ( ) ( ) , 0 0,1, , 2 1 , 2 Also for mother wavelet and father wavelets the following: is an orthogonal unit base for j V and j V contains all sectionally constant functions and their exact length is twice the interval length of are called multiresolatio analysis or scale function ϕ , if it satisfies the following conditions: 1- . , If we consider the scale function in the interval [ ] 0,1 , then the image of f on the space V j is defined as which is a function with the resolution, 2 j and because of the fact that ( ) P is a good approximation of function f for large amounts of j .Let the nested sequence of closed subspaces; … 1 1 , , The term wavelets are used to refer to a set of basis functions with very special structure.The special of wavelets basis for function Given above Wavelet basis, a function ( ) can be written a formal expansion: where As for general orthogonal series estimator, Daubechies [2], density estimator can be written as: ( ) ( ) where the obvious coefficient estimator can be written: We divide time axis into two parts, the intervals and the number of events in each interval.We determine number of events and hazard function according to the observations.Then we flatten them separately via linear wavelet density estimation on the whole time and then we calculate the function estimator and evaluate the asymptotic distribution.
In this paper we obtain estimator density for censoring data by using wavelet method and evaluate mean integral square error with convergence ratio and empirical distribution of given estimator.

Estimator of Density by Using Wavelet Method
Wavelets can be used for transient phenomena analysis or functions analysis which sometimes changes rapidly, and they are symmetrical and have limited period unlike rugged Sine waves, thus the signals with radical changes are analyzed better.The close relationship between wavelet coefficients and some spaces, wavelet bases being orthogonal and also useful properties of them in wavelet issues simplify the computational algorithms.As a result, numerous articles have been published about density function estimation.The mathematical theorem of wavelets and their application in statistics have been studied as a technique for nonparametric curve estimators by Antoniadys [3].
Afshari [4]- [6] have done some researches about density function estimator, the density functional derivative and the nonparametric regression function for the mixing random variables.Donohu [7], kyacharyan, Picard [8], Malat [9], Meyer [10], and some articles have been published in this field.Hall and Patil [11] have found a formula for the Mean Integrated Squared Error of Nonlinear Wavelet based on density estimators.Antoniadys et al. [12] achieved the density function estimator and the hazard function for right-censored data with the wavelets.In this section we obtain estimator of density function for censoring data by using wavelet method.
Suppose 1 2 3 , , , , n X X X X  are failure time of n tests that are studied.They are non-negative, independent, identically distributed, with the density function f and distribution function F and 1 2 3 , , , , n C C C C  are corresponding to censored times, non-negative, independent, identically distributed, with the density function g and distribution function G .
Assuming independency of failure times and censored time of the observed random variable, i Z and the function i δ and Hazard function are shown as below: ( G t < then we have as the following: Also we definite as follows: , we divide the time axis into two parts of small intervals and the amounts of events (0 or 1) in each interval, and then we divide these values to the length of intervals.
Estimation procedures of ( ) can be summarized as the following: Select 0 ∆ > and collect the observed failures in 1 k + intervals with the length ∆ and using wavelet estimation on the collected data.We find an estimate of sub density.This means that we calculate the collected wavelet coefficients data on the scale of ( ) j n by choosing the decomposition level ( ) j n and then we estimate ( ) . It is necessary to state the following symbols to show the details: We figure estimators on the finite interval [ ] Suppose that N is an integer that could be dependent to n and the estimated points are as follows: and we divide the interval [ ] 0,τ of time axis to The k -th interval is marked by k J so: . Now we define the following indicator function that indicates the number of uncensored failures in the time interval ( ) We assume that k U the observed failures ratio in the interval k J n other words: We smooth the data k U ∆ by an appropriate wavelet smoother to find the estimation of f * .
We can write, ( ) ( ) ( ) , , . where, The complex structural polymorphism analysis causes an efficient tree construction algorithm for analysis of functions in N V with theoretic scale wavelet coefficients is not well available and we need an initial value for a fast wavelet transform.Antonyadys [4] suggested the following initial amount: , 0 2 1 As a result a reasonable estimate for image of f * with clarity N is: If we assume that the collected values k U which are equal to the estimators of ( ) τ and ϕ is regular of degree m .We estimate the unknown function * f as follows to level the data with a better rate for the sample size n and the sequence ( ) That it is the orthogonal image of ( ) on the leveler approximation space Theorem 2-2: Suppose that the sub density f * is a continuous function on [ ] 0,τ and it's m times differen-tiable, then if 0 ∆ → for n → ∞ we have: , .
Proof: by using theorem (2-1) we can write: Since, ( ) and we can write as the following: So Equations ( 9) can be written as follows: By using Equation (1) we have: , .
Since ϕ is regular in order m we can write: According Equation ( 13), we can write: , complete the proof.

Evaluate of Mean Integral Square Error with Convergence Ratio
In this section we evaluate mean integral square error and convergence ratio is investigated.Definition 3-1: The mean integrated square error (MISE) of kernel estimator of a density function f is given ( ) . In this formula ≈ denotes the right and left convergence, when n → ∞ , n denotes the sample size, h denotes the estimator bandwidth core, r denotes core level and 1 C , 2 C denote kernel dependent quantities with unknown density.Theorem 3-1: Suppose that the sub density f * is a continuous function on [ ] 0,τ and it's m times diffe- rentiable, then if 0 ∆ → for n → ∞ and ( ) By using Equation (15) and theorem (2-2) for 1 m ≥ , we can write as the following: we can write as the following: ( ) ( ) ( ) So by using Equations ( 16) and (17), we can write: For evaluate ( ) , we can write: Also we can write: By using theorem (2-1) and expectation of Equation ( 19), we can write as the following: , , By using theorem (2-1) we have: ( , By using Equation ( 22) and this fact that f * is uniformly bounded, we can write as the following: ( , . The second part of Equation ( 20) can be written as the following: , the proof is complete.

Empirical Distribution of Purpose Estimator
In this section we investigate empirical distribution of estimator under some condition.
By using Equation ( 23) we have: So by using Equation ( 24) and ( 25), the phrase I, III tend to zero when n → ∞ , and finally we have: Such that for each fixed k , while 1, 2, , i n =  , ik Y is defined as an independent and identically distributed random sample with the mean as follows: By using cushy Schwartz inequality: So we can write as the following: Using this fact that f * is uniformly bounded and, ( Thus, the Equation (26) state is convergent in 2 L and thus in the distribution.Also by using Theorem (2-2), we have: We control the Lindberg condition in order to prove that II is asymptotically normal.For this purpose, we set: , By using cushy Schwartz inequality: , So we can write as the following: and complete the proof.

Simulation and Numerical Computation for Target Estimator
In this section we simulate,

( )
ˆn k f t on the data of size n by using Semlayt's wavelet.We consider conver- gence ratio of given estimator by computing of average mean square error of given estimators.We use R software and wavelet package for simulation.
Example 1: We generate ( )  2 displays the wavelet estimator of subdensity of observed failures for a traditional censoring data.The solid line displays the subdensity estimates based actual data and the dotted line is the true density.

Conclusion
In this paper we obtain density estimation for censoring data by using wavelet method and evaluate mean integral square error.We show that convergence ratio is acceptable and empirical distribution of given estimator under some condition is normal.

W
scaling function ϕ and mother wavelet ψ such that .Other wavelets in the basis are then generated by translation of the scaling function and dilations of the mother wavelet by using the relationships:
The results in Table1displays the average mean square errors of subdensity function estimator for sample of observed failures for a traditional censoring data.The solid line is the density estimator and the dotted line is the true density.

Table 1 .
The average mean square errors of subdensity function estimator by wavelet method.

Table 2 .
The average mean square errors of subdensity function estimator by wavelet method.