^{1}

^{1}

^{*}

^{2}

Various open source software are managed by using several bug tracking systems. In particular, the open source software extends to the cloud service and edge computing. Recently, OSF Edge Computing Group is launched by OpenStack. There are big data behind the internet services such as cloud and edge computing. Then, it is important to consider the impact of big data in order to assess the reliability of open source software. Various optimal software release problems have been proposed by specific researchers. In the typical optimal software release problems, the cost parameters are defined as the known parameter. However, it is difficult to decide the cost parameter because of the uncertainty. The purpose of our research is to estimate the effort parameters included in our models. In this paper, we propose an estimation method of effort parameter by using the genetic algorithm. Then, we show the estimation method in section 3. Moreover, we analyze actual data to show numerical examples for the estimation method of effort parameter. As the research results, we found that the OSS managers would be able to comprehend the human resources required before the OSS project in advance by using our method.

Recently, the network-oriented open source software (OSS) has been used many users. In particular, the cloud and edge computings attract the extensive attention of the developers and users, because of the network revolution such as 5 G. Also, the OSS is helpful for many users to make a cost reduction, standardization, and quick delivery. In particular, several research papers in terms of cloud and edge computings such as “OpenStack” are proposed in the past [

Historically, the software reliability growth models have been applied to the system testing phase of software development [

As the research background, it is difficult for the OSS managers to comprehend the human resources required before the OSS project in advance. Especially, the environment of software development in the cloud and edge computing will be the complex situation. Also, there are several optimal release problems based on the software reliability growth models. Then, the cost parameters are given in the past researches. On the other hand, we can appropriately control the human resources, if the software managers can estimate the effort parameters in advance.

The problems and solutions in order to solve in the paper are listed as follows:

1) the estimation method of the given parameters included in the existing optimal release problems;

2) the optimal release problem with application to the effort expense optimization;

3) the optimal solution by using GA.

This paper discusses a method of OSS project management considering irregular fluctuation via the big data arose from OSS development and management under the cloud and edge computing. In particular, the effort expense optimization method based on Wiener process model in terms of effort is proposed in order to estimate the effort parameters in this paper. Then, the genetic algorithm is applied to the proposed method. Also, we analyze actual software effort expense data by showing numerical examples of OSS project optimization analysis.

In the assumptions of Wiener process modeling, we consider the following situations:

1) the noisy event of software effort expense arise from the complex management in big data environment;

2) the factor of big data is defined as “3 V model” such as Volume, Variety, and Velocity;

3) the velocity directly influence the software effort, the variety and volume indirectly influence one.

Moreover, it is important to consider the effort control, because several research papers considering the relationship between effort and fault have been proposed by in the past [

Then, we can obtain the following two dimensional stochastic differential equation with Brownian motion derived from software reliability growth modeling approach based on Wiener process [

d Ψ ( t ) = { ζ ( t ) − 1 2 ( υ 1 2 + υ 2 2 ) } { λ − Ψ ( t ) } d t + υ 1 { λ − Ψ ( t ) } d ω 1 ( t ) + υ 2 { λ − Ψ ( t ) } d ω 2 ( t ) . (1)

Then, each parameter is as follows:

Ψ ( t ) : the cumulative maintenance effort expense at up to operational time; t ( t ≥ 0 ) in the OSS development project, which takes on continuous real values;

ζ ( t ) : the increase rate of maintenance effort expense at operational time t and a non-negative function;

λ : the estimated maintenance effort required until the end of operation;

ω 1 ( t ) : 1st Wiener process considering the “Volume” factor;

ω 2 ( t ) : 2nd Wiener process considering the “Variety” factor;

υ 1 : 1st positive constant for “Volume” representing a magnitude of the irregular fluctuation;

υ 2 : 2nd positive constant for “Variety” representing a magnitude of the irregular fluctuation.

Moreover, we define the increase rate of maintenance effort expense in case of ζ ( t ) . We assume the following equations based on software reliability models [

ζ ( t ) ≐ d I ( t ) d t o − I ( t ) = p 1 + q ⋅ exp ( − p t ) . (2)

Then, the parameters in Equation (2) are given by

I ( t ) : the mean value function of the inflection S-shaped software reliability growth model based on a nonhomogeneous Poisson process (NHPP);

o: the expected number of latent faults;

p: the fault detection rate per fault;

q: is defined as 1 − r r ;

r: the impact parameter in terms of “Velocity”.

Therefore, the cumulative maintenance effort expense up to time t is obtained as follows:

Ψ ( t ) = λ [ 1 − 1 + p 1 + q ⋅ exp ( − p t ) ⋅ exp { − p t − υ 1 ω 1 ( t ) − υ 2 ω 2 ( t ) } ] . (3)

Similarly, the estimated maintenance effort expense required until the end of operation can give as follows:

Ψ r ( t ) = λ [ 1 + p 1 + q ⋅ exp ( − p t ) ⋅ exp { − p t − υ 1 ω 1 ( t ) − υ 2 ω 2 ( t ) } ] . (4)

Above mentioned model parameters can easily estimate by using the method of maximum likelihood [

We discuss the effort expense optimization based on the conventional optimal software release problems. Generally, it is interesting for the software developers to estimate the time when we should stop the testing phase in order to release software efficiently. Therefore, several researchers have discussed about the determination of software release times minimizing the expected total software cost [

According to the conventional optimal release problems, we can consider the optimal software maintenance problem with software effort expense from the relationship between cost and effort. Our research group has proposed several optimal maintenance problem based on software effort expense. Then, we define the following effort parameters:

τ 1 : the fixing effort expense per fault during the operation;

τ 2 : the effort expense per unit time during the operation;

τ 3 : the maintenance effort expense per fault after the maintenance. Then, the software effort in the operation can be formulated as:

Θ 1 ( t ) = τ 1 Ψ ( t ) + τ 2 t . (5)

Also, the software maintenance effort expense after the maintenance is represented as follows:

Θ 2 ( t ) = τ 3 Ψ r ( t ) . (6)

Consequently, from Equations (5) and (6), the total software maintenance effort expense is given by

Θ ( t ) = Θ 1 ( t ) + Θ 2 ( t ) . (7)

The optimum maintenance time t * is obtained by minimizing Θ ( t ) in Equation (7).

In the past, the effort expense parameter has been assumed as the given parameter according to the conventional optimal software release problem. We propose the estimation method of the effort parameters based on genetic algorithm. Then, we consider as the search problem minimizing the total software maintenance effort expense in Equation (7).

In the past, we have proposed several parameter estimation methods for jump diffusion process models [

Step 1. The initial values are randomly generated;

Step 2. The crossover is executed;

Step 3. The value of goodness-of-fit is calculated from the following evaluation function κ i included parameters:

min δ κ i ( δ ) , κ i = ∑ i = 0 I { Θ ( i ) − α i } 2 ( i = 1 , 2 , ⋯ , I ) . (8)

Then, the parameters included in evaluation function are as follows:

Θ ( i ) : the cumulative amount of maintenance effort expense at up to operation time i in Equation (4);

α i : i-th actual cumulative amount of maintenance effort expense;

δ : the set of effort expense parameters τ 1 , τ 2 , and τ 3 .

Step 4 and Step 2-Step 3 are continued until minimizing the value of evaluation function κ i .

Then, the fitness function based on our model is formulated as follows:

{ min τ 1 , τ 2 , τ 3 κ i ( τ 1 , τ 2 , τ 3 ) , κ i ( τ 1 , τ 2 , τ 3 ) = ∑ i = 0 I { Θ ( i ) − α i } 2 . (9)

Specifically, κ i ( τ 1 , τ 2 , τ 3 ) in Equation (9) is given by the following equation

κ i ( τ 1 , τ 2 , τ 3 ) = ∑ i = 0 I { Θ ( i ) − α i } 2 = ∑ i = 0 I { Θ 1 ( i ) + Θ 2 ( i ) − α i } 2 = ∑ i = 0 I { τ 1 Ψ ( i ) + τ 2 i + τ 3 Ψ r ( i ) − α i } 2 = ∑ i = 0 I { τ 1 λ [ 1 − 1 + p 1 + q ⋅ exp ( − p i ) ⋅ exp { − p i − υ 1 ω 1 ( i ) − υ 2 ω 2 ( i ) } ] + τ 2 i + τ 3 λ [ 1 + p 1 + q ⋅ exp ( − p i ) ⋅ exp { − p i − υ 1 ω 1 ( i ) − υ 2 ω 2 ( i ) } ] − α i } 2 . (10)

Then, the effort expense parameters τ 1 , τ 2 , and τ 3 are searched by the genetic algorithm under the flow in

We show several numerical examples by using the effort expense data of Apache

HTTP Server Project [

and τ 3 , respectively. Similarly, we show the sensitivity analysis of the sample paths for effort expense parameter τ 1 , τ 2 , and τ 3 in Figures 5-7, respectively. From Figures 5-7, we find that the noises become large in the early phase of operation.

In the past, the effort or cost parameters are the given ones. Conversely, we consider that the effort parameters can be estimated by using Equation (9).

τ 1 = 0.90915 , τ 2 = 10130.2 , τ 3 = 2.19561.

From the estimated results, we found that the effort parameters are appropriately estimated by using the combination optimization problem. Then, the software manager will be able to allocate the human resources for the software development appropriately. In other words, the OSS managers can comprehend the maintenance effort expense required in advance by using the proposed method. Therefore, the OSS managers will be able to prepare the human resources required before the OSS project.

The key significance points of the research contribution is as follows:

· There is no research paper in terms of the estimation method of cost parameters and effort ones based on the software reliability growth models. The

cost parameters and effort ones have been given as the experimental parameters of software managers in the past. Therefore, we cannot compare the proposed method with the conventional method, because there is no similar method.

· The managers of OSS project can decide the optimal software operation effort by using the estimated effort parameters τ 1 , τ 2 , and τ 3 , respectively. Thereby, the OSS managers can achieve the cost reduction and quick delivery.

Also, several research papers in terms of the optimization algorithms have been proposed in the past [

From above mentioned, it is difficult to compare the proposed method with the conventional one. Alternatively, we show the comparison results based on several GA’s. Figures 10 and Figures 11 are the results based on “L-BFGS-B” [

τ 1 = 1.0362 , τ 2 = 9617.0 , τ 3 = 2.1984.

As the comparison results based on two kinds of algorithms, the estimation results based on “Nelder-Mead” is pessimistically estimated from

Therefore, the proposed method is stable without regard to the optimization algorithms in GA.

We have discussed the optimum maintenance problem based on the genetic algorithm. In particular, we have proposed an estimation method of effort expense parameters included in our effort function. Moreover, we have concretely shown numerical examples of effort optimization. Then, we have found that the OSS managers can comprehend the human resources required before the OSS project in advance.

Finally, we have discussed the contribution of the proposed method. In particular, we have focused on two dimensional Wiener processes model for effort expense estimation. We conclude the advantage points as follows:

1st Point: The proposed stochastic differential equation model can assess OSS projects considering the influence of big data in terms of 3 V model.

2nd Point: The proposed method can search the optimum maintenance time under the complex situation such as Equation (9).

3rd Point: Before the OSS operation, the OSS managers can comprehend the required human resource in advance.

The OSS managers will be able to control the software effort expense from the stand point of 3 V model as big data by using the proposed effort expense optimization analysis.

At present, the cloud and edge computing based on OSS is attracting the most attention. It is difficult to assess the optimum maintenance time, because the cloud and edge computing have the network-based-service and many distributed node servers. As the future study, it will be important to discuss the optimum maintenance problem for the cloud and edge computing service.

This work was supported in part by the JSPS KAKENHI Grant No. 20K11799 in Japan.

The authors declare no conflicts of interest regarding the publication of this paper.

Sugisaki, K., Tamura, Y. and Yamada, S. (2021) OSS Effort Expense Optimization Based on Wiener Process Model and GA. Journal of Software Engineering and Applications, 14, 11-25. https://doi.org/10.4236/jsea.2021.141002