Energy Aware Processor Architecture for Effective Scheduling and Power Management in Cloud Using Inclusive Power-Cognizant Processor Controller

The fast acceptance of cloud technology to industry explains increasing energy conservation needs and adoption of energy aware scheduling methods to cloud. Power consumption is one of the top of mind issues in cloud, because the usage of cloud storage by the individuals or organization grows rapidly. Developing an efficient power management processor architecture has gained considerable attention. However, the conventional power management mechanism fails to consider task scheduling policies. Therefore, this work presents a novel energy aware framework for power management. The proposed system leads to the development of Inclusive Power-Cognizant Processor Controller (IPCPC) for efficient power utilization. To evaluate the performance of the proposed method, simulation experiments inputting random tasks as well as tasks collected from Google Trace Logs were conducted to validate the supremacy of IPCPC. The research based on Real world Google Trace Logs gives results that proposed framework leads to less than 9% of total power consumption per task of server which proves reduction in the overall power needed.


Introduction
Cloud computing and its pay-as-per your use-cost model have enabled the software service providers, application service providers as well as hardware infrastructure service providers and platform service providers to provide computing services on demand and pay per use.This upward drift in cloud computing, combined with the demands for data storage virtualization is driving the rapid evolution of datacenter technologies towards more cost-effective, user driven and energy efficient solutions.Cloud computing is defined as "A large scale distributed computing paradigm that is driven by economies of scale, in which a pool of abstraction, virtualization, dynamically-scalable, managed computing power, storage, platforms, and services are delivered on demand to external customer over the internet" [1].Power consumption is one of the prominent issues in cloud [2].In cloud model, data owned by a user is managed in a distributed manner.It will consume more energy for allocating resource to correctly identified user process in a distributed cloud system.Moreover, multiple users access the cloud at same time, and this leads to increase in the energy cost enormously and this high energy consumption produces huge amount of heat, consequently the hardware system fails [3].
In cloud data center due to varying workloads, it is common that most servers run at low utilization.In a cloud datacenter, the energy efficiency can be achieved by making the idle server to sleep thereby by reducing the power consumption.In a low load condition, the processor utilization is 10% and their power consumption is over 50% of the peak power [4].In the cloud model, multiple data center applications are hosted on a common set of servers.This permits the application workloads to be consolidated in a small number of servers which are always better utilized.Consolidation can be problematic if it loads maximum workload into minimum no of servers and consequently suffers from performance degradation.Thus reducing the energy consumption of cloud data center is a challenging task .The concept of Green computing has gained much attention recently and it was developed for efficient resource utilization as well as for reduction in energy consumption.The proposed work presents a framework for power management in cloud.The proposed idea for the power management is implemented by calculating how much power and configurations are required for the server to process a task such as uploading a file and after that the task will be scheduled to server which requires a minimum power to process the task.The proposed system inaugurates a novel Inclusive Power-cognizant Processor Controller (IPCPC) for minimizing the power utilization and Inclusive Power-cognizant Processor Controller (IPCPC) integrates with collection of configuration management (CCM), Server/Task Mapping (STM), Anticipating power manager (APM).CCM is used for estimating the server configurations in the data center.Server/Task Mapping (STM) is used for scheduling and task mapping.APM can estimate the current power consumption of the server.Inclusive Power-cognizant Processor Controller (IPCPC) enables the CCM (Collection of Configuration Management) to set the configuration of server.APM can estimate the current power consumption of the server by identifying three major portions of the power consumption, such as power consumption of processor execution, power consumption of the server except for processors, and baseline power consumption of the idle processor.The output of APM is given to the Energy aware Earliest Deadline first algorithm.This scheduling algorithm maps the task to the virtual machine of the server.The unused virtual machine of server and their working frequency can be turned off to reduce the power consumption and extend the prolong life time of the multiple servers.The main objective of the proposed work is as follows.
• Enhance the system performance by using a task scheduling algorithm.
• Minimize the power consumption.
The rest of this paper is organized as follows.Section 2 gives the reviews of previous works in power management and scheduling in cloud.Section 3 introduces the detailed architecture about the proposed work, and in Section 4 the experimental results are analyzed.Conclusions are finally drawn in Section 5.

Related Work
Energy conservation in cloud computing is attracting a wide range of attention in research area, and is leading to a new computing era known as green computing.Efficient scheduling techniques are there to reduce the energy conservation in data centers which have been thoroughly examined in [5]- [7].Chase, D. C. Anderson, P. N. Thakar, A. M. Vahdat, and R. P. Doyle propose the energy-efficient management issue of homogeneous resources in Internet hosting centers.The proposed method is ideal for power efficient resource allocation at data center level and energy consumption is reduced by switching idle servers to power saving modes [8].Arindam Banerjee, Prateek Agrawal, N.Ch.S.N.Iyengar [2] investigate all possible areas in a typical cloud infrastructure that are responsible for significant amount of energy consumption and proposes methodologies for decreasing power utilization.Shin-ichi Kuribayashi [3] identifies the need of collaboration among the entire servers, the communication network, and the power network for reducing power consumption in cloud environment.This paper proposes to use signaling sequences to exchange the information on power consumption between network and servers.In order to realize the proposed policy the volume of power consumption method by all network devices has been estimated and assigns it to an individual user.Luna Mingyi Zhang, Keqin Li, Dan Chia-Tien Lo and Yanqing Zhang [4], considers several green task scheduling algorithms for heterogeneous computers which will have continuous speeds and discrete speeds.All these algorithms focus on minimizing the consumption of energy as well as determining an optimal speed for the tasks assigned to the computer.Awada Uchechukwu, keqiu Li, and Yanming Shen [9], characterizes energy consumption and performance in cloud environments by analyzing and measuring the impact of various task and system configuration.This paper presents energy consumption formulas for calculating the total energy consumption in cloud environments.Andrew J. Younge, Gregor von Laszewski, Lizhe Wang, Sonia Lopez-Alarcon and Warren Carithers [10], presents a framework for providing efficient green enhancements within the scalable cloud computing architecture.The frame work derives efficient methods for VM scheduling, VM image management, and advanced data center design.The Scheduling technique addressed here contains the placement of VMs within the Cloud infrastructure while minimizing the operating costs of the Cloud itself.This is typically achieved by optimizing either power of the server equipment itself or the overall temperature within the data center.The image management attempts to control and manipulate the size and placement of VM images in various ways to conserve power.Yan Ma, Gong B, Sugihara R, and Gupta R. [11], investigates the power-aware scheduling algorithms for heterogeneous systems to meet the deadline constraints in high performance computing applications.A pricing scheme for tasks is also presented in the way that the price of a task differs as its energy usage and the price of a task will depend on the rigidity of its deadline.
Lizhe Wanga et al. [12] studies the case of reducing power consumption of parallel tasks in a cluster with the Dynamic Voltage Frequency Scaling (DVFS) technique.This paper also discusses the relationship between energy consumption and task execution time.
Robert Basmadjian, Hermann De Meer, Ricardo Lent and Giovanni Giuliani [13] studies the case of private cloud computing environments from the perspective of energy saving concerns.This paper presents a generic conceptual description for ICT resources of a data center and identifies their corresponding energy-related attributes.Power consumption prediction models for servers, storage devices and network equipment are presented in this paper and shows that by applying appropriate energy optimization policies guided through accurate power consumption prediction models, it is possible to save about 20% of energy consumption when typical single-site private cloud data centers are considered.
Recently, a number of research works have been conducted in energy efficient scheduling data centers [14].The orthodox power reduction system in a cloud system agrees on an automatic scheme to control the usage of peripheral operations and processor frequency.These mechanisms fail to meet user requirements, consider workloads and operational status of processors in the multiple cloud servers in a data ware house.Also, the multiple Processors are not required since most of the idle time of cloud devices is not heavy loading.The unused idle processors can be shut down to save more power.In this paper a novel framework is established with the consideration of reduction in total energy consumption in datacenters.The proposed method shows that by applying energy consumption reduction technique and suitable scheduling technique, it is possible to save large amount of power in cloud data centers.Our main contributions on cloud storage by the proposed Inclusive Power-cognizant Processor Controller are as follows.
• Innovative concept to reduce power consumption of server by Server/Task Mapping.
• Power management for the entire cloud storage system.

Power Aware Processor Using Inclusive Power-Cognizant Processor Controller
This section gives the detailed explanation of the energy aware scheduler IPCPC which is proposed to minimize the power consumption of the server and thus enhances the system performance.IPCPC will collect the configuration details of the server when issuing or completing the task based on the current status of the server and server workload configuration.It can manage host off/on states, adjust the working frequency, and schedules the task queues of each server to achieve best system performance and to reduce the power consumption of the server system.To achieve the above objective, this mechanism schedules tasks of the task set under some con-straints.First, the tasks entered in to the system will be sorted based on their deadline.Second, all possible system configurations are determined by IPCPC.Then, the tasks are scheduled to most feasible configuration to achieve an improved load balance as well as reduced power consumption.To achieve this IPCPC processor manager uses three techniques CCM, STM, APM. Figure 1 illustrates the conceptual organization of IPCPC.The following subsections introduce the details of these three mechanisms.

Collection of Configuration Management Technique
In cloud system with IPCPC, the huge number of tasks are submitted into the cloud and these tasks are maintained in task-set which is denoted as Ti = {T1, T2 …, Tn}.Assume the available number of servers of the cloud is denoted as K.The Datacenter has much number of servers and is denoted as Si= {S1, S2, …, Sk}; and each server has number of virtual machines based upon their capacity.The enabling status of server is denoted as Si = 1, when the corresponding server is in power on stage, and S1 = 0, when the server is in shutdown/sleep stage.The set of all possible combinations of the data center enabling status is denoted as DCi = {S1, S2, ...., S(k-1)}, where DC1 = (1, 0, …, 0) and DC(k-1)= (1, 1, …, 1), the number of combination of the server enabling status is 2(k-1).The set of possible working frequencies of server Si is denoted as Fi= {fi j│1≤ j ≤ m, fi1 < fi2 <… fim}, where f1 is the lowest frequency and fm is a highest frequency.Therefore all working frequencies of the server systems are denoted as Freq = {F1… Fg… Fk}, Fg ∈ {f1…fm│f1 < … < fm}.The workload and executed server number of task i is denoted as Ti.L and Ti.S.The set of all tasks is denoted as TSi = {T1.S, T2.S… Tn.S}, also TS_cur and TS_temp represent the current task set and temporary task set.The proposed CCM technique must be executed to evaluate a feasible server configuration.The CCM is executed when a task is issued or when a task is initiated by Tissue = Begin or when task issue is completed by Tissue = Completed.CCM can determine the possible system configurations, which can achieve the lowest virtual machine migration; excellent load balance and the highest working frequency.From the collected configuration details, a suitable one for allocation is selected.The server system configuration is denoted as,

{ }
Config Power,S, freq, L, TM = This can be generated by CCM.Equation (1) consists of five components, where S denotes reasonable server system, freq refers to working frequency of server and the Power denotes expecting power consumption of the server and is calculated using Anticipating power model.L is denoted as highest working load of server and TM is the maximum of task migration number achieved by STM.The additional functions of the IPCPC are listed as follows, Offline computing evaluates the relevant parameters, λ, ω, β and TS_cur, which are used by CCM and APM.Server/Task Mapping (STM) (Ti.L, Di) schedules and assigns the tasks based on their load and deadline.The following subsection discusses the technique in detail.
Load (i, TS) estimates and returns the work load value of the task set on the server i, where task set is scheduled and reassigned in order to improve the load balance.This value is also used by the Anticipating power model to predict the power.

Server Task Mapping
The power consumption of a server is notably affected by the workload of server in cloud.As a matter of fact good workload balance among the servers will improve the overall performance of the datacenter.To achieve load balancing the proposed concept uses an effective scheduling algorithm called Earliest Deadline first.The scheduling algorithm considers the following factors such as deadline, cost, reliability and availability of workflow.The performance of job depends on the execution time (ei (Ti)) of task (Ti) which has to be executed on server machine.For this reason execution time of task should be calculated before assigning to server based upon MIPS rate.Deadline of task is represented as di.

Task Arrangement
In cloud large number of tasks Ti = {T1, T2, …, Tn} and servers Si = {S1, S2, Sk} are available.The algorithm 1 has detail description of task arrangement in queue.Initially assume the queue Q, current task set (TS_cur); temporary task set (TS_temp) are empty sets.The current task set contains the currently available number of task for scheduling and temporary task set maintains the currently executed tasks.If Task Ti enters into the cloud, here it is mentioned as Tissue = Begin.The basic idea of the proposed algorithm for task arrangement is to arrange the arriving task set based on the dead line.The load of each task is calculated as Ti.L using the auxiliary function Load (i, TS) and deadline as di based on the task length, where i, varies from 0 to n.The task set is maintained in queue and the workload of task set is the summation of the individual task load.Each task is sorted in ascending order of their load and their deadline.Flow chart is explained in Figure 2.
2) Use CCM to gather configuration of server.
3) Sort all the Si in descending order.4) If Si has feasible configuration then 5) Choose Si with config = {Power, s, Freq, L, TM} 6) Assign task Ti to Si 7) Update Si.config = {Power, s, Freq, L, TM} after completing allocation of task.The basic idea of the EDF scheduling algorithm is to use the APM and CCM to balance the work load and to reduce the power consumption of severs.Before the task is scheduled calculate the power of current server system by using APM and by using CCM measure the system configuration.Based on this information the task is scheduled to the feasible server system.Then TS_temp will be increased by one.When the task is fully completed then Tissue = Completed is initiated.The algorithm 2 has a detailed description of the scheduling process.EDF scheduling algorithm takes the parameter as the load of individual task, and its deadline.The proposed work calculates the deadline by considering the task length.The process which has minimum load and earliest deadline is sent to the head of the queue.This process is assigned to the enabling server i.The current task in queue is submitted for scheduling after arranging the tasks in ascending order of their workload and deadline.The server system configuration is identified and status of the server is evaluated.Based on this information it is found out whether the server has the capability to accommodate the task, if so, task will be allocated to the server.Figure 3 shows the data flow diagram for the EDF scheduling algorithm.

Anticipating Power Model
As mentioned earlier, IPCPC has three major techniques to define the power aware model of cloud.The power level of each server can't be calculated exactly and promptly by using power meter.So this section explains how to predict the power of server by using APM.It estimates the current power consumption of server by identifying the three major portions of the power consumption.They are power consumption of server execution, power consumption of the other components except for server and base line power consumption of the idle server.
In Equation ( 2), APM denotes the predicted power, ε denotes the power consumptions of server's core processor and β represents the power consumptions of other components except for server processor in the cloud system.β can be treated as constant when the configurations of the components in the cloud server are same.When the data center consist K servers, Power consumption of server is denoted as μ.Equation (3) shows Total power consumption of the Servers is According to the results of [15] [16], power consumption of the server core is formulated as P = KCV2f, K denotes the constant; C represent the capacitance of the server; V refer to the working voltage of the server and f is the working frequency of the server processor.While the system work load is increased, the power consumption of the server processor is also increased.The enabling status of the server ON/OFF state also affects the where Ph denotes the enabling status of the server h; Ph = 1 refers a situation in which the power of the server is turned ON; Ph = 0 refers the situation in which the server h is OFF/sleep.Fh represents the working frequency of the server.Vh denotes the working voltage of the server; loadh refers to the work load of the server, which can estimate from the additional functions of IPCPC load (i, Ti.L) and TS_cur.Moreover, ω is a constant factor of workload and the power consumption of the server.Finally, the overall power consumption of the system can be represented as Equation ( 5).
( ) where ε, β, and ω can be obtained from the offline-computing (), it can be varied based on the various cloud system.

Simulation and Experimental Results
This section explains the experimental analysis of IPCPC that is defined in section 3. Experiments are conducted to analyze power consumption of each server.So, here to demonstrate the performance improvements of the IPCPC, the proposed algorithm is compared with EARH [17], and also compare with some existing scheduling algorithm like Greedy-R [18], Greedy-P [18], and FCFS [18].The performance metric, by which the proposed system assesses the system performance, includes following power consumption parameters.The parameter Resource utilization by task (RU) is the number of resources used by a task.Effective utilization (EU) defines whether resources are effectively utilized by varying number of tasks.Guarantee ratio (GR) gives total number of tasks guaranteed to meet their deadlines from the entire task set.The Total energy consumption (ΔEC total) parameter gives total energy consumed by server and Power consumption per task (PCT) gives total power consumption per accepted task count.

Experimental Setup
Cloudsim tool kit is used as simulation platform in this application.A data center has been simulated comprising multiple hosts with the CPU performance equivalent to 9600 MIPS, 40 GB RAM and 11 Tb of storage.Each Virtual machine requires up to 2400 MIPS.These VMs are needed in order to support a wide variety of hardware, software and varying user tasks.A hypervisor Xen provides the virtualized hardware to each VM.Next there is a need for an operating system within the VM to accomplish the task.X86 hardware is suggested for this application with includes operating system Linux.This configuration is able to detect various load of the task.This is takes only 15 seconds for running mod-probe to load single module.
The aim of this set of experiments is to validate the performance effect of EDF scheduling algorithm.Figure 4 shows the performance of the EDF scheduling algorithm which is compared with Cura [19] and the other three existing algorithms.The parameter used here to compare is resource utilization with varying deadlines.
To demonstrate the performance improvements of the IPCPC, the proposed algorithm is compared with EARH, and it is also compared with some existing scheduling algorithm like Greedy-R, Greedy-P, and FCFS. Figure 5 shows the comparison graph between proposed IPCPC with Cura.The parameter considered for  the comparison is resource utilization with varying task count.The resource utilization parameter is considered for comparisons because ineffective utilization of the resources of cloud can definitely leads to diminishing power consumption.The aim of this set of experiments is to validate the performance effect of EDF scheduling algorithm.
Figure 6(a) shows the algorithm basically keeps the guarantee ratio even if the value of task count is varied.IPCPC with EDF can have a higher guarantee ratio than other algorithm.Figure 6(c) shows comparison of total energy consumption of tasks.At this juncture six different algorithms are compared.From that it can be verified that the proposed IPCPC achieves more efficient result.

Evaluation Based on Real Data from Google Trace
The above groups of experiments show the performance of the different algorithms in various random inputting tasks.To evaluate the proposed algorithm in practical use, experiments is carried out using data from real world Google trace as input .The details of real world Google trace logs are given in paper [20].The trace log has information of 29 days.Totally 25 million tasks are recorded in trace log and grouped in 650 thousand jobs are processed in Google in nearly one month.Since there is massive amount of data, only first 5 hours in day 18 [20] were chosen for testing purpose.During these 5 hours 200 thousand tasks were submitted into the cloud.The task counts are varied in time manner.To finish the task it takes 1587 seconds on an average from the submission of task.
The effective utilization of resources for varying task count is shown below in Figure 7.The experiments are based on tasks collected from Google trace log.
The total number of tasks guaranteed to meet their deadlines based on tasks collected from Google trace log is shown below in Figure 8.
Figure 9 gives the result of power consumption per task to the increasing task count based on experiments conducted from the tasks collected from Google Trace Log.
Figure 10 gives the outcome of experiments conducted for total energy consumption for varying task count.
All the above graphs demonstrate the results based on real world trace records.From the analysis of above results it can be proven that the projected framework power management efficient result when compared with previous algorithms.

Conclusion
In this paper, the problems of energy conservation in cloud are investigated.As a feasible solution, a framework for power management known as IPCPC is established.It can reduce the overall power consumption and enhances resource utilization.The experimental results prove that IPCPC can efficiently reduce the power consumption than the traditional power aware algorithm.The scrutiny of the experimental results shows that total power consumption per task of server in IPCPC is 9% which proves reduction in the overall power needed.

Figure 1 .
Figure 1.The organization of proposed IPCPC system.

Figure 2 .
Figure 2. The data flow diagram for Task arrangement algorithm.Else 8) If Si.Config < Si + 1.Config 9) Choose Si +1.ConfigThe basic idea of the EDF scheduling algorithm is to use the APM and CCM to balance the work load and to reduce the power consumption of severs.Before the task is scheduled calculate the power of current server system by using APM and by using CCM measure the system configuration.Based on this information the task is scheduled to the feasible server system.Then TS_temp will be increased by one.When the task is fully completed then Tissue = Completed is initiated.The algorithm 2 has a detailed description of the scheduling process.EDF scheduling algorithm takes the parameter as the load of individual task, and its deadline.The proposed work calculates the deadline by considering the task length.The process which has minimum load and earliest deadline is sent to the head of the queue.This process is assigned to the enabling server i.The current task in queue is submitted for scheduling after arranging the tasks in ascending order of their workload and deadline.The server system configuration is identified and status of the server is evaluated.Based on this information it is found out whether the server has the capability to accommodate the task, if so, task will be allocated to the server.Figure3shows the data flow diagram for the EDF scheduling algorithm.

Figure 3 .
Figure 3.The data flow diagram for EDF scheduling algorithm.power consumption of server.So Equation (3) can be extended as in Equation (4).

2 h
Ph C Vh Fh loadh

Figure 4 .
Figure 4. Dead line based resource utilization of IPCPC.

Figure 5 .
Figure 5. Resource utilization based on number of task.

Figure 6 .
Figure 6.(a) Guarantee ratio; (b) power consumption per task to the varying task count; (c) total energy consumption.

Figure 6 (
Figure 6(b) gives the result of power consumption per task to the increasing task count.Considering the above outcomes it is established that proposed IPCPC has least power consumption.Figure6(c) shows comparison of total energy consumption of tasks.At this juncture six different algorithms are compared.From that it can be verified that the proposed IPCPC achieves more efficient result.

Figure 7 .
Figure 7. Effective utilization of real world Google trace.

Figure 8 .
Figure 8.Guarantee ratio of real world Google trace.

Figure 9 .
Figure 9. Power consumption per task of real world Google trace.

Figure 10 .
Figure 10.Total energy consumption of real world Google trace.

Figure 11 .
Figure 11.Resource utilization of real world Google trace.