A systematic review on effective energy utilization management strategies in cloud data centers
Journal of Cloud Computing volume 11, Article number: 95 (2022)
Data centers are becoming considerably more significant and energy-intensive due to the exponential growth of cloud computing. Cloud computing allows people to access computer resources on demand. It provides amenities on the pay-as-you-go basis across the data center locations spread over the world. Consequently, cloud data centers consume a lot of electricity and leave a proportional carbon impact on the environment. There is a need to investigate efficient energy-saving approaches to reduce the massive energy usage in cloud servers. This review paper focuses on identifying the research done in the field of energy consumption (EC) using different techniques of machine learning, heuristics, metaheuristics, and statistical methods. Host CPU utilization prediction, underload/overload detection, virtual machine selection, migration, and placement have been performed to manage the resources and achieve efficient energy utilization. In this review, energy savings achieved by different techniques are compared. Many researchers have tried various methods to reduce energy usage and service level agreement violations (SLAV) in cloud data centers. By using the heuristic approach, researchers have saved 5.4% to 90% of energy with their proposed methods compared with the existing methods. Similarly, the metaheuristic approaches reduce energy consumption from 7.68% to 97%, the machine learning methods from 1.6% to 88.5%, and the statistical methods from 5.4% to 84% when compared to the benchmark approaches for a variety of settings and parameters. So, making energy use more efficient could cut down the air pollution, greenhouse gas (GHG) emissions, and even the amount of water needed to make power. The overall outcome of this review work is to understand different methods used by researchers to save energy in cloud data centers.
Cloud Computing has become a flexible, resourceful, efficient, and prevalent computational technology that offers users reliable, customized, and dynamic computing environments. Cloud applications are hosted on high-capacity systems and storage devices in multiple locations around the world. Rapid demand for cloud-based facilities essentially requires the development of massive data centers that consume excessive amounts of electricity. Optimization of energy can be proficient by uniting resources based on current utilization, well-organized network, and the thermal position of nodes and computing equipment. Because maximizing the utilization of physical servers is essential in lowering a data center’s (DC) energy demand, virtual machines (VMs) have been effectively introduced in DCs to increase server resource utilization. A method for cost-effective VM migration based on fluctuating electricity prices cuts the energy costs of running a cloud service by a large amount.
Cloud computing is an extension of parallel computing, utility computing, cluster computing, and grid computing. It is distributed in nature, so a group of independent resources are spread in remote locations. Cloud computing is defined by NIST as “a model for enabling ubiquitous, convenient, on-demand network access to a shared pool of configurable computing resources (e.g., storage, networks, servers, services, and applications) that can be rapidly provisioned and released with minimal management effort or service provider interaction” [1, 2].
The service models of cloud computing are Software as a Service (SaaS), Platform as a Platform (PaaS), and Infrastructure as a Service (IaaS). In SaaS, the client has access to cloud services via a web browser to maintain user interaction and data in the cloud. PaaS is a service that allows customers to use the platform and tools instead of purchasing and paying for software licences for platforms such as operating systems, databases, and intermediary applications.
IaaS means the necessary environment to facilitate cloud services. It contains the pool of hardware resources related to computing, storage, networking, etc. Based on the model of deployment, clouds are categorized into four types. The term “public cloud” refers to an infrastructure that allows the general public to store and access data from any location using a client device with an internet connection. Private Cloud: A private cloud or enterprise cloud is one where the facilities and infrastructure are available for the organization or partner’s use only. A Hybrid Cloud: When a private cloud is combined with public cloud computing. Community Cloud: Resources are shared by multiple organizations that serve a particular community with common concerns [3, 4].
Today, research community’s top priorities are energy conservation and effectiveness. The issue of excessive energy utilization arises as a result of unexpected and rapid changes in the environment around the globe . The levels of carbon footprint and Green House Gases (GHG) in the environment have rapidly increased. The information and communication technology (ICT) industry has been identified as the primary emitter . The rise of sophisticated and diverse data-intensive services and applications has exacerbated energy challenges. The intensity and constant growth of ICT energy demand have necessitated not only meeting energy requirements but also developing and implementing efficient energy-savings methods. According to a 2016 survey, the total global energy consumption and CO2 emissions are expected to rise by 48% and 34%, respectively, between 2010 and 2040 . Also, the Climate Action Group found that the world released 32 gigatonnes of CO2 in 2015 .
The paper is organized as follows: First, a brief introduction of cloud computing, motivation, virtualization, energy consumption, SLAV, VM consolidation, CloudSim, workload datasets, purpose, and classification of the survey have been explained. Further next section defines the discussion, analysis, objectives, limitations, and evaluation of existing related work for heuristic, metaheuristic, machine learning, and statistical techniques with tools, performance metrics, and comparisons with their benchmark algorithms related to energy consumption. In last section, result analysis, major challenges, suggestions, and future work are elaborated. Finally, the summary and conclusion of the review paper is summarised to improve energy efficiency in cloud data centers.
The idea behind cloud computing is to provide on-demand quick access to cloud data centers and to administer the operations from a remote location. Cloud computing operates on a pay-as-you-go pricing model, allowing organizations to reduce operational costs and manage infrastructure more effectively. The motivation for conducting the survey, entitled ‘Effective Energy Utilization Management Strategies in Cloud Data Centers’ is to reduce power utilization in well-organized data centers with the help of VM consolidation. There are several proposed resource management approaches for several computing domains, but only a few addresses the issue of energy efficiency in addition to optimizing profit and service quality. Many magnificent studies have been devoted to confirming the consolidation achieved to an appreciable value, but it is still in its developing stage. Various survey papers on load balancing [9, 10], resource provisioning , resource scheduling [12, 13], resource allocation , and resource utilization  have been published. These surveys explored resource management classification and compared state-of-the-art algorithms based on many significant characteristics of cloud computing. But the classification and techniques related to effective energy utilization approaches have not been discussed in detail in the current study. As a result, there is a need for a complete and systematic assessment of existing energy-efficient strategies, as well as their limitations, to entice academics to work in this domain. This study provides an attempt to investigate the categorization of energy-efficient virtual machine consolidation thoroughly, which will be useful for future research in developing new energy-efficient algorithms or methodologies. The limitations of current approaches are emphasized to inspire future research work challenges and the development of algorithms. The following are the primary contributions of the review paper:
Investigate and analyze the various existing energy-efficient methods in cloud data centers.
Classification of VM management using heuristics, metaheuristics, machine learning, and statistical techniques.
The most important parts of each classification are explained, and a summary of future research goals is also given.
An overview of the tools and workload traces that can be used in the cloud environment to measure how well an algorithm works has been shown.
Overall, the goal is to ascertain how well computers use their resources and consume the least amount of energy possible while still meeting SLA limits for RAM, CPU, bandwidth, etc.
Virtualization technology manages massive data centers more efficiently by allowing several applications, software, and operating systems to run on a single host. It bridges the hardware resources and the operating system, dividing the cloud services into logical units called virtual machines (VMs) . Virtualization solutions such as Xen, VMware, and KVM (Kernel-based VM) are used to construct virtual environments in cloud data centers . Figure 1 displays the classification of energy management techniques.
Energy – efficient cloud computing
Cloud computing offers virtualized resources in cloud data centers for handling several requests for different tasks. A cloud data center’s infrastructure often consists of thousands of huge computing hosts with fast processing resources that use a tremendous amount of energy. So, energy-efficient cloud computing is a step forward in analyzing, and implementing global energy reductions in a system providing quality of services while lowering costs . We may conserve energy by consolidating hardware and minimising repetition. If necessary, services should be able to be virtualized and controlled within a data centre, as well as relocated to other locations. To support energy efficiency in the future, machine-readable accounting of the requirements and characteristics of applications, networks, servers, or even entire sites must be available . Energy consumption in a cloud DC organization with m nodes and n switching elements is written as follows .
PUE (Power Usage Effectiveness) is a typical efficiency indicator for data center energy usage that describes how satisfactorily a data center utilizes energy. The PUE formula is well described by eq. (2), which says that it is the ratio of the total energy used in the building to the total energy used by IT equipment in a data center:
As measured at the meter, the electricity dedicated to the data center facility is included in the total facility energy which includes all loads, such as IT equipment, lighting systems, cooling systems, and power supply components. Total IT equipment includes all the energy used by storage, computing, networking, and other control devices like KVM switches, displays, workstations, and laptops, etc.
Energy consumption and service level agreement
Cloud service providers develop an infrastructure where large numbers of high-end computers or servers are installed and interconnected. This hardware platform provides computing, storage, and different amenities to the customer via the internet. As a cloud service provider, the management of power consumption becomes a crucial task. Effective management of resources are required to optimize power utilization, quality of service, cost-effective, and maximize performance with accuracy. In addition to energy utilization and SLA violation, financial expenses and CO2 emissions from data center cooling systems have a substantial impact on the environment .
The most significant challenges in cloud computing are task scheduling, resource utilization, load balancing , SLA, quality of service (QoS), scalability, disaster recovery, safety, fault tolerance, resource management, energy efficiency, virtual machine migration, and automated service provisioning . This review work focuses on the previous study of energy efficiency or power consumption, which should be minimized. However, energy and SLAV are inversely associated, as illustrated in Fig. 2. There is a trade-off between energy consumption and performance (QoS). Performance is described in terms of SLA, which defines the standards and services with throughput, service time, delay time, and reaction time given by the deployed system. A simulation for the environment mentioned in  is performed using the LrMmt host overload detection method with various safety parameter values. The allocation strategy uses the tuning parameters to anticipate the CPU utilization by the host. For example, if the parameter is set to 1.2, the projected utilization is increased by 20%, providing the host a 20% safety buffer to enhance its consumption without violating SLAs. The results reveal that when this value drops, more VMs are packed into a host. Figure 2 shows that when the safety parameter falls, the EC drops and the number of SLA breaches grows. As a result, the parameters must be set to balance the SLAV and EC.
As a result, cloud providers must cope with the trade-off between energy-performance and reducing energy consumption while fulfilling QoS standards. Buyya et al.  showed that when the utilization threshold increases, energy usage is reduced but the percentage of SLAV is also increased. This is because a higher utilization threshold permits more aggressive VM consolidation but at the expense of an increased chance of SLAV. As a result, to save energy, aggressive VM consolidation may result in performance or QoS deterioration, resulting in SLAV. So, while reducing energy utilization, SLAV should also be considered to ensure high adherence to the SLA. To minimize EC and SLAV, the combined metric ESV that captures energy consumption and the level of SLAV is calculated for the performance parameter, as EC decreases with the increased level of SLAV.
In a cloud data center, a central node routes customer applications to the appropriate servers. This facility is known as VM scheduling. To advance the quality of services and efficient management of power consumption, VM scheduling has been done in such a way that a minimum number of hosts are in a state of running. This method is also known as Dynamic Consolidation of Virtual Machine (DCVM) . Predicting host utilization is an ongoing research effort, and a variety of solutions have been proposed. A single host can host more than one VM, and as per user request, VMs use hosts’ resources. When the request of resource host is underutilized or overutilized then VM has to be relocated. This action is known as VM migration and is a popular approach for controlling power consumption. Migration of virtual machines from underutilized and overloaded hosts is a difficult job. To shrink the quantity of VM migrations, appropriate VM selection, and VM placement methods must be developed. When a VM moves from a host that is too busy, both the source host and the new host use power without providing any services.
CloudSim [24, 25] is free, accessible software for simulating cloud computing services and frameworks. This simulator was designed by the CLOUDS (Cloud Computing and Distributed Systems) research laboratory at Melbourne University. Written entirely in Java, CloudSim is a toolkit used to prototype and imitate a cloud computing setting. It enables the modelling of virtualized environments, as well as their administration and on-demand resource management . This simulator is also enhanced to allow for energy-aware models and power models to simulate service applications with variable workloads.
As CloudSim simulator is the preferred tool for research where the workload traces of data is used to test the algorithm. Many researchers are working on PlanetLab or Bitbrains data workloads, where a file associated with one VM denotes the CPU utilization of physical machines. Some workload traces include dynamic data such as CPU, RAM, disc, and network I/O values . PlanetLab workload traces  with statistical features are given in Table 9. Bitbrains is a cloud service agency that focuses on managed hosting and enterprise business computation . Bitbrains’ dataset comprises resources that are used by 1750 VMs from a distributed cloud center. This dataset is published online in the Grid workloads archive . It is divided into fastStorage and Rnd traces. The fastStorage contains 1250 VMs, and Rnd traces have 500 VMs. The fastStorage data is divided into one file per VM, with each file comprising 30 days of data collected every 5 minutes. Bitbrains workload traces with statistical features are given in Table 10 Apart from PlanetLab or Bitbrains, some other workload traces such as Google cluster traces [30, 31], Alibaba cluster , Azure trace , microservices cluster , etc. are also used by researchers. In May 2011, Google released a 29-day cluster trace – a history of every job request, scheduling choice, and resource use statistics for all tasks in a Google Borg computing cluster. The Alibaba group publishes the Alibaba cluster trace program. Their initiative assisted researchers, students, and others interested in the subject by providing cluster traces from the real-world. This allows a better understanding of the features of current internet data centers (IDCs).
Purpose and classification of survey
The Cloud data centers that host and store data are the backbone of cloud computing, which consists of networked computers, power supply, cables, and other components. Data centers that host cloud applications require a lot of energy for resources, leading to high operational costs and carbon release. As expected from a survey, total global energy utilization and carbon dioxide emissions are expected to rise by 48% and 34%, respectively, between 2010 and 2040 . According to a McKinsey analysis , “the entire expected energy expense for cloud data centers in 2010 was $11.5 billion, and cost of energy doubles every five years in a typical data center”. So, cloud data centers are becoming very expensive and harmful to the environment. The authors of  has conducted a systematic examination of the present status of software solution that helps in reduction of energy consumption in data centers and also stated the impact of data centers on the environment. In  the use of big data, cloud, and IoT leads to higher demands for hyperscale data centers (HDCs) for data storage and processing. The analysis of 60 regions done by the researchers has predicted the overall increase in the energy consumption of HDCs, carbon emissions and electricity costs, that focus the purpose of the survey.
The main challenge is to set up a balance between system performance and energy utilization . In this detailed systematic survey, a balance between energy efficiency and performance using VM placement , VM selection, and migrations , has been analyzed for data storage and processing . This paper analyses the approaches performed by various academicians, organizations, researchers in the field of energy consumption in cloud data centers during VM scheduling. Researchers have also compared their method with the benchmark method using different algorithms of heuristics, metaheuristics, machine learning, and statistical methods. Their results show an improvement in energy-saving and thus reduces power consumption. Fig. 3 shows the detailed classification of effective energy management strategies in cloud data centers categorized into four groups i.e., heuristics, metaheuristics, machine learning and statistical. For efficient energy utilization the stages of dynamic VM consolidation is shown in Fig. 4.
In cloud computing effective energy management strategies related work has been provided by researchers. Many researchers have applied different techniques for VM management and energy-efficient strategies to reduce energy consumption in cloud data centers. Some have focused on heuristic methods as classified in Fig. 5, some on metaheuristics as classified in Fig. 6 and Fig. 7, some on machine learning as described in Fig. 8, and others on statistical methods categorized in Fig. 9. To balance the load and decrease energy usage, cloud data centers use live VM migration . VMs are dynamically distributed among the hosts during a live migration to reduce the number of low utilization hosts and maximize the number of high utilization hosts. Although dynamic VM consolidation can significantly reduce energy usage, live migration increases service level agreement violations. As a result, in order to decrease energy usage while satisfying service level agreements, cloud data centers require an effective dynamic VM consolidation solution. The dynamic VM consolidation procedure can often be divided into three parts .
Details of different techniques, algorithms, workload data, approaches, and the researchers’ work are described below. In first section, the heuristic techniques and their different approaches are used to minimize energy usage. Many researchers work on the first fit decreasing (FFD), best fit decreasing (BFD), modified best fit decreasing, power-aware BFD, etc. Next, the metaheuristic techniques including swarm intelligence, evolutionary algorithm, nature-inspired algorithm, and physics-based algorithm are used to reduce energy consumption and to satisfy service level agreement. Machine learning techniques reinforcement learning, neural network (NN), support vector machine (SVM), and k-nearest neighbor (kNN) are elaborated in further section and finally statistical techniques using mean, standard deviation, regression, PPR gear, and ARIMA are explained.
Virtual machine management using heuristic techniques
A heuristic technique is a strategy for solving the problem, that is derived from the Greek term ‘eurisko,’ which means to search, find, or discover. It is about employing a practical technique that does not have to be perfect. Heuristic approaches reduce the time required to find a satisfactory answer. In cloud computing heuristic techniques are used for VM consolidation. In this approach, different researchers use FFD, BFD, MBFD, PABFD, and other algorithms for VM allocation, migration, and placement to reduce energy consumption.
Srikantaiah et al. 2008 , the virtual machine consolidation (VMC) problem was introduced as a bin packing problem. Researchers only examined two criteria: disc and CPU use. The analysis revealed that there is energy-performance compensation for consolidation, with the existence of optimal operating conditions. They constructed a cloud setting, collected data, and developed a bin packing issue using static random threshold values. Other resources, like memory and network, should also be measured, as they may be limiting resources for particular applications.
Beloglazov et al. (2010)  For VM consolidation with random data, researchers employed single threshold (ST), minimization of migration (MM), and bin packing strategies. The authors attempted to strike an ideal balance between energy savings and desired performance. They consolidated VMs based on current resource use, network topologies employed in VMs, and thermal status. An energy-aware resource scheduling system based on heuristics for VM allocation and live migration was suggested. The authors structured it as a bin packing issue and evaluated the effort using preset thresholds using the CloudSim toolbox. The results show that dynamic VM consolidation with adaptive thresholds outperforms static thresholds. Non-power-aware (NPA), dynamic voltage frequency scaling (DVFS), and ST methods were used to test the MM algorithm. Using energy savings, the MM algorithm outperformed ST, DVFS, and NPA by 23%, 66%, and 83%, respectively, with thresholds set at 30–70%, resulting in SLA breaches of 1.1%. The MM policy resulted in 6.7% SLA breaches and 43%, 74%, and 87% higher energy savings than the ST, DVFS, and NPA policies when the threshold value was kept at 50–90%.
Anton et al. (2011)  provided a heuristic approach for resource distribution that is energy efficient. The policy allocated resources to consumer apps in an energy-efficient manner while ensuring QoS by utilizing energy-efficient mapping heuristics using the consolidation of virtual machines. For VM placement, an improved form of the best fit decreasing modified BFD (MBFD) technique was utilized, as well as three double-threshold VM selection policies, random choice policy (RCP), highest potential growth, and MM. CPU usage data were produced at random by utilizing fixed criteria. The results in the CloudSim toolbox showed that energy consumption was reduced by 77% and 53%, respectively, as compared to NPA and DVFS policies, with SLA breaches of 5.4%.
Beloglazov et al. (2012)  Researchers proposed a dynamic VM consolidation approach because fixed thresholds are not feasible in a dynamic cloud environment. The authors reported dynamic threshold values by statistically assessing four histories of CPU use. The reallocation was carried out utilizing a dynamic threshold method. The MBFD technique was utilized to place the VMs. SLA-aware metrics were also examined. The results obtained by running the algorithm on the CloudSim toolkit with a genuine PlanetLab trace demonstrated the validity of the suggested framework. But in the model only single-core CPUs were used, and only a single-core resource CPU was tested.
Arani et al. (2018)  by providing a VM placement strategy, researchers concentrated on reducing energy use (VMP-BFD). VMs were mapped to hosts using an approach centred on the best fit decreasing approach, which significantly decreased energy use and SLA violations. The developed algorithm employed the theory of learning automata, correlation coefficients, and the ensemble forecast technique for VM allocation to hosts. The method assigned a VM to a host whose VMs had the least association with the chosen VM for placement. Compared to other reference policies, the results of the simulations on the CloudSim platform showed a big improvement in lowering the energy use and the SLA violations.
Wang et al. (2018)  focused on energy-efficient dynamic virtual machine consolidation (DVMC) by introducing an approach for virtual machine placement called “Space-Aware Best Fit Decreasing” (SABFD). The authors also created a VM selection strategy called “High CPU Utilization-based Migration VM Selection” (HS). The suggested system was evaluated in several ways by utilizing the CloudSim toolkit and the Planet Lab workload. The results showed that DVMC designs with a range of SABFD and HS produced the better results.
F.F. Moges et al. (2019)  proposed the OpenStack Neat framework’s VM placement method to address the issue of consolidation. They introduced VM placement methods that modify heuristics bin-packing to account for host energy efficiency. When linked to the reference algorithms PABFD and MBFD, the proposed algorithms improve energy proficiency. Depending on the host categories and workloads, the energy proficiency improvement over MBFD can be up to 67%. They also defined an innovative bin-packing method termed a “medium-fit” to avoid unnecessary SLAV and VM migrations. The MFPED (medium-fit power-efficient decreasing) offers a lower SLAV and VM migration rate compared to other VM placement methods. SLAV and VM relocations are reduced to 78% and 46%, respectively, when compared to MBFD, depending on the cloud scenario. They used CloudSim to test the suggested algorithms’ performance in three different data-center situations: heterogeneous, homogeneous, and default. Data workloads that execute in cloud centers are derived from PlanetLab and Bitbrains cloud traces.
Bhattacherjee et al. (2019)  for large historical data sets, proposed prediction technique that was accepted and employed in the current strategy known as the minimization of migration and dynamic thresholding system instead of static thresholds. The MBFD algorithm is used in prediction-based minimization of migration (PMM) to place the VMs. Markov chain learning is applied to formulate the past data for upcoming forecasting deployments. CloudSim 3.0.3 has been used to run rigorous simulations, and the outcomes show a decrease in cloud data center energy utilization.
Xialin Liu et al. (2020)  proposed dynamic consolidation by using migration thrashing. It prioritizes VMs with high dimensions and remarkably decreases migration thrashing. The degree of relocations required maintaining service-level agreements (SLAs) by keeping VMs prone to relocation thrashing on the identical physical servers rather than migrating. Their method improves the relocation thrashing measured around 28%, the number of movements measured around 21%, and the SLAV measured around 19%. When the server is overloaded, their solution detects VMs with sufficient capacity by restricting that VMs with excessive capacity are not transferred. Imitations of a wide-ranging research setting employing a workload data set from numerous PlanetLab VMs were used to validate the suggested techniques.
Saikishor Jangiti et al. (2020)  DRR-FFD and DRR-BinFill are cutting-edge VMC algorithms based on the concepts of FFD (first-fit decreasing) and DRR (dominant residual resource) that organize VMs based on a single VM resource. Researchers proposed an energy-efficient architecture — EMC2 — for an IaaS cloud service provider. The vector bin-packing techniques VMNeAR-E and VMNeAR-D are proposed. In a python context, simulation tests were conducted utilizing a dataset acquired from the EnergyStar® API for diverse physical servers. The suggested VMNeAR-D heuristic saved up to 3.318% of energy on the average across 40 schedules.
Garg et al. (2021)  provided load-aware three-gear THReshold (LATHR) and the MBFD algorithm to reduce overall energy consumption even though they improved service quality in terms of SLA. It produces promising results when used with a dynamic workload and a flexible count of virtual machines (1–290) on each host. The results of the projected work were evaluated concerning service level agreements (SLAs), energy utilization, the number of relocations against various numbers of virtual machines (VMs), and instruction energy ratio (IER). The proposed technique reduces SLA defilements (26%, 55%, and 39%) as well as energy consumption (12%, 17%, and 6%) when related to interquartile range (IQR), median absolute deviation (MAD), and double threshold overload acknowledgement strategies, respectively.
Alharbi et al. (2021)  improved existing research that manages data center resources using two independent layers: applications allotted to VMs and VM placement to hosts; both are bin packing problems. This sequential double-layered bin packing (Consec2LBP) solves issues easily and restricts added solution quality development. This research proposes an integrated ant colony optimization strategy to deal with the layers simultaneously to overcome this issue. It converts two-layer resource management into an optimization problem known as integrated double-layer bin packing (Int2LBP). Then, to solve this optimization challenge, a combined FFD technique known as Int2LBP_FFD is derived. To improve the quality of the result, a combined ant colony system, Int2LBP_ACS, has been developed, where the result of Int2LBP_FFD is used as a preliminary solution. In simulations of nine scales of data centers based on GTC data logs, integrated double-layer Int2LBP_FFD outperforms sequential Consec2LBP_FFD. They’ve also shown that Int2LBP_ACS is better than Int2LBP_FFD concerning energy investments. The Int2LBP_ACS and Int2LBP_FFD algorithms provide scalability.
T Kaur et al. (2022)  The Power Aware Energy Efficient Virtual Machine Migration (PAEEVMM) Method has been developed to migrate virtual machines in data centres depending on the temperature threshold value. Based on temperature, this approach moves the heavily loaded virtual machine to the less loaded virtual machine. The simulation was run on CloudSim Plus, and the outcomes are assessed against first fit algorithms. The experiment demonstrates that the suggested approach performs better in terms of CPU and electricity usage.
A brief description of the above detailed literature review and algorithms developed using heuristic methods with different workload data is given in Table 1. Table 2, summarises the work, method, and comparison with their benchmark methods/ algorithm to evaluate energy consumption. Figure 10 depicts the percentage difference in energy reduction or energy savings in graphical form. The implementation of these algorithms has been tested using different settings. The authors have already talked about the host specification, virtual machine description, datasets, simulators, and other criteria for comparing the proposed method to their benchmark algorithm.
Virtual machine management using metaheuristic methods
A metaheuristic is a problem-solving strategy based on a heuristic method that is independent of the problem’s nature. A single-solution local search metaheuristic and a random search metaheuristic are the two types of metaheuristic methods. Metaheuristic approaches have been shown to produce near-optimal solutions in a reasonable amount of time and are problem-independent, allowing them to be used in a wide range of situations. It is advantageous in a cloud setting to locate a suboptimal solution quickly. Different metaheuristic techniques based on swarm intelligence, bio-inspired, physics-based, and evolutionary algorithms are used by researchers for VM consolidation to reduce energy consumption. This method was implemented for resource prediction, VM migration, VM placement, load balancing, etc.
Kousiouris et al. (2011)  worked on the analysis and performance of VM which depends on several parameters. They proposed the effects on VM performance prediction, persistent allocation proportions, VM co-placement, and instantaneous arrangement on the identical host. They applied a genetic algorithm (GA) to optimize an artificial neural network (ANN) and used linear regression to investigate degradation prediction.
Aryania et al. (2018)  proposed a technique using an ACS to resolve the VM consolidation (VMC) issue to reduce energy utilization in data centers. They took into account energy utilization through virtual machine migration. They presented an energy-aware VMC process based on an ACS to handle the VMC issue as a multi-objective optimization challenge. On the arbitrary workload in several circumstances, simulation findings showed that EVMC-ACS increased the number of sleeping hosts by 16% as related to ACS-VMC. Also, the suggested algorithm minimizes relocations by 89%, the power consumption during a migration by 91%, SLA violations by 79%, and overall energy consumption by 25% relative to ACS-VMC.
Goyal et al. (2019)  worked on PSO and CSA algorithms. The goal of optimizing energy utilization in the cloud is also addressed in the article. CloudSim simulators and common programming languages were utilized in their suggested work. Several performance measures, such as energy efficiency, response time, and execution time, were used to judge how well the work performs.
M Tarahomi et al. (2020)  approached micro-genetic method for choosing the right physical host for a virtual machine. Their simulations reveal that the micro-genetic method enhances power consumption relations. The suggested approach was tested using CloudSim and their result was related to the reference algorithms (genetic and PABFD VM provisioning algorithms) in various scenarios with the datasets of 10 working days. According to experimental results by the CloudSim framework, the micro-genetic system reduced power consumption.
Dubey et al. (2020)  suggested a virtual machine placement approach that reduces the makespan while reducing power consumption. The proposed technique was tested in the simulator CloudSim toolkit, and the findings proved that it exceeded typical work utilizing FCFS, Round-Robin, EERACC, and Random algorithms. The result shows that the recommended technique beats the other four mentioned methods regarding energy and power usage, server utilization, and makespan.
Barthwal et al. (2021)  proposed AntPu ACO meta-heuristic predicted utilization for dynamically placing VMs in the cloud data center to minimize SLAV and energy utilization (EU). In CloudSim, a simulated environment is created, and the PlanetLab dataset is chosen because of its real-world properties. The CPU usage of VMs in five-minute intervals is shown in this data set. To assess the results, extensive simulations were run, showing that the proposed approach offers a significant improvement in energy utilization and SLA compared with other methods. AntPu improves performance by satisfying SLA, QoS, EC, VM migration, and PM overloading constraints.
Mirmohseni et al. (2021)  combined the outcomes of the particle swarm genetic optimization (PSGO) process. The findings were improved and a viable solution for load balancing operations was introduced by combining the advantages of these two algorithms. Instead of arbitrarily assigning the beginning population or data set in the GA, the most acceptable outcome is obtained by giving the starting population in their proposed approach, load balancing PSGO Improve Resource Allocation (LBPSGORA). The LBPSGORA method is compared to GA, PSO, and a hybrid GA-PSO approach. This method outperformed similar methods in terms of execution cost, load balancing, and time to completion. With task changes, the hybrid GA-PSO approach performs similarly to the suggested method. The LBPSGORA technique is 7.32% more effective in makespan and 6.87% more effective in execution cost compared to the hybrid GA-PSO. LBPSGORA outperformed the hybrid GA-PSO by 8.42%, GA by 10.61%, and PSO by 11.71% in terms of load matching.
Alharbi et al. (2021)  improved existing research that manages data center resources using two independent layers: applications allotted to VMs and VM placement to hosts; both are bin packing problems. This sequential double-layered bin packing (Consec2LBP) makes easier the issue solving and restricts added solution quality development. This research proposes an integrated ant colony optimization strategy to deal with the layers simultaneously to overcome this issue. It converts two-layer resource management into an optimization problem known as integrated double-layer bin packing (Int2LBP). Then, to solve this optimization challenge, a combined FFD technique known as Int2LBP_FFD was developed. To improve the quality of the result, the combined ant colony system Int2LBP_ACS is refined further using the Int2LBP_FFD result as a preliminary solution. In simulations of data centers based on GTC data logs, Int2LBP_FFD outperforms Consec2LBP_FFD. They’ve also shown that Int2LBP_ACS is better than Int2LBP_FFD concerning energy investments. The Int2LBP_ACS and Int2LBP_FFD algorithms provide scalability.
Salami et al. (2021)  offer a virtual machine placement problem (VMPP) based on the cuckoo search (CS) algorithm. New cost and perturbation metrics have been created to increase the algorithm’s performance. Two well-known benchmark datasets were used to evaluate the suggested technique. The main objective is to organize virtual machines into actual machines to minimize the number of devices required. It beat the reordered grouping genetic algorithm and the FFD, BFD, and multiCSA, an older CS approach.
M. H. Sayadnavard et al. (2022)  approached a technique for dynamic VMC, which included a prediction model based on DTMC, a VM selection algorithm, and e-MOABC-based VM placement. Using this model in conjunction with the dependability model of PMs results in a more exact classification of PMs depending on their condition. Then, a multi-objective VM placement approach is proposed using the e-dominance-based multi-objective artificial bee colony algorithm to find the optimum VMs to PMs mapping, which can efficiently manage overall energy consumption, resource usage, and system performance to meet SLA and QoS requirements. By completing a performance assessment study with the CloudSim toolkit and PlanetLab workload traces, the proposed system is proved to be effective. The suggested technique greatly decreases energy usage while avoiding excessive VM migrations, according to a competitive analysis of the experimental findings. The investigation of various parameters reveals that the suggested approach outperforms other algorithms. MOABC-VMC decreases energy consumption by 11.35% and 35.25%, respectively, when compared to RE-VMC and LR-MMT.
S. Malik et al. (2022) , proposed Evolutionary Algorithms and Machine Learning Methods to Predict Resource Utilization in cloud data centers. The primary goal was to resolve the over-and under-provisioning problems. Over-provisioning of resources results in higher expenses and increased energy use. However, under-provisioning results in SLA violations and a decline in quality of service (QoS). The research focuses on functional link neural networks (FLNN) using hybrid Genetic Algorithms (GA) and Particle Swarm Optimization (PSO) for multi-resource usage prediction. The suggested model produces improved accuracy when compared to conventional procedures, according to experimental results using data from Google Cluster Traces. This study’s primary objective was to examine how well neural networks predicted multi-resource allocation. The proposed model predicts using FLNN and trains the network weights using a hybrid GA-PSO. To manage a large number of users, resources must be dynamically scaled for effective usage, low energy consumption, low cost, and higher quality of service (QoS).
A brief report of the above detailed literature review and algorithms mentioned using metaheuristic methods with different workload data is given in Table 3. Table 4, summarises researchers work, methods, and comparison with their benchmark algorithm to evaluate energy consumption. Figure 11 depicts the percentage difference in energy reduction or energy savings in graphical form. The implementation of these algorithms has been tested with different settings. About the host specification, virtual machine characteristics, workload datasets, simulators or tools, and other measures for comparing the proposed method to their benchmark algorithm has already been discussed earlier.
Virtual machine management using machine learning techniques
Machine learning technique are approaches and set of technologies that use AI concepts. Machine learning enables researchers to use data to train a system on how to solve a problem using machine learning algorithms and improve over time. Machine learning is frequently classified by how an algorithm learns to improve its prediction accuracy. Supervised learning, unsupervised learning, semi-supervised learning, and reinforcement learning are the four fundamental methodologies. In a cloud computing environment reinforcement learning, neural network, k-nearest neighbor, and support vector machine algorithm are used by researchers to consume less amount of energy in cloud environment.
Jia et al. (2009)  have proposed a reinforcement learning method called VCONF, which automates the VM configuration process by addressing the system’s scalability and adaptability problems. By learning from repetitions with the environment, virtual machine configuration (VCONF) generates policies for the auto-configuration of VMs. This method achieves the best cloud setup while improving adaptability and scalability also. Experimental results demonstrated the system’s optimality in controlled problems, as well as its scalability and adaptability in a broader system. VCONF could be changed to a good configuration in seven steps and showed a 20% to 100% increase in throughput over simple RL approaches.
Vinh et al. (2010)  developed an energy-aware algorithm that uses a neural network (NN) to forecast upcoming load requirements built on previous data and reduces the number of hosts by shutting them down or restarting them as needed. Their research objective is to moderate the energy used in data centers. When the system load increases or decreases, the system turns on or off some hosts.
Niehorster et al. (2011)  have presented an approach for the provisioning of virtual machines using support vector machines (SVM). They created a self-configurable and self-optimized multi-agent system capable of learning its behaviour and estimating its cost. The system acquires performance models for various applications and develops a behaviour model, after which SVM is used to organize the data in the knowledge base.
Kousiouris et al. (2011)  depend on several parameters on VM performance prediction, persistent allocation proportions, VM co-placement, and instantaneous arrangement on the identical host. They used a genetic algorithm (GA) to improve an ANN and linear regression to study how well it could predict degradation.
Islam et al. (2012)  constructed a model for predicting future CPU resource requirements using the linear regression method. The input data set used historical data obtained by performing the Transaction Processing Performance Council (TPC), a typical client-server benchmark. To train the algorithm for prediction, the CPU utilization percentages of all VMs are used. They also used a neural network in the cloud for resource allocation and management. The neural network was trained with the back-propagation process, and experimental outcomes showed that NN-approximate predictions have a lower proportion error than LR-based predictions.
Cheng et al. (2012)  proposed a unified reinforcement learning technique for autonomously configuring virtual machines and their applications and adjusting the VM resources efficiently and providing quality service assurance. They came up with a good plan for running their research on Xen VMs using different workloads.
Farahnakian et al. (2013)  introduced a dynamic consolidation of virtual machines (DCVM) where the active number of hosts are minimized based on present and historical use. The k-nearest neighbour (KNN) method is used to forecast each host’s CPU utilization. To optimize dynamic VM consolidation, their prediction technique focuses on identifying overloading and underloading of hosts. The results indicated that their system consumes the least amount of energy while maintaining the SLA.
Farahnakian et al. (2014)  suggested a Reinforcement learning (RL) technique for dynamic consolidation of VM that uses a learning agent to find out the host’s power strategy. The agent selects the host to make it active or sleep. The RL learning agent optimizes the active host by learning system behavior. Experiments with PlanetLab workload traces show that their model lowers the cost of using energy, improves performance, and cuts down on SLA violations.
Minal et al. (2016)  Configure live VM migration using a support vector regression (SVR) model to forecast dirty pages using time series analysis. The service interruption time and migration duration were used to assess the performance of the live migration. They also created an ARIMA-based model, and findings show that SVR outperforms ARIMA in predicting dirty pages. Total pages transferred and migration time are the two most critical performance criteria for live migration in their proposed system.
Duggan et al. (2016)  developed a network-aware live migration technique that monitors bandwidth usage and takes appropriate action when there is network congestion based on experience Their structure functions as a decision support system, enabling a mediator to schedule VM migrations by determining the best time to do so. The amount of bandwidth available in the data center influences the migration process. According to their research findings, an agent in a cloud data center can learn available bandwidth during peak network capacity and schedule the migration of VMs from underutilized Hosts at the appropriate time using available bandwidth. They used the local regression approach to determine which hosts were overloaded. The Learning agent selects the best VM for migrating from an overloaded host while balancing migration and energy consumption. The findings of the research point to an autonomous VM selection method that can account for VM migration count and energy cost.
Duggan et al. (2017)  To create reliable predictions using time series data, researchers employed a recurrent neural network (RNN) to forecast future values of CPU consumption. They looked into the network’s accuracy for prediction with a deep effect. Experiments have shown that it is possible to get a very accurate estimate of CPU usage for dynamic data sets that change.
Qazi et al. (2017)  provided a real-time resource consumption prediction classification that takes actual resource usage and sends it to multiple buffers built on time and resource type. A system with real CPU utilization traces from a cloud data center with 120 servers used the autoregressive neural network method on data blocks where the data did not track a Gaussian distribution. The experimental findings suggest that AR-NN outperforms ARIMA for a given data set.
Shaw et al. (2017)  have presented the advanced RL consolidation agent method for VM allocation that is capable of optimizing VM circulation in the cloud data center while saving large amounts of energy and lowering SLA violations. They established a space for state-action. Action is defined as a combination of any host’s utilization rate and the size of the VM to be deployed, and state is well-defined as the entire active host as a percentage of the total host.
Sotiriadis et al. (2018)  proposed a VM scheduling strategy that uses extracted data from past VM and host resource utilizations to define host weights based on the resource utilization of hosted VMs on that host. They used SVM to classify VM states based on historical records. They used the resource utilization dataset (percentage of CPU, RAM, and disc usage) in the X-Y planes and expressed the data as vectors. The results of the experiments reveal that, through learning the system’s behavior, their method improved physical machine selection.
Mason et al. (2018)  using evolutionary NN, created a way to forecast the host’s CPU utilization. For network training, optimization approaches such as particle swarm optimization (PSO), differential evolution (DE), and covariance matrix adaptation evolutionary strategy (CMA-ES) are used. The outcomes of the experiments showed that CMA-ES performs better than other optimization strategies and trains networks to predict CPU consumption accurately.
Patel et al. (2019)  presented a load-balancing method based on energy-aware VM Migration. They perform it by assigning a lower and higher threshold to an individual host, which specifies whether the host is underloaded or overloaded. Before initiating the migrations, they used a prediction approach that predicts the demand on the host. Their process uses an artificial neural network (ANN) with the dynamic double threshold (DDThr) technique to predict VM movement and energy consumption while considering CPU utilization. Not only does it reduce the number of VM movements, but it also saves energy. Graphs comparing VM movement and energy utilization show that when ANN is combined with existing techniques, both VM movements, and energy utilization decrease slightly, saving a significant amount of electricity. To create a cloud environment, the CloudSim simulator was employed, and Matlab2015a was used to implement ANN. Based on the experiments, the proposed strategy uses less energy and has fewer migrations than the competitive approach.
Kumar et al. (2020)  provide a workload forecasting framework based on a NN (WFNN) model with supervised learning. To increase the predictive model’s learning efficiency, an upgraded and adaptable differential evolution method has been designed and developed. The algorithm determines the most appropriate crossover and mutation operators. Because of its adaptive nature in pattern learning from sampled data, the learning’s prediction accuracy and convergence rate have been seen to improve. The prediction model’s performance is assessed using real-world data traces from Google’s cluster and NASA’s Kennedy Space Center. A Python3 Jupyter notebook is used to implement the suggested model. The results are compared with other recent methods, and improvements of up to 97%, 91%, and 97.2% are observed over backpropagation, self-adaptive differential evolution, and average-based workload prediction techniques, respectively.
Saxena et al. (2021)  introduce an energy-efficient resource provisioning and management system to satisfy future applications’ dynamic demands. The proposed system addresses power consumption, performance, resource wastage, and QoS depletion by accurately matching the application’s expected resource demand with VM resource capacity. Consequently, condensing the whole load onto the smallest number of energy-efficient physical machines (PMs). The proposed work makes contributions in the form of online multi-resource feed-forward NN (OM-FNN) to predict resources, autoscaling of VMs, and allocation of scaled VMs on energy-efficient hosts. The suggested integrated solution has been rigorously evaluated using real resource usage traces from the Google cluster dataset, and it outperforms the other VMPs in terms of resource utilization and power savings by up to 21.12% and 88.5%, respectively. Also, the OM-FNN predictor is more accurate, takes less time, and uses less space than the single-input single-output feed-forward NN predictor.
Malik et al. (2022)  focuses on employing a hybrid Genetic Algorithm (GA) and Particle Swarm Optimization with a Functional Link Neural Network (FLNN) to anticipate the multi-resource utilization (CPU, memory, and network bandwidth). For resource usage prediction, the programme employs models from convolutional neural networks (CNN) and long short-term memory (LSTM). Experimental findings using Google cluster traces demonstrate that the suggested model outperforms conventional methods in terms of accuracy. This study’s major objective was to examine how well neural networks forecast the use of several resources. FLNN is used for prediction, while hybrid GA-PSO is used to train the network weights. Therefore, to manage a high number of users, the resources need to be scaled dynamically for optimal use, decreased energy consumption, and cost, with better quality of service (QoS).
A brief description of the above detailed literature review and algorithms developed using machine learning methods with different workload data is given in Table 5. Table 6, summarises the work, methods, and comparison with their benchmark algorithm to evaluate energy consumption by different researchers. Figure 12 depicts the percentage difference in energy reduction or energy savings in graphical form. The implementation of these algorithms has been tested for different settings. The authors have already mentioned the host specification, characteristics of virtual machine, workload datasets, simulators environment, and other criteria for comparing the proposed method to their benchmark algorithm.
Virtual machine management using statistical techniques
Statistical methods are used in research planning, analyzing, data collecting, meaningful interpretations, and reporting the findings of various virtual machine management. In cloud computing researchers work on mean, standard deviation, regression, ARIMA, PPRGear, etc. to detect overload and underload hosts, resource prediction, VM allocation, VM migration, and VM placement to save energy consumption.
Cao et al. (2012)  proposed strategies for dynamically combining VMs in a virtualized data center to reduce SLAV and energy utilization. The authors suggested detecting host overload, VM selection, and allocation strategy. The author’s uses mean and standard deviation CPU utilization metrics to determine overloaded hosts. The extension of the maximum correlation (MCE) strategy was utilized to select VMs for migration with mean and variance-related computations for VM allocation. Experiments using PlanetLab traces on CloudSim revealed that the new framework, which consists of the policies listed above, outperforms the previous policies in the requisites of energy utilization and overall QoS. However, it performed slightly worse in the requisites of energy utilization. As a result, managing the energy-performance trade-off is difficult.
Farahnakian et al. (2013)  Using PlanetLab historical data, a linear regression method was proposed to forecast the upcoming CPU use of the host (LIRCUP). Authors discovered a relationship between expected and current CPU use, where expected utilization is a dependent variable and current utilization is an autonomous variable. The LIRCUP algorithm detects overloaded hosts and maintains SLA and energy utilization by transferring some VMs from the overburdened hosts by comparing the expected CPU utilization value with the present utilization.
Nadjar et al. (2015)  present a decentralized scheduling strategy for DCVMs fitted with an auto-regressive integrated moving average (ARIMA) technique to progress resource provisioning by predicting VM resource usage to decrease SLAV and energy utilization in cloud data centers. Global Manager uses first fit decreasing, Cluster Manager uses max load VMP, and Local Manager uses the ARIMA model in their model. As a result, by utilizing ARIMA upper-bound prediction, it is possible to obtain a 90% reduction in migration and SLA violation rates and a 5.4% increase in energy savings. The CloudSim simulator was used to evaluate the method’s efficiency with recently proposed approaches that employed the same workload and experimental settings.
Ruan et al. (2015)  define performance-to-power ratio (PPR) as conscious virtual machine distribution in energy-efficient clouds. They describe “PPRGear,” a novel VM allocation mechanism that takes advantage of performance-to-power ratios for diverse types of hosts. PPRGear can ensure that the host devices use the least amount of power possible. Thus, this drastically lowers the energy usage with minimal performance loss. The proposed algorithm outperforms the competition.
Abdelsamea et al. (2017)  introduced multiple regression host overload detection (MRHOD) procedures that practice memory, CPU, and bandwidth to detect host overload and save energy significantly. They used a combination of factors to manage VMs while keeping energy consumption and SLAs low. They also created the hybrid local regression host overload detection (HLRHOD) method based on LR with hybrid variables. This algorithm outperforms single-factor methods.
Khoshkholghi et al. (2017)  by developing a method for overloaded host detection using iterative weighted linear regression (IWLR), which takes SLA constraints for data centers into consideration, researchers forecasted a dynamic, cost-effective, and energy-efficient management of virtual machines.
Hemavathy et al. (2019)  provide a prediction-based thermal aware server consolidation (PTASC) model, an integration technique that considers numeric and local architecture, as well as service level agreement. PTASC uses a statistical learning approach to consolidate servers (VM Migration). Cloud computing is a method of supplying essential resources by optimizing the usage of data-center resources, which raises energy costs. To reduce energy costs and enhance usage, new energy-efficient methods are proposed that reduce the overall energy consumption of computing and storage.
Lianpeng et al. (2019)  Based on the suggested robust simple linear regression (RobustSLR) prediction model, the authors developed a host overloading/underloading detection technique and a novel VM placement strategy for SLA-aware and energy-efficient virtual machine consolidation in cloud data centers. Unlike native linear regression, the proposed approaches update the forecast and slant toward over-prediction by including the error using eight ways of calculating the error. Researchers examined suggested techniques for the test by extending the CloudSim simulator with PlanetLab and random workload. The experimental findings demonstrate that the suggested approach can minimize SLA violation rates up to 99.16% and energy usage up to 25.43%.
Xialin Liu et al. (2020)  proposed dynamic consolidation using migration thrashing (MT), which prioritizes VMs with high dimensions, significantly decreasing MT. The degree of migrations required maintaining service level agreements (SLAs) by keeping VMs prone to relocation thrashing on the identical physical servers rather than migrating. Their method improves the relocation thrashing measured around 28%, the number of movements measured around 21%, and the SLAV measured around 19%. When the server is overloaded, their solution detects VMs with sufficient capacity by restricting the transfer of VMs with excessive capacity. The suggested techniques were proven to work by simulating large-scale research setting with a workload data set from many PlanetLab VMs.
Maryam C.-Samani et al. (2020)  suggested predictive consolidation of virtual machines (PCVM) using the ARIMA approach, which focuses on the DCVM over the fewest number of real servers. It also reduces the number of unnecessary migrations, detects PM overloading, and enforces SLAs using the ARIMA prediction model. Furthermore, the DVFS approach is utilized to determine the best frequency for heterogeneous physical devices. The experimental findings reveal that, the given framework greatly reduces energy usage while improving QoS characteristics as compared to various baseline techniques. The suggested solution was simulated using MATLAB and CloudSim with real-world PlanetLab workloads.
A brief description of the above detailed literature review and algorithms developed using statistical methods with different workload data is given in Table 7. Table 8, summarises the work, methods, and comparison with their benchmark algorithm to evaluate energy consumption by different researchers. Figure 13 depicts the percentage difference in energy reduction or energy savings in graphical form. The implementation of these algorithms has been tested with different settings. The authors have already discussed the specification of the host, characteristics of the virtual machine, workload datasets, simulator environment, and other criteria for comparing the proposed method to their benchmark algorithm.
Most of the above researchers have used PlanetLab workload traces, as shown in Table 9, or Bitbrains workload traces, as shown in Table 10 for simulation in CloudSim, Matlab, Java, or other environments are given below. Half of the 800 physical nodes in PlanetLab’s simulated data center are HP ProLiant ML110G4 systems, while the other half are HP ProLiant ML110G5 systems, as depicted in Table 11. For the smooth conduction of simulation, the power modeling has been configured in CloudSim as shown in Table 11.
The result of the review paper work is to find the current research outcomes in energy-efficient resource management as stated in different sections. Table 2, and Fig. 10 represent the saving of energy by up to 90% by different researchers using the heuristic method. The objectives addressed in the evaluation of this method were VM placement, VM allocation, VM migration, and resource utilization. In next section, the authors’ metaheuristic approaches were performed to address the objectives of VM consolidation, load balancing, resource management, PM overloading, VM migration, and VM placement. Metaheuristic methods in Table 4 and Fig. 11 showed an improvement in energy savings of up to 95%. Similarly, machine learning algorithms were presented to address the objectives of VM performance, prediction usage of resources, VM scheduling, dynamic consolidation, and resource management. With this approach, the reduction in energy consumption up to 88% has been shown as compared with other methods which has been illustrated in Table 6 and Fig. 12. In the last approach mentioned in this paper, researchers used statistical methods to perform host overload/underload detection, dynamic consolidation of VMs, utilization prediction, and VM allocation. This approach reduces the energy consumption up to 84%, as shown in Table 8 and Fig. 13. The outcomes of the review work are measured in terms of SLA, energy consumption, and the number of migrations against the different numbers of VMs. This review work focuses on energy utilization by different approaches in consolidating virtual machines. The results show that there has been an improvement in energy saving in the outcome of all the researchers by using different techniques. Other research outcomes include the use of integrated and combined approaches for utilization prediction, utilization, virtual machine consolidation, overload detection, VM selection, VM migration, and VM placement.
Major issues, suggestions, and future works
In this paper, the authors have outlined energy-efficient strategies for cloud computing. Several methods have been investigated, and their findings with parameters are listed in the tables. This paper can help people to find out the pros and cons of proposed energy-efficient algorithms that are motivated by researchers.
One of the main issues in cloud computing is using energy effectively, which necessitates the development of an eco-friendly environment. To meet SLAs, service providers must provide continuous power to data centers. This way, the data centers consume a large amount of energy and raise the cost of investment. However, the rising demand for cloud infrastructure has significantly increased the data center energy usage, which has become a crucial concern. As a result, energy-efficient solutions are necessary to reduce this energy utilization. Another significant challenge is the system’s reliability degradation because of the high frequency of consolidation and deploying VMs on PMs. Cloud efficiency is the capacity to make greater use of cloud resources at the lowest feasible cost. Other issues that must be addressed include scheduling challenges while PM-VM mapping for each user task, resource utilization prediction accuracy, overload, and underload host detection problems, and adaptive threshold estimation. Moreover, VM selection from the overloaded host, access to a real cloud data center to perform an experiment in a real environment, and improving user satisfaction along with the service providers are also various research challenges.
Most of the researchers have performed simulations in the CloudSim framework in an Infrastructure as a Service (IaaS) environment. In CloudSim, development tools, middleware technology, database management, resource computation, etc. help create and control cloud applications. Logical architecture is based on local and global managers. Cloud architecture is the organization of various components, including applications, databases, on-demand resources, storage, middleware, network devices, and software capabilities to provide services. Increased power use is a longstanding problem in today’s computer environment. The rise of applications using complex data has resulted in the construction of large data centers, which has increased the need for energy. According to the above analysis of energy-efficient strategies, the majority of the effort to minimize energy utilization in data centers is done by utilizing dynamic VM consolidation and resource management methods. Some researchers suggest multi-objective  algorithms that primarily address SLA, QoS, and resource usage while consuming less energy in cloud data centers. There has been little work done on heterogeneous physical devices, which requires considerable attention from the scientific community. Some major issues in current energy management techniques are prediction utilization of different resources ; mapping of VMs to PMs; host overload issues; VM selection from overloaded hosts; access to a real cloud data center; and VM placement. As VM placement is an NP-hard problem, metaheuristic approaches are the best suitable technique, which increases the complexity.
This research contributes significantly to provide important information related to the reduction of data center energy consumption, financial expenses, and the provision of QoS, hence assisting in the development of a strong, competitive cloud computing sector. This is especially crucial in the current green environment, where customers are becoming more environmentally concerned. Furthermore, according to recent research, data centers are a huge and rapidly rising energy-consumption sector of the economy, as well as a substantial source of CO2 emissions. Also, the research done by  the use of blockchain technology and cloud solutions facilitates and improves not only the aggregation of data and secured access to it, but also has a huge impact on the reduction of CO2 emissions and reduces the carbon footprint. Hence, reducing greenhouse gas (GHG) emissions is an important energy policy objective for many nations, as well as achieving the United Nations Sustainable Development Goal (SDG) to transform the world by 2030. As a result, global research efforts should focus on the open problems described in this work to improve energy-efficient resource management approaches in cloud computing systems. Also, the researchers’ plan should be centred on reducing energy use and increasing resource use without hurting the performance of the system.
Summary and conclusion
Data centers consume a tremendous amount of electricity for computing user applications as well as cooling their equipment. Improving energy efficiency in data centers may reduce greenhouse gas (GHG) emissions, air pollution, and the amount of water utilized in power generation. So, minimizing energy consumption has been a key challenge in recent years. As a result, it is one of the key study areas in cloud computing. Many researchers are concentrating their efforts on lowering the energy usage of data center infrastructures. This review article looks at virtual and physical machine consolidation strategies using various methodologies to save energy. These strategies look at global energy conservation and resource management. As a result, resource usage increases, and data center energy consumption decreases. This paper aims to identify energy consumption research that has been conducted using various heuristics, metaheuristics, machine learning, and statistical methods. VM selection and migration, host CPU usage prediction, overload detection, and VM placement have been used to manage resources and efficient use energy. The energy savings achieved through various strategies are compared in this paper. Various researchers tested several strategies in cloud data centers to reduce energy consumption and SLAV. In the heuristic approach, researchers have saved from 5.4% to 90% of energy with their proposed method when compared with the existing methods. Similarly, the metaheuristic approach reduces energy consumption from 7.68% to 97%. The machine learning method and the statistical method save energy from 1.6% to 88.5%, and 5.4% to 84% respectively when compared to the benchmark approaches considering a variety of settings and parameters. So, energy saving can be maximized up to 90% using different approaches in respect of consolidation of VMs, prediction of workload traces, utilization of resources, host underload/overload detection, VM selection, VM migration, and VM placement. The results of this study could help researchers come up with new ideas for research that will add to their knowledge and make it easier to use energy efficiently in cloud computing. So, the overall outcome of this review paper is to understand different techniques of energy-saving applied in the cloud data centers. As the field of cloud computing is increasing day by day and its application area is increasing, focus must be on the different methods of energy consumption in cloud data centers.
Mell P, Grance T (2011) The NIST definition of cloud computing
Wyld DC (2009) Moving to the cloud: an introduction to cloud computing in government, IBM center for the business of government
Sethi N (2019) The cloud environment and its basics: a review. Int J Comput Technol 6(1):82–88
Sotomayor B, Montero RS, Llorente IM, Foster I (2009) Virtual infrastructure management in private and hybrid clouds. IEEE Internet Comput 13(5):14–22
Kaur T, Chana I (2018) GreenSched: an intelligent energy-aware scheduling for deadline-and-budget constrained cloud tasks, simulation modelling practice, and theory. 82:55–83. ISSN 1569-190X. https://doi.org/10.1016/j.simpat.2017.11.008
Albers S (2010) Energy-efficient algorithms. Commun ACM 53:86–96. https://doi.org/10.1145/1735223.1735245
Çağlar İ, Altılar DT (2022) Look-ahead energy-efficient VM allocation approach for data centers. J Cloud Comput 11(11). https://doi.org/10.1186/s13677-022-00281-x
Conti J, Holtberg P, Diefenderfer J, LaRose A, Turnure JT, Westfall L International Energy Outlook 2016 With projections to 2040, United States. https://doi.org/10.2172/1296780
Milani AS, Navimipour NJ (2016) Load balancing mechanisms and techniques in the cloud environments: systematic literature review and future trends. J Netw Comput Appl 71:86–98. ISSN 1084–8045. https://doi.org/10.1016/j.jnca.2016.06.003
Malik N, Sardaraz M, Tahir M, Shah B, Ali G, Moreira F (2021) Energy-efficient load balancing algorithm for workflow scheduling in cloud data centers using queuing and thresholds. Appl Sci 11(13):5849. https://doi.org/10.3390/app11135849
Singh S, Chana I (2016) Cloud resource provisioning: survey, status, and future research directions, knowledge and information system. 49:1005–1069. https://doi.org/10.1007/s10115-016-0922-3
Mohit Kumar SC, Sharma AG, Singh SP (2019) A comprehensive survey for scheduling techniques in cloud computing. J Netw Comput Appl 143:1–33. ISSN 1084–8045. https://doi.org/10.1016/j.jnca.2019.06.006
Singh S, Chana I (2016) A survey on resource scheduling in cloud computing: issues and challenges. J Grid Computing 14:217–264. https://doi.org/10.1007/s10723-015-9359-2
Chaurasia N, Kumar M, Chaudhry R et al (2021) Comprehensive survey on energy-aware server consolidation techniques in cloud computing. J Supercomput 77:11682–11737. https://doi.org/10.1007/s11227-021-03760-1
Bui DM, Tu NA, Huh EN (2021) Energy efficiency in cloud computing based on mixture power spectral density prediction. J Supercomput 77:2998–3023. https://doi.org/10.1007/s11227-020-03380-1
Bobroff N, Kochut A, Beaty K (2007) Dynamic placement of virtual machines for managing SLA violations, 10th IFIP/IEEE international symposium on integrated network management
Varrette S, Guzek M, Plugaru V, Besseron X, Bouvry P (2013) HPC performance and energy-efficiency of Xen, KVM and VMWare hypervisors, 25th international symposium on computer architecture and high-performance computing
Gelenbe E (2009) Steps toward self-aware networks. Commun ACM 52(7):66–75
Berl A, Gelenbe E, Girolama M, Giuliani G, Meer H, Dang MQ, Pentikousis K (2010) Energy-efficient cloud computing. Comput J 53(7):1045–1051
Luo, L., et al., (2012) A resource scheduling algorithm of cloud computing based on energy efficient optimization methods
Buyya R, Broberg J, Goscinski AM (2010) Cloud computing: principles and paradigms, Vol. 87. John Wiley & Sons
Zhang Q, Cheng L, Boutaba R (2010) Cloud computing: state-of-the-art and research challenges. J Internet Serv Appl 1:7–18. https://doi.org/10.1007/s13174-010-0007-6
Buyya R, Beloglazov A, Abawajy J (2010) Distributed, parallel, and cluster computing (cs. DC), energy-efficient management of data center resources for cloud computing: a vision, architectural elements, and open challenges. arXiv. https://doi.org/10.48550/arXiv.1006.0308
Buyya R, Ranjan R, Calheiros RN (2009) Modeling and simulation of scalable cloud computing environments and the CloudSim toolkit: challenges and opportunities, in 2009 international conference on high-performance computing & simulation
Calheiros RN et al (2011) CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms. Software Pract Exper 41(1):23–50
Park K, Pai VS (2006) CoMon: a mostly-scalable monitoring system for PlanetLab. ACM SIGOPS Oper Syst Rev 40(1):65–74
Moges FF, Abebe SL (2019) Energy-aware VM placement algorithms for the OpenStack neat consolidation framework. J Cloud Comput 8(1):1–14
Shen S, Van Beek V, Iosup A (2015) Statistical characterization of business-critical workloads hosted in cloud data centers. In: Proc - 2015 15th IEEE/ACM international symposium on cluster, cloud and grid computing 2015, pp 465–474. https://doi.org/10.1109/CCGrid.2015.60 arXiv:1302.5679v1
Anoep S, Dumitrescu C, Epema D, Iosup A, Jan M, Li H, Wolters L. The grid workloads archive: bitbrains. http://gwa.ewi.tudelft.nl/datasets/gwa-t12-bitbrains
Amvrosiadis G, Park JW, Ganger GR, Gibson GA, Baseman E, DeBardeleben N (2018) On the diversity of cluster workloads and its impact on research results. In: Proceedings of USENIX ATC
Reiss C, Tumanov A, Ganger GR, Katz RH, Kozuch MA (2012) Heterogeneity and dynamicity of clouds at scale: google trace analysis. In: Proceedings of ACM SoCC, pp 1–13
Liu Q, Zhibin Y (2018) The elasticity and plasticity in semicontainerized co-locating cloud workload: a view from Alibaba trace. In: Proceedings of ACM SoCC, pp 347–360
Shahrad M, Fonseca R, Goiri Í, Chaudhry G, Batum P, Cooke J, Laureano E, Tresness C, Russinovich M, Bianchini R (2020) Serverless in the wild: characterizing and optimizing the serverless workload at a large cloud provider. In: Proceedings of USENIX ATC, pp 205–218
Luo S, Xu H, Lu C, Ye K, Xu G, Zhang L, Yu D, He J, Xu C (2021) Characterizing microservice dependency and performance: Alibaba trace analysis, SoCC'21
Buyya R, Beloglazov A, Abawajy J (2010) Energy-efficient management of data center resources for cloud computing: a vision, architectural elements, and open challenges
Katal A, Dahiya S, Choudhury T (2022) Energy efficiency in cloud computing data centers: a survey on software technologies. Clust Comput. https://doi.org/10.1007/s10586-022-03713-0
Zhang Y, Liu J (2022) Prediction of overall energy consumption of data centers in different locations, Sensors. 22:3704. https://doi.org/10.3390/s22103704
Teng F et al (2017) Energy efficiency of VM consolidation in IaaS clouds. J Supercomput 73(2):782–809
Zhou Z et al (2018) Minimizing SLA violation and power consumption in cloud data centers using adaptive energy-aware algorithms. Future Gene Comput Syst 86:836–850
Khosravi A (2017) Energy, and carbon-efficient resource management in geographically distributed cloud data centers
Kaur T, Chana I (2015) Energy efficiency techniques in cloud computing: a survey and taxonomy. ACM Comput Surv 48(2):1–46
Clark C, Fraser K, SH, J., & Hansen, E. (2005) Warfield, live migration of virtual machines. In: Jul C, Limpach I, Pratt A (eds) In proceedings of the 2nd ACM/USENIX symposium on networked systems design and implementation (NSDI), pp 273–286
Beloglazov A, Buyya R (2012) Optimal online deterministic algorithms and adaptive heuristics for energy and performance efficient dynamic consolidation of virtual machines in cloud data centers, concurrency and computation. Pract Exper 24(13):1397–1420
Srikantaiah S, Kansal A, Zhao F (2008) Energy-aware consolidation for cloud computing. Clust Comput 12:1–15
Beloglazov A, Buyya R (2010) Energy-efficient resource management in virtualized cloud data centers,10th IEEE/ACM international conference on cluster, cloud and grid computing
Beloglazov A, Abawajy J, Buyya R (2012) Energy-aware resource allocation heuristics for efficient management of data centers for cloud computing. Future Gene Comput Syst 28(5):755–768
Ghobaei-Arani M et al (2018) A learning-based approach for virtual machine placement in cloud data centers. Int J Commun Syst 31(8):1–18
Wang H, Tianfield H (2018) Energy-aware dynamic virtual machine consolidation for cloud datacenters. IEEE Access 6:15259–15273
Bhattacherjee S et al (2020) Energy-efficient migration techniques for cloud environment: a step toward green computing. J Supercomput 76(7):5192–5220
Liu X et al (2020) Virtual machine consolidation with minimization of migration thrashing for cloud data centers. Math Probl Eng 2020:1–13
Jangiti S, Shankar Sriram VS (2020) EMC2: energy-efficient and multi-resource- fairness virtual machine consolidation in cloud data centers. Sustain Comput: Inform Syst 27:100414. ISSN 2210–5379. https://doi.org/10.1016/j.suscom.2020.100414
Garg V, Jindal B (2021) Energy-efficient virtual machine migration approach with SLA conservation in cloud computing. J Cent South Univ 28(3):760–770
Alharbi F et al (2021) Simultaneous application assignment and virtual machine placement via ant colony optimization for energy-efficient enterprise data centers. Clust Comput 24:1255–1275
Kaur T, Kumar A (2022) Power aware energy efficient virtual machine migration (PAEEVMM) in cloud computing. In: Satyanarayana C, Samanta D, Gao XZ, Kapoor RK (eds) High performance computing and networking. Springer, Singapore. https://doi.org/10.1007/978-981-16-9885-9_46 Lecture notes in electrical engineering, vol 853
Kousiouris G, Cucinotta T, Varvarigou T (2011) The effects of scheduling, workload type, and consolidation scenarios on virtual machine performance and their prediction through optimized artificial neural networks. J Syst Softw 84(8):1270–1291
Aryania A, Aghdasi HS, Khanli LM (2018) Energy-aware virtual machine consolidation algorithm based on ant colony system. J Grid Comput 16(3):477–491
Goyal et al (2019) An optimized model for energy efficiency on cloud system using PSO & CUCKOO search algorithm. Int J Innov Technol Explor Eng 8(9S):2278–3075
Tarahomi M, Izadi M, Ghobaei-Arani M (2020) An efficient power-aware VM allocation mechanism in cloud data centers: a micro genetic-based approach. Clust Comput 24(2):919–934
Dubey et al., (2020), A simulated annealing based energy-efficient vm placement policy in cloud computing, 2020 international conference on emerging trends in information technology and engineering (ic-ETITE)
Barthwal V, Rauthan MMS (2021) AntPu: a meta-heuristic approach for energy-efficient and SLA aware management of virtual machines in cloud computing. Memetic Computing 13(1):91–110
Mirmohseni SM, Javadpour A, Tang C (2021) LBPSGORA: create load balancing with particle swarm genetic optimization algorithm to improve resource allocation and energy consumption in clouds networks. Math Probl Eng 2021:1–15.
Salami et al., (2021). An energy-efficient cuckoo search algorithm for virtual machine placement in cloud computing data centers. The Journal of Supercomputing 77(11):13330–13357.
Sayadnavard MH, Haghighat AT, Rahmani AM (2022) A multi-objective approach for energy-efficient and reliable dynamic VM consolidation in cloud data centers, engineering science and technology. Int J 26:100995. ISSN 2215–0986. https://doi.org/10.1016/j.jestch.2021.04.014
Malik S, Tahir M, Sardaraz M, Alourani A (2022) A resource utilization prediction model for cloud data centers using evolutionary algorithms and machine learning techniques. Appl Sci 12(4):2160. https://doi.org/10.3390/app12042160
Rao J et al (2009) Vconf: a reinforcement learning approach to virtual machines auto-configuration. In: Proceedings of the 6th international conference on Autonomic computing
Duy TVT, Sato Y, Inoguchi Y (2010) Performance evaluation of a green scheduling algorithm for energy savings in cloud computing, IEEE international symposium on parallel & distributed processing, workshops and Ph.D. forum (IPDPSW)
Niehorster, O., et al., (2011), Autonomic resource management with support vector machines, IEEE/ACM 12th international conference on grid computing
Islam S et al (2012) Empirical prediction models for adaptive resource provisioning in the cloud. Future Gene Comput Syst 28(1):155–162
Xu C-Z, Rao J, Bu X (2012) URL: a unified reinforcement learning approach for autonomic cloud management. J Parallel Distrib Comput 72(2):95–105
Farahnakian F et al (2013) Energy-aware consolidation algorithm based on k-nearest neighbor regression for cloud data centers, Department of IT, University of Turku. IEEE/ACM 6th International Conference on Utility and Cloud Computing, Finland
Farahnakian F, Liljeberg P, Plosila J (2014) Energy-efficient virtual machines consolidation in cloud data centers using reinforcement learning. In: 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, pp 500–507
Patel M, Chaudhary S, Garg S (2016) Machine learning-based statistical prediction model for improving the performance of live virtual machine migration. J Eng 2016:1–9
Duggan M et al (2017) A reinforcement learning approach for the scheduling of live migration from underutilized hosts. Memetic Computing 9(4):283–293
Duggan M et al (2017) Predicting host CPU utilization in cloud computing using recurrent neural networks. In: In 2017 12th international conference for internet technology and secured transactions (ICITST)
Zia Ullah Q, Hassan S, Khan GM (2017) Adaptive resource utilization prediction system for infrastructure as a service cloud. Comput Intell Neurosci 2017:1–12
Shaw R, Howley E, Barrett E (2017) An advanced reinforcement learning approach for energy-aware virtual machine consolidation in cloud data centers. In: 12th international conference for internet technology and secured transactions (ICITST)
Sotiriadis S, Bessis N, Buyya R (2018) Self-managed virtual machine scheduling in cloud systems. Inf Sci 433:381–400
Mason K et al (2018) Predicting host CPU utilization in the cloud using evolutionary neural networks. Future Gene Comput Syst 86:162–173
Patel D, Gupta RK, Pateriya R (2019) Energy-aware prediction-based load balancing approach with VM migration for the cloud environment. In: Data, engineering and applications. Springer, pp 59–74
Kumar J, Saxena D, Singh AK, Mohan A (2020) Biphase adaptive learning-based neural network model for cloud datacenter workload forecasting. Soft Comput 24(19):14593–14610. https://doi.org/10.1007/s00500-020-04808-9
Saxena D, Singh AK (2021) A proactive autoscaling and energy-efficient VM allocation framework using online multi-resource neural network for cloud data center. Neurocomputing 426:248–264. ISSN 0925–2312. https://doi.org/10.1016/j.neucom.2020.08.076
Cao Z, Dong S (2012) Dynamic VM consolidation for energy-aware and SLA violation reduction in cloud computing. In: IEEE 13th international conference on parallel and distributed computing, applications and Technologies
Farahnakian F, Liljeberg P, Plosila J (2013) LiRCUP: linear regression-based CPU usage prediction algorithm for live migration of virtual machines in data centers. In: 39th Euromicro conference on software engineering and advanced applications (SEAA), pp 358–364
Nadjar A, Abrishami S, Deldari H (2015) Hierarchical VM scheduling to improve energy and performance efficiency in IaaS Cloud data centers. In: 5th international conference on computer and knowledge engineering (iccke)
Ruan X, Chen H (2015) Performance-to-power ratio aware virtual machine (VM) allocation in energy-efficient clouds. In: IEEE international conference on cluster computing, pp 264–273
Abdelsamea A et al (2017) Virtual machine consolidation enhancement using hybrid regression algorithms. Egypt Inform J 18(3):161–170
Khoshkholghi MA et al (2017) Energy-efficient algorithms for dynamic virtual machine consolidation in cloud data centers. IEEE Access 5:10709–10722
Hemavathy M, Anitha R (2019) Green aware based VM-placement in cloud computing environment using extended multiple linear regression model. In: Emerging trends in computing and expert technology, COMET 2019. Springer, Cham. https://doi.org/10.1007/978-3-030-32150-5_53 Lecture notes on data engineering and communications technologies, vol 35
Li L, Dong J, Zuo D, Wu J (2019) SLA-aware and energy-efficient VM consolidation in cloud data centers using robust linear regression prediction model. IEEE Access 7:9490–9500
Chehelgerdi-Samani M, Safi-Esfahani F (2020) PCVM.ARIMA: predictive consolidation of virtual machines applying the ARIMA method. J Supercomput 77:2172–2206. https://doi.org/10.1007/s11227-020-03354-3
Chen J, Du T, Xiao G (2021) A multi-objective optimization for resource allocation of emergent demands in cloud computing. J Cloud Computing 10(20):1–17
Karaszewski R, Modrzyński P, Modrzyńska J (2021) The use of Blockchain Technology in Public Sector Entities Management: an example of security and energy efficiency in cloud computing data processing. Energies 14(7):1873. https://doi.org/10.3390/en14071873
The authors would like to thank the editor and reviewers for their suggestions and improvement to the manuscript.
The authors are not affiliated with any entity that has a direct or indirect financial interest in the subject matter covered in the paper.
This manuscript has not been submitted to or is currently being reviewed by any other journal or publication platform.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Panwar, S.S., Rauthan, M.M.S. & Barthwal, V. A systematic review on effective energy utilization management strategies in cloud data centers. J Cloud Comp 11, 95 (2022). https://doi.org/10.1186/s13677-022-00368-5