Multiobjective trajectory optimization algorithms for solving multi‑UAV‑assisted mobile edge computing problem

The Internet of Things (IoT) devices are not able to execute resource‑intensive tasks due to their limited storage and computing power. Therefore, Mobile edge computing (MEC) technology has recently been utilized to pro‑ vide computing and storage capabilities to those devices, enabling them to execute these tasks with less energy consumption and low latency. However, the edge servers in the MEC network are located at fixed positions, which makes them unable to be adjusted according to the requirements of end users. As a result, unmanned aerial vehicles (UAVs) have recently been used to carry the load of these edge servers, making them mobile and capable of meeting the desired requirements for IoT devices. However, the trajectories of the UAVs need to be accurately planned in order to minimize energy consumption for both the IoT devices during data transmission and the UAVs during hovering time and mobility between halting points (HPs). The trajectory planning problem is a complicated optimization prob‑ lem because it involves several factors that need to be taken into consideration. This problem is considered a multi‑ objective optimization problem since it requires simultaneous optimization of both the energy consumption of UAVs and that of IoT devices. However, existing algorithms in the literature for this problem have been based on converting it into a single objective, which may give preference to some objectives over others. Therefore, in this study, several multiobjective trajectory planning algorithms (MTPAs) based on various metaheuristic algorithms with variable population size and the Pareto optimality theory are presented. These algorithms aim to optimize both objectives simultaneously. Additionally, a novel mechanism called the cyclic selection mechanism (CSM) is proposed to manage the population size accurately, optimizing the number of HPs and the maximum function evaluations. Furthermore, the HPs estimated by each MTPA are associated with multiple UAVs using the k‑means clustering algorithm. Then, a low‑complexity greedy mechanism is used to generate the order of HPs assigned to each UAV, determining its trajectory. Several experiments are conducted to assess the effectiveness of the MTPAs with variable population size and cyclic selection mechanisms. The experimental findings demonstrate that the MTPAs with the cyclic selection mechanism outperform all competing algorithms, achieving better outcomes.


Introduction
Mobile edge computing (MEC) technology has recently emerged to enhance the delivery of services to end users, with a focus on minimizing response time, particularly for real-time applications, while also supporting mobility and location awareness [1].However, IoT devices face limitations in terms of storage capacity and computing power, making them unsuitable for resource-intensive tasks.To address these limitations, MEC technology has been employed to provide storage and computing services to IoT devices, enabling them to execute resourceintensive IoT tasks with lower EC and reduced latency [2].Nevertheless, the fixed location of the edge servers poses a significant constraint on the effectiveness of this technology, potentially hindering the provision of desired services to end users.To overcome this limitation, unmanned aerial vehicles (UAVs) have recently been employed in conjunction with edge servers to provide computing power for IoT devices [2].However, careful planning of UAV trajectories is essential to minimize EC during data transmission for IoT devices, as well as reduce energy usage during UAV hovering and mobility between halting points (HPs).The trajectory planning problem (TPP) is a complex optimization challenge due to multiple factors that must be considered, including the number and positioning of HPs, the relationship between HPs and UAVs, and the sequencing of HPs for each UAV.The number of HPs should be variable during the optimization process because it is unknown a priori.This presents a significant difficulty for the traditional gradient-based techniques because a gradient vector is not explicitly defined [3].Metaheuristic algorithms are the best alternative to solve this problem because they are gradient-free.Those algorithms optimize a population of particles or individuals, and each individual typically represents either an entire deployment (number of HPs and their positions) for the UAV or an HP in the entire deployment.
Several recent studies have proposed various approaches to enhance the performance of MEC systems utilizing UAVs.For instance, Zhang et al. [4] addressed the resource allocation problem in UAV-supported MEC systems, aiming to optimize the transmit power of the vehicle and the trajectory of the UAV in order to minimize the total EC.They formulated this problem as an optimization task and employed a search algorithm to find a near-optimal trajectory.In another study, Asim et al. [5] presented a trajectory planning approach based on evolutionary algorithms for determining the best trajectories for multiple UAVs in multi-UAV-assisted MEC systems.This approach consisted of four stages.Firstly, the genetic algorithm (GA) was used to search for the near-optimal deployment of halting points (HPs) for UAVs.Secondly, duplicated HPs were eliminated using a removal operator.Thirdly, a clustering algorithm based on differential evolution was employed to assign the HPs to different clusters.Finally, in the last stage, GA was utilized to establish the order of HPs for each UAV.These studies contribute to the improvement of MEC systems through UAV integration, with a focus on optimizing EC and trajectory planning.
Li et al. [6] proposed an optimization approach for jointly optimizing the resource allocation and UAV trajectory in the UAV-powered MEC systems.This approach based on the improved atomic orbital search (AOS) was used to minimize the total EC by jointly optimizing the transmit power allocation, CPU frequency allocation, time division, and flight trajectory.Wang et al. [7] proposed an optimization framework for the joint optimization of the geographical fairness between the end users (EU), the justice of each UAV's EU-load, and the EU's overall EC.This algorithm was based on multiagent deep reinforcement learning to search for the optimal trajectory for each UAV separately.Savkin et al. [8] proposed a path-planning optimization technique for minimizing the overall EC and maximizing the number of computing tasks.
Asim et al. [9] proposed a trajectory planning technique based on GA with a variable population size (VPs) for minimizing the total EC of multi-UAV-aided MEC systems.This approach was comprised of two stages: the first stage included employing the GA with VPs for finding out the near-optimal deployment of HPs for UAV; and the second stage utilizing the multi-chrome GA to determine the relationship between UAVs and HPs, the UAV's near-optimal number, and the near-optimal order of HPs for UAVs.Zhang et al. [10] proposed an iterative optimization technique with the double-loop structure to search for jointly optimizing the partial computation offloading, power and spectrum resources, allocation of CPU, user association, and UAVs' trajectory to minimize the total EC and maximize the computation efficiency in MEC systems.Sun et al. [11] proposed an optimization approach based on the successive convex approximation for jointly optimizing the UAV's trajectory and the offloading mode in the hope of minimizing the overall EC.
Wu et al. [12] enhanced the tabu search algorithm and introduced a new robust path-planning approach.This approach efficiently optimized the number of UAVs and their path planning with the goal of minimizing total EC.In [2], a three-stage trajectory planning algorithm (TPA) was proposed to minimize the overall EC of UAVs.In the first stage, both the number and positions of halting points (HPs) were simultaneously updated using the differential evolution algorithm combined with a virtual particles (VPs) strategy.
In the second stage, the k-means clustering technique was applied to divide the provided HPs into a number of groups equal to the number of UAVs, with each group containing HPs visited by the same UAV.Finally, in the last stage, a low-complexity greedy strategy was employed to determine the order of HPs in each cluster, thereby generating the trajectory for each UAV.Additionally, there are several other studies in the literature that propose trajectory optimization approaches for UAV-assisted MEC systems.
M. J. Sobouti [13] proposed a new exact approach to solve the multiple 3D trajectory problem for flying base stations (FBS).This approach takes into consideration several constraints when used to solve this problem.Those constraints are FBS energy consumption, flight distance limits, inter-cell interference, and operation time.Broadly speaking, this approach has been split into two stages: FBS trajectory and FBS placement.In this approach, the problem is divided into numerous snapshots.First, the minimal number of FBSs needed and their precise 3D placements in each snapshot are estimated.Then, the trajectory phase is carried out between every two snapshots.During this phase, a binary linear model, which takes into account FBS flight distance and EC limits, is used to determine the ideal path between the origin and destination of each FBS.Afterwards, the shortest path for each FBS, which takes into consideration some constraints such as obstacles and collision avoidance, is computed.The experimental findings showed that this approach could be applied to the real world.
J. Lin [14] proposed a new encoding strategy, namely the cutting and padding encoding, to be employed with the differential evolution for presenting a multiobjective trajectory optimization technique to find the near-optimal flight trajectory of a UAV.This technique was verified using a set of up to 400 IoT devices and compared to some existing optimization techniques to reveal its effectiveness.In [15], several metaheuristic-based optimization algorithms, including the sine cosine algorithm (SCA), salp swarm algorithm (SSA), and flower pollination algorithm (FPA), were developed for precisely optimizing the whole deployment of a UAV utilizing an effective encoding technique.These algorithms were evaluated using numerous instances with a set of up to 700 IoT devices and compared to one another and to some other optimizers to demonstrate their performance.The experimental findings revealed that SCA outperformed in the majority of test instances.
Huang et al. [3] proposed a new encoding strategy for encoding solutions to this problem to avoid excessive dimensionality and mixed variables.This strategy was employed with differential evolution (DE) to offer a new deployment optimization technique, namely DEVIPS.This technique was evaluated using seven instances with several IoT devices ranging between 100 and 700.Furthermore, it was compared to several optimization approaches in order to demonstrate the superiority of this encoding scheme.Zhang [16] used an encoding mechanism to adapt the backtracking search algorithm (BSA) to solve this problem.This encoding mechanism makes each individual responsible for an HP and the population liable for the whole deployment.In addition, BSA was improved by utilizing opposition-based learning in conjunction with the population adjustment mechanism to present a better variant called BSADP.This approach was assessed using numerous instances and contrasted with some competing optimizers to demonstrate its superiority.
Zhang et al. [17] presented a new optimization approach based on integrating an elite-driven DE (EDDE) and DE with a dynamic population (DPDE).This approach was abbreviated EDDE-DPDE and used to optimize the deployment problem of a UAV with the purpose of saving the travel time required while collecting data from IoT devices.The experimental findings show the effectiveness of this algorithm compared to four competitors.Abu-Baker et al. [18] developed a new approach to gathering data from wireless sensor networks using UAVs.This approach incorporates two well-known metaheuristic algorithms, namely GA and particle swarm optimization (PSO).GA was utilized to solve the clustering problem, whilst the other was used to discover the UAV's near-optimal deployment.This approach's performance was measured using two metrics: throughput and lifetime.Some of these include drone swarm path planning [19], a multi-objective trajectory optimization algorithm for a single UAV [14], and reinforcement learning techniques [20][21][22].These studies contribute to the field by offering various approaches for optimizing UAV trajectories in UAV-assisted MEC systems, with a focus on minimizing EC and improving overall efficiency.
The TPP in UAV-assisted MEC systems is considered a multi-objective optimization challenge since it requires optimizing the EC objectives of both UAVs and IoT devices.However, most algorithms proposed in the literature have approached this problem by converting it into a single-objective optimization, potentially favoring one objective over the other.To address this limitation and optimize both objectives simultaneously, this study presents several multi-objective trajectory planning algorithms (MTPAs).These MTPAs are developed by adapting various metaheuristic algorithms, including the artificial gorilla troops optimizer (GTO) [23], gradientbased optimizer (GBO) [24], teaching-learning-based optimization (TLBO) [25], nutcracker optimization algorithm (NOA) [26], slime mould algorithm (SMA) [27], spider wasp optimizer (SWO) [28], differential evolution (DE) [2], RUN [29], and INFO [30].The MTPAs utilize a variable population size mechanism and leverage the Pareto optimality theory.Furthermore, a novel mechanism called the cyclic selection mechanism (CSM) is introduced in this study to effectively manage the population size, precisely optimize the number of halting points (HPs), and ensure accurate utilization of function evaluations.The HPs estimated by each MTPA are associated with multiple UAVs using the k-means clustering algorithm.The greedy mechanism is then employed to determine the order of HPs assigned to each UAV, thereby generating its trajectory.To evaluate the efficacy of the MTPAs with variable population size and cyclic selection mechanism, several experiments are conducted.The experimental results demonstrate that MTPAs employing the cyclic selection mechanism outperform other competing approaches.
The main contributions of this study are listed as follows: • Presenting multiobjective trajectory planning algorithms (MTPAs) for finding the near-optimal trajectories for various UAVs in the UMEC systems • Those MTPAs are based on several metaheuristic algorithms encoded using the variable population size and adapted for the multiobjective problems using the Pareto optimality.• Presenting a novel mechanism, namely the cyclic selection mechanism, to manage the population size for optimizing the number of HPs for each UAV more accurately • Investigating the effectiveness of the Pareto optimality and CSM with several metaheuristic algorithms for solving several instances with several IoT devices ranging between 80 and 400.• The experimental findings show that dealing with this problem as a multiobjective could achieve better total EC than converting it into a single objective.Furthermore, the CSM could significantly improve the performance of the optimization algorithms • Among all the studied algorithms, both MTPA-VTLBO and MTPA-VGBO could achieve superior outcomes.
The following is a summary of the remaining parts of this paper: Problem formulation section explains the problem formulation in a concise manner; the proposed algorithms are described in Proposed algorithm section; the experimental settings are illustrated in Experimental settings section.Results and discussion are presented in Results and discussions section, and the conclusion and recommendations for further research are discussed in Conclusion and future work section.

Problem formulation
The UAVs could be loaded with edge servers to be mobile for quickly receiving the data from the IoT devices for improving the quality of services presented to the end users.For example, Fig. 1 shows a multi-UAV-aided MEC system with n IoT devices and m UAVs.In this study, the IoT devices are denoted as N = {1, 2, . . ., n} , and the m UAVs with edge servers are denoted as M = {1, 2, . . ., m} .This system involves IoT devices that need to execute resource-intensive tasks.The ith task of the ith device is represented in this study using a two-tuple (D i , S i ) , where D i represents the size of the ith task and S i represents the required resources for processing a single bit in the ith task.The IoT devices could not process those tasks due to their limited resources.Therefore, those tasks are first submitted to the MEC servers to be processed, and then the findings are submitted back to the IoT devices.
The UAVs could update their stop positions to decrease the distance with the IoT devices; those positions are referred to as stop points or halting points (HPs).Each jth UAV has a set of k j HPs, which is denoted as K j = 1, 2, . . ., k j .Furthermore, the jth UAV's trajectory is comprised of a sequence of HPs that are visited by UAVs for receiving the data from the IoT devices with the purpose of minimizing the total EC.In this study, the jth UAV's trajectory is denoted as g j = X l j , Y l j , H l j , such that l ∈ K j .The altitude H l j of each stop point for the trajectory of this UAV is set to a fixed value as discussed in [2].The distance between the ith device, which is found at x i , y i , 0 , and the jth HPs could be defined according to the following: where H l j is the fixed altitude of the jth UAV.The IoT devices always chose the nearest HP for sending their tasks to be processed to minimize EC and transmission time.To assign each IoT device to the nearest HPs, a binary variable b ijl is used to determine whether the ith task is allocated to the lth HP of the jth UAV or not [3].This variable b ijl is assigned a value of 1 when the lth HP of the jth UAV is the nearest to the ith IoT device; other- wise, it is set to 0, as defined mathematically in the following mathematical equation: (1) The next constraint should be achieved to ensure that the ith task is assigned to only a UAV at only one HP: The jth UAV at the lth HP is able to simultaneously pro- cess the tasks of δ IoT devices at most due to its limited bandwidth, so the following constraint should be satisfied: In addition to this, each UAV located at each HP provides service to at least one IoT device; hence, the total number of HPs for all UAVs, which is referred to as k, must adhere to the following constraint: (3) where k min is equal to n M , and k max is equal to n .During the data transmission, the channel power gain between the ith IoT device and the jth UAV at the lth HP could be defined as follows: The mathematical model that could be used for computing the data transfer rate from the ith IoT device to the jth UAV at the lth HP is defined as follows: (5) where B refers to the bandwidth, P i is the transmission power of the ith device, and β 2 indicates the white Gauss- ian noise power.Time taken and power used by the ith IoT device to deliver data to the jth UAV at the lth HP are calculated as [3]: The total amount of energy consumed by all IoT devices is stated as [2]: UAVs begin carrying out their missions after they have received the necessary input data.The amount of time it takes to complete the ith task on the jth UAV at the lth HP, given that the computing resource is c ijl , may be calculated as: The jth UAV will wait until the lth HP has finished all of its assigned tasks before proceeding to the next HP.So, the amount of time the jth UAV spends hovering over the lth HP is equal to the total amount of time it takes to complete all assigned tasks.The hovering time of the jth UAV over the lth HP could be defined using the following equation: Afterwards, the energy consumption by the jth UAV over the lth HP until all assigned tasks are completed could be computed using the following formula: where p u stands for the jth UAV's hovering power.The jth UAV will visit all HPs in the estimated trajectory g j , so the time taken to fly among those SPs could be estimated as follows: (7) where v represents the jth UAV's flight velocity.Then, the energy consumed by the jth UAV within the flight time could be estimated according to the following equation: where p F represents the jth UAV's flight power.The total EC of all UAVs is based on the energy consumed in hovering time and flight time, as defined in the following formula: In this paper, we seek to present a new optimization technique that could optimize the UAV trajectories to minimize the energy consumed by the UAVs and IoT devices in the multi-UAV-aided MEC system.The energy consumption of IoT and that of IoT devices is the total energy consumption of this system.This means that this problem contains two objectives that need to be simultaneously optimized.However, several works in the literature have dealt with this problem using a weighted variable to relate two objectives together.This strategy is not effective for optimizing both objectives together because some objectives might have the highest effect on the objective function, and hence the small improvement in this objective might give preference to the generated trajectory even though it affects negatively the other objectives with low influence.Finally, the mathematical model of the multiobjective trajectory planning problem (MTPP) could be defined as follows:

Pareto optimality theory
The trajectory planning optimization problem could be classified as a multiobjective optimization problem (MOP) because it contains two objectives that need to be simultaneously optimized.In addition, the MOP might involve a set of constraints that need to be met by the solutions obtained within the optimization process.For simplification, a multiobjective optimization (15 problem that needs to be minimized is modeled as follows: where L refers to the number of objectives in the tackled MOP,nq i is the ith inequality constraint, z is the number of the inequality constraint, q i is the ith equality con- straint, S is the number of the equality constraint, n 1 is the number of solutions obtained by an optimization algorithm, − → U is the upper bound of all dimensions, and − → L is the lower bound.The multi-objective optimization problems are more complicated than the single-objective problems (SOPs) because the solutions obtained for these problems might improve some objectives and deteriorate others, and hence the methodology for choosing the best solution is hard to be achieved easily.In contrast, SOP only has one objective that the optimization algorithms could directly optimize in order to find a single solution that could either minimize or maximize it depending on its nature.From that, it is clear that SOP could be easily and directly solved by the majority of the optimization techniques, while for a MOP, the optimization technique needs to be first adapted by Pareto optimality to make them relevant for optimizing all objectives simultaneously.As aforementioned, the trajectory planning problem has two objectives that need to be simultaneously optimized using the Pareto optimality theory.This theory is based on estimating a set of solutions, known as nondominated solutions.Those solutions are called nondominated because they could minimize at least one objective without deteriorating any one of the others [31].For more clarification, in Fig. 2, the solution − → x 1 is dominated by the solution − → x 2 because it has one objective that is inferior and one objective that is equal.For solving the multiobjective trajectory planning problem of the UAVs in the multi-UAV-assisted MEC systems, a metaheuristic algorithm will be executed to generate a set of solutions, which are compared with each other under this theory, and the non-dominated solutions are stored in an archive to be compared with the solutions generated in the rest of the optimization process for searching for better nondominated solutions.(17)

Association between HPs and UAVs
In this stage, the HPs generated by the optimization process at generation t need to be associated with the UAVs to determine the HPs for each UAV to be visited.
To do that, according to [2], the K-means clustering (KMC) algorithm, which is classified as an unsupervised machine learning technique, is used to cluster the HPs into m clusters, denoted as C j , where All HPs in the jth cluster are visited by only the jth UAV.In more detail, the KMC algorithm will first initialize m clusters with m HPs selected randomly from the current popu- lation; those HPs are used to represent the centers of the clusters.Then, the Euclidian distance is used to compute the distance between those centers and the other HPs in the current population and assign each HP to the cluster with the closest distance.In algorithm 1, we describe the steps of the KMC algorithm for assigning the different HPs to the best UAV based on computing the distance between each HP and the centroid of various clusters and assigning this HP to the nearest cluster.This process is continued until the termination condition is satisfied.The termination condition in this state is achieved when the centroid in each cluster is unchanged.It is worth stating that the centroid Xj , Yj for each cluster is derived by averaging the HPs allocated to it, as defined by the following formulas:

Low-complexity greedy mechanism for the order of HPs
In [2], a new simple greedy method for generating the order of HPs for each UAV was proposed to be a strong alternative to the traditional and modern optimization techniques, which have high computational costs.This method was called the low-complexity greedy method.
For each UAV, this method selects an HP randomly from its corresponding cluster and then sets this HP to the first cell in the trajectory array of this UAV.Afterwards, the distance between this HP and all HPs in the corresponding cluster is computed, and the nearest HP is set to the second cell in the trajectory array.Likewise, the distance between the last added HP to the trajectory and all remaining HPs in the corresponding cluster is computed, and the nearest HP is added to the next cell in the trajectory array.This procedure is continued until all HPs in all clusters are assigned to the trajectory of each UAV.The pseudocode of this mechanism is presented in Algorithm 2.

Encoding mechanisms
The encoding mechanisms significantly affect the performance of the optimization techniques when tackling a specific problem.The trajectory planning problem has different encoding mechanisms in the literature to encode a set of HPs, which could minimize the energy consumed by UAVs in the MEC system.For example, some works encode the solutions to this problem using a population of individuals, where each individual is responsible for the HPs that construct a possible trajectory for UAVs.However, assume that there are k HPs, to encode the solutions that could estimate those HPs, each solution is comprised of 2k dimensions.Hence, under this encoding mechanism, it is hard to solve this problem due to the high dimensionality, especially with increasing the number of HPs.Also, this mechanism involves an additional array of the same length of the HPs to determine whether the corresponding HP is considered in the trajectory or not.In brief, this mechanism is ineffective because its performance significantly deteriorates with increasing the number of HPs.Therefore, in [2], an alternative encoding mechanism was proposed to represent the solutions to this problem more effectively.This mechanism is called the variable population size (VIPS)based encoding mechanism, which is discussed in detail in the following section.

VIPS mechanism [3]
To better encode the solutions to the trajectory planning problem, this mechanism was devised to avoid the high dimensionality problem in addition to removing the need for auxiliary variables.This mechanism makes each individual responsible for the coordinate x i , y i of a HP, and the entire population represents all possible HPs for the UAVs.According to that, each solution is comprised of only two dimensions, as depicted in Fig. 3.Each solution is responsible for the location of HPs; however, their number is not taken into consideration even now.To handle that, in [3], the authors presented a novel idea based on using removing, insertion, and replacing operators to create three new populations, namely P 1 , P 2 , and P 3 .P 1 includes all the individuals in the current population P , except for only one individual that is added to it from the newly-generated population Q .This means that the first population is increased one by one in the hope of achieving the near-optimal number of HPs.The second population, P 2 , is first assigned to all individ- uals in P , and then an individual from it is randomly selected and replaced with the current individual from the newly generated population.This means that the number of HPs is kept unchanged while their locations are optimized.The third population copies all the individuals from P , and then an individual is randomly selected and removed.This means that the number of stop points is decreased.Based on that, under those three operators, the number of HPs is increased, kept unchanged, or decreased; hence, the near-optimal number for HPs might be achieved.For simplification, those three populations are further discussed in brief in the following list: • First, P 1 = P , then the ith solution in − → x t is added to it.All individuals in P 1 are the same as those of P , except for only one solution.
• First, P 2 = P , then a solution from it is randomly chosen and replaced with the ith solution from − → x t .• First, P 3 = P , then an individual is randomly selected and removed.

Cyclic selection-based encoding mechanism
However, the VIPS-based encoding scheme still needs strong improvement due to creating and evaluating three newly generated populations in each loop.This might require several function evaluations to achieve the required outcomes.Therefore, in this study, we present a novel idea for selecting only one population to be evaluated in each loop in the hope that this population is the best among the others.This idea is based on proposing a new factor, known as the cyclic factor , which starts with a high probability for a tradeoff between the insertion and replacing operators and gradually decreases over time to maximize the probability of the removing operator and minimize the probability for the other two operators.That factor is reset a number of times within the whole optimization process to cover the three operators more accurately.For simplification, our idea is herein based on proposing a novel selection mechanism, namely cyclic selection mechanism (CSM), to accurately tradeoff between insertion, removing, and replacing operators for generating a new population based on a cyclic factor for covering all possible probabilities for each operator within the whole optimization process.In addition, under this mechanism, the operator that could achieve a better solution in the current generation is considered for generating a new population in the next generation in the hope of accelerating the convergence speed.The mathematical model for the cyclic factor is defined as: where t stands for the current function evaluation, T max represents the maximum function evaluation, T stands for the number of cycles (Estimated in the experiments section), and % stands for the remainder operator.The pseudocode of CSM is described in Algorithm 3.

Adaptation of some metaheuristics with VIPS mechanism for MTPP in the UMEC system
In this study, nine metaheuristic algorithms, including GTO [23], GBO [24], TLBO [25], NOA [26], SMA [27], SWO [28], DE [2], RUN [29], and INFO [30], are adapted using the Pareto optimality theory to present multiobjective trajectory planning algorithms (MTPA), namely MTPA-GTO, MTPA-GBO, MTPA-SWO, MTPA-DE, MTPA-TLBO, MTPA-RUN, MTPA-INFO, MTPA-SMA, and MTPA-NOA, for tackling this problem.In more detail, those algorithms first create a two-dimensional matrix of n × 2 , where n represents the population size, and 2 rep- resents the dimension size or coordinate of each HP.This matrix is randomly initialized within the lower bound of 0 and upper bound of 1000 for both X j and Y j , and evalu- ated using F 1 and F 2 .Then, it is added to the archive (A) to represent the non-dominated solution obtained so far since there is no other solutions to be compared with it.Afterwards, the optimization process of those algorithms is independently executed for generating new population (Q) , which is employed with the current population (P) to generate three populations (P 1 , P 2 , P 3 ) according to the insertion, deletion, and removing operators.The HPs in each population from those are associated with m UAVs using the KMC algorithm and the best trajectory of the HPs for each UAV is constructed by the greedy method discussed above.Afterwards, the trajectories estimated from those populations for various UAVs in UMEC system populations are evaluated using the objectives F 1 and F 2 , and compared with each other to identify their domi- nance.A non-dominated population from those three populations is set to P for using in the next generation to search for better solutions.In addition, the nondominated populations from those three populations are compared to those in A , and added to A those that are non-dominated.At the beginning of the optimization process, the studied algorithms will extensively explore the regions around the current population to avoid stagnation into local minima.Then, gradually, with increasing the current function evaluation, they will convert the exploration into exploitation for exploiting the regions around a solution selected randomly from A for accelerating the convergence speed.
Those process is continued until the maximum number of function evaluations is satisfied.A general pseudocode of a MTPA based on one of the investigated metaheuristic algorithms for solving MTPP is presented in Algorithm 4.
Algorithm 4: MTPA Adaptation of some metaheuristics with CSM mechanism for MTPP in the UMEC system In the previous section, we clarified how to adapt the metaheuristic algorithms using the Pareto optimality and VIPS-based encoding mechanism for tackling MTPP.In this section, we will clarify how to adapt those algorithms under the Pareto optimality and CSM for tackling the same problem.at random within the lower and upper bounds of each dimension and evaluated using F 1 and F 2 .Then, it is added to the archive ( A ) to represent the non-dominated solution obtained thus far, as there are no other solutions against which it can be compared.The optimization process of these algorithms is then independently executed to generate a new population ( Q ), which is collaborated with the current population ( P ) under the insertion, removing, and replacing operators to generate a new population with variable population size to optimize the location and number of HPs for the UAVs in UMEC systems, as described in Algorithm 3. The nondominated populations estimated by CSM are added to A whether they are not dominated by any solution inside it.This procedure is repeated until the utmost number of function evaluations has been attained.Algorithm 6 presents a general pseudocode of an MTPA with CSM (namely MTPA-V) for solving MTPP.• M i is randomly set between 1 and 10 3 MB • P i is assigned a value of 0.1W • p u and p F are assigned a value of 1000W • δ is assigned a number of 5 • B is set a value of 1MHz • S i = 100 cycles/bit • D i is scattered at random between 1 and 10 3 MB Nine metaheuristic algorithms are chosen to investigate their performance for finding the best trajectories for four UAVs that could gather data from IoT devices in less EC; those algorithms include GTO [23], GBO [24], TLBO [25], NOA [26], SMA [27], SWO [28], DE [2], RUN [29], and INFO [30].Those algorithms are adapted using the Pareto optimality theory, as discussed in detail in the section of the proposed algorithms, to present multiobjective variants for trajectory planning; those variants are termed MTPA-GTO, MTPA-GBO, MTPA-SWO, MTPA-DE, MTPA-TLBO, MTPA-RUN, MTPA-INFO, MTPA-SMA, and MTPA-NOA.All those algorithms are implemented in MATLAB R2019A within the same environments.The maximum number of function evaluations for all those algorithms is limited to 10,000 to ensure a fair comparison.All those algorithms are first executed 25 independent times on each instance and the obtained solutions are analyzed in terms of the summation indicator (SI) of two optimized objectives, the E UAV objective, and the E iot objective over several performance metrics, including best EC (BEC), average EC (AEC), p-value (PV) of the WRS test, Friedman mean rank (FR), and standard deviation (SD).The SI value of those objectives are computed in this study as follows [2]: where ǫ is set to 10,000 as discussed in [2], and − → x t j is the newly-generated deployment by an optimization algorithm.

Results and discussions
This section first compares the performance of the proposed MTPA-DE and MTPA-VDE to the recently published TPA to highlight the effectiveness of the Pareto optimality theory in optimizing all objectives simultaneously and reveal the performance of the proposed cyclic selection mechanism.Afterwards, the Pareto optimality theory and cyclic selection mechanism are integrated with eight additional metaheuristic algorithms to further observe their effectiveness in presenting a strong (24) SI( − → x t j ) = E UAV + ǫE iot trajectory planning algorithm with a higher ability to minimize total EC.

Parameter adjustment
The proposed cyclic selection mechanism has an effective parameter that has to be accurately estimated to maximize its performance with the proposed algorithms.This parameter determines the number of times the removing, insertion, and replacing operators are applied.Several experiments have been conducted using various values for this parameter when applying MTPA-VGBO for solving three instances such as I-80, I-160, and I-300.The results of these experiments are analyzed using the FR metric and depicted in Fig. 4. Inspecting this figure shows that the performance of MTPA-VGBO is maximized when setting T to 15 for the instances I-300 and I-400.So, this value is considered in all experiments conducted later in this study.

Comparison between MTPA-VDE, MTPA-DE, and TPA-DE
In this section, the proposed MTPA-VDE and MTPA-DE are compared to the recently published TPA-DE in terms of the SI, E UAV objective, and E iot objective to observe their effectiveness in finding the near-optimal deployment of HPs for multiple UAVs.Those algorithms have been executed 25 independent times on each instance, and the values of SI within those runs are analyzed in terms of several performance metrics and presented in Table 1.This table shows that MTPA-VDE is the best for all the performance metrics, except for BEC over I-120.In addition, this WRS test is applied to observe the difference between MTPA-VDE and the other algorithms.The resultant of this test, represented in the PV, is presented in the same table, which shows that the outcomes of MTPA-VDE are significantly different from the other algorithms for all instances.To summarize the outcomes of various algorithms over the SI, Figs. 5 and 6 are presented to compute the average of AEC and FR values over all instances.Those figures show that MTPA-VDE could rank first for the average AEC with a value of 3665.6 and the average FR with a value of 1.2, MTPA-DE ranks second with an AEC value of 3729 and FR value of 2.2, and TPA-DE is the worst algorithm.
The summation indicator sums the values of two optimized objectives without taking into consideration the performance of the algorithms for each objective independently.Therefore, Table 2 is presented to analyze the outcomes for the E UAV objective obtained by each algorithm within 25 independent times.From this table, the proposed MTPA-DE could be the best for all performance indicators, followed by MTPA-VDE as the second high-performing algorithm, while TPA-DE is the worst.The average of both AEC and FR values presented in Table 2 for all instances is computed and reported in Figs.7 and 8, respectively.Those figures show that MTPA-DE is the best, and MTPA-VDE is the second best.From those experiments, it is obvious that the performance of MTPA-DE is significantly better than that of TPA-DE and slightly competitive with the performance of MTPA-VDE.This reveals the significance of the Pareto optimality theory for optimizing all objectives simultaneously without giving higher preference to some objectives over others.
The outcomes of the E iot objective obtained by each algorithm within 25 independent times are analyzed in Table 3. From this table, the proposed MTPA-VDE could be the best for all performance metrics, followed by MTPA-DE as the second high-performing algorithm, while TPA-DE is the worst.The average AEC and FR values shown in this table for all instances are calculated and displayed in Figs. 9 and 10, respectively.These figures confirm that MTPA-VDE is significantly superior to the other algorithms.Those experiments reveal the effectiveness of the Pareto optimality theory for simultaneously optimizing both E iot and E UAV objectives.To further show the effectiveness of MTPA-VDE and MTPA-VDE over TPA-DE, the convergence curves for each of them over all instances are depicted in Fig. 11.This figure reveals the high convergence speed of the proposed algorithms over TPA-DE.However, the proposed MTPA-VDE could be significantly superior to the MTPA-DE.According to the previous experiments, MTPA-VDE could achieve lower energy consumption than the other algorithms, so its flight trajectories might be shorter.To reveal that, Fig. 12 is presented to report the trajectories of each UAV over I-180 and I-200.This figure shows that MTPA-VDE could achieve shorter  flight trajectories for some UAVs and competitive flight trajectories for others.

Comparison among nine multiobjective algorithms with VIPS-based encoding scheme
In this study, eight additional metaheuristic algorithms are integrated with the Pareto optimality theory and VIPS-based encoding scheme to observe their performance for planning the UAV trajectories more efficiently.Those algorithms are executed 25 independent times and the SI values under two optimized objectives are computed according to Eq. ( 24).Those values are analyzed in terms of various performance metrics and reported in Table 4.This table shows that MTPA-GBO could achieve better FR values for five instances, and MTPA-TLBO is the best for the other instances.To summarize the findings in this table, Figs. 13 and 14 are presented to display the average of AEC and FR values for all instances, respectively.Inspecting those figures shows that MTPA-TLBO is the best-performing algorithm in terms of average AEC with a value of 4920.9, and MTPA-GBO is better than all in terms of average FR with a value of 3.6.From that, it is observed that both MTPA-GBO and MTPA-TLBO have competitive performance for updating the deployment of HPs for UAVs with the purpose of minimizing the total energy consumption.
To further observe the performance of those algorithms, the convergence curve obtained by each algorithm for each instance is computed and presented in Fig. 15.This figure shows that MTPA-GBO could converge faster than the others for I-100, I-120, I-160, and I-200; MTPA-TLBO could achieve better convergence speed for I-140, I-180, and I-400; and MTPA-SWO is better for the remaining instances.
Table 5 presents an analysis of the outcomes obtained by these algorithms for the first objective separately.This table demonstrates that MTPA-GBO could attain superior FR values in eight instances, while both MTPA-TLBO and MTPA-GBO are competitive in the remaining instances.The average AEC and FR values for all instances are depicted in Figs.16  and 17, respectively, to provide a summary of the findings in Table 5.According to this figure, MTPA-GBO is the best-performing algorithm in terms of average The analysis of the outcomes obtained by various algorithms for the E iot objective is presented in Table 6.The presented table illustrates that MTPA-GBO exhibits higher FR values in six instances, MTPA-TLBO is the best for two other instances, and MTPA-SWO achieves outstanding outcomes for only one instance.Figures 18  and 19 present the mean values of AEC and FR, respectively, for all instances.These figures serve as a concise representation of the results stated in Table 6.Based on the data presented in the figure, it can be observed that the MTPA-GBO algorithm exhibits the highest performance in terms of FR, while the MTPA-TLBO algorithm exhibits the highest performance in terms of average AEC.

Comparison among some metaheuristic algorithms with CMS
The best five algorithms under Pareto optimality are integrated with the cyclic selection mechanism to further improve their performance for the trajectory planning of UAVs.These algorithms are executed 25 times independently, and the SI values for two optimized objectives are computed based on Eq. ( 24).Table 7 presents the results of an analysis of these values in terms of various performance metrics.This table  demonstrates that MTPA-VGBO could obtain higher FR values in five instances, MTPA-VINFO is superior in two instances, and both MTPA-VTLBO and MTPA-VSWO are superior in only one instance.The average AEC and FR values for all instances are displayed in Figs.20 and 21, respectively, to summarize the data presented in this table.MTPA-VGBO has the highest average AEC with a value of 4843.9 and the highest average FR with a value of 2.51.To further examine the efficacy of these algorithms, Fig. 22 is presented to depict the convergence curves for each algorithm on each instance.This figure demonstrates that MTPA-VDE has the lowest convergence speed; however, the convergence speeds of the other algorithms are somewhat competitive.
To reveal the efficacy of these algorithms for each objective separately, Tables 8 and 9 are presented to report the outcomes obtained by each algorithm for each objective in various instances.Starting with the E UAV objective, Table 8 presents the outcomes obtained by each algorithm for this objective.This table demonstrates that MTPA-VTLBO could achieve superior FR values in two instances; MTPA-VGBO, MTPA-VDE, and MTPA-VINFO are competitive in only one instance; both MTPA-VDE and MTPA-VSWO are superior in two instances; and both MTPA-VINFO and MTPA-VTLBO are superior in only one instance.In addition, the average AEC and FR values for all instances are depicted in Figs.23 and 24, respectively, to provide a summary of the findings in Table 8.According to this data, MTPA-VTLBO is the best-performing algorithm in terms of average AEC, MTPA-VSWO is the second-best algorithm, and MTPA-VINFO is the worst-performing algorithm.For the E iot objec- tive, Table 9 reports the outcomes achieved by each algorithm for this objective.This table illustrates that MTPA-VGBO could achieve superior FR values in six instances, MTPA-VGBO and MTPA-VTLBO are competitive in only one instance, and MTPA-VINFO is superior in only one instance.In addition, to provide a summary of the findings in Table 9, Figs. 25 and 26 are presented to report the average AEC and FR values for all instances, respectively.According to this data, MTPA-VGBO is the most effective algorithm in terms

3.E-06
Fig. 18 Average EC value under the E iot objective for nine multi-objective metaheuristic algorithms Fig. 19 Average FR value under the E iot objective for nine multi-objective metaheuristic algorithms

Comparison among some algorithms under VIPS and CSM
Finally, in this section, we will compare the outcomes obtained by the top-ranked multiobjective metaheuristic algorithms with both VIPS and CSM to highlight their effectiveness in minimizing the total EC of UAVs. Figure 27 presents the average AEC value for SI, the E iot objective, and the E UAV objective obtained by each algorithm in all instances.This figure shows that the performance of the algorithms with CSM could be better than that with the VIPS mechanism for SI and F2 objectives; however, MTPA-VGBO could be better than all the others.On the contrary, for the F1 objective, the multiobjective algorithms with the VIPS mechanism could be slightly better than the algorithms with CSM.Among them, MTPA-INFO could achieve the lowest average value for this objective.From that, it is clear that the cyclic selection mechanism has a significant effect on the performance of the optimization algorithms, where it could accelerate their convergence speed for achieving outstanding outcomes for the F2 objective in fewer function evaluations and somewhat competitive outcomes for the F1 objective.
In addition, it is worth mentioning that the Pareto optimality theory could simultaneously optimize both F1 and F2 objectives because it is not reliant on relating the objectives together using a weighted variable, and hence the solution that could improve at least one objective without deteriorating any of the others is considered a non-dominated solution.On the contrary, the weighted objective function, which relates the two objectives together using a weighted variable, does not take into consideration the deterioration of some objectives, but it relies on the objective values of the estimated solutions.For example, assume that the TPA-DE algorithm could find a new solution with a better value for the F2 objective but a worse value for the F1 objective; however, the F2 objective has the highest effect on the objective function, thereby making the overall objective value for this solution smaller than the best-so-far objective value.Hence, this solution is considered the new best-so-far solution, even though it negatively affects the F1 objective.

Conclusion and future work
Due to their limited storage and computing capacity, IoT devices are incapable of executing resourceintensive tasks.Therefore, the MEC technology has been utilized recently to provide computing and storage capabilities to these devices so that they can execute these tasks with low latency and low EC.However, because the MEC network's edge servers are located in a fixed location, they cannot be adjusted to meet the needs of end users.In order to overcome this limitation, UAVs have recently been loaded with edge servers to assist the MEC systems in presenting better computing services to end users.However, the trajectories of the UAVs must be meticulously planned in order to reduce the amount of energy consumed by IoT devices during data transmission and by UAVs during hovering time and mobility between halting points (HPs).
The trajectory planning problem is a difficult optimization problem due to the multiple factors that should be taken into consideration when solving it; these factors include the location and number of HPs, the relationship between HPs and UAVs, and the sequence of HPs for each UAV.This problem is a multiobjective optimization problem because it requires optimizing the EC   enhance the performance of optimization algorithms.Among all examined algorithms, MTPA-VTLBO and MTPA-VGBO were capable of producing the best results.
Our future work will involve employing the proposed CSM with some recently published metaheuristic algorithms, such as the mantis search algorithm, for tackling resource allocation and mining decisions in MEC-supported blockchain networks.In addition, the greedy mechanism used for constructing the order of HPs for each UAV will be replaced with a metaheuristic algorithm in the hope of achieving better trajectories.

Fig. 1
Fig. 1 Multi-UAV-aided MEC system including m UAVs and n IoT devices

Fig. 2 Algorithm 1
Fig. 2 An illustrative example for the non-dominated solutions

Fig. 3 3
Fig. 3 Solution representation for TPP under the VIPS-based encoding mechanism

Fig. 4
Fig.4Tuning the parameter T over three instances

Algorithm 6 :
MTPA-VExperimental settingsIn this study, nine instances, which include various numbers of IoT devices of I-80, I-100, I-120, I-140, I-160, I-180, I-200, I-300, and I-400, are employed to investigate the performance of the proposed algorithms.Those devices are randomly scattered in a squared area of 1000m , and four UAVs are flying at a velocity of 20m/s and a height of 200m to collect the data from those devices.The other parameters used in the MEC system assisted by multi-UAV are as follows:

Fig. 15 5
Fig. 15 Convergence curve under SI obtained by various multiobjective metaheuristic algorithms on all instances

Fig. 16 Fig. 17
Fig.16 Average EC value under the E UAV objective for nine multi-objective metaheuristic algorithms

Fig. 20 Fig. 21
Fig. 20 Average EC value under SI for algorithms with CSM

Fig. 22
Fig. 22 Convergence curve under SI obtained by five top-ranked multiobjective metaheuristic algorithms with CSM on all instances

m
The number of UAVs n The number of IoT devices t max The maximum function evaluation E UAV Energy consumption of UAV E iot Energy consumption of all IoT devices t The current function evaluation

Table 3
Comparison between MTPA-VDE, MTPA-DE, and TPA-DE over the E

Table 4
Comparison among nine multobjective metaheuristic algorithms over SI

Table 6
Comparison of nine multiobjective metaheuristic algorithms over E

Table 7
Comparison among five multiobjective metaheuristic algorithms with CSM over SI

Table 8
Comparison of five multiobjective metaheuristic algorithms with CSM over E

Table 9
Comparison of five multiobjective metaheuristic algorithms with CSM over E iot Fig. 23 Average EC value under E UAV objective for algorithms with CSM Fig. 24 FR value under E UAV objective for algorithms with CSM Fig. 25 Average EC value under E iot objective for algorithms with CSM Fig. 26 Average FR value under E iot objective for algorithms with CSM