 Research
 Open access
 Published:
NPRLBN: next point of interest recommendation using large bipartite networks with edge and cloud computing
Journal of Cloud Computing volume 12, Article number: 54 (2023)
Abstract
During the last decades, tourism has been augmented worldwide through which the diversity of tourists’ interests is increased and is challenging to tackle with the traditional management system. Such challenges can be overcome by LBSNs (LocationBased Social Networks) such as Yelp, Foursquare, and Facebook which help to collect more personalized information close to tourists’ preferences/interests like checkins, comments, and reviews. In this regard, solutions have been proposed to exploit the POI (Point of Interest) recommendation, but they failed to overcome sparsity and coldstart problems. Existing methods are also not focusing on important aspects, including geographical context, dynamics preferences and social influence, which are essential factors in POI recommendation. Therefore, this work tried to incorporate these factors and present a unified model using bipartite networks to learn users and POI dynamics. For this purpose, we have represented all the factors using eleven networks and combined them into a single latent space. In addition, Edge Computing processes data at the network's edge, reducing latency and bandwidth usage and enabling realtime and personalized recommendations. Furthermore, cloud computing could be used to store and process the large amounts of data collected from LBSNs, to support the proposed model's computational requirements and make it more accessible and scalable, allowing it to be easily used by tourism management systems worldwide. Experimental results show that our model outperforms stateoftheart methods using realworld dataset in terms of accuracy and perform better against sparsity and coldstart problems.
Introduction
“The United Nations World Tourism Organization (UNWTO)” reported that since 1950 the number of tourist arrivals has been raised to 1.4 billion yearly. Also, the fast growth in tourism is expected to reach 1.8 billion worldwide by the year 2030. Tourism plays a vital role in extending economic freedom in developed countries and presents a paradox. To overcome this paradox, different companies related to the tourism sector can play a vital role in different sectors, such as business communities and industries. In the past decade, a significant improvement has been witnessed by development experts, industry leaders, and policymakers toward the tourism sectors in various countries in the world [1]. Consequently, tourism has gained positive economic outcomes, especially by boosting the GDP (Gross Domestic Product) and providing employment opportunities [2]. Considering the growth of tourism and travellers' necessities, it is pertinent to enhance the services provided to travelers according to their needs and interests [3]. Therefore, exploiting the choices and preferences of users is a hot topic in academia as it greatly impacts decisionmaking, decision rules, and choice factors [4]. On the contrary, acute developments in the web, social networks, big data [5], cloud computing, IoVs (Internet of Vehicles) [6, 7], and IoTs (Internet of Things) technologies provide abysmal information that acquaints information overload problems. The individuals are precarious in choosing relevant information and making decisions. Therefore, recommender systems in information technology come in, which cope with the information overload problem [8]. It suggests relevant information to the users, considering their explicit or implicit preferences. Therefore, computer scientists contributed to the tourism industry, and plenty of research has been conducted to facilitate tourists using recommender systems. Due to dynamic and temporal preferences, the existing approaches are limited to coping with the sparsity [9] and coldstart [10] problems. Extensive research [11,12,13,14,15,16,17,18] have been devoted to this area which focuses on users and location relationship, but they exclude sparsity problem due to preference dynamics. In this regard, studies [19,20,21,22] have collected data about the relationship between users and tied it to user location but failed to resolve dynamic and temporal preferences. Besides, research dealing with users' temporal dynamics is still limited in alleviating the sparsity problem since these models do not exploit auxiliary contextual information which changes with user preferences over time [10]. For simplicity let’s say a tourist loves to visit mountains in summer and cities in winter, providing them with mountains in winter and cities in summer will be inappropriate or irrelevant. Therefore, the proposed model temporal factor by splitting the dataset into seasons and categories of locations. Using this approach, user satisfaction with recommendations will be enhanced and can achieve higher accuracy respectively.
To prudently overcome aforesaid issues, there is a need for a unified model that exploits the behavior and preference dynamics of users for a more personalized recommendation. Therefore, the proposed model alleviates such problems and makes the following contributions.

The proposed model uses an approach based on a probabilistic weighting strategy using eleven graphs to tackle the sparsity problem.

Presents two algorithms to get users' favorite season(s) with the most visited categories in a particular season using past checkins history.

The proposed approach uses the work of RELINE (Recommendation with Multiple Network Embeddings) [10] and tries to learn the embeddings of graphs by using the concept of graphs to find the heterogeneous preferences of users.
The rest of the paper is organized as follows: Literature review section presents the contributions being devoted to POI recommendations. Participated networks in the recommendation model section emphasizes participated networks in the proposed model, Proposed nextPOI recommender system section explains the proposed work, and Results and discussion section discusses obtained results. Finally, Conclusion and future work section concludes the proposed work and presents future work.
Literature review
This section discusses the contribution being devoted to recommender systems facilitating POIs. In [21] the authors have proposed a system using Hadoop technology which consists of four phases; scrapping, mapping, deduplication, and recommendation. The shortcoming of this method is a usercentric approach. Due to its complexity, it increases computation time. Likewise, [23] proposed a collaborative filtering approach that performs better due to WSN (Wireless Sensor Network) [24, 25] installations around tourist sites (IoT sensors on edge). It provides the convenience of uploading tourist information and rating POI using smartphones. However, such a method fails to tackle the cold start problem because it is not feasible to implement WSN in all tourist spots, as a result, there may be some locations that remain unrated/unvisited due to the unavailability of sensors/devices at various spots. In this regard, [14] and [25] come up with different approaches to resolving the cold start problem using the notion of CARS (ContextAware Recommender System), they have tried to get contextual information for achieving better results but ignored the importance of preference dynamics. Sampling on graphs has been used in various flavors, but less attention has been paid to matching a large set of graph properties. To this end, various studies employed network embedding models to exploit semantic relations between the network objects and generate their lowdimensional representations. In [26] the authors proposed a graphbased POI recommendation incorporating geographical and temporal influence to tackle the coldstart problem, but they ignored the importance of preference dynamics. Similarly, [27] have considered users’ preference dynamics but ignored social influence. Furthermore, [28] realized the need to provide POI recommendations at an appropriate time rather than only exploiting user, social, and geographical preferences. Finally, authors in [15] upgraded the work of LINE (Largescale Information Network Embedding) [29] and used large bipartite graphs to cope with cold start problems achieving good accuracy using social, geographical, temporalinfluence, along with users’ preference dynamics. To clarify, Table 1 summarizes the incorporating factors of existing methods while exploiting POI recommendations.
To exploit users’ or tourists’ behavioral patterns, we have highlighted the limitations of existing methods, many models [13, 28, 30,31,32] employed POIs as nodes and don’t consider the spatial dimension with distance information. However, [13, 19, 28] have considered location influence but ignore the preference dynamics, which change over time. Methods [31,32,33] used to capture the temporal dynamics elegantly but do not incorporate spatial dimension. Furthermore, methods that tackle spatial and temporal behavior but fail to include preference evaluation, and finally, methods that capture all factors failed to maintain users’ satisfaction regarding recommendations.
Participated networks in the recommendation model
This section presents the proposed model's workflow and discusses the participated networks and problem definition. Figure 1 depicts all incorporated networks in the proposed model, where Table. 2 describes mostly used symbols in the text.
As depicted in Fig. 1, eleven graphs (unipartite and bipartite) have been used in the proposed model. The proposed model consists of the social and location layers, which utilize all the incorporated graphs. The social layer represents the relationship between users (friendship) with each other. Similarly, the location layer describes the physical relationship between various locations like distance, height, and temperature. The embeddings have been generated for each graph and have fed to a collective space, where all the graphs are combined into a single vector space. To further understand the proposed model, the subsequent subsections explain the role of each participated graph as follows.
PointofInterest
PointofInterest is the location where tourists can take interest and have most checkin. It can be represented as a set: \(({s}_{id}, lon, lat)\), where \({s}_{id}\) specifies a location in the dataset, \(lon\) and \(lat\) refer to the longitude and latitude of a particular location.
Checkin
It is the presence of a user u in desired place l at a particular instance of time t, denoted as \(\varvec{c}_{\varvec{i}}=({ }{\varvec{u}},{\varvec{l}},{\varvec{t}})\). A user u can check in only one place where they can record multiple checkins in their profile \({\varvec{c}}_{\varvec{ui}}=\left\{({\varvec{l}}_{\varvec{i}},{\varvec{t}}_{\varvec{i}}),\dots ,({\varvec{l}}_{\varvec{j}},{\varvec{t}}_{\varvec{j}})\right\}\). For each user, a profile is maintained that stores the locations being visited by him/her; as the user profile grows, the preferences of a particular user will be more helpful in the recommendation.
Season
It can be defined as the season of the dataset and is divided into four seasons (Winter, Summer, Spring, and Autumn). Each season has the checkins of all tourists during season ∆S. Every tourist prefers to visit or go for a tour in a particular season, while some tend to go for a visit in a few seasons or even all seasons, respectively. The importance of this season must be realized in the POI recommendation. Therefore, we have used time as a season.
Useruser graph
The social interaction between users can be represented by and Useruser graph as it is a unipartite graph. It can be represented by\({G}_{uu}=\left(U\cup V, {W}_{uv}\right)\), where \(U\) and \(V\) represent the sets of users \(, while {W}_{uv}\) describes the set of weights among users in the network, which can be computed by Eq. (1) as follows
Usercategory graph
A bipartite graph shows the user's and category's connection, considering an entire checkin history. Specifically, it shows the significance of a specific category against all categories for a candidate user. Symbolically, this graph is represented by \({G}_{uk}=\left(U\cup K, {W}_{uk}\right)\), in which U and K are set of users and categories. \({W}_{uk}\) is a set of weighted edges between U and K which can be computed using Eq. (2) and indicates the number of checkins made in desired category k_{i} against overall checkins made by user u_{i.}
Userseason graph
A bipartite graph represents the relationship between user u_{i} and season s_{i}. Algorithm 1 has been used to compute the significance of season s_{i} for each user u_{i}. Can be denoted as \({G}_{us}=\left(U\cup S, {W}_{us}\right)\), in which \(S\) and \(U\) denote the set of seasons set users. \({W}_{us}\) ia s set of weighted links as computed by Eq. (3), the number of user’s checkins in overall season checkins made by the user u_{i}.
Algorithm 1 tries to extract a list of seasons that the user visits most. Checkin history, users, and seasons (winter, summer, spring, autumn) are provided as inputs. In step 1 the checkins have been sorted on timestamp/date, whereas in step 2 create a list L for the most visited season(s). Step 3 runs for each season, and step 4 checks whether the user u_{i} checkedin in S_{i}. If it is true, step 5 increment the season by one and assign it to user u_{i} in list L. Finally, step 6 returns the obtained L that consists of users and their favorite season(s).
Userlocation graph
A bipartite graph represents the degree of a specific location for a given user in a desired category. This relation is represented by \({G}_{ul}=\left(U\cup L, {W}_{ul}\right)\) U and L denote the sets of users and locations respectively. This relation is represented as \({G}_{ul}=\left(U\cup L, {W}_{ul}\right)\). Equation (4) computes the number of times one user visited a particular location in a category while the denominator calculates the user’s overall visits to distinct categories during all seasons:
Category
Five categories are being selected for POIs: Mountains, Rivers, Lakes, Restaurants, and Cities. Every location l must belong to one of the above categories. That is, we have divided the entire dataset into different categories and then extracted the desired category corresponding to the interest of each user.
Categorylocation graph
To represent the relationship between a location and category, the proposed model uses a directed bipartite graph which is different from the previous one based on the weighting mechanism adopted. It is represented by \({G}_{kl}=\left(K\cup L, {W}_{kl}\right)\), in which L and K denote the sets of locations and categories, respectively. \({W}_{kl}\) represents weighted edges and can be computed using Eq. (5). It can be computed as the number of times a place \(l\) is visited in a specific category k against all checkins in the concerned category:
Categoryuser graph
This graph emphasizes the importance of a category that corresponds to each user. The proposed model extracts categories for each POIs using Algorithm 2, which is a modified version of an algorithm proposed in [15]. It correlates each user from \(U\) with a category \(k \in K.\) The graph is denoted by \({G}_{uk}=\left(U\cup K, {W}_{uk}\right)\), where \({W}_{uk}\) denotes a set of weights between users and categories as computed in Eq. (6), the number of times a user visited the desired category to the total number of visits to all categories made by the same user:
Categorycategory graph
It is a bidirectional bipartite graph representing the relationship between pair of categories. For example, if we take two categories viz, \(k\) and \({k}^{{}^{\prime}}\) which are linked using a link if a certain user u checkin both in the same season s. Using this intuition, we construct the graph as \({G}_{k{k}^{{}^{\prime}}}=\left(K\cup K, {W}_{k{k}^{{}^{\prime}}}\right)\), in which \(K\) denotes the set of categories, and \({W}_{k{k}^{{}^{\prime}}}\) is the weighted edges between pair of categories. The weight between categories is calculated using Eq. (7) as it represents the number of times a user u has visited the corresponding categories simultaneously in season s.
Algorithm 2 helps to extract the most visited categories of locations in each season. Such an algorithm aims to get the categories users mostly prefer in the desired season(s). For example, a user u_{i} may love to visit historical places in summer, whereas in spring, she/he prefers to hike the mountains. Considering this approach to provide the most preferable recommendation in various periods can be helpful. Algorithm 2 accepts checkin history and seasons, respectively. Step 1 is sorting the checkins based on date, whereas steps 2 and 3 create lists for time (dividing the dataset into seasons) and categories (categories like mountains, rivers, lakes, meadows, cities, and parks). Step 4 goes through for each season, and n is declared in step 5 to keep track of each checkin of users. Step 6 is whether checkins are fall or not in the desired seasons. Step 7 runs and adds the checkin to the season list if true. Step 8 increment the n, whereas step 9 initializes q to track the desired category in a season. Step 10 runs to make the checkin in the desired category by checking it in step 11. If it is true, the desired checkin is added to the category list in step 12. For false, it increments the q in step 14 and reruns for the next season. Finally, step 15 returns list K.
Categoryseason graph
A directed bipartite graph represents relations between a season and a category. The categoryseason graph is denoted by \({G}_{ks}=\left(K\cup S, {W}_{ks}\right)\), in which K and S show the categories and seasons, respectively. \({W}_{ks}\) is a set of weights established between category and season. Weighted edges between season and category are computed using Eq. (8), where the numerator computes the number of checkins performed in the category \(k\) by all users in seasons \({s}_{i}\), while the denominator represents the whole checkins performed in all seasons for the same category.
Locationlocation graph
This bipartite graph is employed to show the spatial distance between locations if a user u visits two locations \(l\) and \({l}^{{}^{\prime}}\) at the same time and distance in a range \({R}_{g}\), then a link is established between them. The graph is denoted as \({G}_{l{l}^{{}^{\prime}}}=\left(L\cup L, {W}_{l{l}^{{}^{\prime}}}\right)\), in which \(L\) is a set of locations and \({W}_{l{l}^{{}^{\prime}}}\) is a set of weights among them as computed by Eq. (9) using geographical proximity.
Locationuser graph
It represents the relationship between locations and users, also known as a directed bipartite graph. Also, users' interests correspond to a specific location that changes over time. More specifically, this relation is represented as \({G}_{lu}=\left(L\cup U, {W}_{lu}\right)\). The weight \({W}_{lu}\) is calculated using Eq. (10) as the number of times a user u visit a place l to the total number of checkins made by all the users to that location:
Locationseason graph
To show the significance of a certain location for a user u in a season s_{i}, the proposed model uses a locationseason graph, which can be represented by \({G}_{ls}=\left(L\cup S, {W}_{ls}\right)\), \(L\) and \(S\) show a set of locations and seasons. \({W}_{ls}\) represents the weighted edges as calculated using Eq. (11).
Problem definition
Some tourists visit a set of locations in a particular season, while some tend to visit each season. Furthermore, users’ preferences change by location category K, i.e., u_{i} may love to visit l_{i} in winter whereas u_{j} in summer. To consider this problem, we must provide a list of locations L to user u in a season s belong to category k_{i} given that \(Q (u,l,s)\).
Proposed nextPOI recommender system
This section presents the proposed POI recommendation model that jointly learns multiple graph embeddings and encodes them into a lowdimensional embedding space exploiting semantic relations between the nodes of the networks.
Learning embeddings for large information networks
Two nodes m and n are directly connected by an edge, known as firstorder proximity. In contrast, the relation between vertices that share multiple neighbor nodes but are not directly associated with each other is referred to as secondorder proximity. The LINE model [23] tries to learn the relationships of large graphs to extract this kind of proximity. With this concept, we expand our model to learn the embeddings of large information networks.
Consider two disjoint sets \(=({Q}_{A}\cup {Q}_{B}, W)\), where the vertices in \({Q}_{A}\) that collaborate many common neighbors with \({Q}_{B}\) but they are not linked, then there is a high probability that their distributions are the same. To compute the conditional probability of vertex \({n}_{j} \in {Q}_{B}\) given node\({m}_{i} \in {Q}_{A}\), the model employs the following equation:
The vectors of \({m}_{i}\) and \({n}_{j}\) can be represented as \({\overrightarrow{m}}_{i}\) and \({\overrightarrow{n}}_{j}\). Hence, for each vertex \({m}_{i} \in {Q}_{A}\), Eq. (12) presents conditional distribution \(p\left(\bullet {m}_{i}\right)\) to all related vertices in the set \({Q}_{B}\). Then, the model uses the conditional distribution to approximate the empirical distribution \(\widehat{p}\left(\cdot {m}_{i}\right)= \frac{{w}_{i,j}}{\sum {w}_{i,m}}\) employing the following objective function:
where \(d\left(\bullet \bullet \right)\) indicates the Fullback–Leibler divergence between conditional and empirical distributions. To tune the significance of \({m}_{i}\), we have used \({\lambda }_{i}\) as a hyperparameter. This parameter is set to the outdegree of each node. Thus, Eq. (13) tries to optimize the following objective function:
The \({\left\{{\overrightarrow{\varvec{m}}}_{\varvec{i}}\right\}}_{{\varvec{i}}=1\dots {\varvec{Q}}_{\varvec{A}}}\) and \({\left\{{\overrightarrow{\varvec{n}}}_{\varvec{j}}\right\}}_{{\varvec{j}}=1\dots {\varvec{Q}}_{\varvec{B}}}\) that minimizes Eq. (14) are the lowdimensional nodes representations in \({\mathbb{R}}^{d}\) [15].
Optimization of the model
It requires the summation of the complete set to find conditional probability as a result, it enhances computational complexity. To address this problem, we use negative sampling used in [27], which simply samples N negative edges according to the noise distribution for every edge (i, j) as defined in the following equation.
where \(\sigma (x) = 1/1 + exp(x)\) is the sigmoid function, and \({P}_{n}=\left(n \propto {d}_{h}^{{}^{3}\!\left/ \!{}_{4}\right.}\right)\) same as proposed in [29], \({d}_{h}\) is the outdegree of node n. Furthermore, we come up with an asynchronous stochastic gradient algorithm [33] to optimize Eq. (15). If an edge (i, j) has been sampled, the gradient concerning to the embedding of \({\overrightarrow{m}}_{i}\) of node i can be computed as:
It is also considered that the gradient is multiplied by the weight of a link in Eq. (16). It may be problematic if we ignore the balancing of the learning rate. We should have to carefully keep the learning rate because, when selecting the learning rate according to the links with low weights, the gradients on links with high weights will be disastrous. Similarly, when selecting the learning rate with high weight, the gradient will be too small. The model employs the sampling approach adopted in [31] to sample a random edge. Finally, the model draws a sampled edge using alias table according to [29], which minimizes computational complexity to \(O(1)\). Table 3 illustrates the complexity of edge sampling optimization process.
Learning graph dynamics
Initially, by providing bipartite input graphs, the subsequent step combines them into the model. Our input graphs have been divided into three parts, one considering user networks (UserLocation, UserSeason, UserCategory, UserUser). At the same time, the second is related to location (LocationLocation, LocationUser, LocationSeason), and the third one corresponds to a category of places (CategoryUser, CategoryCategory, CategoryLocation). The model collectively integrates the embedding of participating graphs as defined in Eq. (18–28) related to users and POIs relations, optimizing the objective function defined in Eq. (17) as follows:
The model computes these objective functions as follows.
For optimization of object functions, as defined in Eq. (17), it requires merging the links of all networks, and at every step, the model updates a new sample edge. The probability of the desired link is computed based on its associated weight. The training of our model is done jointly using the algorithm in [15] dynamically.
Personalized nextPOI recommendation
After exploiting semantic relations between the nodes of the participating graphs and learning their embeddings, the next step is to make recommendations for a user. Given a query \(Q(u,l,s)\), which specifies a user \(u\) in a location \(l\) and season s we can correspond these values to desire season \(s\). Claiming that a user is willing to attend locations in specific seasons corresponding to category k_{i}. Finally, we rank a list with top@n unvisited location for a user u_{i} related to category k_{i}. The proposed model uses Eq. (29) to recommend unvisited locations:
where \(\overrightarrow{{\varvec{u}}}\) and \(\overrightarrow{{\varvec{l}}}\) is the vector representations of user \(u\), and location \(l\). Similarly,\(\overrightarrow{{\varvec{k}}}\) represents the vector representation of the category \(k\) such checkin. The proposed model uses cloud computing to store and process the data, and to jointly learn the vector representations from various information graphs in the same embedding space. This allows for more efficient handling of large amounts of data and the ability to perform complex computations. The cloudbased solution also allows for the incorporation of more information networks, which in turn reduces sparsity by incorporating more information networks, and jointly learns the dynamics of the social influence \({(\overrightarrow{{\varvec{u}}}}^{{\varvec{T}}}\times \overrightarrow{{\varvec{l}}})\), the geographical influence (\({\overrightarrow{{\varvec{k}}}}^{{\varvec{T}}}\times \overrightarrow{{\varvec{l}}}\)), and the temporal influence (\({\overrightarrow{{\varvec{s}}}}^{{\varvec{T}}}\times \overrightarrow{{\varvec{l}}}\)), simultaneously to provide more accurate and personalized POI recommendations to users.
Employment of cloud and edge computing
Edge computing is a distributed computing paradigm that focuses on processing data near the source of data generation, thereby reducing latency and bandwidth usage. The proposed work leverages edge computing to process data generated by tourists using LBSNs to share their preferences and interests. LBSNs enable the collection of tourists' location and interest data, which can be processed at the network's edge, facilitating realtime and personalized recommendations. In addition to edge computing, cloud computing can be employed to store and process the large amounts of data collected from LBSNs as shown in Fig. 2 consists of two main components. The first component shows tourists using LBSNs to share their preferences and interests, generating data collected and processed at the network's edge through edge computing. The second component displays the cloud computing infrastructure used to store and process the large amounts of data collected from LBSNs. This approach enhances the computational capabilities of the proposed model and enables tourism management systems worldwide to access and use it easily.
Results and discussion
This section focuses on the performance evaluation of the proposed model based on the Foursquare dataset. That is, we compare the results of the proposed model with existing POI models.
Dataset
To analyze the results of the proposed model, we have used a publicly available dataset known as Foursquare.^{Footnote 1} The dataset consists of POIs, users’ checkins, and friendships, which have been collected from the year 2012 to 2014. The distribution of the dataset is depicted in Table 4. The seasons are extracted using the period given in the dataset. Similarly, each POI is associated with its category like restaurant, river, lake, city, and so on.
Baseline models
In this set of experiments, we have comparatively viewed the results with the following models.

RELINE [10]: They have used a graphbased approach to learn users’ and POI relationships from 8 weighted networks in a hidden space and provide location recommendations under a strategy having a probability that examines the influence of social, geographical, temporal, and preference dynamics.

GE [28]: is another graphbased embedding model that exploits geographical influence, sequential effect, temporal cyclic effect, and semantic effect in a unified way and embeds four information graphs into a shared embedding space. Also, a novel timedecay method is proposed that dynamically computes the user’s latest preferences based on the embedding of his/her checkedin POIs learned in the embedding space.

WWO [31]: is a unified POI recommender system with temporal interval assessment that considers temporal interval distributions and developed the lowrank network model, identifying a set of biweighted network bases to learn the static preferences and dynamic preferences coherently.

PGB [33]: This probabilistic model employs the graphbased Markov chain method to improve recommendation accuracy. The choice of suggesting an item is conditioned by considering recommendations generated in previous steps.
Experiments evaluation
To conduct experiments, we have divided the checkins into three partitions for each target user that include: (i) the training set \({\varepsilon }_{C}^{T}\) is the known information, which comprises 80% of the entire checkins, (ii) the probe set \({\varepsilon }_{C}^{P}\), used to test the model, and it contains 10% of the data, and (iii) the validation set \({\varepsilon }_{C}^{V}\) is the rest 10% for validation of the proposed model. Mathematically we can represent it as \({\varepsilon }_{c}= {\varepsilon }_{C}^{T}\cup {\varepsilon }_{C}^{P}\cup {\varepsilon }_{C}^{V}\) and \({\varepsilon }_{C}^{T}\cap {\varepsilon }_{C}^{P}\cap {\varepsilon }_{C}^{V}=\varnothing\). To make recommendations for a target user, the model uses the POIs in set \({\varepsilon }_{C}^{T}\).
For the evaluation, we measure the \(Accuracy@n\) as proposed in [15]. For each \(l \in {\varepsilon }_{C}^{P}\) given as a query \(Q(u, l, s)\), the prediction score for that specific location \(l\) along with all unvisited proximate POIs of the target user is computed based on Eq. (30). The model ranks the predicted scores into a list, and then chooses the top POIs. If the ground truth \(l \in {\varepsilon }_{C}^{P}\) exists in the top recommendations, then the model has accurately recommended that location (i.e., True Positive), otherwise it has suggested a wrong location. To calculate the net accuracy of ten recommendations, the model averages all predictions test cases as follows:
Impact of information graphs
Particularly, we examine how the integration of an information graph influences the topn predictions. Thus, we compare NPRLBN with the four models PGB, GE, WWO, and RELINE which are described in Baseline models section. The results shown in Table 5 reveal that as the model integrates the latest information network, its accuracy increases. This way, the proposed model lessens the sparsity issue by exploiting rich information about the users or the POIs. Also, it is noticeable that the accuracy of all models increases with n, which exhibits that the models fit well to users’ behavior.
Comparative analysis
This section evaluates the results of the proposed work with baselines using accuracy to produce top@n recommendations \([n=1, 5, 10, 15, 20]\) employing a realworld dataset (Foursquare). Specifically, judging the performance of all models that provide POI recommendations against coldstart users and locations.
Cold start user
We have studied the efficacy of the proposed work considering the coldstart user problem. Producing recommendations for such kinds of users is incredibly challenging due to the unavailability of the required information. In this context, we conducted experiments to provide recommendations only to coldstart users and used an accuracy metric to analyze the results of these models, as shown in Fig. 3. Since all models mentioned above support the coldstart recommendation, in this regards we compare our model with all these baselines. We employ side information related to users and locations from eleven information graphs, yielding improved results.
Cold start locations
Similarly, we examine a problem that is known as coldstart locations. The aim is to suggest locations that were not visited for at least one or few checkins at a location with less than 25 checkins. Thus, to cope with this, we have investigated how models behave on unpracticed users or new location is introduced into the system. In addition, we have analyzed whether the new location is listed in the top@n recommendations. Again, the proposed model outperforms in terms of producing quality recommendations, as shown in Fig. 4.
Significance of seasons
Here, we have analyzed the influence of the time interval per season ∆S against the accuracy for different values of the ten recommendations made by the model. ∆S significantly impacts the model's results since it is employed to build multiple information graphs. Table 6 shows the results of the model using the dataset. We can notice that the accuracy score reaches a maximum value at a certain point and then decreases gradually. The reason behind the low accuracy score is the value of ∆S. If its value is exceedingly small, then it means we have less data, and diffidently the accuracy score will be less. On the contrary, for large values of ∆S, a substantial number of nodes related to the target user exists, which causes an overfitting problem. Considering these factors, we choose the size value for the dataset to be 15.
Conclusion and future work
Due to the exponential growth of information on the internet, recommender systems have become prevalent technological assistants to humans. Likewise, LBS have become ubiquitous in different sectors and therefore gained the attention of numerous scientific disciplines. The emergence and usage of communication technologies such as mobile devices allow researchers to come up with more elegant solutions against sparsity and coldstart problems. Geographical information shared on such networks enables researchers to tackle both problems. Various POI approaches that have been proposed using social influences, geographical proximity, and preference dynamics using their social influences. In this work, we have considered coldstart and sparsity problems while providing POI recommendations in the tourism sector. Particularly, our proposed model has been upgraded and comes with additional features such as users’ satisfiability on the system and employs a weighted probabilistic approach over eleven information graphs based on relations established among users, seasons, and categories. The model personalizes locations by jointly learning the embeddings of users and POIs into the same embedding space. The incorporation of edge and cloud computing in our proposed model has improved the system's accuracy and scalability, allowing it to be easily used by tourism management systems worldwide. The influence of social, geographical, and temporal factors in terms of accuracy has been scrutinized. Our model has been evaluated and outperformed against the coldstart users and the coldstart locations. Our future work is to ensure users’ social information security, which is crucial to users, and a stateoftheart problem in POI recommendation.
Availability of data and materials
The materials that support this study are available upon request from the second author.
References
Hamid RA, Albahri AS, Alwan JK, Alqaysi ZT, Albahri OS et al (2021) How smart is etourism? A systematic review of smart tourism recommendation system applying data management. Comput Sci Rev 39:100337
Ashley C, Brine PD, Lehr A, Wilde H (2007) Introduction. The role of the tourism sector in expanding economic opportunity, 1^{st} ed, Coporate Social Resoponsibility Initiative Report No. 23, Cambridge, England. Kennedy School of Government Harvard University, MA, pp 6–7
Manzoor F, Wei L, Asif M, Haq MZ, Rehman H (2019) The contribution of sustainable tourism to economic growth and employment in Pakistan. Int J Environ Res Public Health 16(19):3785
Pei Y, Zhang Y (2021) A study on the integrated development of artificial intelligence and tourism from the perspective of smart tourism. J Phys: Conf Ser 1852(3):2021
Farooqi MM, Shah MA, Wahid A, Akhunzada A, Khan F, Ali I (2019) Big data in healthcare: a survey. Applications of intelligent technologies in healthcare. Springer, Cham, pp 143–152
Malik A, Khan MZ, Faisal M, Khan F, Seo JT (2022) An efficient dynamic solution for the detection and prevention of black hole attack in VANETs. Sensors 22(5):1897
Abbas S, Talib MA, Ahmed A, Khan F, Ahmad S, Kim DH (2021) Blockchainbased authentication in internet of vehicles: a survey. Sensors 21(23):7927
Hoq KMG (2016) Information Overload: Causes, Consequences and Remedies  A Study. Philosophy and Progress 55(12):49–68. https://doi.org/10.3329/pp.v55i12.26390
Achmad KA, Nugroho LE, Djunaedi A (2017) Tourism contextual information for recommender system. In 2017 7th International Annual Engineering Seminar (InAES) (pp. 16). IEEE
Christoforidis G, Kefalas P, Papadopoulos AN, Manopolous Y (2021) RELINE: pointofinterest recommendations using multiple network embeddings. Knowl Inf Syst 63:791–817
Bobadilla J, Ortega F, Hernando A, Gutiérrez A (2013) Recommender systems survey. KnowlBased Syst 46:109–131
Mahadik K, Wu Q, Li S, Sabne A (2020) Fast distributed bandits for online recommendation systems. In Proceedings of the 34th ACM international conference on supercomputing (pp. 113)
Lu J, Wu D, Mao M, Wang W, Zhang G (2015) Recommender system application developments: a survey. Decis Support Syst 74:12–32
Linden G, Smith B, York J (2003) Amazon. com recommendations: Itemtoitem collaborative filtering. IEEE Internet Computing. 7(1):76–80
Figueredo M, Ribeiro J, Cacho N, Thome A, Cacho A, Lopes F, Araujo V (2018) From photos to travel itinerary: A tourism recommender system for smart tourism destination. In 2018 IEEE Fourth International Conference on Big Data Computing Service and Applications (BigDataService) (pp. 8592). IEEE
Bahramianm Z, Abbaspour RA, Claramunt C (2017) A cold start contextaware recommender system for tour planning using artificial neural network and casebased reasoning. Mob Inf Syst 2017:18
Abbas A, Zhang L, Khan SU (2015) A survey on contextaware recommender systems based on computational intelligence techniques. Computing 97:667–690
Artemenko O, Kunanets O, Pasichnyk V (2017) Etourism recommender systems: a survey and development perspectives. Int Q J 6(2):91–95
Bahramian Z, Abbaspour RA, Claramunt C (2017) A CONTEXTAWARE TOURISM RECOMMENDER SYSTEM BASED ON A SPREADING ACTIVATION METHOD. International Archives of the Photogrammetry, Remote Sensing & Spatial Information Sciences, 42. Tehran, pp 7–10
Levandoski JJ, Sarwat M, Eldawy A, Mokbel MF (2012) Lars: A locationaware recommender system. In 2012 IEEE 28th international conference on data engineering (pp. 450461). IEEE
Thasal R, Yelkar S, Tare A, Gaikwad S (2018) Information retrieval and deduplication for tourism recommender system. Int Res J Eng Technol 5(3):1683–1687
Paulino I, Lozano S, Prats L (2021) Identifying tourism destinations from tourists’ travel patterns. J Destin Mark Manag 19:100508
Vijayakumar V, Vairavasundaram S, Logesh R, Sivapathi A (2019) Effective knowledgebased recommender system for tailored multiple point of interest recommendation. Int J Web Portals (IJWP) 11(1):1–18
Khan F, Ahmad S, Gürüler H, Cetin G, Whangbo T, Kim CG (2021) An Efficient and Reliable Algorithm for Wireless Sensor Network. Sensors 21(24):8355
Khan F, Gul T, Ali S, Rashid A, Shah D, Khan S (2018) Energy aware clusterhead selection for improving network life time in wireless sensor network. In Science and Information Conference. Springer, Cham, pp 581–93
Xie M, Yin H, Wang H, Xu F, Chen W, Wang S (2016) Learning graphbased poi embedding for locationbased recommendation. In Proceedings of the 25th ACM international on conference on information and knowledge management (pp. 1524)
Xie M, Yin H, Xu F, Wang H, Zhou X (2016) Graphbased metric embedding for next poi recommendation. In Web Information Systems Engineering–WISE 2016: 17th International Conference, Shanghai, China, November 810, 2016, Proceedings, Part II 17 (pp. 207222). Springer International Publishing
Liu Y, Liu C, Liu B, Qu M, Xiong H (2016) Unified pointofinterest recommendation with temporal interval assessment. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (pp. 10151024)
Tang J, Qu M, Wang M, Zhang M, Yan J, Mei Q (2015) Line: Largescale information network embedding. In Proceedings of the 24th international conference on world wide web (pp. 10671077)
Kenteris M, Gavalas D, Mpitziopoulos A (2010) A mobile tourism recommender system. In The IEEE symposium on Computers and Communications (pp. 840845). IEEE
Ding F, Zhu G, Li Y, Zhang X, Atrey PK et al (2021) Antiforensics for face swapping videos via adversarial training. IEEE Trans Multimedia 24:3429–3441
Leskovec J, Faloutsos C (2006) Sampling from large graphs. In Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 631636)
Joorabloo N, Jalili M, Ren Y (2019) A probabilistic graphbased method to improve recommender system accuracy. In Engineering Applications of Neural Networks: 20th International Conference, EANN 2019, Xersonisos, Crete, Greece, May 2426, 2019, Proceedings 20 (pp. 151163). Springer International Publishing
Acknowledgement
The authors would like to acknowledge Prince Sultan University and EIAS Lab for their valuable support. Further, the authors would like to acknowledge Prince Sultan University for paying the Article Processing Charges (APC) of this publication.
Conflicts of interest
All the authors have no conflicts of interest.
Author information
Authors and Affiliations
Contributions
I.K and A.S are the main writers of the manuscript. They put forward the main ideas of architectural modeling and analysis, and wrote the main part of this manuscript. G.A and M.A supervised the work and provided the funding for the said research. T.S have designed the Algorithms and R.K and validated the results. All the authors reviewed and approved this manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
The study did not require ethical approval.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Khan, I., Sadad, A., Ali, G. et al. NPRLBN: next point of interest recommendation using large bipartite networks with edge and cloud computing. J Cloud Comp 12, 54 (2023). https://doi.org/10.1186/s13677023004275
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13677023004275