Literature DB >> 32802026

Algorithms for Investment Project Distribution on Regions.

Mafawez Alharbi1, Mahdi Jemmali1,2.   

Abstract

This paper proposes an optimization system for solving an NP-hard problem by using several new algorithms and application programs. This study aims to identify a suitable distribution of investment projects across several developed industrial regions. It is assumed that all industrial regions involved have the same economic and strategic characteristics. The problem involves a set of projects that are to be assigned across regions. Each project creates an estimated number of new jobs, and the distribution of projects can be guided by minimizing the maximum total number of newly created jobs. The problem is NP-hard one, and it is difficult to determine the most appropriate distribution. We apply scheduling algorithms in order to solve the analyzed problem. Severalheuristics are developedto obtain the appropriate distribution of newly created jobs across all regions. A branch-and-bound method is employed in order to obtain the exact solution. The performance of the algorithm is demonstrated by the experimental results for a total number of 1850 instances.
Copyright © 2020 Mafawez Alharbi and Mahdi Jemmali.

Entities:  

Mesh:

Year:  2020        PMID: 32802026      PMCID: PMC7416242          DOI: 10.1155/2020/3607547

Source DB:  PubMed          Journal:  Comput Intell Neurosci


1. Introduction

It is important to distribute investments throughout a given country. A “good” distribution of investment resources is very important both from an economic and a social perspective since this may have a direct effect on the behavior of cities in terms of business and economic cycles. The context of this paper is the regional allocation of industrial projects. The problem is as follows: when there are several projects to be assigned across different industrial regions, how should these projects be allocated? The solution to this problem may be based either on the principle of equity distribution or on a distribution process which is meant to minimize the maximum total number of projects to be assigned. We start with a set of projects to be undertaken in several regions, each of which represents an amount of newly created jobs and will be entirely undertaken in the chosen region. For the sake of equity, certain heuristics are applied to guide the allocation of industrial projects to different regions in order to increase the minimum total number of new jobs created [1]. To answer the research question, it is necessary to choose between minimizing the maximum total number of new jobs created and maximizing the minimum total number of new jobs created. In this study, we assume that the industrial regions involved are identical and that they share the same characteristics that affect both investment allocation and investment opportunities. For example, if the investment project consists in the creation of a marine port, then it should be allocated to a coastal region since this is an investment with specific economic characteristics. All types of investments with such specific characteristics will be excluded from this work. Heuristics and exact solutions will be used to ensure a good distribution of all investment projects to industrial regions. The solutions to the aforementioned problem affect individuals and their everyday activities and have significant impacts on the public and private sectors and on society at large. A wide variety of resource allocation methods with different mathematical approaches to medialization have previously been used in various application areas to compute optimal solutions to the problem of the distribution of projects to several regions. The present work aims to determine the distribution of investment projects across regions by using a new objective function and newly proposed mathematical model. New solutions were developed based on the utilization of the heuristics and the optimal solutions for P||Cmax (minimizing the maximum total completion time for parallel machines) and P||Cmin (maximizing the minimum total completion time for parallel machines). For the first problem, which is P||Cmax, we use the heuristics and optimal solution cited by Haouari and Jemmali [2]. The respective derived bounds cited in latter work were embedded in an exact procedure which yielded an exact and effective algorithm based on a symmetry-breaking branching strategy. For the second problem, P||Cmin, a branch-and-bound (B & B) algorithm was developed by Haouari and Jemmali [1] based on enhanced lower and upper bounds. The P||Cmin problem is also referred to as “machine covering.” A study of Walter et al. [3] also proposed an exact B & Bmethod. The rest of this paper is structured as follows. Section 2 discusses previous work related to the aforementioned problem, while Section 3 sets forth the analyzed problem. Section 4 develops the lower bound for the analyzed problem, and Section 5 introduces and describes several heuristics. Section 6 presents the proposed solution, and Section 7 describes the experimental tests applied and their results. Section 8 concludes this paper.

2. Related Work

The theory of optimal control is used to tackle the unequal distribution of income between regions associated with high rates of aggregate growth for the regional allocation of investments [4]. Other applications for optimizing the allocation of investments include linear dynamic programming problems. The objective of a study by Ohtsuki [5] was to search for the optimum solution to the regional allocation problem. Bagchi et al. [6] provided a formulation for the problem of investment allocation across cities in the framework of a planning model, and a case study examining the impact of investment on the economies of Japan was carried out [7]. The study by Castells and Solé-Ollé [8] was undertaken in order to help managers make decisions, and in order to address the need for a system that provided data and information, two metaheuristic algorithms were compared, the first of which was based on a genetic algorithm and the second on the mean variance technique. The work of Salo et al. [9] focused on improved methods of resource allocation compared to the previous work that would facilitate decision analytics by helping public organizations and firms allocate resources for “lumpy” investment opportunities. Resource allocation problems are ubiquitous in governmental organizations and business. In addition to the aforementioned cited works, Luss [10] specifically addressed equitable resource allocation problems with applications in manufacturing, networks, communication, and other areas. Ranjbar et al. [11] investigated the problem of investment project scheduling with the purpose of reducing the related costs and the amount of necessary resources and of avoiding any delay in the implementation of those projects. The project in this case consisted in a set of interrelated activities, and the authors suggested two methods for resolving the project scheduling problem, which are a genetic algorithm and relinking, whereby a schedule is created with a list of priorities. Shadrokh and Kianfar [12] suggested a genetic algorithm to solve the resource investment problem. The performance of this algorithm has been noticed to be satisfactory when compared with that of other algorithms. Castells and Solé-Ollé [8] analyzed the main determinants of investment distribution across different regions. The expected objectives of the investment were set based on a general description of the objectives of the respective country, which focused on effective and fair trade and on the rules arising from political factors. This study employed a dynamic equation that allowed for gradual adjustment and readjustment, and the results indicated that criteria of effective trade played an important part in the geographical distribution of investment within the government infrastructure. Hartmann and Briskorn [13] provided an overview of the aforementioned extensions, which were designed based on the structure of the resource-constrained project scheduling problem (RCPSP). This paper gave a general summary related to the concepts of activity, precedence relationships, and resource constraints and also gave a summary of many lesser-known concepts. Singh [14] developed an algorithm that incorporated the priority of a project with the development of a programmed schedule, with the objective of reducing the project makespan and the penalty cost when some projects have higher priority. The proposed algorithm was shown to be an improvement of the current priority rules. Rodrigues and Yamashita [15] developed a new precise algorithm that solved the cost of the resource availability (RACP) problem in scheduling projects. This new algorithm relied on a hybrid method in which an initial feasible solution was determined heuristically. The branching scheme resolved an RCPSP at each node, where the resources of the RACP were fixed. The new algorithm can be employed for reducing the cost of making resources available. Sanaei et al. [16] studied the firefly algorithm, which was originally developed by Yang to optimize numerical tasks. This method was employed for solving an RCPSP. The difficulty in solving this type of problem arises from a lack of resources and priority of constraints. The algorithm begins by configuring a set of random scheduling solutions, and the primitive scheduling is then repeatedly improved using the firefly algorithm. Qi et al. [17] developed a model for multimode resources that reduced the cost of making resources available for all activities, with the aim of completing all activities of a project on schedule. When the deadline for a project is confirmed, the contractor will attempt to reduce the project cost, and this study proposes an optimization method and a useful heuristic algorithm for identifying the optimal solution. Rahmani et al. [18] proposed a new method for solving an RCPSP based on a differential search. Their approach could provide a set of preliminary schedules. Rabbani et al. [19] suggested a new approach to the multiobjective time-cost constrained RACP. Several other problems related to scheduling problem applications can be cited in [20-23] and [24]. More recently, several works have proposed solutions for the fair distribution problem. For example, Jemmali [25] gave different approximate solutions for the project revenue assignment problem. Another study [26] focused on budget-balancing algorithms for solving the aforementioned problem and gave several algorithms. An application of load balancing of the files on different storage supports was treated in [27].

3. Description of the Problem

The problem analyzed in this paper can be described as follows. Given a set of n industrial investment projects P1 to be undertaken in n different industrial regions Re, each project j can be assigned to only one industrial region i with i={1,…, n}. Each project j with j={1,…, n} can be evaluated by applying three indicators. The number of newly created jobs is denoted by NJ. The objective of this study is to ensure a suitable distribution of projects, and it is essential to define a variable that can be evaluated in order to measure the performance of the project allocation process. In the first approach, the number of new jobs created was set as the objective variable, and the overall objective was to maximize the minimum total number of new jobs created. The question then arises as to whether minimizing the maximum total number of new jobs created yields the same result. When assigning a project j to an industrial region i, the total cumulative number of new jobs created is denoted by t. The total number of new jobs created in each region i after the project allocation is completed is denoted by C. Example 1 illustrates the analyzed problem. The maximum number of new jobs created for all regions is denoted by Cmax, and the minimum number of new jobs created for all regions is denoted by Cmin. For each schedule, the values of C are indexed as follows: C1 ≤ C1 ≤ ⋯≤C.

Example 1 .

We assume that, in the case of industrial investment projects, there are five new projects, i.e., n = 5. In addition, there are two regions, i.e., n = 2. Table 1 displays the number of new jobs created for each project.
Table 1

New jobs created in instance 1.

j 12345
NJ j 16262498
Now, let us suppose that the assignment of projects to regions will be as follows. Projects 1, 4, and 5 are allocated to region 1, while projects 2 and 3 are allocated to region 2. Figure 1 illustrates this allocation.
Figure 1

Assignment in instance 1.

From Figure 1, it can be noticed that there is a large gap between regions 1 and 2, which is equal to C2 − C1=17. This means that more new jobs were created in region 2 than in region 1. The objective is to decrease this gap in order to ensure a better distribution between the regions involved. There is also another arrangement that can be used in order to ameliorate the manner of project assignment with the purpose of achieving a good project distribution. Figure 2 illustrates the new arrangement.
Figure 2

Rearrangement for instance 1.

It can be seen from Figure 2 that the gap between regions is lower compared with the one in the first arrangement shown in Figure 1. After rearrangement, the gap is reduced to three.

Remark 1 .

The gap for each region i is given by Cmax − C.

Proof

The maximum number of new jobs created is Cmax. Thus, for each region, Cmax − C gives the difference between the number of new jobs created in region i and that in the region with the maximum number of new jobs. The indicator that can be measured to minimize the gap between regions is represented by the following formula:

Proposition 1 .

The objective described in equation (1) can be expressed as From equation (1), we have By applying the minimum function to the latter result, we obtain equation (2). Hereafter, the gap between regions that was obtained by using a given heuristic is denoted by gmax=nCmax − ∑C. The objective of this study is to minimize gmax. It is worth mentioning that the constraint applied in this work consists in the fact that the project cannot be divided (i.e., the number of new jobs created by a project cannot be divided across two or more industrial regions). The optimal solution is denoted by gmax, and LBmax denotes the lower bound and UBmax the heuristic value for the P||Cmax problem.

4. Lower Bounds

Two lower bounds are proposed for the analyzed problem. The first is obtained from the optimal solution to P||Cmax and the optimal solution to P||Cmin. Hence, the balance between P||Cmax and P||Cmin will be considered. The second lower bound is obtained from the lower bound on P||Cmax. C denotes the optimal solution for P||Cmax, and C denotes the optimal solution for P||Cmin.

4.1. Min-Max-Based Lower Bound (MML)

This lower bound is determined based on the following proposition:

Proposition 2 .

MML=Cmax − Cmin. The obtained value (MML) is a valid lower bound. In any schedule σ, the value nCmax − ∑C is calculated after Cmax is evaluated. This value is the difference between Cmax and C for each region. This also indicates that the difference between Cmax and C for all the remaining regions (excluding the region that has C=Cmax) will be zero in this case. In addition, σ(max) is the schedule obtained by applying the optimal solution to P||Cmax. In this schedule, it is obvious that no C is greater than Cmax, and hence, this schedule is the optimal one, with Cmax being minimized. Let us suppose that the aforementioned problem may be re-envisaged as follows. Let us assume that, in the ideal case, when the two regions with Cmax and Cmin are excluded, the remaining regions have the same number of new jobs, which is equal to Cmax. In this case, the comparison is only between Cmax and Cmin. Now, if this relaxation is applied while applying the schedule σ(max), the minimum value can be obtained when the value of Cmin is maximized. Thus, the difference will be Cmax − Cmin. Accordingly, σ(max) is then selected, which is the schedule obtained by applying the optimal solution to P||Cmin and the relaxation; the ideal case has a difference of Cmax − Cmin. The proposition can also be written mathematically. For any schedule, Cmin ≤ Cmin can be written as Furthermore, in any schedule, we have Combining equations (5) and (4) yields From equation (6), it can be deduced that Cmax − Cmin is a valid lower bound.

4.2. Cmax-Based Lower Bound (CML)

This lower bound is based on the P||Cmax one. The Cmax-based lower bound, denoted by CML, is based on the following proposition:

Proposition 3 .

For a given lower bound on the P||Cmax problem, LBmax, let CML=nLBmax − ∑NJ. The obtained value for CML is a valid lower bound. From the objective function described in equation (2), we have ∑(Cmax − C)=∑n − ∑C=nCmax − ∑C. Furthermore, for any schedule, we have ∑C=∑NJ. Therefore, When considering the lower bound on the P||Cmax problem, we have From equation (8), after multiplication by a positive integer, we obtain Then, ∑NJ is subtracted from equation (9) to give From equations (7) and (10), it can be deduced that nLBmax − ∑NJ is the lower bound for the problem. In this study, two lower bounds from the literature for the P||Cmax problem are used for developing two versions of CML. Denoted by LBCML and LBCML, these two lower bounds are based on and , respectively, as described by Haouari and Jemmali [2].

5. Approximate Solutions

In this section, certain heuristics developed for the problem under study are presented, and these are compared to demonstrate their performance. The first heuristic is based on the nonincreasing dispatching rule, while the second is based on the nondecreasing dispatching rule. The multifit method is employed for deriving the third heuristic. In the fourth approach, we apply several times the resolution of the subset-sum problem in order to calculate the value of the heuristic. In a similar way, the fifth heuristic will solve a well-known problem iteratively; difference lies in choosing the knapsack problem which is used rather than the subset-sum problem. The last heuristic is based on rescheduling of the most charged and least charged regions to find a better result.

5.1. Heuristic 1: Nonincreasing Number of New Jobs Created (NIJ)

For this schedule, the total number of new jobs created for all projects is placed in a nonincreasing order. Highest NJ is then allocated to the first region on the list, i.e., the one with the lowest number of new jobs created.

Example 2 .

In this example, the same instance cited in Example 1 is considered. Figure 3 shows the schedule after applying NIJ.
Figure 3

Schedule after applying NIJ in instance 1.

5.2. Heuristic 2: Nondecreasing Number of New Jobs Created (NDJ)

In this schedule, the total number of new jobs created for all projects is placed in a nondecreasing order. Lowest NJ is then allocated to the first region on the list, i.e., the one with the lowest number of new jobs created. The schedule obtained after applying NDJ is as follows: region 1: 5,1,2 and region 2: 4,3. Thus, gmax=50 − 33=17.

5.3. Heuristic 3: Multifit Heuristic Based on the Number of New Jobs Created (MFJ)

This approximate solution is inspired by the machine job scheme presented by LEE and David Massey [28]. This method uses the bin-packing method for determining the minimum capacity that will allow all n new jobs created to fit into the m regions. For each bin capacity, the first-fit decreasing (FFD) method is employed for fitting the number of newly created jobs to the bin. Let us assume that the number of new jobs was ordered such that NJ1 ≥ ⋯≥NJ. The FFD method successively assigns the project to the lowest indexed region with the necessary capacity for that respective number of new jobs. Hence, LBmax=max(NJ1, NJ+NJ, [∑NJ/n]), and UBmax is the value achieved by applying the LPT heuristic for P||Cmax. Here, bin denotes the number of bins used after applying FFD, and iter is a predetermined, fixed number of iterations for FFD. In this study, iter=35. The MFJ algorithm is illustrated in Algorithm 1.
Algorithm 1

MFJ algorithm.

Example 3 .

Assume that, in case of industrial investment projects, the number of projects is equal to 10, i.e., n=10. In addition, there are two regions, i.e., n=2. Table 2 presents the number of new jobs created for each project.
Table 2

New jobs created in instance 2.

j 12345678910

Njj 62885521904599798385
For the instance given in Table 2, UBmax=363 and LBmax=354. C=[(363+354)/2]=358. Now, FFD function should be applied for a capacity equal to C and the 10 items represented by instance 2 after arranging them in the nonincreasing order The first bin contains items with weights {79,88,90,99}, while the second contains items with weights {21,45,55,62,83,85}. The first bin has a total load of 356, and the second has a total load of 351, giving gmax=356 − 351=5. Comparing this value with the results obtained using NIJ, we see that the associated gmax is equal to 19. This example shows that the results obtained using MFJ are better than those obtained using NIJ. However, the difference in the number of new jobs created in regions 1 and 2 after applying MFJ is only 5.

5.4. Heuristic 4: Subset-Sum Problem-Based (CSS) Approach

This heuristic is based on a greedy algorithm. It is denoted by (P)(k={1,…, n − 1}) as follows: S1=P and S=S/P, where P is an optimal subset sum for (P)(k=1,…, n − 1). L(S, K) denotes a valid lower bound for P||C on the makespan of a reduced instance, defined by k ≤ n regions and a subset of the number of new jobs created S⊆P. Hence, the first region will be the number of new jobs created until L on (P)1 is reached. The remaining number of new jobs created in the remaining regions will constitute the second problem (P)2, which will consist in solving the new SSP until L is reached, and so on [29]. The subset-sum problem will be solved by using the dynamic programming algorithm developed by Pisinger [30] to solve the subset-sum problem.

5.5. Heuristic 5: Knapsack Problem-Based (CKS) Approach

This heuristic is based on the division of the main problem into m smaller problems, each of which involves scheduling of the projects to regions. In the first region, a knapsack problem is used to assign a project to region 1, and the remaining projects are assigned to the remaining regions using a further knapsack problem, and so on. For each region, a knapsack problem must be solved as follows: F1=P and F=F/P, where P is an optimal knapsack for (kN)(k=1,…, n − 1). w=(|F|/k) × NJ − 1, where |F| is the number of remaining projects, and k is the number of remaining regions. Hence, for the first region, the new jobs created are assigned until [∑NJ/n] on (KN)1 is reached. The remaining number of new jobs created and the remaining regions form the second problem (KN)2 to be solved via a new knapsack problem until [∑NJ/n − 1] is reached, and so on.

5.6. Heuristic 6: Min-Max Iteration Heuristic (MMI)

This heuristic is based on the reformulation of P2||C as a subset-sum problem [29]. In each iteration, the most loaded region r and the least loaded region rmin were fixed, and a subset-sum problem was solved with the projects assigned to rmax and rmin. This procedure was repeated several times, and the best solution was identified. The number of iterations was set to 100 based on an experimental study.

6. Exact Solution

The exact solution was developed using the B & B method.

Corollary 1 .

Minimize[nCmax − ∑C] is equivalent to Minimize Cmax. Proof. For each schedule, ∑C=∑NJ, and the objective of the studied problem can therefore be written as follows: Minimize[nCmax − ∑NJ]. The total number of regions involved m and the summation of all new jobs created ∑NJ are determined in advance and are fixed. This indicates that Minimize[nCmax − ∑NJ] is equivalent to Minimize[nCmax − ∑NJ], which is equivalent to Minimize Cmax.

Lemma 1 .

The optimal solution to the approached problem is Based on the Corollary 1 above, Minimize[nCmax − ∑C] is equivalent to Minimize Cmax. Based on the definition of P||C, the problem is Cmax=Minimize Cmax. Hence, Minimize[nC − ∑C] is equal to nCmax − ∑NJ.

7. Experimental Results

In this section, we present the results given after the implementation of all algorithms coded in C++ using an Intel(R) Xeon(R) CPU E5-2687W v4 @ 3.00 GHz and 64 GB RAM. Several instances and statistical analyses are presented in order to illustrate the results obtained by applying the algorithms described in the previous sections. Different instances were generated which were inspired by Dell'Amico and Martello [31]. The overall instances were based on the choice of n, m, and the type of new jobs created, which is denoted by Class. The selection of the pair (n, n) was as follows: n={10,15,50, 100,200,300} and n={2,3,5,10,20,25,50}. Table 3 shows the fixed distribution of (n, n).
Table 3

Choice of (n, n).

n p n r
102,3,5
152,3,5,10
502,3,5,10,20,25
100,200,3002,3,5,10,20,25,50
The number of new jobs created was calculated according to the 5 distributions in the following, where U[] denotes the discrete uniform distribution and N[] the normal distribution. Class 1: U[20 − 100]; Class 2: U[20 − 500]; Class 3: U[100 − 500]; Class 4: N[50,100]; and Class 5: N[20,100]. For each fixed (n, n, Class), 10 instances were generated, giving 1850 instances in total. The following notations were used: U indicates the best (minimum) value obtained after execution of all heuristics. UB represents the analyzed heuristic. Min is the number of instances for which the analyzed upper bound is equal to U. GAP=(UB − U)/U represents the gap between the value obtained by applying the analyzed heuristic and the one obtained by executing the single best heuristic. AGAP is the average GAP for all heuristics. L is the best (maximum) value obtained after execution of all lower bounds. LB is the lower bound for the problem. Max is the number of instances for which the analyzed lower bound is equal to L. GAP=(L − LB)/LB represents the gap between the value obtained by applying the analyzed lower bound and the one obtained by executing the single best lower bound. AGAP is the average GAP for all lower bounds. Time represents the time required to execute the heuristic or lower bound or exact solution for a corresponding instance. It is measured in seconds and is denoted by “−” if it is less than 0.001 s. Time is the average time for all lower bounds (or heuristics). It is measured in seconds and is denoted by “−” if it is less than 0.001 s. NN is the average number of nodes created in the B & B method. NS is the total number of instances that is unsolved by the B & B algorithm. In this study, the aforementioned analyses are based on several criteria. The main criterion lies in analyzing the results obtained based on the variations in the gap and time according to the variation in n, n, and Class. A comparison between the lower bounds provides information on the time spent and the related gap. The results of comparing the best lower bound and the best heuristic are significant. The average total gap between Min and Max in all 1850 instances is 0.18, with an average time of 19.99 s. This result was obtained based on the performance of the heuristic and lower bounds. Another notable result is the number of times that the lower bound was equal to the upper bound: 1684 from a total of 1850 instances (representing 91.03% of the total number of instances). This indicates that the optimal solution is reached when the lower and upper bounds are calculated. An exact solution is required for only 8.97% of the overall instances. It is worth mentioning that, for the lower bound MML which is based on the solution provided by the B & B method, if the instance is not solved by the B & B method, then the value of Cmax is not determined. In this case, the best lower bound for P||C is chosen. The same procedure is followed for P||Cmin. In Table 4, it is obvious that the gap increases when m increases for MML. This phenomenon is not applicable for LBCML and LBCML. For LBCML, when n=2, GAP=0.06, and when n=5, GAP=1.71, but when n=10, the value of the gap decreases to 0.02. The values of Time, as illustrated by Table 4, do not vary along with other values. The maximum time was 467.640 s and was reached when n=50 for the MML bound. This is because this last lower bound is a consequence of solving two different problems in an exact way. The shortest time was achieved for LBCML and was less than 0.001 s. The total minimum gap was 0.04 and was obtained for n=2.
Table 4

Behavior of GAP and Time according to n.

n r MML LB TV CML LB FS CML Total
GAPlTimeGAPlTimeGAPlTimeAGAPlTimet
20.000.2680.060.0010.060.0020.040.090
30.281.1041.241.230.0010.920.369
51.051.4641.710.0380.480.0461.080.516
103.777.3400.020.0440.000.0601.272.481
207.72419.4150.120.0840.100.1402.65139.880
2511.16316.2030.770.0970.000.1833.98105.495
5023.03467.6402.550.2010.000.4718.53156.104
Other analyses are illustrated in Table 5, which shows the variation in the gap and time according to the number of new jobs created.
Table 5

Behavior of GAP and Time according to n.

n p MML LB TV CML LB FS CML Total
GAPlTimeGAPlTimeGAPlTimeAGAPlTimet
100.530.8143.941.370.0011.950.272
151.123.3261.060.820.0011.001.109
503.12595.6140.740.0010.090.0191.31198.545
1007.70294.1131.700.0060.000.0633.1398.060
2007.6827.4500.000.0310.000.0922.569.191
3008.436.1230.000.0880.000.1652.812.125
5008.298.7630.030.2490.010.3602.783.124
Time increases when n increases for LBCML and LBCML. The smallest gap of zero is obtained for LBCML when n=100,  n=200, and n=300. The maximum gap value, which is 8.413, was obtained for MML when n=300. The total maximum time was 198.545 s for n=50. This explains the nonresolved exact solution provided by employing the B & B method for calculation of MML. Table 6 presents the variation in gap and time according to Class. Table 6 shows that all classes are approximately characterized by the same magnitude of difficulty.
Table 6

Variation of GAP and Time according to Class.

ClassMML LB TV CML LB FS CML Total
GAPlTimeGAPlTimeGAPlTimeAGAPlTimet
15.97149.8720.240.0520.050.0962.0950.007
25.72157.5691.100.0520.330.1092.3852.577
35.86177.2341.850.0650.520.1182.7459.139
45.99144.3490.290.0490.130.1032.1448.167
55.83128.8310.690.0860.040.1422.1943.019
In Table 7, Perc indicates that the percentage of the analyzed lower bound is equal to the best value of all lower bounds for all total number of instances.
Table 7

Comparison between lower bounds.

MML LB TV CML LB FS CML
Max70316761774
Perc38.00%90.59%95.89%
GAPl5.870.840.22
Time151.5710.0610.114
In Table 7, it can be noticed that the best lower bound, which reaches a maximum percentage of 95.89%, is LBCML, with an average gap of 0.22 and an average execution time of 0.114 s. The minimum percentage of approximately 38.00% is obtained by MML, with a gap of 5.87 and an execution time of 151.571 s. Further information on the results obtained is included in Table 8.
Table 8

Lower bound details.

n p n r MML LB TV CML LB FS CML
GAPlTimeGAPlTimeGAPlTime
1020.000.3830.440.44
30.140.9613.673.660.001
51.451.0977.720.000.002

1520.000.3970.000.00
30.381.1880.040.040.001
50.052.6954.173.260.002
104.069.0250.030.000.003

5020.000.2720.000.000.001
30.321.1650.000.0010.000.001
51.221.7780.000.0010.000.003
103.823.9350.000.0010.000.008
202.952070.0170.610.0020.510.027
2510.391496.5193.850.0010.000.076

10020.000.2390.000.000.001
50.901.7190.000.0020.000.007
103.583.4470.000.0040.000.016
208.527.8160.000.0070.000.038
2512.7960.1800.000.0080.000.054
5020.391691.27510.220.0130.000.260

20020.000.2430.000.000.002
51.301.1770.000.0140.000.018
104.023.0650.000.0230.000.043
208.046.7140.000.0360.000.089
2510.468.6380.000.0410.000.115
5022.27144.8660.000.0730.000.287

30020.000.1760.000.000.002
51.401.1340.000.0390.000.045
103.062.8360.000.0650.000.088
2010.986.6010.000.1040.000.178
2511.008.0460.000.1220.000.219
5024.1417.9420.000.1980.000.458

50020.000.1630.000.0060.000.009
51.030.6480.080.2060.080.245
104.1021.7310.100.1720.000.204
208.105.9270.000.2730.000.368
2511.147.6310.000.3150.000.453
5025.3416.480.000.5230.000.881
To show how the analyzed heuristics perform, several tables of results are presented and discussed. The analyses are based on the variations in GAP and Time with n, n, and Class. Table 9 presents the variations in gap and time based on the number of regions.
Table 9

Behavior of GAP and Time according to n.

n r NDJNIJMFJCSSCKSMMI
GAPuTimeGAPuTimeGAPuTimeGAPuTimeGAPuTimeGAPuTime
268.615.452.720.0020.000.000.000.012
3116.2319.8312.150.540.430.000.008
5166.345.404.060.0030.560.760.0010.000.013
10302.9618.4112.360.0040.400.0010.130.0010.000.014
20335.6021.9214.520.0052.730.0013.170.0030.000.014
25371.2724.4615.660.0058.180.0011.320.0030.000.014
50491.6137.9924.190.0068.110.0032.340.0070.000.018
It is obvious in Table 9 that the MMI heuristic has a constant gap of zero for all values of n. The heuristics NDJ and NIJ are much faster than the others, and the necessary execution time for these heuristics is lower than 0.001 s. The maximum gap of 491.61 is achieved for NDJ when n=50. Furthermore, the highest time consumption of 0.018 s is seen for MMI for n=50. The results show that, in almost all cases, MMI provides better results than other heuristics, but there is no dominance amongst the heuristics. Table 10 compares GAP and Time based on n.
Table 10

Behavior of GAP and Time according to n.

n p NDJNIJMFJCSSCKSMMI
GAPuTimeGAPuTimeGAPuTimeGAPuTimeGAPuTimeGAPuTime
1040.598.393.790.0000.550.850.0000.005
1564.6810.926.980.0011.011.180.0000.008
50128.7814.889.430.0012.480.0013.690.0010.0000.010
100252.7023.2715.320.0027.510.0011.410.0020.0000.007
200320.6619.2112.870.0044.410.0010.190.0030.0000.016
300383.5422.0214.390.0060.350.0010.000.0030.0000.024
500402.8814.109.060.0100.120.0020.020.0040.0000.017
As it can be noticed in Table 10, the gap does not vary along with the variation of n values. The time increases according to n for CSS. The results of all executed heuristics are shown in Figure 4, where an increasing curve can be noticed.
Figure 4

Variation of AGAP according to n.

From Table 11, it is vivid that there is a difference between the classes in terms of difficulty.
Table 11

Behavior of GAP and Time according to n.

n p NDJNIJMFJCSSCKSMMI
GAPuTimeGAPuTimeGAPuTimeGAPuTimeGAPuTimeGAPuTime
1101.696.094.400.0050.820.0010.890.0020.0010.378
2432.1224.8012.790.0024.680.0011.360.0020.0010.585
3360.6521.8617.160.0064.960.0011.240.0020.0010.566
4263.5219.539.310.0010.910.0010.890.0020.0010.510
5100.3812.8211.160.0061.460.0010.910.0020.0010.473
For the CSS heuristic, the greatest values for the gap can be noticed for Classes 2 and 3, meaning that these classes are more difficult to solve than the others for CSS. The same behavior was noticed for the CKS, MFJ, NDJ, and NIJ heuristics. It can be concluded that this problem is more difficult to solve for Classes 2 and 3. Furthermore, this heuristic is more time-consuming, reaching 0.585 s for Class 2. In Table 12, it is vivid that the best heuristic is MMI, with the highest percentage of 98.2%, an average gap of zero, and an average execution time of 0.502 s. The lowest percentage that is 14.5% was obtained by NDJ, with a gap of 251.67 and a time of less than 0.001 s. Further information on the results obtained is included in Table 13.
Table 12

Comparison between the heuristics overall in 1850 instances.

NDJNIJMFJCSSCKSMMI
Min268605680150715581817
perct14.5%32.7%36.8%81.5%84.2%98.2%
GAPu251.6717.0210.972.571.060.00
Time0.0040.0010.0020.502
Table 13

Detailed heuristics for all 1850 instances.

n p n r NDJNIJMFJCSSCKSMMI
GAPuTimeGAPuTimeGAPuTimeGAPuTimeGAPuTimeGAPuTime
10276.1013.865.040.0010.000.000.000.004
341.9011.196.320.880.940.000.005
53.770.110.000.761.620.000.006

15269.3415.408.780.000.0010.000.000.007
3162.3223.5516.230.0010.740.360.000.008
524.834.732.930.0013.173.670.000.008
102.220.000.000.0010.140.670.0010.000.009

50254.242.401.040.0010.000.0010.000.000.009
3144.4824.7513.920.000.000.000.010
5176.7612.1310.350.0020.000.000.0010.000.011
10355.6241.1527.370.0011.500.0010.000.0010.000.011
2035.128.753.9110.510.00115.860.0020.000.010
256.480.110.000.0022.840.0016.300.0010.000.008

100259.042.161.360.0030.000.000.000.006
5259.639.717.490.0030.000.0010.000.0010.000.005
10436.9237.0124.130.0020.000.0000.000.0010.000.004
20413.2448.2632.550.0023.120.0010.000.0020.000.006
25339.6342.2726.380.00238.060.0010.280.0030.000.009
507.770.240.000.0033.870.0038.200.0050.000.012

200284.482.361.640.0030.000.0010.000.000.013
5218.624.663.240.0040.000.000.0010.000.015
10240.448.976.200.0040.000.0010.000.0010.000.016
20381.6025.1717.860.0040.000.0010.000.0030.000.017
25541.6435.0522.270.0040.000.0010.000.0030.000.018
50457.1639.0325.990.00326.450.0031.170.0070.000.019

300280.721.000.400.0040.000.000.000.021
5204.783.943.020.0060.000.0010.000.0010.000.023
10431.9512.708.710.0060.000.0010.000.0020.000.023
20269.8611.308.030.0060.000.0010.000.0030.000.023
25458.4926.1316.550.0060.000.0010.000.0040.000.024
50855.4577.0649.650.0072.130.0030.000.0070.000.028

500256.320.960.760.0050.000.0010.000.000.023
5275.982.531.380.0080.000.0010.000.0010.000.021
10350.6210.657.790.0100.740.0010.100.0020.000.018
20578.1816.1010.240.0110.000.0020.000.0040.000.016
25510.1018.7513.090.0110.000.0020.000.0050.000.010
50646.0735.6221.110.0130.000.0040.000.0100.000.014
Now, the experimental study focuses on the gap between the values of the minimum heuristic and the maximum lower bound. This requires a comparison between the best lower bound (L) and the best heuristic (U), denoted by GAP=(U − L)/L. In Table 14, the maximum gap is 0.74, which is obtained when n=50, and the average time is 98.194 s. For n=200, n=300, and n=500, the gap is zero, indicating that the optimal solution is obtained.
Table 14

Variation of GAP according to n.

n p GAPluTime
100.150.274
150.370.547
500.7498.194
1000.0348.519
2000.004.609
3000.001.119
5000.001.632
Figure 5 shows the variation in GAP with the number of new jobs created, which peaks when n=50 and tends asymptotically to zero when n is higher.
Figure 5

Variation of the GAP value according to n.

Table 15 illustrates the impact of the number of regions on the analyzed problem and shows that the maximum gap of 0.86 is reached when n=20. However, a minimum gap of zero is reached for n=2 with a reasonable time of 0.048 s and for n=10 with a time of 1.262 s.
Table 15

Variation of GAP according to n.

n r GAPluTime
20.000.048
30.150.186
50.210.268
100.001.262
200.8687.197
250.0353.175
500.0578.235
Table 16 shows the variation in GAP and Time with Class. This table confirms once again that Classes 2 and 3 are more difficult to solve and require more time for that by comparison with Classes 1, 4, and 5. The maximum gap of 0.23 is obtained for Class 2, and a value of 0.28 is seen for Class 3.
Table 16

Variation of GAP according to Class.

ClassGAPluTime
10.1425.035
20.2326.337
30.2829.617
40.1824.126
50.0621.550
The performance of the exact solution was determined by applying the B & B method, and the time limit settled at 1200 s. If the time required for solving an instance and obtaining the exact solution exceeded this time limit, the resolution process was stopped, and it was noted that the respective instance remained unresolved. Table 17 presents the results obtained by employing the B & B method based on the number of new jobs created. It is vivid that the number of unsolved instances is higher when n=50 with NS=48 and for n=100 with NS=20. The corresponding numbers of nodes are 3,019,072 and 725,107, respectively. However, when the number of new jobs created is greater than 100, the problem is solved rapidly and easily, with a time of 6.572 s for n=500 as compared with a time of 195.614 s for n=50. A total of 70 instances remained unresolved when employing the B & B method which represents 3.78% of the total number of instances that is 1850.
Table 17

Behavior of NS, NN, and Time according to n.

n p g max NJ∗
NS NN Time
100540.805
15017121.239
50483,019,072195.614
10020725,10786.107
200246,38815.409
300016.108
500077786.572
Table 18 shows that the instances that remained unresolved when using the exact method are concentrated for n=20, n=25, and n=50. The maximum value of Time was observed when n=20 and was equal to 179.407 s.
Table 18

Behavior of NS, NN, and Time according to n.

n r g max NJ∗
NS NN Time
2090.267
304271.098
508111.390
10077783.853
20362,344,630179.407
25121,278,25766.588
50221,157,241149.563
Table 19 shows the differences between classes for the exact solution gmax.
Table 19

Behavior of NS, NN, and Time according to Class.

Class g max NJ∗
NS NN Time
115763,86652.394
214510,57351.717
3201,008,23969.979
413582,93746.725
58215,07231.380
In Table 19, it obvious that the maximum number of unsolved instances was obtained for Class 3 with NS=20. Further results obtained by employing the B & B method is presented in Table 20.
Table 20

Branch-and-bound result.

n p n r g max NJ
NS NN Time
1020260.382
301310.956
5061.078

1520330.397
3011491.175
5056651.957
10011.428

502010.272
3011.164
5011.778
10013.934
203611,723,146870.017
25126,391,280296.519

1002010.238
5011.718
10013.447
20017.813
250112.151
50204,350,639491.275

2002010.243
5011.176
10013.064
20016.705
25018.623
502278,32472.643

3002010.176
5011.132
10012.829
20016.586
25018.030
500117.896

5002010.163
5010.888
10046,6648.418
20015.914
25017.615
500116.44
The results included in Tables 17 and 18 motivated an investigation of n and n in order to determine the instances which are the most difficult to solve. A study based on the ratio n/n was carried out. For any ratio n/n for the given instances amongst the total number of 1850 instances, the unsolved instances NS, the average number of nodes NN, and the average value of Time were calculated. For the total number of instances given at the beginning of this section, there were 22 different values of the ratio n/n. The results are illustrated in Table 21. Indeed, Table 21 shows the performance indicators used for measuring the results obtained by the B & B method for variations in the ratio n/n.
Table 21

Ratio n/n results of branch-and-bound analysis.

Index n p/nr NS NN Time
11.5011.428
22323,580,642262.957
32.53611,723,146870.017
43056651.957
53.3301310.956
642139,16242.397
7502943.326
860117.896
97.50330.397
108018.623
1110017.092
1212018.030
1315016.586
1416.66011.164
1520014.132
1625013.093
1730012.829
1840011.176
1950023,3324.328
20100011.132
21150010.176
22250010.163
The maximum average time of 870.017 s was obtained for (n/n)=2.5. It is obvious that the unsolved instances were concentrated for (n/n)=2 and (n/n)=2.5. It can therefore be concluded that these types of instances are the most difficult to solve. Figure 6 shows the concentrated unsolved instances for index 2 and index 3, which are represented in Table 21 and referred to as (n/n)=2 and (n/n)=2.5, respectively.
Figure 6

Variation of NS according to n/m.

The ratio n/n represents the economic scale that can define the hardness of the problem to give a fair distribution of the projects to create new jobs on the regions. The experimental results give an appreciated gap between regions using heuristics. On the contrary, the exact method gives the optimal solution of the problem analyzed. Reducing the gap related to the new job created between regions decreases the unemployment rate in the internal cities of the country and strengthens regional development.

8. Conclusion

The problem analyzed above is of important practical interest as it models real-life situations in the field of economics and finance, such as equal distribution between different regions, and attempts to minimize the differences between them in terms of levels of investment. The solution presented in this paper will therefore be useful from a practical point of view. From a theoretical perspective, the approached problem is a challenging one as it is difficult to solve; thus, its resolution will enhance the theoretical knowledge in this field. The analyses and resolution of this problem may therefore have an important impact on the economy and the industrial sector. In this paper, a comprehensive literature review on resource allocation and optimization studies was presented, and the theory of optimization was applied to a real-life case study to illustrate the importance of this practice. The theoretical and practical analyses are closely related. This study was applied to the industrial sector, and the results achieved by applying heuristics and an exact solution to the analyzed problem demonstrate the importance of optimization applied on the project distribution field. The results contribute to reducing the difference between various industrial regions and to achieving equity among them. More other branching methods can be developed to ameliorate the execution time of the exact solution. The proposed solutions in this paper can be applied to several fields.
  4 in total

1.  Algorithms for the executable programs planning on supercomputers.

Authors:  Abdullah M Algashami
Journal:  PLoS One       Date:  2022-09-26       Impact factor: 3.752

2.  Intelligent algorithms and complex system for a smart parking for vaccine delivery center of COVID-19.

Authors:  Mahdi Jemmali
Journal:  Complex Intell Systems       Date:  2021-10-06

3.  Equity distribution of quality evaluation reports to doctors in health care organizations.

Authors:  Mahdi Jemmali; Loai Kayed B Melhim; Abdullah Alourani; Md Moddassir Alam
Journal:  PeerJ Comput Sci       Date:  2022-01-21

4.  Smart-parking management algorithms in smart city.

Authors:  Mahdi Jemmali; Loai Kayed B Melhim; Mafawez T Alharbi; Abdullah Bajahzar; Mohamed Nazih Omri
Journal:  Sci Rep       Date:  2022-04-20       Impact factor: 4.996

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.