Literature DB >> 31042709

A portfolio selection model based on the knapsack problem under uncertainty.

Fereshteh Vaezi1, Seyed Jafar Sadjadi1, Ahmad Makui1.   

Abstract

One of the primary concerns in investment planning is to determine the number of shares for asset with relatively high net value of share such as Berkshire Hathaway on Stock market. Traditional asset allocation methods like Markowitz theorem gives the solution as a percentage and this ratio may suggest allocation of half of a share on the market, which is impractical. Thus, it is necessary to propose a method to determine the number of shares for each asset. This paper presents a knapsack based portfolio selection model where the expected returns, prices, and budget are characterized by interval values. The study determines the priority and importance of each share in the proposed model by extracting the interval weights from an interval comparison matrix. The resulted model is converted into a parametric linear programming model in which the decision maker is able to determine the optimism threshold. Finally, a discrete firefly algorithm is designed to find the near optional solutions in large dimensions. The proposed study is implemented for some data from the US stock exchange.

Entities:  

Mesh:

Year:  2019        PMID: 31042709      PMCID: PMC6493714          DOI: 10.1371/journal.pone.0213652

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Markowitz [1,2] introduced a mean-variance model to select the optimal asset portfolio. The main task of the portfolio selection model is to allocate capital among different assets such that the risk and return are simultaneously optimized in the portfolio. Markowitz model is based on some assumptions that are rarely feasible in reality. Hence, there has been widespread efforts by many researchers to provide methods for analyzing stocks in financial markets and improving these methods in the world (see [3]). The standard mean-variance Markowitz model does not consider the real financial market constraints, including transaction costs [4,5], Cardinality Constraint [6,7], and multi-period Cardinality Constraint Mean-Variance (CCMV) [8-10]. Another problem in the real financial market is that decision-making processes often face with complexity and uncertainty. So, many approaches based on the uncertain conditions have been developed to consider the real condition of financial markets in portfolio selection models, including, a robust-based approach in [11], a scenario-based approach in [12,13], and fuzzy methods in [14,15]. Fuzzy sets was introduced by Zadeh [16] and interval-valued fuzzy sets was presented by Turksen [17]. During the past few years, a number of studies have examined the interval fuzzy uncertainty for the mean-variance Markowitz model [18-20]. The use of interval approaches converts the model into a nonlinear model, therefore, Liu [21] proposed two linear programming approaches to obtain the upper and lower bounds of the interval number of returns in the Markowitz model. In addition, a multi-period MV model with the interval returns, risk and turnover rates of risky assets was proposed by Liu et al. [22,23]. When all or part of the quantitative and accurate data is not available, some qualitative portfolio models under the fuzzy environment was proposed by Zhou and Xu [24]. On the other hand in the Markowitz model, the variance is considered as a risk measurement factor. Variance is known as a symmetric risk measurement factor and has been criticized by many researchers. Thus, other studies also used different risk measures in the portfolio optimization model and then examined the model under interval uncertainty. For example, mean-Value-at-Risk (VAR) portfolio optimization model [25], Mean-Variance-Skewness (MVS) portfolio selection model [26], and mean-semi-absolute deviation portfolio selection model [27-29] are some of the examples which have examined the portfolio optimization under interval uncertainty. Moreover, note that the structure of the decision variable in the mean-variance Markowitz model is a continuous variable. One of particular interest is the finite divisibility of the (stock) assets, i.e. the obligation to buy/sell only integer quantities of asset lots whose number is pre-established. In some cases, we need to properly allocate the number of shares to different assets. Thus, Li et al. [30] introduced a hybrid algorithm based on a convergent Lagrangian and a contour-domain cut method for solving a nonlinear integer portfolio optimization problem with concave transaction costs and cardinality constraint. Corazza and Favaretto [31] presented a hybrid algorithm based on the relaxation method and two rounding approaches for quadratic mixed-integer programming problem based on one riskless asset and n risky assets. Li and Tsai [32] presented a hybrid algorithm based on the dual Lagrangian relaxation and transformation approaches to examine a portfolio selection model with a nonlinear objective function, transaction costs, and integer variables. Bonami and Lejeune [33] proposed a non-linear branch-and-bound algorithm based on a stochastic branching and an integrated dynamic branching for portfolio optimization with integer variables. Anagnostopoulos and Mamanis [34] compared different multi-objective evolutionary algorithms to study a nonlinear mixed-integer three-objective problem with class and quantity limitations. Castro et al. [35] proposed a mathematical algorithm based on various test sets to solve a portfolio selection model with a nonlinear constraint and integer variable. Therefore, it is very important to provide portfolio selection models that can properly allocate the number of shares to different assets. Therefore, in this research, a portfolio selection model based on the knapsack problem with the risk preferences of investors, cardinality constraint, floor and ceiling constraint, and budget constraint is presented. Since it is far from the fact that information data are considered accurate and definitive, all information or part of them is handled using the fuzzy sets or interval values. In this paper, the interval-valued fuzzy sets are used to examine a portfolio optimization problem under uncertain conditions. Also, a discrete firefly algorithm is designed for the solution. A numerical example of the US stock exchange is given to illustrate the application of the proposed model and demonstrates the effectiveness of the designed algorithm for solving the proposed model. The structure of this paper is as follows. In section 2, some preliminary notions such as the interval numbers, interval comparison matrix, and interval weights are defined. In Section 3, the knapsack based portfolio optimization model considering the risk preferences of investors under interval uncertainty is presented. In Section 4, Discrete Firefly Algorithm (DFA) is presented for the proposed optimization model. A numerical example and sensitivity analysis are examined in Section 5. Finally, a brief summary of the paper, some conclusions, and some future studies are given in Section 6.

Preliminaries

In this section, a review of some necessary concepts is presented:

Interval numbers

Interval is represented as A = 〈m(A), w(A)〉 where: Alefeld and Herzberger [36,37] presented the mathematical operators (〈OP〉 ∈ {±,⊗}) between interval numbers and as follows: In addition, the multiplication operator (⊗) between number λ and interval number is defined as follows: Also, the division operator between interval numbers and is as follows: More related information of the interval numbers can be found in [38].

Interval comparison matrix and interval weights

Wang and Elhag [39] introduced a method for extracting interval weights from an interval comparison matrix. In this method, an interval comparison matrix is considered as follows: where l ≥ 0, l ≤ u, l = 1/u, and u = 1/l for i, j = 1,…,n; i ≠ j. A is divided into two Crisp matrices as follows: where A ≤ A ≤ A.Vector is the normalized interval weight vector, which is close to A in the sense that for i, j = 1,…,n; i ≠ j. Vector W is normalized [40] as follows: which can be rewritten as follows: Given that the interval comparison matrix A is equivalent to the weight vector W , A is written as follows: According to (Eq 4): . Therefore, the interval comparison matrix A is rewritten as follows: A is divided into two Crisp matrices as follows: It is easy to prove that AW = W + (n − 1)W and AW = W + (n − 1)W, Where and . Because the judgment of the decision maker may not be exact, the value of the deviations is defined as the following vectors: where E = (ε1,…ε), Γ = (γ1,…,γ) and I is a unit matrix of order n. Finally, the optimization model for extracting the interval weights from an interval comparison matrix is as follows:

Proposed optimization model

Assumptions and presentation

In order to conveniently describe the model, the following notations are used: n, the total number of stocks; k, the desired number of stocks which can be chosen in the portfolio; B, the total available budget; P, the price of stock i R, the return of stock i u, the upper bound of stock i l, the lower bound of stock i. x, the integer variable that represents the number of each stock; y, the binary variable indicating whether stock i is included in the portfolio or not. y = 1, if stock i is included in the portfolio, and y = 0 otherwise; i ∈ {1,2,…,n}.

Model formulation

The basic model is based on the knapsack problem, which is as follows: The target function (Eq (16)) maximizes the sum of the expected returns in the stock portfolio in which the sum of the weighted prices is less than or equal to the budget (Eq (17)). (Eq 18) states that each stock should be only between its lower bound (ly) and its upper bound (uy) in case the asset is chosen, i.e. y = 1. (Eq 19) states that the portfolio should contain a certain number of stocks (k). All financial managers who are active on the biggest financial houses normally consider only selected number of shares, e.g. the shares of 10–12 firms, which means we need to consider cardinality constraints. Most financial managers are interested in investment in any particular company as long as they invest a minimum amount. The main reason is because they have to study all related press news of the companies on a daily basis. In fact, we needs a big investment team to follow all news and reports and literally hundreds of firms are assigned in a portfolio but when there are limited number of firms in the investment, a relatively small group could manage the fund. On the other hand, it is an easy assignment to build an example where Markowitz theorem either focuses on single investment or gives us investment on more than 10–12 firms. In many cases, the results of Markowitz theorem is impractical since one has to invest a small amount of money on one share and big portion of investment needs to be devoted on another one (see [41]). Therefore, (Eq 19) is considered for the proposed portfolio optimization problem that is the cardinality constrains. Note that the structure of the decision variable in the knapsack-based portfolio optimization model (Eq (20)) differs from the decision variable in the mean-variance Markowitz model. In the mean-variance Markowitz model, the decision variable (the weight of each share (w)) is a continuous variable. However, in the knapsack-based portfolio optimization model, the decision variable (the number of each stock (x)) is an integer and/or binary variable [42,43]. In some cases, rounding the output of the Markowitz model may yield an infeasible solution or a very bad approximation to the optimal integer solution (see [31,32,35]). Therefore, this gap disrupts the optimization process and the portfolio selection model based on knapsack problem is proposed to properly allocate the number of shares to different assets. Also, the knapsack-based portfolio optimization model has an advantage over Markowitz theorem. For instance, suppose, there are three assets in a basket with the optimal weights of 0.2, 0.5 and 0.3 and we plan to invest one million dollars in these three shares whose market prices are 345,000$, 1000$ and 1400$, respectively. Consequently, the number of shares is 500, 214.285 and 0.579, respectively. As we can observe, 214.285 is not an integer number and 0.579 is less than one share. Therefore, according to Markowitz theorem; we need to purchase less than one share, which is obviously impractical. By using knapsack type optimization technique, we may be able to find the optimum asset allocation for especial cases with relatively large stock prices. In the following, with regard to the probable risks, uncertainty conditions, and the lack of accurate information on financial markets, fuzzy and interval programming techniques have been used. In this case, instead of using the exact values, the interval values are used such that the lowest and highest expected values of the parameter are placed in the lower and upper bounds of the range, respectively. To determine the estimated values of these limits, you can use the information of past years and consult with experts. On the other hand, the risk and uncertainty conditions are not only specific to the objective function coefficients, but also the technical coefficients and the values of the right-side of the constraints may also have these conditions. Therefore, the expected return of each stock , the price of each stock , and the budget are defined as the interval numbers. Therefore, the model is rewritten as follows: As can be seen, instead of using the exact values of the coefficients, their interval values are used; For example, in the objective function means that the target function coefficient of the i decision variable is possible to vary in the range to and to fluctuate. In the real world, decision-making processes often have complexity and uncertainty. Sometimes it is far from the fact that judgments of decision-makers are considered accurate and definitive. For this reason, all judgments or part of them are considered as interval values or fuzzy numbers. Interval judgments can examine uncertainty in judgments without the intervention of probability distribution functions in the weight extraction models of the pairwise comparison matrix and give closer results to the decision-making in uncertainty conditions. One of the major problems in extracting weights from the interval pairwise matrix is the problem of the incompatibility of matrices containing mental judgments. Therefore, a model used to derive weights from the interval comparison matrix should be able to protect against changes to these minor incompatibilities. Two types of model responses are defined in weight extraction models: point estimates [44] and interval estimates [39]. Point estimates make decision-making processes easy, but do not reflect uncertainty in responses as an interval. Therefore, a definite answer is obtained. Interval estimates show uncertainty in decision-making processes. Also, the length of the estimated intervals can be a criterion of uncertainty. On the other hand, a stable model should be consistent with the mental incompatibilities of the decision maker, which means that it can provide responses close to the responses extracted from the compatible pairwise comparison matrix. Therefore, in order to determine the importance and priorities of each stock in the stock market, the interval weights extracting from an interval comparison matrix are added to the model (M2). Therefore, the model is written as follows: Given these weights, the budget is also proportional to the scale. According to (Eq 2), the multiplication operator between interval values is defined as: In order to conveniently describe the model (M3), the following notations are used: According to Eqs (24–27), the model (M3) is rewritten as follows: Then, the model (M4) is converted into a crisp form using the technique presented by Sengupta [45]. Therefore, a solvable parametric linear programming model is as follows: where α (α ∈ [0,1]) is the data optimism threshold determined by the decision-maker. If the decision-maker is optimistic about the data, α is greater than 0.5 (50%) and otherwise less than 0.5. In fact, α is a controlling factor for data uncertainty determined by the decision-maker. Empirical studies show that it is usually better to choose α = 0.5 (see [46,47]). Finally, the features and advantages of the proposed optimization model are as follows: The knapsack based portfolio selection model can properly allocate the number of shares to different assets. In the proposed optimization model, the interval uncertainty of the parameters is considered in the objective functions and constraints simultaneously. In the other words, the expected returns, prices, and budget are characterized by interval values. The proposed optimization model considers the risk preferences of investors. In the other words, the importance and priority of each share are considered in the proposed optimization model, which are defined as interval values. In the proposed optimization model, the decision-maker is able to determine the optimism threshold. The proposed optimization model is very suitable for the value of a particular share becomes relatively large. Finally, the proposed optimization model based on the knapsack problem maximizes the returns considering the risk preferences of investors, budget constraint, cardinality constraint, floor and ceiling constraints under interval uncertainty.

Discrete firefly algorithm for portfolio selection

The complexity of the knapsack problem is NP-complete [48]. Therefore, in order to solve the proposed portfolio optimization model based on the knapsack problem with arbitrary inputs for large dimensions, we need an approximate algorithm that gives near-optimal solution. Also, a meta-heuristic method is used to solve a large-scale problem. Many studies compared the effectiveness of the firefly algorithm with other algorithms for different types of knapsack problem (see [49-51]). They concluded that firefly algorithm and its branches are a very powerful algorithm for solving the different types of knapsack problems for both static and dynamic environments. Yang [52] introduced firefly-inspired algorithm in 2008. Then in 2010, Sayadi et al. [53] presented Discrete Firefly Algorithm (DFA). Some related researches in portfolio optimization by firefly algorithm can be found in [54,55]. In general, there are three ideal rules for the development of firefly inspired algorithms that are: 1) All fireflies are considered unisex. This means that a firefly will be absorbed into another firefly, regardless of its sex. 2) The degree of attractiveness of a firefly is proportional to its brightness. If the distance between the two fireflies increases, the brightness decreases and consequently the attractiveness decrease. If none of the fireflies are brighter than the other, the firefly will move randomly. 3) The brightness of a firefly is determined by the objective function or affected by it. Therefore: The attractiveness function (β) is defined as follows: where r is the distance, β0 is the attractiveness at r = 0 and γ is a fixed light absorption coefficient. The distance between any two fireflies i and j at locations x and x is defined as follows: D is the dimension of the problem and x is the d − th component of the i − th firefly (x). The movement of a firefly i to another more attractive (brighter) firefly j is defined as follows: where φ is the random parameter, ξ is a vector of random numbers with Gaussian or uniform distributions in [0,1] and k is the iteration number. When a firefly i moves toward firefly j, the position of firefly i changes from a binary number to a real number. Therefore, the following sigmoid function is used to achieve binary position after the displacement. where is the probability of bit considered as 1. The operating steps of this algorithm are as follow: Algorithm 1. Pseudocode of DFA. Suppose that f(X) is the objective function of X = (x1,x2,…,x)T. Assign value for β0, φ, γ and MaxGeneration. Generate initial population of fireflies x for i = 1,2,…,n. Determine the light intensify I at x using f(x). while (t < MaxGeneration) do for i = 1: n all n fireflies do for i = 1: n all n fireflies do if (I < I) then Move firefly i towards j End if Vary attractiveness with distance r using exp(-γr2) Discrete the position of i − th firefly (31) Evaluate new solution (position of i − th firefly) and update light intensify I End for End for Rank the fireflies and find the current global best End while Show result and visualization.

Numerical example

In this section, a numerical example is presented to express the idea of the proposed optimization model. The case study includes the shares of the Dow Jones Industrial Average (DJIA) listed in New York Stock Exchange (NYSE) and covers weekly financial time series data over a period of five years from 18/10/2013 to 18/10/2018 (Appendix A). Also, the minimum and maximum number of each share is generated randomly and interval returns are calculated from the following formula: The upper bound of the return = ((Minimum close price during the period- Minimum value of investment during the period+ Minimum Dividend during the period)/ Minimum value of investment during the period)*100. The lower bound of the return = ((Maximum value of investment during the period- Maximum close price during the period+ Maximum Dividend during the period)/ Maximum close price during the period)*100 Table 1 shows the information on the minimum and maximum number, prices and returns of each share.
Table 1

Research data.

symbolCompany nameP_iP¯iR¯i%R_i%liui
S1BAThe Boeing Company102.099394.2796.4572.4632048
S2GEGeneral Electric Company11.21033.0001.7711.094166180
S3MMM3M Company120.709259.7692.2500.9661560
S4PGThe Procter & Gamble Company65.01994.6695.8482.06234100
S5KOThe Coca-Cola Company36.56048.6192.4200.989125167
S6AAPLApple Inc.70.507233.4703.6062.8865060
S7AXPAmerican Express Company50.270111.7694.9751.1353342
S8UTXUnited Technologies Corporation83.389144.1493.3051.9487580
S9CVXChevron Corporation69.580135.1009.4771.96189101
S10JNJJohnson & Johnson81.790148.3208.2961.2623545
S11NKENIKE, Inc.34.92486.0402.4160.946111167
S12UNHUnitedHealth Group Incorporated66.720272.8091.7451.6751215
S13MSFTMicrosoft Corporation33.570116.184.7761.9494563
S14IBMInternational Business Machines Corporation116.900199.2104.2052.8632140
S15TRVThe Travelers Companies, Inc.79.889150.5501.9041.2722588
S16MRKMerck & Co., Inc.44.61972.8892.3232.15671125
S17XOMExxon Mobil Corporation66.550104.7208.6081.646100120
S18WMTWal-Mart Stores, Inc.56.299109.9801.9471.0454750
S19GSThe Goldman Sachs Group, Inc.138.199275.3092.9681.9725590
S20CATCaterpillar Inc.56.360173.2406.8632.1652345
S21VVisa Inc.48.564151.5591.4501.2993340
S22CSCOCisco Systems, Inc.20.25049.4704.9692.363142176
S23HDThe Home Depot, Inc.75.480215.4292.5311.9163366
S24JPMJPMorgan Chase & Co.50.070119.3305.3681.7705272
S25PFEPfizer Inc.27.51045.8102.7612.012142167
S26MCDMcDonald’s Corporation87.500178.6993.2030.7562555
S27VZVerizon Communications Inc.38.06056.95012.3952.35299167
S28INTCIntel Corporation23.40057.5992.7891.434142167
S29DISThe Walt Disney Company65.980122.0802.7252.6314153
S30DWDPDowDuPont Inc.35.11077.08010.6800.67655120
The first step in the proposed approach is to obtain an interval comparison matrix. Therefore, a detailed questionnaire is prepared which can be found in Appendix B. Questionnaire form was sent to three experts via email. In this study, an expert is someone who has the information and operational experience on the New York Stock Exchange. In addition, the interval compare n matrix of DJIA can be found in Appendix C. The second step in the proposed approach is to extract the interval weights from an interval comparison matrix. For this purpose, interval weights are obtained using the Wang’s approach [39] and are reported in Table 2.
Table 2

Extracting interval weights from the interval comparison matrix.

Weight of shares
WS1=[0.074,0.095]WS2=[0.005,0.006]WS3=[0.055,0.065]
WS4=[0.020,0.024]WS5=[0.006,0.008]WS6=[0.042,0.054]
WS7=[0.024,0.030]WS8=[0.028,0.032]WS9=[0.028,0.032]
WS10=[0.032,0.038]WS11=[0.006,0.009]WS12=[0.068,0.079]
WS13=[0.016,0.022]WS14=[0.037,0.047]WS15=[0.032,0.039]
WS16=[0.008,0.014]WS17=[0.015,0.020]WS18=[0.020,0.021]
WS19=[0.062,0.075]WS20=[0.033,0.042]WS21=[0.025,0.030]
WS22=[0.006,0.007]WS23=[0.054,0.059]WS24=[0.014,0.019]
WS25=[0.006,0.007]WS26=[0.052,0.059]WS27=[0.006,0.010]
WS28=[0.006,0.007]WS29=[0.019,0.024]WS30=[0.013,0.018]
In the following, the optimization model with the interval weights, the interval prices and the interval expected returns is solved in GAMS software. Table 3 shows the results of the exact solution of the proposed optimization model based on the different values of the uncertain parameter (α = {0,0.1.0.3,0.5,0.7,1}) in small dimensions.
Table 3

Exact solution.

GAMS solution
α = 0α = 0.1α = 0.3α = 0.5α = 0.7α = 1
K = 6E(R) = 49.234% X4 = 36; X9 = 97; X16 = 72; X17 = 100; X27 = 100; X30 = 100.E(R) = 50.679% X4 = 46; X9 = 100; X16 = 72; X17 = 100; X27 = 100; X30 = 100.E(R) = 53.378% X4 = 76; X9 = 100; X16 = 71; X17 = 100; X27 = 100; X30 = 100.E(R) = 55.782% X4 = 100; X9 = 100; X16 = 80; X17 = 100; X27 = 100; X30 = 100.E(R) = 57.744% X4 = 75; X9 = 100; X10 = 35; X17 = 100; X27 = 100; X30 = 100.E(R) = 60.813% X4 = 99; X9 = 100; X10 = 40; X17 = 100; X27 = 100; X30 = 100
K = 7E(R) = 46.974% X4 = 34; X9 = 89; X13 = 45; X16 = 71;X17 = 100; X27 = 99;X30 = 64.E(R) = 49.254% X4 = 34; X9 = 89; X13 = 45; X16 = 71; X17 = 100; X27 = 100;X30 = 86.E(R) = 52.455% X4 = 36; X9 = 98;X13 = 45; X16 = 71;X17 = 100; X27 = 100;X30 = 100.E(R) = 54.981% X4 = 58; X9 = 100; X13 = 45; X16 = 71;X17 = 100; X27 = 100;X30 = 100.E(R) = 57.038% X4 = 80; X9 = 100;X16 = 71; X17 = 100;X24 = 52; X27 = 100;.X30 = 100.E(R) = 59.925% X4 = 79; X9 = 100;X10 = 35; X16 = 73;X17 = 100; X27 = 100; X30 = 100.
K = 8E(R) = 45.625% X4 = 86; X7 = 33;X13 = 45; X16 = 71;X17 = 100; X24 = 52;X27 = 100; X30 = 100.E(R) = 46.860% X4 = 98; X7 = 33; X13 = 46; X16 = 73; X17 = 100; X24 = 52;X27 = 100; X30 = 100.E(R) = 49.484% X4 = 34; X7 = 33;X9 = 89; X13 = 45;X16 = 71; X17 = 100;X27 = 99; X30 = 60.E(R) = 53.303% X4 = 34; X7 = 33;X9 = 89; X13 = 45;X16 = 71; X17 = 100;X27 = 99; X30 = 98.E(R) = 55.749% X4 = 34; X9 = 99;X13 = 45; X16 = 71;X17 = 100; X24 = 52;X27 = 100;. X30 = 100E(R) = 58.682% X4 = 68; X7 = 33;X9 = 99; X16 = 71;X17 = 100; X24 = 52;X27 = 100; X30 = 100.
K = 9E(R) = 43.762% X4 = 34; X7 = 33; X13 = 45; X16 = 71;X17 = 100; X20 = 23;X24 = 52; X27 = 100;.X30 = 87.E(R) = 45.431% X4 = 38; X7 = 33;X13 = 45; X16 = 71;X17 = 100; X20 = 20;X24 = 52; X27 = 100;X30 = 100.E(R) = 47.906% X4 = 42; X7 = 33;X10 = 35; X13 = 45;X16 = 72; X17 = 100;X24 = 52; X27 = 100;X30 = 100.E(R) = 50.061% X4 = 66; X7 = 33;X10 = 35; X13 = 45;X16 = 71; X17 = 100;X24 = 52; X27 = 100;X30 = 100.E(R) = 52.480% X4 = 34; X7 = 33; X9 = 89; X13 = 45; X16 = 71; X17 = 100; X24 = 52; X27 = 99;X30 = 57.E(R) = 57.077% X4 = 36; X7 = 33;X9 = 89; X13 = 45;X16 = 72; X17 = 100;X24 = 52; X27 = 100;X30 = 100.
K = 10E(R) = 36.058% X4 = 34; X7 = 33;X13 = 45; X16 = 71;X18 = 47; X20 = 23;X24 = 52; X27 = 99;X29 = 41; X30 = 71.E(R) = 40.326% X4 = 34; X7 = 33;X13 = 45; X16 = 71;X17 = 100; X18 = 47;X24 = 52; X27 = 100;X29 = 41; X30 = 56.E(R) = 44.145% X4 = 34; X7 = 33;X13 = 45; X16 = 71;X17 = 100; X18 = 47;X24 = 52; X27 = 100;X29 = 41; X30 = 94.E(R) = 47.032% X4 = 34; X7 = 33;X13 = 45; X16 = 71;X17 = 100; X20 = 23;X24 = 52; X27 = 100;X29 = 41; X30 = 96.E(R) = 49.702% X4 = 36; X7 = 33; X10 = 35; X13 = 45; X16 = 71; X17 = 100; X24 = 52; X27 = 100; X29 = 41; X30 = 100.E(R) = 52.426% X4 = 66; X7 = 33;X10 = 35; X13 = 45; X16 = 71; X17 = 100; X24 = 52; X27 = 100; X29 = 41; X30 = 100.
As shown in Table 3, the solution of the model (M5) is not unique. This means that different values of α yield various solutions. When α is equal to zero, the expected return of the portfolio is minimum. When the level of data satisfaction (α) increases, the expected return of the portfolio will also increase. When α is equal to one, the expected return of the portfolio is maximum. For instance, the expected return of the portfolio with cardinality k = 6 lies in [0.49234 0.60813] based on different values of α (α ∈ [0,1]). The proposed portfolio selection model based on the knapsack problem is a mixed-integer optimization problem. Therefore, the exact methods such as the implementation of GAMS software package does not yield appropriate answers for large-size problems. In order to use the proposed model in the large scale, it is necessary to use the meta-heuristic algorithms. Therefore, Discrete Firefly Algorithm (DFA) is used. To select appropriate parameters, the Taguchi method is used for five parameters of DFA (Number of iterations, Number of fireflies, Randomization parameter, Absorption coefficient, Attractiveness). Finally, the selected parameters of this algorithm are attractiveness (β0 = 1), randomization parameter (φ = 0.2), absorption coefficient (γ = 1), number of iterations (MaxGeneration = 500), and number of fireflies (Population = 30). The results of discrete firefly algorithm in small dimensions are reported in Table 4. Then, the results of the exact method are compared with the results of DFA and are reported in Table 5:
Table 4

DFA solution.

tInit. assign. no.DFAsolution
K = 6α = 0α = 0.1α = 0.3α = 0.5α = 0.7α = 1
150.971%52.527%53.048%55.618%55.289%57.867%
250.939%50.895%52.507%54.336%55.936%57.923%
349.451%50.331%52.352%55.624%56.744%57.788%
449.828%52.194%52.804%55.419%56.680%58.604%
550.987%51.459%53.029%54.610%55.638%58.441%
Mean50.435%51.481%52.748%55.121%56.057%58.125%
SE Mean0.0030.0040.0010.0020.0020.001
S.D.0.0070.0090.0030.0060.0060.003
K = 7α = 0α = 0.1α = 0.3α = 0.5α = 0.7α = 1
153.469%54.087%54.431%56.192%58.116%57.534%
252.830%54.220%55.039%57.179%58.503%59.046%
353.923%53.005%54.424%57.781%56.723%58.635%
453.580%53.969%53.859%56.617%57.617%59.180%
553.282%53.656%55.539%54.825%57.236%59.041%
Mean53.417%53.787%54.658%56.519%57.639%58.687%
SE Mean0.0010.0020.0020.0050.0030.003
S.D.0.0040.0040.0060.0110.0070.006
K = 8α = 0α = 0.1α = 0.3α = 0.5α = 0.7α = 1
153.160%53.692%54.438%57.332%59.896%61.281%
251.837%56.064%57.462%57.531%59.483%62.910%
352.511%52.538%55.797%58.439%59.061%62.089%
453.437%53.928%57.652%58.694%59.821%60.880%
553.823%54.777%54.765%58.648%58.930%60.379%
Mean52.954%54.200%56.023%58.129%59.438%61.508%
SE Mean0.0030.0050.0060.0020.0010.004
S.D.0.0070.0130.0140.0060.0040.010
K = 9α = 0α = 0.1α = 0.3α = 0.5α = 0.7α = 1
153.268%56.171%56.016%57.217%60.927%61.928%
254.554%56.412%58.045%59.243%59.971%61.676%
352.942%54.454%58.006%57.593%58.478%62.929%
452.866%55.073%57.444%58.143%59.191%62.821%
552.616%54.091%56.257%58.585%60.941%62.020%
Mean53.249%55.240%57.154%58.156%59.902%62.275%
SE Mean0.0030.0040.0040.0030.0040.002
S.D.0.0070.0100.0090.0080.0100.005
K = 10α = 0α = 0.1α = 0.3α = 0.5α = 0.7α = 1
154.988%55.175%56.752%59.166%60.544%63.510%
252.911%52.341%56.883%58.985%61.030%62.007%
353.477%53.014%57.139%58.885%59.531%63.767%
453.942%54.696%57.481%59.369%60.470%63.151%
551.117%55.491%55.189%58.697%61.679%63.087%
Mean53.287%54.143%56.689%59.020%60.651%63.104%
SE Mean0.0060.0060.0030.0010.0030.003
S.D.0.0140.0130.0080.0020.0070.006
Table 5

Comparison of the exact solution with DFA solution.

Mean solution of DFAGAMS solutionS.D.SE MeanP-Value*
K = 6α = 050.435%49.234%0.0080.0060.008
α = 0.151.481%50.679%0.0050.0040.005
α = 0.352.748%53.378%0.0040.0030.004
α = 0.555.121%55.782%0.0040.0030.004
α = 0.756.057%57.744%0.0110.0080.010
α = 158.125%60.813%0.0190.0130.015
K = 7α = 053.417%46.974%0.0450.0320.042
α = 0.153.787%49.254%0.0320.0220.029
α = 0.354.658%52.455%0.0150.0110.013
α = 0.556.519%54.811%0.0120.0080.010
α = 0.757.639%57.038%0.0040.0030.003
α = 158.687%59.925%0.0080.0060.007
K = 8α = 052.954%45.652%0.0510.0360.048
α = 0.154.200%46.860%0.0510.0360.047
α = 0.356.023%49.489%0.0460.0320.040
α = 0.558.129%53.303%0.0340.0240.028
α = 0.759.438%55.749%0.0260.0180.021
α = 161.508%58.682%0.0200.0140.015
K = 9α = 053.249%43.762%0.0670.0470.063
α = 0.155.240%45.431%0.0690.0490.063
α = 0.357.154%47.906%0.0650.0460.057
α = 0.558.156%50.061%0.0570.0400.048
α = 0.759.902%52.480%0.0520.0370.043
α = 162.275%57.077%0.0360.0260.028
K = 10α = 053.287%36.058%0.1210.0860.124
α = 0.154.143%40.326%0.0970.0690.094
α = 0.356.689%44.145%0.0880.0620.080
α = 0.559.020%47.032%0.0840.0590.073
α = 0.760.651%49.702%0.0770.0540.064
α = 163.104%52.426%0.0750.0530.060

* denotes rejection of the hypothesis at the 0.01 level.

* denotes rejection of the hypothesis at the 0.01 level. As shown in Table 5, Standard Deviations (SD) and Standard Errors of the Mean (SE Mean) between GAMS solutions and DFA solutions lie in [0.004 0.121] and [0.003 0.086], respectively. These results are very reasonable and show that DFA solutions are very close to the exact solutions in small dimensions. Also, as shown in S1 Fig, as the data optimism threshold increases, the results of the two approaches converge more into each other. Therefore, for α greater than 0.5 (α≥0.5), the results of the DFA are more valid. Finally, these results indicate the validity of the used discrete firefly algorithm for the proposed portfolio selection model. Therefore, this algorithm can be used to solve the proposed optimization model in larger dimensions as well. Therefore, the sensitivity analysis and the set of answers in small and large dimensions are reported in Table 6:
Table 6

Set of answers in small and large dimension.

DDFA SolutionGAMS Solution
α = 0α = 0.1α = 0.3α = 0.5α = 0.7α = 1α = 0α = 0.1α = 0.3α = 0.5α = 0.7α = 1
Low -DK = 650.435%51.481%52.748%55.121%56.057%58.125%49.234%50.679%53.378%55.782%57.744%60.813%
K = 753.417%53.787%54.658%56.519%57.639%58.687%46.974%49.254%52.455%54.811%57.038%59.925%
K = 852.954%54.200%56.023%58.129%59.438%61.508%45.652%46.860%49.489%53.303%55.749%58.682%
K = 953.249%55.240%57.154%58.156%59.902%62.275%36.058%40.326%44.145%47.032%49.702%52.426%
K = 1053.287%54.143%56.689%59.020%60.651%63.104%36.058%40.326%44.145%47.032%49.702%52.426%
High- DK = 1152.664%55.066%56.087%59.616%62.251%63.694%------
K = 1250.432%52.244%55.373%58.155%60.899%63.898%------
K = 1348.351%49.467%52.614%55.060%57.667%61.688%------
K = 1447.935%48.951%49.584%52.767%55.157%57.829%------
K = 1544.075%45.247%47.652%49.948%52.576%55.531%------
According to Table 6, the proposed discrete firefly algorithm is appropriate for solving the proposed portfolio selection model based on the knapsack problem with arbitrary inputs (different cardinalities) in large dimensions. Therefore, in the large-scale optimizations with a relatively large portfolio, the proposed discrete firefly algorithm is used.

Conclusions

In this paper, we have presented a portfolio selection model based on the knapsack problem under interval uncertainty, in which the expected returns, prices, and budget are characterized by interval values. We have also considered the importance and the priority of each share in the portfolio optimization model by extracting the interval weights from an interval comparison matrix. Finally, we have converted the proposed model into a crisp form and a solvable parametric linear programming model using the technique presented by Sengupta [45] in which the decision maker is able to determine the optimism threshold. The proposed optimization model has maximized the returns of the investment by taking into account the risk preferences of investors and budget, cardinality, floor and ceiling constraints under interval uncertainty of the parameters in the objective functions and constraints, simultaneously. The proposed model has the ability to determine the optimal portfolio of assets. In fact, the main contribution of this work is to present a portfolio selection model based on the knapsack problem, which eliminates the mentioned gaps in the Markowitz theorem. The portfolio selection model based on knapsack problem is proposed to properly allocate the number of shares to different assets. In addition, by using knapsack type optimization technique, we may be able to find the optimum asset allocation for especial cases with relatively large stock prices. A discrete firefly algorithm has also been designed to handle the complexity of the problem. A numerical example of the US stock exchange has been given to illustrate the application of the proposed model and demonstrated the effectiveness of the designed algorithm for solving the proposed model. Numerical results have shown that the proposed optimization model and the implementation of discrete firefly algorithm may yield promising results. Finally, the inclusion of other constraints in the proposed model such as class constraint and chance constraints and the inclusion of risk measurement factors such as variance and Value-at-Risk (VaR) will be an interesting extension of the proposed model and is regarded as the future scope of the study. Fig A. Comparison of the exact solution with DFA solution. Fig B. Comparison of the exact solution with DFA solution. Fig C. Comparison of the exact solution with DFA solution. Fig D. Comparison of the exact solution with DFA solution. Fig E. Comparison of the exact solution with DFA solution. (TIFF) Click here for additional data file.

Research data.

(PDF) Click here for additional data file.

Extracting interval weights from the interval comparison matrix.

(PDF) Click here for additional data file.

Exact solution.

(PDF) Click here for additional data file.

DFA solution.

(PDF) Click here for additional data file.

Comparison of the exact solution with DFA solution.

(PDF) Click here for additional data file.

Set of answers in small and large dimension.

(PDF) Click here for additional data file.

Pseudocode of DFA.

(PDF) Click here for additional data file.

Appendix A.

Data related to the article. (XLSX) Click here for additional data file.

Appendix B.

Survey form. (PDF) Click here for additional data file.

Appendix C.

Interval comparison matrix. (PDF) Click here for additional data file.
  1 in total

1.  Firefly algorithm for cardinality constrained mean-variance portfolio optimization problem with entropy diversity constraint.

Authors:  Nebojsa Bacanin; Milan Tuba
Journal:  ScientificWorldJournal       Date:  2014-05-29
  1 in total
  2 in total

1.  Constrained portfolio optimization with discrete variables: An algorithmic method based on dynamic programming.

Authors:  Fereshteh Vaezi Jezeie; Seyed Jafar Sadjadi; Ahmad Makui
Journal:  PLoS One       Date:  2022-07-28       Impact factor: 3.752

2.  Financial Network Analysis on the Performance of Companies Using Integrated Entropy-DEMATEL-TOPSIS Model.

Authors:  Kah Fai Liew; Weng Siew Lam; Weng Hoe Lam
Journal:  Entropy (Basel)       Date:  2022-07-31       Impact factor: 2.738

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.