Academia.eduAcademia.edu

A dynamic Stackelberg game for green supply chain management

2015, arXiv (Cornell University)

In this paper, we establish a dynamic game to allocate CSR (Corporate Social Responsibility) to the members of a supply chain. We propose a model of a three-tier supply chain in a decentralized state which includes a supplier, a manufacturer and a retailer. For analyzing supply chain performance in decentralized state and the relationships between the members of the supply chain, we use a Stackelberg game and consider in this paper a hierarchical equilibrium solution for a two-level game. In particular, we formulate a model that crosses through multi-periods with the help of a dynamic discrete Stackelberg game. We obtain an equilibrium point at which both the profits of members and the level of CSR taken up by supply chains is maximized.

A dynamic Stackelberg game for green supply chain management arXiv:1506.06408v1 [math.OC] 21 Jun 2015 Mehrnoosh Khademi a∗ Massimiliano Ferrara Somayeh Sharifi e§ b,d† Mehdi Salimi c,d‡ a Department of Industrial Engineering, Mazandaran University of Science and Technology, Babol, Iran b Department of Law and Economics, Università Mediterranea di Reggio Calabria, Italy c Center for Dynamics, Department of Mathematics, Technische Universität Dresden, Germany d MEDAlics, Research Center at Università per Stranieri Dante Alighieri, Reggio Calabria, Italy e Young Researchers and Elite Club, Hamedan Branch, Islamic Azad University, Hamedan, Iran Abstract In this paper, we establish a dynamic game to allocate CSR (Corporate Social Responsibility) to the members of a supply chain. We propose a model of a three-tier supply chain in a decentralized state which includes a supplier, a manufacturer and a retailer. For analyzing supply chain performance in decentralized state and the relationships between the members of the supply chain, we use a Stackelberg game and consider in this paper a hierarchical equilibrium solution for a two-level game. In particular, we formulate a model that crosses through multi-periods with the help of a dynamic discrete Stackelberg game. We obtain an equilibrium point at which both the profits of members and the level of CSR taken up by supply chains is maximized. Keywords: Dynamic Game; Supply Chain; CSR; Stackelberg Game. 1 Introduction In recent years, companies and firms have been showing an ongoing interest in favor of CSR. This is mainly because of increasing consumer awareness of several CSR issues, e.g. the environment, human rights and safety. In addition, the firms are also forced to accept CSR due to government policies and regulations. Recently CSR has gained recognition and importance as field of research ∗ [email protected] [email protected] ‡ Corresponding author: [email protected] & [email protected] § [email protected] † 1 field [8, 15]. However, the research field still lacks a consistent definition of CSR and this has been the center of discussion since several decades. Dahlsrud [9] presented an overview of different definitions of CSR and summarized the number of dimensions included in each definition. There is a positive correlation between CSR and profit [19, 20]. Moreover, CSR is an effective tool for supply chain management, for coordination, purchasing, manufacturing, distribution, and marketing functions [14]. According to previous studies, the long term investment on CSR is beneficial for a supply chain. Furthermore, a sustainable supply chain requires consideration of the social aspects of the business [23]. Carter et al. [6] established an effective approach and demonstrated that environmental purchasing is significantly related to both net income and cost of goods sold. Carter et al. [7] also pointed towards the importance of CSR in the supply chain, in particular the role played by the purchasing managers in socially responsible activities and the effect of these activities on the supply chain. Sethi [21] introduced a taxonomy in which a firm’s social activities include social obligations as well as more voluntary social responsibility. And, Carroll [4, 5] developed a framework for CSR that consists of economic, legal, and ethical responsibilities. The members of a supply chain take their decisions based on maximizing their individual net benefits. In addition, when they need to accept CSR; this situation leads to an equilibrium status. Game theory is one of the most effective tools to deal with this kind of management problems. A growing number of research papers use game theoretical applications in supply chain management. Cachon et al. [3] discuss Nash equilibrium in noncooperative cases in a supply chain with one supplier and multiple retailers. Hennet et al. [13] presented a paper to evaluate the efficiency of different types of contracts between the industrial partners of a supply chain. They applied game theory methods for decisional purposes. Tian et al. [24] presented a system dynamics model based on evolutionary game theory for green supply chain management. In this paper, we consider a discrete time version of the dynamic differential game. The optimal control theory is the standard tool for analyzing the differential game theory [16]. There are two different types of information structures in a differential game, open-loop and feedback information structures. In an open-loop strategy, the players choose their decisions at time t, with information of the state at time zero. In contrast, in a feedback information structure, the players use their knowledge of the current state at time t in order to formulate their decisions at time t [11]. We formulate a model and study the behavior for decentralized supply chain networks under CSR conditions with one leader and two followers. The Stackelberg game model is recommended and applied here to find an equilibrium point at which the profit of the members of the supply chain is maximized and the level of CSR is adopted in the supply chain. We develop an open-loop Stackelberg game by selecting the supplier as the leader and both the manufacturer and the retailer as the followers. Using this approach, the supplier as a leader, can know the optimal reaction of his followers, and utilizes such processes to maximize his own profit. The manufacturer and the retailer as followers, try to maximize their profits by considering all the conditions. Our model has two levels, at the first level the manufacturer is the leader and the retailer is the follower and we find the equilibrium point. At the second level, we consider the supplier as the leader and the manufacturer as the follower. In fact, we substitute the response functions of the follower into the objective function of the leader and we find the final equilibrium point. We propose a Hamiltonian matrix to solve the optimal control problem to obtain the equilibrium in this game. The paper is organized as follows: Section 2 is devoted to the problem description and assumptions. Objective functions, constraints and solution of the game are illustrated in Section 3. A numerical example is shown in Section 4 and we close with a conclusion in Section 5. 2 2 Problem Description and Assumptions We consider a three stage Stackelberg differential game which has three players playing the game over a fixed finite horizon model. This model is a three-tier, decentralized vertical control supply chain network (Figure 1). All retailers and suppliers at the same level make the same decision. Therefore, consequently the model has only one supplier, one manufacture and one retailer. The simplified model is shown in Figure 2. The dynamic game goes through multi-periods as a repeated game with complete information. This model has a state variable and control variables like any dynamic game. We define the state variable as the level of social responsibility taken up by companies, and the control variables are the capital amounts invested while fulfilling the social responsibility. Specifically, all of the social responsibilities taken up by the firm j at period t can be expressed as the investment Itj . We suppose that xt evolves according to the following rule: xt+1 = f (It , xt ). Figure 1: Three-tier supply chain network More specifically we have the following assumptions: The function Bt (xt ) = δxt represents social benefit which is proportional to social responsibility taken up by the supply chain system [1].   The function Tt = τ It 1 + θ(It ) measures the value of the tax return to the members of the supply chain [10]. Both τ and θ are tax return policy parameters. Specifically, τ is the rate of individual post tax return on investment (ROI), and θ is the rate of supply chain’s post tax return on investment (ROI). The market inverse demand is P M (qt ) = a − bqt [17]. 3 Figure 2: Simplified model of three-tier supply chain network The accumulation of the level of social responsibility taken up by the firms is given by xt+1 = αxt + β1 ItS + β2 ItM + β3 ItR . Here, β1 is the rate of converting the supplier’s capital investment in CSR to the amount of CSR taken up by the supply chain; β2 is the rate of converting the manufacturer’s capital investment in CSR to the amount of CSR taken up by the supply chain and β3 is the rate of converting the retailer’s capital investment in CSR to the amount of CSR taken up by the supply chain as well [22]. 2.1 The General Framework He et al. [12] illustrate an open-loop Stackelberg differential game model over a fixed finite horizon time as detailed in the following: The follower’s optimal control problem is: Z t n o Maxr(·) JR (X0 , r(·); w(·)) = e−ρt πR (X(t), w(t), r(t))dt , (2.1) 0 subject to the state equation Ẋ(t) = F (X(t), w(t), r(t)), X(0) = X0 . (2.2) where the function F represents the rate of sales, ρ is the followers’s discount rate, and X0 , is the initial condition. The follower’s Hamiltonian is HR (X, r, λR , w) = πR (X, w, r) + λR F (X, w, r), (2.3) where λR is the vector of the shadow prices associated with the state variable X; and it satisfies the adjoint equation ∂HR (X, r, λR , w) λ̇R = ρλR − , λR (T ) = 0. (2.4) ∂X 4 The necessary optimality condition for the follower’s problem satisfies ∂HR ∂πR (X, w, r) ∂F (X, w, r) = 0 =⇒ + λR = 0. ∂r ∂r ∂r (2.5) We derive the follower’s best response r ∗ (X, w, λR ). The leader’s problem is n Maxw(·) JM (X0 , w(·)) = Z 0 t o e−µt πM (x, w, r(x, w, λR))dt , Ẋ = F (X, w, r(x, w, λR)), X0 (0) = X0 , ∂HR (x, r(x, w, λR), λR , w) λ̇R = ρλR − , ∂x (2.6) λR (T ) = 0, where µ is the leader’s discount rate and the above differential equations are obtained by substituting the follower’s best response r ∗ (X, w, λR ) into the state equation and the adjoint equation of the follower, respectively. We formulate the leader’s Hamiltonian as follows ∂HR (X, r(X, w, λR ), λR , w) , ∂X (2.7) where λM and µ are the shadow associated with X and λR , respectively, and they satisfy the adjoint equations HM = πM (x, λR , w, r(X, w, λR), λM , ϕ) + λF (X, w, r(X, w, λR)) − µ ∂HM (X, λR , w, r(X, w, λR), λM , w) ∂X ∂πM (X, w, r(X, w, λR)) = µλM − ∂X ∂F (X, w, r(X, w, λR)) ∂ 2 HR (x, r(X, w, λR), λR , w) − λM −µ , ∂X ∂X 2 ∂HM (X, λR , w, r(X, w, λR), λM , ϕ) ϕ̇ = µϕ − ∂λR ∂F (x, w, r(X, w, λR)) ∂ 2 HR (X, r(X, w, λR ), λR , w) = µϕ − λM −µ , ∂λR ∂λ∂X λ̇M = µλM − where λM (T ) = 0 and ϕ(0) = 0 are the boundary conditions. We apply the algorithm of the above general model as part of our model. 2.2 Notations and Definitions To facilitate the model, certain parameters and decision variables are used. Table 1 shows notations and definitions that we use in our model. 5 (2.8) Variables t T qt a b xt HS HM HR JtS JtM JtR B M (xt ) B S (xt ) B R (xt ) T S (xt ) T M (xt ) T R (xt ) ItM ItS ItR d db w z δ δb b δb λ α τ θ β1 β2 β3 Period t Planning horizon Demand quantity at period t Market potential Price sensitivity State variable, degree of taking SR Hamiltonian function of the supplier Hamiltonian function of the manufacturer Hamiltonian function of the retailer Objective function of the supplier Objective function of the manufacturer Objective function of the retailer Social benefit of the supplier Social benefit of the manufacturer Social benefit of the retailer Tax return of the supplier Tax return of the manufacturer Tax return of the retailer The amount of investment done by the manufacturer The amount of investment done by the supplier The amount of investment done by the retailer The percentage of investment of the supplier payoff The percentage of investment of the manufacturer payoff Price of the supplier’s raw material Price of sold product by the retailer Parameter of the supplier’s social benefit Parameter of the manufacturer’s social benefit Parameter of the retailer’s social benefit Quantity discount parameter of the price of raw material Deteriorating rate of the level of current social responsibility The rate of individual post tax return on investment (ROI) The rate of supply chain’s post tax return on investment (ROI) The rate of converting the supplier’s capital investment in CSR to the amount of CSR taken up by the supply chain The rate of converting the manufacturer’s capital investment in CSR to the amount of CSR taken up by the supply chain The rate of converting the retailer’s capital investment in CSR to the amount of CSR taken up by the supply chain Table 1: Notations and Definitions. 3 Objective Functions and Constraints The objective functions are made to depend on the control vectors and the static variable. The members of the supply chain attempt to optimize their net profits, which includes minimizing the cost of raw materials and investment in social responsibility, and maximizing sale revenues and 6 benefits from taking social responsibility as well as tax returns. Thus, the objective function of the supplier is S J = = T X t=1 T X PtS qt − cqt + BtS (xt ) + TtS (ItS , It ) − ItS + dItM wqt − cqt + δx2t + τ ItS [1 + θ(ItS + ItM + ItR )] − ItS + dItM , t=1 where, PtS is the price of the supplier’s raw material. PtS = w. BtS (xt ) is the social benefit of the supplier, δ is the parameter of the supplier’s social benefit and TtS (ItS , It ) is the tax return of the supplier. Similarly, the objective function of the manufacturer is JM = = T X t=1 T X t=1 bR PtM (qt )qt − PtS qt + B M (xt ) + T M (ItM , It ) − ItM + dI t b 2 + τ I M (1 + θ(I S + I M + I R )) − I M + dI b R, (a − bqt )qt − wqt + δx t t t t t t t where PtM (qt ) is the retail price of the product of the manufacturer. BtM (xt ) is the social benefit of the manufacturer, δb is the parameter of the manufacturer’s social benefit and TtM (ItM , It ) is the tax return of the manufacturer. The objective function of the retailer is JR = = T X t=1 T X t=1 PtR qt − PtM (qt )qt + B R (xt ) + T R (ItR , It ) − ItR bb 2 R S M R R zqt − (a − bqt )qt + δx t + τ It (1 + θ(It + It + It )) − It , where PtR is the price at which the retailer sells the product to the consumer. PtR = Z. BtR (xt ) is b the social benefit of the retailer, δb is the parameter of the retailer’s social benefit and TtR (ItR , It ) is the tax return of the retailer. 3.1 Mathematical Model: Level One At this level, we establish a Stackelberg game between the manufacturer as the leader and the retailer as the follower. To calculate the equilibrium at this level, first we calculate the best reaction function of the retailer, then we determine the manufacturer’s optimal decisions based on the retailer’s best reactions. Since our dynamic differential game is an optimal control problem, we can apply the Hamiltonian function to find the equilibrium of the game [21]. Suppose the time interval is [1, T ]. For any fixed ItS and ItM the retailer solves arg max ItR T X PtR qt − PtM (qt )qt + B R (xt ) + T R (ItR , It ) − ItR , t=1 7 subject to xt+1 = αxt + β1 ItS + β2 ItM + β3 ItR . We define the retailer’s Hamiltonian for fixed ItS and ItM as R HtR = JtR + Pt+1 (xt+1 ). By using the conditions for a maximization of this Hamiltonian, we compute: ItR = R 1 − Pt+1 β3 − τ θ(ItM + ItS ) − τ . 2θτ (3.1) The equation of ItR which depends on ItS and ItM , says that for any given strategy of ItS and ItM , there is a unique optimal response ItR . xt+1 = ∂HtR = αxt + β1 ItS + β2 ItM + β3 ItR , R ∂Pt+1 (3.2) and by substituting (3.1) in (3.2), we obtain xt+1 = (β1 − β3 /2)ItS + (β2 − β3 /2)ItM + αxt + β3 R 1 − Pt+1 β3 − τ . 2τ θ (3.3) We also have PtR = ∂HtR b b t + αP R . = 2δx t+1 ∂xt (3.4) The above sets of equations define the reaction function of the retailer. For any fixed ItS the manufacturer solves arg max ItM T X t=1 b R, PtM (qt )qt − PtS qt + B M (xt ) + T M (ItM , It ) − ItM + dI t subject to xt+1 = αxt + β1 ItS + β2 ItM + β3 ItR . Now, we substitute the value of ItR in (3.1) into JtM , and we obtain R b b b b 2 + τ θ − d I S I M + τ − 1 − β3 Pt+1 − d I M − d . JtM = (a − bqt )qt − wqt + δx t t t t 2 2 2θ (3.5) The Hamiltonian function of the manufacturer for fixed ItS is M HtM = JtM + Pt+1 (xt+1 ) + ut (PtR ), (3.6) consequently, we can obtain the unique optimal response of the follower from the equations as follows. 8 R  1 − β3 Pt+1 − τ − τ db ∂HtM M S M + (β2 − β3 /2) Pt+1 , = τ 1 + θ (1 − τ θ)I + I − 1 + t t ∂ItM 2 and we get ItM = Other constraints are M R + (1 + db − τ ) (β2 − β3 /2)Pt+1 −τ θItS + β3 Pt+1 − . 2τ θ τθ PtM = ut+1 = ∂HtM b t + αP M + 2b b t, = 2δx δu t+1 ∂xt M 2 b 3 Pt+1 β3 dβ ∂HtM M = −β /2I − − + αut . 3 t R ∂Pt+1 2τ θ 2τ θ (3.7) (3.8) (3.9) The equation of ItM depends on ItS , and we obtain the final equilibrium in the next section, at level two. 3.2 Mathematical Model: Level Two At the previous level, the manufacturer’s optimal function was calculated by using a reaction function of the retailer. At this level, the reaction functions of two followers (retailer and manufacturer) are inserted into the objective function of the leader (supplier), and we can find the final equilibrium point. The problem facing the supplier is simply given by arg max ItS T X PtS qt − cqt + BtS (xt ) + TtS (ItS , It ) − ItS + dItM , t=1 subject to xt+1 = αxt + β1 ItS + β2 ItM + β3 ItR . The Hamiltonian function of the supplier is defined by S HtS = JtS + Pt+1 (xt+1 ) + µt (PtM ) + ut (pR t ). (3.10) Substitute (3.1) and (3.7) into (3.10), we get the value of ItS , xt+1 and µt+1 ∂HtS = 0, ∂ItS 9 (3.11) therefore ItS = (β3 /2 + β2 − 2β1 ) S (β2 − β3 /2) M β3 R 3 − 3τ − db+ 2d Pt+1 + Pt+1 + Pt+1 + . τθ τθ 2τ θ 2τ θ (3.12) We have xt+1 = ∂HtS , S ∂Pt+1 (3.13) therefore we obtain (−2β2 + β3 )(β2 − β3 /2) M (2β2 β3) − (3β32 ) R xt+1 = αxt + (β1 − β2 /2 − + Pt+1 + Pt+1 2τ θ 4τ θ (3.14) b + 2β3 (1 − τ ) (2β2 − β3 )(1 − τ + d) + . 4τ θ β3 /4)ItS We also have (β2 − β3 /2) S (β2 − β3 /2)(β3 − 2β2 ) S ∂HtS = αµt − It + Pt+1 M ∂Pt+1 2 2τ θ (β2 − β3 /2)d − . τθ µt+1 = (3.15) And we obtain PtS = ∂HtS S b t. = 2δxt + αPt+1 + 2δu ∂xt (3.16) Since we use open-loop information, the structure variables depend on the time variable and the initial state variables. The x1 is given initial parameter, u1 = 0 and µ1 = 0. Furthermore, the R M S boundary condition are Pt+1 = 0, Pt+1 =0 and Pt+1 = 0. 3.3 Augmented Discrete Hamiltonian Matrix In this section for solving the optimal control problem formulated in Section 3.1 and 3.2, we chose an algorithm given by Medanic and Radojevic which is based on an augmented discrete Hamiltonian matrix [18]. First, we assume #      "   e x et+1 x et A B D Ae xt + B Pt+1 + D = = + , e e C D E Pt Pt+1 Cx et + APet+1 + E    S  xt+1 pt+1 , where x et+1 =  ut+1  and Pet+1 =  pM t+1 R µt+1 pt+1 10 A, B, and C are 3 × 3 matrices, and D and E are 3 × 1 matrices, such that        S  xt+1 xt a11 a12 a13 b11 b12 b13 Pt+1 h i M           Pt+1 µt + b21 b22 b13 x et+1 = µt+1 = Ae xt + B Pet+1 + D = a21 a22 a23 R ut+1 a31 a32 a33 ut Pt+1 b31 b32 b33     S M S d1 a11 xt + a12 µt + a13 ut + b11 Pt+1 + b12 Pt+1 + b13 Pt+1 + d1 S M S    + d2 = a21 xt + a22 µt + a23 ut + b21 Pt+1 + b22 Pt+1 + b23 Pt+1 + d2  . M S S d3 + b32 Pt+1 + b33 Pt+1 + d3 a31 xt + a32 µt + a33 ut + b31 Pt+1 (3.17)     1 0 The boundary conditions are x e1 =  0  and PeT +1 =  0 . 0 0   a11 a12 a13  A = a21 a22 a23  , a31 a32 a33 where a11 = α, a12 = 0, a13 = 0, a21 = 0, a22 = α, a23 = 0, a31 = 0, a32 = 0,   b11 b12 b13 B =  b21 b22 b23 , b31 b32 b33 where b11 = b12 = (β1 − β2 /2 − β3 /4)(−2β1 + β2 + β3 /2) . τθ (β1 − β2 /2 − β3 /4)(β2 − β3 /2) (β2 − β3 /2)(−2β2 + β3 ) + . τθ 2τ θ b13 = (β1 − β2 /2 − β3 /4)(β3) (2β2 β3 )(−3β32 ) + . 2τ θ 4τ θ b21 = (β2 − β3 /2)(2β1 − 3β2 + β3 /2) . 2τ θ (β2 − β3 /2)2 . b22 = − 2τ θ β3 (β2 − β3 /2) b23 = − . 4τ θ β3 (−2β1 + β2 + β3 /2) . 4τ θ −β32 + 3/2β3 (β2 − β3 /2) = . 2τ θ b31 = − b32 11 a33 = α. b33 −β32 . = 8τ θ   d1 D =  d2  , where d3 d1 = b + 2β3 (1 − τ ) (β1 − β2 /2 − β3 /4)(3 − 3τ − db + 2d) (2β2 − β3 )(1 − τ + d) + . 2τ θ 4τ θ d2 = (−β2 − β3 /2)(6d − db − 3τ + 3) . 4τ θ d3 = −β3 (−7db + 2d − τ + 1) . 8τ θ Similarly, we can get the values of the matrices C and E      S    S   e1 c11 c12 c13 xt Pt+1 a11 a12 a13 Pt M  +  e2  Pet =  PtM  =  c21 c22 c23   µt  +  a21 a22 a23   Pt+1 R R e3 Pt+1 c31 c32 c33 ut Pt a31 a32 a33   S M R c11 xt + c12 µt + c13 ut + a11 Pt+1 + a12 Pt+1 + a13 Pt+1 + e1 S M R  = c21 xt + c22 µt + c23 ut + a21 Pt+1 + a22 Pt+1 + a23 Pt+1 + e2  . S M R c31 xt + c32 µt + c33 ut + a31 Pt+1 + a32 Pt+1 + a33 Pt+1 + e3 (3.18)     b 2δ 2δb 2δb 0   b    Therefore C =  2δb 0 2δb  and E = 0  . 0 b 2δb 0 0 3.4 Resolution The above problem is solved by the sweep method [2], by assuming a linear relation between pet and xet Thus, we can compute pek = Sk x ek − gk . (3.19) −1 xg et − Bgk+1 + D). k+1 = (I2∗2 − BSk+1 ) (Ax (3.20) 12 Then by substituting (3.19) and (3.20) into the definition of pk+1 as given by the augmented Hamiltonian matrix, and equating both sides we finally get the difference equations: Sk = C + ASk+1 (I2∗2 − BSk+1 )−1 . (3.21) gk = ASk+1 (I2∗2 − BSk+1)−1 + Bgk+1 − D + Agk+1 − E. (3.22)   x1 The boundary conditions are x e1 =  0  0 gT +1 = 03∗1 . and PeT +1   0 =  0 . And then ST +1 = 03∗3 and 0 From the boundary conditions we get ST = C and gT = E. Once we get the different values of Sk and gk by the backward loop, then the values of x et and pet are computed by a forward loop. And, S M R R consequently we get the values of xt , It , It , It , pSt , pM t , pt , for all points in time. 4 Numerical Example In this section we provide a numerical example. We run the following numerical simulations with mathematica 8. The results presented here are obtained for the following values of the parameters: a = 6, w = 3.8, c = 2.4, qt = 100000, d = 0.6, c = 0.00001, d = 0.4, db = 0.4, z = 6, θ = 0.01, and τ = 0.2. We set β1 = 0.3, β2 = 0.5 and β3 = 0.8; and β1 = 0.3, β2 = 0.5. Bt (xt ) = δxt , the potential benefits firms obtain from taking social responsibility, such as increased demand, better b reputation and so on. We set δ = 0.2, δb = 0.2 and δb = 0.2. We assume that the time horizon is T =10. The initial level of social responsibility is supposed to be x1 =1. We draw the results of the equilibrium from our model, a three-stage Stackelberg dynamic game. The figure 3 shows the trend of profits from periods one to ten in a Stackelberg game. JS is the supplier’s profit, JM is manufacturer’s profit and JR is retailer’s profit. We compare the profits of the supplier, manufacturer and retailer over a time horizon, first while playing the game and then, without playing the game. Figure 4 shows the difference in supplier’s profits when playing the game and without playing. JSO is supplier’s profit without playing the game; JS is supplier’s profit when playing the game. As in the first graph, the second and third one (figure 5, 6) shows the difference in manufacturer’s profit and retailer’s profits when playing the game and without playing. JMO is manufacturer’s profit without playing the game; JM is manufacturer’s profit when playing the game. JRO is retailer’s profit without playing the game; JR is retailer’s profit when playing the game. Obviously, all of players gain extra profit from playing the games. Figure 7 compares the cumulated profits of the member’s of supply chain, playing game one and without playing game. In sum, the supplier, manufacturer and retailer are motivated to play the game because their respective benefit increases and the supplier as the leader in the game earns more benefit than the 13 followers. Of course, this result is obtained with a very specific dynamic game model. Another one may give different results. Figure 3: Profits of supplier, manufacturer and retailer. Figure 4: Comparison of the supplier’s profit, playing game one and without playing any game. Figure 5: Comparison of the manufacturer’s profit, playing game one and without playing any game. 14 Figure 6: Comparison of the retailer’s profit, playing game one and without playing any game. Figure 7: Comparison of the cumulated profits of the member’s of supply chain, playing game one and without playing any game. 5 Conclusion In this paper we investigated a decentralized three-tire supply chain consisting of supplier, manufacturer and retailer with the aim of allocating CSR to members of the supply chain system over time. We considered two-level Stackelberg game consisting of two followers and one leader. The members of a supply chain play games with each other to maximize their own profits; thus, the model used was a long-term co-investment game model. The equilibrium point in a time horizon was determined at where the profit of supply chain’s members was maximized and CSR was implemented among members of the supply chain. We applied control theory and used an algorithm (augmented discrete Hamiltonian matrix) to obtain an optimal solution for the dynamic game model. We presented a numerical example and we found that, the benefits of the player increased when they played the game. Acknowledgments. The present research was supported by the MEDAlics, Research Center at Università per Stranieri Dante Alighieri, Reggio Calabria, Italy. 15 References [1] Batabyal, A.A.: Consistency and optimality in a dynamic game of pollution control II: Monopoly, Environmental and Resource Economics, 8, 315-330 (1996). [2] Bryson, A.E., Ho, Y.C.: Applied Optimal Control: Optimization, Estimation and Control, John Wiley and Sons, New York (1975). [3] Cachon, G.P., Zipkin, P.H.: Competitive and cooperative inventory policies in a two-stage supply chain, Management Science, 45, 936-953 (1999). [4] Carroll, A.B.: The pyramid of corporate social responsibility: Toward the moral management of organizational stakeholders, Business Horizons, 34, 39-48 (1991). [5] Carroll, A.B.: A three-dimensional conceptual model of corporate performance, Academy of Management Review, 4, 497-505 (1979). [6] Carter, C.R., Kale, R., Grimm, C.M.: Environmental purchasing and firm performance: An empirical investigation, Logistics and Transportation Review, 36, 219-228 (2000). [7] Carter, C.R., Jennings, M.M.: Social responsibility and supply chain relationships, Transportation Research, 38, 37-52 (2002). [8] Cetindamar, D., Husoy, K.: Corporate social responsibility practices and environmentally responsible behavior: The case of the united nations global compact, Journal of Business Ethics, 76, 163-176 (2007). [9] Dahlsrud, A.: How corporate social responsibility is defined: An analysis of 37 definitions, Corporate Social Responsibility and Environmental Management, 15, 1-13 (2008). [10] Feibel, B.J.: Investment Performance Measurement, John Wiley and Sons, New York, (2003). [11] He, X., Gutierrez, G.J., Sethi, S.P.: A survey of Stackelberg differential game models in supply and marketing channels, Journal of Systems Science and Systems Engineering, 16, 385-413 (2007). [12] He, X., Gutierrez, G., Sethi, S.P.: A review of Stackelberg differential game models in supply chain management, Service Systems and Service Management, (International Conference), 1, 9-11 (2007). [13] Hennet, J.C., Arda, Y.: Supply chain coordination: A game-theory approach, Engineering Applications of Artificial Intelligence, 21, 399-405 (2008). [14] Hervani, A.A., Helms, M.M., Sarkis, J.: Performance measurement for green supply management, Benchmarking: An International Journal, 12, 330-353 (2005). [15] Joyner, B.E., Payne, D.: Evolution and implementation: A study of values, business ethics and corporate social responsibility, Journal of Business Ethics, 41, 297-311 (2002). [16] Kamien, M.I., Schwartz, N.L: Dynamic Optimization: The Calculus of Variations and Optimal Control in Economics and Management, Elsevier, Amsterdam, (1991). 16 [17] Mankiw, N.G.: Principles of Microeconomics, Mason, Ohio: Thomson/South-Western, (2004). [18] Medanic, J., Radojevic, D.: Multilevel Stackelberg strategies in linear-quadratic systems, Journal of Optimization Theory and Applications, 24, 485-497 (1978). [19] Orlitzky, M.: Does firm size confound the relationship between corporate social performance and firm financial performance?, Journal of Business Ethics, 33, 167-180 (2001). [20] Preston, L.E.: The corporate social-financial performance relationship: A Typology and Analysis, Business and Society, 36, 419-429 (1997). [21] Sethi, S.P.: Dimensions of corporate social responsibility: An Analytical Framework, California Management Review, 17, 58-64 (1975). [22] Shi, H.: A Game theoretic approach in green supply chain management, M.S. Thesis, University of Windsor, (2011). [23] Svensson, G.: Aspects of sustainable supply chain management (SSCM): conceptual framework and empirical example, Supply Chain Management, 12, 262-266 (2007). [24] Tian, Y., Govindan, K., Zhu, Q.: A system dynamics model based on evolutionary game theory for green supply chain management diffusion among Chinese manufacturers, Journal of Cleaner Production, 80, 96-105 (2014). 17