Академический Документы
Профессиональный Документы
Культура Документы
Merouane Debbah
Alcatel-Lucent Chair
SUPELEC
Gif-sur-Yvette, France
Email: merouane.debbah@supelec.fr
I. I NTRODUCTION
Due to the ever increasing need for higher data rates, it
is envisaged that a two-tier cellular networks, comprising of
a central macrocell underlaid with short range femtocell hotspots offer an economical way to improve cellular capacity [1].
Therefore, wireless operators are on the verge of augmenting
the macrocell network with femtocell networks through which
higher throughput gains, better indoor coverage and lower
transmit power are expected.
Before femtocells can be rolled out by mobile operators, a
number of technical issues should be tackled such as interference avoidance, security, billing etc. Clearly, interference is
detrimental in femtocell networks due to cross-tier interference
in which macrocell users experience interference coming from
different femto BSs (in the downlink) while femtocell user
perceives significant interference from loud macrocell users
within the vicinity of the femtocell. Moreover, two types of
femtocell deployments are envisioned: closed access (CA)
where a macrocell user cannot be served by a femtocell base
and open access (OA) where a macrocell user can be handed
in to the femtocell. This work assumes CA which means only
licensed home users within radio range can communicate with
their own femtocell.
185
Fig. 1. Closed Access (CA) scenario in which the interference inflicted from
macro-users to femto-BS is accounted for and vice-versa. Here, only K = 2
femto-cells are illustrated.
K
i = 1, ..., K
n = 1, ..., N
(1)
j=1
pni Pi ,
i = 1, ..., K
(2)
n=1
pni |hnii |2
K
2 + j=1,j=i pnj |hnji |2
(3)
1 Due
186
Ri =
N
n=1
(4)
max Ri = max
pi
pi
s.t.
N
N
log2
n=1
|hnii |2 pni
1+
K
2 + j=i |hnji |2 pnj
pni Pi
n=1
pni 0
(6)
=
A
hierarchical
(Stackelberg)
game
G2
{K, {P}iK , {ui }iK } is proposed to model the spectrum
sharing problem. Motivated by the fact that interference
coming from femto-BSs at the macro-users has to be
minimized, we adopt the framework of hierarchical games
wherein more priority is given at the macro-BS. The spectrum
sharing game is modeled as a Stackelberg game in which
the macro-BS is the leader and the femto-BSs are the
followers. The rationale behind is that the macro-BSs being
the primary networks are first deployed in the network
whereas (secondary) femto-BSs are more of a random and
uncoordinated nature.
The Stackelberg Equilibrium (S.E) [19] is the best response
where a hierarchy of actions exists between players. Backward
induction [5] is applied assuming that players can reliably
forecast the behavior of other players and that they believe
that the other player can do the same.
Assuming that MBS 1 is cast as leader and the FBSs (FBS
2,...,FBS K) as followers in the game, the optimal strategy
of the interferers (femto-BSs) is taken into account in the
optimization problem. Moreover, the optimization problem for
the Macro BS is written as:
N
|hn11 |2 pn1
log2 1 +
(7)
max
p1
2 + j=1 |hnj1 |2 pn,SE
j
n=1
s.t.
N
pn1 P1
n=1
p1 0
pn,SE
j
= BRj (pn,SE
, .., pn,SE
and
1
j ) and is a function of p1
(hence the objective function in (7) is non-convex), BRj (.) is
the best response function of player j.
The leader (MBS 1) maximizes its achievable rate while
taking into account the strategy of the followers (i.e., femtoBSs) and the spectrum sharing game G2 boils down to solving
(7). First, the femto-BSs locally optimize their utility function
using waterfilling technique, the solution of which is given as:
+
2 + i |hnii |2 pni
1
i = 2, ..., K n = 1, ..., N
pni =
i
|hnii |2
(8)
where (x)+ = max{x, 0} and i > 0 is the Lagrangian
multiplier chosen to satisfy the power constraint for femtoBSs i.
It turns out that finding the equilibrium of the leader is
not straightforward because its utility function is non-convex
(7). Nevertheless, there exists sub-optimal and low-complexity
methods to solve the problem. To this end and motivated by
the work of [17], we use lagrangian duality theory wherein the
duality gap [18] provides a nice tool for solving non-convex
optimization problem.
187
= max
p1
N
n=1
log2
(9)
n=1
(10)
which is in other words, the empirical frequency (or probability) that follower j takes action Aj,t
f given that the leader takes
action Atleader . Furthermore, whenever the leader takes action
Atleader , the propensity vector rfj,t (Af |Aleader ) is updated as
follows:
j,t1
j,t1
t
rfj,t (Aj,t
(Afj,t1 |At1
) (11)
f |Aleader ) = rf
leader )+I(Af
j,t
188
3 Note
rit (Ati )
t
t
AAi ri (Ai )
Sit (Ati ) =
35
(12)
30
rit (Ati )
Ati at
)I(At1
)
rit = rit1 + (1 )ui (At1
i
i
where
represents the tendency of player i choosing
time-slot t. In addition, player i updates the
action
)
tendency vector based on the experienced utility ui (At1
i
is taken in time-slot t 1 as follows:
when action At1
i
(13)
N = 10
N=5
N = 15
25
20
15
10
V. S IMULATION RESULTS
4 This is referred to as the Braess paradox [22], [23], which implies that
increasing the space of strategies of each player, i.e., the number of BSs each
player can use, ends up degenerating the global performance of the network
10
15
number of base stations
20
25
Fig. 3. Total sum-rate of the network versus the number of base stations K,
for different values of carriers N .
Centralized
12
Stackelberg
10
sumrate of the network
Selfish
10
15
number of base stations
20
25
VI. CONCLUSION
In this paper, we studied the problem of coexistence between a macro-cell and underlaid femto-cells sharing the same
spectrum. The problem is investigated from a game theoretic
perspective where each base station is modeled as a player
in the game who decides in a distributed way the strategy of
allocating its total power across a set of carriers. Both of the
non-cooperative and hierarchical spectrum sharing scenarios
are investigated. It turns out that for a given region with
an arbitrary number of base stations, there exists an optimal
number of base stations to be deployed in the network. In
addition, it is shown that the hierarchical approach outperforms
the non-cooperative approach, hence bridging the gap between
the selfish and centralized approach. The second contribution
of the paper looks at the framework of machine learning with
public information exchange among players, in which case the
leader computes an estimate of the strategies of the followers
189
20
withlearning (case 1)
withlearning (case 2)
nolearning
18
16
14
12
10
8
6
4
2
0
10
time
15
20
Fig. 5. Achievable rate of the leader with and without learning. By exchange
public information (case 1), the leader improves his payoffs compared to the
no-learning case as well as case 2 (reinforcement learning).
[14] B. Wang, Z. Han and K. J. Ray Liu, Stackelberg game for distributed
relay selection and power control for multiuser cooperative communication networks, to appear IEEE Transactions on Mobile Computing,
2009.
[15] A. Ozgur, O. Leveque and D. Tse Hierarchical Cooperation Achieves
Optimal Capacity Scaling in Ad Hoc Networks, IEEE Transactions on
Information Theory, vol. 10, October 2007.
[16] W. Saad, Z. Han, M. Debbah and A. Hjorungnes, A Distributed Merge
and Split Algorithm for Fair Cooperation in Wireless Networks, in proc.
IEEE ICC, May 2008.
[17] W. Yu and R. Lui, Dual methods for nonconvex spectrum optimization
of multicarrier systems, IEEE Transactions on Communications, vol. 54,
No. 7, 2006.
[18] S. Boyd and L. Vandenberghe, Convex Optimization. Cambridge University Press, 2004.
[19] V. H. Stackelberg, Marketform und Gleichgewicht, Oxford, University
Press, 1934.
[20] L. Busoniu, R. Babuska and B. D. Schutter, A comprehensive survey
of multiagent reinforcement learning, IEEE Trans. Systems, Man and
Cybernetics, Part C, vol.38, no.2, pp.156172, March 2008.
[21] J. Hu and M. P. Wellman, Multiagent reinforcement learning: Theoretical
framework and an algorithm, in Proc. 15th Int. Conf. Mach. Learn.
(ICML), July 1998.
[22] D. Braess. Uber ein paradoxon aus der verkehrsplanung. Unternehmensforschung, 24(5):258-268, May 1969.
[23] S. Medina Perlaza, E. V. Belmega, S. Lasaulce and M. Debbah, On
the base station selection and base station sharing in self-configuring
networks, Fourth International Conference on Performance Evaluation
Methodologies and Tools, Pisa, Italy, Oct. 2009.
[24] H. Li, Multi-agent Q-Learning of Channel Selection in Multi-user
Cognitive Radio Systems: A Two by Two Case, IEEE Conference on
System, Man and Cybernetics (SMC), 2009.
[25] M. Bennis, S. Lasaulce and M. Debbah, Inter-operator spectrum sharing
from a game theoretical perspective, EURASIP Journal of Advances
in Signal Processing (JASP), Special issue on dynamic spectrum access for wireless networking, Aug. 2009, 1-12, Article ID 295739,
doi:10.1155/2009/29573.
[26] Z. Han, C. Pandana and K. Liu, Distributive opportunistic spectrum
access for cognitive radio using correlated equilibrium and no-regret
learning, in Proc. of IEEE Wireless Communications and Netwowrking
Conference (WCNC), 2007.
[27] W. Yu, W. Rhee, S. Boyd and J. Cioffi Iterative Water-filling for
Gaussian Vector Multiple Access Channels, IEEE Transactions on
Information Theory, vol. 50, no. 1, pp.145-151, January 2004.
190