[PPT] - Stochastic Control for Energy Efficient Resource Allocation in PowerPoint Presentation

SLIDE 1

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Stochastic Control for Energy Efficient Resource Allocation in Wireless Networks

Abhijeet Bhorkar (01D07014)

under the guidance of

Prof. A. Karandikar

Indian Institute of Technology Bombay, Powai, Mumbai-76 July 05, 2006

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 2

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Outline

1

Introduction and Scope of the Thesis

2

Power Optimal Scheduling Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link

Learning Algorithms: Overview Problem Formulation

Energy Efficient Video Transmis- sion

3

Summary and Future Scope

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 3

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Motivation

Wireless LANs, ad-Hoc networks, sensor networks limited battery life, bandwidth Maintain acceptable QoS metric rate, delay, fairness Efficient utilization of limited resources Time varying wireless channel conditions Exploit channel variations

pportunistic scheduling :

Schedule user with the best channel condition

Can we perform power control to exploit channel variations? Joint opportunistic and energy efficient control

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 4

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Motivation

Wireless LANs, ad-Hoc networks, sensor networks limited battery life, bandwidth Maintain acceptable QoS metric rate, delay, fairness Efficient utilization of limited resources Time varying wireless channel conditions Exploit channel variations

pportunistic scheduling :

Schedule user with the best channel condition

Can we perform power control to exploit channel variations? Joint opportunistic and energy efficient control

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 5

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Motivation

Wireless LANs, ad-Hoc networks, sensor networks limited battery life, bandwidth Maintain acceptable QoS metric rate, delay, fairness Efficient utilization of limited resources Time varying wireless channel conditions Exploit channel variations

pportunistic scheduling :

Schedule user with the best channel condition

Can we perform power control to exploit channel variations? Joint opportunistic and energy efficient control

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 6

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Scenario

✁

✂ ✂ ✂ ✂ ✄ ✄ ✄ ✄ ☎ ☎ ✆ ✆ ✝ ✝ ✝ ✝ ✝ ✝ ✞ ✞ ✞ ✞ ✞ ✞

Power optimal scheduling Minimum rate guarantee (Multi-user) Fairness guarantee (Multi-user) Average delay guarantee with finite buffer (point to point link)

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 7

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Preliminaries(1)

Shannon’s capacity:

P = N0W x

eu/W −1
,

N0 : Spectral density of AWGN channel W : Spectrum bandwidth u : Transmission rate

Stochastic approximation:

λ(n+1) = λ(n)+α(n)(H(λ(n))+M(n+1))(1) E[h(λ,x)] = H(λ) Martingale M(n+1) = h(λ(n),x(n))−H(λ(n)) If step sizes satisfy ,

∞

∑

n=0

α(n) = ∞,

∞

∑

n=0

α(n)2 < ∞

then (1) tracks the Ordinary Differential equation (ODE),

˙ λ(t) = H(λ(t)) (2)

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 8

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

System Model

✂✁☎✄✝✆✟✞ ✠☛✡✌☞✎✍✑✏ ✓✒✔✄✝✆✟✞ ✕✗✖✙✘☎✚✜✛ ✕✗✖✙✘☎✚✣✢ ✕✗✖✙✘☎✚✥✤ ✦ ✁ ✄✝✆✟✞ ✦ ✒ ✄✝✆✟✞ ✦✑✧ ✄✝✆✟✞ ★✪✩✬✫ ✘☎✭✯✮✯✰✱✘☎✚ ✲✳✫✵✴✔✶✯✶ ✘✷✰ ✲✹✸✺✶ ✭✼✻✾✽✙✻ ✸✿✶ ✖ ✄❁❀ ✸❃❂ ✘✷✚✣❄✣✘ ✩ ✻❅✖❆✻ ✸✺✶ ✞ ✄✝❇ ✁ ✄✝✆✟✞❉❈❊❇ ✒ ✄✝✆✟✞❉❈●❋❍❋●❋●❈❊❇ ✧ ✄✝✆✟✞■✞ ✄ ✕✗✖✬✘✷✚ ★ ✘❉✰✱✘ ✩ ✽✙✻ ✸✺✶ ✞ ✄❑❏ ✁ ✄✝✆✟✞❉❈▲❏ ✒ ✄❑✆▼✞✷❈✔❋❍❋❍❋✔❈▲❏ ✧ ✄✝✆✟✞■✞ ◆ ✁ ✄✝✆✟✞ ◆ ✒❖✄✝✆✟✞ ◆ ✧ ✄❑✆▼✞

Figure: Single hop system model

Slotted single-hop TDMA system Uplink scheduling Perfect channel state information Channel process ergodic (i.i.d. or Markovian)

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 9

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Problem Formulation

Minimize average power

min limsup

M→∞

1 M

M

∑

n=1

q(n),

Subject to average rate constraints Ci

limsup

M→∞

1 M

M

∑

n=1

Ui(qi(n),xi(n))) ≥ Ci ∀i, q(n) ≥ 0,

N

∑

i=1

yi(n) ≤ 1 ∀n (3)

U is information theoretic rate and is concave differentiable function of xi, qi U = log(1+xiqi) x = (x1,x2,··· ,xN) y = (y1,y2,··· ,yN)

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 10

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Problem Formulation

Minimize average power

min limsup

M→∞

1 M

M

∑

n=1

q(n),

Subject to average rate constraints Ci

limsup

M→∞

1 M

M

∑

n=1

Ui(qi(n),xi(n))) ≥ Ci ∀i, q(n) ≥ 0,

N

∑

i=1

yi(n) ≤ 1 ∀n (3)

U is information theoretic rate and is concave differentiable function of xi, qi U = log(1+xiqi) x = (x1,x2,··· ,xN) y = (y1,y2,··· ,yN)

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 11

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Multiuser Optimal Solution

Proposition Optimal Policy for multiple users is to select kth user and transmit with power q∗ Sketch of Proof Use ergodicity to convert optimization problem (3) in continuous domain Minimize Lagrangian of (3) w.r.t. q first, then w.r.t. y Optimal power for single user, q∗

i =

λi − 1

xi

+ ,where λi is the Lagrange multiplier Minimizing w.r.t. y, we get, k = argmin

i (q∗ i −λi [log(1+q∗ i xi)−Ci])

Details Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 12

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Online Algorithm (1)

After minimizing over the primal variables, optimal value of Lagrangian is,

F(λ) = [E(min

i (q∗ i −λi log(1+q∗ i xi(n))−Ci)]

F(λ) strictly concave → unique maximum Need to find saddle point

Consider for example f(x) is continuous differentiable Gradient ascent scheme for maximizing f is,

xn+1 = xn +αn ˙ f(xn)

It tracks the differential equation,

˙ x(t) = d f dx = ˙ f

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 13

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Online Algorithm (2)

Estimate λi online

Channel Measurement

ptimal power

with Update Parameter user Find the Optimal power, Transmit

Figure: Block diagram for on-line policy

Update Equation

λi(n+1) = {λi(n)−α(n)[yi(n)log

1+
λi(n)−

1 xi(n) + xi(n)

]−Ci
hi(λ)

}+ ∀i, (4)

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 14

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Optimality and Stability of Update Equation

Iterations converge to differential inclusion almost surely to ,

˙ λ(t) = h(λ(t))

and thus to a supergradient ascent scheme

˙ λ(t) ∈ ∂F(λ(t))

∂F supergradient of F

Stability Boundedness of λi using projection method or linear stochastic approximation method

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 15

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Simulations

0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 x 10

4

5 10 15 20 25

Lagrange Multiplier λi No of iterations n

User 1 User 2 User 3 User 4

Figure: Convergence for Markovian channel α = 0.3, C = (0.6,0.8,0.7,0.2), λ(0) = (1,1,1,1) and σ = (1,1,0.9,0.3)

5 10 15 1 1.5 2 2.5 3 3.5

No of Users Ratio of power for optimal policy to round robin policy

Figure: Gain of the optimal policy over variable power round robin policy, C=0.6, γ = 0.7

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 16

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Temporal Fairness

Time as resource Different users receive different time resource

Long term fair: Proportional share on long run

Starvation HOL blocking

Short term fair: Proportional share on finite window size M

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 17

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Temporal Fairness

Time as resource Different users receive different time resource

Long term fair: Proportional share on long run

Starvation HOL blocking

Short term fair: Proportional share on finite window size M

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 18

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Temporal Fair scheduler

Long term temporal fair

φi temporal share to user i on infinite run liminf

M→∞

1 M

M

∑

n=1

E yi(n) ≥ φi ∀i. Modified update equation

λi(n+1) =

λi(n)−a(n)
yi(n)log
1+
λi −

1 xi(n) + xi(n)

−Ci

+ λ ′

i (n+1)

= [λ ′

i (n)−a(n)(yi(n)−φi)]+

∀i

Short term temporal fair

φi temporal share on finite window of M Heuristic: Elimination policy Remove the user if its share is exhausted

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 19

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Simulations

0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 x 10

4

2 4 6 8 10 12 14 16 18

Lagrange Multiplier λi No of iterations (n)

User 1 User 2 User 3 User 4

Figure: Trajectory of λi(n) φ = (0.3,0.4,0.2,0.1) Figure: Power required for the short term and long term fairness

Short term Thoughput Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 20

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Preliminaries(2)

Average cost Markov Decision Process:

Path wise Average Cost = limsup

N→∞

E

1

N

N−1

∑

n=0

c(sn,un)

φ∗ +h(s) = min

u∈U {c(s,u)+P(¯

s|s,u)h(¯ s)} Bellman equation with unique h(s) exists iff, h(s) = min

u∈U

c(s,u)+Eh(¯

s)−h(s0)

(5)

where, s ∈ S: State of the process u ∈ U : Action taken c : Immediate cost ¯ s : Next state h : Difference value function φ : Average cost s0 ∈ S : Some arbitrary state P(¯ s|s,u) :Transition matrix (kernel)

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 21

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Reinforcement Learning

✁

✂ ✂ ✂ ✂ ✄ ✄ ✄ ✄ ☎✝✆✟✞✡✠☞☛✍✌ ✎ ☛✍✏✑✞ ✒ ✞✟✓✔✞✖✕ ✗ ☛✍✘✙✠☞✆✟✚✜✛✣✢✤✓✥✘✙✦✧✓✔✞✡✠☞☛✍✌ ★ ✎✪✩ ✠✫✞✡✠☞✆✭✬ ★ ☎✝✆✟✞✖☛ ✩ ✬ ✗ ☛✍✘✙✠☞✆✟✚✯✮✱✰✳✲ ✩ ☛✴✢✤✕✵✰✶✕✷✌✸✞ ✎✺✹ ✓✥✌✻✌✼✕✵✘

Learn how to take action in each state to minimize the cost

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 22

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

System Model and Problem Formulation

✂✁ ✄ ✁ ☎ ✁ ✆ ✁ ✝✟✞✡✠☞☛✌☛✎✍✑✏ ✒✌✁ ✓

Figure: Transmitter of a point to point link

Optimization problem

Minimize P = limsup

N→∞

1 N

N

∑

n=1

P

n,

(6) Subject to D− ¯ D ≤ Delay constraint (7) ε − ¯ ε ≤ 0a Drop probability constraint

aWe denote the constraint by ‘¯’ .

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 23

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission n=1 n=0 n=k n=k+1 n=k+2

✂✁☎✄✝✆ ✞ ✁ ✟✠✁ ✡ ✞☛✁ ☞✍✌ ✄✝✆✏✎✒✑✔✓✖✕✘✗ ☞✙✌✛✚

✌

✄✝✆✢✜✣✟ ✌✥✤✧✦✩★

Figure: System Model

By Little’s law (7) is,

Qavg − ¯ a ¯ D ≤ 0 where, ε = limsup

N→∞

∑N

n=0 dn

max(0,Qn −un +an+1 −B)

∑N

n=0 an

¯ a = aavg (1−ε) (Effective arrival rate) Qavg = limsup

N→∞

1 N

N

∑

n=0

Qn, aavg = limsup

N→∞

1 N

N

∑

n=0

an

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 24

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Discrete State Space Markov Decision Formulation : Post Decision

The immediate cost of the constrained MDP is given by,

cn = P

n +λ1(Constraint1)+λ2(Constraint2)

Constraint1 : (Qn − ¯ D(an+1 −dn)) Constraint2 : (dn − ¯ εan+1)

cn is convex function of un and sn. Hence no duality gap. Post decision state ˜ s :State after decision is taken Bellman Equation (5) using the post-decision state is given as,

˜ hn+1(˜ s) = E

min

u∈U

Interchange of min and E operator

c(s,λ,u)+ ˜

hn+1(˜ ¯ s|˜ sn)− ˜ h(˜ s0)

Abhijeet Bhorkar

Resource Allocation in Wireless Networks

SLIDE 25

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Two time scale update

✂✁☎✄✝✆✟✞✡✠☞☛✍✌✏✎✑✞✒✂✓✕✔✖✁✗✞ ✘ ✔✖✁☎✙✚✞✜✛✢✙✤✣✤✓✦✥✧✌✏✄★✣ ✞✡✩✪✔✖✁✏✙✫✔✬✥✭✌☎✄★✣ ✮ ✔✪✯✰✥✱✞✡✠✍☛✍✌☎✎✲✞✳✂✓✴✔✪✁✏✞ ✞★✵✷✶✫✵✹✸✻✺ ✼ ✽✿✾❁❀❃❂✦❄✷✽✡❅ ❆ ✔✪✶★✠✭✔✖✣✚✶★✞❈❇❊❉●❋✫✔✬✥✱✞ ✓✕✄★✣✤✯❍✥✧✔✖✣■✥ ✞★✵✷✶✫✵❑❏▲✺ ✼ ✽ ▼✞✡✞❖◆✚P✏◗

Post learning: Faster time scale

˜ hn+1(˜ s) = ˜ hn(˜ s)+α(ν(˜ s,n))I{˜ sn = ˜ s}

min

u∈U

c(s,λ n,u)+ ˜

hn(˜ ¯ s)

− ˜

hn(˜ s0)− ˜ hn((˜ s)

∀˜

s

Lagrangian update: Slower time scale

λ1n+1 = Γ1[λ1n +β(n)(Qn − ¯ D(an −dn))] λ2n+1 = Γ2[λ2n +β(n)(dn − ¯ εan)] Γ1,Γ2 are projection operators.

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 26

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Simulations

5 10 15 20 25 30 35 40 0.04 0.045 0.05 0.055 0.06 0.065 0.07 0.075

Delay Constaint Power (in watts)

Figure: Power delay curve with finite state space

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 27

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Continuous State Space Formulation

How to deal with large buffer and continuous channel? Approximate the value function:

h(x) =

K

∑

i=1

fi(x)ri, where, f = [f1, f2,··· fK] :Feature vectors e.g. [1, Q, x, Qx] r = [r1,r2,···rK] : Weights

Details Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 28

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Joint Source and Channel Coding

✂✁ ✄ ✂☎✝✆✟✞ ✄ ✂☎✝✆✟✞ ✞✡✠☞☛ ✌ ✞✍☎✎✠✎✏✂✑✒☛✓✞✕✔ ✄ ✂☎✝✆✟✞ ✖ ✁✎✗✘✞✍☎✝✆✘✔ ✙ ✆✟✗✘✞✍☎✝✆✘✔ ✄ ✂☎✝✆✟✞ ✚ ✛✔✜✆✢✏✣✆✢✤✜✤ ✥✧✦ ✑✕✁✎✁✝✆✟✏ ✙ ✆✢★✩✞✍☎✎✠✎✏✂✑✒☛✓✞✕✔

Device an online algorithm to find: Quantization qk per macro block (MB) Transmission rate uk for each MB and minimize transmission energy subject to distortion and absolute delay constraint per MB

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 29

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Finite Horizon MDP Optimization Model

✂✁☎✄✝✆✟✞✡✠☞☛✍✌ ✂✁☎✄✝✆✟✞

Figure: kth MB transmission at nth slot

M :No of MB in Frame δk: Delay of kth MB Dk : Distortion of kth MB min

uk,qk E M

∑

k=0

Lk+n−1

∑

l=n

P(xl,uk)Tc |xn

such that 1

M E∑

k

DM−1

k=0 ≤ 1

M Dmax (8) δk ≤ Tmax,∀k, (9) Formulate as finite Horizon MDP with immediate cost: cn = En +λDn .

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 30

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Finite horizon to Infinite Horizon

Learning algorithms specifically for infinite horizon Join horizon M to horizon 0 to get infinite cycle Modified learning for finite horizon

✁

✂ ✄✆☎ ✝✞☎ ✝✠✟ ✄✡✟ ✝☞☛ ✄✌☛

Figure: Finite horizon to infinite horizon

Learn Online Reinforcement-Learning

Q Learning Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 31

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Simulation

1500 2000 2500 3000 3500 4000 4500 5000 5500 110 115 120 125 130 135 140

Average Distortion per frame Average Ebergy per frame (J)

Figure: Power-Distortion curve

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 32

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Summary

Considered joint opportunistic and power optimal solution for minimum rate guarantee Proposed power optimal temporal fair scheduling Proposed average delay constrained power optimal scheme using MDP formulation. Proved convergence of the online policies Investigated the issue of power optimal video transmission

ver wireless

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 33

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Future Research Direction

More practical, discrete rate scheduler needs to be designed for minimum rate guarantee Definition of fairness in fading is an open issue Convergence of function approximation algorithms using multiple policies is unresolved Power optimal variable packet length online scheduling algorithm for video transmission can be considered

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 34

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Thank you

Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 35

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Using ergodicity from (3), min

ν(dx1,···dxN) ∑

y∈A

[0,∞) p1(dq|y,x)p2(y|x)q,

subject to

ν(dx1 ···dxN) ∑

y∈A

[0,∞) p1(dq|y,x)p2(y|x)log(1+qyixi)

≥ Ci ∀i, q ≥ 0. (10) The Lagrange function associated with (10) is, f(p1, p2,λ) ∆ =

ν(dx1 ···dxN) ∑

y∈A

[0,∞) p1(dq|y,x)p2(y|x)
q−∑

i

λi [log(1+qyixi)−Ci]

.

(11) ν Joint distribution of channel p1, p2 conditional distributions Minimizing w.r.t. p1, p2, F(λ) = min

p1,p2 f(p1, p2,λ) Back Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 36

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Short Term Thoughput Fair

Thoughput: MTCi for window of slot M, each slot duration T State :{r(n),x(n)}

r(n) Residual thoughput at slot n x(n) Channel state vector u(n) Transmission vector at slot n

DP formulation

V(n,r(n),x(n)) = min

q(n)+ ¯

V(n+1,r(n+1))

,

= min

yi(n),q(n) N

∑

i

1 xi

e(ui(n)yi(n)) −1
+ ¯

V(n+1,r(n+1))

,

¯ V(M +1,r(M +1)) = ∞. (12)

Heuristic : Elimination policy

Back Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 37

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Function Approximation

Least Square Policy Iteration: Minimize the difference between actual and estimated approximated value function by,

¯ rn = argmin

r n

∑

m=0

(f(sm)′r − f(sm)′rn −

n

∑

k=m

(αΛ)k−mdn(sm,sm+1) 2 rn+1 = rn +βn(¯ rn −rn) (13) dn(sm,sm+1) = c(sm,sm+1)−φn +(f(sm+1)− f(sm)rn,∀k,n (14) φn+1 = φn +γn (c(sn,sn+1)−φn) (15)

Existence of fixed point proved Convergence not proved

Back Abhijeet Bhorkar Resource Allocation in Wireless Networks

SLIDE 38

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Solution using Q Learning

Qn+1(s,u) = Qn(s,u)+αn(s,u)en φn+1 = φn +βne′

n,

en =            cn −φn +maxbQn(yn,b)−Qn(s,u) if (s,u) = (sn,un),xn ∈ Si,i < N cn −φn if (s,u) = (sn,un),xn ∈ SN 0 otherwise , e′

n

=            cn −φn +maxbQn(yn,b)−Qn(s,u) if s = sn,sn ∈ Si,i < N cn −φn if s = sn,sn ∈ SN 0 otherwise , (16)

Back Abhijeet Bhorkar Resource Allocation in Wireless Networks