Stochastic Control for Energy Efficient Resource Allocation in - - PowerPoint PPT Presentation

stochastic control for energy efficient resource
SMART_READER_LITE
LIVE PREVIEW

Stochastic Control for Energy Efficient Resource Allocation in - - PowerPoint PPT Presentation

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Stochastic Control for Energy Efficient Resource Allocation in Wireless Networks Abhijeet Bhorkar (01D07014) under the guidance of Prof. A. Karandikar


slide-1
SLIDE 1

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Stochastic Control for Energy Efficient Resource Allocation in Wireless Networks

Abhijeet Bhorkar (01D07014)

under the guidance of

  • Prof. A. Karandikar

Indian Institute of Technology Bombay, Powai, Mumbai-76 July 05, 2006

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-2
SLIDE 2

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Outline

1

Introduction and Scope of the Thesis

2

Power Optimal Scheduling Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link

Learning Algorithms: Overview Problem Formulation

Energy Efficient Video Transmis- sion

3

Summary and Future Scope

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-3
SLIDE 3

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Motivation

Wireless LANs, ad-Hoc networks, sensor networks limited battery life, bandwidth Maintain acceptable QoS metric rate, delay, fairness Efficient utilization of limited resources Time varying wireless channel conditions Exploit channel variations

  • pportunistic scheduling :

Schedule user with the best channel condition

Can we perform power control to exploit channel variations? Joint opportunistic and energy efficient control

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-4
SLIDE 4

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Motivation

Wireless LANs, ad-Hoc networks, sensor networks limited battery life, bandwidth Maintain acceptable QoS metric rate, delay, fairness Efficient utilization of limited resources Time varying wireless channel conditions Exploit channel variations

  • pportunistic scheduling :

Schedule user with the best channel condition

Can we perform power control to exploit channel variations? Joint opportunistic and energy efficient control

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-5
SLIDE 5

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Motivation

Wireless LANs, ad-Hoc networks, sensor networks limited battery life, bandwidth Maintain acceptable QoS metric rate, delay, fairness Efficient utilization of limited resources Time varying wireless channel conditions Exploit channel variations

  • pportunistic scheduling :

Schedule user with the best channel condition

Can we perform power control to exploit channel variations? Joint opportunistic and energy efficient control

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-6
SLIDE 6

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Scenario

✂ ✂ ✂ ✂ ✄ ✄ ✄ ✄ ☎ ☎ ✆ ✆ ✝ ✝ ✝ ✝ ✝ ✝ ✞ ✞ ✞ ✞ ✞ ✞

Power optimal scheduling Minimum rate guarantee (Multi-user) Fairness guarantee (Multi-user) Average delay guarantee with finite buffer (point to point link)

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-7
SLIDE 7

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Preliminaries(1)

Shannon’s capacity:

P = N0W x

  • eu/W −1
  • ,

N0 : Spectral density of AWGN channel W : Spectrum bandwidth u : Transmission rate

Stochastic approximation:

λ(n+1) = λ(n)+α(n)(H(λ(n))+M(n+1))(1) E[h(λ,x)] = H(λ) Martingale M(n+1) = h(λ(n),x(n))−H(λ(n)) If step sizes satisfy ,

n=0

α(n) = ∞,

n=0

α(n)2 < ∞

then (1) tracks the Ordinary Differential equation (ODE),

˙ λ(t) = H(λ(t)) (2)

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-8
SLIDE 8

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

System Model

✂✁☎✄✝✆✟✞ ✠☛✡✌☞✎✍✑✏ ✓✒✔✄✝✆✟✞ ✕✗✖✙✘☎✚✜✛ ✕✗✖✙✘☎✚✣✢ ✕✗✖✙✘☎✚✥✤ ✦ ✁ ✄✝✆✟✞ ✦ ✒ ✄✝✆✟✞ ✦✑✧ ✄✝✆✟✞ ★✪✩✬✫ ✘☎✭✯✮✯✰✱✘☎✚ ✲✳✫✵✴✔✶✯✶ ✘✷✰ ✲✹✸✺✶ ✭✼✻✾✽✙✻ ✸✿✶ ✖ ✄❁❀ ✸❃❂ ✘✷✚✣❄✣✘ ✩ ✻❅✖❆✻ ✸✺✶ ✞ ✄✝❇ ✁ ✄✝✆✟✞❉❈❊❇ ✒ ✄✝✆✟✞❉❈●❋❍❋●❋●❈❊❇ ✧ ✄✝✆✟✞■✞ ✄ ✕✗✖✬✘✷✚ ★ ✘❉✰✱✘ ✩ ✽✙✻ ✸✺✶ ✞ ✄❑❏ ✁ ✄✝✆✟✞❉❈▲❏ ✒ ✄❑✆▼✞✷❈✔❋❍❋❍❋✔❈▲❏ ✧ ✄✝✆✟✞■✞ ◆ ✁ ✄✝✆✟✞ ◆ ✒❖✄✝✆✟✞ ◆ ✧ ✄❑✆▼✞

Figure: Single hop system model

Slotted single-hop TDMA system Uplink scheduling Perfect channel state information Channel process ergodic (i.i.d. or Markovian)

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-9
SLIDE 9

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Problem Formulation

Minimize average power

min limsup

M→∞

1 M

M

n=1

q(n),

Subject to average rate constraints Ci

limsup

M→∞

1 M

M

n=1

Ui(qi(n),xi(n))) ≥ Ci ∀i, q(n) ≥ 0,

N

i=1

yi(n) ≤ 1 ∀n (3)

U is information theoretic rate and is concave differentiable function of xi, qi U = log(1+xiqi) x = (x1,x2,··· ,xN) y = (y1,y2,··· ,yN)

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-10
SLIDE 10

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Problem Formulation

Minimize average power

min limsup

M→∞

1 M

M

n=1

q(n),

Subject to average rate constraints Ci

limsup

M→∞

1 M

M

n=1

Ui(qi(n),xi(n))) ≥ Ci ∀i, q(n) ≥ 0,

N

i=1

yi(n) ≤ 1 ∀n (3)

U is information theoretic rate and is concave differentiable function of xi, qi U = log(1+xiqi) x = (x1,x2,··· ,xN) y = (y1,y2,··· ,yN)

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-11
SLIDE 11

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Multiuser Optimal Solution

Proposition Optimal Policy for multiple users is to select kth user and transmit with power q∗ Sketch of Proof Use ergodicity to convert optimization problem (3) in continuous domain Minimize Lagrangian of (3) w.r.t. q first, then w.r.t. y Optimal power for single user, q∗

i =

  • λi − 1

xi

+ ,where λi is the Lagrange multiplier Minimizing w.r.t. y, we get, k = argmin

i (q∗ i −λi [log(1+q∗ i xi)−Ci])

Details Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-12
SLIDE 12

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Online Algorithm (1)

After minimizing over the primal variables, optimal value of Lagrangian is,

F(λ) = [E(min

i (q∗ i −λi log(1+q∗ i xi(n))−Ci)]

F(λ) strictly concave → unique maximum Need to find saddle point

Consider for example f(x) is continuous differentiable Gradient ascent scheme for maximizing f is,

xn+1 = xn +αn ˙ f(xn)

It tracks the differential equation,

˙ x(t) = d f dx = ˙ f

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-13
SLIDE 13

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Online Algorithm (2)

Estimate λi online

Channel Measurement

  • ptimal power

with Update Parameter user Find the Optimal power, Transmit

Figure: Block diagram for on-line policy

Update Equation

λi(n+1) = {λi(n)−α(n)[yi(n)log

  • 1+
  • λi(n)−

1 xi(n) + xi(n)

  • ]−Ci
  • hi(λ)

}+ ∀i, (4)

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-14
SLIDE 14

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Optimality and Stability of Update Equation

Iterations converge to differential inclusion almost surely to ,

˙ λ(t) = h(λ(t))

and thus to a supergradient ascent scheme

˙ λ(t) ∈ ∂F(λ(t))

∂F supergradient of F

Stability Boundedness of λi using projection method or linear stochastic approximation method

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-15
SLIDE 15

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Simulations

0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 x 10

4

5 10 15 20 25

Lagrange Multiplier λi No of iterations n

User 1 User 2 User 3 User 4

Figure: Convergence for Markovian channel α = 0.3, C = (0.6,0.8,0.7,0.2), λ(0) = (1,1,1,1) and σ = (1,1,0.9,0.3)

5 10 15 1 1.5 2 2.5 3 3.5

No of Users Ratio of power for optimal policy to round robin policy

Figure: Gain of the optimal policy over variable power round robin policy, C=0.6, γ = 0.7

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-16
SLIDE 16

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Temporal Fairness

Time as resource Different users receive different time resource

Long term fair: Proportional share on long run

Starvation HOL blocking

Short term fair: Proportional share on finite window size M

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-17
SLIDE 17

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Temporal Fairness

Time as resource Different users receive different time resource

Long term fair: Proportional share on long run

Starvation HOL blocking

Short term fair: Proportional share on finite window size M

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-18
SLIDE 18

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Temporal Fair scheduler

Long term temporal fair

φi temporal share to user i on infinite run liminf

M→∞

1 M

M

n=1

E yi(n) ≥ φi ∀i. Modified update equation

λi(n+1) =

  • λi(n)−a(n)
  • yi(n)log
  • 1+
  • λi −

1 xi(n) + xi(n)

  • −Ci

+ λ ′

i (n+1)

= [λ ′

i (n)−a(n)(yi(n)−φi)]+

∀i

Short term temporal fair

φi temporal share on finite window of M Heuristic: Elimination policy Remove the user if its share is exhausted

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-19
SLIDE 19

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Simulations

0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 x 10

4

2 4 6 8 10 12 14 16 18

Lagrange Multiplier λi No of iterations (n)

User 1 User 2 User 3 User 4

Figure: Trajectory of λi(n) φ = (0.3,0.4,0.2,0.1) Figure: Power required for the short term and long term fairness

Short term Thoughput Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-20
SLIDE 20

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Preliminaries(2)

Average cost Markov Decision Process:

Path wise Average Cost = limsup

N→∞

E

  • 1

N

N−1

n=0

c(sn,un)

  • φ∗ +h(s) = min

u∈U {c(s,u)+P(¯

s|s,u)h(¯ s)} Bellman equation with unique h(s) exists iff, h(s) = min

u∈U

  • c(s,u)+Eh(¯

s)−h(s0)

  • (5)

where, s ∈ S: State of the process u ∈ U : Action taken c : Immediate cost ¯ s : Next state h : Difference value function φ : Average cost s0 ∈ S : Some arbitrary state P(¯ s|s,u) :Transition matrix (kernel)

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-21
SLIDE 21

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Reinforcement Learning

✂ ✂ ✂ ✂ ✄ ✄ ✄ ✄ ☎✝✆✟✞✡✠☞☛✍✌ ✎ ☛✍✏✑✞ ✒ ✞✟✓✔✞✖✕ ✗ ☛✍✘✙✠☞✆✟✚✜✛✣✢✤✓✥✘✙✦✧✓✔✞✡✠☞☛✍✌ ★ ✎✪✩ ✠✫✞✡✠☞✆✭✬ ★ ☎✝✆✟✞✖☛ ✩ ✬ ✗ ☛✍✘✙✠☞✆✟✚✯✮✱✰✳✲ ✩ ☛✴✢✤✕✵✰✶✕✷✌✸✞ ✎✺✹ ✓✥✌✻✌✼✕✵✘

Learn how to take action in each state to minimize the cost

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-22
SLIDE 22

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

System Model and Problem Formulation

✂✁ ✄ ✁ ☎ ✁ ✆ ✁ ✝✟✞✡✠☞☛✌☛✎✍✑✏ ✒✌✁ ✓

Figure: Transmitter of a point to point link

Optimization problem

Minimize P = limsup

N→∞

1 N

N

n=1

P

n,

(6) Subject to D− ¯ D ≤ Delay constraint (7) ε − ¯ ε ≤ 0a Drop probability constraint

aWe denote the constraint by ‘¯’ .

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-23
SLIDE 23

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission n=1 n=0 n=k n=k+1 n=k+2

✂✁☎✄✝✆ ✞ ✁ ✟✠✁ ✡ ✞☛✁ ☞✍✌ ✄✝✆✏✎✒✑✔✓✖✕✘✗ ☞✙✌✛✚
✄✝✆✢✜✣✟ ✌✥✤✧✦✩★

Figure: System Model

By Little’s law (7) is,

Qavg − ¯ a ¯ D ≤ 0 where, ε = limsup

N→∞

∑N

n=0 dn

  • max(0,Qn −un +an+1 −B)

∑N

n=0 an

¯ a = aavg (1−ε) (Effective arrival rate) Qavg = limsup

N→∞

1 N

N

n=0

Qn, aavg = limsup

N→∞

1 N

N

n=0

an

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-24
SLIDE 24

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Discrete State Space Markov Decision Formulation : Post Decision

The immediate cost of the constrained MDP is given by,

cn = P

n +λ1(Constraint1)+λ2(Constraint2)

Constraint1 : (Qn − ¯ D(an+1 −dn)) Constraint2 : (dn − ¯ εan+1)

cn is convex function of un and sn. Hence no duality gap. Post decision state ˜ s :State after decision is taken Bellman Equation (5) using the post-decision state is given as,

˜ hn+1(˜ s) = E

  • min

u∈U

Interchange of min and E operator

  • c(s,λ,u)+ ˜

hn+1(˜ ¯ s|˜ sn)− ˜ h(˜ s0)

  • Abhijeet Bhorkar

Resource Allocation in Wireless Networks

slide-25
SLIDE 25

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Two time scale update

✂✁☎✄✝✆✟✞✡✠☞☛✍✌✏✎✑✞✒✂✓✕✔✖✁✗✞ ✘ ✔✖✁☎✙✚✞✜✛✢✙✤✣✤✓✦✥✧✌✏✄★✣ ✞✡✩✪✔✖✁✏✙✫✔✬✥✭✌☎✄★✣ ✮ ✔✪✯✰✥✱✞✡✠✍☛✍✌☎✎✲✞✳✂✓✴✔✪✁✏✞ ✞★✵✷✶✫✵✹✸✻✺ ✼ ✽✿✾❁❀❃❂✦❄✷✽✡❅ ❆ ✔✪✶★✠✭✔✖✣✚✶★✞❈❇❊❉●❋✫✔✬✥✱✞ ✓✕✄★✣✤✯❍✥✧✔✖✣■✥ ✞★✵✷✶✫✵❑❏▲✺ ✼ ✽ ▼✞✡✞❖◆✚P✏◗

Post learning: Faster time scale

˜ hn+1(˜ s) = ˜ hn(˜ s)+α(ν(˜ s,n))I{˜ sn = ˜ s}

  • min

u∈U

  • c(s,λ n,u)+ ˜

hn(˜ ¯ s)

  • − ˜

hn(˜ s0)− ˜ hn((˜ s)

  • ∀˜

s

Lagrangian update: Slower time scale

λ1n+1 = Γ1[λ1n +β(n)(Qn − ¯ D(an −dn))] λ2n+1 = Γ2[λ2n +β(n)(dn − ¯ εan)] Γ1,Γ2 are projection operators.

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-26
SLIDE 26

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Simulations

5 10 15 20 25 30 35 40 0.04 0.045 0.05 0.055 0.06 0.065 0.07 0.075

Delay Constaint Power (in watts)

Figure: Power delay curve with finite state space

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-27
SLIDE 27

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Continuous State Space Formulation

How to deal with large buffer and continuous channel? Approximate the value function:

h(x) =

K

i=1

fi(x)ri, where, f = [f1, f2,··· fK] :Feature vectors e.g. [1, Q, x, Qx] r = [r1,r2,···rK] : Weights

Details Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-28
SLIDE 28

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Joint Source and Channel Coding

✂✁ ✄ ✂☎✝✆✟✞ ✄ ✂☎✝✆✟✞ ✞✡✠☞☛ ✌ ✞✍☎✎✠✎✏✂✑✒☛✓✞✕✔ ✄ ✂☎✝✆✟✞ ✖ ✁✎✗✘✞✍☎✝✆✘✔ ✙ ✆✟✗✘✞✍☎✝✆✘✔ ✄ ✂☎✝✆✟✞ ✚ ✛✔✜✆✢✏✣✆✢✤✜✤ ✥✧✦ ✑✕✁✎✁✝✆✟✏ ✙ ✆✢★✩✞✍☎✎✠✎✏✂✑✒☛✓✞✕✔

Device an online algorithm to find: Quantization qk per macro block (MB) Transmission rate uk for each MB and minimize transmission energy subject to distortion and absolute delay constraint per MB

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-29
SLIDE 29

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Finite Horizon MDP Optimization Model

✂✁☎✄✝✆✟✞✡✠☞☛✍✌ ✂✁☎✄✝✆✟✞

Figure: kth MB transmission at nth slot

M :No of MB in Frame δk: Delay of kth MB Dk : Distortion of kth MB min

uk,qk E M

k=0

Lk+n−1

l=n

P(xl,uk)Tc |xn

  • such that 1

M E∑

k

DM−1

k=0 ≤ 1

M Dmax (8) δk ≤ Tmax,∀k, (9) Formulate as finite Horizon MDP with immediate cost: cn = En +λDn .

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-30
SLIDE 30

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Finite horizon to Infinite Horizon

Learning algorithms specifically for infinite horizon Join horizon M to horizon 0 to get infinite cycle Modified learning for finite horizon

✂ ✄✆☎ ✝✞☎ ✝✠✟ ✄✡✟ ✝☞☛ ✄✌☛

Figure: Finite horizon to infinite horizon

Learn Online Reinforcement-Learning

Q Learning Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-31
SLIDE 31

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope Minimum Rate Guarantee Fairness Guarantee Average Delay Guarantee- point To point Link Energy Efficient Video Transmission

Simulation

1500 2000 2500 3000 3500 4000 4500 5000 5500 110 115 120 125 130 135 140

Average Distortion per frame Average Ebergy per frame (J)

Figure: Power-Distortion curve

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-32
SLIDE 32

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Summary

Considered joint opportunistic and power optimal solution for minimum rate guarantee Proposed power optimal temporal fair scheduling Proposed average delay constrained power optimal scheme using MDP formulation. Proved convergence of the online policies Investigated the issue of power optimal video transmission

  • ver wireless

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-33
SLIDE 33

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Future Research Direction

More practical, discrete rate scheduler needs to be designed for minimum rate guarantee Definition of fairness in fading is an open issue Convergence of function approximation algorithms using multiple policies is unresolved Power optimal variable packet length online scheduling algorithm for video transmission can be considered

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-34
SLIDE 34

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Thank you

Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-35
SLIDE 35

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Using ergodicity from (3), min

  • ν(dx1,···dxN) ∑

y∈A

  • [0,∞) p1(dq|y,x)p2(y|x)q,

subject to

  • ν(dx1 ···dxN) ∑

y∈A

  • [0,∞) p1(dq|y,x)p2(y|x)log(1+qyixi)

≥ Ci ∀i, q ≥ 0. (10) The Lagrange function associated with (10) is, f(p1, p2,λ) ∆ =

  • ν(dx1 ···dxN) ∑

y∈A

  • [0,∞) p1(dq|y,x)p2(y|x)
  • q−∑

i

λi [log(1+qyixi)−Ci]

  • .

(11) ν Joint distribution of channel p1, p2 conditional distributions Minimizing w.r.t. p1, p2, F(λ) = min

p1,p2 f(p1, p2,λ) Back Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-36
SLIDE 36

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Short Term Thoughput Fair

Thoughput: MTCi for window of slot M, each slot duration T State :{r(n),x(n)}

r(n) Residual thoughput at slot n x(n) Channel state vector u(n) Transmission vector at slot n

DP formulation

V(n,r(n),x(n)) = min

  • q(n)+ ¯

V(n+1,r(n+1))

  • ,

= min

yi(n),q(n) N

i

1 xi

  • e(ui(n)yi(n)) −1
  • + ¯

V(n+1,r(n+1))

  • ,

¯ V(M +1,r(M +1)) = ∞. (12)

Heuristic : Elimination policy

Back Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-37
SLIDE 37

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Function Approximation

Least Square Policy Iteration: Minimize the difference between actual and estimated approximated value function by,

¯ rn = argmin

r n

m=0

  • (f(sm)′r − f(sm)′rn −

n

k=m

(αΛ)k−mdn(sm,sm+1) 2 rn+1 = rn +βn(¯ rn −rn) (13) dn(sm,sm+1) = c(sm,sm+1)−φn +(f(sm+1)− f(sm)rn,∀k,n (14) φn+1 = φn +γn (c(sn,sn+1)−φn) (15)

Existence of fixed point proved Convergence not proved

Back Abhijeet Bhorkar Resource Allocation in Wireless Networks

slide-38
SLIDE 38

Introduction and Scope of the Thesis Power Optimal Scheduling Summary and Future Scope

Solution using Q Learning

Qn+1(s,u) = Qn(s,u)+αn(s,u)en φn+1 = φn +βne′

n,

en =            cn −φn +maxbQn(yn,b)−Qn(s,u) if (s,u) = (sn,un),xn ∈ Si,i < N cn −φn if (s,u) = (sn,un),xn ∈ SN 0 otherwise , e′

n

=            cn −φn +maxbQn(yn,b)−Qn(s,u) if s = sn,sn ∈ Si,i < N cn −φn if s = sn,sn ∈ SN 0 otherwise , (16)

Back Abhijeet Bhorkar Resource Allocation in Wireless Networks