MMN - Lec10 QoS Traffic Engineering

Traffic Behavior and Queuing
in QoS Environment
Hamid R. Rabiee
Mostafa Salehi, Fatemeh Dabiran, Hoda Ayatollahi
Spring 2011
Objectives
Provide some basic understanding of queuing phenomena
Explain the available solution approaches and associated
trade-offs
Give guidelines on how to match applications and solutions
Outline
Basic concepts
Source models
Service models (demo)
Single-queue systems
Priority/shared service systems
Networks of queues
Outline
Basic concepts
Performance measures
Solution methodologies
Queuing system concepts
Stability and steady-state
Causes of delay and bottlenecks
Source models
Service models(demo)
Networks of queues
Performance Measures
Delay
Delay variation (jitter)
Packet loss
Efficient sharing of bandwidth
Relative importance depends on traffic type (audio/video, file transfer,
interactive)
Challenge: Provide adequate performance for (possibly) heterogeneous
traffic
Queuing System Concepts:

Arrival Rate, Occupancy, Time in the System
Queuing system
Data network where packets arrive, wait in various queues, receive service at
various points, and exit after some time
Arrival rate
Long-term number of arrivals per unit time
Occupancy
Number of packets in the system (averaged over a long time)
Time in the system (delay)

Time from packet entry to exit (averaged over many packets)
Stability and Steady-State

A single queue system is stable if
packet arrival rate < system transmission capacity
For a single queue, the ratio

packet arrival rate / system transmission capacity
is called the utilization factor

Describes the loading of a queue
In an unstable system packets accumulate in various queues and/or get dropped

For unstable systems with large buffers some packet delays become very large
Flow/admission control may be used to limit the packet arrival rate
Prioritization of flows keeps delays bounded for the important traffic
Stable systems with time-stationary arrival traffic approach a steady-state
Littles Law
For a given arrival rate, the time in the system is proportional to packet occupancy
N=T
where
N: average number of packets in the system
: packet arrival rate (packets per unit time)
T: average delay (time in the system) per packet
Examples:
On rainy days, streets and highways are more crowded
Fast food restaurants need a smaller dining room than regular restaurants with the same customer
arrival rate
Large buffering may cause large delays
Explanation of Littles Law

Amusement park analogy: people arrive, spend time at various sites, and
leave
They pay $1 per unit time in the park
The rate at which the park earns is $N per unit time
(N: average number of people in the park)
The rate at which people pay is $ T per unit time
(: traffic arrival rate; T: time per person)
Over a long horizon:
Rate of park earnings = Rate of peoples payment
or
N=T
Delay is Caused by Packet Interference

If arrivals are regular or sufficiently spaced apart, no queuing delay occurs
Arrival Times
4
Regular Traffic
Time
2
Departure
Times
Arrival Times
1
Irregular but
Spaced Apart Traffic
Time
1
Departure
Times
Burstiness Causes Interference

Note that the departures are less bursty
1
Bursty Traffic
Time
1
Queuing Delays
Burstiness Example
Different Burstiness Levels at Same Packet Rate
Source: Fei Xue and S. J. Ben Yoo, UCDavis, On the Generation and Shaping Self-similar Traffic in Optical Packet-switched Networks, OPNETWORK 2002
Packet Length Variation Causes Interference
Time
Queuing Delays
Regular arrivals, irregular packet lengths
High Utilization Exacerbates Interference
Time
Queuing Delays
As the work arrival rate:
(packet arrival rate * packet length)
increases, the opportunity for interference increases
Bottlenecks
Types of bottlenecks
At access points (flow control, prioritization, QoS enforcement needed)
At points within the network core
Isolated (can be analyzed in isolation)
Interrelated (network or chain analysis needed)
Bottlenecks result from overloads caused by:

High load sessions, or
Convergence of sufficient number of moderate load sessions at the same queue
Bottlenecks Cause Shaping
Time
The departure traffic from a bottleneck is more regular than the arrival
traffic
The inter-departure time between two packets is at least as large as the
transmission time of the 2nd packet
Bottlenecks Cause Shaping

Incoming traffic
Interarrival times
Outgoing traffic
Interdeparture times
# of packets
# of packets
Exponential
interarrivals
Fixed
packet
length
sec
sec
Transmission
time
Bottleneck
90% utilization
Incoming traffic
Interarrival times
Outgoing traffic
Interdeparture times
# of packets
Small
sec
Medium
Bottleneck
Large
90% utilization
Packet Trains
Histogram of inter-departure times for small
packets
# of packets
sec
Outline
Basic concepts
Source models
Poisson traffic
Batch arrivals
Example applications voice, video, file transfer

Networks of queues
Poisson Process with Rate

Interarrival times are independent and exponentially
distributed
Models well the accumulated traffic of many
independent sources
The average interarrival time is 1/ (secs/packet), so
is the arrival rate (packets/sec)
Interarrival Times
Time
Batch Arrivals
Some sources transmit in packet bursts
May be better modeled by a batch arrival process
(e.g., bursts of packets arriving according to a Poisson process)
The case for a batch model is weaker at queues after the first, because of
shaping
Interarrival Times
Time
Markov Modulated Rate Process (MMRP)

State 0
State 1
OFF
ON
Stay in each state an exponentially

distributed time,
Transmit according to a deterministic process
at each state
Extension: Models with more than two states and/or stochastic

transmission process
Source Types
Voice sources
Video sources
File transfers
Web traffic
Interactive traffic
Different application types have different QoS requirements, e.g., delay,
jitter, loss, throughput, etc.
Source Type Properties

Characteristics
Voice
Video
Data
FTP
telnet
web
QoS
Requirements
Model
* Alternating talkspurts and silence

intervals.
* Talk-spurts produce
constant packet-rate
traffic
* Highly bursty traffic
(when encoded)
* Long range
dependencies
Delay < ~150 ms

Jitter < ~30 ms
Packet loss < ~1%
* Two-state (on-off) Markov

Modulated Rate Process (MMRP)
* Exponentially distributed time at
each state
Delay < ~ 400 ms

Jitter < ~ 30 ms
Packet loss < ~1%
K-state (on-off) Markov Modulated

Rate Process (MMRP)
* Poisson type
* Sometimes batcharrivals, or bursty,
or sometimes on-off
Zero or near-zero
packet loss
Delay may be
important
Poisson, Poisson with batch arrivals,

Two-state MMRP
Typical Voice Source Behavior
MPEG1 Video Source Model

The MPEG1 MMRP model can be extremely bursty, and has long range
dependency behavior due to the deterministic frame sequence
Diagram Source: Mark W. Garrett and Walter Willinger, Analysis, Modeling, and Generation of Self-Similar VBR Video Traffic, BELLCORE, 1994
Outline
Basic concepts
Source models
Service models
Single vs. multiple-servers
FIFO, priority, and shared servers
Demo
Networks of queues
Device Queuing Mechanisms

Common queue examples for IP routers
FIFO: First In First Out
PQ: Priority Queuing
WFQ: Weighted Fair Queuing
Combinations of the above
Service types from a queuing theory standpoint

Single server (one queue - one transmission line)
Multiple server (one queue - several transmission lines)
Priority server (several queues with hard priorities - one transmission line)
Shared server (several queues with soft priorities - one transmission line)
Single Server FIFO

Single transmission line serving packets on a FIFO (First-In-First-Out)
basis
Each packet must wait for all packets found in the system to complete
transmission, before starting transmission
Departure Time = Arrival Time + Workload Found in the System +
Transmission time
Packets arriving to a full buffer are dropped

Arrivals
Transmission
Line
Multiple Servers
Multiple packets are transmitted simultaneously on multiple lines/servers
Head of the line service: packets wait in a FIFO queue, and when a server
becomes free, the first packet goes into service
Transmission
Lines
Arrivals
Priority Servers
Packets form priority classes (each may have several flows)
There is a separate FIFO queue for each priority class
Packets of lower priority start transmission only if no higher priority packet is waiting
Priority types:
Non-preemptive (high priority packet must wait for a lower priority packet found under transmission
upon arrival)
Preemptive (high priority packet does not have to wait )
Class 1 Arrivals
High Priority
Class 2 Arrivals
Interm. Priority
Class 3 Arrivals
Low Priority
Transmission
Line
Priority Queuing
Packets are classified into separate queues
E.g., based on source/destination IP address, source/destination TCP port, etc.
All packets in a higher priority queue are served before a lower priority queue is served
Typically in routers, if a higher priority packet arrives while a lower priority packet is
being transmitted, it waits until the lower priority packet completes
Shared Servers
Again we have multiple classes/queues, but they are served with a soft priority
scheme
Round-robin
Weighted fair queuing
Class 1 Arrivals
Weight 10
Class 2 Arrivals
Weight 3
Class 3 Arrivals
Weight 1
Transmission
Line
Round-Robin/Cyclic Service
Round-robin serves each queue in sequence
A queue that is empty is skipped
Each queue when served may have limited service (at most k packets
transmitted with k = 1 or k > 1)
Round-robin is fair for all queues (in terms of packet transmission rate)
Round-robin cannot be used to enforce bandwidth allocation among the
queues.
Fair Queuing
This scheduling method is inspired by the most fair of methods:
Transmit one bit from each queue in cyclic order (bit-by-bit round robin)
Skip queues that are empty
To approximate the bit-by-bit processing behavior, for each packet

We calculate upon arrival its finish time under bit-by-bit round robin and we
transmit by FIFO within each queue
Transmit next the packet with the minimum finish time
Important properties:
Priority is given to short packets
Equal bandwidth is allocated to all queues that are continuously busy
i-1
Arrival times
i
i-1
i
Departure times
Arrival times
Finish Time
of Packet i
i-1
Finish Time
of Packet i
i-1
i
Departure times
Weighted Fair Queuing

Fair queuing cannot be used to implement bandwidth allocation and
soft priorities
Weighted fair queuing is a variation that corrects this deficiency
Let wk be the weight of the kth queue
Think of round-robin with queue k transmitting wk bits upon its turn
If all queues have always something to send, the kth queue
receives bandwidth equal to a fraction wk / i wi of the total
bandwidth
Fair queuing corresponds to wk = 1

Priority queuing corresponds to the weights being very high as we
move to higher priorities
Implementation: For each packet
Calculate its finish time (under the weighted bit-by-bit round
robin scheme)
Transmit the packet with the minimum finish time
Weighted Fair Queuing Illustration

Weights:
Queue 1 = 3
Queue 2 = 1
Queue 3 = 1
A Practical Combination (e.g. Cisco)

Example voice in PQ, guaranteed b/w traffic in WFQs
(all at middle priority), and best effort traffic in low priority queue
Demo: Comparing FIFO, WFQ and PQ

Two traffic streams mixing on a common interface
Video
FTP
Apply different service schemes

FIFO
PQ
WFQ
Run simulation and compare queuing delays
Demo: FIFO
FIFO
Bottleneck
90% utilization
Demo: FIFO Queuing Delay

Applications have different
requirements
Video
delay, jitter
FTP
packet loss
Control beyond best effort
needed
Priority Queuing (PQ)
(WFQ)
Demo: Priority Queuing (PQ)
PQ
Bottleneck
90% utilization
Demo: PQ Queuing Delays
PQ FTP
FIFO
PQ Video
Demo: Weighted Fair Queuing (WFQ)
WFQ
Bottleneck
90% utilization
Demo: WFQ Queuing Delays
PQ FTP
WFQ FTP
FIFO
WFQ/PQ Video
Queuing: Summary Points

Choice of queuing mechanism can have a profound effect on performance
To achieve desired service differentiation, appropriate queuing
mechanisms can be used
Complex queuing mechanisms may require simulation techniques to
analyze behavior
Improper configuration (e.g., queuing mechanism selection or weights)
may impact performance of low priority traffic
Outline
Basic concepts
Source models
M/M/1M/M/m/k
M/G/1G/G/1
Demo: Analytics vs. simulation

Networks of queues
M/M/1 System
Nomenclature: M stands for Memoryless (a property of the exponential
distribution)
M/M/1 stands for Poisson arrival process (which is

probabilistically memoryless)
M/M/1 stands for exponentially distributed transmission times
Assumptions:
Arrival process is Poisson with rate packets/sec

Packet transmission times are exponentially distributed with mean
1/
One server
Independent interarrival times and packet transmission times
Transmission time is proportional to packet length
Note 1/ is secs/packet so is packets/sec (packet transmission rate of the
queue)
Utilization factor: = / (stable system if < 1)
Delay Calculation
Let
Q = Average time spent waiting in queue
T = Average packet delay (transmission plus queuing)
Note that T = 1/ + Q
Also by Littles law
N = T and Nq = Q
where
Nq = Average number waiting in queue
These quantities can be calculated with formulas derived by Markov chain
analysis (see references)
M/M/1 Results
The analysis gives the steady-state probabilities of number of
packets in queue or transmission
P{n packets} = n(1-)
where = /
From this we can get the averages:

N = /(1 - )
T = N/ = /(1 - ) = 1/( - )
T
1/
0
Example: How Delay Scales with Bandwidth

Occupancy and delay formulas
N = /(1 - )
T = 1/( - )
= /
Assume:
Traffic arrival rate is doubled
System transmission capacity
is doubled
Then:
Queue sizes stay at the same level ( stays the same)
Packet delay is cut in half ( and are doubled)
A conclusion: In high speed networks

propagation delay increases in importance relative to transmission and queuing delays
buffer size and packet loss may still be a problem
M/M/m, M/M/ System
Same as M/M/1, but it has m (or ) servers

In M/M/m, the packet at the head of the queue
moves to service when a server becomes free
Qualitative result
Delay increases to as = /m approaches 1
There are analytical formulas for the occupancy

probabilities and average delay of these systems
Finite Buffer Systems: M/M/m/k

The M/M/m/k system
Same as M/M/m, but there is buffer space for at most k
packets. Packets arriving at a full buffer are dropped
There are formulas for average delay, steady-state

occupancy probabilities, and loss probability
The M/M/m/m system is used widely to size telephone or
circuit switching systems
Characteristics of M/M/. Systems

Advantage: Simple analytical formulas
Disadvantages:
The Poisson assumption may be violated
The exponential transmission time distribution is an
approximation at best
Interarrival and packet transmission times may be
dependent (particularly in the network core)
Head-of-the-line assumption precludes heterogeneous input
traffic with priorities (hard or soft)
M/G/1 System
Same as M/M/1 but the packet transmission time distribution is
general, with given mean 1/ and variance 2
Utilization factor = /
Pollaczek-Kinchine formula for
Average time in queue = (2 + 1/2)/2(1- )
Average delay = 1/ + (2 + 1/2)/2(1- )
The formulas for the steady-state occupancy probabilities are more

complicated
Insight: As 2 increases, delay increases
G/G/1 System
Same as M/G/1 but now the packet interarrival time
distribution is also general, with mean and variance 2
We still assume FIFO, independent interarrival times and
packet transmission times
Heavy traffic approximation:

Average time in queue ~ (2 + 2)/2(1- )
Becomes increasingly accurate as 1
Demo: M/G/1
Packet inter-arrival
times
exponential (0.02) sec
Packet size
1250 bytes
(10000 bits)
Capacity
1 Mbps
Packet size distribution:

exponential
constant
lognormal
What is the average delay and queue size ?
Compare analytical formulas with simulation results
Demo: M/G/1 Analytical Results

Packet Size
Distribution
Exponential
mean = 10000
variance = 1.0 *108
Constant
mean = 10000
variance = 0.0
Lognormal
mean = 10000
variance = 9.0 *108
Delay T (sec)
Queue Size (packets)
0.02
1.0
0.015
0.75
0.06
3.0
Demo: M/G/1 Simulation Results

Average Delay (sec)
Average Queue Size

(packets)
Demo: M/G/1 Limitations

Application traffic mix not memoryless
Video
constant packet inter-arrivals
Http
bursty traffic
Delay
P-K formula
Simulation
Outline
Basic concepts
Source models
Preemptive vs. non-preemptive
Cyclic, WFQ, PQ systems
Demo: Simulation results
Networks of queues
Hybrid simulation (demo)
Non-preemptive Priority Systems

We distinguish between different classes of traffic (flows)
Non-preemptive priority: packet under transmission is not preempted
by a packet of higher priority
P-K formula for delay generalizes
Class 1 Arrivals
High Priority
Class 2 Arrivals
Interm. Priority
Class 3 Arrivals
Low Priority
Transmission
Line
Cyclic Service Systems

Multiple flows, each with its own queue
Fair system: Each flow gets access to the transmission line in turn
Several possible assumptions about how many packets each flow can
transmit when it gets access
Formulas for delay under M/G/1 type assumptions are available
Class 2
Arrivals
Class 1
Arrivals
Class 3
Arrivals
Transmission
Line

A combination of priority and cyclic service
No exact analytical formulas are available
Outline
Basic concepts
Source models
Networks of queues
Violation of M/M/. assumptions
Effects on delays and traffic shaping
Analytical approximations
Two Queues in Series

First queue shapes the traffic into second queue
Arrival times and packet lengths are correlated
M/G/1 formulas yield significant error for second queue
How about M/M/1 if packet lengths are exponential?
First Queue
Time
Second Queue
Time
Two bottlenecks in series
Exponential
inter-arrivals
Spaced-apart
inter-arrivals
Spaced-apart
departures
Bottleneck
Bottleneck
Delay
No queuing
delay
Approximations
Kleinrock independence approximation
Perform a delay calculation in each queue independently of other queues
Add the results (including propagation delay)
Note: In the preceding example, the Kleinrock independence

approximation overestimates the queuing delay by 100%
Tends to be more accurate in networks with lots of traffic mixing, e.g.,

nodes serving many relatively small flows from several different locations
References
Networking
Bertsekas and Gallager, Data Networks, Prentice-Hall, 1992
Device Queuing Implementations
Vegesna, IP Quality of Service, Ciscopress.com, 2001

http://www.juniper.net/techcenter/techpapers/200020.pdf
Probability and Queuing Models
Bertsekas and Tsitsiklis, Introduction to Probability, Athena Scientific, 2002,

http://www.athenasc.com/probbook.html
Cohen, The Single Server Queue, North-Holland, 1992
Takagi, Queuing Analysis: A Foundation of Performance Evaluation. (3
Volumes), North-Holland, 1991
Gross and Harris, Fundamentals of Queuing Theory, Wiley, 1985
Cooper, Introduction to Queuing Theory, CEEPress, 1981
OPNET Hybrid Simulation and Micro Simulation
See Case Studies papers in

http://secure.opnet.com/services/muc/mtdlogis_cse_stdies_81.html

MMN - Lec10 QoS Traffic Engineering

Загружено:

Сведения о документе

Оригинальное название

Авторское право

Доступные форматы

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Авторское право:

Доступные форматы

MMN - Lec10 QoS Traffic Engineering

Загружено:

Авторское право:

Доступные форматы

Traffic Behavior and Queuing

Queuing System Concepts:

Time in the system (delay)

Stability and Steady-State

For a single queue, the ratio

is called the utilization factor

In an unstable system packets accumulate in various queues and/or get dropped

Stable systems with time-stationary arrival traffic approach a steady-state

Explanation of Littles Law

Delay is Caused by Packet Interference

Burstiness Causes Interference

Packet Length Variation Causes Interference

Regular arrivals, irregular packet lengths

High Utilization Exacerbates Interference

Bottlenecks result from overloads caused by:

Bottlenecks Cause Shaping

Bottlenecks Cause Shaping

Service models (demo)

Poisson Process with Rate

Markov Modulated Rate Process (MMRP)

Stay in each state an exponentially

Extension: Models with more than two states and/or stochastic

Source Type Properties

* Alternating talkspurts and silence

Delay < ~150 ms

* Two-state (on-off) Markov

Delay < ~ 400 ms

K-state (on-off) Markov Modulated

Poisson, Poisson with batch arrivals,

Typical Voice Source Behavior

MPEG1 Video Source Model

Device Queuing Mechanisms

Service types from a queuing theory standpoint

Single Server FIFO

Packets arriving to a full buffer are dropped

To approximate the bit-by-bit processing behavior, for each packet

Weighted Fair Queuing

Fair queuing corresponds to wk = 1

Weighted Fair Queuing Illustration

A Practical Combination (e.g. Cisco)

Demo: Comparing FIFO, WFQ and PQ

Apply different service schemes

Run simulation and compare queuing delays

Demo: FIFO Queuing Delay

Demo: Priority Queuing (PQ)

Demo: PQ Queuing Delays

Demo: Weighted Fair Queuing (WFQ)

Demo: WFQ Queuing Delays

Queuing: Summary Points

Priority/shared service systems

M/M/1 stands for Poisson arrival process (which is

Arrival process is Poisson with rate packets/sec

From this we can get the averages:

Example: How Delay Scales with Bandwidth

A conclusion: In high speed networks

M/M/m, M/M/ System

Same as M/M/1, but it has m (or ) servers

There are analytical formulas for the occupancy

Finite Buffer Systems: M/M/m/k

There are formulas for average delay, steady-state

Characteristics of M/M/. Systems

The formulas for the steady-state occupancy probabilities are more

Heavy traffic approximation: