Академический Документы
Профессиональный Документы
Культура Документы
net/publication/311532272
CITATION READS
1 176
2 authors:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Alessandro Vizzarri on 31 August 2017.
Abstract —We consider Quality of Experience (QoE), experiments, reported in Sect. 4. In Sect. 5 QoS metrics have
measured in terms of Mean Opinion Score Listening Quality been used as predictors, together with a feed-forward Neural
Subjective (MOS_LQS) for a Voice over LTE (VoLTE) Network compared with a linear multi-regression technique.
application in realistic situations. A set of service scenarios have This is motivated by a shared evidence of nonlinear behavior
been identified and the network performance simulated. We for MOS versus network parameters in realistic scenarios. Sect.
organized a listening panel to measure MOS according to the 6 will draft the most relevant conclusions out of the present
standard procedures. MOS-LQS results and QoS metrics have approach.
been correlated using a set of Artificial Neural Network (ANN)
models. As a clear result the ANN accurately models the II. QOE ANALYSIS FOR A VOLTE APPLICATION
relationship. This confirms again the need for an approach that is
non linear and capable of generalization, as made by the human Since LTE is fully IP-based wireless standard it only
judgment. Future research is necessary to generalize the ANN enables entire wireless transmission over Packet Switching (PS)
model to a far larger scale while controlling the combinatorial paths using the Internet Protocol (IP) protocol as the network
explosion of scenarios and technical parameters. protocol. All applications delivered over LTE systems are IP-
based, including voice applications as Voice Over LTE
Keywords—LTE; VoIP; VoLTE; end-to-end QoS; QoE; LTE (VoLTE) [3], [4], [5]. VoLTE is delivered in best effort
KPIs; IP cloud; ANN; MOS; MOS-LQS. modality. This implies VoLTE, as a best effort service, requires
All the authors contributed equally to this work.
an effective service management in order to guarantee both an
acceptable end-to-end QoS at network layer and an acceptable
QoE perceived by end-users. Vizzarri in [6] presents a review
of the most important papers focused on the end-to-end QoS in
I. INTRODUCTION case of applications delivered over LTE networks. Reference
Voice over LTE (VoLTE) is a key service for Long Term [7] analyses the QoS for a VoLTE service through and end-to-
Evolution (LTE) networks, and needs specific attention for end approach. Reference [8] studies the impact of network
quality of service (QoS). End-to-end approach for QoS is congestions on the VoLTE end-to-end performance. Major
strongly recommended by the standard not only for data-based standardization entities already treated the QoS issue in LTE
services but also for delay sensitive services VoLTE [1]. through an end-to-end approach. In [9] ETSI provides end-to-
MNOs analyze and monitor continuously the main network end QoS reference architecture for LTE and a description of
parameters (Key Performance Indicators, KPI) at network level relevant management functions. Implementation of QoS
and try to estimate Quality of Experience (QoE) perceived by policies and strategies are left to MNOs. QoE of a voice
end user at the application level. application is usually represented by the Mean Opinion Score
The present work considers QoE measured in terms of (MOS). MOS is a scalar variable, whose values are limited in
Mean Opinion Score Listening Quality Subjective for a VoLTE the range from 1 (worst case) to 5 (best case) [10], which
application in realistic situations. We are interested in studying indicates an average level of service acceptance for the end
the behavior for MOS (as QoE parameter) versus the network user. The MOS value is strictly related to the R factor provided
KPIs (i.e. QoS parameters). Research presents extensive by the ITU E-Model [11]. The E-Model combines a number of
samples of LTE networks modelled in realistic usage different impairments to be considered for an overall quality
conditions through scenario making. We propose the approach measure.
of modelling multiuser and multiservice scenarios according to
There are various different MOS measures, as shown in Fig.
reported conditions, and simulate the network performance
1. We can distinguish objective measurement that rely on
using the OPNET simulation software (as described in Sect. 2).
estimates of conversational and listening quality, based on
Many authors applied polynomial, exponential and logarithmic
technical measures and subjective measurements that rely on
functions to fit the objective MOS value (see references in [2]).
collaboration of human subjects. In case measures are
Then we attacked the subjective measurements for MOS. This
performed before the receiver we speak of MOS
required to invest a certain amount of resources to train and
Conversational Quality Estimated (MOS-CQE) and MOS
employ a panel of listeners for an extensive series of
Listening Quality Estimated (MOS-LQE). In case measures are
performed after the receiver we speak of MOS Listening the callee might start to talk at the same moment or interrupt
Quality Objective (MOS-LQO) and MOS Listening Quality each other. Further an excess of delay might generate Packet
Subjective (MOS-LQS), respectively objective and subjective. Loss because the delayed voice packet could be dropped. Jitter
In the present paper we are interested in subjective is the variability in the arrival time between packets, caused by
measurement of MOS. network congestion or route changes [17]. Negative effects of
Delay and Jitter are voice echoes, while high value of Packet
Loss can produce overlapping of words with a strong negative
impact on voice intelligibility. The effects of these QoS metrics
on QoE are well known, though we are interested in correlating
the QoE with the measured QoS metrics via neural networks.
There are two set of reasons. First, neural networks are a well-
known tool to fit non-linear input-output functions, and may be
easily used in our problem. Second, neural networks were
demonstrated to have a great potential in modeling human
behavior. On consequence we are interested in exploiting this
subtler property to find a way to reduce the expensive process
of full MOS-LQS measurement.
Fig. 1. MOS and voice quality estimation schemes for VoIP Services.
Available methods can be grouped in three classes. The III. RELATED WORK
“Conversational Opinion Test” requires a preparation in the The approach of the authors is to correlate the objective
laboratory in terms of creation of impairments in the VoLTE QoS KPIs at network level with MOS as QoE subjective
testbed system such as loss, delay and echo. Each test requires indicator. We will apply in the following different statistical
two subjects who have to sit in separate sound-proof rooms and regression techniques: Multiple Linear Regression (MLR) and
then talk to each other before finally rating the individual score, Artificial Neural Network (ANN) with different training
using 5-point scale. Also, each condition requires participants algorithms. In [18] the authors propose fuzzy-logic for
at least 24-32 subjects to test. The final result is the MOS QoS/QoE mapping. QoS/QoE mapping is carried out
Conversational Quality Subjective (a variant of MOS-LQS correlating Mean Opinion Score (MOS) as QoE parameter to
shown on Fig. 1). The “LQS Test” requires a laboratory too network KPIs as QoS parameter. In particular Delay, Jitter and
[12]. Preparation is somehow more complicate, because Packet Loss Rate are considered as KPIs. A comparison of
participants are taken individually and provided each with different approaches for the best fitting curve of the simulation
audio files that is the result of a high fidelity recording results is in [19] [20].
transmitted along the VoLTE service chain, including the
receiver. Each participant has to give the individual score, In [21] the authors conducted experimental tests in order to
using the usual 5-point scale. Measurement of each scenario evaluate the QoS of a generic VoIP service. On the basis of the
should be conducted with at least 16 subjects [13]. obtained test results, the authors tried to correlate the metrics
measured at network level (Packet Loss and Packet Delay) to
In the following we will focus on the listening test MOS- the corresponding MOS-CQS resulting from the subjective test.
LQS. We based on a transmission test carried out with pre- They proceeded to fit the obtained values through a MLR and
recorded phrases; the goal of this test is to obtain the absolute identified a polynomial relationship. Finally they calculated the
quality of the voice sample after transmission, through the related Mean Absolute Percent Error (MAPE).
direct hearing of the sample, without a reference sample. There
are various listening tests, i.e. Absolute Category Rating QoE/QoS mapping can be also analyzed and modelled
(ACR), Degradation Category Rating (DCR) and Comparison through the involvement of the ANNs [22] [23], since a
Category Rating (CCR) [14]. The quality must be evaluated desiderable output (QoE in our case) can be predicted on the
through different opinion scales: Listening quality scale, basis of multiple inputs (network QoS metrics at network
Listening-effort scale and Loudness-preference scale. After level). In [24] the authors try to correlate the QoE to QoS in
collection of each participant’s opinion of quality (expressed in case of a mobile data service through an ANN. After training,
the usual range), the MOS is computed as the mean of the test the ANN model gives two different types of relationship for the
panel. The ITU-T Rec. P.800 (1996) defined the requirements a QoE: direct if QoE is estimated against the bandwidth; inverse
MOS test has to comply with [15]. in case of estimation of Delay. In [25] V. A. Machado et alii
defined an ANN-based network model for QoS/QoE correlation
The final objective of our research is to correlate the MOS- in case of a video service delivered over a WiMAX network.
LQS with the QoS metrics of the VoLTE applications. Here we Test results confirm the goodness of the ANN model, which
adopt as the QoS metrics relevant for a VoLTE service three exibits an acceptable error. In [26] the authors apply the ANN
Key Performance Indicators (KPIs): end-to-end Delay, Packet to the estimation of the volume traffic matrix in a large scale [P
Loss rate (PLR) and Jitter (other quality of service measures for network. They compare different training algorithms (as
a generic VoIP application can be found in [16]). Delay is Levenberg-Marquardt and Bayesian Regularization) in terms of
expressed as the amount of time a packet sent by the source robustness and accuracy in order to perform a good prediction
(caller) takes to reach destination (callee). An excess of delay of traffic. Results confirm the Levenberg-Marquardt training
can make an audio conversation very difficult: both caller and algorithm to have the best error robustness.
IV. EXPERIMENTAL QCI 1 (GBR) and Allocation and Retention Priority (ARP 1).
The VoLTE application has been launched with a start offset of
A. Methodology 20s till the end of simulation period. In the HTTP application,
Our rationale is as follows: concerning scenarios from n.25 to n.48, UE_2 can download 1
∑ Define realistic scenarios for VoLTE application, to KB web page, n. 5 medium images with dimension up to 2 KB
be simulated for computing KPIs. and two short videos with dimension up to 350 KB. HTTP
∑ Perform MOS LQS testing per each scenario. application is launched with a start offset of 40s. Since modeled
∑ Attempt multi parameter fitting of QoE vs QoS KPIs profiles add a start offset around 40s, VoLTE and HTTP
through the ANN. applications start after 80s from simulation initiation.
B. Simulated scenarios
Realistic scenarios are simulated on the basis of network
impairments that disturb VoLTE calls. Impairments are
represented by:
∑ mixed traffic: VoLTE application is delivered over the
LTE network together with HTTP web browsing
application.
∑ insertion of IP cloud: additive delay and IP packet
discard ratio are produced across entire end-to-end
transmission chain.
This framework is typical when a backbone network
section is involved in service delivery. The HTTP application
is the application considered for modeling a set of multiuser a) b)
and multiservice scenarios. We made the exercise of
considering different LTE network topology (with or without
IP cloud) and traffic flows (single or mixed), as shown in Fig.
2, and identified a series of 48 scenarios.
Scenarios from n.1 to n.12 are characterized by two UEs:
UE_1 (caller) is performing a VoLTE call to UE_2 (callee)
using a direct link. Traffic flow is single. Topology is shown in
Fig. 2.a.
Scenarios from n.13 to n.24 are characterized by three UEs
and one HTTP web server: UE_1 and UE_2 are performing a
VoLTE call while UE_3 is performing an HTTP web browsing
session. Traffic flow is mixed: VoLTE and HTTP web
browsing services are performed by UEs (see Fig. 2.b). c) d)
Fig. 2. LTE network topology simulated for VoLTE application. One UE for
Scenarios from n.25 to n.36 are similar to those in the group VoLTE without IP cloud (a), two UEs for VoLTE and HTTP browsing
1-12: UE_1 is performing a VoLTE call to UE_2, but link without IP cloud (b), One UE for VoLTE with IP cloud (c), two UEs for
among them is affected by insertion of IP cloud. It adds 1% VoLTE and HTTP browsing with IP cloud (d).
packet discard ratio and 0.1s delay between caller (UE_1) and
callee (UE_2) (refer to Fig. 2.c). The simulation period is equal to 5 (3mins plus offset) for
each scenario.
Scenarios from n.37 to n.48 are similar to those in the group
13-24: UE_1 and UE_2 are performing a VoLTE call using a TABLE III. SCENARIO CONFIGURATION
direct link interrupted by IP cloud, UE_3 is performing HTTP
web session (refer to Fig. 2.d). Table III resumes the main LTE LTE S1 eNB
Scen. IP User Hop
Serv. Serv. Capacity BW
characteristics for each simulated scenario. No.
Type No.
cloud
[%]
No. No.
[MHz]
C. Simulation settings 1-12 VoLTE 1 NO
100; 75;
2 2
5; 10;
50;30 20
Scenarios have been simulated using the LTE network
model provided by OPNET 17.5 PL6. The UE’s antenna gain 100; 75; 5; 10;
13-24 VoLTE 1 YES 2 3
is -1 dBi with a receiver sensitivity of -200 dBm. eNodeB uses 50;30 20
10 MHz LTE bandwidths and FDD Duplex Mode. Link among
LTE network nodes is of type PPP D3, with a data rate of VoLTE 100; 75;
44.736 Mbps. The simulation area is a typical campus area 25-36 2 NO 3 2 5; 10; 20
+ HTTP 50;30
(100 Km-square wide). We employed the Voice Codec GSM
EFR and one voice frame per packet. As per the LTE standard,
the VoLTE application is carried out over the EPS bearer with
scenario number) versus each of the KPIs (which show abrupt
VoLTE 100; 75; 5; 10; changes among contiguous scenarios).
37-48 2 YES 3 3
+ HTTP 50;30 20
D. Experimental Setup
The experimental setting is based on the OPNET simulation
tool configured in the System-in-the-loop (SITL) modality.
V. ANALYSIS OF RESULTS
A. Multi Linear Regression
In [2] we studied the relationship between QoS KPIs and a
MOS, intended as a QoE metrics, fitting the linear model with
bias shown in (1) :
Fig. 3. The experimental testbed.
Fig. 6. Error histogram for the ANN trained with the Levenberg-Marquardt
Algorithm.
Fig. 5. Statistical indicators of the MLR approach.
Statistical Indicators
Dataset
Samples MSE R-factor