Fehlertolerante Videokommunikation über verlustbehaftete

Transcrição

Fehlertolerante Videokommunikation über verlustbehaftete
Traffic Charaterization and Fault Tolerance of H.264/AVC Encoded
Video Streams for QoS-Management in Computer Networks
Klaus Heidtmann, Michael Kiritz, Jens Norgall
Telecommunications and Computer Networks Division
Department of Computer Science, Hamburg University,
Vogt-Kölln-Str. 30, D-22527 Hamburg, Germany
[email protected]
Abstract Distributed video applications are an emerging area of our modern information society.
They are frequently integrated into multimedia-applications and process video data streams
produced by video encoders. This processing imposes high storage and throughput requirements
as well as temporal constrains due to the real-time aspect on communication systems. For instance
real-time video communication like video conferencing does not perform very favourable when
run over best-effort networks without quality-of-service management. Because of data losses or
high delays as consequences of congestions, bottlenecks or radio-interference in WLANs the
service offered by such a video application can deteriorate. To support the solution of such
problems this paper presents a study of the output process of video encoders to quantify and
characterize the requirements of video applications on networks. In particular we measure and
investigate the stochastic traffic characteristics of H.264/AVC-encoded video streams as network
load. This can support the solution of the problem to dimension, configure or parameterize video
applications or required services of underlying communication systems. We used the presented
results especially to derive and calibrate our analytical and simulation models and tools for load
generation as well as reliability and performance evaluation of real-time video communication.
Key words: multimedia system, quality of service, real-time video encoding, video stream, frame
length, compression, traffic characterization, network load, H.264, video communication
1. INTRODUCTION
Video applications like video telephony, video conferencing, video mail and videoon-demand are emerging areas and are already resp. will soon be part of our working
and private life. These applications impose high throughput requirements on computer
and communication systems, which range from high data rates, due to the voluminous
nature of video data, to temporal constraints, due to the continuous resp. real-time
aspects of video presentation and communication. To quantify and characterize these
requirements we study the output process of video encoders, which represent an
integral subsystem of video applications and produce video streams. As results we
present detailed measurements and traffic characteristics of H.264/AVC-encoded
video streams when leaving the video encoder as network load in order to be
transmitted by a computer network or communication system.
During the last decade we carried out comprehensive and detailed measurements
based on many different video sequences and the well-established H.261, H.263 and
H.264/AVC standards for video encoding [3,4,5,9,10,16]. In the following we present
some examples of the latest results [8,11]. So in section 2 this paper starts with the
presentation of measurements and comparison of the frame lengths of H.263- and
H.264/AVC-encoded video streams as their characteristic attribute. In the following
section 3 the previous measurements are used to look for distributions which reflect
the main characteristics of the observed video application. It follows an investigation
of autocorrelations of frame lengths in section 4. Finally we investigate the influences
of fault tolerance mechanisms integrated into the H.264/AVC-standard. These
mechanisms can be used to reconcile the size of the video stream and the picture
quality of the video to integrate quality of service in computer networks, especially in
case of faulty real-time video communication.
The presented results give some insight into the stochastic process of video
streams as produced by video encoders and can be applied to design and manage video
systems as subsystems of computer and communication systems skilfully. Video
systems as an integral part of multimedia systems are characterized by the computercontrolled integrated generation, manipulation, representation, storage and communication of digital video information. With the derived traffic characteristics one can
develop efficient mechanisms to control the video encoder so that it produces an
appropriate video data stream as network load for transmission. Based on the
characterization of a single video source introduced in this paper one can characterize
any mix of video streams by means of overlaying single stream models. Furthermore
the results are useful to provide a suitable or optimal quality of service of the
communication system for distributed video applications. The measurements and
traffic characterization can support the solution of the problem to dimension,
configure or parameterize video applications or required services of underlying
communication systems. Here, parameterization comprises a careful tuning of video
coding parameters. In a distributed environment with fixed resources our results can
be used to provide the best video quality from a single user point of view or from an
overall network point of view, where a maximum number of distributed users has to
be served. We use the presented results especially to derive and calibrate our
analytical and simulation models and tools for reliability and performance evaluation
of real-time video communication [3,6,13,15,16], especially for our eLearning tool
VideoExplorativ [2] and our load generator UniLoG (Universal Load Generator) [1].
2. MEASUREMENT OF FRAME LENGTHS
The standard H.261 defines two types of frames, i.e. I- and P-frames, while H.263
and H.264 define additional optional frame types [7]. The relative frequency of these
types of frames within a video stream is a considerable parameter and at least the first
picture of a video stream must be an intraframe. More interframes means more
computation, less data and a stronger dependence among different frames. Important
2
aspects for the picture sequence structure in an implementation are time constraints,
buffer size or the desired compression and quality trade-off. Real-time services
distinguish two different coding options, constant bit rate (CBR) and variable bit rate
(VBR). VBR uses fixed coding options (as opposed to CBR), e.g. fixed quantization
level, so the induced data rate is dependent on the motion intensity and entropy of the
coded video. Because of these fluctuations and their effects, e.g. on network
congestion, it is very important to derive adequate models for this type of traffic. So,
our following studies will refer to VBR encoded video streams with a beginning Iframe followed exclusively by P-frames and baseline profile in case of H.264/AVC.
In the following we want to observe the compressed video stream as induced by the
video encoder just using I- and P-frames. As we refer to interactive video applications
with real-time constraints the arrival process of the isochronous video streams is very
regular, so we can restrict ourselves to measurements characterizing the attributes of
interest of the compressed video frames to be transmitted. As is usual in modeling
video sources in the following we assume that length of frames is the only attribute of
interest for the requests. Therefore, we have to measure the length xi (in Bit resp. Byte)
of the i-th video frame leaving the video encoder. Thus, the collected trace of frame
lengths X = {xi | i=1,2,...,n} with n observed frames describes the video stream. This
trace leads to an empirical distribution function, which can be considered as traffic
characterization concerning the marginal distribution function of frame lengths.
Evidently, the distributions of lengths of data units have a strong impact in case of
static resource reservations during data transmission as well as in case of an adaptive
model-based quality-of-service management of communication systems.
During the last decade we carried out comprehensive measurements based on many
different video sequences and the well-established H.261, H.263 and H.264 standards
for video encoding. So now we want to exemplify the latest results [8,11] by
discussing two series of experiments in some more detail, in particular choosing
mainly H.264/AVC encoding of the sequences
 claire, a news announcer in a sequence with very low motion intensity.
 mobile, a video-recording taken from within a driving car and representing a
sequence with periods of rat her high motion intensity.
The frame lengths of the videos with H.264/AVC-encoding show the same
structure as H.263, but are 40-70% smaller. Two representative example traces of Pframe lengths are shown in Fig. 1. The traces of H.264-encodings with different
quantization are nearly interchangeable. For the same quantization steps size H.264encoded videos have a 2-4 dB higher mean PSNR compared to H.263. Detailed
measurements show that this advantage is based on the combination of the deblocking
filter, the extended intra-prediction, the variable size of the predicted blocks and the
motion compensation at subpixel level. The measurements to compare the
compression of the different encoding standards presume the same video quality, e.g.
identical PSNR.
3
Fig. 1. P-Frame lengths for video claire (left) and mobile (right), H263 resp. H.264 encoding
Table 1 shows the different compression achieved by the various encodings. One
can see that from one standard to the next there is nearly a doubling of the
compression factor. For a communication system with a fixed bandwidth this means
that it cam support twice as much video connections from one encoding to the next
higher one.
H.261
H.263
H.264
claire
compression factor
51,2
111,4
216,6
mobile
compression factor
3,1
6,4
14,6
Tab. 1: Compression factors for 261, H.263 and H.264 for the two example videos
The drawings of the empirical frame length distribution for the measured traces
look like astonishingly good approximation of Gaussian distributions. This leads to the
hypothesis that lengths of P-frames produced as a result of H.263- resp. H.264/AVCencodings can be closely approximated by normal distributions.
3. APPROXIMATE DISTRIBUTION OF FRAME LENGTH
In order to investigate the validity of the above hypothesis we repeated
approximation of observed empirical lengths distributions by Gaussian distribution for
all these standards and a variety of video sequences. The level of accuracy achievable
by the approximation was very satisfactory for all samples. As a graphical illustration
of typical observations we refer to Fig. 2 with the H.264/AVC-encoded video
sequences claire and mobile.
4
Fig. 2. Length distribution of P-frames for Claire (left) and mobile (right)
In order to judge the accuracy of the maximum likelihood estimates quantitatively,
by means of a ²-test, we tested the empirical distribution for normal distribution.
Here, classification into 13 partitions (10 degrees of freedom with 2 estimated
parameters) has been carried out. The significance level has been chosen to be =0.01
leading to a significance size of ²0.01,10 = 23.209 which is considerably higher than
the values reached by any of the video sequences observed. So we can accept the
hypothesis of normal distribution and even some more restrictive values of the
significance size would not directly lead to rejection of the hypothesis.
Thus, it seems acceptable to characterize the marginal distribution function of
video frame lengths by approximate normal distributions. An important advantage of
this approach results from the fact that the normal distribution is determined by only
two parameters and it allows a straight-forward derivation of quantiles and other
statistical quantities of the distribution function.
4. AUTOCORRELATION
Based on the results of modeling the one-dimensional marginal distribution of the
frame length we now want to take a closer look at their autocorrelations. Although the
marginal distribution of lengths is a basic and very important measure to characterize
the data streams of video applications, it is necessary to evaluate the autocorrelation
coefficients to get an adequate characterization. This is a result of the fact, that
independence assumptions induce a smoother stream than highly correlated processes
such as fractal traffic processes, and therefore independence assumptions can be
misleading in many cases. So we have to take a closer look at the autocorrelation
function of the traces. The measurements of autocorrelation coefficients (autocorrelation function) are shown in Fig. 3.
5
Fig. 3: The empirical autocorrelation function of the P-frame length traces of H.264 for Claire and mobile
Analyzing the measurement results in Fig. 3 leads to the conclusion, that a non
negligible autocorrelation up to lag  = 30 does exist in the trace mobile, while the
video claire, whose picture contents show the strongest correlation as consequence of
low motions, shows the lowest autocorrelation regarding the frame lengths. The
oscillating behavior of the autocorrelation function can be explained by low motion,
so that the coding decisions of producing predictive difference pictures alternate
periodically. A closer look at the more strongly correlated frame length traces of
mobile and other video sequences reveal changes in the level of required bandwidth,
corresponding to the level of motion within these videos. This leads to the assumption,
that correlation is based on changes in bandwidth requirements and finally on
alteration of motion and picture contents. In most video sequences the major fraction
of a picture content persists for a longer while and the motion intensity is maintained
over several frames, so that in these subintervals no rapid changes in bandwidth
requirements can be observed.
5. FAULT TOLERANCE
Finally we show the influence of the fault tolerance mechanism within H.264/AVC,
where 11% additional I-macroblocks in P-frames were enforced. The effects of single
packet losses at equidistant points of time for such videos can be subdivided into two
categories: If the number of packet losses is small and they have a great distance, the
video quality can recover the faults between the lost frames. Then the faults behave
like a sequence of single losses, which do not affect one another. In the second
situation the number of packet losses is higher, so that the distance between lost
frames is smaller than what is needed to recover from the faults. So the negative
influence of a loss on the video quality lasts longer than the time until the next loss.
Hence, the negative influences aggregate and prevent the video from recovering. Fig.
4 illustrates these two different situations in case of 1% and 10 % packet losses. We
did a lot of measurements concerning various fault tolerance mechanisms in
6
H.246/AVC and many different situations of packet loss [8], e.g. bursts, and these
investigations are still in progress.
Fig. 4. Quality of received video after regular single packet losses
6. SUMMARY AND OUTLOOK
In this paper we presented a selection of our detailed measurements video streams,
especially for H.264/AVC. First we showed traces of P-frame lengths. The observed
distributions of their lengths lead to the hypothesis that frame lengths can be approximated by a normal distribution. So we approximated the observed empirical lengths
distributions by Gaussian distribution for a variety of different video sequences
achieving a very satisfactory level of accuracy in all cases. As independence induces a
smoother stream than highly correlated processes we took a closer look at the autocorrelation noticing that a non negligible autocorrelation exists. Furthermore we
realized that correlation is based on changes in bandwidth requirements and finally on
alteration of motion and picture contents. Hence, the correlation structure is a result of
long-term correlations. Consequently, modeling of the frame length process over a
long time has to regard non negligible autocorrelation structure. Finally we investigated the influence of a fault tolerance mechanism on the video quality.
The derived characteristics of video streams can be applied to develop efficient
mechanisms to control the video codex so that it produces an appropriate video data
stream to be transmitted. They can be used straight forward to compute characteristics
of network load and traffic induced by video communication [15,16]. Moreover, our
results can be applied to manage networks for this type of real-time traffic or to
develop new components for suitable communication systems. In the case of computer
network analyses, our realistic and manageable load characterization is useful, e.g.
when applying analytical models or when executing simulation experiments
[1,6,11,15,16]. Currently the measured traces and distributions regarding H.264/AVC
are integrated into our load generator UniLoG, which can be used within our network
emulator NetEmu [12].
7
REFERENCES
[1] CONG J., Load Specification and Load Generation for Multimedia Traffic Load in Computer
Networks, PhD thesis, Dept. Comp. Science, Univ. Hamburg, 2006, published as: WOLFINGER B.
(Ed.) Berichte aus dem Forschungsschwerpunkt Telekommunikation und Rechnernetze, ShakerVerlag, Vol. 4, Aachen, 2006
[2] FIOLKA K., HEIDTMANN K., WOLFINGER B., Ein eLearning -Werkzeug zur Videokommunikation über simulierte verlustbehaftete Netze, GI-Workshop "Elektronische Unterstützung der Präsenzlehre", GI-Jahrestagung INFORMATIK 2004, Ulm, Sept. 2004
[3] HEIDTMANN K, Evaluation of Video Communication over Packet Switching Networks, 3rd
European Dependable Computing Conference EDCC-3, Prague, 1999
[4] HEIDTMANN K., Fault-Tolerant Real-Time Videocommunication via Best-Effort Networks, 4th
CaberNet Workshop, Pisa, Oct. 2001
[5] HEIDTMANN K., KOHLHAAS C., ZADDACH M., Messung der Netzlast und Bewertung der
Bildqualität bei Videokommunikation über Paketvermittlungsnetze, Proc. ARCS’99, Jena, Oct. 1999
[6] HEIDTMANN K., WOLFINGER B., Analytische Leistungsbewertung von Videokommunikation
gemäss H.261 über verlustbehaftete Paketvermittlungsnetze, 10. GI/ITG-Fachtagung über Messung,
Modellierung und Bewertung von Rechen- und Kommunikationssystemen MMB'99, Trier, Sept.
1999, 87-104
[7] ITU-T Recommendation H.264: Advanced video coding for generic audiovisual services, 2005
[8] KIRITZ M., Untersuchung von Fehlerrobustheitstechniken für H.264/AVC-kodierte Videoströme,
diploma thesis, Dept. Comp. Science, Univ. Hamburg, 2006
[9] KIRITZ M., Untersuchung der Fehlerrobustheit H.264/AVC-kodierter Videoströme bei simulierten
Übertragungsverlusten, MMBnet 2007,4. GI/ITG-Workshop, Hamburg, 13./14. Sept. 2007, in [14]
[10] NORGALL J., Lastanalyse zur Modellierung H.264/AVC-kodierter Videoströme, MMBnet 2005, 3
GI/ITG-Workshop, Hamburg, 8./9. Sept. 2007in [12]
[11] NORGALL J., Verkehrslast und Fehlerverhalten H.264/AVC-kodierter Videoströme in paketbasierten Kommunikationssystemen, diploma thesis, Dept. Comp. Science, Univ. Hamburg, 2006
[12] SCHERPE C., Emulation gekoppelter Rechnernetze mit lastabhängigem Verzögerungs- und Verlustverhalte - Architekturkonzepte, prototypische Realisierung und Fallstudien, PhD thesis, Dept. Comp.
Science, Univ. Hamburg, 2006, published as: WOLFINGER B. (Ed.), Berichte aus dem Forschungsschwerpunkt Telekommunikation und Rechnernetze, Vol. 4, Shaker-Verlag, Aachen, 2006
[13] WOLFINGER B., HEIDTMANN K. (Ed.), Leistungs-, Zuverlässigkeits- und Verlässlichkeitsbewertung von Kommunikationsnetzen und verteilten Systemen, 3. GI/ITG-Workshop MMBnet
2005, 8./9. Sept. Hamburg, Report No. 263, Dept. Comp. Science, Univ. Hamburg, 2005
[14] WOLFINGER B., HEIDTMANN K. (Ed.), Leistungs-, Zuverlässigkeits- und Verlässlichkeitsbewertung von Kommunikationsnetzen und verteilten Systemen, 4. GI/ITG-Workshop MMBnet
2007, 13./14. Sept. Hamburg, Report No. 281, Dept. Comp. Science, Univ. Hamburg, 2007
[15] WOLFINGER B., ZADDACH M., BAI G., HEIDTMANN K., Analytical Modeling of Primary and
Secondary Load as Induced by Video Applications Using UDP/IP, Intern. Symp. on Performance
Evaluation of Computer and Telecommunications Systems, SPECTS 2001, Orlando. Also in:
Computer Communications Journal, 2002
[16] ZADDACH M., Modellierung, Charakterisierung und Transformation von Videoverkehrslasten, PhD
thesis, Dept. Comp. Science, Univ. Hamburg, 2001, published as: WOLFINGER B. (Ed.) Berichte
aus dem Forschungsschwerpunkt Telekommunikation und Rechnernetze, Vol. 2, Shaker-Verlag,
Aachen, 2001
8