Академический Документы
Профессиональный Документы
Культура Документы
a r t i c l e i n f o abstract
Available online 6 September 2013 To overcome the dynamic range limitations in images taken with regular consumer
Keywords: cameras, several methods exist for creating high dynamic range (HDR) content. Current
High dynamic range video low-budget solutions apply a temporal exposure bracketing which is not applicable for
Stereo camera setup dynamic scenes or HDR video. In this article, a framework is presented that utilizes two
Stereo matching cameras to realize a spatial exposure bracketing, for which the different exposures are
distributed among the cameras. Such a setup allows for HDR images of dynamic scenes
and HDR video due to its frame by frame operating principle, but faces challenges in the
stereo matching and HDR generation steps. Therefore, the modules in this framework are
selected to alleviate these challenges and to properly handle under- and oversaturated
regions. In comparison to existing work, the camera response calculation is shifted to an
offline process and a masking with a saturation map before the actual HDR generation is
proposed. The first aspect enables the use of more complex camera setups with different
sensors and provides robust camera responses. The second one makes sure that only
necessary pixel values are used from the additional camera view, and thus, reduces errors
in the final HDR image. The resulting HDR images are compared with the quality metric
HDR-VDP-2 and numerical results are given for the first time. For the Middlebury test
images, an average gain of 52 points on a 0-100 mean opinion score is achieved in
comparison to temporal exposure bracketing with camera motion. Finally, HDR video
results are provided.
& 2013 Elsevier B.V. All rights reserved.
0923-5965/$ - see front matter & 2013 Elsevier B.V. All rights reserved.
http://dx.doi.org/10.1016/j.image.2013.08.016
192 M. Btz et al. / Signal Processing: Image Communication 29 (2014) 191202
is only relevant for computer-generated content, and hence, generate low resolution disparity maps, increase the com-
is not applicable for improving images of a real scene. The plexity of the setup, and are still considerably expensive.
third class of HDR acquisition is a low-budget solution. In For testing purposes, datasets from the Middlebury Stereo
recent years, many consumer cameras started to offer Vision Page [6] and two self-recorded datasets are
exposure bracketing as a method to create HDR images of used. Besides visual quality evaluation, a suitable quality
static scenes. For exposure bracketing, a series of LDR metric is applied with respect to a single-view HDR
pictures is taken with different exposure times one after reference image.
the other in order to estimate the camera response, convert The key novel contributions are the new one-stage
the pixel values into radiance values and then merge them framework compared to the two-stage frameworks in the
together to generate the HDR output. Two well-known literature [7,8] and the introduction of a saturation map to
approaches are the algorithm by Debevec and Malik [3] reduce errors in the final HDR image. Beyond this, an
and the method by Robertson et al. [4]. The different objective quality assessment, which is directly computed
exposures can also be realized by using different apertures on the HDR images, is done for the first time.
or neutral density (ND) filters instead of exposure times. The remainder of this article is structured as follows.
However, this approach is not applicable for capturing The upcoming section presents previous work, whereas
dynamic scenes or even HDR video due to the temporal Section 3 covers the layout and the details of the frame-
fashion of the exposure bracketing since the scene is work and its modules. Experimental results are then given
changing over time. An example for a static scene without in Section 4. Finally, Section 5 concludes this contribution.
and with camera motion is illustrated in Fig. 1. It can be
seen that temporal exposure bracketing completely
fails when there is motion. There are approaches which 2. Previous work
compensate for the motion [5], but they have the draw-
back not being able to easily apply ND-filters, and hence, This contribution is mainly inspired by the work in [7,8]
yielding artifacts from different motion blur or depth of where stereo matching is done in two stages and the
field. camera responses are computed from the matches found
The goal of this work is therefore to allow for HDR in the first stage. These camera responses are then used for
image acquisition of dynamic scenes and HDR video. The converting the input images to radiance space and a second
proposed framework utilizes a stereo camera setup to stereo matching for disparity refinement is performed.
realize a spatial exposure bracketing where the differently In this work, however, unlike the existing algorithms, the
exposed images are distributed among the two views. Due camera responses are not estimated from the stereo
to this, all necessary information to create an HDR image is images. This step is done only once in an offline step using
available at the same time instant. This stereo setup leads temporal exposure bracketing on a single-view setup. For
to two challenges. On the one hand, HDR image recon- that reason, the disparity maps can be computed directly
struction requires perfect alignment of the views after from the radiance space images (RSI) and the need for a
image warping. Therefore, a robust disparity estimation is second stereo matching stage is removed. In addition, the
needed. On the other hand, typical stereo matching algo- proposed offline step allows for camera setups with
rithms rely on the brightness constancy assumption which different sensors since each camera can be calibrated
does not hold for differently exposed views. Each stage of separately. Besides the novel one-stage framework, this
the proposed framework is built to properly handle the work also presents the concept of applying a so-called
just mentioned challenges and also under-/oversaturated saturation map before the actual HDR generation.
regions. Instead of applying a stereo matching approach, it In addition to that, objective quality evaluation results
would also be possible to utilize time-of-flight cameras in are provided for the first time. Furthermore, many ideas
order to obtain the disparity maps. However, they can only for the disparity estimation are taken from the currently
Fig. 1. Temporal exposure bracketing without (top) and with (bottom) camera motion.
M. Btz et al. / Signal Processing: Image Communication 29 (2014) 191202 193
second-placed stereo matching algorithm [9] on the Mid- represents a view taken from a different position with a
dlebury Stereo Vision Page [6]. different exposure. From these images the output HDR
image is created.
3. Proposed stereo high dynamic range imaging The first stage is responsible for the geometric calibra-
framework tion of the input images and consists of undistorting and
rectifying the input (see Section 3.1). In the next stage, the
Before the particular stages of the proposed stereo high LDR images are converted to radiance space images. Here,
dynamic range (Stereo-HDR) imaging framework are dis- the pixel values of the differently exposed images are
cussed in detail, an overview of the framework is given. brought into a common value space where properly
A schematic layout in the form of a block diagram is exposed pixels in all images yield approximately the same
provided in Fig. 2. Basically, the framework can be split radiance value. This is done to ease the finding of correct
into the main workflow and an offline step. correspondences which is performed by the stereo match-
The new offline step is responsible for calculating the ing algorithm. For proper conversion, all the side informa-
camera response curves and weighting functions (see tion like camera response curves, exposure times,
Section 3.4), which are both necessary for the conversion apertures, ISO speeds and weighting functions is neces-
of the LDR images to radiance space. This calibration step sary. The third stage deals with the disparity estimation.
has to be done only once. In general, the camera response Since it contains a lot of modules it is described in detail in
curves for both cameras the setup have to be calculated. Section 3.2. The output of this stage is a set of disparity
However, when using the same camera hardware for both maps. The following stage handles the image warping, for
views it is sufficient to conduct the calibration only for one which the calibrated LDR image to be warped and its
view. Therefore, multi-exposed LDR images of a single corresponding disparity map are required. This is dis-
view serve as input for this step. Due to temporal exposure cussed more precisely in Section 3.3. In the last stage,
bracketing applied in this offline step, a static scene is the HDR generation is conducted, which requires three
required for the camera calibration. This approach avoids different inputs. The first one is the calibrated LDR image
errors introduced by disparity estimation and image warp- of the view that is selected to be enhanced to an HDR
ing and it also allows for a camera setup with different image. For best results, this view should contain the least
hardware. under- or overexposed regions. The other two inputs are
The main workflow comprises six stages and in the case the associated warped LDR image and the necessary side
of a video input operates frame by frame. The input for this information which was already needed for the radiance
part consists of the LDR images where each image space conversion. More details on the HDR generation are
194 M. Btz et al. / Signal Processing: Image Communication 29 (2014) 191202
and right epipolar lines to merge, and thus, the search for
the proper matching point only has to be done in the same
image row, easing the stereo matching. To realize this
calibration step, the intrinsic and extrinsic camera para-
meters have to be known or estimated.
After the rectification of both views, the disparity maps
can be computed.
the radiance space images are used for the matching there This step is primarily done to smooth the DSI, and thus,
can be still variations in corresponding values, and thus, a to get more reliable and robust costs. However, the
robust matching cost is preferable. aggregation should not be done across object boundaries.
According to [12], matching costs can be grouped into The cost aggregation can be realized as an averaging over a
parametric and nonparametric costs and mutual informa- 2-D support window in each slice separately or as an
tion. Common parametric costs are the sum of absolute averaging over a 3-D window. For this work, only cost
differences (SAD), the sampling-insensitive absolute dif- aggregations with a 2-D support window were taken
ference of Birchfield and Tomasi (BT) or the normalized further into account. There are several approaches like
cross-correlation (NCC). A variant of the NCC is the zero- square window, shiftable window, adaptive weight or
mean normalized cross-correlation (ZNCC) which can adaptive window [17]. Since adaptive window approaches
compensate for both changes in gain and offset inside can fit the input data very well, a particular adaptive
the correlation window. For that reason, it is a good choice window method, called cross-based aggregation [18,9] is
for matching images of different exposure. However, selected in this contribution for cost aggregation purposes.
because of the fixed support window size it produces a The work in [9] also shows that the improved version of
fattening effect near object boundaries [13]. There is also the cross-based aggregation can achieve similar or slightly
an adaptive normalized cross-correlation (ANCC) [13] better results compared to the adaptive weight aggrega-
which tries to decrease the fattening effect. In comparison tion. Based on a comparison conducted in [17], the
to parametric costs, nonparametric costs are not based on adaptive weight aggregation is said to be one of the best
intensity values, but on the local ordering. That is why they approaches. Furthermore, the cross-based aggregation
can make up for radiometric changes well as long as the needs less memory and computation time and creates a
ordering is not affected. Common matching costs of this support window for each pixel that can be used for later
class are the rank filter and the census transform [14]. steps as well.
However, they are only manipulating the input, and hence, In principle, the cross-based aggregation of [9] is
are not directly matching costs. For the census transform, divided into the cross construction and the actual
the actual matching cost is computed with the aid of the cost aggregation. These two steps are illustrated in
Hamming distance. Figs. 6 and 7.
According to the extensive results in [12], the census In the first step, for each pixel p a cross is set up, which
transform and the ZNCC are among the best matching is highlighted in blue and its four arms are labelled left,
costs when it comes to exposure variations. In their right, upper, and bottom arm. In contrast to the actual
experiments, the census transform outperforms the ZNCC. matching, this cross creation is performed on the RGB
For our case, though, the ZNCC turned out to deliver input images. Each arm length is calculated with the help
slightly better results, and hence, it was chosen for the of three rules. Given p, the left arm extends up to a pixel pl
final setup of this work. Its equation is as follows: which is the last point satisfying the following three rules:
Cp; d Dc pl ; p o1 and Dc pl ; pl 1; 0 o1 2
with
q A Np I L qI L pI R qd; 0I R pd; 0
q
Dc pl ; p max jI i pl I i pj
q A Np I L qI L p2 q A Np I R qd; 0I R pd; 02 i R;G;B
The reason for giving brighter values a higher weighting 30 ms, 61 ms, 122 ms, and 280 ms. The testing on the
factor compared to darker values is that their SNR is fourth dataset was conducted only on view12 and view13
higher. As a consequence, noise in dark regions is sup- that were downsampled by a factor of 2 in each dimension.
pressed whereas bright intensity values gain a bigger After rectification, they were cropped to 1200 900.
impact. The reason for suppressing values at the bound- Finally, the disparity search range was selected to be 100
aries, however, comes from the fact that camera responses pixels.
are typically most sensitive in the middle of the value In general, to achieve different exposures for each view
range. In other words, the mapping is most accurate near the usage of different ND-filters is required in the case of a
the center of the pixel value range. For better illustration, dynamic scene or video. However, since the first four
the camera response curves for the three RGB channels datasets in this work show static scenes, the utilization
and the composite weighting function of the test dataset of different exposure times was possible in order to
IIS Jumble are shown in Fig. 9. The value range of the simplify the test setup. It should also be mentioned that
weighting function is between 0 and slightly above 1. all HDR images in this contribution are tone-mapped and
gamma corrected for displaying with the help of pfstools.
The chosen tone mapping operator is the contrast map-
4. Experimental results
ping by Mantiuk [24] with 0.5 as contrast scaling factor.
As example, the applied left and right views of the Art
This section is divided into three parts. The first part
and IIS Jumble dataset are depicted in Fig. 10.
gives an overview of the simulation setup. The second part
Aside from these static scene setups, a dynamic scene
is about visual quality evaluation and the last one states
setup was simulated and an HDR video was created. The
some objective quality results.
dynamic scene LMS Pillars was artificially generated using
the open-source 3-D computer graphics software Blender
4.1. Simulation setup [25]. It contains moving objects with different textures,
dark areas, and overexposed spots. The scene is lighted by
Four different static test datasets and a dynamic test two distinct light sources and image-based lighting, which
dataset were selected in this contribution. The first three employs an HDR environment map from [26]. In order to
are called Aloe, Moebius, and Art and are provided by simulate a stereo camera setup, the scene was rendered
the Middlebury Stereo Vision Page [6,22]. They consist of with different exposures at two camera positions. The first
three different exposures for three different illuminations frame of both the left and right camera together with the
and a total of seven views in three possible resolutions. tone-mapped HDR sequence is provided in Fig. 11. Unlike
Furthermore, ground truth disparity maps, created by a the static scenes, the applied tone-mapping operator is the
structured light approach [23], are available for the second photographic tone reproduction by Reinhard et al. [27].
most left and right views (view1 and view5). All tests in The complete videos can be found at [28].
this work were done on those views having a ground truth For all the tests, the left view is considered to be the
disparity map. For each set, a fixed illumination was target view for HDR reconstruction. Therefore, it is
selected and the medium resolutions of the images were assumed to have the least under- or oversaturated regions.
taken. The illuminations Illum3, Illum2, and Illum1 were For the Middlebury datasets, the right view was selected to
chosen for the datasets Aloe, Moebius, and Art, respec- be a brighter version, whereas for the last two datasets a
tively. The first dataset has a resolution of 641 555 while darker version was chosen.
the other two have a resolution of 695 555. The disparity
search ranges were set to 100, 120, and 120 pixels,
respectively. Moreover, the different exposure times for
Aloe are 125 ms, 500 ms, and 2000 ms, whereas for 4.2. Visual quality evaluation
Moebius and Art they are 250 ms, 1000 ms, and
4000 ms. The fourth dataset, named IIS Jumble, was In order to evaluate the test results, reference images
created in the course of this work and is composed of 15 are required which show the best case. For this setup, it is
different views arranged in a 5 3 array. Each view possible to create two different references: a single-view,
provides five different exposures of the scene with a and thus, overall best case and a stereo best case which
resolution of 2560 1920. The exposure times are 5 ms, employs cross-checked ground truth disparity maps.
Fig. 10. Stereo, multi-exposure input images depicting Art (a,b) and IIS Jumble (c,d): (a) Left view with normal exposure, (b) right view with long
exposure, (c) left view with normal exposure, and (d) right view with short exposure.
M. Btz et al. / Signal Processing: Image Communication 29 (2014) 191202 199
Fig. 12. Tone-mapped HDR images for different test cases and datasets. Datasets from top to bottom: Aloe, Moebius, Art, zoomed version of Art, and IIS
Jumble. Test cases from left to right: single-view reference (SV), single-view with camera movement (SVCM), stereo employing cross-checked ground
truth disparity maps (Stereo GT-DM), and proposed stereo setup. No images are given for the camera movement and the ground truth disparity map case
for IIS Jumble.
enhance, or attenuate artifacts, and thus, distort the minimum interpolation, and saturation masking can even
results. achieve slightly better results compared to simply employ-
To substantiate the previously obtained visual results, ing a cross-checked ground truth disparity map. That
mean opinion scores are predicted with the HDR-VDP-2 means that even with ground truth data, errors are
and shown in Table 1. It can be seen that the single-view introduced during image warping, especially at object
case completely fails when there is a camera movement. boundaries. In addition, the ground truth disparity maps
For this case, the average MOS drops to 41 points. are normally not available anyhow. Since the IIS Jumble
Furthermore, the proposed refinement including cross- dataset provides no ground truth disparities no compar-
checking, subpixel enhancement, median filtering, ison could be made.
M. Btz et al. / Signal Processing: Image Communication 29 (2014) 191202 201
Table 1
Mean opinion scores, where 0 is the worst case and 100 the best, for all four datasets compared to their single-view HDR references. The last column gives
average values for the 3 Middlebury datasets. SV denotes single-view and GT-DM means ground truth disparity map. The best values are highlighted.
The MOS values for the Middlebury datasets vary less the differently exposed LDR images, the matching is per-
than those for the self-recorded dataset. This is based on formed on the grayscale radiance space images. Further-
the fact that a properly exposed LDR image of the Middle- more, a cross-based cost aggregation, a local winner-take-
bury datasets covers a large portion of the dynamic range all optimization and a set of disparity refinement steps are
of the scene. However, because of the bright light bulb in done. After the disparity estimation, a backward image
IIS Jumble bigger portions of that image require addi- warping and the presented HDR generation are conducted.
tional information from the other view. The introduction of a newly proposed masking step with
The reason for two of the Middlebury images having the aid of a saturation map during the HDR generation stage
slightly better MOS values for no cost aggregation is also leads to improved results. Testing was done on the three
based on the fact that only little information is needed and Middlebury datasets Aloe, Moebius, and Art and on the
therefore, filling holes in the disparity map by cost two self-recorded datasets IIS Jumble and LMS Pillars.
aggregation does not yield a gain. However, for the IIS Besides visual quality assessment, objective quality was
Jumble dataset the reconstruction of disparity values in evaluated with the help of the quality metric HDR-VDP-2,
large under- or oversaturated regions is necessary. These which yields a predicted mean opinion score. To the best of
regions lead to holes in the disparity map which have to be our knowledge, an objective quality assessment of the HDR
filled by all means in order to achieve good results. Due to images, obtained by a multi-view approach, is done for the
the big benefit for datasets with large under- or over- first time in this contribution. The proposed framework
saturated regions, the cross-based aggregation is recom- yields an average gain of 52 points in the MOS compared to
mended to be always used together with the proposed single-view HDR with camera motion for the Middlebury
refinement steps. On average, the proposed framework datasets.
achieves a gain of 52 points in the MOS compared to the Due to the large variety of options for disparity estima-
single-view case with camera motion for the Middlebury tion, image warping, and HDR reconstruction, there are
datasets. The peak gain is 55.64 points. a lot of possibilities for improvements and future work.
A first idea would be to use a semiglobal or global disparity
5. Conclusion optimization approach instead of the local one. The ZNCC
could also be supplied with an adaptive support window
In this contribution, a stereo high dynamic range to reduce fattening effects. Although the framework at
imaging framework was presented, which is capable of hand is ready for more than two views, all tests and
handling dynamic scenes and video. Beyond that the evaluations so far were limited to the stereo case. To
proposed framework is easily extendable to an arbitrary further improve image quality, a cancellation of small
number of cameras. Nevertheless, this stereo case can be erroneous regions in the disparity maps could be applied.
regarded as an important case as it is a good representa- Research on this multi-view HDR topic is ongoing.
tive for off the shelf low cost solutions. The main idea of
this framework is to apply a spatial instead of a temporal Appendix A. Supplementary material
exposure bracketing. However, this leads to two chal-
lenges. On the one hand, typical stereo matching algo- Supplementary data associated with this article can be
rithms rely on the brightness constancy assumption which found in the online version of http://dx.doi.org/10.1016/j.
does not hold for differently exposed views. On the other image.2013.08.016.
hand, HDR image reconstruction requires perfectly aligned
images. The modules in the proposed framework were
selected to alleviate these challenges. First of all, the
References
camera responses are computed in an offline step which
allows for robust conversion to radiance space and makes
[1] E. Reinhard, G. Ward, S. Pattanaik, P. Debevec, High Dynamic Range
it possible to employ camera setups with different hard- Imaging: Acquisition, Display, and Image-Based Lighting (The Mor-
ware elements. In the main workflow the input images are gan Kaufmann Series in Computer Graphics), Morgan Kaufmann,
rectified before performing the disparity estimation. As 2005.
[2] J. Guo, S. Sonkusale, A high dynamic range CMOS image sensor for
matching cost for the stereo matching the exposure- scientific imaging applications, IEEE Sensors Journal 9 (10) (2009)
invariant ZNCC is applied. Then, instead of simply matching 12091218.
202 M. Btz et al. / Signal Processing: Image Communication 29 (2014) 191202
[3] P.E. Debevec, J. Malik, Recovering high dynamic range radiance maps matching, International Journal of Computer Vision 75 (2007)
from photographs, in: Proceedings of the 24th Annual Conference 283296.
on Computer Graphics and Interactive Techniques, SIGGRAPH '97, [18] K. Zhang, J. Lu, G. Lafruit, Cross-based local stereo matching using
ACM Press/Addison-Wesley Publishing Co., New York, NY, USA, 1997, orthogonal integral images, IEEE Transactions on Circuits and
pp. 369378. Systems for Video Technology 19 (7) (2009) 10731079.
[4] M. Robertson, S. Borman, R. Stevenson, Dynamic range improvement [19] H. Hirschmller, Stereo processing by semiglobal matching and
through multiple exposures, in: Proceedings of International Con- mutual information, IEEE Transactions on Pattern Analysis and
ference on Image Processing (ICIP), vol. 3, Kobe, Japan, 1999, Machine Intelligence 30 (2) (2008) 328341.
pp. 159163. [20] D. Tian, P.-L. Lai, P. Lopez, C. Gomila, View synthesis techniques for
[5] S.B. Kang, M. Uyttendaele, S. Winder, R. Szeliski, High dynamic range 3D video, in: Proceedings of SPIE Optical Engineering Applications,
video, ACM Transactions on Graphics 22 (3) (2003) 319325. International Society for Optics and Photonics, San Diego, California,
[6] Middlebury Stereo Vision Page, http://vision.middlebury.edu/ USA, 2009, p. 74430T.
stereo/. [21] R. Mantiuk, G. Krawczyk, R. Mantiuk, H.-P. Seidel, High dynamic
[7] N. Sun, H. Mansour, R. Ward, HDR image construction from multi- range imaging pipeline: perception-motivated representation of
exposed stereo LDR images, in: Proceedings of the 17th IEEE visual content, in: B.E. Rogowitz, T.N. Pappas, S.J. Daly (Eds.), Human
International Conference on Image Processing (ICIP), Hong Kong, Vision and Electronic Imaging XII, Proceedings of SPIE, vol. 6492, San
China, 2010, pp. 29732976. Jose, California, USA, 2007.
[8] A. Troccoli, S.B. Kang, S. Seitz, Multi-view multi-exposure stereo, in: [22] H. Hirschmller, D. Scharstein, Evaluation of cost functions for
Proceedings of the Third International Symposium on 3D Data stereo matching, in: Proceedings of IEEE Conference on Computer
Vision and Pattern Recognition (CVPR), Minneapolis, Minnesota,
Processing, Visualization, and Transmission, Chapel Hill, North
USA, 2007, pp. 18.
Carolina, USA, 2006, pp. 861868.
[23] D. Scharstein, R. Szeliski, High-accuracy stereo depth maps using
[9] X. Mei, X. Sun, M. Zhou, S. Jiao, H. Wang, X. Zhang, On building an
structured light, in: Proceedings of IEEE Computer Society Confer-
accurate stereo matching system on graphics hardware, in: Proceedings
ence on Computer Vision and Pattern Recognition, Vol. 1, Madison,
of IEEE International Conference on Computer Vision Workshops
Wisconsin, USA, 2003, pp. I195 I202.
(ICCV Workshops), Barcelona, Spain, 2011, pp. 467474.
[24] R. Mantiuk, K. Myszkowski, H.-P. Seidel, A perceptual framework for
[10] F. Kainz, R. Bogart, Technical Introduction to OpenEXR, Industrial
contrast processing of high dynamic range images, in: Proceedings
Light & Magic (February 2009).
of the Second Symposium on Applied Perception in Graphics and
[11] A. Bovik, Handbook of Image and Video Processing, Academic Press,
Visualization (APGV), ACM Press, New York, NY, USA, 2005, pp. 87
San Diego, 2000.
94.
[12] H. Hirschmller, D. Scharstein, Evaluation of stereo matching costs on
[25] Blender, http://www.blender.org/.
images with radiometric differences, IEEE Transactions on Pattern [26] HDRI-Hub.com, http://www.hdri-hub.com/free-samples/.
Analysis and Machine Intelligence 31 (9) (2009) 15821599. [27] E. Reinhard, M. Stark, P. Shirley, J. Ferwerda, Photographic tone
[13] Y.S. Heo, K.M. Lee, S.U. Lee, Robust stereo matching using adaptive reproduction for digital images, ACM Transactions on Graphics 21
normalized cross-correlation, IEEE Transactions on Pattern Analysis (3) (2002) 267276.
and Machine Intelligence 33 (4) (2011) 807822. [28] HDR Video: LMS Pillars, http://www.lms.lnt.de/forschung/arbeitsge
[14] K. Ambrosch, C. Zinner, H. Leopold, A miniature embedded stereo biete/video/vanal/hdrvideo.php.
vision system for automotive applications, in: Proceedings of IEEE [29] R. Mantiuk, K.J. Kim, A.G. Rempel, W. Heidrich, HDR-VDP-2: A
26th Convention of Electrical and Electronics Engineers in Israel calibrated visual metric for visibility and quality predictions in all
(IEEEI), Eilat, Israel, 2010, pp. 786789. luminance conditions, ACM Transactions on Graphics 30 (4) (2011).
[15] M. Bleyer, S. Chambon, Does color really help in dense stereo 40:140:14.
matching?, in: Proceedings of International Symposium on 3D Data [30] Z. Wang, E. Simoncelli, A. Bovik, Multiscale structural similarity for
Processing, Visualization and Transmission (3DPVT), Paris, France, image quality assessment, in: Proceedings of Conference Record of
2010, pp. 18. the 37th Asilomar Conference on Signals, Systems and Computers,
[16] D. Scharstein, R. Szeliski, R. Zabih, A taxonomy and evaluation of vol. 2, Pacific Grove, California, USA, 2003, pp. 13981402.
dense two-frame stereo correspondence algorithms, in: Proceedings [31] H. Yeganeh, Z. Wang, Objective assessment of tone mapping algo-
of IEEE Workshop on Stereo and Multi-Baseline Vision (SMBV), rithms, in: 17th IEEE International Conference on Image Processing
Kauai, Hawaii, 2001, pp. 131140. (ICIP), 2010, pp. 24772480.
[17] M. Gong, R. Yang, L. Wang, M. Gong, A performance study on [32] H. Yeganeh, Z. Wang, Objective quality assessment of tone-mapped
different cost aggregation approaches used in real-time stereo images, IEEE Transactions on Image Processing 22 (2) (2013) 657667.