Session 1
Session 1
Session 1
Michael Bleyer
LVA Stereo Vision
VU Stereo Vision (3.0 ECTS/2.0 WS)
Anrechenbarkeit:
• Wahlfach im Masterstudium “Computergraphik & Digitale
Bildverarbeitung”
• Wahlfach im Masterstudium “Medieninformatik”
Webseite der LVA:
• http://www.ims.tuwien.ac.at/teaching_detail.php?ims_id=188.HQK
VU Stereo Vision (3.0 ECTS/2.0 WS)
Vorlesungstermine (9 Einheiten):
• Fr 5. März (10.00-11.30)
• Fr 12. März (10.00-11.30)
• Fr 19. März (10.00-11.30)
• Fr 26. März (10.00-11.30)
• Fr 16. April (10.00-11.30)
• Fr 23. April (10.00-11.30)
• Mi 28. April (10.00-11.30)
• Fr 7. Mai (10.00-11.30)
• Mi 12. Mai (10.00-11.30)
• Mündliche Prüfung nach Vereinbarung
Ort:
• Seminarraum 188/2
Topics Covered in the Lecture (Preliminary List)
Basics:
• 3D Perception, Stereo Matching Problem, Applications
• Stereo Pipeline, Challenges in Stereo Matching, Middlebury benchmark
Local Stereo Methods:
• Adaptive Window Methods
Global Stereo Matching:
• Global optimization:
− Dynamic Programming, Belief Propagation, Move-Making Algorithms
• Stereo Models:
− Smoothness Priors, Occlusion Handling
The Data Term
• Sampling insensitive measures, Role of color, Illumination-invariant measures.
Segmentation-Based Stereo
Recent work at IMS
Homework
1. Step:
• You will implement block matching for stereo.
2. Step:
• You will make the algorithm of step 1 computationally fast (sliding
window technique)
• Reflects the first branch of stereo research: real-time matching
3. Step:
• You will improve the algorithm of step 1 to deliver high-quality
results.
• Competition:
− What is your ranking in the Middlebury benchmark?
• It is up to you what tricks of this lecture you are going to
use.
• Reflects the second branch of stereo research: high-quality
(but slow) matching
3D Perception
Michael Bleyer
LVA Stereo Vision
3D Perception
Human-Eye Brain
Separation(~6.5cm)
2D image and the right eye sees
another one, our brain will try to
overlay the images to generate a 3D
impression.
How can we use this for watching
3D movies?
Left 2D Image Right 2D Image 3D View
Anaglyphs
Two images of complementary
color are overlaid to generate
one image.
Glasses required (e.g.
red/green)
Red filter cancels out red
image component, green filter
cancels out green component (Anaglyph Image)
Each eye gets one image =>
3D impression
Current 3D cinemas use this
principle. However,
polarization filters are used (Red/Green Glasses)
instead of color filters.
Shutter Glasses
Display flickers between left
and right image (i.e. each
even frame shows left image,
each odd frame shows right
image)
(Shutter Glasses and 120 Hz Display)
When left frame is shown,
shutter glasses close right
eye and vice versa.
Requires new displays of
high frame rate (120Hz).
Currently pushed by Nvidea
to address gaming market.
(Nvidea Artwork)
Autostereoscopic Displays
No glasses required!
Matrix of many transparent lenses
put on the display.
Lenses distort pixels so that left eye
gets a left image and right eye gets
a right image (if you are standing in
a sweet spot) => 3D impression
Novel viewpoint capability:
• You can walk in front of the display and
get a perceptively correct depth
impression depending on your current
viewpoint. (Philips Wowvx Display)
You will get a demo soon
Free Viewing (No glasses required, but some practice)
The way how you usually look at the display (no 3D):
Free Viewing (No glasses required, but some practice)
Parallel Viewing:
Left Image Right Image
Free Viewing (No glasses required, but some practice)
Cross Eye Viewing:
Right Image Left Image
Replace human eyes with a pair of slightly
displaced cameras.
Replace human eyes with a pair of slightly
displaced cameras.
How can we accomplish a fully
automatic 2D to 3D conversion?
Background
Disparity
(Small)
Foreground
Disparity (Large)
The challenging part is to compute
the disparity map.
This task is known as the stereo
matching problem.
Stereo matching will be the topic of
this lecture!!!
The disparity map contains sufficient information for
generating a 3D model.
Applications
(just a few examples)
3D Reconstruction from aerial images