Computer Vision Introduction
Computer Vision Introduction
Computer Vision Introduction
Computer Vision”?
Image Computer
Processing Vision
manipulate image
process image data
data
generate symbolic
generate another
data
image
1
Computer Vision
• Reconstruction
– Recover 3D information from data
• Recognition
– Detect and identify objects
• Understanding
– What is happening in the scene?
Historical overview
• 1920s
– Coding images for transmission by telegraph (3 hours)
• 1960s
– Computers powerful enough to store images and process in
realistic times
– Space program
1960s - 1970s
• Applications
– Medical imaging
– Remote sensing
– Astronomy
Today
– DTV
– Image interpretation
– Biometry
– GIS
– Tele-surgery
System Overview
Captured data
Enhancement
Feature Extraction
Feature Recognition
Image Recognition
Labels
Why study Computer Vision?
• Filter outputs
– essentially form a dot-product between a pattern and an image,
while shifting the pattern across the image
– strong response -> image locally looks like the pattern
– e.g. derivatives measured by filtering with a kernel that looks like a
big derivative (bright bar next to dark bar)
Convolve this image
To get this
• Digital libraries
– Find me the pic of a certain posture from skating video
• Surveillance
– Warn me if there is a mugging in the grove
• HCI
– Do what I show you
• Military
– Shoot this, not that
What are the problems in recognition?
• Which bits of image should be recognised together?
– Segmentation.
• How can objects be recognised without focusing on detail?
– Abstraction.
• How can objects with many free parameters be
recognised?
– No popular name, but it’s a crucial problem anyhow.
• How do we structure very large modelbases?
– again, no popular name; abstraction and learning come into this
Segmentation
http://www.ri.cmu.edu/projects/project_320.html
Tracking
• Rigid Motion
• Reconstruction:
2D 3D
2D 3D
2D+3D
Clouds: Interpolation
Clouds: Reconstruction
Tongue: Reconstruction
More..
• Tongue Tracking
• Face Tracking
• Bio-medical:
Tongue-head Tongue-skull
Few More..
• ACCESS
• Stereo-Face Tracker
Project on Image Guided Surgery:
A collaboration between the MIT AI Lab and Brigham
and Women's Surgical Planning Laboratory
• The Computer Vision Group of the MIT Artificial
Intelligence Lab has been collaborating closely for several
years with the Surgical Planning Laboratory of Brigham
and Women's Hospital. As part of the collaboration, tools
are being developed to support image guided surgery. Such
tools will enable surgeons to visualize internal structures
through an automated overlay of 3D reconstructions of
internal anatomy on top of live video views of a patient.
We are developing image analysis tools for leveraging the
detailed three-dimensional structure and relationships in
medical images. Sample applications are in preoperative
surgical planning, intraoperative surgical guidance,
navigation, and instrument tracking.
Figures by kind permission of Eric Grimson; further information can be
obtained from his web site http://www.ai.mit.edu/people/welg/welg.html.
Figures by kind permission of Eric Grimson; further information can be
obtained from his web site http://www.ai.mit.edu/people/welg/welg.html.
Some Results
• MRI data
• Rotate Model
• Peel