Opencv Tutorials 2.4.3
Opencv Tutorials 2.4.3
Opencv Tutorials 2.4.3
Release 2.4.3
November 03, 2012
CONTENTS
1 Introduction to OpenCV 5
1.1 Installation in Linux . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.2 Using OpenCV with gcc and CMake . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.3 Using OpenCV with Eclipse (plugin CDT) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
1.4 Installation in Windows . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
1.5 How to build applications with OpenCV inside the Microsoft Visual Studio . . . . . . . . . . . . . . 26
1.6 Introduction into Android Development . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
1.7 OpenCV4Android SDK . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
1.8 Android development with OpenCV . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
1.9 Installation in iOS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
1.10 Load and Display an Image . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93
1.11 Load, Modify, and Save an Image . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
1.12 How to write a tutorial for OpenCV? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
2 core module. The Core Functionality 107
2.1 Mat - The Basic Image Container . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109
2.2 How to scan images, lookup tables and time measurement with OpenCV . . . . . . . . . . . . . . . 115
2.3 Mask operations on matrices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120
2.4 Adding (blending) two images using OpenCV . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 122
2.5 Changing the contrast and brightness of an image! . . . . . . . . . . . . . . . . . . . . . . . . . . . 124
2.6 Basic Drawing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128
2.7 Random generator and text with OpenCV . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133
2.8 Discrete Fourier Transform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137
2.9 File Input and Output using XML and YAML les . . . . . . . . . . . . . . . . . . . . . . . . . . . 141
2.10 Interoperability with OpenCV 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 148
3 imgproc module. Image Processing 153
3.1 Smoothing Images . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 158
3.2 Eroding and Dilating . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163
3.3 More Morphology Transformations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 169
3.4 Image Pyramids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 175
3.5 Basic Thresholding Operations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181
3.6 Making your own linear lters! . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189
3.7 Adding borders to your images . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 193
3.8 Sobel Derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 197
3.9 Laplace Operator . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 203
3.10 Canny Edge Detector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207
3.11 Hough Line Transform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 212
3.12 Hough Circle Transform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 218
i
3.13 Remapping . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 222
3.14 Afne Transformations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 228
3.15 Histogram Equalization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 234
3.16 Histogram Calculation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 240
3.17 Histogram Comparison . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 247
3.18 Back Projection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 252
3.19 Template Matching . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 258
3.20 Finding contours in your image . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 266
3.21 Convex Hull . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 267
3.22 Creating Bounding boxes and circles for contours . . . . . . . . . . . . . . . . . . . . . . . . . . . . 269
3.23 Creating Bounding rotated boxes and ellipses for contours . . . . . . . . . . . . . . . . . . . . . . . 272
3.24 Image Moments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 274
3.25 Point Polygon Test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 276
4 highgui module. High Level GUI and Media 279
4.1 Adding a Trackbar to our applications! . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 280
4.2 Video Input with OpenCV and similarity measurement . . . . . . . . . . . . . . . . . . . . . . . . . 283
4.3 Creating a video with OpenCV . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 291
5 calib3d module. Camera calibration and 3D reconstruction 297
5.1 Camera calibration with square chessboard . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 298
5.2 Camera calibration With OpenCV . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 298
6 feature2d module. 2D Features framework 309
6.1 Feature Description . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 311
6.2 Harris corner detector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 313
6.3 Feature Matching with FLANN . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 317
6.4 Features2D + Homography to nd a known object . . . . . . . . . . . . . . . . . . . . . . . . . . . 320
6.5 Shi-Tomasi corner detector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 323
6.6 Creating yor own corner detector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 325
6.7 Detecting corners location in subpixeles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 329
6.8 Feature Detection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 332
6.9 Feature Matching with FLANN . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 334
6.10 Features2D + Homography to nd a known object . . . . . . . . . . . . . . . . . . . . . . . . . . . 337
6.11 Detection of planar objects . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 340
7 video module. Video analysis 343
8 objdetect module. Object Detection 345
8.1 Cascade Classier . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 346
9 ml module. Machine Learning 351
9.1 Introduction to Support Vector Machines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 352
9.2 Support Vector Machines for Non-Linearly Separable Data . . . . . . . . . . . . . . . . . . . . . . . 357
10 gpu module. GPU-Accelerated Computer Vision 365
10.1 Similarity check (PNSR and SSIM) on the GPU . . . . . . . . . . . . . . . . . . . . . . . . . . . . 366
11 contrib module. The additional contributions made available ! 375
11.1 Discovering the human retina and its use for image processing . . . . . . . . . . . . . . . . . . . . . 376
12 OpenCV iOS 389
12.1 OpenCV iOS Hello . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 390
12.2 OpenCV iOS - Image Processing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 394
12.3 OpenCV iOS - Video Processing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 396
ii
13 General tutorials 403
iii
iv
The OpenCV Tutorials, Release 2.4.3
The following links describe a set of basic OpenCV tutorials. All the source code mentioned here is provide as part
of the OpenCV regular releases, so check before you start copy & pasting the code. The list of tutorials below is
automatically generated from reST les located in our GIT repository.
As always, we would be happy to hear your comments and receive your contributions on any tutorial.
Introduction to OpenCV
You will learn how to setup OpenCV on your computer!
core module. The Core Functionality
Here you will learn the about the basic building blocks of the library. A
must read and know for understanding how to manipulate the images on a
pixel level.
imgproc module. Image Processing
In this section you will learn about the image processing (manipulation)
functions inside OpenCV.
highgui module. High Level GUI and Media
This section contains valuable tutorials about how to read/save your im-
age/video les and how to use the built-in graphical user interface of the
library.
calib3d module. Camera calibration and 3D reconstruction
Although we got most of our images in a 2D format they do come from a 3D
world. Here you will learn how to nd out from the 2D images information
about the 3D world.
feature2d module. 2D Features framework
CONTENTS 1
The OpenCV Tutorials, Release 2.4.3
Learn about how to use the feature points detectors, descriptors and match-
ing framework found inside OpenCV.
video module. Video analysis
Look here in order to nd use on your video stream algoritms like: motion
extraction, feature tracking and foreground extractions.
objdetect module. Object Detection
Ever wondered how your digital camera detects peoples and faces? Look
here to nd out!
ml module. Machine Learning
Use the powerfull machine learning classes for statistical classication, re-
gression and clustering of data.
gpu module. GPU-Accelerated Computer Vision
Squeeze out every little computation power from your system by using the
power of your video card to run the OpenCV algorithms.
contrib module. The additional contributions made available !
Discover additional contribution to OpenCV.
OpenCV iOS
2 CONTENTS
The OpenCV Tutorials, Release 2.4.3
Run OpenCV and your vision apps on an iDevice
General tutorials
These tutorials are the bottom of the iceberg as they link together multiple
of the modules presented above in order to solve complex problems.
CONTENTS 3
The OpenCV Tutorials, Release 2.4.3
4 CONTENTS
CHAPTER
ONE
INTRODUCTION TO OPENCV
Here you can read tutorials about how to set up your computer to work with the OpenCV library. Additionaly you can
nd a few very basic sample source code that will let introduce you to the world of the OpenCV.
Linux
Title: Installation in Linux
Compatibility: > OpenCV 2.0
Author: Ana Huamn
We will learn how to setup OpenCV in your computer!
Title: Using OpenCV with gcc and CMake
Compatibility: > OpenCV 2.0
Author: Ana Huamn
We will learn how to compile your rst project using gcc and CMake
Title: Using OpenCV with Eclipse (plugin CDT)
Compatibility: > OpenCV 2.0
Author: Ana Huamn
We will learn how to compile your rst project using the Eclipse environ-
ment
Windows
Title: Installation in Windows
Compatibility: > OpenCV 2.0
Author: Bernt Gbor
You will learn how to setup OpenCV in your Windows Operating System!
5
The OpenCV Tutorials, Release 2.4.3
Title: How to build applications with OpenCV inside the Microsoft Visual
Studio
Compatibility: > OpenCV 2.0
Author: Bernt Gbor
You will learn what steps you need to perform in order to use the OpenCV
library inside a new Microsoft Visual Studio project.
Android
Title: Introduction into Android Development
Compatibility: > OpenCV 2.4.2
Author: Vsevolod Glumov
Not a tutorial, but a guide introducing Android development basics and
environment setup
Title: OpenCV4Android SDK
Compatibility: > OpenCV 2.4.2
Author: Vsevolod Glumov
OpenCV4Android SDK: general info, installation, running samples
Title: Android development with OpenCV
Compatibility: > OpenCV 2.4.2
Author: Vsevolod Glumov
Development with OpenCV4Android SDK
iOS
Title: Installation in iOS
Compatibility: > OpenCV 2.4.2
Author: Artem Myagkov, Eduard Feicho
We will learn how to setup OpenCV for using it in iOS!
6 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
Title: Load and Display an Image
Compatibility: > OpenCV 2.0
Author: Ana Huamn
We will learn how to display an image using OpenCV
Title: Load, Modify, and Save an Image
Compatibility: > OpenCV 2.0
Author: Ana Huamn
We will learn how to save an Image in OpenCV...plus a small conversion to
grayscale
Want to contribute, and see your own work between the OpenCV tutorials?
Title: How to write a tutorial for OpenCV?
Compatibility: > OpenCV 1.0
Author: Bernt Gbor
If you already have a good grasp on using OpenCV and have made some
projects that would be perfect presenting an OpenCV feature not yet part of
these tutorials, here it is what you need to know.
7
The OpenCV Tutorials, Release 2.4.3
1.1 Installation in Linux
These steps have been tested for Ubuntu 10.04 but should work with other distros as well.
Required Packages
GCC 4.4.x or later. This can be installed with:
sudo apt-get install build-essential
CMake 2.6 or higher;
Git;
GTK+2.x or higher, including headers (libgtk2.0-dev);
pkgcong;
Python 2.6 or later and Numpy 1.5 or later with developer packages (python-dev, python-numpy);
ffmpeg or libav development packages: libavcodec-dev, libavformat-dev, libswscale-dev;
[optional] libdc1394 2.x;
[optional] libjpeg-dev, libpng-dev, libtiff-dev, libjasper-dev.
All the libraries above can be installed via Terminal or by using Synaptic Manager.
Getting OpenCV Source Code
You can use the latest stable OpenCV version available in sourceforge or you can grab the latest snapshot from our Git
repository.
Getting the Latest Stable OpenCV Version
Go to our page on Sourceforge;
Download the source tarball and unpack it.
Getting the Cutting-edge OpenCV from the Git Repository
Launch Git client and clone OpenCV repository
In Linux it can be achieved with the following command in Terminal:
cd ~/<my
_
working
_
directory>
git clone https://github.com/Itseez/opencv.git
Building OpenCV from Source Using CMake, Using the Command Line
1. Create a temporary directory, which we denote as <cmake_binary_dir>, where you want to put the generated
Makeles, project les as well the object les and output binaries.
2. Enter the <cmake_binary_dir> and type
8 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
cmake [<some optional parameters>] <path to the OpenCV source directory>
For example
cd ~/opencv
mkdir release
cd release
cmake -D CMAKE
_
BUILD
_
TYPE=RELEASE -D CMAKE
_
INSTALL
_
PREFIX=/usr/local ..
3. Enter the created temporary directory (<cmake_binary_dir>) and proceed with:
make
sudo make install
Note: If the size of the created library is a critical issue (like in case of an Android build) you can use the
install/strip command to get the smallest size as possible. The stripped version appears to be twice as small.
However, we do not recommend using this unless those extra megabytes do really matter.
1.2 Using OpenCV with gcc and CMake
Note: We assume that you have successfully installed OpenCV in your workstation.
The easiest way of using OpenCV in your code is to use CMake. A few advantages (taken from the Wiki):
1. No need to change anything when porting between Linux and Windows
2. Can easily be combined with other tools by CMake( i.e. Qt, ITK and VTK )
If you are not familiar with CMake, checkout the tutorial on its website.
Steps
Create a program using OpenCV
Lets use a simple program such as DisplayImage.cpp shown below.
#include <cv.h>
#include <highgui.h>
using namespace cv;
int main( int argc, char
**
argv )
{
Mat image;
image = imread( argv[1], 1 );
if( argc != 2 || !image.data )
{
printf( "No image data \n" );
return -1;
}
namedWindow( "Display Image", CV
_
WINDOW
_
AUTOSIZE );
1.2. Using OpenCV with gcc and CMake 9
The OpenCV Tutorials, Release 2.4.3
imshow( "Display Image", image );
waitKey(0);
return 0;
}
Create a CMake le
Now you have to create your CMakeLists.txt le. It should look like this:
project( DisplayImage )
find
_
package( OpenCV REQUIRED )
add
_
executable( DisplayImage DisplayImage )
target
_
link
_
libraries( DisplayImage ${OpenCV
_
LIBS} )
Generate the executable
This part is easy, just proceed as with any other project using CMake:
cd <DisplayImage
_
directory>
cmake .
make
Result
By now you should have an executable (called DisplayImage in this case). You just have to run it giving an image
location as an argument, i.e.:
./DisplayImage lena.jpg
You should get a nice window as the one shown below:
10 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
1.3 Using OpenCV with Eclipse (plugin CDT)
Note: Two ways, one by forming a project directly, and another by CMake
Prerequisites
1. Having installed Eclipse in your workstation (only the CDT plugin for C/C++ is needed). You can follow the
following steps:
Go to the Eclipse site
Download Eclipse IDE for C/C++ Developers . Choose the link according to your workstation.
2. Having installed OpenCV. If not yet, go here.
Making a project
1. Start Eclipse. Just run the executable that comes in the folder.
2. Go to File -> New -> C/C++ Project
3. Choose a name for your project (i.e. DisplayImage). An Empty Project should be okay for this example.
4. Leave everything else by default. Press Finish.
5. Your project (in this case DisplayImage) should appear in the Project Navigator (usually at the left side of your
window).
1.3. Using OpenCV with Eclipse (plugin CDT) 11
The OpenCV Tutorials, Release 2.4.3
6. Now, lets add a source le using OpenCV:
Right click on DisplayImage (in the Navigator). New -> Folder .
Name your folder src and then hit Finish
Right click on your newly created src folder. Choose New source le:
Call it DisplayImage.cpp. Hit Finish
7. So, now you have a project with a empty .cpp le. Lets ll it with some sample code (in other words, copy and
paste the snippet below):
12 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
#include <cv.h>
#include <highgui.h>
using namespace cv;
int main( int argc, char
**
argv )
{
Mat image;
image = imread( argv[1], 1 );
if( argc != 2 || !image.data )
{
printf( "No image data \n" );
return -1;
}
namedWindow( "Display Image", CV
_
WINDOW
_
AUTOSIZE );
imshow( "Display Image", image );
waitKey(0);
return 0;
}
8. We are only missing one nal step: To tell OpenCV where the OpenCV headers and libraries are. For this, do
the following:
Go to Project>Properties
In C/C++ Build, click on Settings. At the right, choose the Tool Settings Tab. Here we will enter the
headers and libraries info:
(a) In GCC C++ Compiler, go to Includes. In Include paths(-l) you should include the path of the
folder where opencv was installed. In our example, this is /usr/local/include/opencv.
Note: If you do not know where your opencv les are, open the Terminal and type:
pkg-config --cflags opencv
For instance, that command gave me this output:
-I/usr/local/include/opencv -I/usr/local/include
(b) Now go to GCC C++ Linker,there you have to ll two spaces:
First in Library search path (-L) you have to write the path to where the opencv libraries reside, in
my case the path is:
/usr/local/lib
Then in Libraries(-l) add the OpenCV libraries that you may need. Usually just the 3 rst on the list
below are enough (for simple applications) . In my case, I am putting all of them since I plan to use
the whole bunch:
1.3. Using OpenCV with Eclipse (plugin CDT) 13
The OpenCV Tutorials, Release 2.4.3
opencv_core opencv_imgproc opencv_highgui opencv_ml opencv_video opencv_features2d
opencv_calib3d opencv_objdetect opencv_contrib opencv_legacy opencv_ann
If you dont know where your libraries are (or you are just psychotic and want to make sure the path
is ne), type in Terminal:
pkg-config --libs opencv
My output (in case you want to check) was: .. code-block:: bash
-L/usr/local/lib -lopencv_core -lopencv_imgproc -lopencv_highgui -lopencv_ml -
lopencv_video -lopencv_features2d -lopencv_calib3d -lopencv_objdetect -lopencv_contrib
-lopencv_legacy -lopencv_ann
Now you are done. Click OK
Your project should be ready to be built. For this, go to Project->Build all
In the Console you should get something like
If you check in your folder, there should be an executable there.
14 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
Running the executable
So, now we have an executable ready to run. If we were to use the Terminal, we would probably do something like:
cd <DisplayImage
_
directory>
cd src
./DisplayImage ../images/HappyLittleFish.png
Assuming that the image to use as the argument would be located in <DisplayIm-
age_directory>/images/HappyLittleFish.png. We can still do this, but lets do it from Eclipse:
1. Go to Run->Run Congurations
2. Under C/C++ Application you will see the name of your executable + Debug (if not, click over C/C++ Applica-
tion a couple of times). Select the name (in this case DisplayImage Debug).
3. Now, in the right side of the window, choose the Arguments Tab. Write the path of the image le we want to
open (path relative to the workspace/DisplayImage folder). Lets use HappyLittleFish.png:
4. Click on the Apply button and then in Run. An OpenCV window should pop up with the sh image (or whatever
you used).
5. Congratulations! You are ready to have fun with OpenCV using Eclipse.
V2: Using CMake+OpenCV with Eclipse (plugin CDT)
(See the getting started <http://opencv.willowgarage.com/wiki/Getting_started> section of the OpenCV Wiki)
Say you have or create a new le, helloworld.cpp in a directory called foo:
#include <cv.h>
#include <highgui.h>
int main ( int argc, char
**
argv )
{
cvNamedWindow( "My Window", 1 );
IplImage
*
img = cvCreateImage( cvSize( 640, 480 ), IPL
_
DEPTH
_
8U, 1 );
CvFont font;
1.3. Using OpenCV with Eclipse (plugin CDT) 15
The OpenCV Tutorials, Release 2.4.3
double hScale = 1.0;
double vScale = 1.0;
int lineWidth = 1;
cvInitFont( &font, CV
_
FONT
_
HERSHEY
_
SIMPLEX | CV
_
FONT
_
ITALIC,
hScale, vScale, 0, lineWidth );
cvPutText( img, "Hello World!", cvPoint( 200, 400 ), &font,
cvScalar( 255, 255, 0 ) );
cvShowImage( "My Window", img );
cvWaitKey();
return 0;
}
1. Create a build directory, say, under foo: mkdir /build. Then cd build.
2. Put a CmakeLists.txt le in build:
PROJECT( helloworld
_
proj )
FIND
_
PACKAGE( OpenCV REQUIRED )
ADD
_
EXECUTABLE( helloworld helloworld.cxx )
TARGET
_
LINK
_
LIBRARIES( helloworld ${OpenCV
_
LIBS} )
1. Run: cmake-gui .. and make sure you ll in where opencv was built.
2. Then click configure and then generate. If its OK, quit cmake-gui
3. Run make -j4 (the -j4 is optional, it just tells the compiler to build in 4 threads). Make sure it builds.
4. Start eclipse . Put the workspace in some directory but not in foo or foo\\build
5. Right click in the Project Explorer section. Select Import And then open the C/C++ lter. Choose Existing
Code as a Makele Project
6. Name your project, say helloworld. Browse to the Existing Code location foo\\build (where you ran your
cmake-gui from). Select Linux GCC in the Toolchain for Indexer Settings and press Finish.
7. Right click in the Project Explorer section. Select Properties. Under C/C++ Build, set the build direc-
tory: from something like ${workspace
_
loc:/helloworld} to ${workspace
_
loc:/helloworld}/build
since thats where you are building to.
1. You can also optionally modify the Build command: from make to something like make VERBOSE=1 -j4
which tells the compiler to produce detailed symbol les for debugging and also to compile in 4 parallel threads.
1. Done!
1.4 Installation in Windows
The description here was tested on Windows 7 SP1. Nevertheless, it should also work on any other relatively modern
version of Windows OS. If you encounter errors after following the steps described below, feel free to contact us via
our OpenCV Q&A forum. Well do our best to help you out.
Note: To use the OpenCV library you have two options: Installation by Using the Pre-built Libraries or Installation
by Making Your Own Libraries from the Source Files. While the rst one is easier to complete, it only works if you
are coding with the latest Microsoft Visual Studio IDE and doesnt take advantage of the most advanced technologies
we integrate into our library.
16 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
Installation by Using the Pre-built Libraries
1. Launch a web browser of choice and go to our page on Sourceforge.
2. Choose a build you want to use and download it.
3. Make sure you have admin rights. Start the setup and follow the wizard.
4. While adding the OpenCV library to the system path is a good decision for a better control, we will do it
manually for the sake of this tutorial. Make sure you do not set this option.
5. Most of the time it is a good idea to install the source les too, as this will allow for you to debug into the
OpenCV library, if it is necessary. Follow the default settings of the wizard and nish the installation.
6. You can check the installation at the chosen path as you can see below.
7. To nalize the installation go to the Set the OpenCV enviroment variable and add it to the systems path section.
Installation by Making Your Own Libraries from the Source Files
You may nd the content of this tutorial also inside the following videos: Part 1 and Part 2, hosted on YouTube.
Warning: These videos above are long-obsolete and contain inaccurate information. Be careful, since solutions
described in those videos are no longer supported and may even break your install.
If you are building your own libraries you can take the source les from our Git repository.
Building the OpenCV library from scratch requires a couple of tools installed beforehand:
An IDE of choice (preferably), or just a CC++ compiler that will actually make the binary les. Here we will
use the Microsoft Visual Studio. However, you can use any other IDE that has a valid CC++ compiler.
CMake, which is a neat tool to make the project les (for your choosen IDE) from the OpenCV source les. It
will also allow an easy conguration of the OpenCV build les, in order to make binary les that ts exactly to
your needs.
Git to acquire the OpenCV source les. A good tool for this is TortoiseGit. Alternatively, you can just download
an archived version of the source les from our page on Sourceforge
OpenCV may come in multiple avors. There is a core section that will work on its own. Nevertheless, there is a
couple of tools, libraries made by 3rd parties that offer services of which the OpenCV may take advantage. These will
1.4. Installation in Windows 17
The OpenCV Tutorials, Release 2.4.3
improve its capabilities in many ways. In order to use any of them, you need to download and install them on your
system.
The Python libraries are required to build the Python interface of OpenCV. For now use the version 2.7.x. This
is also a must if you want to build the OpenCV documentation.
Numpy is a scientic computing package for Python. Required for the Python interface.
Intel Threading Building Blocks (TBB) is used inside OpenCV for parallel code snippets. Using this will
make sure that the OpenCV library will take advantage of all the cores you have in your systems CPU.
Intel Integrated Performance Primitives (IPP) may be used to improve the performance of color conversion,
Haar training and DFT functions of the OpenCV library. Watch out, since this isnt a free service.
OpenCV offers a somewhat fancier and more useful graphical user interface, than the default one by using the
Qt framework. For a quick overview of what this has to offer look into the documentations highgui module,
under the Qt New Functions section. Version 4.6 or later of the framework is required.
Eigen is a C++ template library for linear algebra.
The latest CUDA Toolkit will allow you to use the power lying inside your GPU. This will drastically improve
performance for some algorithms (e.g the HOG descriptor). Getting more and more of our algorithms to work
on the GPUs is a constant effort of the OpenCV team.
OpenEXR source les are required for the library to work with this high dynamic range (HDR) image le
format.
The OpenNI Framework contains a set of open source APIs that provide support for natural interaction with
devices via methods such as voice command recognition, hand gestures and body motion tracking.
Miktex is the best TEX implementation on the Windows OS. It is required to build the OpenCV documentation.
Sphinx is a python documentation generator and is the tool that will actually create the OpenCV documentation.
This on its own requires a couple of tools installed, We will cover this in depth at the How to Install Sphinx
section.
Now we will describe the steps to follow for a full build (using all the above frameworks, tools and libraries). If you
do not need the support for some of these you can just freely skip this section.
Building the library
1. Make sure you have a working IDE with a valid compiler. In case of the Microsoft Visual Studio just install it
and make sure it starts up.
2. Install CMake. Simply follow the wizard, no need to add it to the path. The default install options are OK.
3. Download and install an up-to-date version of msysgit from its ofcial site. There is also the portable version,
which you need only to unpack to get access to the console version of Git. Supposing that for some of us it
could be quite enough.
4. Install TortoiseGit. Choose the 32 or 64 bit version according to the type of OS you work in. While installing,
locate your msysgit (if it doesnt do that automatically). Follow the wizard the default options are OK for the
most part.
5. Choose a directory in your le system, where you will download the OpenCV libraries to. I recommend creating
a new one that has short path and no special charachters in it, for example D:/OpenCV. For this tutorial Ill
suggest you do so. If you use your own path and know, what youre doing its OK.
(a) Clone the repository to the selected directory. After clicking Clone button, a windowwill appear where you
can select from what repository you want to download source les (https://github.com/Itseez/opencv.git)
and to what directory (D:/OpenCV).
18 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
(b) Push the OK button and be patient as the repository is quite a heavy download. It will take some time
depending on your Internet connection.
6. In this section I will cover installing the 3rd party libraries.
(a) Download the Python libraries and install it with the default options. You will need a couple other python
extensions. Luckily installing all these may be automated by a nice tool called Setuptools. Download and
install again.
(b) Installing Sphinx is easy once you have installed Setuptools. This contains a little application that will
automatically connect to the python databases and download the latest version of many python scripts.
Start up a command window (enter cmd into the windows start menu and press enter) and use the CD
command to navigate to your Python folders Script sub-folder. Here just pass to the easy_install.exe as
argument the name of the program you want to install. Add the sphinx argument.
Note: The CD navigation command works only inside a drive. For example if you are somewhere in
the C: drive you cannot use it this to go to another drive (like for example D:). To do so you rst need
to change drives letters. For this simply enter the command D:. Then you can use the CD to navigate to
specic folder inside the drive. Bonus tip: you can clear the screen by using the CLS command.
This will also install its prerequisites Jinja2 and Pygments.
(c) The easiest way to install Numpy is to just download its binaries from the sourceforga page. Make sure
your download and install exactly the binary for your python version (so for version 2.7).
(d) Download the Miktex and install it. Again just follow the wizard. At the fourth step make sure you select
for the Install missing packages on-the-y the Yes option, as you can see on the image below. Again this
will take quite some time so be patient.
1.4. Installation in Windows 19
The OpenCV Tutorials, Release 2.4.3
(e) For the Intel Threading Building Blocks (TBB) download the source les and extract it inside a directory
on your system. For example let there be D:/OpenCV/dep. For installing the Intel Integrated Perfor-
mance Primitives (IPP) the story is the same. For exctracting the archives I recommend using the 7-Zip
application.
(f) In case of the Eigen library it is again a case of download and extract to the D:/OpenCV/dep directory.
(g) Same as above with OpenEXR.
(h) For the OpenNI Framework you need to install both the development build and the PrimeSensor Module.
(i) For the CUDAyou need again two modules: the latest CUDAToolkit and the CUDA Tools SDK. Download
and install both of them with a complete option by using the 32 or 64 bit setups according to your OS.
(j) In case of the Qt framework you need to build yourself the binary les (unless you use the Microsoft Visual
Studio 2008 with 32 bit compiler). To do this go to the Qt Downloads page. Download the source les
(not the installers!!!):
20 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
Extract it into a nice and short named directory like D:/OpenCV/dep/qt/ . Then you need to build it. Start
up a Visual Studio Command Prompt (2010) by using the start menu search (or navigate through the start
menu All Programs Microsoft Visual Studio 2010 Visual Studio Tools Visual Studio Command
Prompt (2010)).
Now navigate to the extracted folder and enter inside it by using this console window. You should have a
folder containing les like Install, Make and so on. Use the dir command to list les inside your current
directory. Once arrived at this directory enter the following command:
configure.exe -release -no-webkit -no-phonon -no-phonon-backend -no-script -no-scripttools
-no-qt3support -no-multimedia -no-ltcg
Completing this will take around 10-20 minutes. Then enter the next command that will take a lot longer
(can easily take even more than a full hour):
nmake
After this set the Qt enviroment variables using the following command on Windows 7:
setx -m QTDIR D:/OpenCV/dep/qt/qt-everywhere-opensource-src-4.7.3
Also, add the built binary les path to the system path by using the Path Editor. In our case this is
D:/OpenCV/dep/qt/qt-everywhere-opensource-src-4.7.3/bin.
Note: If you plan on doing Qt application development you can also install at this point the Qt Visual
Studio Add-in. After this you can make and build Qt applications without using the Qt Creator. Everything
1.4. Installation in Windows 21
The OpenCV Tutorials, Release 2.4.3
is nicely integrated into Visual Studio.
7. Nowstart the CMake (cmake-gui). You may again enter it in the start menu search or get it fromthe All Programs
CMake 2.8 CMake (cmake-gui). First, select the directory for the source les of the OpenCV library (1).
Then, specify a directory where you will build the binary les for OpenCV (2).
Press the Congure button to specify the compiler (and IDE) you want to use. Note that in case you can choose
between different compilers for making either 64 bit or 32 bit libraries. Select the one you use in your application
development.
CMake will start out and based on your system variables will try to automatically locate as many packages as
possible. You can modify the packages to use for the build in the WITH WITH_X menu points (where X is
the package abbreviation). Here are a list of current packages you can turn on or off:
22 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
Select all the packages you want to use and press again the Congure button. For an easier overview of the
build options make sure the Grouped option under the binary directory selection is turned on. For some of the
packages CMake may not nd all of the required les or directories. In case of these CMake will throw an error
in its output window (located at the bottom of the GUI) and set its eld values, to not found constants. For
example:
For these you need to manually set the queried directories or les path. After this press again the Congure
button to see if the value entered by you was accepted or not. Do this until all entries are good and you cannot
see errors in the eld/value or the output part of the GUI. Now I want to emphasize an option that you will
denitely love: ENABLE ENABLE_SOLUTION_FOLDERS. OpenCV will create many-many projects and
turning this option will make sure that they are categorized inside directories in the Solution Explorer. It is a
must have feature, if you ask me.
Furthermore, you need to select what part of OpenCV you want to build.
BUILD_DOCS -> It creates two projects for building the documentation of OpenCV (there will be a
separate project for building the HTML and the PDF les). Note that these arent built together with the
solution. You need to make an explicit build project command on these to do so.
BUILD_EXAMPLES -> OpenCV comes with many example applications from which you may learn most
of the libraries capabilities. This will also come handy to easily try out if OpenCV is fully functional on
your computer.
BUILD_PACKAGE -> Prior to version 2.3 with this you could build a project that will build an OpenCV
installer. With this you can easily install your OpenCV avor on other systems. For the latest source les
of OpenCV it generates a new project that simply creates zip archive with OpenCV sources.
1.4. Installation in Windows 23
The OpenCV Tutorials, Release 2.4.3
BUILD_SHARED_LIBS -> With this you can control to build DLL les (when turned on) or static library
les (*.lib) otherwise.
BUILD_TESTS -> Each module of OpenCV has a test project assigned to it. Building these test projects is
also a good way to try out, that the modules work just as expected on your system too.
BUILD_PERF_TESTS -> There are also performance tests for many OpenCV functions. If youre con-
cerned about performance, build them and run.
BUILD_opencv_python -> Self-explanatory. Create the binaries to use OpenCV from the Python language.
Press again the Congure button and ensure no errors are reported. If this is the case you can tell CMake to
create the project les by pushing the Generate button. Go to the build directory and open the created OpenCV
solution. Depending on just how much of the above options you have selected the solution may contain quite a
lot of projects so be tolerant on the IDE at the startup. Now you need to build both the Release and the Debug
binaries. Use the drop-down menu on your IDE to change to another of these after building for one of them.
In the end you can observe the built binary les inside the bin directory:
For the documentation you need to explicitly issue the build commands on the doc project for the PDF les
and on the doc_html for the HTML ones. Each of these will call Sphinx to do all the hard work. You can nd
the generated documentation inside the Build/Doc/
_
html for the HTML pages and within the Build/Doc the
PDF manuals.
To collect the header and the binary les, that you will use during your own projects, into a separate directory
(simillary to how the pre-built binaries ship) you need to explicitely build the Install project.
This will create an install directory inside the Build one collecting all the built binaries into a single place. Use
this only after you built both the Release and Debug versions.
24 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
Note: To create an installer you need to install NSIS. Then just build the Package project to build the installer
into the Build/
_
CPack
_
Packages/win32/NSIS folder. You can then use this to distribute OpenCV with your
build settings on other systems.
To test your build just go into the Build/bin/Debug or Build/bin/Release directory and start a couple of
applications like the contours.exe. If they run, you are done. Otherwise, something denitely went awfully
wrong. In this case you should contact us via our user group. If everything is okay the contours.exe output
should resemble the following image (if built with Qt support):
Note: If you use the GPU module (CUDA libraries) make sure you also upgrade to the latest drivers of your
GPU. Error messages containing invalid entries in (or cannot nd) the nvcuda.dll are caused mostly by old video
card drivers. For testing the GPU (if built) run the performance_gpu.exe sample application.
1.4. Installation in Windows 25
The OpenCV Tutorials, Release 2.4.3
Set the OpenCV enviroment variable and add it to the systems path
First we set an enviroment variable to make easier our work. This will hold the install directory of our OpenCV library
that we use in our projects. Start up a command window and enter:
setx -m OPENCV
_
DIR D:\OpenCV\Build\Install
Here the directory is where you have your OpenCV binaries (installed or built). Inside this you should have folders
like bin and include. The -m should be added if you wish to make the settings computer wise, instead of user wise.
If you built static libraries then you are done. Otherwise, you need to add the bin folders path to the systems path.This
is cause you will use the OpenCV library in form of Dynamic-link libraries (also known as DLL). Inside these are
stored all the algorithms and information the OpenCV library contains. The operating system will load them only on
demand, during runtime. However, to do this he needs to know where they are. The systems PATH contains a list of
folders where DLLs can be found. Add the OpenCV library path to this and the OS will know where to look if he ever
needs the OpenCV binaries. Otherwise, you will need to copy the used DLLs right beside the applications executable
le (exe) for the OS to nd it, which is highly unpleasent if you work on many projects. To do this start up again the
Path Editor and add the following new entry (right click in the application to bring up the menu):
%OPENCV
_
DIR%\bin
Save it to the registry and you are done. If you ever change the location of your install directories or want to try out
your applicaton with a different build all you will need to do is to update the OPENCV_DIR variable via the setx
command inside a command window.
Now you can continue reading the tutorials with the How to build applications with OpenCV inside the Microsoft
Visual Studio section. There you will nd out how to use the OpenCV library in your own projects with the help of
the Microsoft Visual Studio IDE.
1.5 How to build applications with OpenCV inside the Microsoft Vi-
sual Studio
Everything I describe here will apply to the C\C++ interface of OpenCV. I start out from the assumption that you have
read and completed with success the Installation in Windows tutorial. Therefore, before you go any further make sure
you have an OpenCV directory that contains the OpenCV header les plus binaries and you have set the environment
variables as described here.
26 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
The OpenCV libraries, distributed by us, on the Microsoft Windows operating system are in a Dynamic Linked
Libraries (DLL). These have the advantage that all the content of the library are loaded only at runtime, on demand,
and that countless programs may use the same library le. This means that if you have ten applications using the
OpenCV library, no need to have around a version for each one of them. Of course you need to have the dll of the
OpenCV on all systems where you want to run your application.
Another approach is to use static libraries that have lib extensions. You may build these by using our source les as
described in the Installation in Windows tutorial. When you use this the library will be built-in inside your exe le. So
there is no chance that the user deletes them, for some reason. As a drawback your application will be larger one and
as, it will take more time to load it during its startup.
To build an application with OpenCV you need to do two things:
Tell to the compiler how the OpenCV library looks. You do this by showing it the header les.
Tell to the linker from where to get the functions or data structures of OpenCV, when they are needed.
If you use the lib system you must set the path where the library les are and specify in which one of them to
look. During the build the linker will look into these libraries and add the denitions and implementation of all
used functions and data structures to the executable le.
If you use the DLL system you must again specify all this, however now for a different reason. This is a
Microsoft OS specic stuff. It seems that the linker needs to know that where in the DLL to search for the data
structure or function at the runtime. This information is stored inside lib les. Nevertheless, they arent static
libraries. They are so called import libraries. This is why when you make some DLLs in Windows you will also
end up with some lib extension libraries. The good part is that at runtime only the DLL is required.
To pass on all this information to the Visual Studio IDE you can either do it globally (so all your future projects will
get these information) or locally (so only for you current project). The advantage of the global one is that you only
need to do it once; however, it may be undesirable to clump all your projects all the time with all these information. In
case of the global one how you do it depends on the Microsoft Visual Studio you use. There is a 2008 and previous
versions and a 2010 way of doing it. Inside the global section of this tutorial Ill show what the main differences are.
The base item of a project in Visual Studio is a solution. A solution may contain multiple projects. Projects are the
building blocks of an application. Every project will realize something and you will have a main project in which you
can put together this project puzzle. In case of the many simple applications (like many of the tutorials will be) you
do not need to break down the application into modules. In these cases your main project will be the only existing
one. Now go create a new solution inside Visual studio by going through the File New Project menu selection.
Choose Win32 Console Application as type. Enter its name and select the path where to create it. Then in the upcoming
dialog make sure you create an empty project.
1.5. How to build applications with OpenCV inside the Microsoft Visual Studio 27
The OpenCV Tutorials, Release 2.4.3
The local method
Every project is built separately from the others. Due to this every project has its own rule package. Inside this rule
packages are stored all the information the IDE needs to know to build your project. For any application there are
at least two build modes: a Release and a Debug one. The Debug has many features that exist so you can nd and
resolve easier bugs inside your application. In contrast the Release is an optimized version, where the goal is to make
the application run as fast as possible or to be as small as possible. You may gure that these modes also require
different rules to use during build. Therefore, there exist different rule packages for each of your build modes. These
rule packages are called inside the IDE as project properties and you can view and modify them by using the Property
Manger. You can bring up this with View Property Pages. Expand it and you can see the existing rule packages
(called Proporty Sheets).
The really useful stuff of these is that you may create a rule package once and you can later just add it to your new
projects. Create it once and reuse it later. We want to create a new Property Sheet that will contain all the rules that
the compiler and linker needs to know. Of course we will need a separate one for the Debug and the Release Builds.
Start up with the Debug one as shown in the image below:
28 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
Use for example the OpenCV_Debug name. Then by selecting the sheet Right Click Properties. In the following I
will show to set the OpenCV rules locally, as I nd unnecessary to pollute projects with custom rules that I do not use
it. Go the C++ groups General entry and under the Additional Include Directories add the path to your OpenCV
include. If you dont have C/C++ group, you should add any .c/.cpp le to the project.
$(OPENCV
_
DIR)\include
When adding third party libraries settings it is generally a good idea to use the power behind the environment variables.
The full location of the OpenCV library may change on each system. Moreover, you may even end up yourself with
moving the install directory for some reason. If you would give explicit paths inside your property sheet your project
will end up not working when you pass it further to someone else who has a different OpenCV install path. Moreover,
xing this would require to manually modifying every explicit path. A more elegant solution is to use the environment
variables. Anything that you put inside a parenthesis started with a dollar sign will be replaced at runtime with the
current environment variables value. Here comes in play the environment variable setting we already made in our
previous tutorial.
Next go to the Linker General and under the Additional Library Directories add the libs directory:
$(OPENCV
_
DIR)\libs
Then you need to specify the libraries in which the linker should look into. To do this go to the Linker Input and
under the Additional Dependencies entry add the name of all modules which you want to use:
1.5. How to build applications with OpenCV inside the Microsoft Visual Studio 29
The OpenCV Tutorials, Release 2.4.3
The names of the libraries are as follow:
opencv
_
(The Name of the module)(The version Number of the library you use)d.lib
A full list, for the currently latest trunk version would contain:
opencv
_
core231d.lib
opencv
_
imgproc231d.lib
opencv
_
highgui231d.lib
opencv
_
ml231d.lib
opencv
_
video231d.lib
opencv
_
features2d231d.lib
opencv
_
calib3d231d.lib
opencv
_
objdetect231d.lib
opencv
_
contrib231d.lib
opencv
_
legacy231d.lib
opencv
_
flann231d.lib
The letter d at the end just indicates that these are the libraries required for the debug. Now click ok to save and do the
same with a new property inside the Release rule section. Make sure to omit the d letters from the library names and
to save the property sheets with the save icon above them.
You can nd your property sheets inside your projects directory. At this point it is a wise decision to back them up
into some special directory, to always have them at hand in the future, whenever you create an OpenCV project. Note
that for Visual Studio 2010 the le extension is props, while for 2008 this is vsprops.
30 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
Next time when you make a new OpenCV project just use the Add Existing Property Sheet... menu entry inside the
Property Manager to easily add the OpenCV build rules.
The global method
In case you nd to troublesome to add the property pages to each and every one of your projects you can also add this
rules to a global property page. However, this applies only to the additional include and library directories. The
name of the libraries to use you still need to specify manually by using for instance: a Property page.
In Visual Studio 2008 you can nd this under the: Tools Options Projects and Solutions VC++ Directories.
In Visual Studio 2010 this has been moved to a global property sheet which is automatically added to every project
you create:
The process is the same as described in case of the local approach. Just add the include directories by using the
environment variable OPENCV_DIR.
1.5. How to build applications with OpenCV inside the Microsoft Visual Studio 31
The OpenCV Tutorials, Release 2.4.3
Test it!
Now to try this out download our little test source code or get it from the sample code folder of the OpenCV sources.
Add this to your project and build it. Heres its content:
1 // Video Image PSNR and SSIM
2 #include <iostream> // for standard I/O
3 #include <string> // for strings
4 #include <iomanip> // for controlling float print precision
5 #include <sstream> // string to number conversion
6
7 #include <opencv2/imgproc/imgproc.hpp> // Gaussian Blur
8 #include <opencv2/core/core.hpp> // Basic OpenCV structures (cv::Mat, Scalar)
9 #include <opencv2/highgui/highgui.hpp> // OpenCV window I/O
10
11 using namespace std;
12 using namespace cv;
13
14 double getPSNR ( const Mat& I1, const Mat& I2);
15 Scalar getMSSIM( const Mat& I1, const Mat& I2);
16
17 void help()
18 {
19 cout
20 << "\n--------------------------------------------------------------------------" << endl
21 << "This program shows how to read a video file with OpenCV. In addition, it tests the"
22 << " similarity of two input videos first with PSNR, and for the frames below a PSNR " << endl
23 << "trigger value, also with MSSIM."<< endl
24 << "Usage:" << endl
25 << "./video-source referenceVideo useCaseTestVideo PSNR
_
Trigger
_
Value Wait
_
Between
_
Frames " << endl
26 << "--------------------------------------------------------------------------" << endl
27 << endl;
28 }
29 int main(int argc, char
*
argv[], char
*
window
_
name)
30 {
31 help();
32 if (argc != 5)
33 {
34 cout << "Not enough parameters" << endl;
35 return -1;
36 }
37 stringstream conv;
38
39 const string sourceReference = argv[1],sourceCompareWith = argv[2];
40 int psnrTriggerValue, delay;
41 conv << argv[3] << argv[4]; // put in the strings
42 conv >> psnrTriggerValue >> delay;// take out the numbers
43
44 char c;
45 int frameNum = -1; // Frame counter
46
47 VideoCapture captRefrnc(sourceReference),
48 captUndTst(sourceCompareWith);
49
50 if ( !captRefrnc.isOpened())
51 {
52 cout << "Could not open reference " << sourceReference << endl;
53 return -1;
54 }
32 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
55
56 if( !captUndTst.isOpened())
57 {
58 cout << "Could not open case test " << sourceCompareWith << endl;
59 return -1;
60 }
61
62 Size refS = Size((int) captRefrnc.get(CV
_
CAP
_
PROP
_
FRAME
_
WIDTH),
63 (int) captRefrnc.get(CV
_
CAP
_
PROP
_
FRAME
_
HEIGHT)),
64 uTSi = Size((int) captUndTst.get(CV
_
CAP
_
PROP
_
FRAME
_
WIDTH),
65 (int) captUndTst.get(CV
_
CAP
_
PROP
_
FRAME
_
HEIGHT));
66
67 if (refS != uTSi)
68 {
69 cout << "Inputs have different size!!! Closing." << endl;
70 return -1;
71 }
72
73 const char
*
WIN
_
UT = "Under Test";
74 const char
*
WIN
_
RF = "Reference";
75
76 // Windows
77 namedWindow(WIN
_
RF, CV
_
WINDOW
_
AUTOSIZE );
78 namedWindow(WIN
_
UT, CV
_
WINDOW
_
AUTOSIZE );
79 cvMoveWindow(WIN
_
RF, 400 , 0); //750, 2 (bernat =0)
80 cvMoveWindow(WIN
_
UT, refS.width, 0); //1500, 2
81
82 cout << "Frame resolution: Width=" << refS.width << " Height=" << refS.height
83 << " of nr#: " << captRefrnc.get(CV
_
CAP
_
PROP
_
FRAME
_
COUNT) << endl;
84
85 cout << "PSNR trigger value " <<
86 setiosflags(ios::fixed) << setprecision(3) << psnrTriggerValue << endl;
87
88 Mat frameReference, frameUnderTest;
89 double psnrV;
90 Scalar mssimV;
91
92 while( true) //Show the image captured in the window and repeat
93 {
94 captRefrnc >> frameReference;
95 captUndTst >> frameUnderTest;
96
97 if( frameReference.empty() || frameUnderTest.empty())
98 {
99 cout << " < < < Game over! > > > ";
100 break;
101 }
102
103 ++frameNum;
104 cout <<"Frame:" << frameNum;
105
106 ///////////////////////////////// PSNR ////////////////////////////////////////////////////
107 psnrV = getPSNR(frameReference,frameUnderTest); //get PSNR
108 cout << setiosflags(ios::fixed) << setprecision(3) << psnrV << "dB";
109
110 //////////////////////////////////// MSSIM /////////////////////////////////////////////////
111 if (psnrV < psnrTriggerValue)
112 {
1.5. How to build applications with OpenCV inside the Microsoft Visual Studio 33
The OpenCV Tutorials, Release 2.4.3
113 mssimV = getMSSIM(frameReference,frameUnderTest);
114
115 cout << " MSSIM: "
116 << "R" << setiosflags(ios::fixed) << setprecision(3) << mssimV.val[2]
*
100
117 << "G" << setiosflags(ios::fixed) << setprecision(3) << mssimV.val[1]
*
100
118 << "B" << setiosflags(ios::fixed) << setprecision(3) << mssimV.val[0]
*
100;
119 }
120
121 cout << endl;
122
123 ////////////////////////////////// Show Image /////////////////////////////////////////////
124 imshow( WIN
_
RF, frameReference);
125 imshow( WIN
_
UT, frameUnderTest);
126
127 c = cvWaitKey(delay);
128 if (c == 27) break;
129 }
130
131 return 0;
132 }
133
134 double getPSNR(const Mat& I1, const Mat& I2)
135 {
136 Mat s1;
137 absdiff(I1, I2, s1); // |I1 - I2|
138 s1.convertTo(s1, CV
_
32F); // cannot make a square on 8 bits
139 s1 = s1.mul(s1); // |I1 - I2|^2
140
141 Scalar s = sum(s1); // sum elements per channel
142
143 double sse = s.val[0] + s.val[1] + s.val[2]; // sum channels
144
145 if( sse <= 1e-10) // for small values return zero
146 return 0;
147 else
148 {
149 double mse =sse /(double)(I1.channels()
*
I1.total());
150 double psnr = 10.0
*
log10((255
*
255)/mse);
151 return psnr;
152 }
153 }
154
155 Scalar getMSSIM( const Mat& i1, const Mat& i2)
156 {
157 const double C1 = 6.5025, C2 = 58.5225;
158 /
*****************************
INITS
**********************************
/
159 int d = CV
_
32F;
160
161 Mat I1, I2;
162 i1.convertTo(I1, d); // cannot calculate on one byte large values
163 i2.convertTo(I2, d);
164
165 Mat I2
_
2 = I2.mul(I2); // I2^2
166 Mat I1
_
2 = I1.mul(I1); // I1^2
167 Mat I1
_
I2 = I1.mul(I2); // I1
*
I2
168
169 /
***************************
END INITS
**********************************
/
170
34 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
171 Mat mu1, mu2; // PRELIMINARY COMPUTING
172 GaussianBlur(I1, mu1, Size(11, 11), 1.5);
173 GaussianBlur(I2, mu2, Size(11, 11), 1.5);
174
175 Mat mu1
_
2 = mu1.mul(mu1);
176 Mat mu2
_
2 = mu2.mul(mu2);
177 Mat mu1
_
mu2 = mu1.mul(mu2);
178
179 Mat sigma1
_
2, sigma2
_
2, sigma12;
180
181 GaussianBlur(I1
_
2, sigma1
_
2, Size(11, 11), 1.5);
182 sigma1
_
2 -= mu1
_
2;
183
184 GaussianBlur(I2
_
2, sigma2
_
2, Size(11, 11), 1.5);
185 sigma2
_
2 -= mu2
_
2;
186
187 GaussianBlur(I1
_
I2, sigma12, Size(11, 11), 1.5);
188 sigma12 -= mu1
_
mu2;
189
190 ///////////////////////////////// FORMULA ////////////////////////////////
191 Mat t1, t2, t3;
192
193 t1 = 2
*
mu1
_
mu2 + C1;
194 t2 = 2
*
sigma12 + C2;
195 t3 = t1.mul(t2); // t3 = ((2
*
mu1
_
mu2 + C1).
*
(2
*
sigma12 + C2))
196
197 t1 = mu1
_
2 + mu2
_
2 + C1;
198 t2 = sigma1
_
2 + sigma2
_
2 + C2;
199 t1 = t1.mul(t2); // t1 =((mu1
_
2 + mu2
_
2 + C1).
*
(sigma1
_
2 + sigma2
_
2 + C2))
200
201 Mat ssim
_
map;
202 divide(t3, t1, ssim
_
map); // ssim
_
map = t3./t1;
203
204 Scalar mssim = mean( ssim
_
map ); // mssim = average of ssim map
205 return mssim;
206 }
You can start a Visual Studio build from two places. Either inside from the IDE (keyboard combination: Control-F5)
or by navigating to your build directory and start the application with a double click. The catch is that these two arent
the same. When you start it from the IDE its current working directory is the projects directory, while otherwise it is
the folder where the application le currently is (so usually your build directory). Moreover, in case of starting from
the IDE the console window will not close once nished. It will wait for a keystroke of yours.
This is important to remember when you code inside the code open and save commands. Youre resources will be
saved ( and queried for at opening!!!) relatively to your working directory. This is unless you give a full, explicit path
as parameter for the I/O functions. In the code above we open this OpenCV logo. Before starting up the application
make sure you place the image le in your current working directory. Modify the image le name inside the code to
try it out on other images too. Run it and voil:
1.5. How to build applications with OpenCV inside the Microsoft Visual Studio 35
The OpenCV Tutorials, Release 2.4.3
Command line arguments with Visual Studio
Throughout some of our future tutorials youll see that the programs main input method will be by giving a runtime
argument. To do this you can just start up a commmand windows (cmd + Enter in the start menu), navigate to your
executable le and start it with an argument. So for example in case of my upper project this would look like:
1 D:
2 CD OpenCV\MySolutionName\Release
3 MySolutionName.exe exampleImage.jpg
Here I rst changed my drive (if your project isnt on the OS local drive), navigated to my project and start it with an
example image argument. While under Linux system it is common to ddle around with the console window on the
Microsoft Windows many people come to use it almost never. Besides, adding the same argument again and again
while you are testing your application is, somewhat, a cumbersome task. Luckily, in the Visual Studio there is a menu
to automate all this:
Specify here the name of the inputs and while you start your application from the Visual Studio enviroment you have
automatic argument passing. In the next introductionary tutorial youll see an in-depth explanation of the upper source
code: Load and Display an Image.
1.6 Introduction into Android Development
This guide was designed to help you in learning Android development basics and seting up your working environment
quickly. It was written with Windows 7 in mind, though it would work with Linux (Ubuntu), Mac OS X and any other
OS supported by Android SDK.
If you encounter any error after thoroughly following these steps, feel free to contact us via OpenCV4Android discus-
sion group or OpenCV Q&A forum. Well do our best to help you out.
36 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
Preface
Android is a Linux-based, open source mobile operating system developed by Open Handset Alliance led by Google.
See the Android home site for general details.
Development for Android signicantly differs from development for other platforms. So before starting programming
for Android we recommend you make sure that you are familiar with the following key topis:
1. Java programming language that is the primary development technology for Android OS. Also, you can nd
Oracle docs on Java useful.
2. Java Native Interface (JNI) that is a technology of running native code in Java virtual machine. Also, you can
nd Oracle docs on JNI useful.
3. Android Activity and its lifecycle, that is an essential Android API class.
4. OpenCV development will certainly require some knowlege of the Android Camera specics.
Quick environment setup for Android development
If you are making a clean environment install, then you can try Tegra Android Development Pack (TADP) released by
NVIDIA.
Note: Starting the version 2.0 the TADP package includes OpenCV for Tegra SDK that is a regular
OpenCV4Android SDK extended with Tegra-specic stuff.
When unpacked, TADP will cover all of the environment setup automatically and you can skip the rest of the guide.
If you are a beginner in Android development then we also recommend you to start with TADP.
Note: NVIDIAs Tegra Android Development Pack includes some special features for NVIDIAs Tegra
platform but its use is not limited to Tegra devices only.
You need at least 1.6 Gb free disk space for the install.
TADP will download Android SDK platforms and Android NDK from Googles server, so Internet connection
is required for the installation.
TADP may ask you to ash your development kit at the end of installation process. Just skip this step if you
have no Tegra Development Kit.
(UNIX) TADP will ask you for root in the middle of installation, so you need to be a member of sudo group.
Manual environment setup for Android development
Development in Java
You need the following software to be installed in order to develop for Android in Java:
1. Sun JDK 6
Visit Java SE Downloads page and download an installer for your OS.
Here is a detailed JDK (Java Development Kit) installation guide for Ubuntu and Mac OS (only JDK sections
are applicable for OpenCV)
1.6. Introduction into Android Development 37
The OpenCV Tutorials, Release 2.4.3
Note: OpenJDK is not suitable for Android development, since Android SDK supports only Sun JDK. If you
use Ubuntu, after installation of Sun JDK you should run the following command to set Sun java environment:
sudo update-java-alternatives --set java-6-sun
2. Android SDK
Get the latest Android SDK from http://developer.android.com/sdk/index.html
Here is Googles install guide for the SDK.
Note: If you choose SDK packed into a Windows installer, then you should have 32-bit JRE installed. It is not
a prerequisite for Android development, but installer is a x86 application and requires 32-bit Java runtime.
Note:
If you are running x64 version of Ubuntu Linux, then you need ia32 shared libraries for use on amd64
and ia64 systems to be installed. You can install them with the following command:
sudo apt-get install ia32-libs
For Red Hat based systems the following command might be helpful:
sudo yum install libXtst.i386
3. Android SDK components
You need the following SDK components to be installed:
Android SDK Tools, revision 20 or newer.
Older revisions should also work, but they are not recommended.
SDK Platform Android 3.0 (API 11).
The minimal platform supported by OpenCV Java API is Android 2.2 (API 8). This is
also the minimum API Level required for the provided samples to run. See the <uses-sdk
android:minSdkVersion="8"/> tag in their AndroidManifest.xml les. But for successful compila-
tion the target platform should be set to Android 3.0 (API 11) or higher. It will not prevent them from
running on Android 2.2.
38 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
See Adding Platforms and Packages for help with installing/updating SDK components.
4. Eclipse IDE
Check the Android SDK System Requirements document for a list of Eclipse versions that are compatible with
the Android SDK. For OpenCV 2.4.x we recommend Eclipse 3.7 (Indigo) or later versions. They work well for
OpenCV under both Windows and Linux.
If you have no Eclipse installed, you can get it from the ofcial site.
5. ADT plugin for Eclipse
These instructions are copied from Android Developers site, check it out in case of any ADT-related problem.
Assuming that you have Eclipse IDE installed, as described above, follow these steps to download and install
the ADT plugin:
(a) Start Eclipse, then select Help Install New Software...
(b) Click Add (in the top-right corner).
(c) In the Add Repository dialog that appears, enter ADT Plugin for the Name and the following URL for
the Location:
https://dl-ssl.google.com/android/eclipse/
(d) Click OK
Note: If you have trouble acquiring the plugin, try using http in the Location URL, instead of https
(https is preferred for security reasons).
(e) In the Available Software dialog, select the checkbox next to Developer Tools and click Next.
1.6. Introduction into Android Development 39
The OpenCV Tutorials, Release 2.4.3
(f) In the next window, youll see a list of the tools to be downloaded. Click Next.
Note: If you also plan to develop native C++ code with Android NDK dont forget to enable NDK Plugins
installations as well.
(g) Read and accept the license agreements, then click Finish.
Note: If you get a security warning saying that the authenticity or validity of the software cant be
established, click OK.
(h) When the installation completes, restart Eclipse.
Native development in C++
You need the following software to be installed in order to develop for Android in C++:
40 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
1. Android NDK
To compile C++ code for Android platform you need Android Native Development Kit (NDK).
You can get the latest version of NDK from the download page. To install Android NDK just extract the archive
to some folder on your computer. Here are installation instructions.
Note: Before start you can read ofcial Android NDK documentation which is in the Android NDK archive,
in the folder docs/. The main article about using Android NDK build system is in the ANDROID-MK.html
le. Some additional information you can nd in the APPLICATION-MK.html, NDK-BUILD.html les, and
CPU-ARM-NEON.html, CPLUSPLUS-SUPPORT.html, PREBUILTS.html.
2. CDT plugin for Eclipse
There are several possible ways to integrate compilation of C++ code by Android NDK into Eclipse compilation
process. We recommend the approach based on Eclipse CDT (C/C++ Development Tooling) Builder. Make
sure your Eclipse IDE has the CDT plugin installed. Menu Help -> About Eclipse SDK -> Installation Details.
1.6. Introduction into Android Development 41
The OpenCV Tutorials, Release 2.4.3
Note: If youre using the latest ADT plugin for Eclipse (version 20 and above), most likely you already have
the CDT plugin and dont need to install it.
42 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
To install the CDT plugin use menu Help -> Install New Software..., then paste the CDT 8.0 repository URL
http://download.eclipse.org/tools/cdt/releases/indigo as shown in the picture below and click Add..., name it
CDT and click OK.
1.6. Introduction into Android Development 43
The OpenCV Tutorials, Release 2.4.3
CDT Main Features should be enough:
44 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
Thats it. Compilation of C++ code is fully integrated into Eclipse building process now.
Android application structure
Usually source code of an Android application has the following structure:
root folder of the project/
jni/
libs/
res/
src/
AndroidManifest.xml
project.properties
... other files ...
1.6. Introduction into Android Development 45
The OpenCV Tutorials, Release 2.4.3
Where:
the src folder contains Java code of the application,
the res folder contains resources of the application (images, xml les describing UI layout, etc),
the libs folder will contain native libraries after a successful build,
and the jni folder contains C/C++ application source code and NDKs build scripts Android.mk and
Application.mk producing the native libraries,
AndroidManifest.xml le presents essential information about application to the Android system (name of
the Application, name of main applications package, components of the application, required permissions, etc).
It can be created using Eclipse wizard or android tool from Android SDK.
project.properties is a text le containing information about target Android platform and other build details.
This le is generated by Eclipse or can be created with android tool included in Android SDK.
Note: Both AndroidManifest.xml and project.properties les are required to compile the C++ part of the
application, since Android NDK build system relies on them. If any of these les does not exist, compile the Java part
of the project before the C++ part.
Android.mk and Application.mk scripts
The script Android.mk usually has the following structure:
1 LOCAL
_
PATH := $(call my-dir)
2
3 include $(CLEAR
_
VARS)
4 LOCAL
_
MODULE := <module
_
name>
5 LOCAL
_
SRC
_
FILES := <list of .c and .cpp project files>
6 <some variable name> := <some variable value>
7 ...
8 <some variable name> := <some variable value>
9
10 include $(BUILD
_
SHARED
_
LIBRARY)
This is the minimal le Android.mk, which builds C++ source code of an Android application. Note that the rst two
lines and the last line are mandatory for any Android.mk.
Usually the le Application.mk is optional, but in case of project using OpenCV, when STL and exceptions are used
in C++, it also should be created. Example of the le Application.mk:
1 APP
_
STL := gnustl
_
static
2 APP
_
CPPFLAGS := -frtti -fexceptions
3 APP
_
ABI := armeabi-v7a
Building application native part from command line
Here is the standard way to compile C++ part of an Android application:
1. Open console and go to the root folder of an Android application
cd <root folder of the project>/
2. Run the following command
46 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
<path
_
where
_
NDK
_
is
_
placed>/ndk-build
Note: On Windows we recommend to use ndk-build.cmd in standard Windows console (cmd.exe) rather
than the similar bash script in Cygwin shell.
3. After executing this command the C++ part of the source code is compiled.
After that the Java part of the application can be (re)compiled (using either Eclipse or Ant build tool).
Note: Some parameters can be set for the ndk-build:
Example 1: Verbose compilation
<path
_
where
_
NDK
_
is
_
placed>/ndk-build V=1
Example 2: Rebuild all
<path
_
where
_
NDK
_
is
_
placed>/ndk-build -B
Building application native part from Eclipse (CDT Builder)
There are several possible ways to integrate compilation of native C++ code by Android NDK into Eclipse build
process. We recommend the approach based on Eclipse CDT Builder.
Important: OpenCV for Android package since version 2.4.2 contains sample projects pre-congured CDT Builders.
For your own projects follow the steps below.
1. Dene the NDKROOT environment variable containing the path to Android NDK in your system (e.g.
"X:\\Apps\\android-ndk-r8" or "/opt/android-ndk-r8").
On Windows an environment variable can be set via My Computer -> Properties -> Advanced -> Environment
variables and restarting Eclipse. On Windows 7 its also possible to use setx command in a console session.
On Linux and MacOS an environment variable can be set via appending a "export VAR
_
NAME=VAR
_
VALUE"
line to the "~/.bashrc" le and logging off and then on.
2. Open Eclipse and load the Android app project to congure.
3. Add C/C++ Nature to the project via Eclipse menu New -> Other -> C/C++ -> Convert to a C/C++ Project.
1.6. Introduction into Android Development 47
The OpenCV Tutorials, Release 2.4.3
And:
48 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
4. Select the project(s) to convert. Specify Project type = Makefile project, Toolchains = Other
Toolchain.
1.6. Introduction into Android Development 49
The OpenCV Tutorials, Release 2.4.3
5. Open Project Properties -> C/C++ Build, uncheck Use default build command, replace Build command
text from "make" to
"${NDKROOT}/ndk-build.cmd" on Windows,
"${NDKROOT}/ndk-build" on Linux and MacOS.
50 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
6. Go to Behaviour tab and change Workbench build type section like shown below:
7. Press OK and make sure the ndk-build is successfully invoked when building the project.
1.6. Introduction into Android Development 51
The OpenCV Tutorials, Release 2.4.3
8. If you open your C++ source le in Eclipse editor, youll see syntax error notications. They are not real errors,
but additional CDT conguring is required.
9. Open Project Properties -> C/C++ General -> Paths and Symbols and add the following Include paths for
C++:
${NDKROOT}/platforms/android-9/arch-arm/usr/include
${NDKROOT}/sources/cxx-stl/gnu-libstdc++/include
${NDKROOT}/sources/cxx-stl/gnu-libstdc++/libs/armeabi-v7a/include
${ProjDirPath}/../../sdk/native/jni/include
The last path should be changed to the correct absolute or relative path to OpenCV4Android SDK location.
This should clear the syntax error notications in Eclipse C++ editor.
52 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
Note: The latest Android NDK r8b uses different STL headers path. So if you use this NDK release add the
following Include paths list instead:
${NDKROOT}/platforms/android-9/arch-arm/usr/include
${NDKROOT}/sources/cxx-stl/gnu-libstdc++/4.6/include
${NDKROOT}/sources/cxx-stl/gnu-libstdc++/4.6/libs/armeabi-v7a/include
${ProjDirPath}/../../sdk/native/jni/include
Debugging and Testing
In this section we will give you some easy-to-follow instructions on how to set up an emulator or hardware device for
testing and debugging an Android project.
AVD
AVD (Android Virtual Device) is not probably the most convenient way to test an OpenCV-dependent application, but
sure the most uncomplicated one to congure.
1. Assuming you already have Android SDK and Eclipse IDE installed, in Eclipse go Window -> AVD Manager.
2. Press the New button in AVD Manager window.
3. Create new Android Virtual Device window will let you select some properties for your new device, like target
API level, size of SD-card and other.
1.6. Introduction into Android Development 53
The OpenCV Tutorials, Release 2.4.3
4. When you click the Create AVD button, your new AVD will be availible in AVD Manager.
5. Press Start to launch the device. Be aware that any AVD (a.k.a. Emulator) is usually much slower than a
hardware Android device, so it may take up to several minutes to start.
6. Go Run -> Run/Debug in Eclipse IDE to run your application in regular or debugging mode. Device Chooser
will let you choose among the running devices or to start a new one.
Hardware Device
If you have an Android device, you can use it to test and debug your applications. This way is more authentic, though a
little bit harder to set up. You need to make some actions for Windows and Linux operating systems to be able to work
with Android devices. No extra actions are required for Mac OS. See detailed information on conguring hardware
54 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
devices in subsections below.
You may also consult the ofcial Android Developers site instructions for more information.
Windows host computer
1. Enable USB debugging on the Android device (via Settings menu).
2. Attach the Android device to your PC with a USB cable.
3. Go to Start Menu and right-click on Computer. Select Manage in the context menu. You may be asked for
Administrative permissions.
4. Select Device Manager in the left pane and nd an unknown device in the list. You may try unplugging it and
then plugging back in order to check whether its your exact equipment appears in the list.
5. Try your luck installing Google USB drivers without any modications: right-click on the unknown device,
select Properties menu item > Details tab > Update Driver button.
1.6. Introduction into Android Development 55
The OpenCV Tutorials, Release 2.4.3
6. Select Browse computer for driver software.
56 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
7. Specify the path to <Android SDK folder>/extras/google/usb
_
driver/ folder.
1.6. Introduction into Android Development 57
The OpenCV Tutorials, Release 2.4.3
8. If you get the prompt to install unveried drivers and report about success - youve nished with USB driver
installation.
58 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
9. Otherwise (getting the failure like shown below) follow the next steps.
1.6. Introduction into Android Development 59
The OpenCV Tutorials, Release 2.4.3
10. Again right-click on the unknown device, select Properties > Details > Hardware Ids and copy the line like
USB\VID
_
XXXX&PID
_
XXXX&MI
_
XX.
60 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
11. Now open le <Android SDK folder>/extras/google/usb
_
driver/android
_
winusb.inf. Select either
Google.NTx86 or Google.NTamd64 section depending on your host system architecture.
1.6. Introduction into Android Development 61
The OpenCV Tutorials, Release 2.4.3
12. There should be a record like existing ones for your device and you need to add one manually.
62 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
13. Save the android
_
winusb.inf le and try to install the USB driver again.
1.6. Introduction into Android Development 63
The OpenCV Tutorials, Release 2.4.3
64 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
14. This time installation should go successfully.
1.6. Introduction into Android Development 65
The OpenCV Tutorials, Release 2.4.3
15. And an unknown device is now recognized as an Android phone.
66 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
16. Successful device USB connection can be veried in console via adb devices command.
17. Now, in Eclipse go Run -> Run/Debug to run your application in regular or debugging mode. Device Chooser
will let you choose among the devices.
1.6. Introduction into Android Development 67
The OpenCV Tutorials, Release 2.4.3
Linux host computer
By default Linux doesnt recognize Android devices, but its easy to x this issue. On Ubuntu Linux you have to create
a new /etc/udev/rules.d/51-android.rules conguration le that contains information about your Android device. You
may nd some Vendor IDs here or execute lsusb command to view VendorID of plugged Android device. Here is an
example of such le for LG device:
SUBSYSTEM=="usb", ATTR{idVendor}=="1004", MODE="0666", GROUP="plugdev"
Then restart your adb server (even better to restart the system), plug in your Android device and execute adb devices
command. You will see the list of attached devices:
MacOS host computer
No actions are required, just connect your device via USB and run adb devices to check connection.
Whats next
Now, when you have your development environment set up and congured, you may want to proceed to installing
OpenCV4Android SDK. You can learn how to do that in a separate OpenCV4Android SDK tutorial.
1.7 OpenCV4Android SDK
This tutorial was designed to help you with installation and conguration of OpenCV4Android SDK.
This guide was written with MS Windows 7 in mind, though it should work with GNU Linux and Apple Mac OS as
well.
This tutorial assumes you have the following software installed and congured:
JDK
Android SDK and NDK
Eclipse IDE
ADT and CDT plugins for Eclipse
If you need help with anything of the above, you may refer to our Introduction into Android Development guide.
If you encounter any error after thoroughly following these steps, feel free to contact us via OpenCV4Android discus-
sion group or OpenCV Q&A forum. Well do our best to help you out.
68 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
Tegra Android Development Pack users
You may have used Tegra Android Development Pack (TADP) released by NVIDIA for Android development envi-
ronment setup.
Beside Android development tools the TADP 2.0 includes OpenCV4Android SDK, so it can be already installed in
your system and you can skip to Running OpenCV Samples section of this tutorial.
More details regarding TADP can be found in the Introduction into Android Development guide.
General info
OpenCV4Android SDK package enables development of Android applications with use of OpenCV library.
The structure of package contents looks as follows:
OpenCV-2.4.3-android-sdk
|
_
apk
| |
_
OpenCV
_
2.4.3
_
binary
_
pack
_
XXX.apk
| |
_
OpenCV
_
2.4.3
_
Manager.apk
|
|
_
doc
|
_
samples
|
_
sdk
| |
_
etc
| |
_
java
| |
_
native
| |
_
3rdparty
| |
_
jni
| |
_
libs
| |
_
armeabi
| |
_
armeabi-v7a
| |
_
x86
|
|
_
license.txt
|
_
README.android
sdk folder contains OpenCV API and libraries for Android:
sdk/java folder contains an Android library Eclipse project providing OpenCV Java API that can be imported
into developers workspace;
sdk/native folder contains OpenCV C++ headers (for JNI code) and native Android libraries (*.so and *.a)
for ARM-v5, ARM-v7a and x86 architectures;
sdk/etc folder contains Haar and LBP cascades distributed with OpenCV.
apk folder contains Android packages that should be installed on the target Android device to enable OpenCV
library access via OpenCV Manager API (see details below).
On production devices that have access to Google Play Market (and Internet) these packages will be installed
from Market on the rst start of an application using OpenCV Manager API. But devkits without Market or In-
ternet connection require this packages to be installed manually. Install the Manager.apk and the corresponding
binary_pack.apk depending on the device CPU, the Manager GUI provides this info. Below youll see exact
commands on how to do this.
Note: Installation from Internet is the preferable way since OpenCV team may publish updated versions of this
packages on the Market.
1.7. OpenCV4Android SDK 69
The OpenCV Tutorials, Release 2.4.3
samples folder contains sample applications projects and their prebuilt packages (APK). Import them into
Eclipse workspace (like described below) and browse the code to learn possible ways of OpenCV use on An-
droid.
doc folder contains various OpenCV documentation in PDF format. Its also available online at
http://docs.opencv.org.
Note: The most recent docs (nightly build) are at http://docs.opencv.org/trunk/. Generally, its more up-to-date,
but can refer to not-yet-released functionality.
Starting from version 2.4.3 OpenCV4Android SDK uses OpenCV Manager API for library initialization. OpenCV
Manager is an Android service based solution providing the following benets for OpenCV applications developers:
Compact apk-size, since all applications use the same binaries from Manager and do not store native libs within
themselves;
Hardware specic optimizations are automatically enabled on all supported platforms;
Automatic updates and bug xes;
Trusted OpenCV library source. All packages with OpenCV are published on Google Play;
For additional information on OpenCV Manager see the:
Slides
Reference Manual
Manual OpenCV4Android SDK setup
Get the OpenCV4Android SDK
1. Go to the OpenCV download page on SourceForge and download the latest available version. Currently its
OpenCV-2.4.3-android-sdk.zip.
2. Create a new folder for Android with OpenCV development. For this tutorial we have unpacked OpenCV SDK
to the C:\Work\OpenCV4Android\ directory.
Note: Better to use a path without spaces in it. Otherwise you may have problems with ndk-build.
3. Unpack the SDK archive into the chosen directory.
You can unpack it using any popular archiver (e.g with 7-Zip):
70 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
On Unix you can use the following command:
unzip ~/Downloads/OpenCV-2.4.3-android-sdk.zip
Import OpenCV library and samples to the Eclipse
1. Start Eclipse and choose your workspace location.
We recommend to start working with OpenCV for Android from a new clean workspace. A new Eclipse
workspace can for example be created in the folder where you have unpacked OpenCV4Android SDK package:
2. Import OpenCV library and samples into workspace.
OpenCV library is packed as a ready-for-use Android Library Project. You can simply reference it in your
projects.
Each sample included into the OpenCV-2.4.3-android-sdk.zip is a regular Android project that already refer-
ences OpenCV library.Follow the steps below to import OpenCV and samples into the workspace:
Right click on the Package Explorer window and choose Import... option from the context menu:
1.7. OpenCV4Android SDK 71
The OpenCV Tutorials, Release 2.4.3
In the main panel select General Existing Projects into Workspace and press Next button:
72 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
In the Select root directory eld locate your OpenCV package folder. Eclipse should automatically locate
OpenCV library and samples:
1.7. OpenCV4Android SDK 73
The OpenCV Tutorials, Release 2.4.3
Click Finish button to complete the import operation.
After clicking Finish button Eclipse will load all selected projects into workspace, and you have to wait some
time while it is building OpenCV samples. Just give a minute to Eclipse to complete initialization.
Note: After the initial import, on a non-Windows (Linux and Mac OS) operating system Eclipse will still show
build errors for applications with native C++ code. To resolve the issues, please do the following:
Open Project Properties -> C/C++ Build, and replace Build command text to "${NDKROOT}/ndk-build"
(remove .cmd at the end).
74 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
Once Eclipse completes build you will have the clean workspace without any build errors:
Running OpenCV Samples
At this point you should be able to build and run the samples. Keep in mind, that face-detection, Tutorial 3 and
Tutorial 4 include some native code and require Android NDK and CDT plugin for Eclipse to build working appli-
cations. If you havent installed these tools see the corresponding section of Introduction into Android Development.
Also, please consider that Tutorial 0 and Tutorial 1 samples use Java Camera API that denitelly accessible on
emulator from the Android SDK. Other samples use OpenCV Native Camera which may not work with emulator.
Note: Recent Android SDK tools, revision 19+ can run ARM v7a OS images but they available not for all Android
versions.
Well, running samples from Eclipse is very simple:
1.7. OpenCV4Android SDK 75
The OpenCV Tutorials, Release 2.4.3
Connect your device with adb tool from Android SDK or create an emulator with camera support.
See Managing Virtual Devices document for help with Android Emulator.
See Using Hardware Devices for help with real devices (not emulators).
Select project you want to start in Package Explorer and just press Ctrl + F11 or select option Run Run
from the main menu, or click Run button on the toolbar.
Note: Android Emulator can take several minutes to start. So, please, be patient.
On the rst run Eclipse will ask you about the running mode for your application:
Select the Android Application option and click OK button. Eclipse will install and run the sample.
Chances are that on the rst launch you will not have the OpenCV Manager package installed. In this case you
will see the following message:
76 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
To get rid of the message you will need to install OpenCV Manager and the appropriate OpenCV binary pack.
Simply tap Yes if you have Google Play Market installed on your device/emulator. It will redirect you to the
corresponding page on Google Play Market.
If you have no access to the Market, which is often the case with emulators - you will need to install the
packages from OpenCV4Android SDK folder manually. Open the console/terminal and type in the following
two commands:
1 <Android SDK path>/platform-tools/adb install <OpenCV4Android SDK path>/apk/OpenCV
_
2.4.3
_
Manager.apk
2 <Android SDK path>/platform-tools/adb install <OpenCV4Android SDK path>/apk/OpenCV
_
2.4.3
_
binary
_
pack
_
armv7a.apk
If youre running Windows, that will probably look like this:
When done, you will be able to run OpenCV samples on your device/emulator seamlessly.
Here is Tutorial 2 - Use OpenCV Camera sample, running on top of stock camera-preview of the emulator.
1.7. OpenCV4Android SDK 77
The OpenCV Tutorials, Release 2.4.3
Whats next
Now, when you have your instance of OpenCV4Adroid SDK set up and congured, you may want to proceed to using
OpenCV in your own application. You can learn how to do that in a separate Android development with OpenCV
tutorial.
1.8 Android development with OpenCV
This tutorial is created to help you use OpenCV library within your Android project.
This guide was written with Windows 7 in mind, though it should work with any other OS supported by
OpenCV4Android SDK.
This tutorial assumes you have the following installed and congured:
JDK
Android SDK and NDK
Eclipse IDE
ADT and CDT plugins for Eclipse
If you need help with anything of the above, you may refer to our Introduction into Android Development guide.
78 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
This tutorial also assumes you have OpenCV4Android SDK already installed on your development machine and
OpenCV Manager on your testing device correspondingly. If you need help with any of these, you may consult our
OpenCV4Android SDK tutorial.
If you encounter any error after thoroughly following these steps, feel free to contact us via OpenCV4Android discus-
sion group or OpenCV Q&A forum . Well do our best to help you out.
Using OpenCV library within your Android project
In this section we will explain how to make some existing project to use OpenCV. Starting with 2.4.2 release for
Android, OpenCV Manager is used to provide apps with the best available version of OpenCV. You can get more
information here: Android_OpenCV_Manager and in these slides.
Java
Application development with async initialization
Using async initialization is a recommended way for application development. It uses the OpenCV Manager to access
OpenCV libraries externally installed in the target system.
1. Add OpenCVlibrary project to your workspace. Use menu File ->Import ->Existing project in your workspace,
press Browse button and locate OpenCV4Android SDK (OpenCV-2.4.3-android-sdk/sdk).
1.8. Android development with OpenCV 79
The OpenCV Tutorials, Release 2.4.3
2. In application project add a reference to the OpenCV Java SDK in Project -> Properties -> Android -> Library
-> Add select OpenCV Library - 2.4.3.
80 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
To run OpenCV Manager-based application for the rst time you need to install package with the OpenCV Manager
for your platform. Armeabi, Armeabi-v7a with NEON, x86 and MIPS achitectures supported. You can do it using
Google Play Market or manually with adb tool:
1 <Android SDK path>/platform-tools/adb install <OpenCV4Android SDK path>/apk/OpenCV
_
2.4.3
_
Manager.apk
For rare cases if NEON instruction set is not supported you need to install aditional OpenCV Library package:
1 <Android SDK path>/platform-tools/adb install <OpenCV4Android SDK path>/apk/OpenCV
_
2.4.3
_
binary
_
pack
_
armv7a.apk
There is a very base code snippet implementing the async initialization. It shows basic principles. See the 15-puzzle
OpenCV sample for details.
1.8. Android development with OpenCV 81
The OpenCV Tutorials, Release 2.4.3
1 public class MyActivity extends Activity implements HelperCallbackInterface
2 {
3 private BaseLoaderCallback mOpenCVCallBack = new BaseLoaderCallback(this) {
4 @Override
5 public void onManagerConnected(int status) {
6 switch (status) {
7 case LoaderCallbackInterface.SUCCESS:
8 {
9 Log.i(TAG, "OpenCV loaded successfully");
10 // Create and set View
11 mView = new puzzle15View(mAppContext);
12 setContentView(mView);
13 } break;
14 default:
15 {
16 super.onManagerConnected(status);
17 } break;
18 }
19 }
20 };
21
22 /
**
Call on every application resume
**
/
23 @Override
24 protected void onResume()
25 {
26 Log.i(TAG, "called onResume");
27 super.onResume();
28
29 Log.i(TAG, "Trying to load OpenCV library");
30 if (!OpenCVLoader.initAsync(OpenCVLoader.OPENCV
_
VERSION
_
2
_
4
_
2, this, mOpenCVCallBack))
31 {
32 Log.e(TAG, "Cannot connect to OpenCV Manager");
33 }
34 }
It this case application works with OpenCV Manager in asynchronous fashion. OnManagerConnected callback will
be called in UI thread, when initialization nishes. Please note, that it is not allowed to use OpenCV calls or load
OpenCV-dependent native libs before invoking this callback. Load your own native libraries that depend on OpenCV
after the successful OpenCV initialization. Default BaseLoaderCallback implementation treat application context as
Activity and calls Activity.nish() method to exit in case of initialization failure. To override this behavior you need
to override nish() method of BaseLoaderCallback class and implement your own nalization method.
Application development with static initialization
According to this approach all OpenCV binaries are included into your application package. It is designed mostly
for development purposes. This approach is deprecated for the production code, release package is recommended to
communicate with OpenCV Manager via the async initialization described above.
1. Add the OpenCV library project to your workspace the same way as for the async initialization above. Use
menu File -> Import -> Existing project in your workspace, push Browse button and select OpenCV SDK path
(OpenCV-2.4.3-android-sdk/sdk).
82 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
2. In the application project add a reference to the OpenCV4Android SDK in Project -> Properties -> Android ->
Library -> Add select OpenCV Library - 2.4.3;
1.8. Android development with OpenCV 83
The OpenCV Tutorials, Release 2.4.3
3. If your application project doesnt have a JNI part, just copy the corresponding OpenCV native libs
from <OpenCV-2.4.3-android-sdk>/sdk/native/libs/<target
_
arch> to your project directory to folder
libs/<target
_
arch>.
In case of the application project with a JNI part, instead of manual libraries copying you need to modify your
Android.mk le: add the following two code lines after the "include $(CLEAR
_
VARS)" and before "include
path
_
to
_
OpenCV-2.4.3-android-sdk/sdk/native/jni/OpenCV.mk"
1 OPENCV
_
CAMERA
_
MODULES:=on
2 OPENCV
_
INSTALL
_
MODULES:=on
The result should look like the following:
84 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
1 include $(CLEAR
_
VARS)
2
3 # OpenCV
4 OPENCV
_
CAMERA
_
MODULES:=on
5 OPENCV
_
INSTALL
_
MODULES:=on
6 include ../../sdk/native/jni/OpenCV.mk
After that the OpenCV libraries will be copied to your application libs folder during the JNI part build.
Eclipse will automatically include all the libraries from the libs folder to the application package (APK).
4. The last step of enabling OpenCV in your application is Java initialization code before call to OpenCV API. It
can be done, for example, in the static section of the Activity class:
1 static {
2 if (!OpenCVLoader.initDebug()) {
3 // Handle initialization error
4 }
5 }
If you application includes other OpenCV-dependent native libraries you should load them after OpenCV ini-
tialization:
1 static {
2 if (!OpenCVLoader.initDebug()) {
3 // Handle initialization error
4 } else {
5 System.loadLibrary("my
_
jni
_
lib1");
6 System.loadLibrary("my
_
jni
_
lib2");
7 }
8 }
Native/C++
To build your own Android application, which uses OpenCV from native part, the following steps should be done:
1. You can use an environment variable to specify the location of OpenCV package or just hardcode absolute or
relative path in the jni/Android.mk of your projects.
2. The le jni/Android.mk should be written for the current application using the common rules for this le.
For detailed information see the Android NDK documentation from the Android NDK archive, in the le
<path
_
where
_
NDK
_
is
_
placed>/docs/ANDROID-MK.html
3. The line
include C:\Work\OpenCV4Android\OpenCV-2.4.3-android-sdk\sdk\native\jni\OpenCV.mk
should be inserted into the jni/Android.mk le after the line
include $(CLEAR
_
VARS)
4. Several variables can be used to customize OpenCV stuff, but you dont need to use them when your application
uses the async initialization via the OpenCV Manager API.
Note: these variables should be set before the "include .../OpenCV.mk" line:
OPENCV
_
INSTALL
_
MODULES:=on
Copies necessary OpenCV dynamic libs to the project libs folder in order to include them into the APK.
1.8. Android development with OpenCV 85
The OpenCV Tutorials, Release 2.4.3
OPENCV
_
CAMERA
_
MODULES:=off
Skip native OpenCV camera related libs copying to the project libs folder.
OPENCV
_
LIB
_
TYPE:=STATIC
Perform static link with OpenCV. By default dynamic link is used and the project JNI lib depends on
libopencv
_
java.so.
5. The le Application.mk should exist and should contain lines:
APP
_
STL := gnustl
_
static
APP
_
CPPFLAGS := -frtti -fexceptions
Also the line like this one:
APP
_
ABI := armeabi-v7a
should specify the application target platforms.
In some cases a linkage error (like "In function cv::toUtf16(std::basic
_
string<...>...
undefined reference to mbstowcs") happens when building an application JNI library depending on
OpenCV. The following line in the Application.mk usually xes it:
APP
_
PLATFORM := android-9
6. Either use manual ndk-build invocation or setup Eclipse CDT Builder to build native JNI lib before Java part
[re]build and APK creation.
Hello OpenCV Sample
Here are basic steps to guide you trough the process of creating a simple OpenCV-centric application. It will be
capable of accessing camera output, processing it and displaying the result.
1. Open Eclipse IDE, create a new clean workspace, create a new Android project (File -> New -> Android
Project).
2. Set name, target, package and minSDKVersion accordingly.
3. Create a new class (File -> New -> Class). Name it for example: HelloOpenCVView.
86 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
It should extend SurfaceView class.
It also should implement SurfaceHolder.Callback, Runnable.
4. Edit HelloOpenCVView class.
Add an import line for android.content.context.
Modify autogenerated stubs: HelloOpenCVView, surfaceCreated, surfaceDestroyed and surfaceChanged.
1 package com.hello.opencv.test;
2
3 import android.content.Context;
4
5 public class HelloOpenCVView extends SurfaceView implements Callback, Runnable {
6
1.8. Android development with OpenCV 87
The OpenCV Tutorials, Release 2.4.3
7 public HelloOpenCVView(Context context) {
8 super(context);
9 getHolder().addCallback(this);
10 }
11
12 public void surfaceCreated(SurfaceHolder holder) {
13 (new Thread(this)).start();
14 }
15
16 public void surfaceDestroyed(SurfaceHolder holder) {
17 cameraRelease();
18 }
19
20 public void surfaceChanged(SurfaceHolder holder, int format, int width, int height) {
21 cameraSetup(width, height);
22 }
23
24 //...
Add cameraOpen, cameraRelease and cameraSetup voids as shown below.
Also, dont forget to add the public void run() as follows:
1 public void run() {
2 // TODO: loop { getFrame(), processFrame(), drawFrame() }
3 }
4
5 public boolean cameraOpen() {
6 return false; //TODO: open camera
7 }
8
9 private void cameraRelease() {
10 // TODO release camera
11 }
12
13 private void cameraSetup(int width, int height) {
14 // TODO setup camera
15 }
5. Create a new Activity (New -> Other -> Android -> Android Activity) and name it, for example: HelloOpenC-
VActivity. For this activity dene onCreate, onResume and onPause voids.
1 public void onCreate (Bundle savedInstanceState) {
2 super.onCreate(savedInstanceState);
3 mView = new HelloOpenCVView(this);
4 setContentView (mView);
5 }
6
7 protected void onPause() {
8 super.onPause();
9 mView.cameraRelease();
10 }
11
12 protected void onResume() {
13 super.onResume();
14 if( !mView.cameraOpen() ) {
15 // MessageBox and exit app
16 AlertDialog ad = new AlertDialog.Builder(this).create();
17 ad.setCancelable(false); // This blocks the "BACK" button
88 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
18 ad.setMessage("Fatal error: cant open camera!");
19 ad.setButton("OK", new DialogInterface.OnClickListener() {
20 public void onClick(DialogInterface dialog, int which) {
21 dialog.dismiss();
22 finish();
23 }
24 });
25 ad.show();
26 }
27 }
6. Add the following permissions to the AndroidManifest.xml le:
1 </application>
2
3 <uses-permission android:name="android.permission.CAMERA" />
4 <uses-feature android:name="android.hardware.camera" />
5 <uses-feature android:name="android.hardware.camera.autofocus" />
7. Reference OpenCV library within your project properties.
1.8. Android development with OpenCV 89
The OpenCV Tutorials, Release 2.4.3
8. We now need some code to handle the camera. Update the HelloOpenCVView class as follows:
1 private VideoCapture mCamera;
2
3 public boolean cameraOpen() {
4 synchronized (this) {
5 cameraRelease();
6 mCamera = new VideoCapture(Highgui.CV
_
CAP
_
ANDROID);
7 if (!mCamera.isOpened()) {
8 mCamera.release();
9 mCamera = null;
10 Log.e("HelloOpenCVView", "Failed to open native camera");
11 return false;
12 }
13 }
90 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
14 return true;
15 }
16
17 public void cameraRelease() {
18 synchronized(this) {
19 if (mCamera != null) {
20 mCamera.release();
21 mCamera = null;
22 }
23 }
24 }
25
26 private void cameraSetup(int width, int height) {
27 synchronized (this) {
28 if (mCamera != null && mCamera.isOpened()) {
29 List<Size> sizes = mCamera.getSupportedPreviewSizes();
30 int mFrameWidth = width;
31 int mFrameHeight = height;
32 { // selecting optimal camera preview size
33 double minDiff = Double.MAX
_
VALUE;
34 for (Size size : sizes) {
35 if (Math.abs(size.height - height) < minDiff) {
36 mFrameWidth = (int) size.width;
37 mFrameHeight = (int) size.height;
38 minDiff = Math.abs(size.height - height);
39 }
40 }
41 }
42 mCamera.set(Highgui.CV
_
CAP
_
PROP
_
FRAME
_
WIDTH, mFrameWidth);
43 mCamera.set(Highgui.CV
_
CAP
_
PROP
_
FRAME
_
HEIGHT, mFrameHeight);
44 }
45 }
46 }
9. The last step would be to update the run() void in HelloOpenCVView class as follows:
1 public void run() {
2 while (true) {
3 Bitmap bmp = null;
4 synchronized (this) {
5 if (mCamera == null)
6 break;
7 if (!mCamera.grab())
8 break;
9
10 bmp = processFrame(mCamera);
11 }
12 if (bmp != null) {
13 Canvas canvas = getHolder().lockCanvas();
14 if (canvas != null) {
15 canvas.drawBitmap(bmp, (canvas.getWidth() - bmp.getWidth()) / 2,
16 (canvas.getHeight() - bmp.getHeight()) / 2, null);
17 getHolder().unlockCanvasAndPost(canvas);
18
19 }
20 bmp.recycle();
21 }
22 }
1.8. Android development with OpenCV 91
The OpenCV Tutorials, Release 2.4.3
23 }
24
25 protected Bitmap processFrame(VideoCapture capture) {
26 Mat mRgba = new Mat();
27 capture.retrieve(mRgba, Highgui.CV
_
CAP
_
ANDROID
_
COLOR
_
FRAME
_
RGBA);
28 //process mRgba
29 Bitmap bmp = Bitmap.createBitmap(mRgba.cols(), mRgba.rows(), Bitmap.Config.ARGB
_
8888);
30 try {
31 Utils.matToBitmap(mRgba, bmp);
32 } catch(Exception e) {
33 Log.e("processFrame", "Utils.matToBitmap() throws an exception: " + e.getMessage());
34 bmp.recycle();
35 bmp = null;
36 }
37 return bmp;
38 }
1.9 Installation in iOS
Required Packages
CMake 2.8.8 or higher
Xcode 4.2 or higher
Getting the Cutting-edge OpenCV from Git Repository
Launch GIT client and clone OpenCV repository from here
In MacOS it can be done using the following command in Terminal:
cd ~/<my
_
working
_
directory>
git clone https://github.com/Itseez/opencv.git
Building OpenCV from Source, using CMake and Command Line
1. Make symbolic link for Xcode to let OpenCV build scripts nd the compiler, header les etc.
cd /
sudo ln -s /Applications/Xcode.app/Contents/Developer Developer
2. Build OpenCV framework:
cd ~/<my
_
working
_
directory>
python opencv/ios/build
_
framework.py ios
If everythings ne, a few minutes later you will get ~/<my_working_directory>/ios/opencv2.framework. You can add
this framework to your Xcode projects.
Further Reading
You can nd several OpenCV+iOS tutorials here OpenCV iOS.
92 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
1.10 Load and Display an Image
Goal
In this tutorial you will learn how to:
Load an image (using imread)
Create a named OpenCV window (using namedWindow)
Display an image in an OpenCV window (using imshow)
Source Code
Download the source code from here.
1 #include <opencv2/core/core.hpp>
2 #include <opencv2/highgui/highgui.hpp>
3 #include <iostream>
4
5 using namespace cv;
6 using namespace std;
7
8 int main( int argc, char
**
argv )
9 {
10 if( argc != 2)
11 {
12 cout <<" Usage: display
_
image ImageToLoadAndDisplay" << endl;
13 return -1;
14 }
15
16 Mat image;
17 image = imread(argv[1], CV
_
LOAD
_
IMAGE
_
COLOR); // Read the file
18
19 if(! image.data ) // Check for invalid input
20 {
21 cout << "Could not open or find the image" << std::endl ;
22 return -1;
23 }
24
25 namedWindow( "Display window", CV
_
WINDOW
_
AUTOSIZE );// Create a window for display.
26 imshow( "Display window", image ); // Show our image inside it.
27
28 waitKey(0); // Wait for a keystroke in the window
29 return 0;
30 }
Explanation
In OpenCV 2 we have multiple modules. Each one takes care of a different area or approach towards image processing.
You could already observe this in the structure of the user guide of these tutorials itself. Before you use any of them
you rst need to include the header les where the content of each individual module is declared.
Youll almost always end up using the:
core section, as here are dened the basic building blocks of the library
1.10. Load and Display an Image 93
The OpenCV Tutorials, Release 2.4.3
highgui module, as this contains the functions for input and output operations
// Video Image PSNR and SSIM
#include <iostream> // for standard I/O
#include <string> // for strings
We also include the iostream to facilitate console line output and input. To avoid data structure and function name
conicts with other libraries, OpenCV has its own namespace: cv. To avoid the need appending prior each of these the
cv:: keyword you can import the namespace in the whole le by using the lines:
using namespace cv;
using namespace std;
This is true for the STL library too (used for console I/O). Now, lets analyze the main function. We start up assuring
that we acquire a valid image name argument from the command line.
if( argc != 2)
{
cout <<" Usage: display
_
image ImageToLoadAndDisplay" << endl;
return -1;
}
Then create a Mat object that will store the data of the loaded image.
Mat image;
Now we call the imread function which loads the image name specied by the rst argument (argv[1]). The second
argument species the format in what we want the image. This may be:
CV_LOAD_IMAGE_UNCHANGED (<0) loads the image as is (including the alpha channel if present)
CV_LOAD_IMAGE_GRAYSCALE ( 0) loads the image as an intensity one
CV_LOAD_IMAGE_COLOR (>0) loads the image in the RGB format
image = imread(argv[1], CV
_
LOAD
_
IMAGE
_
COLOR); // Read the file
Note: OpenCV offers support for the image formats Windows bitmap (bmp), portable image formats (pbm, pgm,
ppm) and Sun raster (sr, ras). With help of plugins (you need to specify to use them if you build yourself the library,
nevertheless in the packages we ship present by default) you may also load image formats like JPEG (jpeg, jpg, jpe),
JPEG 2000 (jp2 - codenamed in the CMake as Jasper), TIFF les (tiff, tif) and portable network graphics (png).
Furthermore, OpenEXR is also a possibility.
After checking that the image data was loaded correctly, we want to display our image, so we create an OpenCV
window using the namedWindow function. These are automatically managed by OpenCV once you create them. For
this you need to specify its name and how it should handle the change of the image it contains from a size point of
view. It may be:
CV_WINDOW_AUTOSIZE is the only supported one if you do not use the Qt backend. In this case the window
size will take up the size of the image it shows. No resize permitted!
CV_WINDOW_NORMAL on Qt you may use this to allow window resize. The image will resize itself according
to the current window size. By using the | operator you also need to specify if you would like the image to keep
its aspect ratio (CV_WINDOW_KEEPRATIO) or not (CV_WINDOW_FREERATIO).
namedWindow( "Display window", CV
_
WINDOW
_
AUTOSIZE );// Create a window for display.
Finally, to update the content of the OpenCV window with a new image use the imshow function. Specify the OpenCV
window name to update and the image to use during this operation:
94 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
imshow( "Display window", image ); // Show our image inside it.
Because we want our window to be displayed until the user presses a key (otherwise the program would end far too
quickly), we use the waitKey function whose only parameter is just how long should it wait for a user input (measured
in milliseconds). Zero means to wait forever.
waitKey(0); // Wait for a keystroke in the window
Result
Compile your code and then run the executable giving an image path as argument. If youre on Windows the
executable will of course contain an exe extension too. Of course assure the image le is near your program le.
./DisplayImage HappyFish.jpg
You should get a nice window as the one shown below:
1.11 Load, Modify, and Save an Image
Note: We assume that by now you know how to load an image using imread and to display it in a window (using
imshow). Read the Load and Display an Image tutorial otherwise.
Goals
In this tutorial you will learn how to:
Load an image using imread
Transform an image from RGB to Grayscale format by using cvtColor
Save your transformed image in a le on disk (using imwrite)
Code
Here it is:
1.11. Load, Modify, and Save an Image 95
The OpenCV Tutorials, Release 2.4.3
1 #include <cv.h>
2 #include <highgui.h>
3
4 using namespace cv;
5
6 int main( int argc, char
**
argv )
7 {
8 char
*
imageName = argv[1];
9
10 Mat image;
11 image = imread( imageName, 1 );
12
13 if( argc != 2 || !image.data )
14 {
15 printf( " No image data \n " );
16 return -1;
17 }
18
19 Mat gray
_
image;
20 cvtColor( image, gray
_
image, CV
_
RGB2GRAY );
21
22 imwrite( "../../images/Gray
_
Image.jpg", gray
_
image );
23
24 namedWindow( imageName, CV
_
WINDOW
_
AUTOSIZE );
25 namedWindow( "Gray image", CV
_
WINDOW
_
AUTOSIZE );
26
27 imshow( imageName, image );
28 imshow( "Gray image", gray
_
image );
29
30 waitKey(0);
31
32 return 0;
33 }
Explanation
1. We begin by:
Creating a Mat object to store the image information
Load an image using imread, located in the path given by imageName. Fort this example, assume you are
loading a RGB image.
2. Now we are going to convert our image from RGB to Grayscale format. OpenCV has a really nice function to
do this kind of transformations:
cvtColor( image, gray
_
image, CV
_
RGB2GRAY );
As you can see, cvtColor takes as arguments:
a source image (image)
a destination image (gray_image), in which we will save the converted image.
an additional parameter that indicates what kind of transformation will be performed. In this case we use
CV_RGB2GRAY (self-explanatory).
3. So now we have our new gray_image and want to save it on disk (otherwise it will get lost after the program
ends). To save it, we will use a function analagous to imread: imwrite
96 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
imwrite( "../../images/Gray
_
Image.jpg", gray
_
image );
Which will save our gray_image as Gray_Image.jpg in the folder images located two levels up of my current
location.
4. Finally, lets check out the images. We create two windows and use them to show the original image as well as
the new one:
namedWindow( imageName, CV
_
WINDOW
_
AUTOSIZE );
namedWindow( "Gray image", CV
_
WINDOW
_
AUTOSIZE );
imshow( imageName, image );
imshow( "Gray image", gray
_
image );
5. Add add the waitKey(0) function call for the program to wait forever for an user key press.
Result
When you run your program you should get something like this:
And if you check in your folder (in my case images), you should have a newly .jpg le named Gray_Image.jpg:
1.11. Load, Modify, and Save an Image 97
The OpenCV Tutorials, Release 2.4.3
Congratulations, you are done with this tutorial!
1.12 How to write a tutorial for OpenCV?
Okay, so assume you have just nished a project of yours implementing something based on OpenCV and you want
to present/share it with the community. Luckily, OpenCV is an open source project. This means that in theory anyone
has access to the full source code and may extend it. While making a robust and practical library (like OpenCV) is
great, the success of a library also depends on how user friendly it is. To improve on this aspect, the OpenCV team has
already been listening to user feedback from its Yahoo user group and by making samples you can nd in the source
directories sample folder. The addition of the tutorials (in both online and PDF format) is an extension of these efforts.
Goal
The tutorials are just as an important part of the library as the implementation of those crafty data structures and
algorithms you can nd in OpenCV. Therefore, the source codes for the tutorials are part of the library. And yes, I
meant source codes. The reason for this formulation is that the tutorials are written by using the Sphinx documen-
tation generation system. This is based on the popular python documentation system called reStructuredText (reST).
ReStructuredText is a really neat language that by using a few simple conventions (indentation, directives) and emu-
lating old school e-mail writing techniques (text only) tries to offer a simple way to create and edit documents. Sphinx
extends this with some new features and creates the resulting document in both HTML (for web) and PDF (for ofine
usage) format.
Usually, an OpenCV tutorial has the following parts:
1. A source code demonstration of an OpenCV feature:
1. One or more CPP, Python, Java or other type of les depending for what OpenCV offers support and
for what language you make the tutorial.
2. Occasionaly, input resource les required for running your tutorials application.
2. A table of content entry (so people may easily nd the tutorial):
1. Adding your stuff to the tutorials table of content (reST le).
2. Add an image le near the TOC entry.
3. The content of the tutorial itself:
1. The reST text of the tutorial
98 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
2. Images following the idea that A picture is worth a thousand words.
3. For more complex demonstrations you may create a video.
As you can see you will need at least some basic knowledge of the reST system in order to complete the task at hand
with success. However, dont worry reST (and Sphinx) was made with simplicity in mind. It is easy to grasp its basics.
I found that the OpenAlea documentations introduction on this subject (or the Thomas Cokelaer one ) should enough
for this. If for some directive or feature you need a more in-depth description look it up in the ofcial reStructuredText
help les or at the Sphinx documentation. In our world achieving some tasks is possible in multiple ways. However,
some of the roads to take may have obvious or hidden advantages over others. Then again, in some other cases it may
come down to just simple user preference. Here, Ill present how I decided to write the tutorials, based on my personal
experience. If for some of them you know a better solution and you can back it up feel free to use that. Ive nothing
against it, as long as it gets the job done in an elegant fashion. Now the best would be if you could make the integration
yourself. For this you need rst to have the source code. I recommend following the guides for your operating system
on acquiring OpenCV sources. For Linux users look here and for Windows here. You must also install python and
sphinx with its dependencies in order to be able to build the documentation. Once you have downloaded the repository
to your hard drive you can take a look in the OpenCV directory to make sure you have both the samples and doc folder
present. Anyone may download the trunk source les from git://code.opencv.org/opencv.git . Nevertheless,
not everyone has upload (commit/submit) rights. This is to protect the integrity of the library. If you plan doing more
than one tutorial, and would like to have an account with commit user rights you should rst register an account at
http://code.opencv.org/ and then contact dr. Gary Bradski at [email protected]. Otherwise,
you can just send the resulting les to us via the Yahoo user group or to me at [email protected]
and Ill add it. If you have questions, suggestions or constructive critics I will gladly listen to them. If you send it to
the OpenCV group please tag its subject with a [Tutorial] entry.
Format the Source Code
Before I start this let it be clear: the main goal is to have a working sample code. However, for your tutorial to be of
a top notch quality you should follow a few guide lines I am going to present here. In case you have an application
by using the older interface (with IplImage, CVMat, cvLoadImage and such) consider migrating it to the new C++
interface. The tutorials are intended to be an up to date help for our users. And as of OpenCV 2 the OpenCV emphasis
on using the less error prone and clearer C++ interface. Therefore, if possible please convert your code to the C++
interface. For this it may help to read the Interoperability with OpenCV 1 tutorial. However, once you have an OpenCV
2 working code, then you should make your source code snippet as easy to read as possible. Herere a couple of advices
for this:
Add a standard output with the description of what your program does. Keep it short and yet, descriptive. This
output is at the start of the program. In my example les this usually takes the form of a help function containing
the output. This way both the source le viewer and application runner can see what all is about in your sample.
Heres an instance of this:
void help()
{
cout
<< "--------------------------------------------------------------------------" << endl
<< "This program shows how to write video files. You can extract the R or G or B color channel "
<< " of the input video. You can choose to use the source codec (Y) or select a custom one. (N)"<< endl
<< "Usage:" << endl
<< "./video-write inputvideoName [ R | G | B] [Y | N]" << endl
<< "--------------------------------------------------------------------------" << endl
<< endl;
}
// ...
int main(int argc, char
*
argv[], char
*
window
_
name)
{
help();
1.12. How to write a tutorial for OpenCV? 99
The OpenCV Tutorials, Release 2.4.3
// here comes the actual source code
}
Additionally, nalize the description with a short usage guide. This way the user will know how to call your
programs, what leads us to the next point.
Prefer command line argument controlling instead of hard coded one. If your program has some variables that
may be changed use command line arguments for this. The tutorials, can be a simple try-out ground for the user.
If you offer command line controlling for the input image (for example), then you offer the possibility for the
user to try it out with his/her own images, without the need to mess in the source code. In the upper example
you can see that the input image, channel and codec selection may all be changed from the command line. Just
compile the program and run it with your own input arguments.
Be as verbose as possible. There is no shame in lling the source code with comments. This way the more
advanced user may gure out whats happening right from the sample code. This advice goes for the output
console too. Specify to the user whats happening. Never leave the user hanging there and thinking on: Is this
program now crashing or just doing some computationally intensive task?. So, if you do a training task that
may take some time, make sure you print out a message about this before starting and after nishing it.
Throw out unnecessary stuff from your source code. This is a warning to not take the previous point too
seriously. Balance is the key. If its something that can be done in a fewer lines or simpler than thats the way
you should do it. Nevertheless, if for some reason you have such sections notify the user why you have chosen
to do so. Keep the amount of information as low as possible, while still getting the job done in an elegant way.
Put your sample le into the opencv/samples/cpp/tutorial
_
code/sectionName folder. If you write a
tutorial for other languages than cpp, then change that part of the path. Before completing this you need to
decide that to what section (module) does your tutorial goes. Think about on what module relies most heavily
your code and that is the one to use. If the answer to this question is more than one modules then the general
section is the one to use. For nding the opencv directory open up your le system and navigate where you
downloaded our repository.
If the input resources are hard to acquire for the end user consider adding a few of them to the
opencv/samples/cpp/tutorial
_
code/images. Make sure that who reads your code can try it out!
Add the TOC entry
For this you will need to know some reStructuredText. There is no going around this. reStructuredText les have rst
extensions. However, these are simple text les. Use any text editor you like. Finding a text editor that offers syntax
highlighting for reStructuredText was quite a challenge at the time of writing this tutorial. In my experience, Intype is
a solid option on Windows, although there is still place for improvement.
Adding your source code to a table of content is important for multiple reasons. First and foremost this will allow for
the user base to nd your tutorial from our websites tutorial table of content. Secondly, if you omit this Sphinx will
throw a warning that your tutorial le isnt part of any TOC tree entry. And there is nothing more than the developer
team hates than an ever increasing warning/error list for their builds. Sphinx also uses this to build up the previous-
back-up buttons on the website. Finally, omitting this step will lead to that your tutorial will not be added to the PDF
version of the tutorials.
Navigate to the opencv/doc/tutorials/section/table
_
of
_
content
_
section folder (where the section is the
module to which youre adding the tutorial). Open the table_of_content_section le. Now this may have two forms.
If no prior tutorials are present in this section that there is a template message about this and has the following form:
..
_
Table-Of-Content-Section:
Section title
-----------------------------------------------------------
Description about the section.
100 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
.. include:: ../../definitions/noContent.rst
.. raw:: latex
\pagebreak
The rst line is a reference to the section title in the reST system. The section title will be a link and you may refer to it
via the :ref: directive. The include directive imports the template text from the denitions directories noContent.rst
le. Sphinx does not creates the PDF from scratch. It does this by rst creating a latex le. Then creates the PDF from
the latex le. With the raw directive you can directly add to this output commands. Its unique argument is for what
kind of output to add the content of the directive. For the PDFs it may happen that multiple sections will overlap on a
single page. To avoid this at the end of the TOC we add a pagebreak latex command, that hints to the LATEX system
that the next line should be on a new page. If you have one of this, try to transform it to the following form:
..
_
Table-Of-Content-Section:
Section title
-----------------------------------------------------------
.. include:: ../../definitions/tocDefinitions.rst
+
.. tabularcolumns:: m{100pt} m{300pt}
.. cssclass:: toctableopencv
=============== ======================================================
|MatBasicIma|
**
Title:
**
:ref:matTheBasicImageContainer
*
Compatibility:
*
> OpenCV 2.0
*
Author:
*
|Author
_
BernatG|
You will learn how to store images in the memory and how to print out their content to the console.
=============== =====================================================
.. |MatBasicIma| image:: images/matTheBasicImageStructure.jpg
:height: 90pt
:width: 90pt
.. raw:: latex
\pagebreak
.. toctree::
:hidden:
../mat - the basic image container/mat - the basic image container
If this is already present just add a new section of the content between the include and the raw directives (excluding
those lines). Here youll see a new include directive. This should be present only once in a TOC tree and the reST le
contains the denitions of all the authors contributing to the OpenCV tutorials. We are a multicultural community and
some of our name may contain some funky characters. However, reST only supports ANSI characters. Luckily we can
specify Unicode characters with the unicode directive. Doing this for all of your tutorials is a troublesome procedure.
Therefore, the tocDenitions le contains the denition of your author name. Add it here once and afterwards just use
the replace construction. For example heres the denition for my name:
.. |Author
_
BernatG| unicode:: Bern U+00E1 t U+0020 G U+00E1 bor
The |Author
_
BernatG| is the text denitions alias. I can use later this to add the denition, like Ive done in the
TOCs Author part. After the :: and a space you start the denition. If you want to add an UNICODE character
(non-ASCI) leave an empty space and specify it in the format U+(UNICODE code). To nd the UNICODE code of
a character I recommend using the FileFormat websites service. Spaces are trimmed from the denition, therefore we
add a space by its UNICODE character (U+0020). Until the raw directive what you can see is a TOC tree entry. Heres
how a TOC entry will look like:
.. tabularcolumns:: m{100pt} m{300pt}
.. cssclass:: toctableopencv
=============== ======================================================
|MatBasicIma|
**
Title:
**
:ref:matTheBasicImageContainer
1.12. How to write a tutorial for OpenCV? 101
The OpenCV Tutorials, Release 2.4.3
*
Compatibility:
*
> OpenCV 2.0
*
Author:
*
|Author
_
BernatG|
You will learn how to store images in the memory and how to print out their content to the console.
=============== ======================================================
.. |MatBasicIma| image:: images/matTheBasicImageStructure.jpg
:height: 90pt
:width: 90pt
As you can see we have an image to the left and a description box to the right. To create two boxes we use a table with
two columns and a single row. In the left column is the image and in the right one the description. However, the image
directive is way too long to t in a column. Therefore, we need to use the substitution denition system. We add this
denition after the TOC tree. All images for the TOC tree are to be put in the images folder near its reStructuredText
le. We use the point measurement system because we are also creating PDFs. PDFs are printable documents, where
there is no such thing that pixels (px), just points (pt). And while generally space is no problem for web pages (we
have monitors with huge resolutions) the size of the paper (A4 or letter) is constant and will be for a long time in the
future. Therefore, size constrains come in play more like for the PDF, than the generated HTML code. Now your
images should be as small as possible, while still offering the intended information for the user. Remember that the
tutorial will become part of the OpenCV source code. If you add large images (that manifest in form of large image
size) it will just increase the size of the repository pointlessly. If someone wants to download it later, its download
time will be that much longer. Not to mention the larger PDF size for the tutorials and the longer load time for the
web pages. In terms of pixels a TOC image should not be larger than 120 X 120 pixels. Resize your images if they are
larger!
Note: If you add a larger image and specify a smaller image size, Sphinx will not resize that. At build time will
add the full size image and the resize will be done by your browser after the image is loaded. A 120 X 120 image is
somewhere below 10KB. If you add a 110KB image, you have just pointlessly added a 100KB extra data to transfer
over the internet for every user!
Generally speaking you shouldnt need to specify your images size (excluding the TOC entries). If no such is found
Sphinx will use the size of the image itself (so no resize occurs). Then again if for some reason you decide to specify a
size that should be the width of the image rather than its height. The reason for this again goes back to the PDFs. On a
PDF page the height is larger than the width. In the PDF the images will not be resized. If you specify a size that does
not t in the page, then what does not ts in will be cut off. When creating your images for your tutorial you should
try to keep the image widths below 500 pixels, and calculate with around 400 point page width when specifying image
widths.
The image format depends on the content of the image. If you have some complex scene (many random like colors)
then use jpg. Otherwise, prefer using png. They are even some tools out there that optimize the size of PNG images,
such as PNGGauntlet. Use them to make your images as small as possible in size. Now on the right side column of
the table we add the information about the tutorial:
In the rst line it is the title of the tutorial. However, there is no need to specify it explicitly. We use the reference
system. Well start up our tutorial with a reference specication, just like in case of this TOC entry with its ..
_Table-Of-Content-Section: . If after this you have a title (pointed out by the following line of -), then Sphinx
will replace the :ref:Table-Of-Content-Section directive with the tile of the section in reference form
(creates a link in web page). Heres how the denition looks in my case:
..
_
matTheBasicImageContainer:
Mat - The Basic Image Container
*******************************
Note, that according to the reStructuredText rules the * should be as long as your title.
Compatibility. What version of OpenCV is required to run your sample code.
Author. Use the substitution markup of reStructuredText.
102 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
A short sentence describing the essence of your tutorial.
Now before each TOC entry you need to add the three lines of:
+
.. tabularcolumns:: m{100pt} m{300pt}
.. cssclass:: toctableopencv
The plus sign (+) is to enumerate tutorials by using bullet points. So for every TOC entry we have a corresponding
bullet point represented by the +. Sphinx is highly indenting sensitive. Indentation is used to express from which point
until to which point does a construction last. Un-indentation means end of that construction. So to keep all the bullet
points to the same group the following TOC entries (until the next +) should be indented by two spaces. Here, I should
also mention that always prefer using spaces instead of tabs. Working with only spaces makes possible that if we both
use monotype fonts we will see the same thing. Tab size is text editor dependent and as should be avoided. Sphinx
translates all tabs into 8 spaces before interpreting it. It turns out that the automatic formatting of both the HTML and
PDF(LATEX) system messes up our tables. Therefore, we need to help them out a little. For the PDF generation we
add the .. tabularcolumns:: m{100pt} m{300pt} directive. This means that the rst column should be 100
points wide and middle aligned. For the HTML look we simply name the following table of a toctableopencv class
type. Then, we can modify the look of the table by modifying the CSS of our web page. The CSS denitions go into
the opencv/doc/
_
themes/blue/static/default.css
_
t le.
.toctableopencv
{
width: 100% ;
table-layout: fixed;
}
.toctableopencv colgroup col:first-child
{
width: 100pt !important;
max-width: 100pt !important;
min-width: 100pt !important;
}
.toctableopencv colgroup col:nth-child(2)
{
width: 100% !important;
}
However, you should not need to modify this. Just add these three lines (plus keep the two space indentation) for all
TOC entries you add. At the end of the TOC le youll nd:
.. raw:: latex
\pagebreak
.. toctree::
:hidden:
../mat - the basic image container/mat - the basic image container
The page break entry comes for separating sections and should be only one in a TOC tree reStructuredText le. Finally,
at the end of the TOC tree we need to add our tutorial to the Sphinx TOC tree system. Sphinx will generate from this
the previous-next-up information for the HTML le and add items to the PDF according to the order here. By default
this TOC tree directive generates a simple table of contents. However, we already created a fancy looking one so we
no longer need this basic one. Therefore, we add the hidden option to do not show it. The path is of a relative type.
We step back in the le system and then go into the mat - the basic image container directory for the mat -
the basic image container.rst le. Putting out the rst extension for the le is optional.
1.12. How to write a tutorial for OpenCV? 103
The OpenCV Tutorials, Release 2.4.3
Write the tutorial
Create a folder with the name of your tutorial. Preferably, use small letters only. Then create a text le in this
folder with rst extension and the same name. If you have images for the tutorial create an images folder and add
your images there. When creating your images follow the guidelines described in the previous part! Now heres our
recommendation for the structure of the tutorial (although, remember that this is not carved in the stone; if you have a
better idea, use it!):
Create the reference point and the title.
..
_
matTheBasicImageContainer:
Mat - The Basic Image Container
*******************************
You start the tutorial by specifying a reference point by the ..
_
matTheBasicImageContainer: and then its
title. The name of the reference point should be a unique one over the whole documentation. Therefore, do not
use general names like tutorial1. Use the * character to underline the title for its full width. The subtitles of the
tutorial should be underlined with = charachter.
Goals. You start your tutorial by specifying what you will present. You can also enumerate the sub jobs to be
done. For this you can use a bullet point construction. There is a single conguration le for both the reference
manual and the tutorial documentation. In the reference manuals at the argument enumeration we do not want
any kind of bullet point style enumeration. Therefore, by default all the bullet points at this level are set to do
not show the dot before the entries in the HTML. You can override this by putting the bullet point in a container.
Ive dened a square type bullet point view under the name enumeratevisibleitemswithsquare. The CSS style
denition for this is again in the opencvdoc
_
themesbluestaticdefault.css
_
t le. Heres a quick example
of using it:
.. container:: enumeratevisibleitemswithsquare
+ Create the reference point and the title.
+ Second entry
+ Third entry
Note that you need the keep the indentation of the container directive. Directive indentations are always three
(3) spaces. Here you may even give usage tips for your sample code.
Source code. Present your samples code to the user. Its a good idea to offer a quick download link for the
HTML page by using the download directive and pointing out where the user may nd your source code in the
le system by using the le directive:
Text :file:samples/cpp/tutorial
_
code/highgui/video-write/ folder of the OpenCV source library
or :download:text to appear in the webpage
<../../../../samples/cpp/tutorial
_
code/HighGUI/video-write/video-write.cpp>.
For the download link the path is a relative one, hence the multiple back stepping operations (..). Then you can
add the source code either by using the code block directive or the literal include one. In case of the code block
you will need to actually add all the source code text into your reStructuredText text and also apply the required
indentation:
.. code-block:: cpp
int i = 0;
l = ++j;
The only argument of the directive is the language used (here CPP). Then you add the source code into its
content (meaning one empty line after the directive) by keeping the indentation of the directive (3 spaces). With
the literal include directive you do not need to add the source code of the sample. You just specify the sample
and Sphinx will load it for you, during build time. Heres an example usage:
104 Chapter 1. Introduction to OpenCV
The OpenCV Tutorials, Release 2.4.3
.. literalinclude:: ../../../../samples/cpp/tutorial
_
code/HighGUI/video-write/video-write.cpp
:language: cpp
:linenos:
:tab-width: 4
:lines: 1-8, 21-22, 24-
After the directive you specify a relative path to the le from what to import. It has four options: the lan-
guage to use, if you add the :linenos: the line numbers will be shown, you can specify the tab size with the
:tab-width: and you do not need to load the whole le, you can show just the important lines. Use the lines
option to do not show redundant information (such as the help function). Here basically you specify ranges, if
the second range line number is missing than that means that until the end of the le. The ranges specied here
do no need to be in an ascending order, you may even reorganize the structure of how you want to show your
sample inside the tutorial.
The tutorial. Well here goes the explanation for why and what have you used. Try to be short, clear, concise
and yet a thorough one. Theres no magic formula. Look into a few already made tutorials and start out from
there. Try to mix sample OpenCV code with your explanations. If with words is hard to describe something do
not hesitate to add in a reasonable size image, to overcome this issue. When you present OpenCV functionality
its a good idea to give a link to the used OpenCV data structure or function. Because the OpenCV tutorials
and reference manual are in separate PDF les it is not possible to make this link work for the PDF format.
Therefore, we use here only web page links to the opencv.itseez.com website. The OpenCV functions and
data structures may be used for multiple tasks. Nevertheless, we want to avoid that every users creates its own
reference to a commonly used function. So for this we use the global link collection of Sphinx. This is dened
in the le:opencv/doc/conf.py conguration le. Open it and go all the way down to the last entry:
# ---- External links for tutorials -----------------
extlinks = {
hgvideo : (http://opencv.itseez.com/modules/highgui/doc/reading
_
and
_
writing
_
images
_
and
_
video.html#%s, None)
}
In short here we dened a new hgvideo directive that refers to an external webpage link. Its usage is:
A sample function of the highgui modules image write and read page is the :hgvideo:imread() function <imread>.
Which turns to: A sample function of the highgui modules image write and read page is the imread()
function. The argument you give between the <> will be put in place of the %s in the upper deni-
tion, and as the link will anchor to the correct function. To nd out the anchor of a given function
just open up a web page, search for the function and click on it. In the address bar it should appear like:
http://opencv.itseez.com/modules/highgui/doc/reading
_
and
_
writing
_
images
_
and
_
video.html#imread
. Look here for the name of the directives for each page of the OpenCV reference manual. If none present for
one of them feel free to add one for it. For formulas you can add LATEX code that will translate in the web
pages into images. You do this by using the math directive. A usage tip:
.. math::
MSE = \frac{1}{c
*
i
*
j} \sum{(I
_
1-I
_
2)^2}
That after build turns into:
MSE =
1
c i j
(I
1
I
2
)
2
You can even use it inline as :math: MSE = \frac{1}{c
*
i
*
j} \sum{(I
_
1-I
_
2)^2} that turns into
MSE =
1
cij
(I
1
I
2
)
2
. If you use some crazy LATEX library extension you need to add those to the
ones to use at build time. Look into the le:opencv/doc/conf.py conguration le for more information on this.
Results. Well, here depending on your program show one of more of the following: - Console outputs by using
the code block directive. - Output images. - Runtime videos, visualization. For this use your favorite screens
capture software. Camtasia Studio certainly is one of the better choices, however their prices are out of this
1.12. How to write a tutorial for OpenCV? 105
The OpenCV Tutorials, Release 2.4.3
world. CamStudio is a free alternative, but less powerful. If you do a video you can upload it to YouTube and
then use the raw directive with HTML option to embed it into the generated web page:
You may observe a runtime instance of this on the YouTube here <https://www.youtube.com/watch?v=jpBwHxsl1
_
0>
_
.
.. raw:: html
<div align="center">
<iframe title="Creating a video with OpenCV" width="560" height="349" src="http://www.youtube.com/embed/jpBwHxsl1
_
0?rel=0&loop=1" frameborder="0" allowfullscreen align="middle"></iframe>
</div>
This results in the text and video: You may observe a runtime instance of this on the YouTube here.
When these arent self-explanatory make sure to throw in a few guiding lines about what and why we can see.
Build the documentation and check for errors or warnings. In the CMake make sure you check or pass the option
for building documentation. Then simply build the docs project for the PDF le and the docs_html project for
the web page. Read the output of the build and check for errors/warnings for what you have added. This is also
the time to observe and correct any kind of not so good looking parts. Remember to keep clean our build logs.
Read again your tutorial and check for both programming and spelling errors. If found any, please correct them.
Take home the pride and joy of a job well done!
Once you are done contact me or dr. Gary Bradski with the tutorial. We may submit the tutorial ourselves to the
trunk branch of our repository or ask you to do so. Now, to see your work live you may need to wait some time. The
PDFs are updated usually at the launch of a new OpenCV version. The web pages are a little more diverse. They are
automatically rebuilt in each evening. However, the opencv.itseez.com website contains only the most recent stable
branch of OpenCV. Currently this is 2.3. When we add something new (like a tutorial) that rst goes to the trunk
branch of our repository. A build of this you may nd on the opencv.itseez.com/trunk website. Although, we try to
make a build every night occasionally we might freeze any of the branches to x upcoming issues. During this it may
take a little longer to see your work live, however if you submited it, be sure that eventually it will show up. If you
have any questions or advices relating to this tutorial you can contact me at [email protected].
Of course, delete the -delete- parts of that e-mail address.
106 Chapter 1. Introduction to OpenCV
CHAPTER
TWO
CORE MODULE. THE CORE
FUNCTIONALITY
Here you will learn the about the basic building blocks of the library. A must read and know for understanding how to
manipulate the images on a pixel level.
Title: How to scan images, lookup tables and time measurement with
OpenCV
Compatibility: > OpenCV 2.0
Author: Bernt Gbor
Youll nd out how to scan images (go through each of the image pixels)
with OpenCV. Bonus: time measurement with OpenCV.
Title: File Input and Output using XML and YAML les
Compatibility: > OpenCV 2.0
Author: Bernt Gbor
You will see how to use the FileStorage data structure of OpenCV to write
and read data to XML or YAML le format.
i=0
N1
j=0
f(i, j)e
i2(
ki
N
+
lj
N
)
e
ix
= cos x +i sin x
Here f is the image value in its spatial domain and F in its frequency domain. The result of the transformation is
complex numbers. Displaying this is possible either via a real image and a complex image or via a magnitude and a
phase image. However, throughout the image processing algorithms only the magnitude image is interesting as this
contains all the information we need about the images geometric structure. Nevertheless, if you intend to make some
modications of the image in these forms and then you need to retransform it youll need to preserve both of these.
In this sample Ill show how to calculate and show the magnitude image of a Fourier Transform. In case of digital
images are discrete. This means they may take up a value from a given domain value. For example in a basic gray
scale image values usually are between zero and 255. Therefore the Fourier Transform too needs to be of a discrete
type resulting in a Discrete Fourier Transform (DFT). Youll want to use this whenever you need to determine the
structure of an image from a geometrical point of view. Here are the steps to follow (in case of a gray scale input
image I):
1. Expand the image to an optimal size. The performance of a DFT is dependent of the image size. It tends to be
the fastest for image sizes that are multiple of the numbers two, three and ve. Therefore, to achieve maximal
performance it is generally a good idea to pad border values to the image to get a size with such traits. The
getOptimalDFTSize() returns this optimal size and we can use the copyMakeBorder() function to expand the
borders of an image:
Mat padded; //expand input image to optimal size
int m = getOptimalDFTSize( I.rows );
int n = getOptimalDFTSize( I.cols ); // on the border add zero pixels
copyMakeBorder(I, padded, 0, m - I.rows, 0, n - I.cols, BORDER
_
CONSTANT, Scalar::all(0));
The appended pixels are initialized with zero.
2. Make place for both the complex and the real values. The result of a Fourier Transform is complex. This
implies that for each image value the result is two image values (one per component). Moreover, the frequency
2.8. Discrete Fourier Transform 139
The OpenCV Tutorials, Release 2.4.3
domains range is much larger than its spatial counterpart. Therefore, we store these usually at least in a oat
format. Therefore well convert our input image to this type and expand it with another channel to hold the
complex values:
Mat planes[] = {Mat
_
<float>(padded), Mat::zeros(padded.size(), CV
_
32F)};
Mat complexI;
merge(planes, 2, complexI); // Add to the expanded another plane with zeros
3. Make the Discrete Fourier Transform. Its possible an in-place calculation (same input as output):
dft(complexI, complexI); // this way the result may fit in the source matrix
4. Transform the real and complex values to magnitude. A complex number has a real (Re) and a complex
(imaginary - Im) part. The results of a DFT are complex numbers. The magnitude of a DFT is:
M =
2
_
Re(DFT(I))
2
+Im(DFT(I))
2
Translated to OpenCV code:
split(complexI, planes); // planes[0] = Re(DFT(I), planes[1] = Im(DFT(I))
magnitude(planes[0], planes[1], planes[0]);// planes[0] = magnitude
Mat magI = planes[0];
5. Switch to a logarithmic scale. It turns out that the dynamic range of the Fourier coefcients is too large to be
displayed on the screen. We have some small and some high changing values that we cant observe like this.
Therefore the high values will all turn out as white points, while the small ones as black. To use the gray scale
values to for visualization we can transform our linear scale to a logarithmic one:
M
1
= log (1 +M)
Translated to OpenCV code:
magI += Scalar::all(1); // switch to logarithmic scale
log(magI, magI);
6. Crop and rearrange. Remember, that at the rst step, we expanded the image? Well, its time to throw away
the newly introduced values. For visualization purposes we may also rearrange the quadrants of the result, so
that the origin (zero, zero) corresponds with the image center.
magI = magI(Rect(0, 0, magI.cols & -2, magI.rows & -2));
int cx = magI.cols/2;
int cy = magI.rows/2;
Mat q0(magI, Rect(0, 0, cx, cy)); // Top-Left - Create a ROI per quadrant
Mat q1(magI, Rect(cx, 0, cx, cy)); // Top-Right
Mat q2(magI, Rect(0, cy, cx, cy)); // Bottom-Left
Mat q3(magI, Rect(cx, cy, cx, cy)); // Bottom-Right
Mat tmp; // swap quadrants (Top-Left with Bottom-Right)
q0.copyTo(tmp);
q3.copyTo(q0);
tmp.copyTo(q3);
q1.copyTo(tmp); // swap quadrant (Top-Right with Bottom-Left)
q2.copyTo(q1);
tmp.copyTo(q2);
7. Normalize. This is done again for visualization purposes. We now have the magnitudes, however this are still
out of our image display range of zero to one. We normalize our values to this range using the normalize()
function.
140 Chapter 2. core module. The Core Functionality
The OpenCV Tutorials, Release 2.4.3
normalize(magI, magI, 0, 1, CV
_
MINMAX); // Transform the matrix with float values into a
// viewable image form (float between values 0 and 1).
Result
An application idea would be to determine the geometrical orientation present in the image. For example, let us nd
out if a text is horizontal or not? Looking at some text youll notice that the text lines sort of form also horizontal lines
and the letters form sort of vertical lines. These two main components of a text snippet may be also seen in case of the
Fourier transform. Let us use this horizontal and this rotated image about a text.
In case of the horizontal text:
In case of a rotated text:
You can see that the most inuential components of the frequency domain (brightest dots on the magnitude image)
follow the geometric rotation of objects on the image. From this we may calculate the offset and perform an image
rotation to correct eventual miss alignments.
2.9 File Input and Output using XML and YAML les
Goal
Youll nd answers for the following questions:
How to print and read text entries to a le and OpenCV using YAML or XML les?
How to do the same for OpenCV data structures?
How to do this for your data structures?
Usage of OpenCV data structures such as FileStorage, FileNode or FileNodeIterator.
2.9. File Input and Output using XML and YAML les 141
The OpenCV Tutorials, Release 2.4.3
Source code
You can download this from here or nd it in the samples/cpp/tutorial
_
code/core/file
_
input
_
output/file
_
input
_
output.cpp
of the OpenCV source code library.
Heres a sample code of how to achieve all the stuff enumerated at the goal list.
1 #include <opencv2/core/core.hpp>
2 #include <iostream>
3 #include <string>
4
5 using namespace cv;
6 using namespace std;
7
8 class MyData
9 {
10 public:
11 MyData() : A(0), X(0), id()
12 {}
13 explicit MyData(int) : A(97), X(CV
_
PI), id("mydata1234") // explicit to avoid implicit conversion
14 {}
15 void write(FileStorage& fs) const //Write serialization for this class
16 {
17 fs << "{" << "A" << A << "X" << X << "id" << id << "}";
18 }
19 void read(const FileNode& node) //Read serialization for this class
20 {
21 A = (int)node["A"];
22 X = (double)node["X"];
23 id = (string)node["id"];
24 }
25 public: // Data Members
26 int A;
27 double X;
28 string id;
29 };
30
31 //These write and read functions must be defined for the serialization in FileStorage to work
32 void write(FileStorage& fs, const std::string&, const MyData& x)
33 {
34 x.write(fs);
35 }
36 void read(const FileNode& node, MyData& x, const MyData& default
_
value = MyData()){
37 if(node.empty())
38 x = default
_
value;
39 else
40 x.read(node);
41 }
42
43 // This function will print our custom class to the console
44 ostream& operator<<(ostream& out, const MyData& m)
45 {
46 out << "{ id = " << m.id << ", ";
47 out << "X = " << m.X << ", ";
48 out << "A = " << m.A << "}";
49 return out;
50 }
51
52 int main(int ac, char
**
av)
142 Chapter 2. core module. The Core Functionality
The OpenCV Tutorials, Release 2.4.3
53 {
54 if (ac != 2)
55 {
56 help(av);
57 return 1;
58 }
59
60 string filename = av[1];
61 { //write
62 Mat R = Mat
_
<uchar>::eye(3, 3),
63 T = Mat
_
<double>::zeros(3, 1);
64 MyData m(1);
65
66 FileStorage fs(filename, FileStorage::WRITE);
67
68 fs << "iterationNr" << 100;
69 fs << "strings" << "["; // text - string sequence
70 fs << "image1.jpg" << "Awesomeness" << "baboon.jpg";
71 fs << "]"; // close sequence
72
73 fs << "Mapping"; // text - mapping
74 fs << "{" << "One" << 1;
75 fs << "Two" << 2 << "}";
76
77 fs << "R" << R; // cv::Mat
78 fs << "T" << T;
79
80 fs << "MyData" << m; // your own data structures
81
82 fs.release(); // explicit close
83 cout << "Write Done." << endl;
84 }
85
86 {//read
87 cout << endl << "Reading: " << endl;
88 FileStorage fs;
89 fs.open(filename, FileStorage::READ);
90
91 int itNr;
92 //fs["iterationNr"] >> itNr;
93 itNr = (int) fs["iterationNr"];
94 cout << itNr;
95 if (!fs.isOpened())
96 {
97 cerr << "Failed to open " << filename << endl;
98 help(av);
99 return 1;
100 }
101
102 FileNode n = fs["strings"]; // Read string sequence - Get node
103 if (n.type() != FileNode::SEQ)
104 {
105 cerr << "strings is not a sequence! FAIL" << endl;
106 return 1;
107 }
108
109 FileNodeIterator it = n.begin(), it
_
end = n.end(); // Go through the node
110 for (; it != it
_
end; ++it)
2.9. File Input and Output using XML and YAML les 143
The OpenCV Tutorials, Release 2.4.3
111 cout << (string)
*
it << endl;
112
113
114 n = fs["Mapping"]; // Read mappings from a sequence
115 cout << "Two " << (int)(n["Two"]) << "; ";
116 cout << "One " << (int)(n["One"]) << endl << endl;
117
118
119 MyData m;
120 Mat R, T;
121
122 fs["R"] >> R; // Read cv::Mat
123 fs["T"] >> T;
124 fs["MyData"] >> m; // Read your own structure
_
125
126 cout << endl
127 << "R = " << R << endl;
128 cout << "T = " << T << endl << endl;
129 cout << "MyData = " << endl << m << endl << endl;
130
131 //Show default behavior for non existing nodes
132 cout << "Attempt to read NonExisting (should initialize the data structure with its default).";
133 fs["NonExisting"] >> m;
134 cout << endl << "NonExisting = " << endl << m << endl;
135 }
136
137 cout << endl
138 << "Tip: Open up " << filename << " with a text editor to see the serialized data." << endl;
139
140 return 0;
141 }
Explanation
Here we talk only about XML and YAML le inputs. Your output (and its respective input) le may have only one of
these extensions and the structure coming from this. They are two kinds of data structures you may serialize: mappings
(like the STL map) and element sequence (like the STL vector>. The difference between these is that in a map every
element has a unique name through what you may access it. For sequences you need to go through them to query a
specic item.
1. XML\YAML File Open and Close. Before you write any content to such le you need to open it and at the
end to close it. The XMLYAML data structure in OpenCV is FileStorage. To specify that this structure to which
le binds on your hard drive you can use either its constructor or the open() function of this:
string filename = "I.xml";
FileStorage fs(filename, FileStorage::WRITE);
\\...
fs.open(filename, FileStorage::READ);
Either one of this you use the second argument is a constant specifying the type of operations youll be able to
on them: WRITE, READ or APPEND. The extension specied in the le name also determinates the output
format that will be used. The output may be even compressed if you specify an extension such as .xml.gz.
The le automatically closes when the FileStorage objects is destroyed. However, you may explicitly call for
this by using the release function:
144 Chapter 2. core module. The Core Functionality
The OpenCV Tutorials, Release 2.4.3
fs.release(); // explicit close
2. Input and Output of text and numbers. The data structure uses the same << output operator that the STL
library. For outputting any type of data structure we need rst to specify its name. We do this by just simply
printing out the name of this. For basic types you may follow this with the print of the value :
fs << "iterationNr" << 100;
Reading in is a simple addressing (via the [] operator) and casting operation or a read via the >> operator :
int itNr;
fs["iterationNr"] >> itNr;
itNr = (int) fs["iterationNr"];
3. Input\Output of OpenCV Data structures. Well these behave exactly just as the basic C++ types:
Mat R = Mat
_
<uchar >::eye (3, 3),
T = Mat
_
<double>::zeros(3, 1);
fs << "R" << R; // Write cv::Mat
fs << "T" << T;
fs["R"] >> R; // Read cv::Mat
fs["T"] >> T;
4. Input\Output of vectors (arrays) and associative maps. As I mentioned beforehand we can output maps and
sequences (array, vector) too. Again we rst print the name of the variable and then we have to specify if our
output is either a sequence or map.
For sequence before the rst element print the [ character and after the last one the ] character:
fs << "strings" << "["; // text - string sequence
fs << "image1.jpg" << "Awesomeness" << "baboon.jpg";
fs << "]"; // close sequence
For maps the drill is the same however now we use the { and } delimiter characters:
fs << "Mapping"; // text - mapping
fs << "{" << "One" << 1;
fs << "Two" << 2 << "}";
To read from these we use the FileNode and the FileNodeIterator data structures. The [] operator of the FileStor-
age class returns a FileNode data type. If the node is sequential we can use the FileNodeIterator to iterate through
the items:
FileNode n = fs["strings"]; // Read string sequence - Get node
if (n.type() != FileNode::SEQ)
{
cerr << "strings is not a sequence! FAIL" << endl;
return 1;
}
FileNodeIterator it = n.begin(), it
_
end = n.end(); // Go through the node
for (; it != it
_
end; ++it)
cout << (string)
*
it << endl;
For maps you can use the [] operator again to acces the given item (or the >> operator too):
2.9. File Input and Output using XML and YAML les 145
The OpenCV Tutorials, Release 2.4.3
n = fs["Mapping"]; // Read mappings from a sequence
cout << "Two " << (int)(n["Two"]) << "; ";
cout << "One " << (int)(n["One"]) << endl << endl;
5. Read and write your own data structures. Suppose you have a data structure such as:
class MyData
{
public:
MyData() : A(0), X(0), id() {}
public: // Data Members
int A;
double X;
string id;
};
Its possible to serialize this through the OpenCV I/O XML/YAML interface (just as in case of the OpenCV data
structures) by adding a read and a write function inside and outside of your class. For the inside part:
void write(FileStorage& fs) const //Write serialization for this class
{
fs << "{" << "A" << A << "X" << X << "id" << id << "}";
}
void read(const FileNode& node) //Read serialization for this class
{
A = (int)node["A"];
X = (double)node["X"];
id = (string)node["id"];
}
Then you need to add the following functions denitions outside the class:
void write(FileStorage& fs, const std::string&, const MyData& x)
{
x.write(fs);
}
void read(const FileNode& node, MyData& x, const MyData& default
_
value = MyData())
{
if(node.empty())
x = default
_
value;
else
x.read(node);
}
Here you can observe that in the read section we dened what happens if the user tries to read a non-existing
node. In this case we just return the default initialization value, however a more verbose solution would be to
return for instance a minus one value for an object ID.
Once you added these four functions use the >> operator for write and the << operator for read:
MyData m(1);
fs << "MyData" << m; // your own data structures
fs["MyData"] >> m; // Read your own structure
_
Or to try out reading a non-existing read:
fs["NonExisting"] >> m; // Do not add a fs << "NonExisting" << m command for this to work
cout << endl << "NonExisting = " << endl << m << endl;
146 Chapter 2. core module. The Core Functionality
The OpenCV Tutorials, Release 2.4.3
Result
Well mostly we just print out the dened numbers. On the screen of your console you could see:
Write Done.
Reading:
100image1.jpg
Awesomeness
baboon.jpg
Two 2; One 1
R = [1, 0, 0;
0, 1, 0;
0, 0, 1]
T = [0; 0; 0]
MyData =
{ id = mydata1234, X = 3.14159, A = 97}
Attempt to read NonExisting (should initialize the data structure with its default).
NonExisting =
{ id = , X = 0, A = 0}
Tip: Open up output.xml with a text editor to see the serialized data.
Nevertheless, its much more interesting what you may see in the output xml le:
<?xml version="1.0"?>
<opencv
_
storage>
<iterationNr>100</iterationNr>
<strings>
image1.jpg Awesomeness baboon.jpg</strings>
<Mapping>
<One>1</One>
<Two>2</Two></Mapping>
<R type
_
id="opencv-matrix">
<rows>3</rows>
<cols>3</cols>
<dt>u</dt>
<data>
1 0 0 0 1 0 0 0 1</data></R>
<T type
_
id="opencv-matrix">
<rows>3</rows>
<cols>1</cols>
<dt>d</dt>
<data>
0. 0. 0.</data></T>
<MyData>
<A>97</A>
<X>3.1415926535897931e+000</X>
<id>mydata1234</id></MyData>
</opencv
_
storage>
Or the YAML le:
%YAML:1.0
iterationNr: 100
2.9. File Input and Output using XML and YAML les 147
The OpenCV Tutorials, Release 2.4.3
strings:
- "image1.jpg"
- Awesomeness
- "baboon.jpg"
Mapping:
One: 1
Two: 2
R: !!opencv-matrix
rows: 3
cols: 3
dt: u
data: [ 1, 0, 0, 0, 1, 0, 0, 0, 1 ]
T: !!opencv-matrix
rows: 3
cols: 1
dt: d
data: [ 0., 0., 0. ]
MyData:
A: 97
X: 3.1415926535897931e+000
id: mydata1234
You may observe a runtime instance of this on the YouTube here .
2.10 Interoperability with OpenCV 1
Goal
For the OpenCV developer team its important to constantly improve the library. We are constantly thinking about
methods that will ease your work process, while still maintain the libraries exibility. The new C++ interface is a
development of us that serves this goal. Nevertheless, backward compatibility remains important. We do not want
to break your code written for earlier version of the OpenCV library. Therefore, we made sure that we add some
functions that deal with this. In the following youll learn:
What changed with the version 2 of OpenCV in the way you use the library compared to its rst version
How to add some Gaussian noise to an image
What are lookup tables and why use them?
General
When making the switch you rst need to learn some about the new data structure for images: Mat - The Basic Image
Container, this replaces the old CvMat and IplImage ones. Switching to the new functions is easier. You just need to
remember a couple of new things.
OpenCV 2 received reorganization. No longer are all the functions crammed into a single library. We have many
modules, each of them containing data structures and functions relevant to certain tasks. This way you do not need to
ship a large library if you use just a subset of OpenCV. This means that you should also include only those headers
you will use. For example:
#include <opencv2/core/core.hpp>
#include <opencv2/imgproc/imgproc.hpp>
#include <opencv2/highgui/highgui.hpp>
148 Chapter 2. core module. The Core Functionality
The OpenCV Tutorials, Release 2.4.3
All the OpenCV related stuff is put into the cv namespace to avoid name conicts with other libraries data structures
and functions. Therefore, either you need to prepend the cv:: keyword before everything that comes from OpenCV or
after the includes, you just add a directive to use this:
using namespace cv; // The new C++ interface API is inside this namespace. Import it.
Because the functions are already in a namespace there is no need for them to contain the cv prex in their name.
As such all the new C++ compatible functions dont have this and they follow the camel case naming rule. This
means the rst letter is small (unless its a name, like Canny) and the subsequent words start with a capital letter (like
copyMakeBorder).
Now, remember that you need to link to your application all the modules you use, and in case you are on Windows
using the DLL system you will need to add, again, to the path all the binaries. For more in-depth information if youre
on Windows read How to build applications with OpenCV inside the Microsoft Visual Studio and for Linux an example
usage is explained in Using OpenCV with Eclipse (plugin CDT).
Now for converting the Mat object you can use either the IplImage or the CvMat operators. While in the C interface
you used to work with pointers here its no longer the case. In the C++ interface we have mostly Mat objects. These
objects may be freely converted to both IplImage and CvMat with simple assignment. For example:
Mat I;
IplImage pI = I;
CvMat mI = I;
Now if you want pointers the conversion gets just a little more complicated. The compilers can no longer automatically
determinate what you want and as you need to explicitly specify your goal. This is to call the IplImage and CvMat
operators and then get their pointers. For getting the pointer we use the & sign:
Mat I;
IplImage
*
pI = &I.operator IplImage();
CvMat
*
mI = &I.operator CvMat();
One of the biggest complaints of the C interface is that it leaves all the memory management to you. You need to gure
out when it is safe to release your unused objects and make sure you do so before the program nishes or you could
have troublesome memory leeks. To work around this issue in OpenCV there is introduced a sort of smart pointer.
This will automatically release the object when its no longer in use. To use this declare the pointers as a specialization
of the Ptr :
Ptr<IplImage> piI = &I.operator IplImage();
Converting from the C data structures to the Mat is done by passing these inside its constructor. For example:
Mat K(piL), L;
L = Mat(pI);
A case study
Now that you have the basics done heres an example that mixes the usage of the C interface with
the C++ one. You will also nd it in the sample directory of the OpenCV source code library at the
samples/cpp/tutorial
_
code/core/interoperability
_
with
_
OpenCV
_
1/interoperability
_
with
_
OpenCV
_
1.cpp
. To further help on seeing the difference the programs supports two modes: one mixed C and C++ and one pure C++.
If you dene the DEMO_MIXED_API_USE youll end up using the rst. The program separates the color planes,
does some modications on them and in the end merge them back together.
1 #include <stdio.h>
2 #include <iostream>
3
2.10. Interoperability with OpenCV 1 149
The OpenCV Tutorials, Release 2.4.3
4 #include <opencv2/core/core.hpp>
5 #include <opencv2/imgproc/imgproc.hpp>
6 #include <opencv2/highgui/highgui.hpp>
7
8 using namespace cv; // The new C++ interface API is inside this namespace. Import it.
9 using namespace std;
10 #define DEMO
_
MIXED
_
API
_
USE
11
12 int main( int argc, char
**
argv )
13 {
14 const char
*
imagename = argc > 1 ? argv[1] : "lena.jpg";
15
16 #ifdef DEMO
_
MIXED
_
API
_
USE
17 Ptr<IplImage> IplI = cvLoadImage(imagename); // Ptr<T> is safe ref-counting pointer class
18 if(IplI.empty())
19 {
20 cerr << "Can not load image " << imagename << endl;
21 return -1;
22 }
23 Mat I(IplI); // Convert to the new style container. Only header created. Image not copied.
24 #else
25 Mat I = imread(imagename); // the newer cvLoadImage alternative, MATLAB-style function
26 if( I.empty() ) // same as if( !I.data )
27 {
28 cerr << "Can not load image " << imagename << endl;
29 return -1;
30 }
31 #endif
Here you can observe that with the new structure we have no pointer problems, although it is possible to use the old
functions and in the end just transform the result to a Mat object.
1 // convert image to YUV color space. The output image will be created automatically.
2 Mat I
_
YUV;
3 cvtColor(I, I
_
YUV, CV
_
BGR2YCrCb);
4
5 vector<Mat> planes; // Use the STLs vector structure to store multiple Mat objects
6 split(I
_
YUV, planes); // split the image into separate color planes (Y U V)
Because, we want to mess around with the images luma component we rst convert from the default RGB to the YUV
color space and then split the result up into separate planes. Here the program splits: in the rst example it processes
each plane using one of the three major image scanning algorithms in OpenCV (C [] operator, iterator, individual
element access). In a second variant we add to the image some Gaussian noise and then mix together the channels
according to some formula.
The scanning version looks like:
1 // Method 1. process Y plane using an iterator
2 MatIterator
_
<uchar> it = planes[0].begin<uchar>(), it
_
end = planes[0].end<uchar>();
3 for(; it != it
_
end; ++it)
4 {
5 double v =
*
it
*
1.7 + rand()%21 - 10;
6
*
it = saturate
_
cast<uchar>(v
*
v/255);
7 }
8
9 for( int y = 0; y < I
_
YUV.rows; y++ )
10 {
11 // Method 2. process the first chroma plane using pre-stored row pointer.
12 uchar
*
Uptr = planes[1].ptr<uchar>(y);
150 Chapter 2. core module. The Core Functionality
The OpenCV Tutorials, Release 2.4.3
13 for( int x = 0; x < I
_
YUV.cols; x++ )
14 {
15 Uptr[x] = saturate
_
cast<uchar>((Uptr[x]-128)/2 + 128);
16
17 // Method 3. process the second chroma plane using individual element access
18 uchar& Vxy = planes[2].at<uchar>(y, x);
19 Vxy = saturate
_
cast<uchar>((Vxy-128)/2 + 128);
20 }
21 }
Here you can observe that we may go through all the pixels of an image in three fashions: an iterator, a C pointer
and an individual element access style. You can read a more in-depth description of these in the How to scan images,
lookup tables and time measurement with OpenCV tutorial. Converting from the old function names is easy. Just
remove the cv prex and use the new Mat data structure. Heres an example of this by using the weighted addition
function:
1 Mat noisyI(I.size(), CV
_
8U); // Create a matrix of the specified size and type
2
3 // Fills the matrix with normally distributed random values (around number with deviation off).
4 // There is also randu() for uniformly distributed random number generation
5 randn(noisyI, Scalar::all(128), Scalar::all(20));
6
7 // blur the noisyI a bit, kernel size is 3x3 and both sigmas are set to 0.5
8 GaussianBlur(noisyI, noisyI, Size(3, 3), 0.5, 0.5);
9
10 const double brightness
_
gain = 0;
11 const double contrast
_
gain = 1.7;
12
13 #ifdef DEMO
_
MIXED
_
API
_
USE
14 // To pass the new matrices to the functions that only work with IplImage or CvMat do:
15 // step 1) Convert the headers (tip: data will not be copied).
16 // step 2) call the function (tip: to pass a pointer do not forget unary "&" to form pointers)
17
18 IplImage cv
_
planes
_
0 = planes[0], cv
_
noise = noisyI;
19 cvAddWeighted(&cv
_
planes
_
0, contrast
_
gain, &cv
_
noise, 1, -128 + brightness
_
gain, &cv
_
planes
_
0);
20 #else
21 addWeighted(planes[0], contrast
_
gain, noisyI, 1, -128 + brightness
_
gain, planes[0]);
22 #endif
23
24 const double color
_
scale = 0.5;
25 // Mat::convertTo() replaces cvConvertScale.
26 // One must explicitly specify the output matrix type (we keep it intact - planes[1].type())
27 planes[1].convertTo(planes[1], planes[1].type(), color
_
scale, 128
*
(1-color
_
scale));
28
29 // alternative form of cv::convertScale if we know the datatype at compile time ("uchar" here).
30 // This expression will not create any temporary arrays ( so should be almost as fast as above)
31 planes[2] = Mat
_
<uchar>(planes[2]
*
color
_
scale + 128
*
(1-color
_
scale));
32
33 // Mat::mul replaces cvMul(). Again, no temporary arrays are created in case of simple expressions.
34 planes[0] = planes[0].mul(planes[0], 1./255);
As you may observe the planes variable is of type Mat. However, converting from Mat to IplImage is easy and made
automatically with a simple assignment operator.
1 merge(planes, I
_
YUV); // now merge the results back
2 cvtColor(I
_
YUV, I, CV
_
YCrCb2BGR); // and produce the output RGB image
3
4
2.10. Interoperability with OpenCV 1 151
The OpenCV Tutorials, Release 2.4.3
5 namedWindow("image with grain", CV
_
WINDOW
_
AUTOSIZE); // use this to create images
6
7 #ifdef DEMO
_
MIXED
_
API
_
USE
8 // this is to demonstrate that I and IplI really share the data - the result of the above
9 // processing is stored in I and thus in IplI too.
10 cvShowImage("image with grain", IplI);
11 #else
12 imshow("image with grain", I); // the new MATLAB style function show
The new imshow highgui function accepts both the Mat and IplImage data structures. Compile and run the program
and if the rst image below is your input you may get either the rst or second as output:
You may observe a runtime instance of this on the YouTube here and you can download the source code from
here or nd it in the samples/cpp/tutorial
_
code/core/interoperability
_
with
_
OpenCV
_
1/interoperability
_
with
_
OpenCV
_
1.cpp
of the OpenCV source code library.
152 Chapter 2. core module. The Core Functionality
CHAPTER
THREE
IMGPROC MODULE. IMAGE
PROCESSING
In this section you will learn about the image processing (manipulation) functions inside OpenCV.
Title: Remapping
Compatibility: > OpenCV 2.0
Author: Ana Huamn
Where we learn how to manipulate pixels locations
k,l
f(i +k, j +l)h(k, l)
h(k, l) is called the kernel, which is nothing more than the coefcients of the lter.
It helps to visualize a lter as a window of coefcients sliding across the image.
There are many kind of lters, here we will mention the most used:
Normalized Box Filter
This lter is the simplest of all! Each output pixel is the mean of its kernel neighbors ( all of them contribute
with equal weights)
The kernel is below:
K =
1
K
width
K
height
_
_
1 1 1 ... 1
1 1 1 ... 1
. . . ... 1
. . . ... 1
1 1 1 ... 1
_
_
158 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
Gaussian Filter
Probably the most useful lter (although not the fastest). Gaussian ltering is done by convolving each point in
the input array with a Gaussian kernel and then summing them all to produce the output array.
Just to make the picture clearer, remember how a 1D Gaussian kernel look like?
Assuming that an image is 1D, you can notice that the pixel located in the middle would have the biggest weight.
The weight of its neighbors decreases as the spatial distance between them and the center pixel increases.
Note: Remember that a 2D Gaussian can be represented as :
G
0
(x, y) = Ae
(x
x
)
2
2
2
x
+
(y
y
)
2
2
2
y
where is the mean (the peak) and represents the variance (per each of the variables x and y)
Median Filter
The median lter run through each element of the signal (in this case the image) and replace each pixel with the
median of its neighboring pixels (located in a square neighborhood around the evaluated pixel).
Bilateral Filter
So far, we have explained some lters which main goal is to smooth an input image. However, sometimes the
lters do not only dissolve the noise, but also smooth away the edges. To avoid this (at certain extent at least),
we can use a bilateral lter.
In an analogous way as the Gaussian lter, the bilateral lter also considers the neighboring pixels with weights
assigned to each of them. These weights have two components, the rst of which is the same weighting used by
the Gaussian lter. The second component takes into account the difference in intensity between the neighboring
pixels and the evaluated one.
For a more detailed explanation you can check this link
3.1. Smoothing Images 159
The OpenCV Tutorials, Release 2.4.3
Code
What does this program do?
Loads an image
Applies 4 different kinds of lters (explained in Theory) and show the ltered images sequentially
Downloadable code: Click here
Code at glance:
#include "opencv2/imgproc/imgproc.hpp"
#include "opencv2/highgui/highgui.hpp"
using namespace std;
using namespace cv;
/// Global Variables
int DELAY
_
CAPTION = 1500;
int DELAY
_
BLUR = 100;
int MAX
_
KERNEL
_
LENGTH = 31;
Mat src; Mat dst;
char window
_
name[] = "Filter Demo 1";
/// Function headers
int display
_
caption( char
*
caption );
int display
_
dst( int delay );
/
**
*
function main
*
/
int main( int argc, char
**
argv )
{
namedWindow( window
_
name, CV
_
WINDOW
_
AUTOSIZE );
/// Load the source image
src = imread( "../images/lena.jpg", 1 );
if( display
_
caption( "Original Image" ) != 0 ) { return 0; }
dst = src.clone();
if( display
_
dst( DELAY
_
CAPTION ) != 0 ) { return 0; }
/// Applying Homogeneous blur
if( display
_
caption( "Homogeneous Blur" ) != 0 ) { return 0; }
for ( int i = 1; i < MAX
_
KERNEL
_
LENGTH; i = i + 2 )
{ blur( src, dst, Size( i, i ), Point(-1,-1) );
if( display
_
dst( DELAY
_
BLUR ) != 0 ) { return 0; } }
/// Applying Gaussian blur
if( display
_
caption( "Gaussian Blur" ) != 0 ) { return 0; }
for ( int i = 1; i < MAX
_
KERNEL
_
LENGTH; i = i + 2 )
{ GaussianBlur( src, dst, Size( i, i ), 0, 0 );
if( display
_
dst( DELAY
_
BLUR ) != 0 ) { return 0; } }
/// Applying Median blur
160 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
if( display
_
caption( "Median Blur" ) != 0 ) { return 0; }
for ( int i = 1; i < MAX
_
KERNEL
_
LENGTH; i = i + 2 )
{ medianBlur ( src, dst, i );
if( display
_
dst( DELAY
_
BLUR ) != 0 ) { return 0; } }
/// Applying Bilateral Filter
if( display
_
caption( "Bilateral Blur" ) != 0 ) { return 0; }
for ( int i = 1; i < MAX
_
KERNEL
_
LENGTH; i = i + 2 )
{ bilateralFilter ( src, dst, i, i
*
2, i/2 );
if( display
_
dst( DELAY
_
BLUR ) != 0 ) { return 0; } }
/// Wait until user press a key
display
_
caption( "End: Press a key!" );
waitKey(0);
return 0;
}
int display
_
caption( char
*
caption )
{
dst = Mat::zeros( src.size(), src.type() );
putText( dst, caption,
Point( src.cols/4, src.rows/2),
CV
_
FONT
_
HERSHEY
_
COMPLEX, 1, Scalar(255, 255, 255) );
imshow( window
_
name, dst );
int c = waitKey( DELAY
_
CAPTION );
if( c >= 0 ) { return -1; }
return 0;
}
int display
_
dst( int delay )
{
imshow( window
_
name, dst );
int c = waitKey ( delay );
if( c >= 0 ) { return -1; }
return 0;
}
Explanation
1. Lets check the OpenCV functions that involve only the smoothing procedure, since the rest is already known
by now.
2. Normalized Block Filter:
OpenCV offers the function blur to perform smoothing with this lter.
for ( int i = 1; i < MAX
_
KERNEL
_
LENGTH; i = i + 2 )
{ blur( src, dst, Size( i, i ), Point(-1,-1) );
if( display
_
dst( DELAY
_
BLUR ) != 0 ) { return 0; } }
We specify 4 arguments (more details, check the Reference):
src: Source image
3.1. Smoothing Images 161
The OpenCV Tutorials, Release 2.4.3
dst: Destination image
Size( w,h ): Denes the size of the kernel to be used ( of width w pixels and height h pixels)
Point(-1, -1): Indicates where the anchor point (the pixel evaluated) is located with respect to the neigh-
borhood. If there is a negative value, then the center of the kernel is considered the anchor point.
3. Gaussian Filter:
It is performed by the function GaussianBlur :
for ( int i = 1; i < MAX
_
KERNEL
_
LENGTH; i = i + 2 )
{ GaussianBlur( src, dst, Size( i, i ), 0, 0 );
if( display
_
dst( DELAY
_
BLUR ) != 0 ) { return 0; } }
Here we use 4 arguments (more details, check the OpenCV reference):
src: Source image
dst: Destination image
Size(w, h): The size of the kernel to be used (the neighbors to be considered). w and h have to be odd and
positive numbers otherwise thi size will be calculated using the
x
and
y
arguments.
x
: The standard deviation in x. Writing 0 implies that
x
is calculated using kernel size.
y
: The standard deviation in y. Writing 0 implies that
y
is calculated using kernel size.
4. Median Filter:
This lter is provided by the medianBlur function:
for ( int i = 1; i < MAX
_
KERNEL
_
LENGTH; i = i + 2 )
{ medianBlur ( src, dst, i );
if( display
_
dst( DELAY
_
BLUR ) != 0 ) { return 0; } }
We use three arguments:
src: Source image
dst: Destination image, must be the same type as src
i: Size of the kernel (only one because we use a square window). Must be odd.
5. Bilateral Filter
Provided by OpenCV function bilateralFilter
for ( int i = 1; i < MAX
_
KERNEL
_
LENGTH; i = i + 2 )
{ bilateralFilter ( src, dst, i, i
*
2, i/2 );
if( display
_
dst( DELAY
_
BLUR ) != 0 ) { return 0; } }
We use 5 arguments:
src: Source image
dst: Destination image
d: The diameter of each pixel neighborhood.
Color
: Standard deviation in the color space.
Space
: Standard deviation in the coordinate space (in pixel terms)
162 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
Results
The code opens an image (in this case lena.jpg) and display it under the effects of the 4 lters explained.
Here is a snapshot of the image smoothed using medianBlur:
3.2 Eroding and Dilating
Goal
In this tutorial you will learn how to:
Apply two very common morphology operators: Dilation and Erosion. For this purpose, you will use the
following OpenCV functions:
erode
dilate
Cool Theory
Note: The explanation below belongs to the book Learning OpenCV by Bradski and Kaehler.
3.2. Eroding and Dilating 163
The OpenCV Tutorials, Release 2.4.3
Morphological Operations
In short: A set of operations that process images based on shapes. Morphological operations apply a structuring
element to an input image and generate an output image.
The most basic morphological operations are two: Erosion and Dilation. They have a wide array of uses, i.e. :
Removing noise
Isolation of individual elements and joining disparate elements in an image.
Finding of intensity bumps or holes in an image
We will explain dilation and erosion briey, using the following image as an example:
Dilation
This operations consists of convoluting an image A with some kernel (B), which can have any shape or size,
usually a square or circle.
The kernel B has a dened anchor point, usually being the center of the kernel.
As the kernel B is scanned over the image, we compute the maximal pixel value overlapped by B and replace the
image pixel in the anchor point position with that maximal value. As you can deduce, this maximizing operation
causes bright regions within an image to grow (therefore the name dilation). Take as an example the image
above. Applying dilation we can get:
The background (bright) dilates around the black regions of the letter.
164 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
Erosion
This operation is the sister of dilation. What this does is to compute a local minimum over the area of the kernel.
As the kernel B is scanned over the image, we compute the minimal pixel value overlapped by B and replace
the image pixel under the anchor point with that minimal value.
Analagously to the example for dilation, we can apply the erosion operator to the original image (shown above).
You can see in the result below that the bright areas of the image (the background, apparently), get thinner,
whereas the dark zones (the writing( gets bigger.
Code
This tutorial codes is shown lines below. You can also download it from here
#include "opencv2/imgproc/imgproc.hpp"
#include "opencv2/highgui/highgui.hpp"
#include "highgui.h"
#include <stdlib.h>
#include <stdio.h>
using namespace cv;
/// Global variables
Mat src, erosion
_
dst, dilation
_
dst;
int erosion
_
elem = 0;
int erosion
_
size = 0;
int dilation
_
elem = 0;
int dilation
_
size = 0;
int const max
_
elem = 2;
int const max
_
kernel
_
size = 21;
/
**
Function Headers
*
/
void Erosion( int, void
*
);
void Dilation( int, void
*
);
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
/// Load an image
src = imread( argv[1] );
3.2. Eroding and Dilating 165
The OpenCV Tutorials, Release 2.4.3
if( !src.data )
{ return -1; }
/// Create windows
namedWindow( "Erosion Demo", CV
_
WINDOW
_
AUTOSIZE );
namedWindow( "Dilation Demo", CV
_
WINDOW
_
AUTOSIZE );
cvMoveWindow( "Dilation Demo", src.cols, 0 );
/// Create Erosion Trackbar
createTrackbar( "Element:\n 0: Rect \n 1: Cross \n 2: Ellipse", "Erosion Demo",
&erosion
_
elem, max
_
elem,
Erosion );
createTrackbar( "Kernel size:\n 2n +1", "Erosion Demo",
&erosion
_
size, max
_
kernel
_
size,
Erosion );
/// Create Dilation Trackbar
createTrackbar( "Element:\n 0: Rect \n 1: Cross \n 2: Ellipse", "Dilation Demo",
&dilation
_
elem, max
_
elem,
Dilation );
createTrackbar( "Kernel size:\n 2n +1", "Dilation Demo",
&dilation
_
size, max
_
kernel
_
size,
Dilation );
/// Default start
Erosion( 0, 0 );
Dilation( 0, 0 );
waitKey(0);
return 0;
}
/
**
@function Erosion
*
/
void Erosion( int, void
*
)
{
int erosion
_
type;
if( erosion
_
elem == 0 ){ erosion
_
type = MORPH
_
RECT; }
else if( erosion
_
elem == 1 ){ erosion
_
type = MORPH
_
CROSS; }
else if( erosion
_
elem == 2) { erosion
_
type = MORPH
_
ELLIPSE; }
Mat element = getStructuringElement( erosion
_
type,
Size( 2
*
erosion
_
size + 1, 2
*
erosion
_
size+1 ),
Point( erosion
_
size, erosion
_
size ) );
/// Apply the erosion operation
erode( src, erosion
_
dst, element );
imshow( "Erosion Demo", erosion
_
dst );
}
/
**
@function Dilation
*
/
void Dilation( int, void
*
)
{
int dilation
_
type;
if( dilation
_
elem == 0 ){ dilation
_
type = MORPH
_
RECT; }
else if( dilation
_
elem == 1 ){ dilation
_
type = MORPH
_
CROSS; }
else if( dilation
_
elem == 2) { dilation
_
type = MORPH
_
ELLIPSE; }
166 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
Mat element = getStructuringElement( dilation
_
type,
Size( 2
*
dilation
_
size + 1, 2
*
dilation
_
size+1 ),
Point( dilation
_
size, dilation
_
size ) );
/// Apply the dilation operation
dilate( src, dilation
_
dst, element );
imshow( "Dilation Demo", dilation
_
dst );
}
Explanation
1. Most of the stuff shown is known by you (if you have any doubt, please refer to the tutorials in previous sections).
Lets check the general structure of the program:
Load an image (can be RGB or grayscale)
Create two windows (one for dilation output, the other for erosion)
Create a set of 02 Trackbars for each operation:
The rst trackbar Element returns either erosion_elem or dilation_elem
The second trackbar Kernel size return erosion_size or dilation_size for the corresponding opera-
tion.
Every time we move any slider, the users function Erosion or Dilation will be called and it will update
the output image based on the current trackbar values.
Lets analyze these two functions:
2. erosion:
/
**
@function Erosion
*
/
void Erosion( int, void
*
)
{
int erosion
_
type;
if( erosion
_
elem == 0 ){ erosion
_
type = MORPH
_
RECT; }
else if( erosion
_
elem == 1 ){ erosion
_
type = MORPH
_
CROSS; }
else if( erosion
_
elem == 2) { erosion
_
type = MORPH
_
ELLIPSE; }
Mat element = getStructuringElement( erosion
_
type,
Size( 2
*
erosion
_
size + 1, 2
*
erosion
_
size+1 ),
Point( erosion
_
size, erosion
_
size ) );
/// Apply the erosion operation
erode( src, erosion
_
dst, element );
imshow( "Erosion Demo", erosion
_
dst );
}
The function that performs the erosion operation is erode. As we can see, it receives three arguments:
src: The source image
erosion_dst: The output image
element: This is the kernel we will use to perform the operation. If we do not specify, the default
is a simple 3x3 matrix. Otherwise, we can specify its shape. For this, we need to use the function
getStructuringElement:
Mat element = getStructuringElement( erosion
_
type,
Size( 2
*
erosion
_
size + 1, 2
*
erosion
_
size+1 ),
Point( erosion
_
size, erosion
_
size ) );
3.2. Eroding and Dilating 167
The OpenCV Tutorials, Release 2.4.3
We can choose any of three shapes for our kernel:
*
Rectangular box: MORPH_RECT
*
Cross: MORPH_CROSS
*
Ellipse: MORPH_ELLIPSE
Then, we just have to specify the size of our kernel and the anchor point. If not specied, it is assumed
to be in the center.
That is all. We are ready to perform the erosion of our image.
Note: Additionally, there is another parameter that allows you to perform multiple erosions (iterations) at once.
We are not using it in this simple tutorial, though. You can check out the Reference for more details.
3. dilation:
The code is below. As you can see, it is completely similar to the snippet of code for erosion. Here we also have the
option of dening our kernel, its anchor point and the size of the operator to be used.
/
**
@function Dilation
*
/
void Dilation( int, void
*
)
{
int dilation
_
type;
if( dilation
_
elem == 0 ){ dilation
_
type = MORPH
_
RECT; }
else if( dilation
_
elem == 1 ){ dilation
_
type = MORPH
_
CROSS; }
else if( dilation
_
elem == 2) { dilation
_
type = MORPH
_
ELLIPSE; }
Mat element = getStructuringElement( dilation
_
type,
Size( 2
*
dilation
_
size + 1, 2
*
dilation
_
size+1 ),
Point( dilation
_
size, dilation
_
size ) );
/// Apply the dilation operation
dilate( src, dilation
_
dst, element );
imshow( "Dilation Demo", dilation
_
dst );
}
Results
Compile the code above and execute it with an image as argument. For instance, using this image:
168 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
We get the results below. Varying the indices in the Trackbars give different output images, naturally. Try them
out! You can even try to add a third Trackbar to control the number of iterations.
3.3 More Morphology Transformations
Goal
In this tutorial you will learn how to:
Use the OpenCV function morphologyEx to apply Morphological Transformation such as:
Opening
Closing
3.3. More Morphology Transformations 169
The OpenCV Tutorials, Release 2.4.3
Morphological Gradient
Top Hat
Black Hat
Theory
Note: The explanation below belongs to the book Learning OpenCV by Bradski and Kaehler.
In the previous tutorial we covered two basic Morphology operations:
Erosion
Dilation.
Based on these two we can effectuate more sophisticated transformations to our images. Here we discuss briey 05
operations offered by OpenCV:
Opening
It is obtained by the erosion of an image followed by a dilation.
dst = open(src, element) = dilate(erode(src, element))
Useful for removing small objects (it is assumed that the objects are bright on a dark foreground)
For instance, check out the example below. The image at the left is the original and the image at the right is
the result after applying the opening transformation. We can observe that the small spaces in the corners of the
letter tend to dissapear.
Closing
It is obtained by the dilation of an image followed by an erosion.
dst = close(src, element) = erode(dilate(src, element))
Useful to remove small holes (dark regions).
170 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
Morphological Gradient
It is the difference between the dilation and the erosion of an image.
dst = morph
grad
(src, element) = dilate(src, element) erode(src, element)
It is useful for nding the outline of an object as can be seen below:
Top Hat
It is the difference between an input image and its opening.
dst = tophat(src, element) = src open(src, element)
3.3. More Morphology Transformations 171
The OpenCV Tutorials, Release 2.4.3
Black Hat
It is the difference between the closing and its input image
dst = blackhat(src, element) = close(src, element) src
Code
This tutorial codes is shown lines below. You can also download it from here
#include "opencv2/imgproc/imgproc.hpp"
#include "opencv2/highgui/highgui.hpp"
#include <stdlib.h>
#include <stdio.h>
using namespace cv;
/// Global variables
Mat src, dst;
int morph
_
elem = 0;
int morph
_
size = 0;
int morph
_
operator = 0;
int const max
_
operator = 4;
int const max
_
elem = 2;
int const max
_
kernel
_
size = 21;
char
*
window
_
name = "Morphology Transformations Demo";
/
**
Function Headers
*
/
void Morphology
_
Operations( int, void
*
);
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
/// Load an image
src = imread( argv[1] );
if( !src.data )
{ return -1; }
/// Create window
namedWindow( window
_
name, CV
_
WINDOW
_
AUTOSIZE );
172 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
/// Create Trackbar to select Morphology operation
createTrackbar("Operator:\n 0: Opening - 1: Closing \n 2: Gradient - 3: Top Hat \n 4: Black Hat", window
_
name, &morph
_
operator, max
_
operator, Morphology
_
Operations );
/// Create Trackbar to select kernel type
createTrackbar( "Element:\n 0: Rect - 1: Cross - 2: Ellipse", window
_
name,
&morph
_
elem, max
_
elem,
Morphology
_
Operations );
/// Create Trackbar to choose kernel size
createTrackbar( "Kernel size:\n 2n +1", window
_
name,
&morph
_
size, max
_
kernel
_
size,
Morphology
_
Operations );
/// Default start
Morphology
_
Operations( 0, 0 );
waitKey(0);
return 0;
}
/
**
*
@function Morphology
_
Operations
*
/
void Morphology
_
Operations( int, void
*
)
{
// Since MORPH
_
X : 2,3,4,5 and 6
int operation = morph
_
operator + 2;
Mat element = getStructuringElement( morph
_
elem, Size( 2
*
morph
_
size + 1, 2
*
morph
_
size+1 ), Point( morph
_
size, morph
_
size ) );
/// Apply the specified morphology operation
morphologyEx( src, dst, operation, element );
imshow( window
_
name, dst );
}
Explanation
1. Lets check the general structure of the program:
Load an image
Create a window to display results of the Morphological operations
Create 03 Trackbars for the user to enter parameters:
The rst trackbar Operator returns the kind of morphology operation to use (morph_operator).
createTrackbar("Operator:\n 0: Opening - 1: Closing \n 2: Gradient - 3: Top Hat \n 4: Black Hat",
window
_
name, &morph
_
operator, max
_
operator,
Morphology
_
Operations );
The second trackbar Element returns morph_elem, which indicates what kind of structure our
kernel is:
createTrackbar( "Element:\n 0: Rect - 1: Cross - 2: Ellipse", window
_
name,
&morph
_
elem, max
_
elem,
Morphology
_
Operations );
3.3. More Morphology Transformations 173
The OpenCV Tutorials, Release 2.4.3
The nal trackbar Kernel Size returns the size of the kernel to be used (morph_size)
createTrackbar( "Kernel size:\n 2n +1", window
_
name,
&morph
_
size, max
_
kernel
_
size,
Morphology
_
Operations );
Every time we move any slider, the users function Morphology_Operations will be called to effectuate
a new morphology operation and it will update the output image based on the current trackbar values.
/
**
*
@function Morphology
_
Operations
*
/
void Morphology
_
Operations( int, void
*
)
{
// Since MORPH
_
X : 2,3,4,5 and 6
int operation = morph
_
operator + 2;
Mat element = getStructuringElement( morph
_
elem, Size( 2
*
morph
_
size + 1, 2
*
morph
_
size+1 ), Point( morph
_
size, morph
_
size ) );
/// Apply the specified morphology operation
morphologyEx( src, dst, operation, element );
imshow( window
_
name, dst );
}
We can observe that the key function to perform the morphology transformations is morphologyEx. In this
example we use four arguments (leaving the rest as defaults):
src : Source (input) image
dst: Output image
operation: The kind of morphology transformation to be performed. Note that we have 5 alternatives:
*
Opening: MORPH_OPEN : 2
*
Closing: MORPH_CLOSE: 3
*
Gradient: MORPH_GRADIENT: 4
*
Top Hat: MORPH_TOPHAT: 5
*
Black Hat: MORPH_BLACKHAT: 6
As you can see the values range from <2-6>, that is why we add (+2) to the values entered by the
Trackbar:
int operation = morph
_
operator + 2;
element: The kernel to be used. We use the function getStructuringElement to dene our own struc-
ture.
Results
After compiling the code above we can execute it giving an image path as an argument. For this tutorial we use
as input the image: baboon.png:
174 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
And here are two snapshots of the display window. The rst picture shows the output after using the operator
Opening with a cross kernel. The second picture (right side, shows the result of using a Blackhat operator with
an ellipse kernel.
3.4 Image Pyramids
Goal
In this tutorial you will learn how to:
Use the OpenCV functions pyrUp and pyrDown to downsample or upsample a given image.
Theory
Note: The explanation below belongs to the book Learning OpenCV by Bradski and Kaehler.
Usually we need to convert an image to a size different than its original. For this, there are two possible options:
1. Upsize the image (zoom in) or
2. Downsize it (zoom out).
Although there is a geometric transformation function in OpenCV that -literally- resize an image (resize, which
we will show in a future tutorial), in this section we analyze rst the use of Image Pyramids, which are widely
applied in a huge range of vision applications.
3.4. Image Pyramids 175
The OpenCV Tutorials, Release 2.4.3
Image Pyramid
An image pyramid is a collection of images - all arising from a single original image - that are successively
downsampled until some desired stopping point is reached.
There are two common kinds of image pyramids:
Gaussian pyramid: Used to downsample images
Laplacian pyramid: Used to reconstruct an upsampled image from an image lower in the pyramid (with
less resolution)
In this tutorial well use the Gaussian pyramid.
Gaussian Pyramid
Imagine the pyramid as a set of layers in which the higher the layer, the smaller the size.
Every layer is numbered from bottom to top, so layer (i +1) (denoted as G
i+1
is smaller than layer i (G
i
).
To produce layer (i +1) in the Gaussian pyramid, we do the following:
Convolve G
i
with a Gaussian kernel:
1
16
_
_
1 4 6 4 1
4 16 24 16 4
6 24 36 24 6
4 16 24 16 4
1 4 6 4 1
_
_
Remove every even-numbered row and column.
You can easily notice that the resulting image will be exactly one-quarter the area of its predecessor. Iterating
this process on the input image G
0
(original image) produces the entire pyramid.
The procedure above was useful to downsample an image. What if we want to make it bigger?:
First, upsize the image to twice the original in each dimension, wit the new even rows and columns lled
with zeros (0)
Perform a convolution with the same kernel shown above (multiplied by 4) to approximate the values of
the missing pixels
176 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
These two procedures (downsampling and upsampling as explained above) are implemented by the OpenCV
functions pyrUp and pyrDown, as we will see in an example with the code below:
Note: When we reduce the size of an image, we are actually losing information of the image.
Code
This tutorial codes is shown lines below. You can also download it from here
#include "opencv2/imgproc/imgproc.hpp"
#include "opencv2/highgui/highgui.hpp"
#include <math.h>
#include <stdlib.h>
#include <stdio.h>
using namespace cv;
/// Global variables
Mat src, dst, tmp;
char
*
window
_
name = "Pyramids Demo";
/
**
*
@function main
*
/
int main( int argc, char
**
argv )
{
/// General instructions
printf( "\n Zoom In-Out demo \n " );
printf( "------------------ \n" );
printf( "
*
[u] -> Zoom in \n" );
printf( "
*
[d] -> Zoom out \n" );
printf( "
*
[ESC] -> Close program \n \n" );
/// Test image - Make sure it s divisible by 2^{n}
src = imread( "../images/chicky
_
512.jpg" );
if( !src.data )
{ printf(" No data! -- Exiting the program \n");
return -1; }
tmp = src;
dst = tmp;
/// Create window
namedWindow( window
_
name, CV
_
WINDOW
_
AUTOSIZE );
imshow( window
_
name, dst );
/// Loop
while( true )
{
int c;
c = waitKey(10);
if( (char)c == 27 )
{ break; }
if( (char)c == u )
3.4. Image Pyramids 177
The OpenCV Tutorials, Release 2.4.3
{ pyrUp( tmp, dst, Size( tmp.cols
*
2, tmp.rows
*
2 ) );
printf( "
**
Zoom In: Image x 2 \n" );
}
else if( (char)c == d )
{ pyrDown( tmp, dst, Size( tmp.cols/2, tmp.rows/2 ) );
printf( "
**
Zoom Out: Image / 2 \n" );
}
imshow( window
_
name, dst );
tmp = dst;
}
return 0;
}
Explanation
1. Lets check the general structure of the program:
Load an image (in this case it is dened in the program, the user does not have to enter it as an argument)
/// Test image - Make sure it s divisible by 2^{n}
src = imread( "../images/chicky
_
512.jpg" );
if( !src.data )
{ printf(" No data! -- Exiting the program \n");
return -1; }
Create a Mat object to store the result of the operations (dst) and one to save temporal results (tmp).
Mat src, dst, tmp;
/
*
...
*
/
tmp = src;
dst = tmp;
Create a window to display the result
namedWindow( window
_
name, CV
_
WINDOW
_
AUTOSIZE );
imshow( window
_
name, dst );
Perform an innite loop waiting for user input.
while( true )
{
int c;
c = waitKey(10);
if( (char)c == 27 )
{ break; }
if( (char)c == u )
{ pyrUp( tmp, dst, Size( tmp.cols
*
2, tmp.rows
*
2 ) );
printf( "
**
Zoom In: Image x 2 \n" );
}
else if( (char)c == d )
{ pyrDown( tmp, dst, Size( tmp.cols/2, tmp.rows/2 ) );
printf( "
**
Zoom Out: Image / 2 \n" );
}
imshow( window
_
name, dst );
178 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
tmp = dst;
}
Our program exits if the user presses ESC. Besides, it has two options:
Perform upsampling (after pressing u)
pyrUp( tmp, dst, Size( tmp.cols
*
2, tmp.rows
*
2 )
We use the function pyrUp with 03 arguments:
*
tmp: The current image, it is initialized with the src original image.
*
dst: The destination image (to be shown on screen, supposedly the double of the input image)
*
Size( tmp.cols*2, tmp.rows*2 ) : The destination size. Since we are upsampling, pyrUp expects a
size double than the input image (in this case tmp).
Perform downsampling (after pressing d)
pyrDown( tmp, dst, Size( tmp.cols/2, tmp.rows/2 )
Similarly as with pyrUp, we use the function pyrDown with 03 arguments:
*
tmp: The current image, it is initialized with the src original image.
*
dst: The destination image (to be shown on screen, supposedly half the input image)
*
Size( tmp.cols/2, tmp.rows/2 ) : The destination size. Since we are upsampling, pyrDown expects
half the size the input image (in this case tmp).
Notice that it is important that the input image can be divided by a factor of two (in both dimensions).
Otherwise, an error will be shown.
Finally, we update the input image tmp with the current image displayed, so the subsequent operations
are performed on it.
tmp = dst;
Results
After compiling the code above we can test it. The program calls an image chicky_512.jpg that comes in the
tutorial_code/image folder. Notice that this image is 512 512, hence a downsample wont generate any error
(512 = 2
9
). The original image is shown below:
3.4. Image Pyramids 179
The OpenCV Tutorials, Release 2.4.3
First we apply two successive pyrDown operations by pressing d. Our output is:
Note that we should have lost some resolution due to the fact that we are diminishing the size of the image. This
is evident after we apply pyrUp twice (by pressing u). Our output is now:
180 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
3.5 Basic Thresholding Operations
Goal
In this tutorial you will learn how to:
Perform basic thresholding operations using OpenCV function threshold
Cool Theory
Note: The explanation below belongs to the book Learning OpenCV by Bradski and Kaehler.
What is Thresholding?
The simplest segmentation method
Application example: Separate out regions of an image corresponding to objects which we want to analyze.
This separation is based on the variation of intensity between the object pixels and the background pixels.
To differentiate the pixels we are interested in from the rest (which will eventually be rejected), we perform a
comparison of each pixel intensity value with respect to a threshold (determined according to the problem to
solve).
3.5. Basic Thresholding Operations 181
The OpenCV Tutorials, Release 2.4.3
Once we have separated properly the important pixels, we can set them with a determined value to identify them
(i.e. we can assign them a value of 0 (black), 255 (white) or any value that suits your needs).
Types of Thresholding
OpenCV offers the function threshold to perform thresholding operations.
We can effectuate 5 types of Thresholding operations with this function. We will explain them in the following
subsections.
To illustrate how these thresholding processes work, lets consider that we have a source image with pixels with
intensity values src(x, y). The plot below depicts this. The horizontal blue line represents the threshold thresh
(xed).
Threshold Binary
This thresholding operation can be expressed as:
dst(x, y) =
i=0
M
j
1
j=0
I(x +i a
i
, y +j a
j
)K(i, j)
Fortunately, OpenCV provides you with the function lter2D so you do not have to code all these operations.
Code
1. What does this program do?
Loads an image
Performs a normalized box lter. For instance, for a kernel of size size = 3, the kernel would be:
K =
1
3 3
_
_
1 1 1
1 1 1
1 1 1
_
_
The program will perform the lter operation with kernels of sizes 3, 5, 7, 9 and 11.
The lter output (with each kernel) will be shown during 500 milliseconds
2. The tutorial codes is shown lines below. You can also download it from here
#include "opencv2/imgproc/imgproc.hpp"
#include "opencv2/highgui/highgui.hpp"
#include <stdlib.h>
#include <stdio.h>
190 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
using namespace cv;
/
**
@function main
*
/
int main ( int argc, char
**
argv )
{
/// Declare variables
Mat src, dst;
Mat kernel;
Point anchor;
double delta;
int ddepth;
int kernel
_
size;
char
*
window
_
name = "filter2D Demo";
int c;
/// Load an image
src = imread( argv[1] );
if( !src.data )
{ return -1; }
/// Create window
namedWindow( window
_
name, CV
_
WINDOW
_
AUTOSIZE );
/// Initialize arguments for the filter
anchor = Point( -1, -1 );
delta = 0;
ddepth = -1;
/// Loop - Will filter the image with different kernel sizes each 0.5 seconds
int ind = 0;
while( true )
{
c = waitKey(500);
/// Press ESC to exit the program
if( (char)c == 27 )
{ break; }
/// Update kernel size for a normalized box filter
kernel
_
size = 3 + 2
*
( ind%5 );
kernel = Mat::ones( kernel
_
size, kernel
_
size, CV
_
32F )/ (float)(kernel
_
size
*
kernel
_
size);
/// Apply filter
filter2D(src, dst, ddepth , kernel, anchor, delta, BORDER
_
DEFAULT );
imshow( window
_
name, dst );
ind++;
}
return 0;
}
Explanation
1. Load an image
3.6. Making your own linear lters! 191
The OpenCV Tutorials, Release 2.4.3
src = imread( argv[1] );
if( !src.data )
{ return -1; }
2. Create a window to display the result
namedWindow( window
_
name, CV
_
WINDOW
_
AUTOSIZE );
3. Initialize the arguments for the linear lter
anchor = Point( -1, -1 );
delta = 0;
ddepth = -1;
4. Perform an innite loop updating the kernel size and applying our linear lter to the input image. Lets analyze
that more in detail:
5. First we dene the kernel our lter is going to use. Here it is:
kernel
_
size = 3 + 2
*
( ind%5 );
kernel = Mat::ones( kernel
_
size, kernel
_
size, CV
_
32F )/ (float)(kernel
_
size
*
kernel
_
size);
The rst line is to update the kernel_size to odd values in the range: [3, 11]. The second line actually builds the
kernel by setting its value to a matrix lled with 1
_
2 4 5 4 2
4 9 12 9 4
5 12 15 12 5
4 9 12 9 4
2 4 5 4 2
_
_
2. Find the intensity gradient of the image. For this, we follow a procedure analogous to Sobel:
(a) Apply a pair of convolution masks (in x and y directions:
G
x
=
_
_
1 0 +1
2 0 +2
1 0 +1
_
_
G
y
=
_
_
1 2 1
0 0 0
+1 +2 +1
_
_
(b) Find the gradient strength and direction with:
G =
_
G
2
x
+G
2
y
= arctan(
G
y
G
x
)
The direction is rounded to one of four possible angles (namely 0, 45, 90 or 135)
3. Non-maximum suppression is applied. This removes pixels that are not considered to be part of an edge. Hence,
only thin lines (candidate edges) will remain.
4. Hysteresis: The nal step. Canny does use two thresholds (upper and lower):
(a) If a pixel gradient is higher than the upper threshold, the pixel is accepted as an edge
(b) If a pixel gradient value is below the lower threshold, then it is rejected.
(c) If the pixel gradient is between the two thresholds, then it will be accepted only if it is connected to a pixel
that is above the upper threshold.
Canny recommended a upper:lower ratio between 2:1 and 3:1.
5. For more details, you can always consult your favorite Computer Vision book.
Code
1. What does this program do?
208 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
Asks the user to enter a numerical value to set the lower threshold for our Canny Edge Detector (by means
of a Trackbar)
Applies the Canny Detector and generates a mask (bright lines representing the edges on a black back-
ground).
Applies the mask obtained on the original image and display it in a window.
2. The tutorial codes is shown lines below. You can also download it from here
#include "opencv2/imgproc/imgproc.hpp"
#include "opencv2/highgui/highgui.hpp"
#include <stdlib.h>
#include <stdio.h>
using namespace cv;
/// Global variables
Mat src, src
_
gray;
Mat dst, detected
_
edges;
int edgeThresh = 1;
int lowThreshold;
int const max
_
lowThreshold = 100;
int ratio = 3;
int kernel
_
size = 3;
char
*
window
_
name = "Edge Map";
/
**
*
@function CannyThreshold
*
@brief Trackbar callback - Canny thresholds input with a ratio 1:3
*
/
void CannyThreshold(int, void
*
)
{
/// Reduce noise with a kernel 3x3
blur( src
_
gray, detected
_
edges, Size(3,3) );
/// Canny detector
Canny( detected
_
edges, detected
_
edges, lowThreshold, lowThreshold
*
ratio, kernel
_
size );
/// Using Cannys output as a mask, we display our result
dst = Scalar::all(0);
src.copyTo( dst, detected
_
edges);
imshow( window
_
name, dst );
}
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
/// Load an image
src = imread( argv[1] );
if( !src.data )
{ return -1; }
/// Create a matrix of the same type and size as src (for dst)
dst.create( src.size(), src.type() );
3.10. Canny Edge Detector 209
The OpenCV Tutorials, Release 2.4.3
/// Convert the image to grayscale
cvtColor( src, src
_
gray, CV
_
BGR2GRAY );
/// Create a window
namedWindow( window
_
name, CV
_
WINDOW
_
AUTOSIZE );
/// Create a Trackbar for user to enter threshold
createTrackbar( "Min Threshold:", window
_
name, &lowThreshold, max
_
lowThreshold, CannyThreshold );
/// Show the image
CannyThreshold(0, 0);
/// Wait until user exit program by pressing a key
waitKey(0);
return 0;
}
Explanation
1. Create some needed variables:
Mat src, src
_
gray;
Mat dst, detected
_
edges;
int edgeThresh = 1;
int lowThreshold;
int const max
_
lowThreshold = 100;
int ratio = 3;
int kernel
_
size = 3;
char
*
window
_
name = "Edge Map";
Note the following:
a. We establish a ratio of lower:upper threshold of 3:1 (with the variable
*
ratio
*
)
b. We set the kernel size of :math:3 (for the Sobel operations to be performed internally by the Canny function)
c. We set a maximum value for the lower Threshold of :math:100.
2. Loads the source image:
/// Load an image
src = imread( argv[1] );
if( !src.data )
{ return -1; }
3. Create a matrix of the same type and size of src (to be dst)
dst.create( src.size(), src.type() );
4. Convert the image to grayscale (using the function cvtColor:
cvtColor( src, src
_
gray, CV
_
BGR2GRAY );
5. Create a window to display the results
210 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
namedWindow( window
_
name, CV
_
WINDOW
_
AUTOSIZE );
6. Create a Trackbar for the user to enter the lower threshold for our Canny detector:
createTrackbar( "Min Threshold:", window
_
name, &lowThreshold, max
_
lowThreshold, CannyThreshold );
Observe the following:
(a) The variable to be controlled by the Trackbar is lowThreshold with a limit of max_lowThreshold (which
we set to 100 previously)
(b) Each time the Trackbar registers an action, the callback function CannyThreshold will be invoked.
7. Lets check the CannyThreshold function, step by step:
(a) First, we blur the image with a lter of kernel size 3:
blur( src
_
gray, detected
_
edges, Size(3,3) );
(b) Second, we apply the OpenCV function Canny:
Canny( detected
_
edges, detected
_
edges, lowThreshold, lowThreshold
*
ratio, kernel
_
size );
where the arguments are:
detected_edges: Source image, grayscale
detected_edges: Output of the detector (can be the same as the input)
lowThreshold: The value entered by the user moving the Trackbar
highThreshold: Set in the program as three times the lower threshold (following Cannys recommen-
dation)
kernel_size: We dened it to be 3 (the size of the Sobel kernel to be used internally)
8. We ll a dst image with zeros (meaning the image is completely black).
dst = Scalar::all(0);
9. Finally, we will use the function copyTo to map only the areas of the image that are identied as edges (on a
black background).
src.copyTo( dst, detected
_
edges);
copyTo copy the src image onto dst. However, it will only copy the pixels in the locations where they have non-
zero values. Since the output of the Canny detector is the edge contours on a black background, the resulting
dst will be black in all the area but the detected edges.
10. We display our result:
imshow( window
_
name, dst );
Result
After compiling the code above, we can run it giving as argument the path to an image. For example, using as
an input the following image:
3.10. Canny Edge Detector 211
The OpenCV Tutorials, Release 2.4.3
Moving the slider, trying different threshold, we obtain the following result:
Notice how the image is superposed to the black background on the edge regions.
3.11 Hough Line Transform
Goal
In this tutorial you will learn how to:
Use the OpenCV functions HoughLines and HoughLinesP to detect lines in an image.
Theory
Note: The explanation below belongs to the book Learning OpenCV by Bradski and Kaehler.
Hough Line Transform
1. The Hough Line Transform is a transform used to detect straight lines.
2. To apply the Transform, rst an edge detection pre-processing is desirable.
212 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
How does it work?
1. As you know, a line in the image space can be expressed with two variables. For example:
(a) In the Cartesian coordinate system: Parameters: (m, b).
(b) In the Polar coordinate system: Parameters: (r, )
For Hough Transforms, we will express lines in the Polar system. Hence, a line equation can be written as:
y =
_
cos
sin
_
x +
_
r
sin
_
Arranging the terms: r = x cos +ysin
1. In general for each point (x
0
, y
0
), we can dene the family of lines that goes through that point as:
r
= x
0
cos +y
0
sin
Meaning that each pair (r
)
In OpenCV it is implemented with the function HoughLines
2. The Probabilistic Hough Line Transform
A more efcient implementation of the Hough Line Transform. It gives as output the extremes of the detected
lines (x
0
, y
0
, x
1
, y
1
)
In OpenCV it is implemented with the function HoughLinesP
Code
1. What does this program do?
Loads an image
214 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
Applies either a Standard Hough Line Transform or a Probabilistic Line Transform.
Display the original image and the detected line in two windows.
2. The sample code that we will explain can be downloaded from here. A slightly fancier version (which shows
both Hough standard and probabilistic with trackbars for changing the threshold values) can be found here.
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
using namespace cv;
using namespace std;
void help()
{
cout << "\nThis program demonstrates line finding with the Hough transform.\n"
"Usage:\n"
"./houghlines <image
_
name>, Default is pic1.jpg\n" << endl;
}
int main(int argc, char
**
argv)
{
const char
*
filename = argc >= 2 ? argv[1] : "pic1.jpg";
Mat src = imread(filename, 0);
if(src.empty())
{
help();
cout << "can not open " << filename << endl;
return -1;
}
Mat dst, cdst;
Canny(src, dst, 50, 200, 3);
cvtColor(dst, cdst, CV
_
GRAY2BGR);
#if 0
vector<Vec2f> lines;
HoughLines(dst, lines, 1, CV
_
PI/180, 100, 0, 0 );
for( size
_
t i = 0; i < lines.size(); i++ )
{
float rho = lines[i][0], theta = lines[i][1];
Point pt1, pt2;
double a = cos(theta), b = sin(theta);
double x0 = a
*
rho, y0 = b
*
rho;
pt1.x = cvRound(x0 + 1000
*
(-b));
pt1.y = cvRound(y0 + 1000
*
(a));
pt2.x = cvRound(x0 - 1000
*
(-b));
pt2.y = cvRound(y0 - 1000
*
(a));
line( cdst, pt1, pt2, Scalar(0,0,255), 3, CV
_
AA);
}
#else
vector<Vec4i> lines;
HoughLinesP(dst, lines, 1, CV
_
PI/180, 50, 50, 10 );
for( size
_
t i = 0; i < lines.size(); i++ )
{
3.11. Hough Line Transform 215
The OpenCV Tutorials, Release 2.4.3
Vec4i l = lines[i];
line( cdst, Point(l[0], l[1]), Point(l[2], l[3]), Scalar(0,0,255), 3, CV
_
AA);
}
#endif
imshow("source", src);
imshow("detected lines", cdst);
waitKey();
return 0;
}
Explanation
1. Load an image
Mat src = imread(filename, 0);
if(src.empty())
{
help();
cout << "can not open " << filename << endl;
return -1;
}
2. Detect the edges of the image by using a Canny detector
Canny(src, dst, 50, 200, 3);
Now we will apply the Hough Line Transform. We will explain how to use both OpenCV functions available
for this purpose:
3. Standard Hough Line Transform
(a) First, you apply the Transform:
vector<Vec2f> lines;
HoughLines(dst, lines, 1, CV
_
PI/180, 100, 0, 0 );
with the following arguments:
dst: Output of the edge detector. It should be a grayscale image (although in fact it is a binary one)
lines: A vector that will store the parameters (r, ) of the detected lines
rho : The resolution of the parameter r in pixels. We use 1 pixel.
theta: The resolution of the parameter in radians. We use 1 degree (CV_PI/180)
threshold: The minimum number of intersections to detect a line
srn and stn: Default parameters to zero. Check OpenCV reference for more info.
(b) And then you display the result by drawing the lines.
for( size
_
t i = 0; i < lines.size(); i++ )
{
float rho = lines[i][0], theta = lines[i][1];
Point pt1, pt2;
double a = cos(theta), b = sin(theta);
double x0 = a
*
rho, y0 = b
*
rho;
pt1.x = cvRound(x0 + 1000
*
(-b));
216 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
pt1.y = cvRound(y0 + 1000
*
(a));
pt2.x = cvRound(x0 - 1000
*
(-b));
pt2.y = cvRound(y0 - 1000
*
(a));
line( cdst, pt1, pt2, Scalar(0,0,255), 3, CV
_
AA);
}
4. Probabilistic Hough Line Transform
(a) First you apply the transform:
vector<Vec4i> lines;
HoughLinesP(dst, lines, 1, CV
_
PI/180, 50, 50, 10 );
with the arguments:
dst: Output of the edge detector. It should be a grayscale image (although in fact it is a binary one)
lines: A vector that will store the parameters (x
start
, y
start
, x
end
, y
end
) of the detected lines
rho : The resolution of the parameter r in pixels. We use 1 pixel.
theta: The resolution of the parameter in radians. We use 1 degree (CV_PI/180)
threshold: The minimum number of intersections to detect a line
minLinLength: The minimum number of points that can form a line. Lines with less than this number
of points are disregarded.
maxLineGap: The maximum gap between two points to be considered in the same line.
(b) And then you display the result by drawing the lines.
for( size
_
t i = 0; i < lines.size(); i++ )
{
Vec4i l = lines[i];
line( cdst, Point(l[0], l[1]), Point(l[2], l[3]), Scalar(0,0,255), 3, CV
_
AA);
}
5. Display the original image and the detected lines:
imshow("source", src);
imshow("detected lines", cdst);
6. Wait until the user exits the program
waitKey();
Result
Note: The results below are obtained using the slightly fancier version we mentioned in the Code section. It still
implements the same stuff as above, only adding the Trackbar for the Threshold.
Using an input image such as:
3.11. Hough Line Transform 217
The OpenCV Tutorials, Release 2.4.3
We get the following result by using the Probabilistic Hough Line Transform:
You may observe that the number of lines detected vary while you change the threshold. The explanation is sort of
evident: If you establish a higher threshold, fewer lines will be detected (since you will need more points to declare a
line detected).
3.12 Hough Circle Transform
Goal
In this tutorial you will learn how to:
Use the OpenCV function HoughCircles to detect circles in an image.
218 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
Theory
Hough Circle Transform
The Hough Circle Transform works in a roughly analogous way to the Hough Line Transform explained in the
previous tutorial.
In the line detection case, a line was dened by two parameters (r, ). In the circle case, we need three parame-
ters to dene a circle:
C : (x
center
, y
center
, r)
where (x
center
, y
center
) dene the center position (gree point) and r is the radius, which allows us to com-
pletely dene a circle, as it can be seen below:
For sake of efciency, OpenCV implements a detection method slightly trickier than the standard Hough Trans-
form: The Hough gradient method. For more details, please check the book Learning OpenCV or your favorite
Computer Vision bibliography
Code
1. What does this program do?
Loads an image and blur it to reduce the noise
Applies the Hough Circle Transform to the blurred image .
Display the detected circle in a window.
2. The sample code that we will explain can be downloaded from here. A slightly fancier version (which shows
both Hough standard and probabilistic with trackbars for changing the threshold values) can be found here.
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
#include <stdio.h>
using namespace cv;
/
**
@function main
*
/
3.12. Hough Circle Transform 219
The OpenCV Tutorials, Release 2.4.3
int main(int argc, char
**
argv)
{
Mat src, src
_
gray;
/// Read the image
src = imread( argv[1], 1 );
if( !src.data )
{ return -1; }
/// Convert it to gray
cvtColor( src, src
_
gray, CV
_
BGR2GRAY );
/// Reduce the noise so we avoid false circle detection
GaussianBlur( src
_
gray, src
_
gray, Size(9, 9), 2, 2 );
vector<Vec3f> circles;
/// Apply the Hough Transform to find the circles
HoughCircles( src
_
gray, circles, CV
_
HOUGH
_
GRADIENT, 1, src
_
gray.rows/8, 200, 100, 0, 0 );
/// Draw the circles detected
for( size
_
t i = 0; i < circles.size(); i++ )
{
Point center(cvRound(circles[i][0]), cvRound(circles[i][1]));
int radius = cvRound(circles[i][2]);
// circle center
circle( src, center, 3, Scalar(0,255,0), -1, 8, 0 );
// circle outline
circle( src, center, radius, Scalar(0,0,255), 3, 8, 0 );
}
/// Show your results
namedWindow( "Hough Circle Transform Demo", CV
_
WINDOW
_
AUTOSIZE );
imshow( "Hough Circle Transform Demo", src );
waitKey(0);
return 0;
}
Explanation
1. Load an image
src = imread( argv[1], 1 );
if( !src.data )
{ return -1; }
2. Convert it to grayscale:
cvtColor( src, src
_
gray, CV
_
BGR2GRAY );
3. Apply a Gaussian blur to reduce noise and avoid false circle detection:
GaussianBlur( src
_
gray, src
_
gray, Size(9, 9), 2, 2 );
4. Proceed to apply Hough Circle Transform:
220 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
vector<Vec3f> circles;
HoughCircles( src
_
gray, circles, CV
_
HOUGH
_
GRADIENT, 1, src
_
gray.rows/8, 200, 100, 0, 0 );
with the arguments:
src_gray: Input image (grayscale)
circles: A vector that stores sets of 3 values: x
c
, y
c
, r for each detected circle.
CV_HOUGH_GRADIENT: Dene the detection method. Currently this is the only one available in
OpenCV
dp = 1: The inverse ratio of resolution
min_dist = src_gray.rows/8: Minimum distance between detected centers
param_1 = 200: Upper threshold for the internal Canny edge detector
param_2 = 100*: Threshold for center detection.
min_radius = 0: Minimum radio to be detected. If unknown, put zero as default.
max_radius = 0: Maximum radius to be detected. If unknown, put zero as default
5. Draw the detected circles:
for( size
_
t i = 0; i < circles.size(); i++ )
{
Point center(cvRound(circles[i][0]), cvRound(circles[i][1]));
int radius = cvRound(circles[i][2]);
// circle center
circle( src, center, 3, Scalar(0,255,0), -1, 8, 0 );
// circle outline
circle( src, center, radius, Scalar(0,0,255), 3, 8, 0 );
}
You can see that we will draw the circle(s) on red and the center(s) with a small green dot
6. Display the detected circle(s):
namedWindow( "Hough Circle Transform Demo", CV
_
WINDOW
_
AUTOSIZE );
imshow( "Hough Circle Transform Demo", src );
7. Wait for the user to exit the program
waitKey(0);
Result
The result of running the code above with a test image is shown below:
3.12. Hough Circle Transform 221
The OpenCV Tutorials, Release 2.4.3
3.13 Remapping
Goal
In this tutorial you will learn how to:
1. Use the OpenCV function remap to implement simple remapping routines.
Theory
What is remapping?
It is the process of taking pixels from one place in the image and locating them in another position in a new
image.
To accomplish the mapping process, it might be necessary to do some interpolation for non-integer pixel loca-
tions, since there will not always be a one-to-one-pixel correspondence between source and destination images.
We can express the remap for every pixel location (x, y) as:
g(x, y) = f(h(x, y))
where g() is the remapped image, f() the source image and h(x, y) is the mapping function that operates on
(x, y).
Lets think in a quick example. Imagine that we have an image I and, say, we want to do a remap such that:
h(x, y) = (I.cols x, y)
What would happen? It is easily seen that the image would ip in the x direction. For instance, consider the
input image:
222 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
observe how the red circle changes positions with respect to x (considering x the horizontal direction):
In OpenCV, the function remap offers a simple remapping implementation.
Code
1. What does this program do?
Loads an image
Each second, apply 1 of 4 different remapping processes to the image and display them indenitely in a
window.
Wait for the user to exit the program
2. The tutorial codes is shown lines below. You can also download it from here
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
#include <stdio.h>
using namespace cv;
/// Global variables
Mat src, dst;
3.13. Remapping 223
The OpenCV Tutorials, Release 2.4.3
Mat map
_
x, map
_
y;
char
*
remap
_
window = "Remap demo";
int ind = 0;
/// Function Headers
void update
_
map( void );
/
**
*
@function main
*
/
int main( int argc, char
**
argv )
{
/// Load the image
src = imread( argv[1], 1 );
/// Create dst, map
_
x and map
_
y with the same size as src:
dst.create( src.size(), src.type() );
map
_
x.create( src.size(), CV
_
32FC1 );
map
_
y.create( src.size(), CV
_
32FC1 );
/// Create window
namedWindow( remap
_
window, CV
_
WINDOW
_
AUTOSIZE );
/// Loop
while( true )
{
/// Each 1 sec. Press ESC to exit the program
int c = waitKey( 1000 );
if( (char)c == 27 )
{ break; }
/// Update map
_
x & map
_
y. Then apply remap
update
_
map();
remap( src, dst, map
_
x, map
_
y, CV
_
INTER
_
LINEAR, BORDER
_
CONSTANT, Scalar(0,0, 0) );
/// Display results
imshow( remap
_
window, dst );
}
return 0;
}
/
**
*
@function update
_
map
*
@brief Fill the map
_
x and map
_
y matrices with 4 types of mappings
*
/
void update
_
map( void )
{
ind = ind%4;
for( int j = 0; j < src.rows; j++ )
{ for( int i = 0; i < src.cols; i++ )
{
switch( ind )
{
case 0:
if( i > src.cols
*
0.25 && i < src.cols
*
0.75 && j > src.rows
*
0.25 && j < src.rows
*
0.75 )
{
224 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
map
_
x.at<float>(j,i) = 2
*
( i - src.cols
*
0.25 ) + 0.5 ;
map
_
y.at<float>(j,i) = 2
*
( j - src.rows
*
0.25 ) + 0.5 ;
}
else
{ map
_
x.at<float>(j,i) = 0 ;
map
_
y.at<float>(j,i) = 0 ;
}
break;
case 1:
map
_
x.at<float>(j,i) = i ;
map
_
y.at<float>(j,i) = src.rows - j ;
break;
case 2:
map
_
x.at<float>(j,i) = src.cols - i ;
map
_
y.at<float>(j,i) = j ;
break;
case 3:
map
_
x.at<float>(j,i) = src.cols - i ;
map
_
y.at<float>(j,i) = src.rows - j ;
break;
} // end of switch
}
}
ind++;
}
Explanation
1. Create some variables we will use:
Mat src, dst;
Mat map
_
x, map
_
y;
char
*
remap
_
window = "Remap demo";
int ind = 0;
2. Load an image:
src = imread( argv[1], 1 );
3. Create the destination image and the two mapping matrices (for x and y )
dst.create( src.size(), src.type() );
map
_
x.create( src.size(), CV
_
32FC1 );
map
_
y.create( src.size(), CV
_
32FC1 );
4. Create a window to display results
namedWindow( remap
_
window, CV
_
WINDOW
_
AUTOSIZE );
5. Establish a loop. Each 1000 ms we update our mapping matrices (mat_x and mat_y) and apply them to our
source image:
while( true )
{
/// Each 1 sec. Press ESC to exit the program
int c = waitKey( 1000 );
if( (char)c == 27 )
3.13. Remapping 225
The OpenCV Tutorials, Release 2.4.3
{ break; }
/// Update map
_
x & map
_
y. Then apply remap
update
_
map();
remap( src, dst, map
_
x, map
_
y, CV
_
INTER
_
LINEAR, BORDER
_
CONSTANT, Scalar(0,0, 0) );
/// Display results
imshow( remap
_
window, dst );
}
The function that applies the remapping is remap. We give the following arguments:
src: Source image
dst: Destination image of same size as src
map_x: The mapping function in the x direction. It is equivalent to the rst component of h(i, j)
map_y: Same as above, but in y direction. Note that map_y and map_x are both of the same size as src
CV_INTER_LINEAR: The type of interpolation to use for non-integer pixels. This is by default.
BORDER_CONSTANT: Default
How do we update our mapping matrices mat_x and mat_y? Go on reading:
6. Updating the mapping matrices: We are going to perform 4 different mappings:
(a) Reduce the picture to half its size and will display it in the middle:
h(i, j) = (2 i src.cols/2 +0.5, 2 j src.rows/2 +0.5)
for all pairs (i, j) such that:
src.cols
4
< i <
3 src.cols
4
and
src.rows
4
< j <
3 src.rows
4
(b) Turn the image upside down: h(i, j) = (i, src.rows j)
(c) Reect the image from left to right: h(i, j) = (src.cols i, j)
(d) Combination of b and c: h(i, j) = (src.cols i, src.rows j)
This is expressed in the following snippet. Here, map_x represents the rst coordinate of h(i,j) and map_y
the second coordinate.
for( int j = 0; j < src.rows; j++ )
{ for( int i = 0; i < src.cols; i++ )
{
switch( ind )
{
case 0:
if( i > src.cols
*
0.25 && i < src.cols
*
0.75 && j > src.rows
*
0.25 && j < src.rows
*
0.75 )
{
map
_
x.at<float>(j,i) = 2
*
( i - src.cols
*
0.25 ) + 0.5 ;
map
_
y.at<float>(j,i) = 2
*
( j - src.rows
*
0.25 ) + 0.5 ;
}
else
{ map
_
x.at<float>(j,i) = 0 ;
map
_
y.at<float>(j,i) = 0 ;
}
break;
case 1:
map
_
x.at<float>(j,i) = i ;
map
_
y.at<float>(j,i) = src.rows - j ;
226 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
break;
case 2:
map
_
x.at<float>(j,i) = src.cols - i ;
map
_
y.at<float>(j,i) = j ;
break;
case 3:
map
_
x.at<float>(j,i) = src.cols - i ;
map
_
y.at<float>(j,i) = src.rows - j ;
break;
} // end of switch
}
}
ind++;
}
Result
1. After compiling the code above, you can execute it giving as argument an image path. For instance, by using
the following image:
2. This is the result of reducing it to half the size and centering it:
3. Turning it upside down:
3.13. Remapping 227
The OpenCV Tutorials, Release 2.4.3
4. Reecting it in the x direction:
5. Reecting it in both directions:
3.14 Afne Transformations
Goal
In this tutorial you will learn how to:
1. Use the OpenCV function warpAfne to implement simple remapping routines.
228 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
2. Use the OpenCV function getRotationMatrix2D to obtain a 2 3 rotation matrix
Theory
What is an Afne Transformation?
1. It is any transformation that can be expressed in the form of a matrix multiplication (linear transformation)
followed by a vector addition (translation).
2. From the above, We can use an Afne Transformation to express:
(a) Rotations (linear transformation)
(b) Translations (vector addition)
(c) Scale operations (linear transformation)
you can see that, in essence, an Afne Transformation represents a relation between two images.
3. The usual way to represent an Afne Transform is by using a 2 3 matrix.
A =
_
a
00
a
01
a
10
a
11
_
22
B =
_
b
00
b
10
_
21
M =
_
A B
=
_
a
00
a
01
b
00
a
10
a
11
b
10
_
23
Considering that we want to transform a 2D vector X =
_
x
y
_
by using A and B, we can do it equivalently with:
T = A
_
x
y
_
+B or T = M [x, y, 1]
T
T =
_
a
00
x +a
01
y +b
00
a
10
x +a
11
y +b
10
_
How do we get an Afne Transformation?
1. Excellent question. We mentioned that an Afne Transformation is basically a relation between two images.
The information about this relation can come, roughly, in two ways:
(a) We know both X and T and we also know that they are related. Then our job is to nd M
(b) We know M and :math:X. To obtain T we only need to apply T = M X. Our information for M may
be explicit (i.e. have the 2-by-3 matrix) or it can come as a geometric relation between points.
2. Lets explain a little bit better (b). Since M relates 02 images, we can analyze the simplest case in which it
relates three points in both images. Look at the gure below:
3.14. Afne Transformations 229
The OpenCV Tutorials, Release 2.4.3
the points 1, 2 and 3 (forming a triangle in image 1) are mapped into image 2, still forming a triangle, but now
they have changed notoriously. If we nd the Afne Transformation with these 3 points (you can choose them
as you like), then we can apply this found relation to the whole pixels in the image.
Code
1. What does this program do?
Loads an image
Applies an Afne Transform to the image. This Transform is obtained from the relation between three
points. We use the function warpAfne for that purpose.
Applies a Rotation to the image after being transformed. This rotation is with respect to the image center
Waits until the user exits the program
2. The tutorial codes is shown lines below. You can also download it from here
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
#include <stdio.h>
using namespace cv;
using namespace std;
/// Global variables
char
*
source
_
window = "Source image";
char
*
warp
_
window = "Warp";
char
*
warp
_
rotate
_
window = "Warp + Rotate";
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
Point2f srcTri[3];
Point2f dstTri[3];
Mat rot
_
mat( 2, 3, CV
_
32FC1 );
Mat warp
_
mat( 2, 3, CV
_
32FC1 );
Mat src, warp
_
dst, warp
_
rotate
_
dst;
/// Load the image
src = imread( argv[1], 1 );
230 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
/// Set the dst image the same type and size as src
warp
_
dst = Mat::zeros( src.rows, src.cols, src.type() );
/// Set your 3 points to calculate the Affine Transform
srcTri[0] = Point2f( 0,0 );
srcTri[1] = Point2f( src.cols - 1, 0 );
srcTri[2] = Point2f( 0, src.rows - 1 );
dstTri[0] = Point2f( src.cols
*
0.0, src.rows
*
0.33 );
dstTri[1] = Point2f( src.cols
*
0.85, src.rows
*
0.25 );
dstTri[2] = Point2f( src.cols
*
0.15, src.rows
*
0.7 );
/// Get the Affine Transform
warp
_
mat = getAffineTransform( srcTri, dstTri );
/// Apply the Affine Transform just found to the src image
warpAffine( src, warp
_
dst, warp
_
mat, warp
_
dst.size() );
/
**
Rotating the image after Warp
*
/
/// Compute a rotation matrix with respect to the center of the image
Point center = Point( warp
_
dst.cols/2, warp
_
dst.rows/2 );
double angle = -50.0;
double scale = 0.6;
/// Get the rotation matrix with the specifications above
rot
_
mat = getRotationMatrix2D( center, angle, scale );
/// Rotate the warped image
warpAffine( warp
_
dst, warp
_
rotate
_
dst, rot
_
mat, warp
_
dst.size() );
/// Show what you got
namedWindow( source
_
window, CV
_
WINDOW
_
AUTOSIZE );
imshow( source
_
window, src );
namedWindow( warp
_
window, CV
_
WINDOW
_
AUTOSIZE );
imshow( warp
_
window, warp
_
dst );
namedWindow( warp
_
rotate
_
window, CV
_
WINDOW
_
AUTOSIZE );
imshow( warp
_
rotate
_
window, warp
_
rotate
_
dst );
/// Wait until user exits the program
waitKey(0);
return 0;
}
Explanation
1. Declare some variables we will use, such as the matrices to store our results and 2 arrays of points to store the
2D points that dene our Afne Transform.
Point2f srcTri[3];
Point2f dstTri[3];
Mat rot
_
mat( 2, 3, CV
_
32FC1 );
3.14. Afne Transformations 231
The OpenCV Tutorials, Release 2.4.3
Mat warp
_
mat( 2, 3, CV
_
32FC1 );
Mat src, warp
_
dst, warp
_
rotate
_
dst;
2. Load an image:
src = imread( argv[1], 1 );
3. Initialize the destination image as having the same size and type as the source:
warp
_
dst = Mat::zeros( src.rows, src.cols, src.type() );
4. Afne Transform: As we explained lines above, we need two sets of 3 points to derive the afne transform
relation. Take a look:
srcTri[0] = Point2f( 0,0 );
srcTri[1] = Point2f( src.cols - 1, 0 );
srcTri[2] = Point2f( 0, src.rows - 1 );
dstTri[0] = Point2f( src.cols
*
0.0, src.rows
*
0.33 );
dstTri[1] = Point2f( src.cols
*
0.85, src.rows
*
0.25 );
dstTri[2] = Point2f( src.cols
*
0.15, src.rows
*
0.7 );
You may want to draw the points to make a better idea of how they change. Their locations are approximately
the same as the ones depicted in the example gure (in the Theory section). You may note that the size and
orientation of the triangle dened by the 3 points change.
5. Armed with both sets of points, we calculate the Afne Transform by using OpenCV function getAfneTrans-
form:
warp
_
mat = getAffineTransform( srcTri, dstTri );
We get as an output a 2 3 matrix (in this case warp_mat)
6. We apply the Afne Transform just found to the src image
warpAffine( src, warp
_
dst, warp
_
mat, warp
_
dst.size() );
with the following arguments:
src: Input image
warp_dst: Output image
warp_mat: Afne transform
warp_dst.size(): The desired size of the output image
We just got our rst transformed image! We will display it in one bit. Before that, we also want to rotate it...
7. Rotate: To rotate an image, we need to know two things:
(a) The center with respect to which the image will rotate
(b) The angle to be rotated. In OpenCV a positive angle is counter-clockwise
(c) Optional: A scale factor
We dene these parameters with the following snippet:
Point center = Point( warp
_
dst.cols/2, warp
_
dst.rows/2 );
double angle = -50.0;
double scale = 0.6;
232 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
8. We generate the rotation matrix with the OpenCV function getRotationMatrix2D, which returns a 2 3 matrix
(in this case rot_mat)
rot
_
mat = getRotationMatrix2D( center, angle, scale );
9. We now apply the found rotation to the output of our previous Transformation.
warpAffine( warp
_
dst, warp
_
rotate
_
dst, rot
_
mat, warp
_
dst.size() );
10. Finally, we display our results in two windows plus the original image for good measure:
namedWindow( source
_
window, CV
_
WINDOW
_
AUTOSIZE );
imshow( source
_
window, src );
namedWindow( warp
_
window, CV
_
WINDOW
_
AUTOSIZE );
imshow( warp
_
window, warp
_
dst );
namedWindow( warp
_
rotate
_
window, CV
_
WINDOW
_
AUTOSIZE );
imshow( warp
_
rotate
_
window, warp
_
rotate
_
dst );
11. We just have to wait until the user exits the program
waitKey(0);
Result
1. After compiling the code above, we can give it the path of an image as argument. For instance, for a picture
like:
after applying the rst Afne Transform we obtain:
3.14. Afne Transformations 233
The OpenCV Tutorials, Release 2.4.3
and nally, after applying a negative rotation (remember negative means clockwise) and a scale factor, we get:
3.15 Histogram Equalization
Goal
In this tutorial you will learn:
What an image histogram is and why it is useful
To equalize histograms of images by using the OpenCV function:equalize_hist:equalizeHist <>
Theory
What is an Image Histogram?
It is a graphical representation of the intensity distribution of an image.
234 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
It quanties the number of pixels for each intensity value considered.
What is Histogram Equalization?
It is a method that improves the contrast in an image, in order to stretch out the intensity range.
To make it clearer, from the image above, you can see that the pixels seem clustered around the middle of the
available range of intensities. What Histogram Equalization does is to stretch out this range. Take a look at the
gure below: The green circles indicate the underpopulated intensities. After applying the equalization, we get
an histogram like the gure in the center. The resulting image is shown in the picture at right.
How does it work?
Equalization implies mapping one distribution (the given histogram) to another distribution (a wider and more
uniform distribution of intensity values) so the intensity values are spreaded over the whole range.
To accomplish the equalization effect, the remapping should be the cumulative distribution function (cdf) (more
3.15. Histogram Equalization 235
The OpenCV Tutorials, Release 2.4.3
details, refer to Learning OpenCV). For the histogram H(i), its cumulative distribution H
(i) is:
H
(i) =
0j<i
H(j)
To use this as a remapping function, we have to normalize H
(src(x, y))
Code
What does this program do?
Loads an image
Convert the original image to grayscale
Equalize the Histogram by using the OpenCV function EqualizeHist
Display the source and equalized images in a window.
Downloadable code: Click here
Code at glance:
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
#include <stdio.h>
using namespace cv;
using namespace std;
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
Mat src, dst;
236 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
char
*
source
_
window = "Source image";
char
*
equalized
_
window = "Equalized Image";
/// Load image
src = imread( argv[1], 1 );
if( !src.data )
{ cout<<"Usage: ./Histogram
_
Demo <path
_
to
_
image>"<<endl;
return -1;}
/// Convert to grayscale
cvtColor( src, src, CV
_
BGR2GRAY );
/// Apply Histogram Equalization
equalizeHist( src, dst );
/// Display results
namedWindow( source
_
window, CV
_
WINDOW
_
AUTOSIZE );
namedWindow( equalized
_
window, CV
_
WINDOW
_
AUTOSIZE );
imshow( source
_
window, src );
imshow( equalized
_
window, dst );
/// Wait until user exits the program
waitKey(0);
return 0;
}
Explanation
1. Declare the source and destination images as well as the windows names:
Mat src, dst;
char
*
source
_
window = "Source image";
char
*
equalized
_
window = "Equalized Image";
2. Load the source image:
src = imread( argv[1], 1 );
if( !src.data )
{ cout<<"Usage: ./Histogram
_
Demo <path
_
to
_
image>"<<endl;
return -1;}
3. Convert it to grayscale:
cvtColor( src, src, CV
_
BGR2GRAY );
4. Apply histogram equalization with the function equalizeHist :
equalizeHist( src, dst );
As it can be easily seen, the only arguments are the original image and the output (equalized) image.
5. Display both images (original and equalized) :
3.15. Histogram Equalization 237
The OpenCV Tutorials, Release 2.4.3
namedWindow( source
_
window, CV
_
WINDOW
_
AUTOSIZE );
namedWindow( equalized
_
window, CV
_
WINDOW
_
AUTOSIZE );
imshow( source
_
window, src );
imshow( equalized
_
window, dst );
6. Wait until user exists the program
waitKey(0);
return 0;
Results
1. To appreciate better the results of equalization, lets introduce an image with not much contrast, such as:
which, by the way, has this histogram:
238 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
notice that the pixels are clustered around the center of the histogram.
2. After applying the equalization with our program, we get this result:
this image has certainly more contrast. Check out its new histogram like this:
3.15. Histogram Equalization 239
The OpenCV Tutorials, Release 2.4.3
Notice how the number of pixels is more distributed through the intensity range.
Note: Are you wondering how did we draw the Histogram gures shown above? Check out the following tutorial!
3.16 Histogram Calculation
Goal
In this tutorial you will learn how to:
Use the OpenCV function split to divide an image into its correspondent planes.
To calculate histograms of arrays of images by using the OpenCV function calcHist
To normalize an array by using the function normalize
Note: In the last tutorial (Histogram Equalization) we talked about a particular kind of histogram called Image
histogram. Now we will considerate it in its more general concept. Read on!
What are histograms?
Histograms are collected counts of data organized into a set of predened bins
When we say data we are not restricting it to be intensity values (as we saw in the previous Tutorial). The data
collected can be whatever feature you nd useful to describe your image.
Lets see an example. Imagine that a Matrix contains information of an image (i.e. intensity in the range 0255):
240 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
What happens if we want to count this data in an organized way? Since we know that the range of information
value for this case is 256 values, we can segment our range in subparts (called bins) like:
[0, 255] = [0, 15] [16, 31] .... [240, 255]
range = bin
1
bin
2
.... bin
n=15
and we can keep count of the number of pixels that fall in the range of each bin
i
. Applying this to the example
above we get the image below ( axis x represents the bins and axis y the number of pixels in each of them).
3.16. Histogram Calculation 241
The OpenCV Tutorials, Release 2.4.3
This was just a simple example of how an histogram works and why it is useful. An histogram can keep count
not only of color intensities, but of whatever image features that we want to measure (i.e. gradients, directions,
etc).
Lets identify some parts of the histogram:
1. dims: The number of parameters you want to collect data of. In our example, dims = 1 because we are
only counting the intensity values of each pixel (in a greyscale image).
2. bins: It is the number of subdivisions in each dim. In our example, bins = 16
3. range: The limits for the values to be measured. In this case: range = [0,255]
What if you want to count two features? In this case your resulting histogram would be a 3D plot (in which x
and y would be bin
x
and bin
y
for each feature and z would be the number of counts for each combination of
(bin
x
, bin
y
). The same would apply for more features (of course it gets trickier).
What OpenCV offers you
For simple purposes, OpenCV implements the function calcHist, which calculates the histogram of a set of arrays
(usually images or image planes). It can operate with up to 32 dimensions. We will see it in the code below!
Code
What does this program do?
Loads an image
Splits the image into its R, G and B planes using the function split
Calculate the Histogram of each 1-channel plane by calling the function calcHist
Plot the three histograms in a window
Downloadable code: Click here
Code at glance:
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
#include <stdio.h>
using namespace std;
using namespace cv;
242 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
/
**
*
@function main
*
/
int main( int argc, char
**
argv )
{
Mat src, dst;
/// Load image
src = imread( argv[1], 1 );
if( !src.data )
{ return -1; }
/// Separate the image in 3 places ( B, G and R )
vector<Mat> bgr
_
planes;
split( src, bgr
_
planes );
/// Establish the number of bins
int histSize = 256;
/// Set the ranges ( for B,G,R) )
float range[] = { 0, 256 } ;
const float
*
histRange = { range };
bool uniform = true; bool accumulate = false;
Mat b
_
hist, g
_
hist, r
_
hist;
/// Compute the histograms:
calcHist( &bgr
_
planes[0], 1, 0, Mat(), b
_
hist, 1, &histSize, &histRange, uniform, accumulate );
calcHist( &bgr
_
planes[1], 1, 0, Mat(), g
_
hist, 1, &histSize, &histRange, uniform, accumulate );
calcHist( &bgr
_
planes[2], 1, 0, Mat(), r
_
hist, 1, &histSize, &histRange, uniform, accumulate );
// Draw the histograms for B, G and R
int hist
_
w = 512; int hist
_
h = 400;
int bin
_
w = cvRound( (double) hist
_
w/histSize );
Mat histImage( hist
_
h, hist
_
w, CV
_
8UC3, Scalar( 0,0,0) );
/// Normalize the result to [ 0, histImage.rows ]
normalize(b
_
hist, b
_
hist, 0, histImage.rows, NORM
_
MINMAX, -1, Mat() );
normalize(g
_
hist, g
_
hist, 0, histImage.rows, NORM
_
MINMAX, -1, Mat() );
normalize(r
_
hist, r
_
hist, 0, histImage.rows, NORM
_
MINMAX, -1, Mat() );
/// Draw for each channel
for( int i = 1; i < histSize; i++ )
{
line( histImage, Point( bin
_
w
*
(i-1), hist
_
h - cvRound(b
_
hist.at<float>(i-1)) ) ,
Point( bin
_
w
*
(i), hist
_
h - cvRound(b
_
hist.at<float>(i)) ),
Scalar( 255, 0, 0), 2, 8, 0 );
line( histImage, Point( bin
_
w
*
(i-1), hist
_
h - cvRound(g
_
hist.at<float>(i-1)) ) ,
Point( bin
_
w
*
(i), hist
_
h - cvRound(g
_
hist.at<float>(i)) ),
Scalar( 0, 255, 0), 2, 8, 0 );
line( histImage, Point( bin
_
w
*
(i-1), hist
_
h - cvRound(r
_
hist.at<float>(i-1)) ) ,
Point( bin
_
w
*
(i), hist
_
h - cvRound(r
_
hist.at<float>(i)) ),
Scalar( 0, 0, 255), 2, 8, 0 );
}
3.16. Histogram Calculation 243
The OpenCV Tutorials, Release 2.4.3
/// Display
namedWindow("calcHist Demo", CV
_
WINDOW
_
AUTOSIZE );
imshow("calcHist Demo", histImage );
waitKey(0);
return 0;
}
Explanation
1. Create the necessary matrices:
Mat src, dst;
2. Load the source image
src = imread( argv[1], 1 );
if( !src.data )
{ return -1; }
3. Separate the source image in its three R,G and B planes. For this we use the OpenCV function split:
vector<Mat> bgr
_
planes;
split( src, bgr
_
planes );
our input is the image to be divided (this case with three channels) and the output is a vector of Mat )
4. Now we are ready to start conguring the histograms for each plane. Since we are working with the B, G and
R planes, we know that our values will range in the interval [0, 255]
(a) Establish number of bins (5, 10...):
int histSize = 256; //from 0 to 255
(b) Set the range of values (as we said, between 0 and 255 )
/// Set the ranges ( for B,G,R) )
float range[] = { 0, 256 } ; //the upper boundary is exclusive
const float
*
histRange = { range };
(c) We want our bins to have the same size (uniform) and to clear the histograms in the beginning, so:
bool uniform = true; bool accumulate = false;
(d) Finally, we create the Mat objects to save our histograms. Creating 3 (one for each plane):
Mat b
_
hist, g
_
hist, r
_
hist;
(e) We proceed to calculate the histograms by using the OpenCV function calcHist:
/// Compute the histograms:
calcHist( &bgr
_
planes[0], 1, 0, Mat(), b
_
hist, 1, &histSize, &histRange, uniform, accumulate );
calcHist( &bgr
_
planes[1], 1, 0, Mat(), g
_
hist, 1, &histSize, &histRange, uniform, accumulate );
calcHist( &bgr
_
planes[2], 1, 0, Mat(), r
_
hist, 1, &histSize, &histRange, uniform, accumulate );
where the arguments are:
&bgr_planes[0]: The source array(s)
244 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
1: The number of source arrays (in this case we are using 1. We can enter here also a list of arrays )
0: The channel (dim) to be measured. In this case it is just the intensity (each array is single-channel)
so we just write 0.
Mat(): A mask to be used on the source array ( zeros indicating pixels to be ignored ). If not dened
it is not used
b_hist: The Mat object where the histogram will be stored
1: The histogram dimensionality.
histSize: The number of bins per each used dimension
histRange: The range of values to be measured per each dimension
uniform and accumulate: The bin sizes are the same and the histogram is cleared at the beginning.
5. Create an image to display the histograms:
// Draw the histograms for R, G and B
int hist
_
w = 512; int hist
_
h = 400;
int bin
_
w = cvRound( (double) hist
_
w/histSize );
Mat histImage( hist
_
h, hist
_
w, CV
_
8UC3, Scalar( 0,0,0) );
6. Notice that before drawing, we rst normalize the histogram so its values fall in the range indicated by the
parameters entered:
/// Normalize the result to [ 0, histImage.rows ]
normalize(b
_
hist, b
_
hist, 0, histImage.rows, NORM
_
MINMAX, -1, Mat() );
normalize(g
_
hist, g
_
hist, 0, histImage.rows, NORM
_
MINMAX, -1, Mat() );
normalize(r
_
hist, r
_
hist, 0, histImage.rows, NORM
_
MINMAX, -1, Mat() );
this function receives these arguments:
b_hist: Input array
b_hist: Output normalized array (can be the same)
0 and**histImage.rows**: For this example, they are the lower and upper limits to normalize the values
of r_hist
NORM_MINMAX: Argument that indicates the type of normalization (as described above, it adjusts the
values between the two limits set before)
-1: Implies that the output normalized array will be the same type as the input
Mat(): Optional mask
7. Finally, observe that to access the bin (in this case in this 1D-Histogram):
/// Draw for each channel
for( int i = 1; i < histSize; i++ )
{
line( histImage, Point( bin
_
w
*
(i-1), hist
_
h - cvRound(b
_
hist.at<float>(i-1)) ) ,
Point( bin
_
w
*
(i), hist
_
h - cvRound(b
_
hist.at<float>(i)) ),
Scalar( 255, 0, 0), 2, 8, 0 );
line( histImage, Point( bin
_
w
*
(i-1), hist
_
h - cvRound(g
_
hist.at<float>(i-1)) ) ,
Point( bin
_
w
*
(i), hist
_
h - cvRound(g
_
hist.at<float>(i)) ),
Scalar( 0, 255, 0), 2, 8, 0 );
line( histImage, Point( bin
_
w
*
(i-1), hist
_
h - cvRound(r
_
hist.at<float>(i-1)) ) ,
Point( bin
_
w
*
(i), hist
_
h - cvRound(r
_
hist.at<float>(i)) ),
3.16. Histogram Calculation 245
The OpenCV Tutorials, Release 2.4.3
Scalar( 0, 0, 255), 2, 8, 0 );
}
we use the expression:
b
_
hist.at<float>(i)
where i indicates the dimension. If it were a 2D-histogram we would use something like:
b
_
hist.at<float>( i, j )
8. Finally we display our histograms and wait for the user to exit:
namedWindow("calcHist Demo", CV
_
WINDOW
_
AUTOSIZE );
imshow("calcHist Demo", histImage );
waitKey(0);
return 0;
Result
1. Using as input argument an image like the shown below:
2. Produces the following histogram:
246 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
3.17 Histogram Comparison
Goal
In this tutorial you will learn how to:
Use the function compareHist to get a numerical parameter that express how well two histograms match with
each other.
Use different metrics to compare histograms
Theory
To compare two histograms ( H
1
and H
2
), rst we have to choose a metric (d(H
1
, H
2
)) to express how well
both histograms match.
OpenCV implements the function compareHist to perform a comparison. It also offers 4 different metrics to
compute the matching:
1. Correlation ( CV_COMP_CORREL )
d(H
1
, H
2
) =
I
(H
1
(I)
H
1
)(H
2
(I)
H
2
)
_
I
(H
1
(I)
H
1
)
2
I
(H
2
(I)
H
2
)
2
3.17. Histogram Comparison 247
The OpenCV Tutorials, Release 2.4.3
where
H
k
=
1
N
J
H
k
(J)
and N is the total number of histogram bins.
2. Chi-Square ( CV_COMP_CHISQR )
d(H
1
, H
2
) =
I
(H
1
(I) H
2
(I))
2
H
1
(I)
3. Intersection ( method=CV_COMP_INTERSECT )
d(H
1
, H
2
) =
I
min(H
1
(I), H
2
(I))
4. Bhattacharyya distance ( CV_COMP_BHATTACHARYYA )
d(H
1
, H
2
) =
1
1
_
H
1
H
2
N
2
I
_
H
1
(I) H
2
(I)
Code
What does this program do?
Loads a base image and 2 test images to be compared with it.
Generate 1 image that is the lower half of the base image
Convert the images to HSV format
Calculate the H-S histogram for all the images and normalize them in order to compare them.
Compare the histogram of the base image with respect to the 2 test histograms, the histogram of the lower
half base image and with the same base image histogram.
Display the numerical matching parameters obtained.
Downloadable code: Click here
Code at glance:
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
#include <stdio.h>
using namespace std;
using namespace cv;
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
Mat src
_
base, hsv
_
base;
Mat src
_
test1, hsv
_
test1;
Mat src
_
test2, hsv
_
test2;
Mat hsv
_
half
_
down;
248 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
/// Load three images with different environment settings
if( argc < 4 )
{ printf("
**
Error. Usage: ./compareHist
_
Demo <image
_
settings0> <image
_
setting1> <image
_
settings2>\n");
return -1;
}
src
_
base = imread( argv[1], 1 );
src
_
test1 = imread( argv[2], 1 );
src
_
test2 = imread( argv[3], 1 );
/// Convert to HSV
cvtColor( src
_
base, hsv
_
base, CV
_
BGR2HSV );
cvtColor( src
_
test1, hsv
_
test1, CV
_
BGR2HSV );
cvtColor( src
_
test2, hsv
_
test2, CV
_
BGR2HSV );
hsv
_
half
_
down = hsv
_
base( Range( hsv
_
base.rows/2, hsv
_
base.rows - 1 ), Range( 0, hsv
_
base.cols - 1 ) );
/// Using 30 bins for hue and 32 for saturation
int h
_
bins = 50; int s
_
bins = 60;
int histSize[] = { h
_
bins, s
_
bins };
// hue varies from 0 to 256, saturation from 0 to 180
float h
_
ranges[] = { 0, 256 };
float s
_
ranges[] = { 0, 180 };
const float
*
ranges[] = { h
_
ranges, s
_
ranges };
// Use the o-th and 1-st channels
int channels[] = { 0, 1 };
/// Histograms
MatND hist
_
base;
MatND hist
_
half
_
down;
MatND hist
_
test1;
MatND hist
_
test2;
/// Calculate the histograms for the HSV images
calcHist( &hsv
_
base, 1, channels, Mat(), hist
_
base, 2, histSize, ranges, true, false );
normalize( hist
_
base, hist
_
base, 0, 1, NORM
_
MINMAX, -1, Mat() );
calcHist( &hsv
_
half
_
down, 1, channels, Mat(), hist
_
half
_
down, 2, histSize, ranges, true, false );
normalize( hist
_
half
_
down, hist
_
half
_
down, 0, 1, NORM
_
MINMAX, -1, Mat() );
calcHist( &hsv
_
test1, 1, channels, Mat(), hist
_
test1, 2, histSize, ranges, true, false );
normalize( hist
_
test1, hist
_
test1, 0, 1, NORM
_
MINMAX, -1, Mat() );
calcHist( &hsv
_
test2, 1, channels, Mat(), hist
_
test2, 2, histSize, ranges, true, false );
normalize( hist
_
test2, hist
_
test2, 0, 1, NORM
_
MINMAX, -1, Mat() );
/// Apply the histogram comparison methods
for( int i = 0; i < 4; i++ )
{ int compare
_
method = i;
double base
_
base = compareHist( hist
_
base, hist
_
base, compare
_
method );
double base
_
half = compareHist( hist
_
base, hist
_
half
_
down, compare
_
method );
double base
_
test1 = compareHist( hist
_
base, hist
_
test1, compare
_
method );
double base
_
test2 = compareHist( hist
_
base, hist
_
test2, compare
_
method );
printf( " Method [%d] Perfect, Base-Half, Base-Test(1), Base-Test(2) : %f, %f, %f, %f \n", i, base
_
base, base
_
half , base
_
test1, base
_
test2 );
3.17. Histogram Comparison 249
The OpenCV Tutorials, Release 2.4.3
}
printf( "Done \n" );
return 0;
}
Explanation
1. Declare variables such as the matrices to store the base image and the two other images to compare ( RGB and
HSV )
Mat src
_
base, hsv
_
base;
Mat src
_
test1, hsv
_
test1;
Mat src
_
test2, hsv
_
test2;
Mat hsv
_
half
_
down;
2. Load the base image (src_base) and the other two test images:
if( argc < 4 )
{ printf("
**
Error. Usage: ./compareHist
_
Demo <image
_
settings0> <image
_
setting1> <image
_
settings2>\n");
return -1;
}
src
_
base = imread( argv[1], 1 );
src
_
test1 = imread( argv[2], 1 );
src
_
test2 = imread( argv[3], 1 );
3. Convert them to HSV format:
cvtColor( src
_
base, hsv
_
base, CV
_
BGR2HSV );
cvtColor( src
_
test1, hsv
_
test1, CV
_
BGR2HSV );
cvtColor( src
_
test2, hsv
_
test2, CV
_
BGR2HSV );
4. Also, create an image of half the base image (in HSV format):
hsv
_
half
_
down = hsv
_
base( Range( hsv
_
base.rows/2, hsv
_
base.rows - 1 ), Range( 0, hsv
_
base.cols - 1 ) );
5. Initialize the arguments to calculate the histograms (bins, ranges and channels H and S ).
int h
_
bins = 50; int s
_
bins = 32;
int histSize[] = { h
_
bins, s
_
bins };
float h
_
ranges[] = { 0, 256 };
float s
_
ranges[] = { 0, 180 };
const float
*
ranges[] = { h
_
ranges, s
_
ranges };
int channels[] = { 0, 1 };
6. Create the MatND objects to store the histograms:
MatND hist
_
base;
MatND hist
_
half
_
down;
MatND hist
_
test1;
MatND hist
_
test2;
7. Calculate the Histograms for the base image, the 2 test images and the half-down base image:
250 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
calcHist( &hsv
_
base, 1, channels, Mat(), hist
_
base, 2, histSize, ranges, true, false );
normalize( hist
_
base, hist
_
base, 0, 1, NORM
_
MINMAX, -1, Mat() );
calcHist( &hsv
_
half
_
down, 1, channels, Mat(), hist
_
half
_
down, 2, histSize, ranges, true, false );
normalize( hist
_
half
_
down, hist
_
half
_
down, 0, 1, NORM
_
MINMAX, -1, Mat() );
calcHist( &hsv
_
test1, 1, channels, Mat(), hist
_
test1, 2, histSize, ranges, true, false );
normalize( hist
_
test1, hist
_
test1, 0, 1, NORM
_
MINMAX, -1, Mat() );
calcHist( &hsv
_
test2, 1, channels, Mat(), hist
_
test2, 2, histSize, ranges, true, false );
normalize( hist
_
test2, hist
_
test2, 0, 1, NORM
_
MINMAX, -1, Mat() );
8. Apply sequentially the 4 comparison methods between the histogram of the base image (hist_base) and the other
histograms:
for( int i = 0; i < 4; i++ )
{ int compare
_
method = i;
double base
_
base = compareHist( hist
_
base, hist
_
base, compare
_
method );
double base
_
half = compareHist( hist
_
base, hist
_
half
_
down, compare
_
method );
double base
_
test1 = compareHist( hist
_
base, hist
_
test1, compare
_
method );
double base
_
test2 = compareHist( hist
_
base, hist
_
test2, compare
_
method );
printf( " Method [%d] Perfect, Base-Half, Base-Test(1), Base-Test(2) : %f, %f, %f, %f \n", i, base
_
base, base
_
half , base
_
test1, base
_
test2 );
}
Results
1. We use as input the following images:
where the rst one is the base (to be compared to the others), the other 2 are the test images. We will also
compare the rst image with respect to itself and with respect of half the base image.
2. We should expect a perfect match when we compare the base image histogram with itself. Also, compared with
the histogram of half the base image, it should present a high match since both are from the same source. For the
other two test images, we can observe that they have very different lighting conditions, so the matching should
not be very good:
3. Here the numeric results:
Method Base - Base Base - Half Base - Test 1 Base - Test 2
Correlation 1.000000 0.930766 0.182073 0.120447
Chi-square 0.000000 4.940466 21.184536 49.273437
Intersection 24.391548 14.959809 3.889029 5.775088
Bhattacharyya 0.000000 0.222609 0.646576 0.801869
3.17. Histogram Comparison 251
The OpenCV Tutorials, Release 2.4.3
For the Correlation and Intersection methods, the higher the metric, the more accurate the match. As
we can see, the match base-base is the highest of all as expected. Also we can observe that the match
base-half is the second best match (as we predicted). For the other two metrics, the less the result, the
better the match. We can observe that the matches between the test 1 and test 2 with respect to the base
are worse, which again, was expected.
3.18 Back Projection
Goal
In this tutorial you will learn:
What is Back Projection and why it is useful
How to use the OpenCV function calcBackProject to calculate Back Projection
How to mix different channels of an image by using the OpenCV function mixChannels
Theory
What is Back Projection?
Back Projection is a way of recording how well the pixels of a given image t the distribution of pixels in a
histogram model.
To make it simpler: For Back Projection, you calculate the histogram model of a feature and then use it to nd
this feature in an image.
Application example: If you have a histogram of esh color (say, a Hue-Saturation histogram ), then you can
use it to nd esh color areas in an image:
How does it work?
We explain this by using the skin example:
Lets say you have gotten a skin histogram (Hue-Saturation) based on the image below. The histogram besides
is going to be our model histogram (which we know represents a sample of skin tonality). You applied some
mask to capture only the histogram of the skin area:
252 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
Now, lets imagine that you get another hand image (Test Image) like the one below: (with its respective his-
togram):
What we want to do is to use our model histogram (that we know represents a skin tonality) to detect skin areas
in our Test Image. Here are the steps
1. In each pixel of our Test Image (i.e. p(i, j) ), collect the data and nd the correspondent bin location for
that pixel (i.e. (h
i,j
, s
i,j
) ).
2. Lookup the model histogram in the correspondent bin - (h
i,j
, s
i,j
) - and read the bin value.
3. Store this bin value in a new image (BackProjection). Also, you may consider to normalize the model
histogram rst, so the output for the Test Image can be visible for you.
4. Applying the steps above, we get the following BackProjection image for our Test Image:
3.18. Back Projection 253
The OpenCV Tutorials, Release 2.4.3
5. In terms of statistics, the values stored in BackProjection represent the probability that a pixel in Test
Image belongs to a skin area, based on the model histogram that we use. For instance in our Test image,
the brighter areas are more probable to be skin area (as they actually are), whereas the darker areas have
less probability (notice that these dark areas belong to surfaces that have some shadow on it, which in
turns affects the detection).
Code
What does this program do?
Loads an image
Convert the original to HSV format and separate only Hue channel to be used for the Histogram (using the
OpenCV function mixChannels)
Let the user to enter the number of bins to be used in the calculation of the histogram.
Calculate the histogram (and update it if the bins change) and the backprojection of the same image.
Display the backprojection and the histogram in windows.
Downloadable code:
1. Click here for the basic version (explained in this tutorial).
2. For stuff slightly fancier (using H-S histograms and oodFill to dene a mask for the skin area) you can
check the improved demo
3. ...or you can always check out the classical camshiftdemo in samples.
Code at glance:
#include "opencv2/imgproc/imgproc.hpp"
#include "opencv2/highgui/highgui.hpp"
#include <iostream>
using namespace cv;
using namespace std;
/// Global Variables
Mat src; Mat hsv; Mat hue;
int bins = 25;
254 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
/// Function Headers
void Hist
_
and
_
Backproj(int, void
*
);
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
/// Read the image
src = imread( argv[1], 1 );
/// Transform it to HSV
cvtColor( src, hsv, CV
_
BGR2HSV );
/// Use only the Hue value
hue.create( hsv.size(), hsv.depth() );
int ch[] = { 0, 0 };
mixChannels( &hsv, 1, &hue, 1, ch, 1 );
/// Create Trackbar to enter the number of bins
char
*
window
_
image = "Source image";
namedWindow( window
_
image, CV
_
WINDOW
_
AUTOSIZE );
createTrackbar("
*
Hue bins: ", window
_
image, &bins, 180, Hist
_
and
_
Backproj );
Hist
_
and
_
Backproj(0, 0);
/// Show the image
imshow( window
_
image, src );
/// Wait until user exits the program
waitKey(0);
return 0;
}
/
**
*
@function Hist
_
and
_
Backproj
*
@brief Callback to Trackbar
*
/
void Hist
_
and
_
Backproj(int, void
*
)
{
MatND hist;
int histSize = MAX( bins, 2 );
float hue
_
range[] = { 0, 180 };
const float
*
ranges = { hue
_
range };
/// Get the Histogram and normalize it
calcHist( &hue, 1, 0, Mat(), hist, 1, &histSize, &ranges, true, false );
normalize( hist, hist, 0, 255, NORM
_
MINMAX, -1, Mat() );
/// Get Backprojection
MatND backproj;
calcBackProject( &hue, 1, 0, hist, backproj, &ranges, 1, true );
/// Draw the backproj
imshow( "BackProj", backproj );
/// Draw the histogram
int w = 400; int h = 400;
int bin
_
w = cvRound( (double) w / histSize );
Mat histImg = Mat::zeros( w, h, CV
_
8UC3 );
3.18. Back Projection 255
The OpenCV Tutorials, Release 2.4.3
for( int i = 0; i < bins; i ++ )
{ rectangle( histImg, Point( i
*
bin
_
w, h ), Point( (i+1)
*
bin
_
w, h - cvRound( hist.at<float>(i)
*
h/255.0 ) ), Scalar( 0, 0, 255 ), -1 ); }
imshow( "Histogram", histImg );
}
Explanation
1. Declare the matrices to store our images and initialize the number of bins to be used by our histogram:
Mat src; Mat hsv; Mat hue;
int bins = 25;
2. Read the input image and transform it to HSV format:
src = imread( argv[1], 1 );
cvtColor( src, hsv, CV
_
BGR2HSV );
3. For this tutorial, we will use only the Hue value for our 1-D histogram (check out the fancier code in the links
above if you want to use the more standard H-S histogram, which yields better results):
hue.create( hsv.size(), hsv.depth() );
int ch[] = { 0, 0 };
mixChannels( &hsv, 1, &hue, 1, ch, 1 );
as you see, we use the function http://opencv.itseez.com/modules/core/doc/operations_on_arrays.html?highlight=mixchannels#mixchannelsmixChannels
to get only the channel 0 (Hue) from the hsv image. It gets the following parameters:
&hsv: The source array from which the channels will be copied
1: The number of source arrays
&hue: The destination array of the copied channels
1: The number of destination arrays
ch[] = {0,0}: The array of index pairs indicating how the channels are copied. In this case, the Hue(0)
channel of &hsv is being copied to the 0 channel of &hue (1-channel)
1: Number of index pairs
4. Create a Trackbar for the user to enter the bin values. Any change on the Trackbar means a call to the
Hist_and_Backproj callback function.
char
*
window
_
image = "Source image";
namedWindow( window
_
image, CV
_
WINDOW
_
AUTOSIZE );
createTrackbar("
*
Hue bins: ", window
_
image, &bins, 180, Hist
_
and
_
Backproj );
Hist
_
and
_
Backproj(0, 0);
5. Show the image and wait for the user to exit the program:
imshow( window
_
image, src );
waitKey(0);
return 0;
6. Hist_and_Backproj function: Initialize the arguments needed for calcHist. The number of bins comes from
the Trackbar:
256 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
void Hist
_
and
_
Backproj(int, void
*
)
{
MatND hist;
int histSize = MAX( bins, 2 );
float hue
_
range[] = { 0, 180 };
const float
*
ranges = { hue
_
range };
7. Calculate the Histogram and normalize it to the range [0, 255]
calcHist( &hue, 1, 0, Mat(), hist, 1, &histSize, &ranges, true, false );
normalize( hist, hist, 0, 255, NORM
_
MINMAX, -1, Mat() );
8. Get the Backprojection of the same image by calling the function calcBackProject
MatND backproj;
calcBackProject( &hue, 1, 0, hist, backproj, &ranges, 1, true );
all the arguments are known (the same as used to calculate the histogram), only we add the backproj matrix,
which will store the backprojection of the source image (&hue)
9. Display backproj:
imshow( "BackProj", backproj );
10. Draw the 1-D Hue histogram of the image:
int w = 400; int h = 400;
int bin
_
w = cvRound( (double) w / histSize );
Mat histImg = Mat::zeros( w, h, CV
_
8UC3 );
for( int i = 0; i < bins; i ++ )
{ rectangle( histImg, Point( i
*
bin
_
w, h ), Point( (i+1)
*
bin
_
w, h - cvRound( hist.at<float>(i)
*
h/255.0 ) ), Scalar( 0, 0, 255 ), -1 ); }
imshow( "Histogram", histImg );
Results
1. Here are the output by using a sample image ( guess what? Another hand ). You can play with the bin values
and you will observe how it affects the results:
3.18. Back Projection 257
The OpenCV Tutorials, Release 2.4.3
3.19 Template Matching
Goal
In this tutorial you will learn how to:
Use the OpenCV function matchTemplate to search for matches between an image patch and an input image
Use the OpenCV function minMaxLoc to nd the maximum and minimum values (as well as their positions) in
a given array.
Theory
What is template matching?
Template matching is a technique for nding areas of an image that match (are similar) to a template image (patch).
How does it work?
We need two primary components:
1. Source image (I): The image in which we expect to nd a match to the template image
2. Template image (T): The patch image which will be compared to the template image
our goal is to detect the highest matching area:
To identify the matching area, we have to compare the template image against the source image by sliding it:
258 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
By sliding, we mean moving the patch one pixel at a time (left to right, up to down). At each location, a metric
is calculated so it represents how good or bad the match at that location is (or how similar the patch is to
that particular area of the source image).
For each location of T over I, you store the metric in the result matrix (R). Each location (x, y) in R contains
the match metric:
the image above is the result R of sliding the patch with a metric TM_CCORR_NORMED. The brightest
locations indicate the highest matches. As you can see, the location marked by the red circle is probably the
one with the highest value, so that location (the rectangle formed by that point as a corner and width and height
equal to the patch image) is considered the match.
3.19. Template Matching 259
The OpenCV Tutorials, Release 2.4.3
In practice, we use the function minMaxLoc to locate the highest value (or lower, depending of the type of
matching method) in the R matrix.
Which are the matching methods available in OpenCV?
Good question. OpenCV implements Template matching in the function matchTemplate. The available methods are
6:
1. method=CV_TM_SQDIFF
R(x, y) =
,y
(T(x
, y
) I(x +x
, y +y
))
2
2. method=CV_TM_SQDIFF_NORMED
R(x, y) =
,y
(T(x
, y
) I(x +x
, y +y
))
2
_
,y
T(x
, y
)
2
,y
I(x +x
, y +y
)
2
3. method=CV_TM_CCORR
R(x, y) =
,y
(T(x
, y
) I(x +x
, y +y
))
4. method=CV_TM_CCORR_NORMED
R(x, y) =
,y
(T(x
, y
) I
(x +x
, y +y
))
_
,y
T(x
, y
)
2
,y
I(x +x
, y +y
)
2
5. method=CV_TM_CCOEFF
R(x, y) =
,y
(T
(x
, y
) I(x +x
, y +y
))
where
T
(x
, y
) = T(x
, y
) 1/(w h)
,y
T(x
, y
)
I
(x +x
, y +y
) = I(x +x
, y +y
) 1/(w h)
,y
I(x +x
, y +y
)
6. method=CV_TM_CCOEFF_NORMED
R(x, y) =
,y
(T
(x
, y
) I
(x +x
, y +y
))
_
,y
T
(x
, y
)
2
,y
I
(x +x
, y +y
)
2
Code
What does this program do?
Loads an input image and a image patch (template)
Perform a template matching procedure by using the OpenCV function matchTemplate with any of the
6 matching methods described before. The user can choose the method by entering its selection in the
Trackbar.
260 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
Normalize the output of the matching procedure
Localize the location with higher matching probability
Draw a rectangle around the area corresponding to the highest match
Downloadable code: Click here
Code at glance:
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
#include <stdio.h>
using namespace std;
using namespace cv;
/// Global Variables
Mat img; Mat templ; Mat result;
char
*
image
_
window = "Source Image";
char
*
result
_
window = "Result window";
int match
_
method;
int max
_
Trackbar = 5;
/// Function Headers
void MatchingMethod( int, void
*
);
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
/// Load image and template
img = imread( argv[1], 1 );
templ = imread( argv[2], 1 );
/// Create windows
namedWindow( image
_
window, CV
_
WINDOW
_
AUTOSIZE );
namedWindow( result
_
window, CV
_
WINDOW
_
AUTOSIZE );
/// Create Trackbar
char
*
trackbar
_
label = "Method: \n 0: SQDIFF \n 1: SQDIFF NORMED \n 2: TM CCORR \n 3: TM CCORR NORMED \n 4: TM COEFF \n 5: TM COEFF NORMED";
createTrackbar( trackbar
_
label, image
_
window, &match
_
method, max
_
Trackbar, MatchingMethod );
MatchingMethod( 0, 0 );
waitKey(0);
return 0;
}
/
**
*
@function MatchingMethod
*
@brief Trackbar callback
*
/
void MatchingMethod( int, void
*
)
{
/// Source image to display
Mat img
_
display;
img.copyTo( img
_
display );
3.19. Template Matching 261
The OpenCV Tutorials, Release 2.4.3
/// Create the result matrix
int result
_
cols = img.cols - templ.cols + 1;
int result
_
rows = img.rows - templ.rows + 1;
result.create( result
_
cols, result
_
rows, CV
_
32FC1 );
/// Do the Matching and Normalize
matchTemplate( img, templ, result, match
_
method );
normalize( result, result, 0, 1, NORM
_
MINMAX, -1, Mat() );
/// Localizing the best match with minMaxLoc
double minVal; double maxVal; Point minLoc; Point maxLoc;
Point matchLoc;
minMaxLoc( result, &minVal, &maxVal, &minLoc, &maxLoc, Mat() );
/// For SQDIFF and SQDIFF
_
NORMED, the best matches are lower values. For all the other methods, the higher the better
if( match
_
method == CV
_
TM
_
SQDIFF || match
_
method == CV
_
TM
_
SQDIFF
_
NORMED )
{ matchLoc = minLoc; }
else
{ matchLoc = maxLoc; }
/// Show me what you got
rectangle( img
_
display, matchLoc, Point( matchLoc.x + templ.cols , matchLoc.y + templ.rows ), Scalar::all(0), 2, 8, 0 );
rectangle( result, matchLoc, Point( matchLoc.x + templ.cols , matchLoc.y + templ.rows ), Scalar::all(0), 2, 8, 0 );
imshow( image
_
window, img
_
display );
imshow( result
_
window, result );
return;
}
Explanation
1. Declare some global variables, such as the image, template and result matrices, as well as the match method and
the window names:
Mat img; Mat templ; Mat result;
char
*
image
_
window = "Source Image";
char
*
result
_
window = "Result window";
int match
_
method;
int max
_
Trackbar = 5;
2. Load the source image and template:
img = imread( argv[1], 1 );
templ = imread( argv[2], 1 );
3. Create the windows to show the results:
namedWindow( image
_
window, CV
_
WINDOW
_
AUTOSIZE );
namedWindow( result
_
window, CV
_
WINDOW
_
AUTOSIZE );
4. Create the Trackbar to enter the kind of matching method to be used. When a change is detected the callback
function MatchingMethod is called.
262 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
char
*
trackbar
_
label = "Method: \n 0: SQDIFF \n 1: SQDIFF NORMED \n 2: TM CCORR \n 3: TM CCORR NORMED \n 4: TM COEFF \n 5: TM COEFF NORMED";
createTrackbar( trackbar
_
label, image
_
window, &match
_
method, max
_
Trackbar, MatchingMethod );
5. Wait until user exits the program.
waitKey(0);
return 0;
6. Lets check out the callback function. First, it makes a copy of the source image:
Mat img
_
display;
img.copyTo( img
_
display );
7. Next, it creates the result matrix that will store the matching results for each template location. Observe in detail
the size of the result matrix (which matches all possible locations for it)
int result
_
cols = img.cols - templ.cols + 1;
int result
_
rows = img.rows - templ.rows + 1;
result.create( result
_
cols, result
_
rows, CV
_
32FC1 );
8. Perform the template matching operation:
matchTemplate( img, templ, result, match
_
method );
the arguments are naturally the input image I, the template T, the result R and the match_method (given by the
Trackbar)
9. We normalize the results:
normalize( result, result, 0, 1, NORM
_
MINMAX, -1, Mat() );
10. We localize the minimum and maximum values in the result matrix R by using minMaxLoc.
double minVal; double maxVal; Point minLoc; Point maxLoc;
Point matchLoc;
minMaxLoc( result, &minVal, &maxVal, &minLoc, &maxLoc, Mat() );
the function calls as arguments:
result: The source array
&minVal and &maxVal: Variables to save the minimum and maximum values in result
&minLoc and &maxLoc: The Point locations of the minimum and maximum values in the array.
Mat(): Optional mask
11. For the rst two methods ( CV_SQDIFF and CV_SQDIFF_NORMED ) the best match are the lowest values.
For all the others, higher values represent better matches. So, we save the corresponding value in the matchLoc
variable:
if( match
_
method == CV
_
TM
_
SQDIFF || match
_
method == CV
_
TM
_
SQDIFF
_
NORMED )
{ matchLoc = minLoc; }
else
{ matchLoc = maxLoc; }
12. Display the source image and the result matrix. Draw a rectangle around the highest possible matching area:
3.19. Template Matching 263
The OpenCV Tutorials, Release 2.4.3
rectangle( img
_
display, matchLoc, Point( matchLoc.x + templ.cols , matchLoc.y + templ.rows ), Scalar::all(0), 2, 8, 0 );
rectangle( result, matchLoc, Point( matchLoc.x + templ.cols , matchLoc.y + templ.rows ), Scalar::all(0), 2, 8, 0 );
imshow( image
_
window, img
_
display );
imshow( result
_
window, result );
Results
1. Testing our program with an input image such as:
and a template image:
2. Generate the following result matrices (rst row are the standard methods SQDIFF, CCORR and CCOEFF,
second row are the same methods in its normalized version). In the rst column, the darkest is the better match,
for the other two columns, the brighter a location, the higher the match.
264 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
3. The right match is shown below (black rectangle around the face of the guy at the right). Notice that CCORR
and CCDEFF gave erroneous best matches, however their normalized version did it right, this may be due to the
fact that we are only considering the highest match and not the other possible high matches.
3.19. Template Matching 265
The OpenCV Tutorials, Release 2.4.3
3.20 Finding contours in your image
Goal
In this tutorial you will learn how to:
Use the OpenCV function ndContours
Use the OpenCV function drawContours
Theory
Code
This tutorial codes is shown lines below. You can also download it from here
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
#include <stdio.h>
#include <stdlib.h>
using namespace cv;
using namespace std;
Mat src; Mat src
_
gray;
int thresh = 100;
int max
_
thresh = 255;
RNG rng(12345);
/// Function header
void thresh
_
callback(int, void
*
);
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
/// Load source image and convert it to gray
src = imread( argv[1], 1 );
/// Convert image to gray and blur it
cvtColor( src, src
_
gray, CV
_
BGR2GRAY );
blur( src
_
gray, src
_
gray, Size(3,3) );
/// Create Window
char
*
source
_
window = "Source";
namedWindow( source
_
window, CV
_
WINDOW
_
AUTOSIZE );
imshow( source
_
window, src );
createTrackbar( " Canny thresh:", "Source", &thresh, max
_
thresh, thresh
_
callback );
thresh
_
callback( 0, 0 );
waitKey(0);
return(0);
}
/
**
@function thresh
_
callback
*
/
266 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
void thresh
_
callback(int, void
*
)
{
Mat canny
_
output;
vector<vector<Point> > contours;
vector<Vec4i> hierarchy;
/// Detect edges using canny
Canny( src
_
gray, canny
_
output, thresh, thresh
*
2, 3 );
/// Find contours
findContours( canny
_
output, contours, hierarchy, CV
_
RETR
_
TREE, CV
_
CHAIN
_
APPROX
_
SIMPLE, Point(0, 0) );
/// Draw contours
Mat drawing = Mat::zeros( canny
_
output.size(), CV
_
8UC3 );
for( int i = 0; i< contours.size(); i++ )
{
Scalar color = Scalar( rng.uniform(0, 255), rng.uniform(0,255), rng.uniform(0,255) );
drawContours( drawing, contours, i, color, 2, 8, hierarchy, 0, Point() );
}
/// Show in a window
namedWindow( "Contours", CV
_
WINDOW
_
AUTOSIZE );
imshow( "Contours", drawing );
}
Explanation
Result
1. Here it is:
3.21 Convex Hull
Goal
In this tutorial you will learn how to:
3.21. Convex Hull 267
The OpenCV Tutorials, Release 2.4.3
Use the OpenCV function convexHull
Theory
Code
This tutorial codes is shown lines below. You can also download it from here
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
#include <stdio.h>
#include <stdlib.h>
using namespace cv;
using namespace std;
Mat src; Mat src
_
gray;
int thresh = 100;
int max
_
thresh = 255;
RNG rng(12345);
/// Function header
void thresh
_
callback(int, void
*
);
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
/// Load source image and convert it to gray
src = imread( argv[1], 1 );
/// Convert image to gray and blur it
cvtColor( src, src
_
gray, CV
_
BGR2GRAY );
blur( src
_
gray, src
_
gray, Size(3,3) );
/// Create Window
char
*
source
_
window = "Source";
namedWindow( source
_
window, CV
_
WINDOW
_
AUTOSIZE );
imshow( source
_
window, src );
createTrackbar( " Threshold:", "Source", &thresh, max
_
thresh, thresh
_
callback );
thresh
_
callback( 0, 0 );
waitKey(0);
return(0);
}
/
**
@function thresh
_
callback
*
/
void thresh
_
callback(int, void
*
)
{
Mat src
_
copy = src.clone();
Mat threshold
_
output;
vector<vector<Point> > contours;
vector<Vec4i> hierarchy;
/// Detect edges using Threshold
threshold( src
_
gray, threshold
_
output, thresh, 255, THRESH
_
BINARY );
268 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
/// Find contours
findContours( threshold
_
output, contours, hierarchy, CV
_
RETR
_
TREE, CV
_
CHAIN
_
APPROX
_
SIMPLE, Point(0, 0) );
/// Find the convex hull object for each contour
vector<vector<Point> >hull( contours.size() );
for( int i = 0; i < contours.size(); i++ )
{ convexHull( Mat(contours[i]), hull[i], false ); }
/// Draw contours + hull results
Mat drawing = Mat::zeros( threshold
_
output.size(), CV
_
8UC3 );
for( int i = 0; i< contours.size(); i++ )
{
Scalar color = Scalar( rng.uniform(0, 255), rng.uniform(0,255), rng.uniform(0,255) );
drawContours( drawing, contours, i, color, 1, 8, vector<Vec4i>(), 0, Point() );
drawContours( drawing, hull, i, color, 1, 8, vector<Vec4i>(), 0, Point() );
}
/// Show in a window
namedWindow( "Hull demo", CV
_
WINDOW
_
AUTOSIZE );
imshow( "Hull demo", drawing );
}
Explanation
Result
1. Here it is:
3.22 Creating Bounding boxes and circles for contours
Goal
In this tutorial you will learn how to:
Use the OpenCV function boundingRect
3.22. Creating Bounding boxes and circles for contours 269
The OpenCV Tutorials, Release 2.4.3
Use the OpenCV function minEnclosingCircle
Theory
Code
This tutorial codes is shown lines below. You can also download it from here
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
#include <stdio.h>
#include <stdlib.h>
using namespace cv;
using namespace std;
Mat src; Mat src
_
gray;
int thresh = 100;
int max
_
thresh = 255;
RNG rng(12345);
/// Function header
void thresh
_
callback(int, void
*
);
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
/// Load source image and convert it to gray
src = imread( argv[1], 1 );
/// Convert image to gray and blur it
cvtColor( src, src
_
gray, CV
_
BGR2GRAY );
blur( src
_
gray, src
_
gray, Size(3,3) );
/// Create Window
char
*
source
_
window = "Source";
namedWindow( source
_
window, CV
_
WINDOW
_
AUTOSIZE );
imshow( source
_
window, src );
createTrackbar( " Threshold:", "Source", &thresh, max
_
thresh, thresh
_
callback );
thresh
_
callback( 0, 0 );
waitKey(0);
return(0);
}
/
**
@function thresh
_
callback
*
/
void thresh
_
callback(int, void
*
)
{
Mat threshold
_
output;
vector<vector<Point> > contours;
vector<Vec4i> hierarchy;
/// Detect edges using Threshold
threshold( src
_
gray, threshold
_
output, thresh, 255, THRESH
_
BINARY );
/// Find contours
270 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
findContours( threshold
_
output, contours, hierarchy, CV
_
RETR
_
TREE, CV
_
CHAIN
_
APPROX
_
SIMPLE, Point(0, 0) );
/// Approximate contours to polygons + get bounding rects and circles
vector<vector<Point> > contours
_
poly( contours.size() );
vector<Rect> boundRect( contours.size() );
vector<Point2f>center( contours.size() );
vector<float>radius( contours.size() );
for( int i = 0; i < contours.size(); i++ )
{ approxPolyDP( Mat(contours[i]), contours
_
poly[i], 3, true );
boundRect[i] = boundingRect( Mat(contours
_
poly[i]) );
minEnclosingCircle( contours
_
poly[i], center[i], radius[i] );
}
/// Draw polygonal contour + bonding rects + circles
Mat drawing = Mat::zeros( threshold
_
output.size(), CV
_
8UC3 );
for( int i = 0; i< contours.size(); i++ )
{
Scalar color = Scalar( rng.uniform(0, 255), rng.uniform(0,255), rng.uniform(0,255) );
drawContours( drawing, contours
_
poly, i, color, 1, 8, vector<Vec4i>(), 0, Point() );
rectangle( drawing, boundRect[i].tl(), boundRect[i].br(), color, 2, 8, 0 );
circle( drawing, center[i], (int)radius[i], color, 2, 8, 0 );
}
/// Show in a window
namedWindow( "Contours", CV
_
WINDOW
_
AUTOSIZE );
imshow( "Contours", drawing );
}
Explanation
Result
1. Here it is:
3.22. Creating Bounding boxes and circles for contours 271
The OpenCV Tutorials, Release 2.4.3
3.23 Creating Bounding rotated boxes and ellipses for contours
Goal
In this tutorial you will learn how to:
Use the OpenCV function minAreaRect
Use the OpenCV function tEllipse
Theory
Code
This tutorial codes is shown lines below. You can also download it from here
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
#include <stdio.h>
#include <stdlib.h>
using namespace cv;
using namespace std;
Mat src; Mat src
_
gray;
int thresh = 100;
int max
_
thresh = 255;
RNG rng(12345);
/// Function header
void thresh
_
callback(int, void
*
);
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
/// Load source image and convert it to gray
src = imread( argv[1], 1 );
/// Convert image to gray and blur it
cvtColor( src, src
_
gray, CV
_
BGR2GRAY );
blur( src
_
gray, src
_
gray, Size(3,3) );
/// Create Window
char
*
source
_
window = "Source";
namedWindow( source
_
window, CV
_
WINDOW
_
AUTOSIZE );
imshow( source
_
window, src );
createTrackbar( " Threshold:", "Source", &thresh, max
_
thresh, thresh
_
callback );
thresh
_
callback( 0, 0 );
waitKey(0);
return(0);
}
/
**
@function thresh
_
callback
*
/
272 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
void thresh
_
callback(int, void
*
)
{
Mat threshold
_
output;
vector<vector<Point> > contours;
vector<Vec4i> hierarchy;
/// Detect edges using Threshold
threshold( src
_
gray, threshold
_
output, thresh, 255, THRESH
_
BINARY );
/// Find contours
findContours( threshold
_
output, contours, hierarchy, CV
_
RETR
_
TREE, CV
_
CHAIN
_
APPROX
_
SIMPLE, Point(0, 0) );
/// Find the rotated rectangles and ellipses for each contour
vector<RotatedRect> minRect( contours.size() );
vector<RotatedRect> minEllipse( contours.size() );
for( int i = 0; i < contours.size(); i++ )
{ minRect[i] = minAreaRect( Mat(contours[i]) );
if( contours[i].size() > 5 )
{ minEllipse[i] = fitEllipse( Mat(contours[i]) ); }
}
/// Draw contours + rotated rects + ellipses
Mat drawing = Mat::zeros( threshold
_
output.size(), CV
_
8UC3 );
for( int i = 0; i< contours.size(); i++ )
{
Scalar color = Scalar( rng.uniform(0, 255), rng.uniform(0,255), rng.uniform(0,255) );
// contour
drawContours( drawing, contours, i, color, 1, 8, vector<Vec4i>(), 0, Point() );
// ellipse
ellipse( drawing, minEllipse[i], color, 2, 8 );
// rotated rectangle
Point2f rect
_
points[4]; minRect[i].points( rect
_
points );
for( int j = 0; j < 4; j++ )
line( drawing, rect
_
points[j], rect
_
points[(j+1)%4], color, 1, 8 );
}
/// Show in a window
namedWindow( "Contours", CV
_
WINDOW
_
AUTOSIZE );
imshow( "Contours", drawing );
}
Explanation
Result
1. Here it is:
3.23. Creating Bounding rotated boxes and ellipses for contours 273
The OpenCV Tutorials, Release 2.4.3
3.24 Image Moments
Goal
In this tutorial you will learn how to:
Use the OpenCV function moments
Use the OpenCV function contourArea
Use the OpenCV function arcLength
Theory
Code
This tutorial codes is shown lines below. You can also download it from here
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
#include <stdio.h>
#include <stdlib.h>
using namespace cv;
using namespace std;
Mat src; Mat src
_
gray;
int thresh = 100;
int max
_
thresh = 255;
RNG rng(12345);
/// Function header
void thresh
_
callback(int, void
*
);
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
274 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
/// Load source image and convert it to gray
src = imread( argv[1], 1 );
/// Convert image to gray and blur it
cvtColor( src, src
_
gray, CV
_
BGR2GRAY );
blur( src
_
gray, src
_
gray, Size(3,3) );
/// Create Window
char
*
source
_
window = "Source";
namedWindow( source
_
window, CV
_
WINDOW
_
AUTOSIZE );
imshow( source
_
window, src );
createTrackbar( " Canny thresh:", "Source", &thresh, max
_
thresh, thresh
_
callback );
thresh
_
callback( 0, 0 );
waitKey(0);
return(0);
}
/
**
@function thresh
_
callback
*
/
void thresh
_
callback(int, void
*
)
{
Mat canny
_
output;
vector<vector<Point> > contours;
vector<Vec4i> hierarchy;
/// Detect edges using canny
Canny( src
_
gray, canny
_
output, thresh, thresh
*
2, 3 );
/// Find contours
findContours( canny
_
output, contours, hierarchy, CV
_
RETR
_
TREE, CV
_
CHAIN
_
APPROX
_
SIMPLE, Point(0, 0) );
/// Get the moments
vector<Moments> mu(contours.size() );
for( int i = 0; i < contours.size(); i++ )
{ mu[i] = moments( contours[i], false ); }
/// Get the mass centers:
vector<Point2f> mc( contours.size() );
for( int i = 0; i < contours.size(); i++ )
{ mc[i] = Point2f( mu[i].m10/mu[i].m00 , mu[i].m01/mu[i].m00 ); }
/// Draw contours
Mat drawing = Mat::zeros( canny
_
output.size(), CV
_
8UC3 );
for( int i = 0; i< contours.size(); i++ )
{
Scalar color = Scalar( rng.uniform(0, 255), rng.uniform(0,255), rng.uniform(0,255) );
drawContours( drawing, contours, i, color, 2, 8, hierarchy, 0, Point() );
circle( drawing, mc[i], 4, color, -1, 8, 0 );
}
/// Show in a window
namedWindow( "Contours", CV
_
WINDOW
_
AUTOSIZE );
imshow( "Contours", drawing );
/// Calculate the area with the moments 00 and compare with the result of the OpenCV function
printf("\t Info: Area and Contour Length \n");
for( int i = 0; i< contours.size(); i++ )
{
3.24. Image Moments 275
The OpenCV Tutorials, Release 2.4.3
printf("
*
Contour[%d] - Area (M
_
00) = %.2f - Area OpenCV: %.2f - Length: %.2f \n", i, mu[i].m00, contourArea(contours[i]), arcLength( contours[i], true ) );
Scalar color = Scalar( rng.uniform(0, 255), rng.uniform(0,255), rng.uniform(0,255) );
drawContours( drawing, contours, i, color, 2, 8, hierarchy, 0, Point() );
circle( drawing, mc[i], 4, color, -1, 8, 0 );
}
}
Explanation
Result
1. Here it is:
3.25 Point Polygon Test
Goal
In this tutorial you will learn how to:
Use the OpenCV function pointPolygonTest
Theory
Code
This tutorial codes is shown lines below. You can also download it from here
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
#include <stdio.h>
#include <stdlib.h>
using namespace cv;
using namespace std;
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
/// Create an image
const int r = 100;
276 Chapter 3. imgproc module. Image Processing
The OpenCV Tutorials, Release 2.4.3
Mat src = Mat::zeros( Size( 4
*
r, 4
*
r ), CV
_
8UC1 );
/// Create a sequence of points to make a contour:
vector<Point2f> vert(6);
vert[0] = Point( 1.5
*
r, 1.34
*
r );
vert[1] = Point( 1
*
r, 2
*
r );
vert[2] = Point( 1.5
*
r, 2.866
*
r );
vert[3] = Point( 2.5
*
r, 2.866
*
r );
vert[4] = Point( 3
*
r, 2
*
r );
vert[5] = Point( 2.5
*
r, 1.34
*
r );
/// Draw it in src
for( int j = 0; j < 6; j++ )
{ line( src, vert[j], vert[(j+1)%6], Scalar( 255 ), 3, 8 ); }
/// Get the contours
vector<vector<Point> > contours; vector<Vec4i> hierarchy;
Mat src
_
copy = src.clone();
findContours( src
_
copy, contours, hierarchy, RETR
_
TREE, CHAIN
_
APPROX
_
SIMPLE);
/// Calculate the distances to the contour
Mat raw
_
dist( src.size(), CV
_
32FC1 );
for( int j = 0; j < src.rows; j++ )
{ for( int i = 0; i < src.cols; i++ )
{ raw
_
dist.at<float>(j,i) = pointPolygonTest( contours[0], Point2f(i,j), true ); }
}
double minVal; double maxVal;
minMaxLoc( raw
_
dist, &minVal, &maxVal, 0, 0, Mat() );
minVal = abs(minVal); maxVal = abs(maxVal);
/// Depicting the distances graphically
Mat drawing = Mat::zeros( src.size(), CV
_
8UC3 );
for( int j = 0; j < src.rows; j++ )
{ for( int i = 0; i < src.cols; i++ )
{
if( raw
_
dist.at<float>(j,i) < 0 )
{ drawing.at<Vec3b>(j,i)[0] = 255 - (int) abs(raw
_
dist.at<float>(j,i))
*
255/minVal; }
else if( raw
_
dist.at<float>(j,i) > 0 )
{ drawing.at<Vec3b>(j,i)[2] = 255 - (int) raw
_
dist.at<float>(j,i)
*
255/maxVal; }
else
{ drawing.at<Vec3b>(j,i)[0] = 255; drawing.at<Vec3b>(j,i)[1] = 255; drawing.at<Vec3b>(j,i)[2] = 255; }
}
}
/// Create Window and show your results
char
*
source
_
window = "Source";
namedWindow( source
_
window, CV
_
WINDOW
_
AUTOSIZE );
imshow( source
_
window, src );
namedWindow( "Distance", CV
_
WINDOW
_
AUTOSIZE );
imshow( "Distance", drawing );
waitKey(0);
return(0);
3.25. Point Polygon Test 277
The OpenCV Tutorials, Release 2.4.3
}
Explanation
Result
1. Here it is:
278 Chapter 3. imgproc module. Image Processing
CHAPTER
FOUR
HIGHGUI MODULE. HIGH LEVEL GUI
AND MEDIA
This section contains valuable tutorials about how to read/save your image/video les and how to use the built-in
graphical user interface of the library.
(I
1
I
2
)
2
Then the PSNR is expressed as:
PSNR = 10 log
10
_
MAX
2
I
MSE
_
Here the MAX
2
I
is the maximum valid value for a pixel. In case of the simple single byte image per pixel per channel
this is 255. When two images are the same the MSE will give zero, resulting in an invalid divide by zero operation in
the PSNR formula. In this case the PSNR is undened and as well need to handle this case separately. The transition
to a logarithmic scale is made because the pixel values have a very wide dynamic range. All this translated to OpenCV
and a C++ function looks like:
double getPSNR(const Mat& I1, const Mat& I2)
{
Mat s1;
absdiff(I1, I2, s1); // |I1 - I2|
288 Chapter 4. highgui module. High Level GUI and Media
The OpenCV Tutorials, Release 2.4.3
s1.convertTo(s1, CV
_
32F); // cannot make a square on 8 bits
s1 = s1.mul(s1); // |I1 - I2|^2
Scalar s = sum(s1); // sum elements per channel
double sse = s.val[0] + s.val[1] + s.val[2]; // sum channels
if( sse <= 1e-10) // for small values return zero
return 0;
else
{
double mse =sse /(double)(I1.channels()
*
I1.total());
double psnr = 10.0
*
log10((255
*
255)/mse);
return psnr;
}
}
Typically result values are anywhere between 30 and 50 for video compression, where higher is better. If the images
signicantly differ youll get much lower ones like 15 and so. This similarity check is easy and fast to calculate,
however in practice it may turn out somewhat inconsistent with human eye perception. The structural similarity
algorithm aims to correct this.
Describing the methods goes well beyond the purpose of this tutorial. For that I invite you to read the article introducing
it. Nevertheless, you can get a good image of it by looking at the OpenCV implementation below.
See Also:
SSIM is described more in-depth in the: Z. Wang, A. C. Bovik, H. R. Sheikh and E. P. Simoncelli, Image quality
assessment: From error visibility to structural similarity, IEEE Transactions on Image Processing, vol. 13, no. 4, pp.
600-612, Apr. 2004. article.
Scalar getMSSIM( const Mat& i1, const Mat& i2)
{
const double C1 = 6.5025, C2 = 58.5225;
/
*****************************
INITS
**********************************
/
int d = CV
_
32F;
Mat I1, I2;
i1.convertTo(I1, d); // cannot calculate on one byte large values
i2.convertTo(I2, d);
Mat I2
_
2 = I2.mul(I2); // I2^2
Mat I1
_
2 = I1.mul(I1); // I1^2
Mat I1
_
I2 = I1.mul(I2); // I1
*
I2
/
***********************
PRELIMINARY COMPUTING
******************************
/
Mat mu1, mu2; //
GaussianBlur(I1, mu1, Size(11, 11), 1.5);
GaussianBlur(I2, mu2, Size(11, 11), 1.5);
Mat mu1
_
2 = mu1.mul(mu1);
Mat mu2
_
2 = mu2.mul(mu2);
Mat mu1
_
mu2 = mu1.mul(mu2);
Mat sigma1
_
2, sigma2
_
2, sigma12;
GaussianBlur(I1
_
2, sigma1
_
2, Size(11, 11), 1.5);
sigma1
_
2 -= mu1
_
2;
4.2. Video Input with OpenCV and similarity measurement 289
The OpenCV Tutorials, Release 2.4.3
GaussianBlur(I2
_
2, sigma2
_
2, Size(11, 11), 1.5);
sigma2
_
2 -= mu2
_
2;
GaussianBlur(I1
_
I2, sigma12, Size(11, 11), 1.5);
sigma12 -= mu1
_
mu2;
///////////////////////////////// FORMULA ////////////////////////////////
Mat t1, t2, t3;
t1 = 2
*
mu1
_
mu2 + C1;
t2 = 2
*
sigma12 + C2;
t3 = t1.mul(t2); // t3 = ((2
*
mu1
_
mu2 + C1).
*
(2
*
sigma12 + C2))
t1 = mu1
_
2 + mu2
_
2 + C1;
t2 = sigma1
_
2 + sigma2
_
2 + C2;
t1 = t1.mul(t2); // t1 =((mu1
_
2 + mu2
_
2 + C1).
*
(sigma1
_
2 + sigma2
_
2 + C2))
Mat ssim
_
map;
divide(t3, t1, ssim
_
map); // ssim
_
map = t3./t1;
Scalar mssim = mean( ssim
_
map ); // mssim = average of ssim map
return mssim;
}
This will return a similarity index for each channel of the image. This value is between zero and one, where one
corresponds to perfect t. Unfortunately, the many Gaussian blurring is quite costly, so while the PSNR may work
in a real time like environment (24 frame per second) this will take signicantly more than to accomplish similar
performance results.
Therefore, the source code presented at the start of the tutorial will perform the PSNR measurement for each frame,
and the SSIM only for the frames where the PSNR falls below an input value. For visualization purpose we show both
images in an OpenCV window and print the PSNR and MSSIM values to the console. Expect to see something like:
290 Chapter 4. highgui module. High Level GUI and Media
The OpenCV Tutorials, Release 2.4.3
You may observe a runtime instance of this on the YouTube here.
4.3 Creating a video with OpenCV
Goal
Whenever you work with video feeds you may eventually want to save your image processing result in a form of a
new video le. For simple video outputs you can use the OpenCV built-in VideoWriter class, designed for this.
How to create a video le with OpenCV
What type of video les you can create with OpenCV
How to extract a given color channel from a video
As a simple demonstration Ill just extract one of the RGB color channels of an input video le into a new video. You
can control the ow of the application from its console line arguments:
The rst argument points to the video le to work on
The second argument may be one of the characters: R G B. This will specify which of the channels to extract.
The last argument is the character Y (Yes) or N (No). If this is no, the codec used for the input video le will be
the same as for the output. Otherwise, a window will pop up and allow you to select yourself the codec to use.
For example, a valid command line would look like:
video-write.exe video/Megamind.avi R Y
The source code
You may also nd the source code and these video le in the samples/cpp/tutorial
_
code/highgui/video-write/
folder of the OpenCV source library or download it from here.
1 #include <iostream> // for standard I/O
2 #include <string> // for strings
3
4 #include <opencv2/core/core.hpp> // Basic OpenCV structures (cv::Mat)
5 #include <opencv2/highgui/highgui.hpp> // Video write
6
7 using namespace std;
8 using namespace cv;
9 int main(int argc, char
*
argv[], char
*
window
_
name)
10 {
11 if (argc != 4)
12 {
13 cout << "Not enough parameters" << endl;
14 return -1;
15 }
16
17 const string source = argv[1]; // the source file name
18 const bool askOutputType = argv[3][0] ==Y; // If false it will use the inputs codec type
19
20 VideoCapture inputVideo(source); // Open input
21 if ( !inputVideo.isOpened())
22 {
23 cout << "Could not open the input video." << source << endl;
4.3. Creating a video with OpenCV 291
The OpenCV Tutorials, Release 2.4.3
24 return -1;
25 }
26
27 string::size
_
type pAt = source.find
_
last
_
of(.); // Find extension point
28 const string NAME = source.substr(0, pAt) + argv[2][0] + ".avi"; // Form the new name with container
29 int ex = static
_
cast<int>(inputVideo.get(CV
_
CAP
_
PROP
_
FOURCC)); // Get Codec Type- Int form
30
31 // Transform from int to char via Bitwise operators
32 char EXT[] = {ex & 0XFF , (ex & 0XFF00) >> 8,(ex & 0XFF0000) >> 16,(ex & 0XFF000000) >> 24, 0};
33
34 Size S = Size((int) inputVideo.get(CV
_
CAP
_
PROP
_
FRAME
_
WIDTH), //Acquire input size
35 (int) inputVideo.get(CV
_
CAP
_
PROP
_
FRAME
_
HEIGHT));
36
37 VideoWriter outputVideo; // Open the output
38 if (askOutputType)
39 outputVideo.open(NAME , ex=-1, inputVideo.get(CV
_
CAP
_
PROP
_
FPS),S, true);
40 else
41 outputVideo.open(NAME , ex, inputVideo.get(CV
_
CAP
_
PROP
_
FPS),S, true);
42
43 if (!outputVideo.isOpened())
44 {
45 cout << "Could not open the output video for write: " << source << endl;
46 return -1;
47 }
48
49 union { int v; char c[5];} uEx ;
50 uEx.v = ex; // From Int to char via union
51 uEx.c[4]=\0;
52
53 cout << "Input frame resolution: Width=" << S.width << " Height=" << S.height
54 << " of nr#: " << inputVideo.get(CV
_
CAP
_
PROP
_
FRAME
_
COUNT) << endl;
55 cout << "Input codec type: " << EXT << endl;
56
57 int channel = 2; // Select the channel to save
58 switch(argv[2][0])
59 {
60 case R : {channel = 2; break;}
61 case G : {channel = 1; break;}
62 case B : {channel = 0; break;}
63 }
64 Mat src,res;
65 vector<Mat> spl;
66
67 while( true) //Show the image captured in the window and repeat
68 {
69 inputVideo >> src; // read
70 if( src.empty()) break; // check if at end
71
72 split(src, spl); // process - extract only the correct channel
73 for( int i =0; i < 3; ++i)
74 if (i != channel)
75 spl[i] = Mat::zeros(S, spl[0].type());
76 merge(spl, res);
77
78 //outputVideo.write(res); //save or
79 outputVideo << res;
80 }
81
292 Chapter 4. highgui module. High Level GUI and Media
The OpenCV Tutorials, Release 2.4.3
82 cout << "Finished writing" << endl;
83 return 0;
84 }
The structure of a video
For start, you should have an idea of just how a video le looks. Every video le in itself is a container. The type of
the container is expressed in the les extension (for example avi, mov or mkv). This contains multiple elements like:
video feeds, audio feeds or other tracks (like for example subtitles). How these feeds are stored is determined by the
codec used for each one of them. In case of the audio tracks commonly used codecs are mp3 or aac. For the video
les the list is somehow longer and includes names such as XVID, DIVX, H264 or LAGS (Lagarith Lossless Codec).
The full list of codecs you may use on a system depends on just what one you have installed.
As you can see things can get really complicated with videos. However, OpenCV is mainly a computer vision library,
not a video stream, codec and write one. Therefore, the developers tried to keep this part as simple as possible. Due
to this OpenCV for video containers supports only the avi extension, its rst version. A direct limitation of this is that
you cannot save a video le larger than 2 GB. Furthermore you can only create and expand a single video track inside
the container. No audio or other track editing support here. Nevertheless, any video codec present on your system
might work. If you encounter some of these limitations you will need to look into more specialized video writing
libraries such as FFMpeg or codecs as HuffYUV, CorePNG and LCL. As an alternative, create the video track with
OpenCV and expand it with sound tracks or convert it to other formats by using video manipulation programs such
as VirtualDub or AviSynth. The VideoWriter class ======================= The content written here builds on
the assumption you already read the Video Input with OpenCV and similarity measurement tutorial and you know how
to read video les. To create a video le you just need to create an instance of the VideoWriter class. You can specify
its properties either via parameters in the constructor or later on via the open function. Either way, the parameters
are the same: 1. The name of the output that contains the container type in its extension. At the moment only avi is
supported. We construct this from the input le, add to this the name of the channel to use, and nish it off with the
container extension.
const string source = argv[1]; // the source file name
string::size
_
type pAt = source.find
_
last
_
of(.); // Find extension point
const string NAME = source.substr(0, pAt) + argv[2][0] + ".avi"; // Form the new name with container
1. The codec to use for the video track. Now all the video codecs have a unique short name of maximum four
characters. Hence, the XVID, DIVX or H264 names. This is called a four character code. You may also ask this
from an input video by using its get function. Because the get function is a general function it always returns
double values. A double value is stored on 64 bits. Four characters are four bytes, meaning 32 bits. These four
characters are coded in the lower 32 bits of the double. A simple way to throw away the upper 32 bits would be
to just convert this value to int:
4.3. Creating a video with OpenCV 293
The OpenCV Tutorials, Release 2.4.3
VideoCapture inputVideo(source); // Open input
int ex = static
_
cast<int>(inputVideo.get(CV
_
CAP
_
PROP
_
FOURCC)); // Get Codec Type- Int form
OpenCV internally works with this integer type and expect this as its second parameter. Now to convert from the
integer form to string we may use two methods: a bitwise operator and a union method. The rst one extracting
from an int the characters looks like (an and operation, some shifting and adding a 0 at the end to close the
string):
char EXT[] = {ex & 0XFF , (ex & 0XFF00) >> 8,(ex & 0XFF0000) >> 16,(ex & 0XFF000000) >> 24, 0};
You can do the same thing with the union as:
union { int v; char c[5];} uEx ;
uEx.v = ex; // From Int to char via union
uEx.c[4]=\0;
The advantage of this is that the conversion is done automatically after assigning, while for the bitwise operator
you need to do the operations whenever you change the codec type. In case you know the codecs four character
code beforehand, you can use the CV_FOURCC macro to build the integer:
If you pass for this argument minus one than a window will pop up at runtime that contains all the codec installed
on your system and ask you to select the one to use:
2. The frame per second for the output video. Again, here I keep the input videos frame per second by using the
get function.
3. The size of the frames for the output video. Here too I keep the input videos frame size per second by using the
get function.
4. The nal argument is an optional one. By default is true and says that the output will be a colorful one (so for
write you will send three channel images). To create a gray scale video pass a false parameter here.
Here it is, how I use it in the sample:
VideoWriter outputVideo;
Size S = Size((int) inputVideo.get(CV
_
CAP
_
PROP
_
FRAME
_
WIDTH), //Acquire input size
(int) inputVideo.get(CV
_
CAP
_
PROP
_
FRAME
_
HEIGHT));
outputVideo.open(NAME , ex, inputVideo.get(CV
_
CAP
_
PROP
_
FPS),S, true);
Afterwards, you use the isOpened() function to nd out if the open operation succeeded or not. The video le au-
tomatically closes when the VideoWriter object is destroyed. After you open the object with success you can send
the frames of the video in a sequential order by using the write function of the class. Alternatively, you can use its
overloaded operator << :
outputVideo.write(res); //or
outputVideo << res;
294 Chapter 4. highgui module. High Level GUI and Media
The OpenCV Tutorials, Release 2.4.3
Extracting a color channel from an RGB image means to set to zero the RGB values of the other channels. You can
either do this with image scanning operations or by using the split and merge operations. You rst split the channels
up into different images, set the other channels to zero images of the same size and type and nally merge them back:
split(src, spl); // process - extract only the correct channel
for( int i =0; i < 3; ++i)
if (i != channel)
spl[i] = Mat::zeros(S, spl[0].type());
merge(spl, res);
Put all this together and youll get the upper source code, whose runtime result will show something around the idea:
You may observe a runtime instance of this on the YouTube here.
4.3. Creating a video with OpenCV 295
The OpenCV Tutorials, Release 2.4.3
296 Chapter 4. highgui module. High Level GUI and Media
CHAPTER
FIVE
CALIB3D MODULE. CAMERA
CALIBRATION AND 3D
RECONSTRUCTION
Although we got most of our images in a 2D format they do come from a 3D world. Here you will learn how to nd
out from the 2D images information about the 3D world.
x,y
w(x, y)[I(x +u, y +v) I(x, y)]
2
where:
w(x, y) is the window at position (x, y)
6.2. Harris corner detector 313
The OpenCV Tutorials, Release 2.4.3
I(x, y) is the intensity at (x, y)
I(x +u, y +v) is the intensity at the moved window (x +u, y +v)
Since we are looking for windows with corners, we are looking for windows with a large variation in intensity.
Hence, we have to maximize the equation above, specically the term:
x,y
[I(x +u, y +v) I(x, y)]
2
Using Taylor expansion:
E(u, v)
x,y
[I(x, y) +uI
x
+vI
y
I(x, y)]
2
Expanding the equation and cancelling properly:
E(u, v)
x,y
u
2
I
2
x
+2uvI
x
I
y
+v
2
I
2
y
Which can be expressed in a matrix form as:
E(u, v)
_
u v
x,y
w(x, y)
_
I
2
x
I
x
I
y
I
x
I
y
I
2
y
_
_
_
u
v
_
Lets denote:
M =
x,y
w(x, y)
_
I
2
x
I
x
I
y
I
x
I
y
I
2
y
_
So, our equation now is:
E(u, v)
_
u v
M
_
u
v
_
A score is calculated for each window, to determine if it can possibly contain a corner:
R = det(M) k(trace(M))
2
where:
det(M) =
1
2
trace(M) =
1
+
2
a window with a score R greater than a certain value is considered a corner
Code
This tutorial codes is shown lines below. You can also download it from here
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
#include <stdio.h>
#include <stdlib.h>
314 Chapter 6. feature2d module. 2D Features framework
The OpenCV Tutorials, Release 2.4.3
using namespace cv;
using namespace std;
/// Global variables
Mat src, src
_
gray;
int thresh = 200;
int max
_
thresh = 255;
char
*
source
_
window = "Source image";
char
*
corners
_
window = "Corners detected";
/// Function header
void cornerHarris
_
demo( int, void
*
);
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
/// Load source image and convert it to gray
src = imread( argv[1], 1 );
cvtColor( src, src
_
gray, CV
_
BGR2GRAY );
/// Create a window and a trackbar
namedWindow( source
_
window, CV
_
WINDOW
_
AUTOSIZE );
createTrackbar( "Threshold: ", source
_
window, &thresh, max
_
thresh, cornerHarris
_
demo );
imshow( source
_
window, src );
cornerHarris
_
demo( 0, 0 );
waitKey(0);
return(0);
}
/
**
@function cornerHarris
_
demo
*
/
void cornerHarris
_
demo( int, void
*
)
{
Mat dst, dst
_
norm, dst
_
norm
_
scaled;
dst = Mat::zeros( src.size(), CV
_
32FC1 );
/// Detector parameters
int blockSize = 2;
int apertureSize = 3;
double k = 0.04;
/// Detecting corners
cornerHarris( src
_
gray, dst, blockSize, apertureSize, k, BORDER
_
DEFAULT );
/// Normalizing
normalize( dst, dst
_
norm, 0, 255, NORM
_
MINMAX, CV
_
32FC1, Mat() );
convertScaleAbs( dst
_
norm, dst
_
norm
_
scaled );
/// Drawing a circle around corners
for( int j = 0; j < dst
_
norm.rows ; j++ )
{ for( int i = 0; i < dst
_
norm.cols; i++ )
{
if( (int) dst
_
norm.at<float>(j,i) > thresh )
{
circle( dst
_
norm
_
scaled, Point( i, j ), 5, Scalar(0), 2, 8, 0 );
6.2. Harris corner detector 315
The OpenCV Tutorials, Release 2.4.3
}
}
}
/// Showing the result
namedWindow( corners
_
window, CV
_
WINDOW
_
AUTOSIZE );
imshow( corners
_
window, dst
_
norm
_
scaled );
}
Explanation
Result
The original image:
The detected corners are surrounded by a small black circle
316 Chapter 6. feature2d module. 2D Features framework
The OpenCV Tutorials, Release 2.4.3
6.3 Feature Matching with FLANN
Goal
In this tutorial you will learn how to:
Use the FlannBasedMatcher interface in order to perform a quick and efcient matching by using the FLANN (
Fast Approximate Nearest Neighbor Search Library )
Theory
Code
This tutorial codes is shown lines below. You can also download it from here
#include <stdio.h>
#include <iostream>
#include "opencv2/core/core.hpp"
#include "opencv2/features2d/features2d.hpp"
#include "opencv2/highgui/highgui.hpp"
using namespace cv;
void readme();
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
if( argc != 3 )
{ readme(); return -1; }
Mat img
_
1 = imread( argv[1], CV
_
LOAD
_
IMAGE
_
GRAYSCALE );
6.3. Feature Matching with FLANN 317
The OpenCV Tutorials, Release 2.4.3
Mat img
_
2 = imread( argv[2], CV
_
LOAD
_
IMAGE
_
GRAYSCALE );
if( !img
_
1.data || !img
_
2.data )
{ std::cout<< " --(!) Error reading images " << std::endl; return -1; }
//-- Step 1: Detect the keypoints using SURF Detector
int minHessian = 400;
SurfFeatureDetector detector( minHessian );
std::vector<KeyPoint> keypoints
_
1, keypoints
_
2;
detector.detect( img
_
1, keypoints
_
1 );
detector.detect( img
_
2, keypoints
_
2 );
//-- Step 2: Calculate descriptors (feature vectors)
SurfDescriptorExtractor extractor;
Mat descriptors
_
1, descriptors
_
2;
extractor.compute( img
_
1, keypoints
_
1, descriptors
_
1 );
extractor.compute( img
_
2, keypoints
_
2, descriptors
_
2 );
//-- Step 3: Matching descriptor vectors using FLANN matcher
FlannBasedMatcher matcher;
std::vector< DMatch > matches;
matcher.match( descriptors
_
1, descriptors
_
2, matches );
double max
_
dist = 0; double min
_
dist = 100;
//-- Quick calculation of max and min distances between keypoints
for( int i = 0; i < descriptors
_
1.rows; i++ )
{ double dist = matches[i].distance;
if( dist < min
_
dist ) min
_
dist = dist;
if( dist > max
_
dist ) max
_
dist = dist;
}
printf("-- Max dist : %f \n", max
_
dist );
printf("-- Min dist : %f \n", min
_
dist );
//-- Draw only "good" matches (i.e. whose distance is less than 2
*
min
_
dist )
//-- PS.- radiusMatch can also be used here.
std::vector< DMatch > good
_
matches;
for( int i = 0; i < descriptors
_
1.rows; i++ )
{ if( matches[i].distance < 2
*
min
_
dist )
{ good
_
matches.push
_
back( matches[i]); }
}
//-- Draw only "good" matches
Mat img
_
matches;
drawMatches( img
_
1, keypoints
_
1, img
_
2, keypoints
_
2,
good
_
matches, img
_
matches, Scalar::all(-1), Scalar::all(-1),
vector<char>(), DrawMatchesFlags::NOT
_
DRAW
_
SINGLE
_
POINTS );
//-- Show detected matches
imshow( "Good Matches", img
_
matches );
318 Chapter 6. feature2d module. 2D Features framework
The OpenCV Tutorials, Release 2.4.3
for( int i = 0; i < good
_
matches.size(); i++ )
{ printf( "-- Good Match [%d] Keypoint 1: %d -- Keypoint 2: %d \n", i, good
_
matches[i].queryIdx, good
_
matches[i].trainIdx ); }
waitKey(0);
return 0;
}
/
**
@function readme
*
/
void readme()
{ std::cout << " Usage: ./SURF
_
FlannMatcher <img1> <img2>" << std::endl; }
Explanation
Result
1. Here is the result of the feature detection applied to the rst image:
2. Additionally, we get as console output the keypoints ltered:
6.3. Feature Matching with FLANN 319
The OpenCV Tutorials, Release 2.4.3
6.4 Features2D + Homography to nd a known object
Goal
In this tutorial you will learn how to:
Use the function ndHomography to nd the transform between matched keypoints.
Use the function perspectiveTransform to map the points.
Theory
Code
This tutorial codes is shown lines below. You can also download it from here
#include <stdio.h>
#include <iostream>
#include "opencv2/core/core.hpp"
#include "opencv2/features2d/features2d.hpp"
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/calib3d/calib3d.hpp"
using namespace cv;
void readme();
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
if( argc != 3 )
320 Chapter 6. feature2d module. 2D Features framework
The OpenCV Tutorials, Release 2.4.3
{ readme(); return -1; }
Mat img
_
object = imread( argv[1], CV
_
LOAD
_
IMAGE
_
GRAYSCALE );
Mat img
_
scene = imread( argv[2], CV
_
LOAD
_
IMAGE
_
GRAYSCALE );
if( !img
_
object.data || !img
_
scene.data )
{ std::cout<< " --(!) Error reading images " << std::endl; return -1; }
//-- Step 1: Detect the keypoints using SURF Detector
int minHessian = 400;
SurfFeatureDetector detector( minHessian );
std::vector<KeyPoint> keypoints
_
object, keypoints
_
scene;
detector.detect( img
_
object, keypoints
_
object );
detector.detect( img
_
scene, keypoints
_
scene );
//-- Step 2: Calculate descriptors (feature vectors)
SurfDescriptorExtractor extractor;
Mat descriptors
_
object, descriptors
_
scene;
extractor.compute( img
_
object, keypoints
_
object, descriptors
_
object );
extractor.compute( img
_
scene, keypoints
_
scene, descriptors
_
scene );
//-- Step 3: Matching descriptor vectors using FLANN matcher
FlannBasedMatcher matcher;
std::vector< DMatch > matches;
matcher.match( descriptors
_
object, descriptors
_
scene, matches );
double max
_
dist = 0; double min
_
dist = 100;
//-- Quick calculation of max and min distances between keypoints
for( int i = 0; i < descriptors
_
object.rows; i++ )
{ double dist = matches[i].distance;
if( dist < min
_
dist ) min
_
dist = dist;
if( dist > max
_
dist ) max
_
dist = dist;
}
printf("-- Max dist : %f \n", max
_
dist );
printf("-- Min dist : %f \n", min
_
dist );
//-- Draw only "good" matches (i.e. whose distance is less than 3
*
min
_
dist )
std::vector< DMatch > good
_
matches;
for( int i = 0; i < descriptors
_
object.rows; i++ )
{ if( matches[i].distance < 3
*
min
_
dist )
{ good
_
matches.push
_
back( matches[i]); }
}
Mat img
_
matches;
drawMatches( img
_
object, keypoints
_
object, img
_
scene, keypoints
_
scene,
good
_
matches, img
_
matches, Scalar::all(-1), Scalar::all(-1),
vector<char>(), DrawMatchesFlags::NOT
_
DRAW
_
SINGLE
_
POINTS );
//-- Localize the object
std::vector<Point2f> obj;
6.4. Features2D + Homography to nd a known object 321
The OpenCV Tutorials, Release 2.4.3
std::vector<Point2f> scene;
for( int i = 0; i < good
_
matches.size(); i++ )
{
//-- Get the keypoints from the good matches
obj.push
_
back( keypoints
_
object[ good
_
matches[i].queryIdx ].pt );
scene.push
_
back( keypoints
_
scene[ good
_
matches[i].trainIdx ].pt );
}
Mat H = findHomography( obj, scene, CV
_
RANSAC );
//-- Get the corners from the image
_
1 ( the object to be "detected" )
std::vector<Point2f> obj
_
corners(4);
obj
_
corners[0] = cvPoint(0,0); obj
_
corners[1] = cvPoint( img
_
object.cols, 0 );
obj
_
corners[2] = cvPoint( img
_
object.cols, img
_
object.rows ); obj
_
corners[3] = cvPoint( 0, img
_
object.rows );
std::vector<Point2f> scene
_
corners(4);
perspectiveTransform( obj
_
corners, scene
_
corners, H);
//-- Draw lines between the corners (the mapped object in the scene - image
_
2 )
line( img
_
matches, scene
_
corners[0] + Point2f( img
_
object.cols, 0), scene
_
corners[1] + Point2f( img
_
object.cols, 0), Scalar(0, 255, 0), 4 );
line( img
_
matches, scene
_
corners[1] + Point2f( img
_
object.cols, 0), scene
_
corners[2] + Point2f( img
_
object.cols, 0), Scalar( 0, 255, 0), 4 );
line( img
_
matches, scene
_
corners[2] + Point2f( img
_
object.cols, 0), scene
_
corners[3] + Point2f( img
_
object.cols, 0), Scalar( 0, 255, 0), 4 );
line( img
_
matches, scene
_
corners[3] + Point2f( img
_
object.cols, 0), scene
_
corners[0] + Point2f( img
_
object.cols, 0), Scalar( 0, 255, 0), 4 );
//-- Show detected matches
imshow( "Good Matches & Object detection", img
_
matches );
waitKey(0);
return 0;
}
/
**
@function readme
*
/
void readme()
{ std::cout << " Usage: ./SURF
_
descriptor <img1> <img2>" << std::endl; }
Explanation
Result
1. And here is the result for the detected object (highlighted in green)
322 Chapter 6. feature2d module. 2D Features framework
The OpenCV Tutorials, Release 2.4.3
6.5 Shi-Tomasi corner detector
Goal
In this tutorial you will learn how to:
Use the function goodFeaturesToTrack to detect corners using the Shi-Tomasi method.
Theory
Code
This tutorial codes is shown lines below. You can also download it from here
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
#include <stdio.h>
#include <stdlib.h>
using namespace cv;
using namespace std;
/// Global variables
Mat src, src
_
gray;
int maxCorners = 23;
int maxTrackbar = 100;
RNG rng(12345);
char
*
source
_
window = "Image";
/// Function header
void goodFeaturesToTrack
_
Demo( int, void
*
);
/
**
6.5. Shi-Tomasi corner detector 323
The OpenCV Tutorials, Release 2.4.3
*
@function main
*
/
int main( int argc, char
**
argv )
{
/// Load source image and convert it to gray
src = imread( argv[1], 1 );
cvtColor( src, src
_
gray, CV
_
BGR2GRAY );
/// Create Window
namedWindow( source
_
window, CV
_
WINDOW
_
AUTOSIZE );
/// Create Trackbar to set the number of corners
createTrackbar( "Max corners:", source
_
window, &maxCorners, maxTrackbar, goodFeaturesToTrack
_
Demo );
imshow( source
_
window, src );
goodFeaturesToTrack
_
Demo( 0, 0 );
waitKey(0);
return(0);
}
/
**
*
@function goodFeaturesToTrack
_
Demo.cpp
*
@brief Apply Shi-Tomasi corner detector
*
/
void goodFeaturesToTrack
_
Demo( int, void
*
)
{
if( maxCorners < 1 ) { maxCorners = 1; }
/// Parameters for Shi-Tomasi algorithm
vector<Point2f> corners;
double qualityLevel = 0.01;
double minDistance = 10;
int blockSize = 3;
bool useHarrisDetector = false;
double k = 0.04;
/// Copy the source image
Mat copy;
copy = src.clone();
/// Apply corner detection
goodFeaturesToTrack( src
_
gray,
corners,
maxCorners,
qualityLevel,
minDistance,
Mat(),
blockSize,
useHarrisDetector,
k );
/// Draw corners detected
cout<<"
**
Number of corners detected: "<<corners.size()<<endl;
int r = 4;
for( int i = 0; i < corners.size(); i++ )
324 Chapter 6. feature2d module. 2D Features framework
The OpenCV Tutorials, Release 2.4.3
{ circle( copy, corners[i], r, Scalar(rng.uniform(0,255), rng.uniform(0,255),
rng.uniform(0,255)), -1, 8, 0 ); }
/// Show what you got
namedWindow( source
_
window, CV
_
WINDOW
_
AUTOSIZE );
imshow( source
_
window, copy );
}
Explanation
Result
6.6 Creating yor own corner detector
Goal
In this tutorial you will learn how to:
Use the OpenCV function cornerEigenValsAndVecs to nd the eigenvalues and eigenvectors to determine if a
pixel is a corner.
Use the OpenCV function cornerMinEigenVal to nd the minimum eigenvalues for corner detection.
To implement our own version of the Harris detector as well as the Shi-Tomasi detector, by using the two
functions above.
6.6. Creating yor own corner detector 325
The OpenCV Tutorials, Release 2.4.3
Theory
Code
This tutorial codes is shown lines below. You can also download it from here
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
#include <stdio.h>
#include <stdlib.h>
using namespace cv;
using namespace std;
/// Global variables
Mat src, src
_
gray;
Mat myHarris
_
dst; Mat myHarris
_
copy; Mat Mc;
Mat myShiTomasi
_
dst; Mat myShiTomasi
_
copy;
int myShiTomasi
_
qualityLevel = 50;
int myHarris
_
qualityLevel = 50;
int max
_
qualityLevel = 100;
double myHarris
_
minVal; double myHarris
_
maxVal;
double myShiTomasi
_
minVal; double myShiTomasi
_
maxVal;
RNG rng(12345);
char
*
myHarris
_
window = "My Harris corner detector";
char
*
myShiTomasi
_
window = "My Shi Tomasi corner detector";
/// Function headers
void myShiTomasi
_
function( int, void
*
);
void myHarris
_
function( int, void
*
);
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
/// Load source image and convert it to gray
src = imread( argv[1], 1 );
cvtColor( src, src
_
gray, CV
_
BGR2GRAY );
/// Set some parameters
int blockSize = 3; int apertureSize = 3;
/// My Harris matrix -- Using cornerEigenValsAndVecs
myHarris
_
dst = Mat::zeros( src
_
gray.size(), CV
_
32FC(6) );
Mc = Mat::zeros( src
_
gray.size(), CV
_
32FC1 );
cornerEigenValsAndVecs( src
_
gray, myHarris
_
dst, blockSize, apertureSize, BORDER
_
DEFAULT );
/
*
calculate Mc
*
/
for( int j = 0; j < src
_
gray.rows; j++ )
{ for( int i = 0; i < src
_
gray.cols; i++ )
{
float lambda
_
1 = myHarris
_
dst.at<float>( j, i, 0 );
float lambda
_
2 = myHarris
_
dst.at<float>( j, i, 1 );
326 Chapter 6. feature2d module. 2D Features framework
The OpenCV Tutorials, Release 2.4.3
Mc.at<float>(j,i) = lambda
_
1
*
lambda
_
2 - 0.04
*
pow( ( lambda
_
1 + lambda
_
2 ), 2 );
}
}
minMaxLoc( Mc, &myHarris
_
minVal, &myHarris
_
maxVal, 0, 0, Mat() );
/
*
Create Window and Trackbar
*
/
namedWindow( myHarris
_
window, CV
_
WINDOW
_
AUTOSIZE );
createTrackbar( " Quality Level:", myHarris
_
window, &myHarris
_
qualityLevel, max
_
qualityLevel,
myHarris
_
function );
myHarris
_
function( 0, 0 );
/// My Shi-Tomasi -- Using cornerMinEigenVal
myShiTomasi
_
dst = Mat::zeros( src
_
gray.size(), CV
_
32FC1 );
cornerMinEigenVal( src
_
gray, myShiTomasi
_
dst, blockSize, apertureSize, BORDER
_
DEFAULT );
minMaxLoc( myShiTomasi
_
dst, &myShiTomasi
_
minVal, &myShiTomasi
_
maxVal, 0, 0, Mat() );
/
*
Create Window and Trackbar
*
/
namedWindow( myShiTomasi
_
window, CV
_
WINDOW
_
AUTOSIZE );
createTrackbar( " Quality Level:", myShiTomasi
_
window, &myShiTomasi
_
qualityLevel, max
_
qualityLevel,
myShiTomasi
_
function );
myShiTomasi
_
function( 0, 0 );
waitKey(0);
return(0);
}
/
**
@function myShiTomasi
_
function
*
/
void myShiTomasi
_
function( int, void
*
)
{
myShiTomasi
_
copy = src.clone();
if( myShiTomasi
_
qualityLevel < 1 ) { myShiTomasi
_
qualityLevel = 1; }
for( int j = 0; j < src
_
gray.rows; j++ )
{ for( int i = 0; i < src
_
gray.cols; i++ )
{
if( myShiTomasi
_
dst.at<float>(j,i) > myShiTomasi
_
minVal + ( myShiTomasi
_
maxVal -
myShiTomasi
_
minVal )
*
myShiTomasi
_
qualityLevel/max
_
qualityLevel )
{ circle( myShiTomasi
_
copy, Point(i,j), 4, Scalar( rng.uniform(0,255),
rng.uniform(0,255), rng.uniform(0,255) ), -1, 8, 0 ); }
}
}
imshow( myShiTomasi
_
window, myShiTomasi
_
copy );
}
/
**
@function myHarris
_
function
*
/
void myHarris
_
function( int, void
*
)
{
myHarris
_
copy = src.clone();
if( myHarris
_
qualityLevel < 1 ) { myHarris
_
qualityLevel = 1; }
for( int j = 0; j < src
_
gray.rows; j++ )
{ for( int i = 0; i < src
_
gray.cols; i++ )
{
if( Mc.at<float>(j,i) > myHarris
_
minVal + ( myHarris
_
maxVal - myHarris
_
minVal )
6.6. Creating yor own corner detector 327
The OpenCV Tutorials, Release 2.4.3
*
myHarris
_
qualityLevel/max
_
qualityLevel )
{ circle( myHarris
_
copy, Point(i,j), 4, Scalar( rng.uniform(0,255), rng.uniform(0,255),
rng.uniform(0,255) ), -1, 8, 0 ); }
}
}
imshow( myHarris
_
window, myHarris
_
copy );
}
Explanation
Result
328 Chapter 6. feature2d module. 2D Features framework
The OpenCV Tutorials, Release 2.4.3
6.7 Detecting corners location in subpixeles
Goal
In this tutorial you will learn how to:
Use the OpenCV function cornerSubPix to nd more exact corner positions (more exact than integer pixels).
Theory
Code
This tutorial codes is shown lines below. You can also download it from here
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/imgproc/imgproc.hpp"
#include <iostream>
#include <stdio.h>
#include <stdlib.h>
using namespace cv;
using namespace std;
/// Global variables
Mat src, src
_
gray;
int maxCorners = 10;
int maxTrackbar = 25;
RNG rng(12345);
char
*
source
_
window = "Image";
6.7. Detecting corners location in subpixeles 329
The OpenCV Tutorials, Release 2.4.3
/// Function header
void goodFeaturesToTrack
_
Demo( int, void
*
);
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
/// Load source image and convert it to gray
src = imread( argv[1], 1 );
cvtColor( src, src
_
gray, CV
_
BGR2GRAY );
/// Create Window
namedWindow( source
_
window, CV
_
WINDOW
_
AUTOSIZE );
/// Create Trackbar to set the number of corners
createTrackbar( "Max corners:", source
_
window, &maxCorners, maxTrackbar, goodFeaturesToTrack
_
Demo);
imshow( source
_
window, src );
goodFeaturesToTrack
_
Demo( 0, 0 );
waitKey(0);
return(0);
}
/
**
*
@function goodFeaturesToTrack
_
Demo.cpp
*
@brief Apply Shi-Tomasi corner detector
*
/
void goodFeaturesToTrack
_
Demo( int, void
*
)
{
if( maxCorners < 1 ) { maxCorners = 1; }
/// Parameters for Shi-Tomasi algorithm
vector<Point2f> corners;
double qualityLevel = 0.01;
double minDistance = 10;
int blockSize = 3;
bool useHarrisDetector = false;
double k = 0.04;
/// Copy the source image
Mat copy;
copy = src.clone();
/// Apply corner detection
goodFeaturesToTrack( src
_
gray,
corners,
maxCorners,
qualityLevel,
minDistance,
Mat(),
blockSize,
useHarrisDetector,
k );
/// Draw corners detected
cout<<"
**
Number of corners detected: "<<corners.size()<<endl;
330 Chapter 6. feature2d module. 2D Features framework
The OpenCV Tutorials, Release 2.4.3
int r = 4;
for( int i = 0; i < corners.size(); i++ )
{ circle( copy, corners[i], r, Scalar(rng.uniform(0,255), rng.uniform(0,255),
rng.uniform(0,255)), -1, 8, 0 ); }
/// Show what you got
namedWindow( source
_
window, CV
_
WINDOW
_
AUTOSIZE );
imshow( source
_
window, copy );
/// Set the neeed parameters to find the refined corners
Size winSize = Size( 5, 5 );
Size zeroZone = Size( -1, -1 );
TermCriteria criteria = TermCriteria( CV
_
TERMCRIT
_
EPS + CV
_
TERMCRIT
_
ITER, 40, 0.001 );
/// Calculate the refined corner locations
cornerSubPix( src
_
gray, corners, winSize, zeroZone, criteria );
/// Write them down
for( int i = 0; i < corners.size(); i++ )
{ cout<<" -- Refined Corner ["<<i<<"] ("<<corners[i].x<<","<<corners[i].y<<")"<<endl; }
}
Explanation
Result
Here is the result:
6.7. Detecting corners location in subpixeles 331
The OpenCV Tutorials, Release 2.4.3
6.8 Feature Detection
Goal
In this tutorial you will learn how to:
Use the FeatureDetector interface in order to nd interest points. Specically:
Use the SurfFeatureDetector and its function detect to perform the detection process
Use the function drawKeypoints to draw the detected keypoints
Theory
Code
This tutorial codes is shown lines below. You can also download it from here
#include <stdio.h>
#include <iostream>
#include "opencv2/core/core.hpp"
#include "opencv2/features2d/features2d.hpp"
#include "opencv2/highgui/highgui.hpp"
using namespace cv;
void readme();
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
if( argc != 3 )
{ readme(); return -1; }
Mat img
_
1 = imread( argv[1], CV
_
LOAD
_
IMAGE
_
GRAYSCALE );
Mat img
_
2 = imread( argv[2], CV
_
LOAD
_
IMAGE
_
GRAYSCALE );
if( !img
_
1.data || !img
_
2.data )
{ std::cout<< " --(!) Error reading images " << std::endl; return -1; }
//-- Step 1: Detect the keypoints using SURF Detector
int minHessian = 400;
332 Chapter 6. feature2d module. 2D Features framework
The OpenCV Tutorials, Release 2.4.3
SurfFeatureDetector detector( minHessian );
std::vector<KeyPoint> keypoints
_
1, keypoints
_
2;
detector.detect( img
_
1, keypoints
_
1 );
detector.detect( img
_
2, keypoints
_
2 );
//-- Draw keypoints
Mat img
_
keypoints
_
1; Mat img
_
keypoints
_
2;
drawKeypoints( img
_
1, keypoints
_
1, img
_
keypoints
_
1, Scalar::all(-1), DrawMatchesFlags::DEFAULT );
drawKeypoints( img
_
2, keypoints
_
2, img
_
keypoints
_
2, Scalar::all(-1), DrawMatchesFlags::DEFAULT );
//-- Show detected (drawn) keypoints
imshow("Keypoints 1", img
_
keypoints
_
1 );
imshow("Keypoints 2", img
_
keypoints
_
2 );
waitKey(0);
return 0;
}
/
**
@function readme
*
/
void readme()
{ std::cout << " Usage: ./SURF
_
detector <img1> <img2>" << std::endl; }
Explanation
Result
1. Here is the result of the feature detection applied to the rst image:
2. And here is the result for the second image:
6.8. Feature Detection 333
The OpenCV Tutorials, Release 2.4.3
6.9 Feature Matching with FLANN
Goal
In this tutorial you will learn how to:
Use the FlannBasedMatcher interface in order to perform a quick and efcient matching by using the FLANN (
Fast Approximate Nearest Neighbor Search Library )
Theory
Code
This tutorial codes is shown lines below. You can also download it from here
#include <stdio.h>
#include <iostream>
#include "opencv2/core/core.hpp"
#include "opencv2/features2d/features2d.hpp"
#include "opencv2/highgui/highgui.hpp"
using namespace cv;
void readme();
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
if( argc != 3 )
{ readme(); return -1; }
Mat img
_
1 = imread( argv[1], CV
_
LOAD
_
IMAGE
_
GRAYSCALE );
Mat img
_
2 = imread( argv[2], CV
_
LOAD
_
IMAGE
_
GRAYSCALE );
if( !img
_
1.data || !img
_
2.data )
334 Chapter 6. feature2d module. 2D Features framework
The OpenCV Tutorials, Release 2.4.3
{ std::cout<< " --(!) Error reading images " << std::endl; return -1; }
//-- Step 1: Detect the keypoints using SURF Detector
int minHessian = 400;
SurfFeatureDetector detector( minHessian );
std::vector<KeyPoint> keypoints
_
1, keypoints
_
2;
detector.detect( img
_
1, keypoints
_
1 );
detector.detect( img
_
2, keypoints
_
2 );
//-- Step 2: Calculate descriptors (feature vectors)
SurfDescriptorExtractor extractor;
Mat descriptors
_
1, descriptors
_
2;
extractor.compute( img
_
1, keypoints
_
1, descriptors
_
1 );
extractor.compute( img
_
2, keypoints
_
2, descriptors
_
2 );
//-- Step 3: Matching descriptor vectors using FLANN matcher
FlannBasedMatcher matcher;
std::vector< DMatch > matches;
matcher.match( descriptors
_
1, descriptors
_
2, matches );
double max
_
dist = 0; double min
_
dist = 100;
//-- Quick calculation of max and min distances between keypoints
for( int i = 0; i < descriptors
_
1.rows; i++ )
{ double dist = matches[i].distance;
if( dist < min
_
dist ) min
_
dist = dist;
if( dist > max
_
dist ) max
_
dist = dist;
}
printf("-- Max dist : %f \n", max
_
dist );
printf("-- Min dist : %f \n", min
_
dist );
//-- Draw only "good" matches (i.e. whose distance is less than 2
*
min
_
dist )
//-- PS.- radiusMatch can also be used here.
std::vector< DMatch > good
_
matches;
for( int i = 0; i < descriptors
_
1.rows; i++ )
{ if( matches[i].distance < 2
*
min
_
dist )
{ good
_
matches.push
_
back( matches[i]); }
}
//-- Draw only "good" matches
Mat img
_
matches;
drawMatches( img
_
1, keypoints
_
1, img
_
2, keypoints
_
2,
good
_
matches, img
_
matches, Scalar::all(-1), Scalar::all(-1),
vector<char>(), DrawMatchesFlags::NOT
_
DRAW
_
SINGLE
_
POINTS );
//-- Show detected matches
imshow( "Good Matches", img
_
matches );
for( int i = 0; i < good
_
matches.size(); i++ )
{ printf( "-- Good Match [%d] Keypoint 1: %d -- Keypoint 2: %d \n", i, good
_
matches[i].queryIdx, good
_
matches[i].trainIdx ); }
6.9. Feature Matching with FLANN 335
The OpenCV Tutorials, Release 2.4.3
waitKey(0);
return 0;
}
/
**
@function readme
*
/
void readme()
{ std::cout << " Usage: ./SURF
_
FlannMatcher <img1> <img2>" << std::endl; }
Explanation
Result
1. Here is the result of the feature detection applied to the rst image:
2. Additionally, we get as console output the keypoints ltered:
336 Chapter 6. feature2d module. 2D Features framework
The OpenCV Tutorials, Release 2.4.3
6.10 Features2D + Homography to nd a known object
Goal
In this tutorial you will learn how to:
Use the function ndHomography to nd the transform between matched keypoints.
Use the function perspectiveTransform to map the points.
Theory
Code
This tutorial codes is shown lines below. You can also download it from here
#include <stdio.h>
#include <iostream>
#include "opencv2/core/core.hpp"
#include "opencv2/features2d/features2d.hpp"
#include "opencv2/highgui/highgui.hpp"
#include "opencv2/calib3d/calib3d.hpp"
using namespace cv;
void readme();
/
**
@function main
*
/
int main( int argc, char
**
argv )
{
if( argc != 3 )
6.10. Features2D + Homography to nd a known object 337
The OpenCV Tutorials, Release 2.4.3
{ readme(); return -1; }
Mat img
_
object = imread( argv[1], CV
_
LOAD
_
IMAGE
_
GRAYSCALE );
Mat img
_
scene = imread( argv[2], CV
_
LOAD
_
IMAGE
_
GRAYSCALE );
if( !img
_
object.data || !img
_
scene.data )
{ std::cout<< " --(!) Error reading images " << std::endl; return -1; }
//-- Step 1: Detect the keypoints using SURF Detector
int minHessian = 400;
SurfFeatureDetector detector( minHessian );
std::vector<KeyPoint> keypoints
_
object, keypoints
_
scene;
detector.detect( img
_
object, keypoints
_
object );
detector.detect( img
_
scene, keypoints
_
scene );
//-- Step 2: Calculate descriptors (feature vectors)
SurfDescriptorExtractor extractor;
Mat descriptors
_
object, descriptors
_
scene;
extractor.compute( img
_
object, keypoints
_
object, descriptors
_
object );
extractor.compute( img
_
scene, keypoints
_
scene, descriptors
_
scene );
//-- Step 3: Matching descriptor vectors using FLANN matcher
FlannBasedMatcher matcher;
std::vector< DMatch > matches;
matcher.match( descriptors
_
object, descriptors
_
scene, matches );
double max
_
dist = 0; double min
_
dist = 100;
//-- Quick calculation of max and min distances between keypoints
for( int i = 0; i < descriptors
_
object.rows; i++ )
{ double dist = matches[i].distance;
if( dist < min
_
dist ) min
_
dist = dist;
if( dist > max
_
dist ) max
_
dist = dist;
}
printf("-- Max dist : %f \n", max
_
dist );
printf("-- Min dist : %f \n", min
_
dist );
//-- Draw only "good" matches (i.e. whose distance is less than 3
*
min
_
dist )
std::vector< DMatch > good
_
matches;
for( int i = 0; i < descriptors
_
object.rows; i++ )
{ if( matches[i].distance < 3
*
min
_
dist )
{ good
_
matches.push
_
back( matches[i]); }
}
Mat img
_
matches;
drawMatches( img
_
object, keypoints
_
object, img
_
scene, keypoints
_
scene,
good
_
matches, img
_
matches, Scalar::all(-1), Scalar::all(-1),
vector<char>(), DrawMatchesFlags::NOT
_
DRAW
_
SINGLE
_
POINTS );
//-- Localize the object
std::vector<Point2f> obj;
338 Chapter 6. feature2d module. 2D Features framework
The OpenCV Tutorials, Release 2.4.3
std::vector<Point2f> scene;
for( int i = 0; i < good
_
matches.size(); i++ )
{
//-- Get the keypoints from the good matches
obj.push
_
back( keypoints
_
object[ good
_
matches[i].queryIdx ].pt );
scene.push
_
back( keypoints
_
scene[ good
_
matches[i].trainIdx ].pt );
}
Mat H = findHomography( obj, scene, CV
_
RANSAC );
//-- Get the corners from the image
_
1 ( the object to be "detected" )
std::vector<Point2f> obj
_
corners(4);
obj
_
corners[0] = cvPoint(0,0); obj
_
corners[1] = cvPoint( img
_
object.cols, 0 );
obj
_
corners[2] = cvPoint( img
_
object.cols, img
_
object.rows ); obj
_
corners[3] = cvPoint( 0, img
_
object.rows );
std::vector<Point2f> scene
_
corners(4);
perspectiveTransform( obj
_
corners, scene
_
corners, H);
//-- Draw lines between the corners (the mapped object in the scene - image
_
2 )
line( img
_
matches, scene
_
corners[0] + Point2f( img
_
object.cols, 0), scene
_
corners[1] + Point2f( img
_
object.cols, 0), Scalar(0, 255, 0), 4 );
line( img
_
matches, scene
_
corners[1] + Point2f( img
_
object.cols, 0), scene
_
corners[2] + Point2f( img
_
object.cols, 0), Scalar( 0, 255, 0), 4 );
line( img
_
matches, scene
_
corners[2] + Point2f( img
_
object.cols, 0), scene
_
corners[3] + Point2f( img
_
object.cols, 0), Scalar( 0, 255, 0), 4 );
line( img
_
matches, scene
_
corners[3] + Point2f( img
_
object.cols, 0), scene
_
corners[0] + Point2f( img
_
object.cols, 0), Scalar( 0, 255, 0), 4 );
//-- Show detected matches
imshow( "Good Matches & Object detection", img
_
matches );
waitKey(0);
return 0;
}
/
**
@function readme
*
/
void readme()
{ std::cout << " Usage: ./SURF
_
descriptor <img1> <img2>" << std::endl; }
Explanation
Result
1. And here is the result for the detected object (highlighted in green)
6.10. Features2D + Homography to nd a known object 339
The OpenCV Tutorials, Release 2.4.3
6.11 Detection of planar objects
The goal of this tutorial is to learn how to use features2d and calib3d modules for detecting known planar objects in
scenes.
Test data: use images in your data folder, for instance, box.png and box
_
in
_
scene.png.
1. Create a new console project. Read two input images.
Mat img1 = imread(argv[1], CV
_
LOAD
_
IMAGE
_
GRAYSCALE);
Mat img2 = imread(argv[2], CV
_
LOAD
_
IMAGE
_
GRAYSCALE);
2. Detect keypoints in both images.
// detecting keypoints
FastFeatureDetector detector(15);
vector<KeyPoint> keypoints1;
detector.detect(img1, keypoints1);
... // do the same for the second image
3. Compute descriptors for each of the keypoints.
// computing descriptors
SurfDescriptorExtractor extractor;
Mat descriptors1;
extractor.compute(img1, keypoints1, descriptors1);
... // process keypoints from the second image as well
4. Now, nd the closest matches between descriptors from the rst image to the second:
// matching descriptors
BruteForceMatcher<L2<float> > matcher;
vector<DMatch> matches;
matcher.match(descriptors1, descriptors2, matches);
5. Visualize the results:
340 Chapter 6. feature2d module. 2D Features framework
The OpenCV Tutorials, Release 2.4.3
// drawing the results
namedWindow("matches", 1);
Mat img
_
matches;
drawMatches(img1, keypoints1, img2, keypoints2, matches, img
_
matches);
imshow("matches", img
_
matches);
waitKey(0);
6. Find the homography transformation between two sets of points:
vector<Point2f> points1, points2;
// fill the arrays with the points
....
Mat H = findHomography(Mat(points1), Mat(points2), CV
_
RANSAC, ransacReprojThreshold);
7. Create a set of inlier matches and draw them. Use perspectiveTransform function to map points with homogra-
phy:
Mat points1Projected; perspectiveTransform(Mat(points1), points1Projected, H);
8. Use drawMatches for drawing inliers.
6.11. Detection of planar objects 341
The OpenCV Tutorials, Release 2.4.3
342 Chapter 6. feature2d module. 2D Features framework
CHAPTER
SEVEN
VIDEO MODULE. VIDEO ANALYSIS
Look here in order to nd use on your video stream algoritms like: motion extraction, feature tracking and foreground
extractions.
Note: Unfortunetly we have no tutorials into this section. Nevertheless, our tutorial writting team is working on it. If
you have a tutorial suggestion or you have writen yourself a tutorial (or coded a sample code) that you would like to
see here please contact us via our user group.
343
The OpenCV Tutorials, Release 2.4.3
344 Chapter 7. video module. Video analysis
CHAPTER
EIGHT
OBJDETECT MODULE. OBJECT
DETECTION
Ever wondered how your digital camera detects peoples and faces? Look here to nd out!
i
subject to y
i
(
T
x
i
+
0
) 1
i
and
i
0 i
How should the parameter C be chosen? It is obvious that the answer to this question depends on how the training
data is distributed. Although there is no general answer, it is useful to take into account these rules:
Large values of C give solutions with less misclassication errors but a smaller margin. Consider that in this
case it is expensive to make misclassication errors. Since the aim of the optimization is to minimize the
argument, few misclassications errors are allowed.
Small values of C give solutions with bigger margin and more classication errors. In this case the minimization
does not consider that much the term of the sum so it focuses more on nding a hyperplane with big margin.
Source Code
You may also nd the source code and these video le in the samples/cpp/tutorial
_
code/gpu/non
_
linear
_
svms/non
_
linear
_
svms
folder of the OpenCV source library or download it from here.
1 #include <iostream>
2 #include <opencv2/core/core.hpp>
3 #include <opencv2/highgui/highgui.hpp>
4 #include <opencv2/ml/ml.hpp>
5
6 #define NTRAINING
_
SAMPLES 100 // Number of training samples per class
358 Chapter 9. ml module. Machine Learning
The OpenCV Tutorials, Release 2.4.3
7 #define FRAC
_
LINEAR
_
SEP 0.9f // Fraction of samples which compose the linear separable part
8
9 using namespace cv;
10 using namespace std;
11
12 int main()
13 {
14 // Data for visual representation
15 const int WIDTH = 512, HEIGHT = 512;
16 Mat I = Mat::zeros(HEIGHT, WIDTH, CV
_
8UC3);
17
18 //--------------------- 1. Set up training data randomly ---------------------------------------
19 Mat trainData(2
*
NTRAINING
_
SAMPLES, 2, CV
_
32FC1);
20 Mat labels (2
*
NTRAINING
_
SAMPLES, 1, CV
_
32FC1);
21
22 RNG rng(100); // Random value generation class
23
24 // Set up the linearly separable part of the training data
25 int nLinearSamples = (int) (FRAC
_
LINEAR
_
SEP
*
NTRAINING
_
SAMPLES);
26
27 // Generate random points for the class 1
28 Mat trainClass = trainData.rowRange(0, nLinearSamples);
29 // The x coordinate of the points is in [0, 0.4)
30 Mat c = trainClass.colRange(0, 1);
31 rng.fill(c, RNG::UNIFORM, Scalar(1), Scalar(0.4
*
WIDTH));
32 // The y coordinate of the points is in [0, 1)
33 c = trainClass.colRange(1,2);
34 rng.fill(c, RNG::UNIFORM, Scalar(1), Scalar(HEIGHT));
35
36 // Generate random points for the class 2
37 trainClass = trainData.rowRange(2
*
NTRAINING
_
SAMPLES-nLinearSamples, 2
*
NTRAINING
_
SAMPLES);
38 // The x coordinate of the points is in [0.6, 1]
39 c = trainClass.colRange(0 , 1);
40 rng.fill(c, RNG::UNIFORM, Scalar(0.6
*
WIDTH), Scalar(WIDTH));
41 // The y coordinate of the points is in [0, 1)
42 c = trainClass.colRange(1,2);
43 rng.fill(c, RNG::UNIFORM, Scalar(1), Scalar(HEIGHT));
44
45 //------------------ Set up the non-linearly separable part of the training data ---------------
46
47 // Generate random points for the classes 1 and 2
48 trainClass = trainData.rowRange( nLinearSamples, 2
*
NTRAINING
_
SAMPLES-nLinearSamples);
49 // The x coordinate of the points is in [0.4, 0.6)
50 c = trainClass.colRange(0,1);
51 rng.fill(c, RNG::UNIFORM, Scalar(0.4
*
WIDTH), Scalar(0.6
*
WIDTH));
52 // The y coordinate of the points is in [0, 1)
53 c = trainClass.colRange(1,2);
54 rng.fill(c, RNG::UNIFORM, Scalar(1), Scalar(HEIGHT));
55
56 //------------------------- Set up the labels for the classes ---------------------------------
57 labels.rowRange( 0, NTRAINING
_
SAMPLES).setTo(1); // Class 1
58 labels.rowRange(NTRAINING
_
SAMPLES, 2
*
NTRAINING
_
SAMPLES).setTo(2); // Class 2
59
60 //------------------------ 2. Set up the support vector machines parameters --------------------
61 CvSVMParams params;
62 params.svm
_
type = SVM::C
_
SVC;
63 params.C = 0.1;
64 params.kernel
_
type = SVM::LINEAR;
9.2. Support Vector Machines for Non-Linearly Separable Data 359
The OpenCV Tutorials, Release 2.4.3
65 params.term
_
crit = TermCriteria(CV
_
TERMCRIT
_
ITER, (int)1e7, 1e-6);
66
67 //------------------------ 3. Train the svm ----------------------------------------------------
68 cout << "Starting training process" << endl;
69 CvSVM svm;
70 svm.train(trainData, labels, Mat(), Mat(), params);
71 cout << "Finished training process" << endl;
72
73 //------------------------ 4. Show the decision regions ----------------------------------------
74 Vec3b green(0,100,0), blue (100,0,0);
75 for (int i = 0; i < I.rows; ++i)
76 for (int j = 0; j < I.cols; ++j)
77 {
78 Mat sampleMat = (Mat
_
<float>(1,2) << i, j);
79 float response = svm.predict(sampleMat);
80
81 if (response == 1) I.at<Vec3b>(j, i) = green;
82 else if (response == 2) I.at<Vec3b>(j, i) = blue;
83 }
84
85 //----------------------- 5. Show the training data --------------------------------------------
86 int thick = -1;
87 int lineType = 8;
88 float px, py;
89 // Class 1
90 for (int i = 0; i < NTRAINING
_
SAMPLES; ++i)
91 {
92 px = trainData.at<float>(i,0);
93 py = trainData.at<float>(i,1);
94 circle(I, Point( (int) px, (int) py ), 3, Scalar(0, 255, 0), thick, lineType);
95 }
96 // Class 2
97 for (int i = NTRAINING
_
SAMPLES; i <2
*
NTRAINING
_
SAMPLES; ++i)
98 {
99 px = trainData.at<float>(i,0);
100 py = trainData.at<float>(i,1);
101 circle(I, Point( (int) px, (int) py ), 3, Scalar(255, 0, 0), thick, lineType);
102 }
103
104 //------------------------- 6. Show support vectors --------------------------------------------
105 thick = 2;
106 lineType = 8;
107 int x = svm.get
_
support
_
vector
_
count();
108
109 for (int i = 0; i < x; ++i)
110 {
111 const float
*
v = svm.get
_
support
_
vector(i);
112 circle( I, Point( (int) v[0], (int) v[1]), 6, Scalar(128, 128, 128), thick, lineType);
113 }
114
115 imwrite("result.png", I); // save the Image
116 imshow("SVM for Non-Linear Training Data", I); // show it to the user
117 waitKey(0);
118 }
360 Chapter 9. ml module. Machine Learning
The OpenCV Tutorials, Release 2.4.3
Explanation
1. Set up the training data
The training data of this exercise is formed by a set of labeled 2D-points that belong to one of two different
classes. To make the exercise more appealing, the training data is generated randomly using a uniform
probability density functions (PDFs). We have divided the generation of the training data into two main
parts. In the rst part we generate data for both classes that is linearly separable.
// Generate random points for the class 1
Mat trainClass = trainData.rowRange(0, nLinearSamples);
// The x coordinate of the points is in [0, 0.4)
Mat c = trainClass.colRange(0, 1);
rng.fill(c, RNG::UNIFORM, Scalar(1), Scalar(0.4
*
WIDTH));
// The y coordinate of the points is in [0, 1)
c = trainClass.colRange(1,2);
rng.fill(c, RNG::UNIFORM, Scalar(1), Scalar(HEIGHT));
// Generate random points for the class 2
trainClass = trainData.rowRange(2
*
NTRAINING
_
SAMPLES-nLinearSamples, 2
*
NTRAINING
_
SAMPLES);
// The x coordinate of the points is in [0.6, 1]
c = trainClass.colRange(0 , 1);
rng.fill(c, RNG::UNIFORM, Scalar(0.6
*
WIDTH), Scalar(WIDTH));
// The y coordinate of the points is in [0, 1)
c = trainClass.colRange(1,2);
rng.fill(c, RNG::UNIFORM, Scalar(1), Scalar(HEIGHT));
In the second part we create data for both classes that is non-linearly separable, data that overlaps.
// Generate random points for the classes 1 and 2
trainClass = trainData.rowRange( nLinearSamples, 2
*
NTRAINING
_
SAMPLES-nLinearSamples);
// The x coordinate of the points is in [0.4, 0.6)
c = trainClass.colRange(0,1);
rng.fill(c, RNG::UNIFORM, Scalar(0.4
*
WIDTH), Scalar(0.6
*
WIDTH));
// The y coordinate of the points is in [0, 1)
c = trainClass.colRange(1,2);
rng.fill(c, RNG::UNIFORM, Scalar(1), Scalar(HEIGHT));
2. Set up SVMs parameters
See Also:
In the previous tutorial Introduction to Support Vector Machines there is an explanation of the atributes of
the class CvSVMParams that we congure here before training the SVM.
CvSVMParams params;
params.svm
_
type = SVM::C
_
SVC;
params.C = 0.1;
params.kernel
_
type = SVM::LINEAR;
params.term
_
crit = TermCriteria(CV
_
TERMCRIT
_
ITER, (int)1e7, 1e-6);
There are just two differences between the conguration we do here and the one that was done in the
previous tutorial that we use as reference.
CvSVM::C_SVC. We chose here a small value of this parameter in order not to punish too much the misclassication errors in the optimization. The idea of doing this stems from the will of obtaining a solution close to the one intuitively expected. However, we recommend to get a better insight of the problem by making adjustments to this parameter.
Note: Here there are just very few points in the overlapping region between classes, giving
a smaller value to FRAC_LINEAR_SEP the density of points can be incremented and the
impact of the parameter CvSVM::C_SVC explored deeply.
9.2. Support Vector Machines for Non-Linearly Separable Data 361
The OpenCV Tutorials, Release 2.4.3
Termination Criteria of the algorithm. The maximum number of iterations has to be increased
considerably in order to solve correctly a problem with non-linearly separable training data. In
particular, we have increased in ve orders of magnitude this value.
3. Train the SVM
We call the method CvSVM::train to build the SVM model. Watch out that the training process may take
a quite long time. Have patiance when your run the program.
CvSVM svm;
svm.train(trainData, labels, Mat(), Mat(), params);
4. Show the Decision Regions
The method CvSVM::predict is used to classify an input sample using a trained SVM. In this example we
have used this method in order to color the space depending on the prediction done by the SVM. In other
words, an image is traversed interpreting its pixels as points of the Cartesian plane. Each of the points is
colored depending on the class predicted by the SVM; in dark green if it is the class with label 1 and in
dark blue if it is the class with label 2.
Vec3b green(0,100,0), blue (100,0,0);
for (int i = 0; i < I.rows; ++i)
for (int j = 0; j < I.cols; ++j)
{
Mat sampleMat = (Mat
_
<float>(1,2) << i, j);
float response = svm.predict(sampleMat);
if (response == 1) I.at<Vec3b>(j, i) = green;
else if (response == 2) I.at<Vec3b>(j, i) = blue;
}
5. Show the training data
The method circle is used to show the samples that compose the training data. The samples of the class
labeled with 1 are shown in light green and in light blue the samples of the class labeled with 2.
int thick = -1;
int lineType = 8;
float px, py;
// Class 1
for (int i = 0; i < NTRAINING
_
SAMPLES; ++i)
{
px = trainData.at<float>(i,0);
py = trainData.at<float>(i,1);
circle(I, Point( (int) px, (int) py ), 3, Scalar(0, 255, 0), thick, lineType);
}
// Class 2
for (int i = NTRAINING
_
SAMPLES; i <2
*
NTRAINING
_
SAMPLES; ++i)
{
px = trainData.at<float>(i,0);
py = trainData.at<float>(i,1);
circle(I, Point( (int) px, (int) py ), 3, Scalar(255, 0, 0), thick, lineType);
}
6. Support vectors
We use here a couple of methods to obtain information about the support vectors. The method
CvSVM::get_support_vector_count outputs the total number of support vectors used in the problem and
with the method CvSVM::get_support_vector we obtain each of the support vectors using an index. We
have used this methods here to nd the training examples that are support vectors and highlight them.
362 Chapter 9. ml module. Machine Learning
The OpenCV Tutorials, Release 2.4.3
thick = 2;
lineType = 8;
int x = svm.get
_
support
_
vector
_
count();
for (int i = 0; i < x; ++i)
{
const float
*
v = svm.get
_
support
_
vector(i);
circle( I, Point( (int) v[0], (int) v[1]), 6, Scalar(128, 128, 128), thick, lineType);
}
Results
The code opens an image and shows the training examples of both classes. The points of one class are repre-
sented with light green and light blue ones are used for the other class.
The SVM is trained and used to classify all the pixels of the image. This results in a division of the image in
a blue region and a green region. The boundary between both regions is the separating hyperplane. Since the
training data is non-linearly separable, it can be seen that some of the examples of both classes are misclassied;
some green points lay on the blue region and some blue points lay on the green one.
Finally the support vectors are shown using gray rings around the training examples.
You may observe a runtime instance of this on the YouTube here.
9.2. Support Vector Machines for Non-Linearly Separable Data 363
The OpenCV Tutorials, Release 2.4.3
364 Chapter 9. ml module. Machine Learning
CHAPTER
TEN
GPU MODULE. GPU-ACCELERATED
COMPUTER VISION
Squeeze out every little computation power fromyour systemby using the power of your video card to run the OpenCV
algorithms.
Title: Discovering the human retina and its use for image processing
Compatibility: > OpenCV 2.4
Author: Alexandre Benoit
You will learn how to process images and video streams with a model of
retina lter for details enhancement, spatio-temporal noise removal, lumi-
nance correction and spatio-temporal events detection.
375
The OpenCV Tutorials, Release 2.4.3
11.1 Discovering the human retina and its use for image processing
Goal
I present here a model of human retina that shows some interesting properties for image preprocessing and enhance-
ment. In this tutorial you will learn how to:
discover the main two channels outing from your retina
see the basics to use the retina model
discover some parameters tweaks
General overview
The proposed model originates fromJeanny Heraults research at Gipsa. It is involved in image processing applications
with Listic (code maintainer) lab. This is not a complete model but it already present interesting properties that can
be involved for enhanced image processing experience. The model allows the following human retina properties to be
used :
spectral whitening that has 3 important effects: high spatio-temporal frequency signals canceling (noise), mid-
frequencies details enhancement and low frequencies luminance energy reduction. This all in one property
directly allows visual signals cleaning of classical undesired distortions introduced by image sensors and input
luminance range.
local logarithmic luminance compression allows details to be enhanced even in low light conditions.
decorrelation of the details information (Parvocellular output channel) and transient information (events, motion
made available at the Magnocellular output channel).
The rst two points are illustrated below :
In the gure below, the OpenEXR image sample CrissyField.exr, a High Dynamic Range image is shown. In order to
make it visible on this web-page, the original input image is linearly rescaled to the classical image luminance range
[0-255] and is converted to 8bit/channel format. Such strong conversion hides many details because of too strong local
contrasts. Furthermore, noise energy is also strong and pollutes visual information.
376 Chapter 11. contrib module. The additional contributions made available !
The OpenCV Tutorials, Release 2.4.3
In the following image, as your retina does, local luminance adaptation, spatial noise removal and spectral whitening
work together and transmit accurate information on lower range 8bit data channels. On this picture, noise in signi-
cantly removed, local details hidden by strong luminance contrasts are enhanced. Output image keeps its naturalness
and visual content is enhanced.
11.1. Discovering the human retina and its use for image processing 377
The OpenCV Tutorials, Release 2.4.3
Note : image sample can be downloaded from the OpenEXR website. Regarding this demonstration, be-
fore retina processing, input image has been linearly rescaled within 0-255 keeping its channels oat for-
mat. 5% of its histogram ends has been cut (mostly removes wrong HDR pixels). Check out the sample
opencv/samples/cpp/OpenEXRimages_HighDynamicRange_Retina_toneMapping.cpp for similar processing. The fol-
lowing demonstration will only consider classical 8bit/channel images.
The retina model output channels
The retina model presents two outputs that benet from the above cited behaviors.
The rst one is called the Parvocellular channel. It is mainly active in the foveal retina area (high resolution
central vision with color sensitive photo-receptors), its aim is to provide accurate color vision for visual details
remaining static on the retina. On the other hand objects moving on the retina projection are blurred.
The second well known channel is the Magnocellular channel. It is mainly active in the retina peripheral vision
and send signals related to change events (motion, transient events, etc.). These outing signals also help visual
system to focus/center retina on transient/moving areas for more detailed analysis thus improving visual scene
context and object classication.
NOTE : regarding the proposed model, contrary to the real retina, we apply these two channels on the entire input
images using the same resolution. This allows enhanced visual details and motion information to be extracted on all
the considered images... but remember, that these two channels are complementary. For example, if Magnocellular
channel gives strong energy in an area, then, the Parvocellular channel is certainly blurred there since there is a transient
event.
As an illustration, we apply in the following the retina model on a webcam video stream of a dark visual scene. In this
visual scene, captured in an amphitheater of the university, some students are moving while talking to the teacher.
378 Chapter 11. contrib module. The additional contributions made available !
The OpenCV Tutorials, Release 2.4.3
In this video sequence, because of the dark ambiance, signal to noise ratio is low and color artifacts are present on
visual features edges because of the low quality image capture tool-chain.
Below is shown the retina foveal vision applied on the entire image. In the used retina conguration, global luminance
is preserved and local contrasts are enhanced. Also, signal to noise ratio is improved : since high frequency spatio-
temporal noise is reduced, enhanced details are not corrupted by any enhanced noise.
11.1. Discovering the human retina and its use for image processing 379
The OpenCV Tutorials, Release 2.4.3
Below is the output of the Magnocellular output of the retina model. Its signals are strong where transient events
occur. Here, a student is moving at the bottom of the image thus generating high energy. The remaining of the image
is static however, it is corrupted by a strong noise. Here, the retina lters out most of the noise thus generating low
false motion area alarms. This channel can be used as a transient/moving areas detector : it would provide relevant
information for a low cost segmentation tool that would highlight areas in which an event is occurring.
380 Chapter 11. contrib module. The additional contributions made available !
The OpenCV Tutorials, Release 2.4.3
Retina use case
This model can be used basically for spatio-temporal video effects but also in the aim of :
performing texture analysis with enhanced signal to noise ratio and enhanced details robust against input images
luminance ranges (check out the Parvocellular retina channel output)
performing motion analysis also taking benet of the previously cited properties.
For more information, refer to the following papers :
Benoit A., Caplier A., Durette B., Herault, J., Using Human Visual System Modeling For Bio-Inspired Low
Level Image Processing, Elsevier, Computer Vision and Image Understanding 114 (2010), pp. 758-773. DOI
<http://dx.doi.org/10.1016/j.cviu.2010.01.011>
Please have a look at the reference work of Jeanny Herault that you can read in his book :
Vision: Images, Signals and Neural Networks: Models of Neural Processing in Visual Perception (Progress in Neural
Processing),By: Jeanny Herault, ISBN: 9814273686. WAPI (Tower ID): 113266891.
This retina lter code includes the research contributions of phd/research collegues from which code has been redrawn
by the author :
take a look at the retinacolor.hpp module to discover Brice Chaix de Lavarene phD color mosaicing/demosaicing
and his reference paper: B. Chaix de Lavarene, D. Alleysson, B. Durette, J. Herault (2007). Efcient demo-
saicing through recursive ltering, IEEE International Conference on Image Processing ICIP 2007
11.1. Discovering the human retina and its use for image processing 381
The OpenCV Tutorials, Release 2.4.3
take a look at imagelogpolprojection.hpp to discover retina spatial log sampling which originates from
Barthelemy Durette phd with Jeanny Herault. A Retina / V1 cortex projection is also proposed and originates
from Jeannys discussions. ====> more information in the above cited Jeanny Heraultss book.
Code tutorial
Please refer to the original tutorial source code in le opencv_folder/samples/cpp/tutorial_code/contrib/retina_tutorial.cpp.
To compile it, assuming OpenCV is correctly installed, use the following command. It requires the opencv_core
(cv::Mat and friends objects management), opencv_highgui (display and image/video read) and opencv_contrib
(Retina description) libraries to compile.
// compile
gcc retina
_
tutorial.cpp -o Retina
_
tuto -lopencv
_
core -lopencv
_
highgui -lopencv
_
contrib
// Run commands : add log as a last parameter to apply a spatial log sampling (simulates retina sampling)
// run on webcam
./Retina
_
tuto -video
// run on video file
./Retina
_
tuto -video myVideo.avi
// run on an image
./Retina
_
tuto -image myPicture.jpg
// run on an image with log sampling
./Retina
_
tuto -image myPicture.jpg log
Here is a code explanation :
Retina denition is present in the contrib package and a simple include allows to use it
#include "opencv2/opencv.hpp"
Provide user some hints to run the program with a help function
// the help procedure
static void help(std::string errorMessage)
{
std::cout<<"Program init error : "<<errorMessage<<std::endl;
std::cout<<"\nProgram call procedure : retinaDemo [processing mode] [Optional : media target] [Optional LAST parameter: \"log\" to activate retina log sampling]"<<std::endl;
std::cout<<"\t[processing mode] :"<<std::endl;
std::cout<<"\t -image : for still image processing"<<std::endl;
std::cout<<"\t -video : for video stream processing"<<std::endl;
std::cout<<"\t[Optional : media target] :"<<std::endl;
std::cout<<"\t if processing an image or video file, then, specify the path and filename of the target to process"<<std::endl;
std::cout<<"\t leave empty if processing video stream coming from a connected video device"<<std::endl;
std::cout<<"\t[Optional : activate retina log sampling] : an optional last parameter can be specified for retina spatial log sampling"<<std::endl;
std::cout<<"\t set \"log\" without quotes to activate this sampling, output frame size will be divided by 4"<<std::endl;
std::cout<<"\nExamples:"<<std::endl;
std::cout<<"\t-Image processing : ./retinaDemo -image lena.jpg"<<std::endl;
std::cout<<"\t-Image processing with log sampling : ./retinaDemo -image lena.jpg log"<<std::endl;
std::cout<<"\t-Video processing : ./retinaDemo -video myMovie.mp4"<<std::endl;
std::cout<<"\t-Live video processing : ./retinaDemo -video"<<std::endl;
std::cout<<"\nPlease start again with new parameters"<<std::endl;
std::cout<<"
****************************************************
"<<std::endl;
std::cout<<" NOTE : this program generates the default retina parameters file RetinaDefaultParameters.xml"<<std::endl;
std::cout<<" => you can use this to fine tune parameters and load them if you save to file RetinaSpecificParameters.xml"<<std::endl;
}
Then, start the main program and rst declare a cv::Mat matrix in which input images will be loaded. Also allocate a
cv::VideoCapture object ready to load video streams (if necessary)
382 Chapter 11. contrib module. The additional contributions made available !
The OpenCV Tutorials, Release 2.4.3
int main(int argc, char
*
argv[]) {
// declare the retina input buffer... that will be fed differently in regard of the input media
cv::Mat inputFrame;
cv::VideoCapture videoCapture; // in case a video media is used, its manager is declared here
In the main program, before processing, rst check input command parameters. Here it loads a rst input image
coming from a single loaded image (if user chose command -image) or from a video stream (if user chose command
-video). Also, if the user added log command at the end of its program call, the spatial logarithmic image sampling
performed by the retina is taken into account by the Boolean ag useLogSampling.
// welcome message
std::cout<<"
****************************************************
"<<std::endl;
std::cout<<"
*
Retina demonstration : demonstrates the use of is a wrapper class of the Gipsa/Listic Labs retina model."<<std::endl;
std::cout<<"
*
This demo will try to load the file RetinaSpecificParameters.xml (if exists).\nTo create it, copy the autogenerated template RetinaDefaultParameters.xml.\nThen twaek it with your own retina parameters."<<std::endl;
// basic input arguments checking
if (argc<2)
{
help("bad number of parameter");
return -1;
}
bool useLogSampling = !strcmp(argv[argc-1], "log"); // check if user wants retina log sampling processing
std::string inputMediaType=argv[1];
//////////////////////////////////////////////////////////////////////////////
// checking input media type (still image, video file, live video acquisition)
if (!strcmp(inputMediaType.c
_
str(), "-image") && argc >= 3)
{
std::cout<<"RetinaDemo: processing image "<<argv[2]<<std::endl;
// image processing case
inputFrame = cv::imread(std::string(argv[2]), 1); // load image in RGB mode
}else
if (!strcmp(inputMediaType.c
_
str(), "-video"))
{
if (argc == 2 || (argc == 3 && useLogSampling)) // attempt to grab images from a video capture device
{
videoCapture.open(0);
}else// attempt to grab images from a video filestream
{
std::cout<<"RetinaDemo: processing video stream "<<argv[2]<<std::endl;
videoCapture.open(argv[2]);
}
// grab a first frame to check if everything is ok
videoCapture>>inputFrame;
}else
{
// bad command parameter
help("bad command parameter");
return -1;
}
Once all input parameters are processed, a rst image should have been loaded, if not, display error and stop program
:
if (inputFrame.empty())
{
11.1. Discovering the human retina and its use for image processing 383
The OpenCV Tutorials, Release 2.4.3
help("Input media could not be loaded, aborting");
return -1;
}
Now, everything is ready to run the retina model. I propose here to allocate a retina instance and to manage the
eventual log sampling option. The Retina constructor expects at least a cv::Size object that shows the input data size
that will have to be managed. One can activate other options such as color and its related color multiplexing strategy
(here Bayer multiplexing is chosen using enum cv::RETINA_COLOR_BAYER). If using log sampling, the image
reduction factor (smaller output images) and log sampling strengh can be adjusted.
// pointer to a retina object
cv::Ptr<cv::Retina> myRetina;
// if the last parameter is log, then activate log sampling (favour foveal vision and subsamples peripheral vision)
if (useLogSampling)
{
myRetina = new cv::Retina(inputFrame.size(), true, cv::RETINA
_
COLOR
_
BAYER, true, 2.0, 10.0);
}
else// -> else allocate "classical" retina :
myRetina = new cv::Retina(inputFrame.size());
Once done, the proposed code writes a default xml le that contains the default parameters of the retina. This is useful
to make your own cong using this template. Here generated template xml le is called RetinaDefaultParameters.xml.
// save default retina parameters file in order to let you see this and maybe modify it and reload using method "setup"
myRetina->write("RetinaDefaultParameters.xml");
In the following line, the retina attempts to load another xml le called RetinaSpecicParameters.xml. If you created
it and introduced your own setup, it will be loaded, in the other case, default retina parameters are used.
// load parameters if file exists
myRetina->setup("RetinaSpecificParameters.xml");
It is not required here but just to show it is possible, you can reset the retina buffers to zero to force it to forget past
events.
// reset all retina buffers (imagine you close your eyes for a long time)
myRetina->clearBuffers();
Now, it is time to run the retina ! First create some output buffers ready to receive the two retina channels outputs
// declare retina output buffers
cv::Mat retinaOutput
_
parvo;
cv::Mat retinaOutput
_
magno;
Then, run retina in a loop, load new frames from video sequence if necessary and get retina outputs back to dedicated
buffers.
// processing loop with no stop condition
while(true)
{
// if using video stream, then, grabbing a new frame, else, input remains the same
if (videoCapture.isOpened())
videoCapture>>inputFrame;
// run retina filter on the loaded input frame
myRetina->run(inputFrame);
// Retrieve and display retina output
myRetina->getParvo(retinaOutput
_
parvo);
384 Chapter 11. contrib module. The additional contributions made available !
The OpenCV Tutorials, Release 2.4.3
myRetina->getMagno(retinaOutput
_
magno);
cv::imshow("retina input", inputFrame);
cv::imshow("Retina Parvo", retinaOutput
_
parvo);
cv::imshow("Retina Magno", retinaOutput
_
magno);
cv::waitKey(10);
}
Thats done ! But if you want to secure the system, take care and manage Exceptions. The retina can throw some
when it sees irrelevant data (no input frame, wrong setup, etc.). Then, i recommend to surround all the retina code by
a try/catch system like this :
try{
// pointer to a retina object
cv::Ptr<cv::Retina> myRetina;
[---]
// processing loop with no stop condition
while(true)
{
[---]
}
}catch(cv::Exception e)
{
std::cerr<<"Error using Retina : "<<e.what()<<std::endl;
}
Retina parameters, what to do ?
First, it is recommended to read the reference paper :
Benoit A., Caplier A., Durette B., Herault, J., Using Human Visual System Modeling For Bio-Inspired Low
Level Image Processing, Elsevier, Computer Vision and Image Understanding 114 (2010), pp. 758-773. DOI
<http://dx.doi.org/10.1016/j.cviu.2010.01.011>
Once done open the conguration le RetinaDefaultParameters.xml generated by the demo and lets have a look at it.
<?xml version="1.0"?>
<opencv
_
storage>
<OPLandIPLparvo>
<colorMode>1</colorMode>
<normaliseOutput>1</normaliseOutput>
<photoreceptorsLocalAdaptationSensitivity>7.0e-01</photoreceptorsLocalAdaptationSensitivity>
<photoreceptorsTemporalConstant>5.0e-01</photoreceptorsTemporalConstant>
<photoreceptorsSpatialConstant>5.3e-01</photoreceptorsSpatialConstant>
<horizontalCellsGain>0.</horizontalCellsGain>
<hcellsTemporalConstant>1.</hcellsTemporalConstant>
<hcellsSpatialConstant>7.</hcellsSpatialConstant>
<ganglionCellsSensitivity>7.0e-01</ganglionCellsSensitivity></OPLandIPLparvo>
<IPLmagno>
<normaliseOutput>1</normaliseOutput>
<parasolCells
_
beta>0.</parasolCells
_
beta>
<parasolCells
_
tau>0.</parasolCells
_
tau>
<parasolCells
_
k>7.</parasolCells
_
k>
<amacrinCellsTemporalCutFrequency>1.2e+00</amacrinCellsTemporalCutFrequency>
<V0CompressionParameter>9.5e-01</V0CompressionParameter>
<localAdaptintegration
_
tau>0.</localAdaptintegration
_
tau>
<localAdaptintegration
_
k>7.</localAdaptintegration
_
k></IPLmagno>
</opencv
_
storage>
11.1. Discovering the human retina and its use for image processing 385
The OpenCV Tutorials, Release 2.4.3
Here are some hints but actually, the best parameter setup depends more on what you want to do with the retina rather
than the images input that you give to retina. Apart from the more specic case of High Dynamic Range images
(HDR) that require more specic setup for specic luminance compression objective, the retina behaviors should be
rather stable from content to content. Note that OpenCV is able to manage such HDR format thanks to the OpenEXR
images compatibility.
Then, if the application target requires details enhancement prior to specic image processing, you need to know if
mean luminance information is required or not. If not, the the retina can cancel or signicantly reduce its energy thus
giving more visibility to higher spatial frequency details.
Basic parameters
The most simple parameters are the following :
colorMode : let the retina process color information (if 1) or gray scale images (if 0). In this last case, only the
rst channel of the input will be processed.
normaliseOutput : each channel has this parameter, if value is 1, then the considered channel output is rescaled
between 0 and 255. Take care in this case at the Magnocellular output level (motion/transient channel detection).
Residual noise will also be rescaled !
Note : using color requires color channels multiplexing/demultipexing which requires more processing. You can
expect much faster processing using gray levels : it would require around 30 product per pixel for all the retina
processes and it has recently been parallelized for multicore architectures.
Photo-receptors parameters
The following parameters act on the entry point of the retina - photo-receptors - and impact all the following processes.
These sensors are low pass spatio-temporal lters that smooth temporal and spatial data and also adjust there sensitivity
to local luminance thus improving details extraction and high frequency noise canceling.
photoreceptorsLocalAdaptationSensitivity between 0 and 1. Values close to 1 allow high luminance log
compression effect at the photo-receptors level. Values closer to 0 give a more linear sensitivity. Increased alone,
it can burn the Parvo (details channel) output image. If adjusted in collaboration with ganglionCellsSensitivity
images can be very contrasted whatever the local luminance there is... at the price of a naturalness decrease.
photoreceptorsTemporalConstant this setups the temporal constant of the low pass lter effect at the entry of
the retina. High value lead to strong temporal smoothing effect : moving objects are blurred and can disappear
while static object are favored. But when starting the retina processing, stable state is reached lately.
photoreceptorsSpatialConstant species the spatial constant related to photo-receptors low pass lter effect.
This parameters specify the minimum allowed spatial signal period allowed in the following. Typically, this
lter should cut high frequency noise. Then a 0 value doesnt cut anything noise while higher values start to cut
high spatial frequencies and more and more lower frequencies... Then, do not go to high if you wanna see some
details of the input images ! A good compromise for color images is 0.53 since this wont affect too much the
color spectrum. Higher values would lead to gray and blurred output images.
Horizontal cells parameters
This parameter set tunes the neural network connected to the photo-receptors, the horizontal cells. It modulates photo-
receptors sensitivity and completes the processing for nal spectral whitening (part of the spatial band pass effect thus
favoring visual details enhancement).
386 Chapter 11. contrib module. The additional contributions made available !
The OpenCV Tutorials, Release 2.4.3
horizontalCellsGain here is a critical parameter ! If you are not interested by the mean luminance and focus
on details enhancement, then, set to zero. But if you want to keep some environment luminance data, let some
low spatial frequencies pass into the system and set a higher value (<1).
hcellsTemporalConstant similar to photo-receptors, this acts on the temporal constant of a low pass temporal
lter that smooths input data. Here, a high value generates a high retina after effect while a lower value makes
the retina more reactive.
hcellsSpatialConstant is the spatial constant of the low pass lter of these cells lter. It species the lowest
spatial frequency allowed in the following. Visually, a high value leads to very lowspatial frequencies processing
and leads to salient halo effects. Lower values reduce this effect but the limit is : do not go lower than the value
of photoreceptorsSpatialConstant. Those 2 parameters actually specify the spatial band-pass of the retina.
NOTE after the processing managed by the previous parameters, input data is cleaned from noise and luminance in
already partly enhanced. The following parameters act on the last processing stages of the two outing retina signals.
Parvo (details channel) dedicated parameter
ganglionCellsSensitivity species the strength of the nal local adaptation occurring at the output of this details
dedicated channel. Parameter values remain between 0 and 1. Low value tend to give a linear response while
higher values enforces the remaining low contrasted areas.
Note : this parameter can correct eventual burned images by favoring low energetic details of the visual scene, even
in bright areas.
IPL Magno (motion/transient channel) parameters
Once image information is cleaned, this channel acts as a high pass temporal lter that only selects signals related to
transient signals (events, motion, etc.). A low pass spatial lter smooths extracted transient data and a nal logarithmic
compression enhances low transient events thus enhancing event sensitivity.
parasolCells_beta generally set to zero, can be considered as an amplier gain at the entry point of this pro-
cessing stage. Generally set to 0.
parasolCells_tau the temporal smoothing effect that can be added
parasolCells_k the spatial constant of the spatial ltering effect, set it at a high value to favor low spatial
frequency signals that are lower subject to residual noise.
amacrinCellsTemporalCutFrequency species the temporal constant of the high pass lter. High values let
slow transient events to be selected.
V0CompressionParameter species the strength of the log compression. Similar behaviors to previous de-
scription but here it enforces sensitivity of transient events.
localAdaptintegration_tau generally set to 0, no real use here actually
localAdaptintegration_k species the size of the area on which local adaptation is performed. Low values lead
to short range local adaptation (higher sensitivity to noise), high values secure log compression.
11.1. Discovering the human retina and its use for image processing 387
The OpenCV Tutorials, Release 2.4.3
388 Chapter 11. contrib module. The additional contributions made available !
CHAPTER
TWELVE
OPENCV IOS