Project Report On 2factor Authentication
Project Report On 2factor Authentication
Project Report On 2factor Authentication
Is approved for the Degree of Master of Engineering in Computer Science and Engineering
by
Shital Digamber Gheware
Dissertation Entitled
Is approved for the Degree of Master of Engineering in Computer Science and Engineering
of
Swami Ramanand Teerth Marathwada University, Nanded
2017-2018
Certificate
This is to certify that dissertation entitled “Accurate Text Detection in Natural Scene Images using
MSER Approach " is bona-fide record of dissertation carried out by Shital Digamber Gheware under
my supervision and guidance for the award of degree of "Master of Computer Science & Engineering"
as per the requirements of M. S. Bidve Engineering College, Latur, (M.S.), India. In my opinion, the
work presented in this dissertation report is of the standard required for the award of degree.
Submitted a satisfactory report about it as per the curriculum of Swami Ramanand Teerth
Marathwada University, Nanded.
I hereby declare that I have formed, completed and written the report entitled “Accurate
Text Detection in Natural Scene Images using MSER Approach”. It has not
previously submitted for the basis of the award of any degree or diploma or other similar
title of this for any other examining body or university.
Place: Latur
Date:
Shital D. Gheware
M.E. II Year
Dept. of Computer Science and
Engineering
ACKNOWLEDGEMENT
With profound sense of regards and gratitude. I thank Prof. N.G. Dharashive for his
valuable guidance incessant interest and constructive suggestions during the course of
dissertation. This dissertation wouldn’t have been possible without his zeal and interest
through the task, right from the beginning. I thank him for his valuable and immense
knowledge and timely help, which made this, project a reality.
I also take this opportunity to thank Prof. S.R. Tandle Head of Computer Science
and Engineering Department for his kind permission towards undertaking this
dissertation work and his keen encouragement during the course of this dissertation.
I also take this opportunity to thank Prof. N.J.Pathan ME Coordinator, for his
kind permission towards undertaking this dissertation work and his keen encouragement
during the course of this dissertation.
I feel grateful to record my pleasant thanks to all staff members of the Computer
Science and Engineering Department for their support, help and assistance which they
extend as and when required.
I deeply thankful to our Principal Prof. N.B. Khatod and the college for providing
us with a platform to excel in life.
Last but by no means the least, I am extremely grateful to my batch mates for their
consistent help.
ME(CSE)
I
Abstract
Text detection method discovers the presence of text in images, videos, etc. This technique
is very needful for many applications, based on content based image analysis, such as web
image search, map analysis, video information retrieval, etc. It is very challenging to
detect the text from natural scene images due to its complex background and noise. The
objective is to design a text detection system which will be able to detect maximum
characters from natural scene images.
In this text detection system, first, the input natural scene image is pre-processed. The
input color image is converted into grey scale image. Then, in the next stage, character
candidates are extracted from gray scale image using the MSERs region detector method.
MSER will extract various features from input image and then divide it into number of
different regions. Then stroke width transform is applied which computes the per pixel the
width of the most likely stroke containing the pixels. Then, morphological filter is applied
to remove noise and unwanted regions detected by MSER. After morphological operation
some heuristic rules are applied to these regions; which removes non-text candidates. The
probability of text is estimated by calculating features like height, width and area of
contours. Finally, text candidates corresponding to true texts are constructed and displayed
using rectangle and text is shown on the command prompt.
II
INDEX
Acknowledgement I
Abstract II
List of Figures VI
III
2.2.8 Natural scene text detection with 25
multi-layer segmentation and higher order
conditional random field based analysis
2.2.9 Text detection approach based on 25
IV
CHAPTER 4 PERFORMANANCE ANALYSIS 53
4.1 Datasets 54
4.1.1Sample Dataset 54
4.1.2ICDAR 2011 dataset 54
4.2 Snapshot of the text detection System 56
4.3 Accuracy of the system. 68
4.4 Comparative result of Different methods. 70
4.5 Chapter Summery. 71
V
List of Figures
Fig No Figure Name Page No
1.1 Document text image 3
1.2 Scene text image 3
labeling via graph cut. (a) The flowchart. (b) MSERs in the
Original image.(c) The construction of the graph 21
2.4 Flowchart of the proposed system 23
3.1 Mathematical model for text detection system 29
VI
Fig No Figure Name Page No
4.2 ICDAR 2011 dataset images 55
4.3 GUI of text detection system 56
4.5 When we press get image button the open window is going to open 57
4.17 The result of image having different font size of each character 63
VII
List of Tables
Table No Table Name Page No
2.1 Comparative analysis of literature survey 10
VIII
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
CHAPTER 1
INTRODUCTION
This chapter describes the introduction to Text detection in natural scene images
using MSER.
MSBECL Page 1
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
INTRODUCTION
1.2LITERATURE SURVEY
Many methods for text detection in natural scene images have been proposed over the
past years; we will briefly review some of these text detection methods and the
comparative study of these methods
1.2.1Image types
In general, the images are divided into three different types : document image, scene
image and born-digital images[1][20]. In this dissertation work only natural scene
image is considered for the detection of text.
A. Document images
Document images are nothing but image-format of the document. Image format of
MSBECL Page 2
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
B. Scene images
Scene images contain the text, such as the advertising boards, banners, which is
captured by the cameras; therefore scene text appears with the background part of the
scene. These types of images are very challenging to detect and recognize, because
the backgrounds are complex, containing the text in different sizes, styles and
alignments. Also, scene text is affected by lighting conditions and perspective
distortions.
MSBECL Page 3
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
C. Born-digital images
Born-digital images are generated by computer software and are saved as digital
images. Compared with document images and scene images, there are more defects in
born digital images, such as more complex foreground/background, low resolution,
compression loss, and severe edge softness. Therefore, during the text extraction from
born-digital images, it is difficult to distinct the text from the background.
D. Heterogeneous images
This type of image contains the combination of all above given images i.e. it can have
a digital image with scene text and document text.
MSBECL Page 4
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
C. Hybrid methods
To overcome the problems of sliding window and connected component methods, this
method is introduced, which combines the advantages of both for better result in text
detection[2].
Advantages:
1) Region-based information is very helpful for text component segmentation
and analysis;
2) This method differentiates text components from non-text components better
than other;
3) learning-based energy minimization method can group text components into
text lines (words) robustly.
Disadvantage: Complex text segmentation.
MSBECL Page 5
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
1.2.3 Factors that should be considered while detecting text from Scene Images
a) Font style, size (height, width) and thickness (stroke width)[14];
e) Alignment;
g) Illumination;
h) Language;
i) Resolution;
j) Contrast;
1. The text appeared in image that does not represent any important contents related to
image, that referred as scene text.
2. The text which produced separately from the image is good key to understand the
image, is called as an artificial text.
In contrast to scene text, artificial text is not only an important source of information
but also a significant entity for indexing and retrieval purposes. So it is very
challenging task to detect, segment, recognize and retrieve text from an image with
accuracy and robustness of the image contents.
MSBECL Page 6
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
3) Hybrid method:
The hybrid method presented by Pan et al exploits a region detector to detect text
candidates and extracts connected components as character candidates by local
binarization; non-characters are eliminated with a Conditional Random Fields model,
and characters can finally be grouped into text [1].
MSBECL Page 7
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
MSBECL Page 8
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
with a variety of orientations, and features of text can be extracted from combinations
and distributions of the stroke components. One feature that separates text from other
elements of a scene is its nearly constant stroke feature like stroke width. This can be
utilized to recover regions that are likely to contain text. For stroke- based methods,
text stroke candidates are extracted by segmentation, verified by feature extraction
and classification, and grouped together by clustering. These methods are easy to
implement on specific applications because of the intuition and simplicity. However,
complex backgrounds make text strokes hard to segment and verify.
2. Region size - MSER tended to detect many small regions, versus large regions
which not cover a planar part of the scene.
MSBECL Page 9
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
5. Blur - MSER proved to be the most sensitive to this type of change in image,
which is the only area that this type of detection is lacking in.
6. Light change - MSER showed the highest repeatability score for this type of
scene, with all the other having good robustness as well.
Table below shows the comparative analysis of different methods by their accuracy
and datasets used.
Author’s f- Dataset
Name Year Precision Recall Methodology
measure Used
Lukas
Neumann 2011 68.9 52.5 59.6 MSER ++ ICDAR
et.al.[1] 2011
Lukas ICDAR
2 stage
Neumann 2011,
2012 73.1 64.7 68.7 Algorithm for Street View
et.al.[18]
ERs pruning. Text
Dataset
Cunzhao Graph cut
Shi 2013 83.3 63.1 71.8 model ICDAR
et.al.[19] with MSER 2011
ICDAR
2011,
MSER as Multilingual
Xu-Cheng Yin 2013 86.29 68.26 76.22 Character DB, Street
et.al.[1] candidate view
DB,Multi-
orientation
DB.
MSBECL Page 10
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
Author’s f- Dataset
Use of
snooper text,
Rodrigo toggle- ITW, SVT,
Minetto 2014 0.74 0.63 0.68 Mapping EPS,
et.al.[24] image ICD DB.
segmentation,
HOG-based
descrip
tor
Obtaining ICDAR2003
B.H.Shekar 2015 0.84 0.79 0.82 skeleton ,
et.al.[21] using ICDAR
morphology 2011
hierarchicalcl USTB-
SV1K
ustering with DB,TD500
a unified MSRA-DB,
Xu- ChengYin distance
2015 0.81 0.63 0.71 ICDAR2011
et.al.[22] metric
,
learning
ICDAR
framework.
2013.
MSBECL Page 11
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
1.2.6 Applications.[3]
Text detection, segmentation and extraction from complex images can be applied to a
variety of fields where the information needs to be analyzed and understood. Some of
these applications are given below:
3) Super map: Text extraction technology can be applied to detect scene text from
images taken with laptops, phones and other equipments, so as to be applied to
maps, navigation, automatic translation, foreign-related tour guides, walking
robots and intelligent monitoring system [1] and also used as visual impaired
peoples assistance [9].
4) Vehicle testing: Vehicle license and scene subtitles have many features in
common, so text extraction can be used to supervise the traffic in real time.
After text extraction from highway video flow, the traffic situation can be
overseen and vehicle licenses can be recognized easily from traffic accidents,
which can improve the efficiency of the transportation systems [1].
5) Optical character reading: Reads text from paper and translates images into a
form that computer can manipulate (for example, into ASCII codes). An OCR
system enables to take a book, feed it directly into an electronic computer file,
and then edit the file using a word processor.
MSBECL Page 12
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
9) Online electric goods search: Online shopping applications using mobile phone
allows customer to type the name of goods and get required information about
it with images and descriptions [12].
MSBECL Page 13
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
_________________________________
CHAPTER 2
BACKGROUND AND RELATED WORK
In this chapter, we review some of the existing work in the field Text detection in
natural scene images by using MSER technique and without using MSER technique.
MSBECL Page 14
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
2.1 Overview
MSER-based methods have demonstrated very promising performance in many real
projects. However, current MSER-based methods still have some key limitations, i.e.,
they may suffer from detecting of repeating components and also insufficient text
candidates construction algorithms. In this section, we will review the MSER-based
methods focusing on these two problems1) MSER pruning problem2) Text candidate
construction problem. The main advantage of MSER-based methods over traditional
connected component based methods roots in the usage of the MSERs algorithm for
character extraction. The MSERs algorithm is able to detect most characters even
when the image is in low quality (low resolution, strong noises, low contrast, etc.).
However, one severe but not so obvious pitfall of the MSERs algorithm is that most of
the detected MSERs are in fact repeating with each other. Repeating MSERs are
problematic for the latter character candidates grouping algorithm, thus most of the
repeating MSERs, apart from the MSERs that most likely correspond to character,
need to be removed before being fed to the character grouping algorithm.
MSBECL Page 15
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
Let I denote an image of n pixels and let P(I) denote set of all sub regions of the
image I. Let sL denote an arbitrary sequence of non-repeating image sub regions in equation
2.1
(2.1)
sL=( , ) ri € P (I); ri ≠rj ˅ i; j of length L
1 2,……..,
(2.2 )
let SL = ⋃
=1
Denote set of all sequences of length L and let S denote set of all sequences of
lengths up to in equation 2.2
⋃ =1
SL =
Given a verification function v : S →{0; 1}, the set of estimates (words) E. The
*
methods for text localization aim to find an optimal verification function v (s) so
that f-measure of precision and recall is maximized, where T denotes set of words
in the ground truth.
we extend this approach by using whole tree of MSER lattice induced by the
inclusion relation, in contrast to [6] where only root nodes (i.e. supremums of the
MSER lattice) were considered which implied that a high MSER margin had to be
used to maximize the number of root nodes which correspond to letters. If a lower
margin is used, the MSER detector finds more regions but only certain regions
correspond to characters. As shown in Figure 2.1, smaller MSERs are embedded
into bigger ones, thus forming a tree where only certain combinations of nodes
can be selected as letters, because in a word one letter cannot be embedded into
another. We refer to individual nodes of the MSER tree as MSER++ to emphasize
that multiple projections (gray, red, green and blue channel) are used and the
internal tree structure is taken into account.
MSBECL Page 16
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
Figure 2.1 MSER lattice induced by the inclusion relation. Only certain nodes
correspond to characters
C .Exhaustive search
Let M denote the set of MSER++ in the image I. Even though the cardinality of M is
linear in number of pixels, the cardinality of the set S of all sequences is still
exponential.
^ ^ ^
Let v1; v2, , , , vn denote “upper-bound” verification functions which determine
L
whether s is a subsequence of a text
=1
This decomposition allows efficient pruning of the exhaustive search, because non-
text subsequences are excluded without a need to build a complete sequence, which
L
prevents from a combinatorial explosion of enumerating the S sets of all sequences
of length L.
C. Verification functions
The function ^v1(r) is a SVM character classifier, which determines whether the
region is a character or not based on a set of region measurements (height ratio,
compactness, etc.) -The function is scale invariant, but not rotation invariant so
possible rotations had to be included in the training set. On average, the ^v1 function
correctly includes 83% of text regions whilst it correctly excludes 93% of non text
regions such as plants, trees or other random textures.
MSBECL Page 17
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
In the first classification stage, the probability of each ER being a character is estimated using
novel features calculated with O (1) complexity per region tested. Only ERs with locally
maximal probability are selected for the second stage, where the classification is improved
using more computationally expensive features. A highly efficient exhaustive search with
feedback loops is then applied to group ERs into words and to select the most probable
character segmentation. Finally, text is recognized in an OCR stage trained using synthetic
fonts. The method was evaluated on two public datasets. On the ICDAR 2011 dataset, the
method achieves state-of-the- art text localization results amongst published methods and it is
the first one to report results for end-to-end text recognition. On the more challenging Street
View Text dataset, the method achieves state-of-the-art recall. The robustness of the proposed
method against noise and low contrast of characters is demonstrated by “false positives”
caused by detected watermark text in the dataset.
MSBECL Page 18
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
the system; the image intensities are linearly adjusted to enhance the contrast.
Subsequently, MSER regions are efficiently extracted from the image and enhanced
using canny edges obtained from the original gray-scale image. As a next step, the
resulting CCs are filtered using geometric constraints on properties like aspect ratio
and number of holes. The stroke width information is robustly computed using a
distance transform and objects with high variation in stroke width are rejected. Text
candidates are grouped pair wise and form text lines. Finally, words within a text line
are separated, giving segmented word patches at the output of our system.
MSBECL Page 19
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
The flowchart of the proposed method is shown in Fig. 2. 3 .The main contributions
of this paper include four aspects.
1. MSERs detected in the original image, which are shown to be suitable for text
detection in the experiment (Neumann and Matas, 2010; Chen et al., 2011; Neumann
and Matas, 2011b), are used as basic CCs.
2. Effective features specially designed for MSERs are used to train a classifier to
estimate the probability of MSERs being text.
MSBECL Page 20
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
Fig 2.3 The flowchart of the algorithm and the illustration of MSER labeling via
graph cut. (a) The flowchart. (b) MSERs in the Original image. (c) The construction
of the graph
Concretely, as shown in Fig. 2.3, first, two kinds of MSERs, dark on-light and light-
on-dark ones, are detected. Then, we focus on the MSERs labeling process which
removes the non-text MSERs while also preserving text ones. To this end, a graph
whose nodes are the MSERs is first constructed, and the MSERs are then labeled as
text or non-text regions by minimizing the carefully designed unary and pair wise cost
MSBECL Page 21
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
MSBECL Page 22
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
2.2.7 Text detection for Multi- Orientation Scene Images Using Adaptive
Clustering.[22]
Detection of text in camera-based images is a vital requirement for several computer
vision applications. Text detection task is frequently challenging due to difficulties
like composite backgrounds, dissimilarities of text orientations, font, size, color. The
aim is to recognize text in a combine manner by searching for words from the image
into text areas or single character candidates. Text captured in natural scenes is most
of the times with multiple orientations and point of distortions. Currently most
research efforts focuses on horizontal orientation from images. To address same issues
a novel approach unified distance metric learning framework is proposed an adaptive
hierarchical clustering, which learns weights of the character candidates once at a
time and adaptively integrate different feature similarities. An effective multi-
orientation text detection system, which constructs the text character candidates by
grouping characters based on an adaptive clustering.
A hierarchical structure based 2-dimensional proximity matrix is design with the help
of an hierarchical clustering and also arrange data into a hierarchical structure manner.
The outcomes are typically presented by a binary tree or dendrogram. From these
outcomes different clusters of the data formation is done.
T (2.4)
d (xi, xj: ω) = ω vec(xi, xj),
where weight vector ?, vec (xi, xj) is the similarity vector of two variables xi and xj.
MSBECL Page 23
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
In this, the aim of this framework is to forming the two sets of clusters i.e. set S for
same pair of points and set D for different pair of points. The distance of pair of points
in set D is maximized and the same is minimized in set S. this framework is also
capable of providing tough and indicative problems which are responsible for the
formation of number of representative part of the problem, i.e., given the labeled
cluster set {Ck}mk=1(with m clusters), the following strategy is used to compute D
and S
^ ^ m (2.5)
D = {( , ) = arg min d(x, y; ω)} k=1
x ϵ Ck ,y ϵ C-k
S = {( ,) = arg max min d(x, y; ω)}mk=1 (2.6)
x ϵ Ck y ϵ Ck
Clustering types.
1) Morphology clustering (morphology-based grouping via clustering). By the
character appearances (color, stroke width and location differences), character
candidates are clustered into Nmor groups using single-link clustering.
MSBECL Page 24
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
2.2.8 Natural scene text detection with multi-layer segmentation and higher
order conditional random field based analysis [23]
Text detection in natural scene images is a hot and challenging problem in pattern
recognition and computer vision .Considering the complex situation in natural scene
images ,they propose a robust two steps method in this paper based on multi layer
segmentation and higher order conditional random field .Given an input image ,the
method separates text from its background by using multi layer segmentation, which
decomposes the input image into nine layers .Then ,the connected components in
these different layers are obtained as candidate text .these candidate text CCs are
verified by higher order CRF based analysis. Inspired from the multistage information
integration mechanism of visual brains, features from three different levels, including
separate CCs, CC pairs and CC strings, are integrated by higher order CRF model to
distinguish text from non text. The remaining CCs are then grouped into words for
easy evaluation.
2.2.9 Text detection approach based on confidence map and context information
[25]
Text information plays a significant role in many applications for providing more
descriptive and abstract information than other objects. In this paper, an approach
based on the confidence map and context information is proposed to robustly detect
text in natural scenes. Most of the conventional methods design sophisticated texture
features to describe the text regions, while we focus on building a confidence map
model by integrating the seed candidate appearance and the relationships with
Its adjacent candidates to highlight the texts from the background, and the candidates
with low confidence value will be removed. In order to improve the recall rate, the
text context information is adopted to regain the missing text regions. Finally, the text
MSBECL Page 25
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
lines are formed and further verified, and the words are obtained by calculating the
threshold to separate the intra word letters from the inter word letters.
MSBECL Page 26
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
______________________________________________
CHAPTER 3
PROPOSED SYSTEM OF TEXT DETECTION IN NATURAL SCENE IMAGES
USING MSER TECHNIQUE
______________________________________________
This chapter describes the proposed system development objective, requirement of
the system and implementation
_____________________________________________________________________
MSBECL Page 27
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
To design a system for text detection in natural scene images using MSER to extract
Maximally Stable Extremal Regions (MSER) as character candidates.
3.4 OBJECTIVES
The objectives of this project are as follows:
a. To develop a text detection system.
b. The main objective of literature survey is to compare various detection
methods and there results.
c. Creation of the natural scene image dataset.
d. To detect text from natural scene images.
e. To detect maximum characters from blur images.
MSBECL Page 28
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
Hardware Requirements
1. Processor : Pentium IV. ( and onwards ).
2. Memory ( RAM ) : 256 MB RAM.
3. Hard disk : 40GB
4. Web Cam / Camera
MSBECL Page 29
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
MSBECL Page 30
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
As shown in the figure, there are six different use cases and one actor. The actor can
be end user or the system that will upload the natural scene image to detect the text in
it. The use cases are uploading Natural scene image to text detection system,
uploading image, pre-processing, extracting character candidates using MSER
Algorithm, non-text elimination, text construction with rectangle and display of text.
Figure below shows the Level 0 Data flow diagram for text detection system:
As shown in the figure, the text detection system takes the natural scene image as
an input and produce Output as detected text.
Figure 3.3: Level 0 data flow diagram for Text Detection System
Figure 3.4: Level 1 data flow diagram for Text Detection System
As shown in figure 3.4, the flow of the system is from user can upload the natural
scene image and pre-process on it. After that image pass through the extraction,
Elimination and construction phase. Finally, it displays detected text from input
image.
MSBECL Page 31
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
Figure 3.5 below shows the Sequence diagram for text detection system:
As shown in figure, the sequence of text detection is as follows: First the user
will upload the natural scene image in the system as an input. After that, the
system will pre-process on the input image. It converts the RGB image into gray
scale. Then the character candidates are extracted using MSER algorithm. The
non-text is eliminated and the final text is constructed and displayed using OCR.
System architecture shows the first stage is taking the natural scene image which
is taken by web cam or camera. The pre processing stage is the resize which takes
MSBECL Page 32
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
input as a RGB or Grayscale or binary image .this function returns image B that is
scale times the size of A. In the next stage rgb2gray (RGB) function is used to convert
RGB images to grayscale by eliminating the hue and saturation information while
retaining the luminance. Here we can also use the global thresholding binarization
method [7][17]. But here we have used the method detectMSERFeatures method
which can take only gray image as a input.
Extremal Region
Let Rl be an extremal region in the input image, then the branch of the tree rooted at R
will be as:
) (3.2)
B ( Rl ) = (Rl,Rl+1, ...,Rl+∆
Where, ∆ is a parameter for region R;the variation (instability) of Rlis defined as :
V (Rl) = | Rl+ ∆Rl| ÷ | Rl| (3.3)
Where, | R |= number of pixels in R.
MSBECL Page 33
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
Linear Reduction
When MSER tree has only one child then the linear reduction algorithm is applied.
This will select one of the character from parent and child, which has minimum
variation and removes the another one. This procedure is applied to the MSER tree
recursively.
The algorithm works as below.
Given the MSER node t, the linear reduction algorithm first calculate number of
children of t; if t has no children, then it return t; if t has only one child, it get the root
c of child tree by first applying the linear reduction procedure to the child tree; then it
checks the variations of both parent and child; if t has a lower variation compared to
c, it link c to t and return t; otherwise return c; if t has more than
one child, it process these children using linear reduction procedure and link the
resulting trees to t.
Tree Accumulation
When MSERs has more than one child, then the tree accumulation algorithm is used.
This procedure returns a set of disconnected nodes.
The algorithm for tree accumulation works as follows.
For a given node t, tree accumulation calculates the number of children’s of t; if t has
no children, then it returns t; if t has more than two children, it creates an empty set C
MSBECL Page 34
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
and append the result of applying tree accumulation to children of t to C; if one of the
nodes in C has a lower variation than variation of t, it return C, otherwise it removes
children of t and return t.
Step 1: Apply tree accumulation on the linear reduction tree
Step 2: if T has more than two children’s then
Create empty set as C
For each c in the children of tree T
Apply C = C ∪ TREE ACCUMULATION(c)
In our project we have used the method detectMSERFeatures for getting the
MSERRegions object. This object contains information about MSER features detected
in 2D grayscale input image. We retrieve the pixels from this. Then we use the canny
edge detector method, it is an image processing technique for finding the boundaries
of objects within images .It works by detecting discontinuities in brightness. Edge
detection is used for image segmentation and data extraction in areas such as image
processing, computer vision and machine vision. Then apply the image gradient of
image I which return magnitude and direction of intensities. HelperGrowEdges Grow
edges along or opposite to gradients helperGrowEdges
(Edges, GradientDirection, TextPolarity) asymmetrically dilates binary image
edges in the direction specified by gradient Direction. Text Polarity is a string
specifying whether to grow along or opposite the gradient Direction, 'Light Text On
Dark' or 'Dark Text On Light', respectively. Then we check for connected
components by using the following method. CC = bwconncomp (BW) returns the
connected components CC found in BW. The binary image BW can have any
dimension. CC is a structure with four fields, Connectivity, Image Size, Num Objects,
and Pixel Idx List.
D = bwdist(BW) computes the Euclidean distance transform of the binary image BW.
For each pixel in BW, the distance transform assigns a number that is the distance
between that pixel and the nearest nonzero pixel of BW. bwdist uses the Euclidean
distance metric by default. BW can have any dimension. D is the same size as BW.
helperStrokeWidth Transforms distance image into stroke width image
StrokeWidthImage = helperStrokeWidth (Distance Image);
returns a Stroke Width Image computed from Distance Image, containing a value for
stroke width at each non-zero pixel in the Distance Image. Distance Image is a
Euclidean distance transform of a binary image computed by bwdist.
Another useful discriminator for text in images is the variation in stroke width within
each text candidate. Characters in most languages have a similar stroke width or
thickness throughout. It is therefore useful to remove regions where the stroke width
exhibits too much variation [1]. The stroke width image below is computed using the
helperStrokeWidth helper function. Note that most non-text regions show a large
variation in stroke width. These can now be filtered using the coefficient of stroke
width variation.
The Stroke Width Transform In this section I will describe the Stroke Width
Transform algorithm as it is presented in [1], with several additions and
enhancements. These additions will be discussed in further extent in the next section
‘The Application’. The algorithm receives an RGB image and returns an image of the
same size, where the regions of Suspected text is marked. It has 3 major steps: the
stroke width transform, grouping the pixels into letter candidates based on their stroke
width, and finally, grouping letter candidates into regions of text.
First, all pixels are initialized with ∞ as their stroke width. Then, we calculate the
edge map of the image by using the canny edge detector. We consider the edges as
MSBECL Page 36
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
possible stroke boundaries, and we wish to find the width of such stroke. If p is an
edge pixel, the direction of the gradient is roughly perpendicular to the orientation of
the stroke boundary. Therefore, the next step is to calculate the gradient direction
gp of the edge pixels, and follow the ray r=p+n*gp (n>0) until we find another edge
pixel q. If the gradient direction gq at q is roughly opposite to gp, then each pixel in
the ray is assigned the distance between p and q as their stroke width, unless it already
has a lower value. If, however, an edge pixel q is not found, or gq is not opposite to
gp, the ray is discarded.
MSBECL Page 37
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
MSBECL Page 38
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
3.8.5 Output
Fig 3.10 the output of text detection system a) Natural scene image b) Gray image c)
MSER regions of input image d) character candidates’ extractions e) Stroke width
transform of I/p image f) Joining individual characters g) Text extractions h) output
text on command window
MSBECL Page 39
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
3.9.1 General
MATLAB (matrix laboratory) is a numerical computing environment and fourth-
generation programming language. Developed by Math Works, MATLAB allows
matrix manipulations, plotting of functions and data, implementation of algorithms,
creation of user interfaces, and interfacing with programs written in other languages,
including C, C++, Java, and Fortran.
For Implementation of this project we have used MATLAB version 2014 software
Tool .MATLAB® is a high-level technical computing language and interactive
environment for algorithm development, data visualization, data analysis, and
numerical computation. Using MATLAB, you can solve technical computing
problems faster than with traditional programming languages, such as C, C++, and
Fortran. Matlab is a data analysis and visualization tool which has been designed with
powerful support for matrices and matrix operations. As well as this, Matlab has
excellent graphics capabilities, and its own powerful programming language. One of
the reasons that Matlab has become such an important tool is through the use of sets
of Matlab programs designed to support a particular task. These sets of programs are
called toolboxes, and the particular toolbox of interest to us is the image processing
toolbox. Rather than give a description of all of Matlab‟s capabilities, we shall restrict
ourselves to just those aspects concerned with handling of images. We shall introduce
functions, commands and techniques as required. A Matlab function is a keyword
which accepts various parameters, and produces some sort of output: for example a
matrix, a string, a graph. Examples of such functions are sin, imread, imclose. There
are many functions in Matlab, and as we shall see, it is very easy (and sometimes
necessary) to write our own. Matlab’s standard data type is the matrix all data are
considered to be matrices of some sort. Images, of course, are matrices whose
elements are the grey values (or possibly the RGB values) of its pixels. Single values
are considered by Matlab to be matrices, while a string is merely a matrix of
characters; being the string’s length. When you start up Matlab , you have a blank
window called the _Command Window_ in which you enter commands. Given the
vast number of Matlab’s functions, and the different parameters they can take, a
command line style interface is in fact much more efficient than a complex sequence
of pull-down menus .You can use MATLAB in a wide range of applications,
MSBECL Page 40
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
including signal and image processing, communications ,control design, test and
measurement financial modeling and analysis. Add-on toolboxes (collections of
special-purpose MATLAB functions) extend the MATLAB environment to solve
particular classes of problems in these application areas. MATLAB provides a number
of features for documenting and sharing your work. You can integrate your MATLAB
code with other languages and applications, and distribute your MATLAB algorithms
and applications. Although MATLAB is intended primarily for numerical computing,
an optional toolbox uses the MuPADsymbolic engine, allowing access to symbolic
computing capabilities. An additional package, Simulink, adds graphical multi-
domain simulation and Model-Based Design for dynamic and embedded systems.
In 2004, MATLAB had around one million users across industry and academia.
MATLAB users come from various backgrounds of engineering, science, and
economics. MATLAB is widely used in academic and research institutions as well as
industrial enterprises.
MATLAB was first adopted by researchers and practitioners in control engineering,
Little's specialty, but quickly spread to many other domains. It is now also used in
education, in particular the teaching of linear algebra and numerical analysis, and is
popular amongst scientists involved in image processing. The MATLAB application
is built around the MATLAB language. The simplest way to execute MATLAB code
is to type it in the Command Window, which is one of the elements of the MATLAB
Desktop. When code is entered in the Command Window, MATLAB can be used as
an interactive mathematical shell. Sequences of commands can be saved in a text file,
typically using the MATLAB Editor, as a script or encapsulated into a function,
extending the commands available. MATLAB provides a number of features for
documenting and sharing your work. You can integrate your MATLAB code with
other languages and applications, and distribute your MATLAB algorithms and
applications.
MSBECL Page 41
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
Uses of MATLAB
MATLAB is widely used as a computational tool in science and engineering
encompassing the fields of physics, chemistry, math and all engineering streams. It is
used in a range of applications including:
a. Signal processing and Communications
b. Image and video Processing
c. Control systems
MSBECL Page 42
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
MSBECL Page 43
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
Interface), which should not be confused with the unrelated Java that is also called
JMI.
As alternatives to the MuPAD based Symbolic Math Toolbox available from Math
Works, MATLAB can be connected to Maple or Mathematical. Libraries also exist to
import and export MathML.
The M Files
MATLAB allows writing two kinds of program files: Scripts - script files are
program files with .m extension. In these files, you write series of commands, which
you want to execute together. Scripts do not accept inputs and do not return any
outputs. They operate on data in the workspace.
Functions - functions files are also program files with .m extension. Functions can
accept inputs and return outputs. Internal variables are local to the function.
You can use the MATLAB editor or any other text editor to create your .m files. In
this section, we will discuss the script files. A script file contains multiple sequential
lines of MATLAB commands and function calls. You can run a script by typing its
name at the command line.
Development Environment
execution. With the MATLAB language, you can program and develop algorithms
faster than with traditional languages because you do not need to perform low-level
administrative tasks, such as declaring variables, specifying data types, and allocating
memory. In many cases, MATLAB eliminates the need for „for‟ loops. As a result,
one line of MATLAB code can often replace several lines of C or C++ code.
At the same time, MATLAB provides all the features of a traditional programming
language, including arithmetic operators, flow control, data structures, data types,
object-oriented programming (OOP), and debugging features.
MATLAB lets you execute commands or groups of commands one at a time, without
compiling and linking, enabling you to quickly iterate to the optimal solution. For fast
execution of heavy matrix and vector computations, MATLAB uses processor-
optimized libraries. For general-purpose scalar computations, MATLAB generates
machine-code instructions using its JIT (Just-In-Time) compilation technology. This
technology, which is available on most platforms, provides execution speeds that rival
those of traditional programming languages.
Development Tools
MATLAB includes development tools that help you implement your algorithm
efficiently. These include the following:
MATLAB Editor
Provides standard editing and debugging features, such as setting breakpoints and
single stepping
Code Analyzer
Checks your code for problems and recommends modifications to maximize
performance and maintainability
MATLAB Profiler
Records the time spent executing each line of code
Directory Reports
Scan all the files in a directory and report on code efficiency, file differences, file
dependencies, and code coverage
Designing Graphical User Interfaces
By using the interactive tool GUIDE (Graphical User Interface Development
Environment) to layout, design, and edit user interfaces. GUIDE lets you include list
boxes, pull-down menus, push buttons, radio buttons, and sliders, as well as
MSBECL Page 45
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
MATLAB plots and Microsoft ActiveX® controls. Alternatively, you can create GUIs
programmatically using MATLAB functions.
Data Analysis
MATLAB provides interactive tools and command-line functions for data analysis
operations, including:
a. Interpolating and decimating
g. Matrix analysis
Data Access
MATLAB is an efficient platform for accessing data from files, other applications,
databases, and external devices. You can read data from popular file formats, such as
Microsoft Excel; ASCII text or binary files; image, sound, and video files; and
scientific files, such as HDF and HDF5. Low-level binary file I/O functions let you
work with data files in any format. Additional functions let you read data from Web
pages and XML.
Visualizing Data
All the graphics features that are required to visualize engineering and scientific data are
available in MATLAB. These include 2-D and 3-D plotting functions, 3-D volume
visualization functions, tools for interactively creating plots, and the ability to export
results to all popular graphics formats. You can customize plots by adding multiple
MSBECL Page 46
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
axes; changing line colors and markers; adding annotation, Latex equations, and
legends; and drawing shapes
2-D Plotting
Visualizing vectors of data with 2-D plotting functions that create:
a. Line, area, bar, and pie charts.
c. Histograms.
e. Scatter/bubble plots.
f. Animations.
b. Image plots.
MSBECL Page 47
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
MATLAB can perform arithmetic on a wide range of data types, including doubles,
singles, and integers.
% StrokeWidthImage = helperStrokeWidth(DistanceImage);
% returns a Stroke Width Image computed from DistanceImage, containing a
% value for stroke width at each non-zero pixel in the DistanceImage.
% DistanceImage is a Euclidean distance transform of a binary image
% computed by bwdist.
%
DistanceImage = round(DistanceImage); % bins distances into integer values for
comparison
MSBECL Page 48
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
PaddedDistanceImage = padarray(DistanceImage,[1,1]);
Dind = find (PaddedDistanceImage ~= 0);
sz=size(paddedDistanceImage);
% Compare whether eight neighbors are less than current pixel for all
% pixels in image
neighborIndices = repmat(Dind,[1,8]);
[I,J] = ind2sub(sz,neighborIndices);
I = bsxfun(@plus,I,connectivity(1,:));
J = bsxfun(@plus,J,connectivity(2,:));
neighborIndices = sub2ind(sz,I,J);
lookup =
bsxfun(@lt,paddedDistanceImage(neighborIndices),paddedDistanceImage(Dind));
lookup(paddedDistanceImage(neighborIndices) == 0) = false;
MSBECL Page 49
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
HelperGrowEdges function
Function GradientGrownEdgesMask =
helperGrowEdges(Edges,GradientDirection,TextPolarity)
% helperGrowEdges Grow edges along or opposite to gradients
%
% GradientGrownEdgesMask =
% helperGrowEdges(Edges,GradientDirection,TextPolarity) Asymmetrically
% dilates binary image edges in the direction specified by
% gradient Direction. Text Polarity is a string specifying whether to grow
% along or opposite the gradient Direction, 'LightTextOnDark' or
% 'DarkTextOnLight', respectively.
% Quantize to 8 cardinal and ordinal directions
GrowthDirection=round ((GradientDirection + 180) / 360 * 8);
GrowthDirection(GrowthDirection == 0) = 8;
MSBECL Page 50
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
0,1,1,1,1,1,0;...
zeros(3,7)];
NW = strel(northwestTemplate);
SW = strel(rot90 (northwestTemplate));
SE = strel(rot90(northwestTemplate,2));
NE = strel(rot90(northwestTemplate,3));
Strels = [NE,N,NW,W,SW,S,SE,E];
% Initialize mask
GradientGrownEdgesMask = false(size(Edges));
image IM with the structuring element SE. The argument SE must be a single
structuring element object, as opposed to an array of objects. The morphological open
operation is an erosion followed by a dilation, using the same structuring element for
both operations.
IM2 = imclose(IM,SE)
IM2 = imclose(IM,SE) performs morphological closing on the grayscale or binary
image IM, returning the closed image, IM2. The structuring element, SE, must be a
single structuring element object, as opposed to an array of objects. The
morphological close operation is a dilation followed by an erosion, using the same
structuring element for both operations.
OCR
Recognize text using optical character recognition.
txt = ocr(I) returns an ocrText object containing optical character recognition
information from the input image, I. The object contains recognized text, text
location, and a metric indicating the confidence of the recognition result.
txt = ocr(I, roi) recognizes text in I within one or more rectangular regions. The roi
input contains an M-by-4 matrix, with M regions of interest.
MSBECL Page 52
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
CHAPTER 4
PERFORMANANCE ANALYSIS
_____________________________________________________________________
_____________________________________________________________________
MSBECL Page 53
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
4.1 DATASETS
MSBECL Page 54
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
MSBECL Page 55
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
For checking the robustness of text detection system we have taken different natural
scene images to retrieve text. Fig 4.4 to Fig 4.22 shows the system with scene image
having font size changed as well as the oriented images means the images having
vertical text, the image having noise in it and the images having blurred in it, the
images which are having street view. And the detected text is also shown in the
command window.
MSBECL Page 56
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
Fig 4.5 when we press get image button the open window is going to open
MSBECL Page 57
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
Fig 4.7 the result of MSER button press is displayed in second axis
MSBECL Page 58
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
MSBECL Page 59
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
MSBECL Page 60
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
MSBECL Page 61
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
MSBECL Page 62
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
Fig 4.16 The image having different font size of each character
Fig 4.17 The result of image having different font size of each character
MSBECL Page 63
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
Fig
MSBECL Page 64
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
MSBECL Page 65
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
MSBECL Page 66
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
MSBECL Page 67
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
Table shows the results of two different datasets by using Precision, Recall and f
measure values. Number of texts in input image is considered as the relevant
text, and the number of text detected by system is defined as retrieved text. If the
system detects non text regions as text, then it is considered as the false positives
in the output. And if the system does not detect the text region available in input
image, then it will be considered as false negative. The value for the same is
calculated using the equations below:
Precision = (No. of text in input image ∩ No. of detected text in output image) ÷
No. of detected text in output image (4.1)
OR
The value of f-measure is defined using the value of Precision and Recall
calculated above, it can be as below [10] :
F-measure = 2× (Precision × Recall) ÷ (Precision + Recall) (4.5)
MSBECL Page 68
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
Table 4.1 Result analysis using relevant and retrieved values of sample dataset
and ICDAR 2011
No. of text
No. of text detected in out- False False
Image No in input image put image retrieved(Fa Negative
(Relevant) (True lse positives)
Retrieved)
1 42 42 0 0
2 7 7 0 0
3 13 13 0 0
4 5 5 0 0
5 17 17 0 0
6 12 12 0 0
7 5 5 0 0
8 12 12 0 0
9 20 18 06 02
10 6 6 0 0
11 33 31 05 02
12 5 5 0 0
13 3 3 0 0
14 11 11 02 0
15 32 32 04 0
16 12 11 0 01
17 8 8 0 0
18 10 10 0 0
19 14 14 0 0
20 25 25 0 0
21 18 17 0 01
22 39 38 0 01
23 53 50 0 03
24 34 34 01 0
25 20 20 03 0
26 19 18 02 01
27 10 10 0 0
28 118 111 10 07
29 55 55 0 0
30 8 8 2 0
31 20 20 01 0
32 43 43 06 0
33 64 62 0 02
34 16 16 0 0
35 11 11 0 0
36 15 15 01 0
37 25 25 02 0
38 25 25 0 0
39 26 26 2 0
40 10 10 06 0
41 09 08 02 01
42 13 13 0 0
43 13 11 01 02
44 37 33 0 04
45 23 19 0 04
46 5 5 0 0
47 09 06 01 02
MSBECL Page 69
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
120
100
80
60
Recall
40 Precision
F - measure
20
As shown in above graph, the result of this text detection system is better than other
methods. This system gives 94.23 precision, 96.17 recall and 95.19 f-measure, which
is very high compare to other methods and calculated from values of Table 4.1.
MSBECL Page 70
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
In this and in the preceding chapters we have seen how our method gives better
results as compare to the other text detection systems. In this chapter we have taken
screenshots of the system with different images for example noise added image, blur
image, vertical text i.e. multi oriented image, the rotated image; the image is having
different font size for each character. This many images give us accurate result. The
chart is also drawn for showing the performance of the different systems on ICDAR
2011 dataset.
MSBECL Page 71
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
_____________________________________________
CHAPTER 5
_________________________________
MSBECL Page 72
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
5.1 Conclusion.
We have developed a text detection system which detects text present in the
Natural scene images as well as display the maximum characters on the
command window, for getting such accurate result we have used the MSER
technique. The MSER method is proven better results from various techniques, because
it easily finds the extremal regions from the input image. It finds most of the characters
from natural scene images, although having noise, blur, etc. For non text elimination
we have used text measurement rules, stroke width transform and morphological
operations so that it can give more accurate result. Finally for displaying text we
used the OCR technique .
Our contribution is we have developed such a system which detects
text and also shows the text present in it on the command window more
accurately. We also retrieve the text which is having different font size in a
image. We can also retrieve the text which is oriented.
The future enhancement of this system is to retrieve multilingual text from the natural
scene images. Also to detect highly blurred texts in low-resolution natural scene
images.
MSBECL Page 73
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
References
[1] Xu-Cheng Yin, Xuwang Yin, Kaizhu Huang, and Hong-Wei Hao, “Robust Text
Detection in Natural Scene Images”, IEEE transaction on Pattern Analysis And
Machine Intelligence, Vol:36,2013, pp. 970-983.
[2] Yi-Feng Pan, Xinwen Hou, and Cheng-Lin Liu, “A Hybrid Approach to Detect
and Localize Texts in Natural Scene Images”, IEEE Transactions On Image
Processing, Vol. 20, 2011.
[4] Huizhong Chen, Sam S. Tsai, Georg Schroth, David M. Chen, Radek Grzeszczuk
and Bernd Girod, “Robust Text Detection In Natural Images With Edge-Enhanced
Maximally Stable Ex-tremal Regions”, 18th IEEE international conference on Image
Processing, pp.2609-2612, 2011.
[5] Thuy Ho, Ngoc Ly, “A Scene Text-Based Image Retrieval System”, IEEE
international symposium on Signal Processing and Information Tech., pp. 79-84,
2012.
[6] L. Neumann and J. Matas, “A method for text localization and recognition in real-
world images,” in ACCV 2010, ser.LNCS 6495, vol. IV, November 2010, pp. 2067–
2078.
MSBECL Page 74
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
[9] Xiaobing Wang, Yonghang Song, Yuanlin Zhang, “Natural scene text detection in
multi-channel connected component segmentation”, 12th International conf. on
Document Analysis and Recognition, pp. 1375-1379, 2013.
[10] Chucai Yi, Yingli Tian, “Text extraction from scene images by character
appearance and structure modeling”, Elsevier journal on Computer Vision and Image
Understanding, 2013,pp. 182-194
[12] Trung Quy Phan, Palaiahnakote Shivakumara, Souvik Bhowmick, Shimiao Li,
Chew Lim Tan,Umapada Pal, “Semiautomatic Ground Truth Generation for Text
Detection and Recognition in Video Images”, IEEE Trans. Circuits And Systems For
Video Technology, VOL. 24, NO. 8, 2014
[13] Rong-Chi Chang, “Intelligent Text Detection and Extraction from Natural Scene
th
Images”, 15 North- East Asia Symposium on Nano, Information tech. and reliability,
pp.23-28, 2011.
[14] Yao Li, Huchuan Lu, “Scene Text Detection via Stroke Width”, 21st International
Conference on Pattern Recognition, 2012. pp.681-684.
[15] Jonathan Fabrizio, Beatriz Marcotegui, Matthieu Cord , “Text detection in street
level images”,Journal on Pattern analysis applications, 2013, Vol 16, Issue 4, pp. 519-
533.
MSBECL Page 75
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
[17] Anand Mishra, Karteek Alahari, C.V. Jawahar, “An MRF Model for Binarization
of Natural Scene Text”, IEEE conf on Document Analysis and Recognition,
2011,pp.11-16.
[19] Cunzhao Shi, Chunheng Wang, Baihua Xiao, Yang Zhang, Song Gao, “Scene
text detection using graph model built upon maximally stable extremal region”. vol
34, issue 2, 2013, page no. 107-116.
[21] B.H.Shekar, Smitha M.L., “Skeleton Matching based approach for Text
Localization in Scene Images”, arXiv:1502.03913v1, 2015, pp. 1-12.
[22] Xu-Cheng Yin, Wei-Yi Pei, Jun Zhang, and Hong-Wei Hao, “Multi-Orientation
Scene Text Detection with Adaptive Clustering”, IEEE Transactions On Pattern
Analysis And Machine Intelligence, VOL. 37, NO. 9, 2015, pp. 1930-1937.
[23] Xiaobing Wang, Yonghong Song, Yuanlin Zhang, Jingmin Xin, “Natural scene
text detection with multi-layersegmentation and higher order conditional random field
based analysis”, Else-vier publication on Pattern Recognition Letters 6061, 2015, pp.
41-47.
[24] Rodrigo Minetto , Nicolas Thome , Matthieu Cord, Neucimar J. Leite , Jorge
Stolfi, “Snooper-Text: A text detection system for automatic indexing of urban
scenes”, Elsevier publication on Computer Vision and Image Understanding 122,
2014, pp. 92-104.
[25] Runmin Wang, Nong Sang, “Changxin Gao, Text detection approach based on
confidence map and context information”, Elsevier publication on Neurocomputing,
MSBECL Page 76
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
[26] Bowornrat Sriman, Lambert Schomaker, “Object Attention Patches for Text
Detection and Recognition in Scene Images using SIFT”, ICPRAM 2015.
MSBECL Page 77
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
Paper Published
MSBECL Page 78
ROBUST TEXT DETECTION IN NATURAL SCENE IMAGES USING MSER APPROACH
MSBECL Page 79