Autonomous Bot Using Machine Learning and Computer Vision: SN Computer Science July 2021

See discussions, stats, and author profiles for this publication at: https://www.researchgate.
net/publication/351247579
Autonomous Bot Using Machine Learning and Computer Vision
Article in SN Computer Science · July 2021

DOI: 10.1007/s42979-021-00640-6
CITATIONS READS
2 126
2 authors, including:
Thejas Karkera
1 PUBLICATION 2 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
Smart Farm - Crop Monitoring System View project
Survivability Standard Techinques Implementation in Survivable Optical Network s View project
All content following this page was uploaded by Chandra Singh on 27 February 2022.
The user has requested enhancement of the downloaded file.

SN Computer Science (2021) 2:251
https://doi.org/10.1007/s42979-021-00640-6
ORIGINAL RESEARCH
Autonomous Bot Using Machine Learning and Computer Vision

Thejas Karkera1 · Chandra Singh1
Received: 19 December 2020 / Accepted: 8 April 2021

© The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd 2021
Abstract
Self-driving vehicles have the potential to revolutionize urban mobility by providing sustainable, safe, and convenient
transportability. In recent years several companies have identified automation as their major area of research and also are
investing a huge amount of their financial resources in automating vehicles. This is the period of time where autonomous
vehicles are very close to being capable of transporting us to destinations without the aid of drivers in the very near future.
In the current generation, the main focus is to make vehicles more automated to provide a better driving experience. These
vehicles are designed to drive without or with little human assistance by sensing it’s the environment. This can be achieved
by a combination of sensors and processing the data with the help of computer vision technology and machine learning.
The vehicle autonomy needs to be conducted with care, keeping in mind the challenges that can be faced during the process.
Recognizing the traffic signals, understanding the signs, identifying the lane markings are some of the basic functions that it
needs to perform. After gathering all this information, the next task is to understand the predefined protocols and follow them
without any fault. This problem can be solved stepwise using some functions from image processing and computer vision
technology such as Haar transform, perspective mapping, perspective transformation, canny edge detection, and histogram
equalization. This solution is further enhanced by including machine learning, which improves performance with experience,
making it more reliable. It should be noted that, although the vehicles promoted by the companies ensure 80% reliability, we
are not yet ready to completely adapt to the idea of automated vehicles. This paper hence focuses on the negative of current
ideology and makes it reliable enough to pave a way for its immediate implementation. In this paper, the authors have used
a microcontroller and a microprocessor, to Arduino uno is used as a microcontroller and Raspberry pi B+ model is used as
the microprocessor. To detect the lanes the authors have used image processing using a library called OpenCV. For detect-
ing the traffic signs the authors have used supervised machine learning technique, to capture the images authors have used
raspberry pi version 2 cam, using cascade training to classify the positive images from the negative images.
Keywords Raspberry Pi · Haar transform · Self-driving vehicles
Introduction injured or disabled in accidents. Road accidents are ranked

9th for being the cause of death and it is responsible for 2.2%
A recent survey says that nearly 1.25 million deaths are of the deaths around the globe. Safety has become a major
caused by road crashes every year, that is an average of 3287 concern these days. Self-driving vehicles could help in
deaths per day. In addition, around 20–50 million people get reducing this number. This technology could help the indi-
viduals who are unable to drive by themselves, such as the
elderly, disabled, and the ones who are phobiatic to driving.
This article is part of the topical collection “Data Science and
Communication” guest edited by Kamesh Namudri, Naveen Currently, several technological firms and universities have
Chilamkurti, Sushma S J and S. Padmashree. started investing immense technical and financial resources
in the field of autonomous vehicles because of their high
* Thejas Karkera growth scope. This concept of self-driving vehicles is on the
[email protected]
edge of becoming the mainstream, provided it overcomes the
Chandra Singh practical challenges, along with economic, social and legal
[email protected]
acceptance. In this project, we have made an attempt to real-
1
Department of ECE, Sahyadri College of Engineering ise this ideology in a very simple and cost-effective way, by
and Management, Mangalore, India
SN Computer Science
Vol.:(0123456789)
251 Page 2 of 9 SN Computer Science (2021) 2:251
exploring the concepts and effects of some basic function- all the traffic signals are designed in a common way to be
alities in machine learning, computer vision and other such understood easily by everyone. Since all three colors (red,
fields. The final vision is to ideally accomplish all the neces- yellow, and green) have high contrast ratios, this feature
sary tasks that a basic self-driving vehicle needs to perform. itself is used to separate the traffic signal from the rest of
Ideally here refers to a methodology that is uncomplicated to the objects in the image frame [13]. This separates the
understand, easy to modify, and open to improvisation. The region of interest from the rest of the surroundings. Then
report details on how computer vision technology along with to identify the colors individually, their RGB pixel values
some image processing functions further dealt with machine are considered and then the colors are classified [14]. For
learning help to study the environment of the vehicle and more precise performance, this process is carried out in
enables the vehicle to find out a path and travel through it in six different steps. Input frame from the video, color fil-
the prescribed way. tering, edge detection, contour detection, detect bounding
rectangle of contours, and then save candidate images for
recognition. The data is then sent to the processor which
Literature Survey then performs data set exploration and the required action
is taken [15]. The process of detection is followed by the
The process of automation of vehicles is carried out by process of recognition, which involves recognizing vari-
various methods like sensor fusion, computer vision, path ous regions of interest on which the functions need to be
planning, actuator, deep learning, and localization [1]. performed. The first step in recognition is data set explora-
Computer vision deals with the process of making com- tion. The data set used for training is GTSRB. Approxi-
puters acquire a high level of understanding from digital mately 1000 images are taken for each class from different
images [2]. Sensory fusion deals with the process of com- perspectives and different sizes. Twenty percentage of the
bining sensors and analyzing the obtained sensory data training data set is stored for the validation process, hence
as a combined result of two or more sensors that would it helps to increase the data set size artificially by a method
yield a better understanding of the environment of obser- called the augmentation process. Random images are cho-
vation [3]. Deep learning can be seen as a wider family sen from the existing images and random rotations and
of machine learning, that includes various types of data translations are performed on the [16]. The transformed
representation unlike task-specific algorithms [4, 5]. Path set of pixels is then added to the original set of pixels.
planning is a primitive step that identifies the path where The next step is training and model performance. Sto-
the vehicle is allowed to pass through. An efficient path chastic gradient descent is used as the optimizer. Instead
planning can be done by plotting the shortest path between of stochastic gradient descent other optimizers can also
two points [6]. An actuator helps in moving or control- be used to increase the performance, since the work not
ling the vehicle [7]. Navigation is the vehicle’s capability only focuses on the optimizer. Another important perfor-
to determine the position of the vehicle within its frame mance indicator is batch size tuning because small batch
of reference and plan the most effective path towards the sizes results in slow convergence whereas large batch
destination. To navigate in its environment, the vehicle sizes cause memory problems. Usually, middle batch sizes
requires a representation of the plot, i.e. a map showing are preferred. To conclude, the paper includes two main
the environment and the capability to interpret its repre- phases: detection and recognition [17]. The first sign is
sentation. Edge detection comprises a set of mathematical detected from the real-time video stream using a CNN
equations that identify the points within a digital image model. The detected sign is classified with an accuracy of
where there is a rapid change or where the image bright- 97.42%. However, when the video obtained by the RC Car
ness has discontinuities [8]. Since the usual color of the is streamed online, the accuracy rate instantly decreases
road is black, which is the least intensity color and that of to 87.36%. The reason behind this rapid fall is because of
the lane markings is either white or yellow both of which the low sensitivity of the color filtering method used to
falls under the region of high-intensity colors, it is easier the lighting and other objects. From the results, the classic
to differentiate the two regions, thus making the task of image processing methods are eliminated and recurrent
identifying the region of interest easier. The points where neural networks are used for detection as well as recogni-
the image brightness changes sharply are grouped together tion phases. Hence, the result consists of each object in
and stored as a set of curved line segments [9, 10]. These the whole picture [18]. By this, the decrease in the perfor-
line segments compose the edges of the region of inter- mance can be contradicted. The theory of neural networks,
est [11]. Edge detection is a fundamental tool in image autonomous vehicles, and the process of how a prototype
processing, machine learning, and computer vision, after with a camera as its only input can be used to design,
which the functions are performed on the image [12]. The test, and evaluate the algorithm capabilities [19, 20]. The
process of detection is done in three simple steps since ANN is an efficient algorithm that helps in recognizing
SN Computer Science
SN Computer Science (2021) 2:251 Page 3 of 9 251
the patterns within an image with the help of a training set

that nearly contains 2000 images. The result thus obtained
is 96% accurate. The main convenience of this project is
that the design has successfully accomplished the tasks
to be performed just by using one camera of average cost
for navigation, which might as well be used for obstacle
avoidance. In the end, the project concludes that the preci-
sion and accuracy of the output is directly proportional to
the number of input images fed to the self-learning sys-
tem [21]. The approach used here successfully meets all
the requirements needed for autonomous navigation. In
fact, the neural network is here used as the exact means
to operate and control an autonomous vehicle in order
to provide the user with a high accuracy rate reaching
96%. The perspectives for continuation of the work are
in improvising the learning algorithm and enhancing the
process of the testing the experiments. One of the key
tasks is on-road obstacle detection and classification in
the region ahead of the self-driving vehicles [22]. Since
the key function of vehicle automation involves vehicle
tracking or locating and associating vehicles in the frames,
vehicle detection and classification becomes necessary.
Due to the cost-effectiveness of vision-based approaches,
Fig. 1 Slave setup
they are given a higher priority over other approaches
available for this task. This system uses a deep learning
system accompanied by the convolutional region-based negative terminals of the motor are handled by the digi-
neural network. PASCAL VOC image dataset is used to tal pins of the microcontroller. Figure 1 shows the setup
train these dataset. The algorithm is advanced to the extent required.
that it automatically identifies and classifies the obstacles
like animals, pedestrians, and other vehicles with a time- OpenCV Using C
dependent increased performance [23]. Using Titan X
GPU to implement the system can help us achieve a frame We have used raspberry pi as our microprocessor. To per-
rate of 10 frames per second or above during processing form image processing on the objects we have used an open
for an image frame of VGA resolution. The high frame rate platform called OpenCV. There are other platforms like ten-
thus obtained simplifies demonstration to an extent that it sor flow but as OpenCV provides smooth performance we
then becomes suitable for driving automated vehicles even have decided to use OpenCV. We have used pi cam 2 to
on highways. During the performance testing, the results capture video. We have used this camera because we are per-
showed invariant performance under different textures of forming processing with an image resolution of 480 × 360.
road and various climatic conditions that make the design Pi cam perfectly supports this resolution and is cheaper than
well reliable for Indian rural roads as well. other cams. After capturing the required frame we first con-
vert the image into signature, before that we have to change
the raspberry pi default BGR format of the image to RGB
format. To detect the lanes we have to apply a perspective
Methodology wrap on the image. To analyze an image 5 frames of ref-
erences are needed, object co-ordinate frame, word coor-
Slave Setup dinate frame, camera coordinate frame, image coordinate
frame, image coordinate frame, and pixel coordinate frame.
As raspberry pi cannot handle both image processing and By applying transformations to all these 5 frames we get a
machine learning operations at the same time, a separate perceptive wrap of an image. To apply perspective wrap first
controller is used to control the motors of the bot. The we have to create a region of interest around the working
motors are driven by using a L298N H bridge motor driver. region. Then a perspective transform is taken over the region
Enable pins are connected to the PWM-enabled pins to con- to get a bird’s eye view of the image. A fresh frame of the
trol the speed of the motor. The powers to the positive and same image is taken and a canny edge detection algorithm
SN Computer Science
is applied to it. Now the wrapped image is added with this negative then the bot has to move left. The magnitude of the
frame and the lanes can be detected accurately. The actual turn depends on the distance from center-lane.
distance of the lanes are found out by dividing the region of ( )
x2 +y2
interest equally and finding the maximum intensity levels G(x, y) = e
−
2𝜎 2 , (1)
of each element. Again the array is divided into two parts to
detect the left and right part of the lanes. The array elements fs = G(x, y) ⊗ f (x, y), (2)
having the max intensity corresponds to the lane position.
After finding the distance of the lanes the midpoint is taken.
Taking the center of the camera frame as then we have used 𝜕fs
Gx = , (3)
raspberry pi as our microprocessor. To perform image pro- 𝜕x
cessing on the objects we have used an open platform called
OpenCV. 𝜕fs
Gy = , (4)
There are many other platforms for image processing we 𝜕y
have used open CV as it is a open source platform. We have
used pi cam 2 to capture video. We have used this camera √
because we are performing processing with image resolu-
Edge Gradient (G) = G2x + G2y , (5)
tion of 480 × 360. Pi cam perfectly supports this resolution
and is cheaper than other cams. After capturing the required (
Gy
)
frame we first convert the image into signature, before that Angle (𝜃) = tan−1 , (6)
Gx
we have to change the raspberry pi default BGR format of
the image to RGB format. To detect the lanes we have to
| |
apply a perspective wrap on the image. Edge Gradient = ||Gx || + |Gy |, (7)
| |
To apply perspective wrap first we have to create a region
of interest around the working region. Then a perspective {
transform is taken over the region to get a bird’s eye view 1, if f (x, y) > 0.5
t(x, y) = . (8)
of the image. A fresh frame of the same image is taken and 0, if f (x, y) ≤ 0.5
canny edge detection algorithm is applied to it. Let f(x, y)
denote the input image and G(x, y) denote the Gaussian
function. By convolving G and f we for a smoothed image, Master/Slave Communication
which is given by fs. After this it is followed up by calcu-
lating the gradient magnitude and direction. The gradient Here parallel communication is setup between the microcon-
magnitude is computed at every point and direction to esti- troller and raspberry pi using GPIO pins of raspberry pi and
mate the edge strength and direction at every point, which four digital pins of microcontroller. Conditions are applied
is called edge gradient. for different distances between the frame center and the lane
The equations of the process involved in canny edge center. Depending on the conditions the bot is moved left or
detection are from Eqs. (1–7) Thresholding of the image is right towards the frame center (Figs. 2, 3).
done and is added with canny edge detected output. Thresh-
olding is done to extract or enhance the image. To extract an Machine Learning
object from the image one way is to separate the object and
background by using a threshold. At any point (x, y) in an To detect the traffic signals, obstacles, and traffic signs
image f(x, y) > T is called an object point, otherwise called labelled machine learning is used, i.e. the data set used
as a background point. Equation (8) gives the mathematical by the authors will be labelled, these labelled images are
equation of the process. As our image is a grayscale image compared with the real-time images taken by the cam. To
we set T = 0.5. Now the wrapped image is added with this classify the images we need a machine learning model. To
frame and the lanes can be detected accurately. The actual implement a machine learning model, we require a data set
distance of the lanes is found out by diving the region of which sufficient enough for the model to classify between
interest equally and finding the maximum intensity levels the images. The authors take 400 samples of the object to
of each element. Again the array is divided into two parts to be detected these are called the positive images and then
detect the left and right part of the lanes. The array elements 300 negative images are taken, that is those areas which do
having the max intensity corresponds to the lane position. not belong to the object to be detected. Histogram equaliza-
After finding the distance of the lanes the midpoint is taken. tion is applied to all the images after converting the RGB
Taking the center of the camera frame as a reference the image to a grayscaled image. If we consider values of con-
bot has to adjust its position. If the value of the distance is tinuous intensity and if r is the intensities of the image to
SN Computer Science
Fig. 2 Master–slave communi-
cation setup
Fig. 3 a Positive image. b

Negative image
be processed, we focus attention on intensity mappings of using opencv_traincascade. For given an training example,
the form s = T(r). The purpose of using histogram equaliza- (x1, y1)…(xn, yn), where yi = 0, 1 respectively, the cascade
tion is to uniformly distribute gray value, by making the transform initializes the weights for yi respectively. For
probability distribution function the image intensity uni- training examples from 0 − N, the transform normalizes the
form. By creating a info. file of the images we store the weights, so that the weights are of the form of probability
exact location of the image and also the number of objects distribution. For each feature, we train the classifier which
in each image. The info. file is created by using the OpenCV is restricted to use a single feature, the errors are evaluated
integrated annotation tool. By using these images a training and classifier with the lowest error is taken and the weights
system is developed to recognize the object. This training are updated. Finally a classifier strong enough to classify
method involves cascading and then a XML file format of between is build, which is given by Eqs. (9–11). Where ht is
the learned method is created. This file has to be uploaded a classifier, where alpha and beta are used for updation of the
to the program to apply the remaining operations. After cre- weights. These values are chosen randomly. The best results
ating the info. file cascade training is done on the image of cascade classifier will consist of 38 stages. To train the
SN Computer Science
detector of image size 240 × 240 a total of 30 min was taken. Results and Discussion
After detecting the image, the next step is to stop before the
sign, for this a distance should be known from the bot to the Results obtained from this paper are as follows. The first
detected sign. The authors have used Haar cascade transfor- result is lane detection. From Fig. 5 it has detected the lane
mation to implement the model. To find the distance we use and the lane center is at a distance of − 18 from the frame
linear equations. A linear equation for eg: y = mx + c, where center which indicates us to take a left turn and in Fig. 6
x is a weight of the equation. This weight and the intercept it is giving a value zero, which is a condition for forward
are found manually. After getting the distance from the sign direction.
after the sign is detected, a threshold is set to stop the bot Next part of our paper was to detect the signs in sides of
when the distance is reached. The whole working flow of the the roads, we have taken 500 negative images of the stop
system is shown in Fig. 4.
{ N N
∑ 1 ∑
h(x) = 1 𝛼t ht (x) ≥ 𝛼, (9)
t=1
2 t=1 t
1
Where, 𝛼t = log
𝛽t
, (10)
where, update weights is given by:

1−ei
wt+1,i = wt,i 𝛽t (11)
Fig. 5 Lane detection center is − 18 from the frame center
Fig. 4 Work flow of the bot
SN Computer Science
Fig. 6 Lane detection with distance value 0
sign surroundings and 60 positive images of the stop sign.

By applying Harr cascade transform we have trained our
system to detect the stop sign. From Fig. 7 we have detected
the sign by marking it with a rectangular box.
From the test results the efficiency of detecting the lane in
dark environment is very low. But during sufficient bright-
ness around it was able to distinguish between the sign and
the surroundings properly. The efficiency of detection can be
improved by using good cam’s light night vision camera’s of
raspberry pi is available. During the motor test the bot was Fig. 7 Stop sign detection
able to move in the centre with a total efficiency of 75%.
Due to the lower frame rates of capture and low processing
speed of our microprocessor the efficiency was reduced to TP
Precision = (14)
75%. If we use a processor with RAM of 4 GB the data pro- TP + FP
cessing and frame rates can be increased i.e. increasing the
efficiency. Iterations were performed for different samples
and number of stages, and evaluation metrics like accuracy, Conclusion
precision, and recall of detection of the traffic signals where
calculated, Table 1 depicts the results given for different The objective of the proposed work was to come up with a
video samples. Video sample 4 was taken in indoor condi- cost-effective and a very efficient automatic driving car, with
tions and all the other samples from Videos 1–3 were taken a learning algorithm that is easy to understand and further
from outdoor. In each video sample, the signs were shown can be improvised. Our bot was successful in following the
to the bot 12 times. The formula for calculating is given in straight path when the distance measured was zero, it was
Eqs. (12–14) from the confusion metric. Our best results able to detect the stop sign successfully in the outdoor envi-
were from a sample size of 400:300 positive and negative ronment. But we found that there was a delay in the com-
samples respectively (Fig. 8). munication between the raspberry pi and the Arduino. This
TP + TN was because of the 1 GB RAM of raspberry pi, which was
Accuracy = , (12) not enough to perform image processing, machine learn-
TP + TN + FP + FN
ing and also send signals to the Arduino board for motor
control simultaneously. The visual results indicated that our
TP
Recall = , (13) bot performed to the expectations. To get more smooth per-
TP + FN
formance, the solution is to r use a 4 GB RAM processor.
As the machine learning technique we used required many
SN Computer Science
Table 1 Evaluation table for Video_Samples Number of posi- Number of nega- Accuracy Precision Recall
different samples/sample size tive samples tive samples
Video_1 (Out-Door) 150 120 0.831 0.800 0.796

Video_2 (Out-Door) 150 120 0.808 0.785 0.785
Video_3 (Out-Door) 150 120 0.792 0.698 0.769
Video_4 (In-Door) 150 120 0.452 0.328 0.56
Video_1 (Out-Door) 250 170 0.851 0.804 0.774
Video_2 (Out-Door) 250 170 0.827 0.783 0.752
Video_3 (Out-Door) 250 170 0.810 0.775 0.798
Video_4 (In-Door) 250 170 0.505 0.398 0.598
Video_1 (Out-Door) 400 300 0.943 0.900 0.870
Video_2 (Out-Door) 400 300 0.948 0.878 0.922
Video_3 (Out-Door) 400 300 0.946 0.887 0.900
Video_4 (In-Door) 400 300 0.523 0.425 0.659
The bold numbers specify the highest value of accuracy for a particular value of Number of positive sam-
ples and Number of negative samples
2. Huang T, Vandoni CE, editors. Computer vision: evolution and

promise. In: 19th CERN School of Computing. Geneva: CERN;
1996. pp. 21–25. https://doi.org/10.5170/CERN-1996-008.21.
ISBN 978-9290830955.
3. Elmenreich W. Sensor fusion in time-triggered systems. PhD
Thesis (PDF). Vienna, Austria: Vienna University of Technol-
ogy; 2002.
4. Haghighat MBA, Aghagolzadeh A, Seyedarabi H. Multi-focus
image fusion for visual sensor networks in DCT domain. Com-
put Electr Eng. 2011;37(5):789–97.
5. Bengio Y, Courville A, Vincent P. Representation learning: a
review and new perspectives. IEEE Trans Pattern Anal Mach
Intell. 2013;35(8):1798–1828. https://doi.org/10.1109/tpami.
2013.50. ArXiv: 1206.5538 freely accessible.
6. Schmidhuber J. Deep learning in neural networks: an over-
view. Neural Netw. 2015;61:85–117. https://doi.org/10.1016/j.
neunet.2014.09.003. PMID 25462637. ArXiv: 1404.7828 freely
accessible.
7. Umbaugh SE. Digital image processing and analysis: human
and computer vision applications with CVIP tools. 2nd ed. Boca
Raton: CRC Press; 2010. (ISBN 978-1-4398-0205-2).
8. Barrow HG, Tenenbaum JM. Interpreting line drawings as
three-dimensional surfaces. Artif Intell. 1981;17(1–3):75–116.
Fig. 8 Hardware connection 9. Lindeberg T. Edge detection. In: Hazewinkel M, editor. Ency-
clopedia of mathematics. Springer Science Business Media;
2001. (ISBN978-1-55608-010-4).
samples of the picture predefined for training. This makes 10. Lindeberg T. Edge detection and ridge detection with automatic
scale selection. Int J Comput Vis. 1998;30(2):117–54.
the process semi-autonomous, so we will have to figure out
11. Anggraini D, Siswantoko W, Henriyan D, Subiyanti DP, Aziz
some algorithm which will learn by itself without any prede- MVG, Prihatmanto AS. Design and implementation of system
fined label for each subject. The camera which we used was prediction and traffic conditions visualization in two dimen-
not high-end, so as per the evaluation from Table 1 it clearly sional map (case study: Bandung city). In: 2016 6th Inter-
national conference on system engineering and technology
was not able to detect the lanes in the indoor environment.
(ICSET); 2016.
We have to change the setup when we shift from an outdoor 12. Shapiro L, Stockman G. Computer vision. Prentice Hall Inc.;
environment to indoor or vice-versa. 2001.
13. Duda RO, Hart PE. Use of the Hough transformation to detect
lines and curves in pictures. Commun ACM. 1972;15:11–5.
14. Hough PVC. Method and means for recognizing complex pat-
References terns. U.S. Patent 3,069,654, Dec. 18, 1962.
1. Ballard DH, Brown CM. Computer vision. 1st ed. Prentice Hall;
1982.
SN Computer Science
15. Hough PVC. Machine analysis of bubble chamber pictures. In: 21. Hsieh J-W, Yu S-H, Chen Y-S, Hu W-F. Automatic traffic surveil-
Proc. Int. Conf. High Energy Accelerators and Instrumentation; lance system for vehicle tracking and classification. IEEE Trans
1959. Intell Transp Syst. 2006;7(2):175–87.
16. Nunes E, Conci A, Sanchez A. Robust background subtraction on 22. Jung Y-K, Ho Y-S. Traffic parameter extraction using video
traffic videos. In: 2011 18th International conference on systems, based vehicle tracking. In: 1999 IEEE/IEEJ/JSAI international
signals and image processing (IWSSIP); 2011. pp. 1–4. conference on intelligent transportation systems, proceedings, pp.
17. Lucas BD, Kanade T. An iterative image registration technique 764–769.
with an application to stereo vision. In: IJCAI81; 1981. pp. 23. Cheung S-CS, Kamath C. Robust background subtraction with
674–679. foreground validation for urban traffic video. EURASIP J Apple
18. Pang CCC, Lam WWL, Yung NHC. A novel method for resolving Signal Process. 2005;2005:2330–40.
vehicle occlusion in a monocular traffic-image sequence. IEEE
Trans Intell Transp Syst. 2004;5:129–41. Publisher’s Note Springer Nature remains neutral with regard to
19. Chiu C, Ku M, Wang C. Automatic traffic surveillance system jurisdictional claims in published maps and institutional affiliations.
for vision-based vehicle recognition and tracking. J Inf Sci Eng.
2010;26:611–29.
20. Gordon RL, Tighe W. Traffic control systems handbook. Wash-
ington, DC, USA: U.S. Department of Transportation Federal
Highway Administration; 2005.
SN Computer Science
View publication stats

Autonomous Bot Using Machine Learning and Computer Vision: SN Computer Science July 2021

Uploaded by

Document Informationclick to expand document information

Document Informationclick to expand document information

Copyright:

Available Formats

Autonomous Bot Using Machine Learning and Computer Vision: SN Computer Science July 2021

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Autonomous Bot Using Machine Learning and Computer Vision: SN Computer Science July 2021

Uploaded by

Copyright:

Available Formats

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Autonomous Bot Using Machine Learning and Computer Vision

Article in SN Computer Science · July 2021

Smart Farm - Crop Monitoring System View project

Survivability Standard Techinques Implementation in Survivable Optical Network s View project

The user has requested enhancement of the downloaded file.

Autonomous Bot Using Machine Learning and Computer Vision

Received: 19 December 2020 / Accepted: 8 April 2021

Keywords Raspberry Pi · Haar transform · Self-driving vehicles

Introduction injured or disabled in accidents. Road accidents are ranked

the patterns within an image with the help of a training set

Fig. 3 a Positive image. b

where, update weights is given by:

Fig. 5 Lane detection center is − 18 from the frame center

Fig. 4 Work flow of the bot

Fig. 6 Lane detection with distance value 0

sign surroundings and 60 positive images of the stop sign.

Video_1 (Out-Door) 150 120 0.831 0.800 0.796

2. Huang T, Vandoni CE, editors. Computer vision: evolution and

View publication stats

You might also like

Fig. 3 a Positive image. b

Fig. 5 Lane detection center is − 18 from the frame center

Fig. 4 Work flow of the bot

Fig. 6 Lane detection with distance value 0