DOC-20241121-WA0194.

Download as pdf or txt
Download as pdf or txt
You are on page 1of 7

11/21/2024

Introduction to
COMPUTER VISION
Submitted by,
Dwaipayan Saha
Class: 10
Section: A
Roll: 04
Date: 23.11.2024

Table of Contents

1 What is Computer Vision?

2 Application of Computer Vision

3 Concepts of Computer Vision

4 Python open CV

5 What is convolutional Operator?

6 What is CNN?

1
11/21/2024

What is Computer
Vision?

Computer vision is a field of artificial


intelligence (AI) that uses machine learning
and neural networks to teach computers
and systems to derive meaningful
information from digital images, videos and
other visual inputs—and to make
recommendations or take actions when they
see defects or issues.

Applications of Computer Vision:


Certainly! Computer vision has a wide range of applications across
various industries. Here are some of the key ones:

● Autonomous Vehicles: Used for object detection, lane


detection, and navigation to enable self-driving cars to
understand their environment.
● Healthcare: Assists in medical imaging analysis for detecting
diseases, such as tumors in MRI scans or diabetic
retinopathy in eye images.
● Security and Surveillance: Enhances the capabilities of
security cameras for facial recognition, threat detection, and
monitoring activities.
● Retail: Powers cashier-less stores, product recognition, and
customer behavior analysis for better service and inventory
management.

2
11/21/2024

● Agriculture: Helps in monitoring crop health, detecting


pests, and assessing yield through aerial imagery and plant
disease identification.
● Manufacturing: Used in quality control to inspect
products on assembly lines for defects and ensure high
manufacturing standards.
● Augmented Reality (AR): Enables AR applications to
understand and interact with the real world by recognizing
objects and tracking movements.
● Robotics: Enhances the perception of robots to navigate
and manipulate objects within their environment more
effectively.

Sports Analytics: Analyzes player movements and strategies


through video footage to provide insights for performance
improvement.

Document Digitization: Converts handwritten or printed


text into digital formats, improving data accessibility and
processing.

Fashion: Enables virtual try-ons, style recommendations, and


trend analysis by analyzing images of clothing and accessories.

Environmental Monitoring: Tracks wildlife, monitors


deforestation, and assesses environmental changes using
satellite imagery.

3
11/21/2024

Concepts of Computer Vision :

Computer vision is a field of artificial intelligence that enables machines to


interpret and make decisions based on visual data from the world. Here are
some core concepts within computer vision:

● Image Processing: Involves enhancing and manipulating images to


extract useful information, such as filtering, edge detection, and image
segmentation.
● Feature Extraction: Identifies important characteristics or features
within an image, like corners, edges, textures, or specific objects.
● Object Detection: Locates and classifies objects within an image or
video. Common techniques include YOLO (You Only Look Once)
and SSD (Single Shot MultiBox Detector).
● Image Classification: Assigns a label to an entire image, recognizing
what is depicted in the image. Techniques often involve
Convolutional Neural Networks (CNNs).

● Semantic Segmentation: Divides an image into regions


and classifies each region pixel-by-pixel. This helps in
identifying and understanding the context of different
regions within the image.
● Instance Segmentation: Similar to semantic
segmentation, but distinguishes between different
instances of the same object type within an image.
● Object Tracking: Monitors objects as they move across
frames in a video sequence, often used in surveillance and
autonomous vehicles.
● Optical Flow: Measures the motion of objects between
consecutive frames of video. It helps in understanding
movement and predicting future positions of objects.

4
11/21/2024

● 3D Vision: Constructs three-dimensional


representations from two-dimensional images, useful
in applications like 3D modeling, AR/VR, and
robotics.
● Pattern Recognition: Identifies patterns and
regularities in data, which can be used for
recognizing faces, handwriting, and more.
● Deep Learning: Utilizes neural networks,
particularly Convolutional Neural Networks
(CNNs) and Recurrent Neural Networks (RNNs),
to analyze and interpret complex visual data.
● Augmented Reality (AR): Integrates virtual objects
into real-world environments, requiring real-time
recognition and tracking of physical spaces.

What is Open CV ?
OpenCV is an open-source computer
vision and machine learning software
library. Various image processing
operations such as manipulating
images and applying tons of filters can
be done with the help of it. It is
broadly used in Object detection, Face
Detection, and other Image processing
tasks.

5
11/21/2024

What is convolutional Operator?


The convolution of and is written , denoting the
operator with the symbol . It is defined as the integral of
the product of the two functions after one is reflected
about the y-axis and shifted. Convolution is one of the
most important operations in signal and image
processing. It could operate 1D ( e.g. speech processing),
2D ( e.g. image processing), 3D ( video processing )
Convolution is only used for image editing,
it is also use in speech processing and even in
Convolutional Neural Network ( CNN ) to extract
image features, which are commonly used in computer
vision applications.

What is CNN?
Convolutional Neural Networks are a popular deep
learning model that has achieved state-of-the-art
performance in many different tasks [130–133]. A
convolutional layer is the major building block of a CNN
architecture where the convolution operation is performed.

Convolutional Neural Networks (CNN) are a


type of deep-learning based artificial neural network that
are mainly used to process and analyse visual imagery.
CNN's deep learning algorithm takes visual inputs
(images), extracts and identifies various image features,
assigns importance to features and other aspects of the
image and finally identifies image distinctly by
differentiating it from other

6
11/21/2024

Uses Of CNN :
There are multiple benefits of using
CNNs. These can be used in various fields
and can perform major tasks like facial
recognition, emotion detection, analysing
documents, understanding climate, image
recognition, auto translation, object
identification and biometric
authentication etc. Deep learning has
helped enormously in advancement of
CNNs as it attains the benefits of
providing maximum performance and
efficiency.

Conclusion
Computer vision is an area of research and
development within artificial intelligence that seeks
to simulate human vision to algorithms by
processing categorized images until classification
patterns can be effectively established.

You might also like