Lan Gauge
Lan Gauge
Lan Gauge
Learning?
• Machine Learning
• Study of algorithms that improve their performance
at some task with experience
• Optimize a performance criterion using example data or
experience.
• Role of Statistics: Inference from a sample
• Role of Computer science: Efficient algorithms to
• Solve the optimization problem
• Representing and evaluating the model for inference 1
Machine learning is a field which
focuses on the use of data and
algorithms to imitate the way that
humans learn, gradually improving its
accuracy.
Features
• A feature is one column of the data in your
input set.
Features Label
• A model is the relationship between features and the
Model label.
ML
Data ML Model
Algorithm
Steps in ML
7.
Make
6. predicti
5. Parameter ons
4. Train Evaluate tuning
the the model
3. Choose
a ML model
2. Data
1. Data preparati model
collecti on
on
Approaches to Modelling
Techniques
in ML
The main machine learning techniques include regression,
classification, clustering, decision tree, neural networks, and anomaly
detection.
Regression. (The first machine learning technique uses input data to predict
numerical value)
Classification
Clustering
Decision Tree
Neural Networks
Anomaly Detection
3 Essensitals
Data
• Quantity: Increasing the data used to train the model can improve its accuracy.
• Quality: Using accurate and high-quality data can improve the model’s overall
performance.
• Preprocessing: Cleaning and preprocessing data to handle missing values and
outliers can improve accuracy.
• Augmentation: Expanding the size of real data using data augmentation
techniques can help with smaller datasets and models that experience
overfitting.
Model
• Selection: Experimenting with different model selection techniques to find the best
model for the data.
• Tuning: Adjusting hyperparameters to optimize model performance.
• Ensemble: Combining multiple models for better performance.
Validation
• Cross validation: Testing the accuracy of the model on multiple and diverse subsets
of data.
Classifier:
A classifier in machine learning is an algorithm that automatically
orders or categorizes data into one or more of a set of “classes.” One
of the most common examples is an email classifier that scans emails
to filter them by class label: Spam or Not Spam
Regressior