DTEXP5
DTEXP5
DTEXP5
Experiment: 2.1
2. Tool Used:
Require the PyCaret libraries and Google Colab
𝐷 = {(𝑥𝑖𝑖, 𝑦𝑖𝑖)}𝑁
Where, D is called the training set, and N is the number of training examples.
In the simplest setting, each training input xi is a D-dimensional vector of numbers, representing, say, the
height and weight of a person. These are called features, attributes or covariates. In general, however, xi
could be a complex structured object, such as an image, a sentence, an email message, a time series, a
molecular shape, a graph, etc.
Similarly the form of the output or response variable can in principle be anything, but most methods
assume that yi is a categorical or nominal variable from some finite set, yi ∈ {1,...,C} (such as male or
female), or that yi is a real-valued scalar (such as income level). When yi is categorical, the problem is
known as classification or pattern recognition, and when yi is real- valued, the problem is known as
regression. Another variant, known as ordinal regression, occurs where label space Y has some natural
ordering, such as grades A–F.
University Institute of Engineering
Department of Computer Science & Engineering
The second main type of machine learning is the descriptive or unsupervised learning approach. Here we
are only given inputs, D = {xi}N i=1, and the goal is to find “interesting patterns” in the data. This is
sometimes called knowledge discovery. This is a much less well-defined problem, since we are not told
what kinds of patterns to look for, and there is no obvious error metric to use.
There is a third type of machine learning, known as reinforcement learning, which is somewhat less
commonly used. This is useful for learning how to act or behave when given occasional reward or
punishment signals.
4. Code:
!pip install pycaret &> /dev/null
print ("Pycaret installed sucessfully!!")
dataSets = get_data('index')
bostonDataSet = get_data("boston")
print(type(bostonDataSet))
cm = compare_models()
cm = compare_models()
cm = compare_models()
University Institute of Engineering
Department of Computer Science & Engineering
cm = compare_models()
# Select top 10 rows from boston dataset newDataSet = get_data("boston").iloc[:10] # Make prediction on
new dataset
2. Setting up Environment: How to setup an experiment in PyCaret and get started with building
regression models
3. Create Model: How to create a model, perform cross validation and evaluate regression metrics