Welcome to Scribd!

0% found this document useful (0 votes)

9 views

CPSC540: Regularization, Regularization, Nonlinear Prediction and Generalization

Uploaded by

This lecture covers regularization techniques for fitting nonlinear functions to data using basis functions. It teaches how to: 1) Derive ridge regression and understand the tradeoff between fitting data and regularizing the model. 2) Use polynomial regression and cross-validation to deal with nonlinearity. 3) Choose hyperparameters like the regularization coefficient, kernel width, and polynomial order using cross-validation. 4) Understand how the number of data points and basis functions affect generalization.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

CPSC540: Regularization, Regularization, Nonlinear Prediction and Generalization

Uploaded by

juanagallardo01

0% found this document useful (0 votes)

9 views23 pages

Original Description:

Machine Learning Nando de Freitas January, 2013 University of British Columbia

Original Title

l4 ML

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

9 views23 pages

CPSC540: Regularization, Regularization, Nonlinear Prediction and Generalization

Uploaded by

juanagallardo01

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 23

Search inside document

CPSC540

Regularization,
nonlinear prediction
and generalization

Nando de Freitas
Janurary, 2013
University of British Columbia
Outline of the lecture
This lecture will teach you how to fit nonlinear functions by using
bases functions and how to control model complexity. The goal is for
you to:

Learn how to derive ridge regression.

Understand the trade-off of fitting the data and regularizing it.
Learn polynomial regression.
Understand that, if basis functions are given, the problem of
learning the parameters is still linear.
Learn cross-validation.
Understand the effects of the number of data and the number of
basis functions on generalization.
Regularization
Derivation
Ridge regression as constrained optimization
Regularization paths
Asδ increases, t(δ) decreases and each θi goes to zero.

θ1

θ2

θ8

t(δ) [Hastie, Tibshirani & Friedman book]

Going nonlinear via basis functions
We introduce basis functions φ(·) to deal with nonlinearity:

y(x) = φ(x)θ + ǫ

For example, φ(x) = [1, x, x2 ],

Going nonlinear via basis functions
y(x) = φ(x)θ + ǫ

φ(x) = [1, x1 , x2 ] φ(x) = [1, x1 , x2 , x21 , x22 ]

Example: Ridge regression with a polynomial of degree 14

y(xi ) = 1 θ0 + xi θ1 + xi2 θ2 + . . . + xi13 θ13 + xi14 θ14

Φ = [ 1 xi xi2 . . . xi13 xi14 ]

J(θ) = ( y − Φ θ ) Τ ( y − Φ θ ) + δ2 θ Τ θ

y y y
small δ medium δ large δ

x x x
Kernel regression and RBFs
We can use kernels or radial basis functions (RBFs) as features:
1
(− λ x−µi 2 )
φ(x) = [κ(x, µ1 , λ), . . . , κ(x, µd , λ)], e.g. κ(x, µi , λ) = e

y(xi ) = φ (xi ) θ = 1θ0 + k(xi , µ1 , λ) θ1 + . . . + k(xi , µd , λ) θd

We can choose the locations µ of the basis functions to be the inputs.
That is, µi = xi . These basis functions are the known as kernels.
The choice of width λ is tricky, as illustrated below.
kernels

Too small λ

Right λ

Too large λ
The big question is how do we
choose the regularization coefficient,
the width of the kernels or the
polynomial order?
One Solution: cross-validation
K-fold crossvalidation

The idea is simple: we split the training data into K folds; then, for each
fold k ∈ {1, . . . , K}, we train on all the folds but the k’th, and test on the
k’th, in a round-robin fashion.

It is common to use K = 5; this is called 5-fold CV.

If we set K = N , then we get a method called leave-one out cross

validation, or LOOCV, since in fold i, we train on all the data cases
except for i, and then test on i.
Example: Ridge regression with polynomial of degree 14
Effect of data when we have the right model
yi = θ0 + xi θ1 + xi2 θ2 + N ( 0 , σ 2 )
Effect of data when the model is too simple
yi = θ0 + xi θ1 + xi2 θ2 + N ( 0 , σ 2 )
Effect of data when the model is very complex
yi = θ0 + xi θ1 + xi2 θ2 + N ( 0 , σ 2 )
Confidence in the predictions
Next lecture
In the next lecture, we introduce Bayesian inference, and show how it
can provide us with an alternative way of learning a model from data.

Machine Learning Week 2 Coursera
Document4 pages
Machine Learning Week 2 Coursera
Hương Đặng
100% (1)
Practice Midterm
Document4 pages
Practice Midterm
Arka Mitra
No ratings yet
Lecture 9 - SVM
Document42 pages
Lecture 9 - SVM
Husein Yusuf
No ratings yet
MedTerm Machine Learning
Document14 pages
MedTerm Machine Learning
MOhmedSharaf
No ratings yet
A Proposal On Machine Learning Via Dynamical Systems
Document11 pages
A Proposal On Machine Learning Via Dynamical Systems
Pol Mestres
No ratings yet
Limits and Continuity PDF
Document107 pages
Limits and Continuity PDF
Anonymous 0MTQ0B
No ratings yet
Convex Optimization Prerequisite_topics
Document6 pages
Convex Optimization Prerequisite_topics
issacwy
No ratings yet
BRLF Miccai2013 Fixedrefs
Document8 pages
BRLF Miccai2013 Fixedrefs
Spmail Amal
No ratings yet
10-701/15-781, Machine Learning: Homework 1: Aarti Singh Carnegie Mellon University
Document6 pages
10-701/15-781, Machine Learning: Homework 1: Aarti Singh Carnegie Mellon University
tarun gupta
No ratings yet
Day 1
Document41 pages
Day 1
sharma.pranshu2388
No ratings yet
Notes
Document73 pages
Notes
gaurav_718676500
No ratings yet
Lec 16
Document10 pages
Lec 16
Arthur Costa
No ratings yet
Sparsity and Its Mathematics
Document44 pages
Sparsity and Its Mathematics
jwdali
No ratings yet
Convex Functions
Document13 pages
Convex Functions
wesley maxmiliano
No ratings yet
ml_cheat (1)
Document9 pages
ml_cheat (1)
deepti gupta
No ratings yet
Introduction To Optimization: Anjela Govan North Carolina State University SAMSI NDHS Undergraduate Workshop 2006
Document29 pages
Introduction To Optimization: Anjela Govan North Carolina State University SAMSI NDHS Undergraduate Workshop 2006
Teferi Lemma
No ratings yet
SinhaDu16 PDF
Document20 pages
SinhaDu16 PDF
Ümit Aslan
No ratings yet
Mathematics of Deep Learning: Lecture 1-Introduction and The Universality of Depth 1 Nets
Document12 pages
Mathematics of Deep Learning: Lecture 1-Introduction and The Universality of Depth 1 Nets
flowh_
No ratings yet
Lecture 1, Part 1: Linear Regression: Roger Grosse
Document9 pages
Lecture 1, Part 1: Linear Regression: Roger Grosse
Shamil shihab pk
No ratings yet
ML, WK 04-Questions With Answers
Document4 pages
ML, WK 04-Questions With Answers
ravinyse
No ratings yet
Practice Midterm
Document8 pages
Practice Midterm
Olabiyi Ridwan
No ratings yet
Dirichlet
Document6 pages
Dirichlet
Yenny Purwandari
No ratings yet
n25 PDF
Document8 pages
n25 PDF
Christine Straub
No ratings yet
CH 4
Document41 pages
CH 4
Tran Kim Toai
No ratings yet
Midterm Solutions For Machine Learning
Document13 pages
Midterm Solutions For Machine Learning
Nithin
No ratings yet
lecture3_supervised_learning_I
Document84 pages
lecture3_supervised_learning_I
y.lin.39
No ratings yet
Kernel Ridge Regression: Max Welling
Document3 pages
Kernel Ridge Regression: Max Welling
Naein
No ratings yet
9.1 Optimization
Document29 pages
9.1 Optimization
abdul aziz
No ratings yet
SGM With Random Features
Document25 pages
SGM With Random Features
idan kahan
No ratings yet
Estimating Penalized Spline Regressions: Theory and Application To Economics
Document16 pages
Estimating Penalized Spline Regressions: Theory and Application To Economics
sdsdfsds
No ratings yet
Guc 2 61 38781 2023-11-25T16 24 45
Document19 pages
Guc 2 61 38781 2023-11-25T16 24 45
joeplays451
No ratings yet
Logistic Regression in Data Analysis: An Overview
Document21 pages
Logistic Regression in Data Analysis: An Overview
SAHIL PAHUJA
No ratings yet
Diffeomorphisms: Marc Niethammer
Document29 pages
Diffeomorphisms: Marc Niethammer
eesheep
100% (1)
Fuzzy
Document38 pages
Fuzzy
tidjani86
No ratings yet
Instructions
Document7 pages
Instructions
Agarwal Himanshu
No ratings yet
A Feasible Method For Optimization With Orthogonality Constraints
Document39 pages
A Feasible Method For Optimization With Orthogonality Constraints
rboragolla
No ratings yet
Limits: Created by Tynan Lazarus September 24, 2017
Document6 pages
Limits: Created by Tynan Lazarus September 24, 2017
Geetansh Pancholi
No ratings yet
LIBSVM: A Library For Support Vector Machines
Document39 pages
LIBSVM: A Library For Support Vector Machines
Nguyễn Hồng
No ratings yet
Lec6 Linear Model With LSP
Document35 pages
Lec6 Linear Model With LSP
Shanti Grover
No ratings yet
NLP Nctu
Document19 pages
NLP Nctu
larasmoyo
No ratings yet
03 - Kernelization
Document32 pages
03 - Kernelization
Rehan Mahmood
No ratings yet
Machine Learning - Home - Week 2 - Notes - Coursera
Document10 pages
Machine Learning - Home - Week 2 - Notes - Coursera
copsamosto
No ratings yet
EIE520 Neural Computation: The Hong Kong Polytechnic University
Document14 pages
EIE520 Neural Computation: The Hong Kong Polytechnic University
test hack
No ratings yet
Convex Optimization
Document152 pages
Convex Optimization
nudala
No ratings yet
Calc
Document6 pages
Calc
ballistic.code
No ratings yet
BITS F464 ML Lecture Notes
Document86 pages
BITS F464 ML Lecture Notes
Shashank S
No ratings yet
Boosting and Additive Tree
Document26 pages
Boosting and Additive Tree
Jigar Patel
No ratings yet
Week 6
Document50 pages
Week 6
Aarthi Shankar
No ratings yet
Linear Regression
Document9 pages
Linear Regression
jeanjeanxxxx224
No ratings yet
CMPUT 466/551 - Assignment 1: Paradox?
Document6 pages
CMPUT 466/551 - Assignment 1: Paradox?
findingfelicity
No ratings yet
Day 2
Document40 pages
Day 2
Dishant Gupta
No ratings yet
Intro 1
Document12 pages
Intro 1
miaomiao24122
No ratings yet
ACT6100 A2020 Sup 12
Document37 pages
ACT6100 A2020 Sup 12
lebesgues
No ratings yet
Nonlinear Optimization
Document6 pages
Nonlinear Optimization
Kibria Prangon
No ratings yet
A Study On Sigmoid Kernels For SVM and The Training of non-PSD Kernels by SMO-type Methods
Document32 pages
A Study On Sigmoid Kernels For SVM and The Training of non-PSD Kernels by SMO-type Methods
ramonenrique
No ratings yet
Lagrangian Relaxation: An Overview: General Idea
Document4 pages
Lagrangian Relaxation: An Overview: General Idea
ami554
No ratings yet
Calc 2 Springstudyguide
Document5 pages
Calc 2 Springstudyguide
Default Account
No ratings yet
Introduction To Rewriting and Functional Programming
Document355 pages
Introduction To Rewriting and Functional Programming
clux
No ratings yet
Green's Function Estimates for Lattice Schrödinger Operators and Applications
From Everand
Green's Function Estimates for Lattice Schrödinger Operators and Applications
Jean Bourgain
No ratings yet
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Massachusetts Institute of Technology: 6.867 Machine Learning, Fall 2006 Problem Set 3: Solutions
Document3 pages
Massachusetts Institute of Technology: 6.867 Machine Learning, Fall 2006 Problem Set 3: Solutions
juanagallardo01
No ratings yet
Massachusetts Institute of Technology: 6.867 Machine Learning, Fall 2006 Problem Set 4: Solutions
Document8 pages
Massachusetts Institute of Technology: 6.867 Machine Learning, Fall 2006 Problem Set 4: Solutions
juanagallardo01
No ratings yet
Massachusetts Institute of Technology: 6.867 Machine Learning, Fall 2006 Problem Set 2: Solutions
Document7 pages
Massachusetts Institute of Technology: 6.867 Machine Learning, Fall 2006 Problem Set 2: Solutions
juanagallardo01
No ratings yet
HW 4
Document6 pages
HW 4
juanagallardo01
No ratings yet
HW 3 Errata
Document1 page
HW 3 Errata
juanagallardo01
No ratings yet
Tao2018 - Digital Twin Driven Prognostics and Health Management For Complex Equipment
Document4 pages
Tao2018 - Digital Twin Driven Prognostics and Health Management For Complex Equipment
bmdeon
No ratings yet
CBSE Class 9 Mathematics Worksheet - Polynomials
Document2 pages
CBSE Class 9 Mathematics Worksheet - Polynomials
lembda
100% (2)
B. M. Yavorsky, A. A. Pinsky-Fundamentals of Physics. 1-MIR Publishers (1975) - 469-544
Document76 pages
B. M. Yavorsky, A. A. Pinsky-Fundamentals of Physics. 1-MIR Publishers (1975) - 469-544
ABHIRAM K SANJEEV
No ratings yet
FEE321 Lecture - 5 - Singularity Functions - 2hrs
Document57 pages
FEE321 Lecture - 5 - Singularity Functions - 2hrs
Peter Jumre
No ratings yet
Compound Curve
Document10 pages
Compound Curve
reyi
No ratings yet
Quarks
Document4 pages
Quarks
Daliboyina Vaishnav
No ratings yet
Daphne Super Coat NR: Lubricant Product Information
Document2 pages
Daphne Super Coat NR: Lubricant Product Information
Zack Malik
No ratings yet
Unit 5....
Document4 pages
Unit 5....
Raiyad Reza
No ratings yet
Insulators 101 Panel Final A
Document84 pages
Insulators 101 Panel Final A
Blake
No ratings yet
Kelompok Bobot Sampel Ditimbang (G) Di Ad Abs Sampel
Document4 pages
Kelompok Bobot Sampel Ditimbang (G) Di Ad Abs Sampel
Jilan Qf
No ratings yet
Close Traverse
Document15 pages
Close Traverse
aQuAmiRa
100% (1)
Calculating Weight
Document1 page
Calculating Weight
Măndița Baias
No ratings yet
Nce 520 Integration Course 2 Strength of Materials: Simple Stress and Simple Strain
Document3 pages
Nce 520 Integration Course 2 Strength of Materials: Simple Stress and Simple Strain
Mark Aaron Alcantara
No ratings yet
The Pattern Book Fractals Art and Nature
Document450 pages
The Pattern Book Fractals Art and Nature
Alejandra Ramirez Zuluaga
100% (3)
Librosysolucionarios - Index
Document12 pages
Librosysolucionarios - Index
ScribdTranslations
No ratings yet
June AASL FINAL P1 Markscheme
Document11 pages
June AASL FINAL P1 Markscheme
mihirdoesvfx
No ratings yet
FMV Assignment
Document48 pages
FMV Assignment
lim kai
No ratings yet
Course Outline Vehicle Dynamics ME-496
Document2 pages
Course Outline Vehicle Dynamics ME-496
Console Boy
No ratings yet
Optical Instrument
Document19 pages
Optical Instrument
Naman
No ratings yet
E4 - Thermal Processes
Document2 pages
E4 - Thermal Processes
alexandria iskandar
No ratings yet
5 Mekbat Sesi - 5 - Deformation
Document17 pages
5 Mekbat Sesi - 5 - Deformation
CHIKAYUSUF
No ratings yet
HKDSE Physics Formulae List (English Version)
Document5 pages
HKDSE Physics Formulae List (English Version)
flowerinsnow
No ratings yet
Graphs Topic Test 1 For Igcse
Document5 pages
Graphs Topic Test 1 For Igcse
Cara Ho
No ratings yet
Hysitron TI 980 TriboIndenter Brochure BRUKER
Document8 pages
Hysitron TI 980 TriboIndenter Brochure BRUKER
carbon89
No ratings yet
Classified Files 2 Sept 2022
Document65 pages
Classified Files 2 Sept 2022
Psyche Cajes
No ratings yet
CE117 Process Trainer Datasheet
Document3 pages
CE117 Process Trainer Datasheet
CRISTOBAL ANDRES SUSARTE RODRIGUEZ
No ratings yet
FalCon - Pre Board 2 (2024) - HGE
Document9 pages
FalCon - Pre Board 2 (2024) - HGE
rarawaveparas
No ratings yet
Libro Pag.64,65 Y66. Vocabulario
Document9 pages
Libro Pag.64,65 Y66. Vocabulario
Dai Maō Mahmut
No ratings yet
DRV 1
Document1 page
DRV 1
jaison.p
No ratings yet
Y6 Autumn Block 1 WO8 Negative Numbers 2022
Document2 pages
Y6 Autumn Block 1 WO8 Negative Numbers 2022
M alfy
No ratings yet