Welcome to Scribd!

0% found this document useful (0 votes)

33 views

Lab RNN Intro

Uploaded by

This document discusses recurrent neural networks (RNNs) and their applications. It provides examples of RNN tasks like image captioning, sentiment classification, translation, and video classification. It then explains the basic vanilla RNN model, how RNNs are unfolded in time, and backpropagation through time. Next, it covers truncated backpropagation, teacher forcing, and warm-starting. Finally, it introduces long short-term memory (LSTM) cells and their components, equations, and practical applications in large sequence to sequence models.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Lab RNN Intro

Uploaded by

Miruna -Alondra

0% found this document useful (0 votes)

33 views22 pages

Original Title

Lab-RNN-intro

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

33 views22 pages

Lab RNN Intro

Uploaded by

Miruna -Alondra

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 22

Search inside document

Machine Learning

- Intro to Recurrent Neural Networks -

RNN Tasks

2/18
RNN Tasks

Vanilla RNNs

Source: CS231n Lecture 10

3/18
RNN Tasks

e.g. Image Captioning

Image → sequence of words
Source: CS231n Lecture 10

4/18
RNN Tasks

e.g. Sentiment Classification

Sequence of words → sentiment
Source: CS231n Lecture 10

5/18
RNN Tasks

e.g. Translation
Sequence of words → sequence of words
Source: CS231n Lecture 10

6/18
RNN Tasks

e.g. Video classification

on frame level
Source: CS231n Lecture 10

7/18
RNN Model

8/18
Vanilla RNN Model

(t) (t ) (t )
x h y

wih whh who

●
Current state depends on current inputs and previous state
●
RNNs can yield outputs at each time step
(t ) (t−1) (t)
h =f w (h
hh
, f w ( x ))
ih

(t ) (t )
y =f w (h ), ∀ t ∈{1... τ }
ho

9/18
Unfolding RNN in time

Source: NN Lectures, Tudor

Berariu, 2016

10/18
Unfolding RNN in time

Source: NN Lectures, Tudor

Berariu, 2016

11/18
Unfolding RNN in time

Source: NN Lectures, Tudor

Berariu, 2016

12/18
Forward through entire sequence to
compute loss, then backward through
Backpropagation through time entire sequence to compute gradient

Loss

Fei-Fei Li, Ranjay Krishna, Danfei Xu Lecture 10 - 50 April 29, 2021

Truncated Backpropagation through time
Loss

Run forward and backward

through chunks of the
sequence instead of whole
sequence

Fei-Fei Li, Ranjay Krishna, Danfei Xu Lecture 10 - 51 April 29, 2021

Truncated Backpropagation through time
Loss

Carry hidden states

forward in time forever,
but only backpropagate
for some smaller
number of steps

Fei-Fei Li, Ranjay Krishna, Danfei Xu Lecture 10 - 52 April 29, 2021

Truncated Backpropagation through time
Loss

Fei-Fei Li, Ranjay Krishna, Danfei Xu Lecture 10 - 53 April 29, 2021

Truncated BPTT

●
Used in practice
●
Summary of the algorithm:
– Present a sequence of k1 timesteps of input and output pairs to
the network.
– Unroll the network then calculate and accumulate errors across
k2 timesteps.
– Roll-up the network and update weights.
– Repeat

13/18
Teacher Forcing and Warm-start

●
When training a RNN to generate a sequence, often, the
predictions (outputs y(t)) of a RNN cell are used as the input of
the cell at the next timestamp
●
Teacher Forcing: at training time, use the targets of the
sequence, instead of RNN predictions, as inputs to the next
step

●
Warm-start: when using an RNN to predict a next value
conditioned on previous predictions, it is sometimes
necessary to give the RNN some context (known ground truth
elements) before letting it predict on its own

14/18
LSTM

15/18
LSTM Cell

Img source:
https://medium.com/
@kangeugine/

●
Input Gate (i in (0, 1) – sigmoid) – scales input to cell (write)
●
Output Gate (o in (0, 1) – sigmoid) – scales output from cell
(read)
●
Forget Gate (f in (0, 1) – sigmoid) – scales old cell values
(reset mem)

16/18
LSTM Cell - Equations

(t ) (t−1)
it =σ ( θ xi x + θhi h +b i )

(t ) (t−1)
f t =σ ( θ xf x + θhf h +b f )

(t ) (t−1)
o t =σ ( θ xo x + θho h +b o )

(t) (t−1)
g t =tanh ( θ xg x + θhg h +b g )

c t =f t ⊙c(t−1)+it ⊙g t
h t =ot ⊙tanh(ct ) , where ⊙ is elementwise multiplication

17/18
LSTMs in practice

●
Sutskever et al, Sequence
to Sequence Learning with
Neural Networks, NIPS 2014
– Models are huge :-)

– 4 layers, 1000 LSTM cells

per layer
– Input vocabulary of 160k
– Output vocabulary of 80k
– 1000 dimensional word
embeddings

18/18

Caelus - Pitch Byi Final
Document13 pages
Caelus - Pitch Byi Final
Nathan Arnold
No ratings yet
IBM Developer Brand System Guidelines R3.0 2019
Document95 pages
IBM Developer Brand System Guidelines R3.0 2019
Jean Moraes
No ratings yet
ch10 Sequence Modelling - Recurrent and Recursive Nets
Document45 pages
ch10 Sequence Modelling - Recurrent and Recursive Nets
黃良初
No ratings yet
Unit 11-LSTM-CNN
Document72 pages
Unit 11-LSTM-CNN
陳力熊
No ratings yet
Curs2014 Partea3 Extra
Document9 pages
Curs2014 Partea3 Extra
Leontin Căliman
No ratings yet
Recurrent Nets
Document28 pages
Recurrent Nets
Khang Thái Duy
No ratings yet
07 RNN Recurrent Neural Networks
Document115 pages
07 RNN Recurrent Neural Networks
조동올
No ratings yet
2014 10 Cho EMNLP
Document11 pages
2014 10 Cho EMNLP
hungbkpro90
No ratings yet
Hindi To French Ankur
Document6 pages
Hindi To French Ankur
Balpreet Kaur
No ratings yet
Lecture Notes - Recurrent Neural Networks
Document11 pages
Lecture Notes - Recurrent Neural Networks
Bhavin Panchal
No ratings yet
Advanced Data Analytics: Simon Scheidegger - University of Lausanne, Department of Economics
Document50 pages
Advanced Data Analytics: Simon Scheidegger - University of Lausanne, Department of Economics
Ruben Kempter
No ratings yet
Recursion
Document15 pages
Recursion
tarunsocsa
No ratings yet
Phonopy Tips 2014
Document48 pages
Phonopy Tips 2014
anibrata
No ratings yet
Unit 4b - Recurrent Neural Networks
Document60 pages
Unit 4b - Recurrent Neural Networks
Esha Thaniya Malla
No ratings yet
Unit III (2) RNN, LSTM, Gru
Document14 pages
Unit III (2) RNN, LSTM, Gru
aayushahuja030201
No ratings yet
9 RNN LSTM Gru
Document91 pages
9 RNN LSTM Gru
sandhya
No ratings yet
Lecture4 Quantum Python
Document23 pages
Lecture4 Quantum Python
Sam
No ratings yet
Csitnepal: UNIT:-1 Principles of Analyzing Algorithms and Problems
Document83 pages
Csitnepal: UNIT:-1 Principles of Analyzing Algorithms and Problems
Kiran Khanal
No ratings yet
Plasticity III 2022
Document42 pages
Plasticity III 2022
Juan Pablo
No ratings yet
ML Lec 21 RNN
Document72 pages
ML Lec 21 RNN
8d24wc8sj2
No ratings yet
Sequence Modeling RNN-LSTM-APPL-Anand Kumar JUNE2021
Document71 pages
Sequence Modeling RNN-LSTM-APPL-Anand Kumar JUNE2021
prakash
No ratings yet
QTT Irif 20200123 PDF
Document82 pages
QTT Irif 20200123 PDF
rodrigogribeiro
No ratings yet
Module 4 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
Document21 pages
Module 4 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
suryajit27
No ratings yet
AOA-Paper Solutions
Document68 pages
AOA-Paper Solutions
Sahil K
No ratings yet
Be Computer Engineering Semester 4 2019 May Analysis of Algorithms Cbcgs
Document19 pages
Be Computer Engineering Semester 4 2019 May Analysis of Algorithms Cbcgs
Mysterious Mr.Mister
No ratings yet
Phonon Cal in Quantum Espresso&files
Document54 pages
Phonon Cal in Quantum Espresso&files
sasdude
No ratings yet
K-Faraz Et Al Sci-Rep (2021) REVISED SupplInfo
Document17 pages
K-Faraz Et Al Sci-Rep (2021) REVISED SupplInfo
Suganthi Ganesh
No ratings yet
SE-Comps SEM4 AOA-CBCGS DEC19 SOLUTION
Document19 pages
SE-Comps SEM4 AOA-CBCGS DEC19 SOLUTION
Sagar Saklani
No ratings yet
Existential Existential - Existential Any: Alternation Universal Universal
Document19 pages
Existential Existential - Existential Any: Alternation Universal Universal
manikandarajan_gr
No ratings yet
DFT notes (1)
Document32 pages
DFT notes (1)
PRO Gamer
No ratings yet
Lectures Handout
Document23 pages
Lectures Handout
Berilhes Borges Garcia
No ratings yet
Lecture 2
Document23 pages
Lecture 2
nitingautam1907
No ratings yet
DSP - Nov 19 Q&a PDF
Document17 pages
DSP - Nov 19 Q&a PDF
Anonymous yO7rcec6vu
No ratings yet
CS 224D: Deep Learning For NLP: Lecture Notes: Part IV Spring 2015
Document12 pages
CS 224D: Deep Learning For NLP: Lecture Notes: Part IV Spring 2015
Kumar J
No ratings yet
CS2403 DSP
Document47 pages
CS2403 DSP
kar12345aravind
100% (1)
CS60010: Deep Learning: Recurrent Neural Network
Document44 pages
CS60010: Deep Learning: Recurrent Neural Network
parantap dansana
No ratings yet
Radix Sort Explanation
Document8 pages
Radix Sort Explanation
vadini.vatsla76
No ratings yet
02b Analysis
Document19 pages
02b Analysis
ap6487
No ratings yet
4-1 Nic
Document26 pages
4-1 Nic
Swastik Agarwal
No ratings yet
FLUENT Modeling Unsteady Flows
Document101 pages
FLUENT Modeling Unsteady Flows
조기현
100% (1)
Notes On Discrete Fourier Transforms Physics 204B: T N 2T N KT N (N 1) T N I T
Document4 pages
Notes On Discrete Fourier Transforms Physics 204B: T N 2T N KT N (N 1) T N I T
Sachin Pati
No ratings yet
Understanding LSTM Networks
Document27 pages
Understanding LSTM Networks
Samyukta
No ratings yet
5707 11 RNN LSTM
Document128 pages
5707 11 RNN LSTM
Mayank Kumar
No ratings yet
Y23!02!2119divide and Conquer
Document36 pages
Y23!02!2119divide and Conquer
Denise Tsang
No ratings yet
Unit 3 Deep Learning SPPU BE IT
Document30 pages
Unit 3 Deep Learning SPPU BE IT
mansimengde17
No ratings yet
Chapter 7 - Recursion
Document23 pages
Chapter 7 - Recursion
Tanveer Ahmed Hakro
No ratings yet
Digital Signal Processing (DSP) - Jeppiar College
Document22 pages
Digital Signal Processing (DSP) - Jeppiar College
Karthi Keyan
No ratings yet
Recurrence Relation
Document7 pages
Recurrence Relation
sayan.pal.23
No ratings yet
It6502 2m-1 Rejinpaul
Document12 pages
It6502 2m-1 Rejinpaul
Tejaswini Pydi
No ratings yet
Unit 3 RCNN Updated
Document28 pages
Unit 3 RCNN Updated
vijayganesh.s.2022.ads
No ratings yet
Computational Astrophysics
Document19 pages
Computational Astrophysics
Carlos Alberto Dutra Fraga Filho
No ratings yet
The Fast Fourier Transform Demystified
Document4 pages
The Fast Fourier Transform Demystified
Andy2102
No ratings yet
Green's Function Estimates for Lattice Schrödinger Operators and Applications
From Everand
Green's Function Estimates for Lattice Schrödinger Operators and Applications
Jean Bourgain
No ratings yet
The Spectral Theory of Toeplitz Operators
From Everand
The Spectral Theory of Toeplitz Operators
L. Boutet de Monvel
No ratings yet
Calculus on Heisenberg Manifolds
From Everand
Calculus on Heisenberg Manifolds
Richard Beals
No ratings yet
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Seminar on Micro-Local Analysis
From Everand
Seminar on Micro-Local Analysis
Victor Guillemin
No ratings yet
Commensurabilities among Lattices in PU (1,n)
From Everand
Commensurabilities among Lattices in PU (1,n)
Pierre R. Deligne
No ratings yet
Harmonic Analysis and the Theory of Probability
From Everand
Harmonic Analysis and the Theory of Probability
Salomon Bochner
No ratings yet
Singular Points of Complex Hypersurfaces
From Everand
Singular Points of Complex Hypersurfaces
John Milnor
No ratings yet
Navier-Stokes Equations
From Everand
Navier-Stokes Equations
Peter Constantin
No ratings yet
Radically Elementary Probability Theory
From Everand
Radically Elementary Probability Theory
Edward Nelson
Rating: 4 out of 5 stars
4/5 (2)
TAMM Presentation
Document23 pages
TAMM Presentation
maryam saeed
No ratings yet
Concurrent Engineering at Airbus - A Case Study: Roland Haas and Manoj Sinha
Document13 pages
Concurrent Engineering at Airbus - A Case Study: Roland Haas and Manoj Sinha
Mani Rathinam Rajamani
No ratings yet
CB Insights Future of Fashion
Document68 pages
CB Insights Future of Fashion
Aly Changiz
No ratings yet
Research Paper
Document16 pages
Research Paper
Ashwini
No ratings yet
Applications of AI in Business
Document1 page
Applications of AI in Business
mannanasif983
No ratings yet
A Hybrid Machine Learning Method For Image Classification
Document15 pages
A Hybrid Machine Learning Method For Image Classification
index Pub
No ratings yet
11 - AI 900 101 - 117 - Answered Day 4
Document4 pages
11 - AI 900 101 - 117 - Answered Day 4
Edy Nugroho
No ratings yet
Eea2a - HOLIDAY HOMEWORK XII
Document12 pages
Eea2a - HOLIDAY HOMEWORK XII
Daksh Yadav
No ratings yet
English Assignment-7
Document2 pages
English Assignment-7
Fahad Mehmood
No ratings yet
Course Outlines-Semester
Document7 pages
Course Outlines-Semester
Zara Tariq
No ratings yet
UiPath AL ROC - Carahsoft 2019 PDF
Document85 pages
UiPath AL ROC - Carahsoft 2019 PDF
Jagadeeh Valasapalli
No ratings yet
Module 3 Living in The IT Era
Document11 pages
Module 3 Living in The IT Era
kvelez
No ratings yet
Review On Application of Artificial Intelligence in Civil Engineering
Document31 pages
Review On Application of Artificial Intelligence in Civil Engineering
puppyarav2726
No ratings yet
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture3 Compressed
Document15 pages
Asset-V1 MITx+6.86x+3T2020+typeasset+blockslides Lecture3 Compressed
Rahul Vasanth
No ratings yet
1 Neural Networks
Document16 pages
1 Neural Networks
Sai someone
No ratings yet
6-Huawei ICT Competition 2021-2022 Exam Outline - Cloud Track
Document7 pages
6-Huawei ICT Competition 2021-2022 Exam Outline - Cloud Track
Énergie Said
No ratings yet
A Thesis On Automated Handling of Port Containers Using Machine Learning
Document59 pages
A Thesis On Automated Handling of Port Containers Using Machine Learning
Ashifur Rahaman
100% (1)
Sample Project Report
Document26 pages
Sample Project Report
ABHI
No ratings yet
Signature Verification Using Python
Document8 pages
Signature Verification Using Python
IJRASETPublications
No ratings yet
ETI Report Group 24
Document22 pages
ETI Report Group 24
Aryan Buchake
No ratings yet
V.K.R, V.N.B & A.G.K College of Engineering: Explain The Two Uses of Features in Machine Learning
Document2 pages
V.K.R, V.N.B & A.G.K College of Engineering: Explain The Two Uses of Features in Machine Learning
NAVYA Tadisetty
No ratings yet
AI IN BI
Document3 pages
AI IN BI
judy672625
No ratings yet
AI Project Report
Document7 pages
AI Project Report
Keerti Gulati
No ratings yet
Instance Segmentation For Autonomous Vehicle
Document6 pages
Instance Segmentation For Autonomous Vehicle
Soulayma Gazzeh
No ratings yet
02 Create Question Answering Solutions With Azure AI Language
Document22 pages
02 Create Question Answering Solutions With Azure AI Language
nelajala nelajala
No ratings yet
An Instruction Manual of 'Delivery Drone'
Document8 pages
An Instruction Manual of 'Delivery Drone'
tarvinraj
No ratings yet
Technological Forecasting & Social Change: Pradeep Kautish, Sonal Purohit, Raffaele Filieri, Yogesh K. Dwivedi
Document14 pages
Technological Forecasting & Social Change: Pradeep Kautish, Sonal Purohit, Raffaele Filieri, Yogesh K. Dwivedi
Subrata Das
No ratings yet
Stretching The Mind
Document5 pages
Stretching The Mind
bharathtg
No ratings yet