Welcome to Scribd!

0% found this document useful (0 votes)

19 views

Assignment 1

Uploaded by

The document contains a 9 question multiple choice quiz about natural language processing topics. The questions cover topics like tokenization, part-of-speech tagging, stemming algorithms, and more. Correct answers and short explanations are provided for each question.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Assignment 1

Uploaded by

geetha megharaj

0% found this document useful (0 votes)

19 views4 pages

Original Description:

NPTEL NLP

Original Title

assignment_1

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

19 views4 pages

Assignment 1

Uploaded by

geetha megharaj

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 4

Search inside document

Natural Language Processing

Assignment 1
Type of Questions: MCQ

Number of Questions: 9 Total Marks: 9

Question 1: What would be the number of tokens for the following sentence after:
a) word level tokenization by space, and b) character level tokenization?

“All good things come to an end.”

1. 8, 25

2. 7, 31

3. 8, 31

4. 7, 25

Answer: 2
Solution: Character tokenization:
[‘A’, ‘l’, ‘l’, ‘ ’, ‘g’, ‘o’, ‘o’, ‘d’, ‘ ’, ‘t’,
‘h’, ‘i’, ‘n’, ‘g’, ‘s’, ‘ ’, ‘c’, ‘o’, ‘m’, ‘e’, ‘ ’,
‘t’, ‘o’, ‘ ’, ‘a’, ‘n’, ‘ ’, ‘e’, ‘n’, ‘d’, ‘.’]

Word tokenization by space:

[‘All’, ‘good’, ‘things’, ‘come’, ‘to’, ‘an’, ‘end.’]

Question 2: If we use the regular expression "\.[ ]+" (python syntax) for sentence
tokenization, what problems may we face?

1. ‘.’(dot) may be part of an abbreviation

2. There may not be a space after the end-of-sentence.

1
3. There may be more than one space after the end-of-sentence.
4. Sentence may end with punctuations other than ‘.’(dot).
Answer: 1, 2, 4
Solution:

Question 3: A text processing system found the following sentence in a document.

What are the most probable reasons for the two hyphens?
“With general-purpose” computers becoming more and more power-ful, multimedia
devices like iPod or Walkman have become rare.
1. Sententially determined hyphen, End-of-line hyphen
2. Sententially determined hyphen, Lexical hyphen
3. Sententially determined hyphen, Sententially determined hyphen
4. Lexical hyphen, Sententially determined hyphen
Answer: 1
Solution: “Powerful” is a word without any hyphens.

Question 4: Consider the imaginary words “starking” and “ylding”. If we pass it

through the Porter Stemmer algorithm what would be the outputs?
1. stark, yld
2. star, yld
3. starking, ylding
4. stark, ylding
Answer: 4
Solution:

Question 5: What is order relation between frequencies of any function word wf

and any content word wc ?

2
1. freq(wf ) > freq(wc )

2. freq(wf ) < freq(wc )

3. freq(wf ) ≈ freq(wc )

4. Not comparable

Answer: 1
Solution: Function words belong to a closed set of words and are limited in number.
Thus any function word is generally more frequent in a text than any content word.

Question 6: Consider the following text:

Mr. Bennet was among the earliest of those who waited on Mr. Bingley.
He had always intended to visit him, though to the last always assuring
his wife that he should not go; and till the evening after the visit was paid
she had no knowledge of it.

Find the running-average TTR for a window of length 40. (Assume that the text is
tokenized by spaces only.)

1. 0.7917

2. 0.8389

3. 1.2632

4. 0.7533

Answer: 2
Solution: We get 48 tokens after tokenization.
For all the sliding windows in [1, 40], [2, 41], ..., [9, 48] we get the TTR values as follows.
[0.825, 0.825, 0.825, 0.85, 0.825, 0.85, 0.85, 0.85, 0.85]
The average is 0.8389 (rounded to 4 decimals.)

Question 7: Which of the following phenomena make word-segmentation difficult

in Sanskrit language?

1. Inflectional Morphology

3
2. Ambiguity in function of punctuations

3. Sandhi

4. None of these

Answer: 3
Solution: Refer to lecture 5

Question 8: Which of the following algorithms can be used for automatically creating
decision trees?

1. Gradient Descent

2. ID3

3. Adaboost

4. C4.5

Answer: 2, 4
Solution: Refer to lecture 5

Question 9: If the TTR (type-to-token ratio) value in a book after first 500 tokens
is r500 and after first 50000 tokens is r50k , then what is the expected order of these
two ratios?

1. r500 < r50k

2. r500 > r50k

3. r500 ≈ r50k

4. Not comparable

Answer: 2
Solution: Refer to lecture 4

Introduction To Modern Cryptography - 2nd Edition Solutions Manual
Document156 pages
Introduction To Modern Cryptography - 2nd Edition Solutions Manual
inullzzz
50% (6)
Data Compression Solutions
Document67 pages
Data Compression Solutions
lokeshbn
79% (19)
Sample Lesson Plans Excel 9
Document235 pages
Sample Lesson Plans Excel 9
Айгерим
100% (2)
Quiz
Document8 pages
Quiz
Geisler Jelo Mangaran
No ratings yet
Capgemini Mock Test-1
Document166 pages
Capgemini Mock Test-1
Gnana Kumar
No ratings yet
COS30008 - 2021 Final Exam Questionnaire
Document3 pages
COS30008 - 2021 Final Exam Questionnaire
Minhh Tu
No ratings yet
Final CSI 4107 - 2009 Solution
Document11 pages
Final CSI 4107 - 2009 Solution
Merelup
100% (1)
TRIGONOMETRY
Document4 pages
TRIGONOMETRY
Rachel Delosreyes
100% (1)
MTA Makeup Spring 2014 PDF
Document8 pages
MTA Makeup Spring 2014 PDF
Christina Fington
No ratings yet
Exam Preparation Questions PCL I 2022
Document6 pages
Exam Preparation Questions PCL I 2022
Richard Salnikov
No ratings yet
Advanced Data Structure - Sample Questions
Document9 pages
Advanced Data Structure - Sample Questions
Aarish Giri
No ratings yet
NLP Assignment-9 Solution
Document4 pages
NLP Assignment-9 Solution
geetha megharaj
No ratings yet
HW01
Document9 pages
HW01
mariahrosefullerton
No ratings yet
Problem 1 Proposal
Document24 pages
Problem 1 Proposal
uthayakumar j
No ratings yet
AI Homework
Document12 pages
AI Homework
Liber Primus
No ratings yet
Adobe
Document25 pages
Adobe
mahesh121192
No ratings yet
ST Microelectronics Interview Questions
Document4 pages
ST Microelectronics Interview Questions
Manish Dahiya
No ratings yet
Ask Me Anything Dynamic Memory Networks For Natural Language Processing
Document10 pages
Ask Me Anything Dynamic Memory Networks For Natural Language Processing
Ashutosh Kumar
No ratings yet
XII-Comp. Sc.
Document124 pages
XII-Comp. Sc.
axyz49804
No ratings yet
Ca 2
Document4 pages
Ca 2
iwyjqkebd
No ratings yet
Experience Ashwin Muthuraman
Document5 pages
Experience Ashwin Muthuraman
Keerthi Varshini
No ratings yet
Evolutionary Algorithm For Decryption of Monoalphabetic Homophonic Substitution Ciphers Encoded As Constraint Satisfaction Problems
Document2 pages
Evolutionary Algorithm For Decryption of Monoalphabetic Homophonic Substitution Ciphers Encoded As Constraint Satisfaction Problems
doranchak
100% (3)
Quest PDF
Document19 pages
Quest PDF
Shailesh Prajapati
No ratings yet
Dokument - Pub Cardiff University Examination Paper Academic Year Flipbook PDF
Document13 pages
Dokument - Pub Cardiff University Examination Paper Academic Year Flipbook PDF
الامتحانات السياحة
No ratings yet
Natural Language Processing (NLP) Introduction:: Top 10 NLP Interview Questions For Beginners
Document24 pages
Natural Language Processing (NLP) Introduction:: Top 10 NLP Interview Questions For Beginners
03sri03
No ratings yet
Qnance CPP TestA
Document3 pages
Qnance CPP TestA
Shivam Razdan
No ratings yet
W10b Full
Document14 pages
W10b Full
takalee119
No ratings yet
cmpsc443-f17-midterm1.pdf
Document4 pages
cmpsc443-f17-midterm1.pdf
吴凡
No ratings yet
Shubham Jade MSC It 31031420010 NLP Practical Journal
Document17 pages
Shubham Jade MSC It 31031420010 NLP Practical Journal
Shubham Jade
No ratings yet
Finite Automata
Document17 pages
Finite Automata
sumit_bhardwaj87
No ratings yet
Wow compu bytes
Document23 pages
Wow compu bytes
ganesan3675
No ratings yet
Assignment 8
Document5 pages
Assignment 8
geetha megharaj
No ratings yet
HW 4
Document11 pages
HW 4
Intekhab Khan
100% (1)
Practice Exam Algorithms and Programming For High Schoolers (Addiscoder)
Document5 pages
Practice Exam Algorithms and Programming For High Schoolers (Addiscoder)
khalfan athman
No ratings yet
CT2 Set A
Document4 pages
CT2 Set A
errorcode026
No ratings yet
Information Theory, Pattern Recognition and Neural Networks
Document2 pages
Information Theory, Pattern Recognition and Neural Networks
Leonardo Serna Guarín
No ratings yet
Infoedge Test in IITR
Document2 pages
Infoedge Test in IITR
kartikmaich
No ratings yet
Python MCQ
Document21 pages
Python MCQ
preethi
No ratings yet
Automata Theory Module 1
Document85 pages
Automata Theory Module 1
Anusha k p
100% (1)
Shri. S. H. Kelkar College of Arts, Commerce and Science, Devgad
Document4 pages
Shri. S. H. Kelkar College of Arts, Commerce and Science, Devgad
Bhavesh Motwani
No ratings yet
1st Half
Document2 pages
1st Half
itsfaani007
No ratings yet
Extracting Noun Phrases From Large-Scale Texts: A Hybrid Approach and Its Automatic Evaluation
Document8 pages
Extracting Noun Phrases From Large-Scale Texts: A Hybrid Approach and Its Automatic Evaluation
zira
No ratings yet
Cryptography: Michigan State University Computer Science and Engineering
Document75 pages
Cryptography: Michigan State University Computer Science and Engineering
Kalaikumaran Thangavel
No ratings yet
Homework For Day 6
Document4 pages
Homework For Day 6
Juan Vega
0% (2)
Major Company Interview Questions
Document100 pages
Major Company Interview Questions
api-3757759
80% (5)
Lecture 1
Document25 pages
Lecture 1
nitingautam1907
No ratings yet
Computer Science E-1 Spring 2010 Scribe Notes Lecture 1: January 25, 2010 Andrew Sellergren
Document7 pages
Computer Science E-1 Spring 2010 Scribe Notes Lecture 1: January 25, 2010 Andrew Sellergren
Gaurav Singh
No ratings yet
How To Think Like A Computer Scientist - Learning With Python 3 PDF
Document354 pages
How To Think Like A Computer Scientist - Learning With Python 3 PDF
Jerry Waxman
100% (2)
First Order Logic: Artificial Intelligence COSC-3112 Ms. Humaira Anwer
Document15 pages
First Order Logic: Artificial Intelligence COSC-3112 Ms. Humaira Anwer
Khizrah Rafique
0% (1)
NLP 02
Document6 pages
NLP 02
TAHA MURADE [UCOE-3968]
No ratings yet
CS201 Objective Mega File-1 PDF
Document102 pages
CS201 Objective Mega File-1 PDF
Umme Rubab saleem
No ratings yet
190+ Python Interview Questions and Answers
Document192 pages
190+ Python Interview Questions and Answers
Dilgam Imranov
100% (1)
Cryptography Programming Workshop, Project 1: Submission Instructions
Document7 pages
Cryptography Programming Workshop, Project 1: Submission Instructions
חיים שפירא
No ratings yet
Most Asked Coding QUESTIONS
Document30 pages
Most Asked Coding QUESTIONS
salem
No ratings yet
Vu Current Papers
Document35 pages
Vu Current Papers
Saad Sajid
No ratings yet
Ang Thomas Waterloo Automata MS
Document60 pages
Ang Thomas Waterloo Automata MS
sbwjlvnbs
No ratings yet
Quantum Networking
From Everand
Quantum Networking
Rodney Van Meter
Rating: 5 out of 5 stars
5/5 (1)
Problem Solving in Automata, Languages, and Complexity
From Everand
Problem Solving in Automata, Languages, and Complexity
Ding-Zhu Du
No ratings yet
Statistical Independence in Probability, Analysis and Number Theory
From Everand
Statistical Independence in Probability, Analysis and Number Theory
Mark Kac
No ratings yet
Deep Learning Fundamentals in Python
From Everand
Deep Learning Fundamentals in Python
LazyProgrammer
Rating: 4 out of 5 stars
4/5 (9)
Divine Mathematics Like You Have Never Seen Before: You Will Enter an Area That Will Show You From Where Arises All the Diversity of This Ours Monolithic World
From Everand
Divine Mathematics Like You Have Never Seen Before: You Will Enter an Area That Will Show You From Where Arises All the Diversity of This Ours Monolithic World
Nenad Ilic
No ratings yet
Introduction to Deep Learning
From Everand
Introduction to Deep Learning
Eugene Charniak
No ratings yet
Module 3-1-GM
Document46 pages
Module 3-1-GM
geetha megharaj
No ratings yet
Module 5-Geetha Megharaj
Document70 pages
Module 5-Geetha Megharaj
geetha megharaj
No ratings yet
Automata Theory and Computability (17CS54) : 5 Semester
Document34 pages
Automata Theory and Computability (17CS54) : 5 Semester
geetha megharaj
No ratings yet
Dms Mod3
Document5 pages
Dms Mod3
geetha megharaj
No ratings yet
Turing Machine 1
Document18 pages
Turing Machine 1
geetha megharaj
No ratings yet
Assignment 6 (COPY)
Document6 pages
Assignment 6 (COPY)
geetha megharaj
No ratings yet
Module - 4 - BCS303-OS
Document39 pages
Module - 4 - BCS303-OS
geetha megharaj
No ratings yet
Automata Theory and Computability (18CS54) : 5 Semester
Document38 pages
Automata Theory and Computability (18CS54) : 5 Semester
geetha megharaj
No ratings yet
Automata Theory and Computability (17CS54) : 5 Semester
Document27 pages
Automata Theory and Computability (17CS54) : 5 Semester
geetha megharaj
No ratings yet
Lec 1
Document14 pages
Lec 1
geetha megharaj
No ratings yet
GM-3 2BCS303
Document48 pages
GM-3 2BCS303
geetha megharaj
No ratings yet
GM-2.3 Module3
Document17 pages
GM-2.3 Module3
geetha megharaj
No ratings yet
Lec 4
Document22 pages
Lec 4
geetha megharaj
No ratings yet
Module - 4 - BCS303-OS
Document39 pages
Module - 4 - BCS303-OS
geetha megharaj
No ratings yet
Module - 2 Notes-BCS303
Document38 pages
Module - 2 Notes-BCS303
geetha megharaj
No ratings yet
Lec 3
Document19 pages
Lec 3
geetha megharaj
No ratings yet
Lec 5
Document17 pages
Lec 5
geetha megharaj
No ratings yet
Assignment 5 (COPY)
Document5 pages
Assignment 5 (COPY)
geetha megharaj
No ratings yet
NLP Assignment-10 Solution
Document4 pages
NLP Assignment-10 Solution
geetha megharaj
No ratings yet
Lec 2
Document13 pages
Lec 2
geetha megharaj
No ratings yet
NLP Assignment-2 Solution
Document5 pages
NLP Assignment-2 Solution
geetha megharaj
No ratings yet
Assignment 8
Document5 pages
Assignment 8
geetha megharaj
No ratings yet
NLP Assignment-4 Solution
Document5 pages
NLP Assignment-4 Solution
geetha megharaj
No ratings yet
Module 1-2
Document19 pages
Module 1-2
geetha megharaj
No ratings yet
NLP Assignment-11 Solution
Document5 pages
NLP Assignment-11 Solution
geetha megharaj
No ratings yet
NLP Assignment-3 Solution
Document6 pages
NLP Assignment-3 Solution
geetha megharaj
No ratings yet
NLP Assignment-9 Solution
Document4 pages
NLP Assignment-9 Solution
geetha megharaj
No ratings yet
Claims of Fact Value and Policy
Document19 pages
Claims of Fact Value and Policy
Princesszyra Cafe
No ratings yet
Spend 20 Minutes: SUBJECT: English Literature
Document6 pages
Spend 20 Minutes: SUBJECT: English Literature
Achyuth Pradeep
No ratings yet
Concepcion: Flow of Promotional Video (Phil Cul)
Document2 pages
Concepcion: Flow of Promotional Video (Phil Cul)
Trish Bobier Bonavente
No ratings yet
A Little History of Philosophy - KANT
Document5 pages
A Little History of Philosophy - KANT
Aryo Pangestu
No ratings yet
Lesson Plan Crossroad
Document3 pages
Lesson Plan Crossroad
Transformer
No ratings yet
France Report Completed
Document4 pages
France Report Completed
api-297249314
No ratings yet
Module 22, Flores
Document10 pages
Module 22, Flores
Ranjo M Novasil
100% (2)
Presentation, Practice and Production (PPP) Alternative
Document3 pages
Presentation, Practice and Production (PPP) Alternative
Andrew Riordan
No ratings yet
Alfresco Study
Document42 pages
Alfresco Study
Yashjit
No ratings yet
Chapter 1
Document16 pages
Chapter 1
AREE
No ratings yet
Present Simple: Complete The Text With The Present Simple of The Verbs in Parentheses
Document2 pages
Present Simple: Complete The Text With The Present Simple of The Verbs in Parentheses
Angie Martinez Saucedo
No ratings yet
Call Execute: Let Your Program Run Your Macro: Artur Usov, OCS Consulting BV, 'S-Hertogenbosch, Netherlands
Document9 pages
Call Execute: Let Your Program Run Your Macro: Artur Usov, OCS Consulting BV, 'S-Hertogenbosch, Netherlands
naveen kumar
No ratings yet
Soal Bahasa Inggris Sma
Document17 pages
Soal Bahasa Inggris Sma
Goes Arik
75% (4)
Lab Objectives
Document6 pages
Lab Objectives
Qais Ahmad Amin
No ratings yet
The OTIS Excerpts Problems Solutions (Evan Chen)
Document261 pages
The OTIS Excerpts Problems Solutions (Evan Chen)
Adryan Freitas
No ratings yet
Lessonplaningrade 8
Document3 pages
Lessonplaningrade 8
Ate Cathy
100% (2)
Grammar I - Booklet
Document58 pages
Grammar I - Booklet
Micaela Mendoza
No ratings yet
Iso 6
Document5 pages
Iso 6
Jason Uchennna
No ratings yet
(SQL) Injection and Cross-Site Scripting - Edited
Document5 pages
(SQL) Injection and Cross-Site Scripting - Edited
John John
No ratings yet
Early Daoist Scriptures Bokenkamp Stephen R. 2024 scribd download
Document55 pages
Early Daoist Scriptures Bokenkamp Stephen R. 2024 scribd download
struttgarzia
No ratings yet
Book Deism
Document108 pages
Book Deism
Antonio De Mendieta
No ratings yet
C++ Program
Document10 pages
C++ Program
sridharan
No ratings yet
Loading
Document12 pages
Loading
Luz Anne de Guzman
No ratings yet
Course Outline
Document5 pages
Course Outline
Magarsa Bedasa
No ratings yet
JHS-SHS Principal Korbel Foundation College, Inc. Prk. Spring, Brgy. Morales City of Koronadal, South Cotabato
Document4 pages
JHS-SHS Principal Korbel Foundation College, Inc. Prk. Spring, Brgy. Morales City of Koronadal, South Cotabato
Marlon Raz
No ratings yet
Esperanto
Document2 pages
Esperanto
analfab3tyzm
No ratings yet
Kernos 2337
Document81 pages
Kernos 2337
Olga Fernández
No ratings yet
Correction and Evaluation Penny Ur
Document4 pages
Correction and Evaluation Penny Ur
Miss Abril
No ratings yet