Big Data Analytics (BDAG 19-5) : Quiz: GMP - 2019 Term V
Big Data Analytics (BDAG 19-5) : Quiz: GMP - 2019 Term V
Big Data Analytics (BDAG 19-5) : Quiz: GMP - 2019 Term V
For question nos. 1 - 10 identify the correct choice(s) for each question and write it on your
answer script. Each question carries 2 marks. Indicate the correct choice(s) on the question
paper itself and return the question paper after the quiz.
1. Suppose, you applied a logistic regression model on a given data and got a training accuracy
X and testing accuracy Y. Now, you want to add a few new features in the same data. Select
the options(s) which is/are correct in such a case. Consider all the other features remaining
same.
a. Training accuracy decreases
b. Training accuracy increases or remains the same
c. Testing accuracy decreases
d. Testing accuracy increases or remains the same
3. The logit function is the natural log of odds. What could be the range of logit function in the
domain x = [0, 1].
a. (– ∞ , ∞)
b. (0, 1)
c. (0, ∞)
d. (– ∞, 0)
7. A neural network with multiple hidden layers and multiple nodes in each hidden layer using
a suitable activation function can form non-linear boundaries in a classification problem. The
statement is:
a. Always TRUE
b. Always FALSE
c. Depends on the data
d. Depends on the activation function being used
8. You have collected a dataset containing 10,000 rows of tweet text and no other information.
You have created a document term matrix of the data, treating every tweet as a document.
Which of the following is correct, in regards to document term matrix?
a. Removal of stopwords from the data will affect the dimensionality of the data
b. Normalization of words in the data will reduce dimensionality of the data
c. Both the statements a and b are correct
d. None of the statements a and b are correct
9. Imagine, you are solving a classification problem with two highly imbalanced classes. The
majority class is observed 99% of the records in the training dataset. Your model has 99%
accuracy on the test data class prediction. Which of the following is TRUE in such as case?
a. Classification accuracy, Precision, and Recall are all good metrics
b. None of Classification accuracy, Precision, and Recall are good metrics
c. Classification accuracy is not a good metric, while Precision and Recall are
d. Classification accuracy is a good metric, while Precision and Recall are not