Sample MCQ
Sample MCQ
Sample MCQ
a) Linear Regression.
b) K-Means Clustering.
c) Hierarchical Clustering.
d) Apriori Algorithm.
a) Decision Trees.
b) Naive Bayes.
c) K-Means Clustering.
d) Gradient Boosting.
c) To transform the features into a more suitable representation for a machine learning algorithm.
a) K-Means Clustering.
b) Hierarchical Clustering.
c) Box Plot.
d) Apriori Algorithm.
b) Precision measures the number of true positives, while recall measures the number of false
negatives.
c) Precision measures the number of false positives, while recall measures the number of true
negatives.
d) Precision measures the number of false negatives, while recall measures the number of true
positives.
a) Precision.
b) Recall.
c) F1 Score.
c) To ensure that the model is not biased towards the training data.
d) To ensure that the model is not biased towards the test data.
a) Linear Regression.
b) Random Forest.
c) K-Means Clustering.
a) A hypothesis that states there is a significant difference between a sample statistic and a
population parameter.
b) A hypothesis that states there is no significant difference between a sample statistic and a
population parameter.
c) A hypothesis that states there is a perfect correlation between two variables.
a) A hypothesis that states there is a significant difference between a sample statistic and a
population parameter.
b) A hypothesis that states there is no significant difference between a sample statistic and a
population parameter.
a) The probability of observing a sample statistic as extreme or more extreme than the one
observed, assuming the null hypothesis is true.
b) The probability of observing a sample statistic as extreme or more extreme than the one
observed, assuming the alternative hypothesis is true.
16. Which of the following measures of central tendency is most affected by outliers?
a) Mean
b) Median
c) Mode
a) Mean
b) Median
c) Mode
d) Standard deviation
18. The measure of central tendency that represents the most frequently occurring value in a
dataset is known as-
a) Mean
b) Median
c) Mode
d) Range
19. If a dataset has an even number of observations, what value is used as the median?
20. What is the measure of central tendency that is used to represent the typical value of a
dataset?
a) Mean
b) Median
c) Mode
a) Range
b) Standard deviation
c) Variance
d) Median
a) Mean
b) Median
c) Mode
25. What is the measure of spread that is equal to the square root of the variance?
a) Variance
b) Standard deviation
c) Range
29. What is the result of the expression `"Hello, " + "World!"` in Python?
a. "Hello,World!"
b. "Hello World!"
c. "Hello, World!"
d. Error
31. . To generate a random integer between 1 and 10 (inclusive), which `random` module
function should you use?
a. `randint(1, 10)`
b. `random(1, 10)`
c. `randrange(1, 10)`
d. `uniform(1, 10)`
32. In Python, how can you define a function that accepts an arbitrary number of positional
arguments?
a. Using the `*args` parameter
b. Using the `**kwargs` parameter
c. Using the `&args` parameter
d. Using the `#args` parameter
33. . Which of the following is the primary data structure in NumPy for working with arrays?
a. List
b. Tuple
c. ndarray
d. Dictionary
34. . How can you create a NumPy array containing integers from 0 to 9?
a. `np.array(0, 9)`
b. `np.array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])`
c. `np.arange(10)`
d. `np.linspace(0, 9, 10)`
35. . What is the data type of the elements in a NumPy array by default?
a. Integer
b. Float
c. String
d. None
36. What will be the result of the operation `np.array([1, 2, 3]) + np.array([4, 5, 6])` in
NumPy?
a. `[5, 7, 9]`
b. `[1, 2, 3, 4, 5, 6]`
c. `[1, 2, 3, 4, 5, 6]`
d. `[4, 5, 6]`
37. How can you access the element at the second row and third column of a NumPy array
`arr`?
a. `arr(2, 3)`
b. `arr[2, 3]`
c. `arr[1, 2]`
d. `arr[3, 2]`
39. . A sample of N observations are independently drawn from a normal distribution. The
sample variance follows
a. Normal distribution
b. Chi-square with N degrees of freedom
c. Chi-square with N − 1 degrees of freedom
d. t-distribution with N − 1 degrees of freedom
41. State whether the following statement is True or False.[1 mark] Covariance is a better
metric to analyze the association between two numerical variables than correlation.
a. True
b. Fasle
45. In Simple Linear Regression, if the equation of the regression line is given as `y = 2x +
3`, what is the predicted value of `y` when `x` is 5?
a. 13
b. 12
c. 16
d. None
49. When performing polynomial regression, what does increasing the degree of the
polynomial typically result in?
a. Improved model simplicity.
b. Decreased model flexibility.
c. Overfitting the datA)
d. Decreased model complexity.
50. Which type of regression is most suitable for handling multicollinearity among
independent variables?
a. Linear Regression.
b. Polynomial Regression.
c. Lasso Regression.
d. Logistic Regression.
3. Letf(x)=x3+3x2−24x+7.Selectthecorrectoptionsfromthefollowing:[3marks]
a. x=2willgivethemaximumforf(x).
b. x=2willgivetheminimumforf(x).
c. Thestationarypointsforf(x)are2and4.
d. None
4. Letf(x,y)=−3x2−6xy−6y2.Thepoint(0,0)isa
a. saddlepoint
b. maxima
c. Minima
d. None
5. You have a dataset with three variables: `X`, `Y`, and `Z`. The correlation coefficient
between `X` and `Y` is -0.6, and between `Y` and `Z` is 0.8. What is the correlation
coefficient between `X` and `Z`?
a. -0.48
b. -0.75
c. -1.33
d. None
a) Age
b) Gender
c) Income
d) Sales
a) Linear regression is used for predicting continuous variables, while logistic regression is used for
predicting categorical variables.
b) Linear regression is used for predicting categorical variables, while logistic regression is used for
predicting continuous variables.
a) R-squared
b) Odds ratio
c) P-value
d) Coefficient of determination
9. Which of the following is a measure of the strength and direction of the relationship between
two variables in regression analysis?
a) Coefficient of determination
b) R-squared
c) Correlation coefficient
d) Odds ratio
11. What is the difference between k-means clustering and hierarchical clustering?
a) K-means clustering forms clusters by iteratively assigning data points to the nearest centroid,
while hierarchical clustering forms clusters by iteratively merging or splitting clusters.
b) K-means clustering forms clusters by iteratively merging or splitting clusters, while hierarchical
clustering forms clusters by iteratively assigning data points to the nearest centroid.
15. Which of the following is a disadvantage of using a low learning rate in gradient descent?
3. ConsiderthefollowingconfusionmatrixfortheclassicationofHatchbackandSUV:
True
Hatchback SU
V
Prediction Hatchback 55 5
SUV 0 40
Findtheaccuracyofthemodel.
a. 0.95
b. 0.55
c. 0.45
d. 0.88
4. ConsiderthefollowingconfusionmatrixfortheclassicationofHatchbackandSUV:
True
Hatchback SU
V
Prediction Hatchback 55 5
SUV 0 40
Findthesensitivityofthemodel.
a. 0.95
b. 0.55
c. 1
d. 0.88
5. Given a Multiple Linear Regression model with three independent variables: `X1 = 4`, `X2
= 7`, and `X3 = 10`, and the coefficients: `β1 = 2`, `β2 = 3`, and `β3 = 1`, calculate the
predicted value of the dependent variable `Y`.
a. 31
b. 29
c. 23
d. 27
6. What is the difference between simple linear regression and multiple regression?
a) Simple linear regression involves only one independent variable, while multiple regression
involves more than one independent variable.
b) Simple linear regression involves more than one independent variable, while multiple regression
involves only one independent variable.
c) Simple linear regression and multiple regression are the same thing.
8. Which of the following is a method for finding the minimum of a function with multiple
variables without using derivatives?
a) Gradient descent
b) Newton's method
c) Levenberg-Marquardt algorithm
a) A technique used to reduce the size of a decision tree by removing unnecessary branches.
b) A technique used to increase the size of a decision tree by adding more branches.
a) To find the optimal decision boundary that separates two classes of data.