Dwm Question Bank Winter 2024

Download as pdf or txt
Download as pdf or txt
You are on page 1of 4

Rashtrasant Tukadoji Maharaj Nagpur University

Faculty of Science & Technology


Fifth Semester B.Tech. (Computer Science & Engineering (AI & ML) (Computer Science &
Engineering) (Computer Science) (C.B.C.S.) Examination 2024
DATA WAREHOUSING AND MINING
Elective – I
P. Page: 4 PRS/KW/24/2695
Time-Three Hours [Maximum Marks-70]

INSTRUCTIONS TO CANDIDATES
(1) All questions carry marks as indicated.
(2) Solve Question No. 1 OR Question No. 2.
(3) Solve Question No. 3 OR Question No. 4.
(4) Solve Question No. 5 OR Question No. 6.
(5) Solve Question No. 7 OR Question No. 8.
(6) Solve Question No. 9 OR Question No. 10.
(7) Due credit will be given to neatness and adequate dimension.
(8) Assume suitable data whenever necessary.

Unit 1
1. Define data warehouse. How is a data warehouse different from data base?
2. What is the differentiate between OLAP and OLTP.
3. What is data warehouse? Explain characteristics of data warehouse.
4. Explain the types of Multidimensional data model.
5. Draw and explain three tier architecture of data warehouse.
6. Draw and explain various schemas used to create data warehouse
7. What is OLAP? What are the different OLAP operations that can be performed on
multidimensional data model?
8. Define data warehouse? explain data warehouse architecture along with neat sketch.
9. Differentiate ROLAP, MOLAP and HOLAP server functionalities.
10. 1) Difference between data warehouse and data mart. 2) difference between database
and data warehouse
11. Define following: 1) data mart. 2) data warehouse. 3) data base management system.

Unit 2

1. Explain various data mining functionalities along with example.


2. Explain different data preprocessing techniques? What is the need of the data
preprocessing? Explain steps involved in data repressing.
3. Discuss major issues and challenges in data mining.
4. What is concept hierarchy and how it is useful in data mining. Draw concept hierarchy for
location dimension.
5. What is data mining task primitive? What do they specify? Explain in detail.
6. Explain following data mining functionalities. 1) Association and correlation analysis. 2)
clustering 3) classification.
7. What do you mean by preprocessing in data warehouse? Explain data reduction
strategies in brief.
8. Draw and explain knowledge discovery form database architecture with neat sketch.
What are the steps involved in KDD process?
9. Describe the typical architecture of data mining system? Write a application areas of data
mining.
10. Explain data integration and data transformation in data mining.
11. What is data mining? What are technical issues to be considered which designing and
implementing data warehouse environment

Unit 3

1. Explain classification by decision tree induction with explain. Given the steps involved in
decision tree algorithm states its advantages and disadvantage’s
2. Write short note on: 1) rule-based classification. 2) outlier analysis audits application. 3)
density-based clustering. 4) support vector machine (SVM). 5) Classification by back
propagation. 6) constraint-based clustering analysis. 7) k – means partitioned method.
8) agglomerative and decisive hierarchical clustering. 9) DBCAN clustering.
3. What is outlier? Why outlier is importance?
4. Give classification of clustering algorithm’s and explain partition-based clustering
algorithm namely k – means, starting its merits, demerits and application area.
5. Explain classification using naïve Bayesian classification.
6. Explain the difference between k- means and k- medoid algorithm.
7. Briefly outline the major ideas of naïve Bayesian classification.
8. Explain agglomerative and divisive algorithm in brief.
9. Write k- means algorithm apply it to classify the following data in 2- clusters data: 3, 3.5,
5, 3.3, 4.5, 5.2, 6.4, use Euclidean distance method.
10. Write a detailed note on split algorithm based on Gini index
11. What is clusters? Explain the types of clustering analysis.

Unit 4

1. Explain Apriori Algorithm for frequent item sets. what are the drawbacks of apriori
algorithm?
2. What is the process of generating association rules from frequent item sets? Explain with
example.
3. Explain the market- basket analysis using example.
4. Explain EP -Growth algorithm with the help of an example.
5. Generate FP tree for following transaction database. Assume min-sup=40%.
TID Item sets
T1 K, D, B, T
T2 D, G, T, W
T3 B, K, D, T
T4 K, W, G, D, T
T5 W, S, D, T

6. Explain mining various kinds of association rule.


7. A database has five transaction let min sup=2 .

TID Items bought


T1 A, B, C, D, E
T2 B, C, D
T3 B, C, D, T
T4 A, B, C, D, E
T5 B, C, D, E
Final all frequent item set using FP- Growth algorithm.
8. Consider transaction data for an all-electronic branch.
TID LIST OF ITEM IDS
T100 I1, I2, I5
T200 I2, I4
T300 I2, I3
T400 I2, I2,I4
T500 I1, I3
T600 I2, I3
T700 I1, I3
T800 I1, I2, I3, I5
T900 I1, I2,
There are nine transaction in this data base that is [D] = 9, apply the apriori algorithm
for finding frequent itemset is D consider min_ support = 2.
9. Explain following terms with the help of examples. 1) Frequent item sets. 2) confidence
3) support 4) closed item set. 5) association rules.
10. Write short notes on following: 1) constraint association mining. 2) multi-dimensional
association rules.
11. How FP growth algorithm work? Explain.
12. Discuss association rules mining versus correlation analysis.

Unit 5

1. Explain web mining with various types.


2. With the help of real-life example, Explain web structure mining and web usage in
detail.
3. With the help of a suitable real-life example, explain the significance of web data
mining.
4. Graph and network mining have become there a singly important and heavily
researched justify the same.
5. Discuss the basis measure for text retrieval explain retrieval method.
6. 1) Differentiate web context mining and web structure mining. 2) differentiate between
temporal and spatial data mining in detail.
7. Write short notes on: 1) graph mining 2) text mining 3) visual web data mining. 4) social
network analysis. 5) temporal data mining. 6) web content mining. 7) spatial data
mining.
8. What is web mining? Write the challenges that occur during knowledge discovery in
web data mining.
9. Explain in brief what are various text mining operations that can be performed on text.
10. Write short notes on : web usage mining.

You might also like