Big-Data-Koe097 2021 - 22 8th Sem

Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

Printed Page: 1 of 2

Subject Code: KOE097


0Roll No: 0 0 0 0 0 0 0 0 0 0 0 0 0

BTECH
(SEM VIII) THEORY EXAMINATION 2021-22
BIG DATA

Time: 3 Hours Total Marks: 100


Note: Attempt all Sections. If you require any missing data, then choose suitably.

SECTION A

1. Attempt all questions in brief. 2*10 = 20


Q.no Questions Marks CO
(a) What do you mean by Apache Hadoop? 2 2
(b) Define the term Big Data. 2 1
(c) Why do we need Hadoop? 2 2
(d) What do you mean by Digital data? 2 1
(e) Define data replication in Hadoop Distributed File System. 2 3
(f) What do you mean by serialization in Hadoop Distributed File 2 3
System?
(g) Write down briefly about the schedulers. 2 4
(h) Briefly define NameNode under Hadoop Eco System. 2 4
90

1
(i) What is Zookeeper? 2 5

13
_2

(j) Mention the Execution Modes of Pig. 2 5

2.
P1

24
2E

SECTION B

5.
.5
P2

2. Attempt any three of the following: 10*3 = 30


17
Q.no Questions Marks CO
Q

|1

(a) What are views in HIVE? What is the difference between internal and 10 5
external tables in HIVE?
4

(b) Describe Map Reduce framework and Map Reduce working. 10 2


1
2:

(c) Describe Structured, Semi structured and Unstructured data? Explain 10 3


:3

with example.
13

(d) Describe about the “Shuffle & Sort” phase and “Reducer Phase” in 10 4
MapReduce.
2

(e) What are the benefits of Big Data? Discuss in detail about the 5V’s 10 1
02

characteristics of Big Data.


-2
06

SECTION C
1-

3. Attempt any one part of the following: 10*1 = 10


|1

Q.no Questions Marks CO


(a) Write short note on any two of the following. 10 5
i) Hadoop Eco System Frameworks.
ii) Joins and subqueries.
iii) IBM Big Data strategy.
(b) Write down the statement for developing a Map Reduce application. 10 2

QP22EP1_290 | 11-06-2022 13:32:14 | 117.55.242.131


Printed Page: 2 of 2
Subject Code: KOE097
0Roll No: 0 0 0 0 0 0 0 0 0 0 0 0 0

BTECH
(SEM VIII) THEORY EXAMINATION 2021-22
BIG DATA

4. Attempt any one part of the following: 10 *1 = 10


Q.no Questions Marks CO
(a) Explain the analytic processes and tools under Introduction to Big 10 1
Data.
(b) Mention about the Cluster specification? Describe how to Setting up a 10 3
Hadoop Cluster?

5. Attempt any one part of the following: 10*1 = 10


Q.no Questions Marks CO
(a) Discuss Master Slave and Peer-Peer replication in detail. 10 4
(b) Explain the Hbase concepts. How Zookeeper helps in monitoring a 10 5
Cluster?

6. Attempt any one part of the following: 10*1 = 10


Q.no Questions Marks CO
90

1
(a) Give the Anatomy of a Map Reduce job run in Map Reduce. 10 2

13
_2

(b) Give the detailed description about the Analysis vs. reporting while 10 1

2.
Introducing the Big Data.
P1

24
2E

5.
7. Attempt any one part of the following: 10*1 = 10
.5
P2

Q.no Questions Marks CO


17
Q

(a) Describe the Compression and Serialization under the Hadoop I/O 10 3
|1

Operation.
(b) Explain about the storage mechanism in HBase. Write a query to 10 4
4

create table in HBase.


1
2:
:3
13
2
02
-2
06
1-
|1

QP22EP1_290 | 11-06-2022 13:32:14 | 117.55.242.131

You might also like