Big Data

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 16

Big Data Definition

No single standard definition



Big Data is data whose scale, diversity, and complexity
require new architecture, techniques, algorithms, and
analytics to manage it and extract value and hidden
knowledge from it

Characteristics of Big Data: 1-Scale (Volume)
Data Volume
44x increase from 2009 2020
From 0.8 zettabytes to 35zb
Data volume is increasing exponentially

Characteristics of Big Data: Complexity (Varity)
Various formats, types, and structures
Text, numerical, images, audio, video,
sequences, time series, social media data,
multi-dim arrays, etc
Static data vs. streaming data
A single application can be
generating/collecting many types of data


To extract knowledge all these types
of data need to linked together
Characteristics of Big Data: Speed (Velocity)
Data is begin generated fast and need to be processed fast
Online Data Analytics
Late decisions missing opportunities
Examples
E-Promotions: Based on your current location, your purchase history, what you
like send promotions right now for store next to you

Healthcare monitoring: sensors monitoring your activities and body any
abnormal measurements require immediate reaction

Big Data: 3Vs
Some Make it 4Vs
Harnessing Big Data
OLTP: Online Transaction
Processing (DBMSs)
OLAP: Online Analytical
Processing (Data Warehousing)
RTAP: Real-Time Analytics
Processing (Big Data
Architecture & technology)

Whos Generating Big Data ?
Social media and
networks
(all of us are
generating data)
Scientific instruments
(collecting all sorts of data)
Mobile devices
(tracking all
objects all the
time)
Sensor technology
and networks
(measuring all kinds
of data)
The progress and innovation is no longer hindered by the ability to collect data

But, by the ability to manage, analyze, summarize, visualize, and discover knowledge from the collected
data in a timely manner and in a scalable fashion
The Model Has Changed
The Model of Generating/Consuming Data has Changed

Old Model: Few companies are generating data, all others are consuming data
New Model: all of us are generating data, and all of us are consuming data
Whats driving Big Data ?
- Ad-hoc querying and reporting
- Data mining techniques
- Structured data, typical sources
- Small to mid-size datasets
- Optimizations and predictive analytics
- Complex statistical analysis
- All types of data, and many sources
- Very large datasets
- More of a real-time
Value of Big Data Analytics
Big data is more real-time in
nature than traditional DW
applications
Traditional DW architectures (e.g.
Exadata, Teradata) are not well-
suited for big data apps
Shared nothing, massively
parallel processing, scale out
architectures are well-suited for
big data apps

Challenges in Handling Big Data
The Bottleneck is in technology
New architecture, algorithms, techniques are needed
Also in technical skills
Experts in using the new technology and dealing with big data

Thank You
Visit www.seminarlinks.blogspot.com to download

You might also like