AWS Ref Eren Ce Arch Itect Ure S: System
AWS Ref Eren Ce Arch Itect Ure S: System
AWS Ref Eren Ce Arch Itect Ure S: System
Amazon Web Services provides services and infrastructure to However, as the web application grows and the number of visitors
f e r company that is operating web applications. Analyzing logs can component of this architecture is Amazon Elastic MapReduce, a
Reectur
reveal information such as traffic patterns, user behavior, web service that enables analysts to process large amounts of
marketing profiles, etc. data easily and cost-effectively using a Hadoop hosted framework.
h it 2
Arc
C
E
Ma
n
zo
pR E
a
t
o
m
edlas 4
p
A
Fi Log
S
A
uc tic
m
2
les
a
C
S3
zo
E
a
e
n
zo
R
n
n
M
zo
E
a
m
n
zo
A
3
a
S
m
A
n
zo
a
m
t
n
ro
A
F
d
u
lo
ilesry p
C
oo r
S
on
F
D
g o d
az
Lo posit Hauste
R
m
n
A
zo
Fi Log
Re Cl
a
m
les
A
ro n
F o
b
t
d z
Wervers
u a
lo m
3
C A
Se
EC
A
m
a
2
zo
5
InSpo
n
7
st t
an
1 s ce
l y tice s
a
Antabas
Da 6 RD
A
m
S
a
zo
st
n
HT ly
TP 2 na
Re A
qu
es
ts
e ry
Cl
ou D eliv
dF t
A
tenk
m
a
ro n
zo
Vi
Cotwor
n
sit n
or t
Ne
s
The web front-end servers are running on Amazon An Amazon Elastic MapReduce cluster processes Data processing results are pushed back to a relational
System 1 Elastic Compute Cloud (Amazon EC2) instances. 4 the data set. Amazon Elastic MapReduce utilizes a
hosted Hadoop framework, which processes the data in a
6 database using tools like Apache Hive. The database
can be an Amazon Relational Database Service (Amazon
Overview 2
Amazon CloudFront is a content delivery network that
uses low latency and high data transfer speeds to
parallel job flow. RDS) instance. Amazon RDS makes it easy to set up,
operate, and scale a relational database in the cloud.
distribute static files to customers. This service also When Amazon EC2 has unused capacity, it offers EC2
generates valuable log information. 5 instances at a reduced cost, called the Spot Price. This 7 Like many services, Amazon RDS instances are
price fluctuates based on availability and demand. If your priced on a pay-as-you-go model. After analysis, the
Log files are periodically uploaded to Amazon Simple workload is flexible in terms of time of completion or required database can be backed-up into Amazon S3 as a database
3 Storage Service (Amazon S3), a highly available and capacity, you can dynamically extend the capacity of your snapshot, and then terminated. The database can then be
reliable data store. Data is sent in parallel from multiple web cluster using Spot Instances and significantly reduce the recreated from the snapshot whenever needed.
servers or edge locations. cost of running your job flows.