Mobile Phones Data Analysis: Introduction To Data Management Project Report
Mobile Phones Data Analysis: Introduction To Data Management Project Report
Mobile Phones Data Analysis: Introduction To Data Management Project Report
August-December 2020
Submitted by-
Dilshad Ahmad
Registration No- 11810636
Section- KM068
Course Code- INT217
Discipline of CSE/IT
Lovely School of Computer Science and Engineering
Lovely Professional University, Phagwara
CERTIFICATE
This is to certify that Dilshad Ahmad bearing Registration no. 11810636 has
completed INT 217 project titled, MOBILE PHONES data analysis under my
guidance and supervision. To the best of my knowledge, the present work is the
result of his original development, effort and study.
Date: 02/12/2020
DECLARATION
I, Dilshad Ahmad student of P132 (Program name) under CSE/IT Discipline at,
Lovely Professional University, Punjab, hereby declare that all the information
furnished in this project report is based on my own intensive work and is
genuine.
Date: 02/12/2020
Signature
I would like to express my deepest appreciation to all those who provided me the possibility
to complete this report. A special gratitude I give to my mentor whose contribution in
stimulating suggestions and encouragement helped me to coordinate my project especially in
writing this report. I express my thanks to my institution Lovely Professional University for
giving me an opportunity to learn this interesting topic. I also convey my regards to my
faculty assistance all through this training named “Data Science Project on MOBILE
PHONES Data analysis”. Once again, I would like to thank all my supporters from the core
of my heart.
Index
Topic Page
Cover 1
Certificate 2
Declaration 3
Acknowledgement 4
Index 5
Introduction 6
ETL Process 8
Analysis 9
Reference 13
INTRODUCTION
A simple database, such as Excel, holds all information about one subject in a single table.
Relational databases, on the other hand, consist of many tables with each one containing
information about different, but related topics.
RECORDS
In database terminology, a record holds all the information or data about one specific object
in the database. In Excel, each cell in a worksheet contains one item of information or value.
FIELDS
Each item of information in a database record, such as a telephone number or street number,
is referred to as a field. In Excel, the individual cells of a worksheet serve as fields, since
each cell can contain a single piece of information about an object.
FIELD NAMES
It is vital to organize a database so that you can sort or filter the data to find specific
information. Adding column headings, known as field names, makes it easier to enter data in
the same order for each record.
Additionally, Microsoft has several data tools to make it easier to work with vast amounts of
data stored in Excel tables and to help keep it in good condition.
One of those tools is the data form. You can use it to find, edit, enter, or delete records, in
tables containing up to 32 fields or columns.
The default form includes a list of field names in the order they are arranged in the table, to
ensure that users enter records correctly. Next to each field name is a text box for entering or
editing the individual fields of data.
While it is possible to create custom forms, creating and using the default form is often all
that is needed.
A common problem with all databases is data errors. In addition to simple spelling mistakes
or missing fields of data, duplicate data records can be a concern as a data table grows in size.
Another of Excel's data tools can be used to remove these duplicate records — either exact
or partial duplicates.
SORTING DATA IN EXCEL
Sorting means to reorganize data according to a specific property, such as sorting a table
alphabetically by last names or chronologically from oldest to youngest.
Excel's sort options include sorting by one or more fields, custom sorting, such as by date or
time, and sorting by rows which makes it possible to reorder the fields in a table.
Objective:
The objective of the project is to analyze the data set and find the solutions for different
queries like:
1. Top prices of mobiles.
2. Highest number of Camera.
3. Ratings of Mobile phones.
4. Different ranges of batteries.
5. Count of mobile names by Display.
Source of data:
The dataset has taken from:
https://www.kaggle.com/shreyaspj/android-devices-and-mobiles
ETL PROCESS:
Extraction: Data extraction is a process that involves retrieval of data from various sources.
Frequently, companies extract data in order to process it further, migrate the data to a data
repository (such as a data warehouse or a data lake) or to further analyze it. It’s common
to transform the data as a part of this process. For example, you might want to perform
calculations on the data — such as aggregating sales data — and store those results in the
data warehouse. If you are extracting the data to store it in a data warehouse, you might want
to add additional metadata or enrich the data with timestamps or geolocation data. Finally,
you likely want to combine the data with other data in the target data store.
The dataset on which I am working it is the structured data.
Structured data:
If the data is structured, the data extraction process is generally performed within the source
system. It is common to perform data extraction using one of the following methods
Full extraction:
Data is completely extracted from the source, and there is no need to track changes.
Transformation:
Transform is the process of converting the extracted data from its previous form into the form it needs
to be in so that it can be placed into another database. Transformation occurs by using rules or lookup
tables or by combining the data with other data.
Analysis of dataset
Objective I: Top Prices of Mobile
A pivot chart was made describing the top 10 sum of prices of the mobile phones arranged in
increasing order with their price lists. It was found that top prices of mobile were of Samsung
only.
20
18
16
14
12
10
8
6
4
2
0
Gome C7…
POCO M2…
Realme X…
Realme X…
Realme 2…
Realme 3…
Realme 3…
Realme 5…
Realme 5…
Realme C2…
Realme C2…
Realme C3…
Realme C3…
Vivo Y91i…
Vivo Z1Pro…
Vivo Z1Pro…
Redmi Note…
Redmi Note…
Redmi Note…
Realme X2…
Realme X2…
Realme X2…
Realme 5i…
Realme 5i…
Realme 5i…
Realme 5i…
Realme 6i…
Realme 6i…
Vivo Y12…
Realme XT…
Realme XT…
This pivot chart was created by taking different ranges or capacity of battery showing their
power of functionality with respect to their prices. We can choose the best battery in our
compatible price given by the data.
Tota
DIFFERENT RANGE OF BATTERIES l
120000
100000
80000
60000
40000
20000
0
Lithium-ion
Lithium-ion
Lithium-ion
Lithium-ion
Lithium-ion
Lithium-ion
Lithium-ion
Lithium-ion
Lithium-ion
Lithium-ion
4300 mA h
4 5 0 0 mA h
4 5 0 0 mA h
4 5 0 0 mA h
4 5 0 0 mA h
5 0 0 0 mA h
5 0 0 0 mA h
4 5 0 0 mA h
4 5 0 0 mA h
4 5 0 0 mA h
B at t e ry
B at t e ry
B at t e ry
B at t e ry
B at t e ry
B at t e ry
B at t e ry
B at t e ry
B at t e ry
B at t e ry
Sams u n gSams
G al axy
u n gN
Sams
Goal
t eaxy
u20
n gN
( Sams
Mys
G
o taleaxy
t20
iucnBUl
gNro
oG
Sams
t tra
nal
ezaxy
e20
5G
,u nUl
(NgSams
Mys
otGra
t al
e t10
5G
axy
iucnLg(iN
Mys
tSa
Geoal
ms
t( eAaxy
t 10
iuucra
n Sams
LgS20
iBtGleac
aUl
(luaA
k,nxy
tura
graS20
GSams
(Ral
Co
eaxy
dsUlmi
,utnra
S20+
cg BSams
G(lCo
al
ac(axy
sk,
Clmi
uonucgS20+
dGGB
ra
all uyaxy
( ,eCo
, 128
sS20+
mi cGBB
( lCo
)acsk,mi128
c G ray, 128
256 GBBl)ac k, 256 B roGnBz e) , 256 G128 B ) G B ) 128 G B ) 128 G B ) 128 G B ) GB) GB)
Objective 5: Count of Name by Display
A pivot chart was made to view the display properties counting the numbers of names using the
same display for a better view. It represents the number of persons using the same display.
Total
Reference: https://www.kaggle.com/shreyaspj/android-devices-and-mobiles