TM351 Data Management and Analysis: Prepared by Eng. A.Samy Tel: 99941566
TM351 Data Management and Analysis: Prepared by Eng. A.Samy Tel: 99941566
TM351 Data Management and Analysis: Prepared by Eng. A.Samy Tel: 99941566
TM351
Data management and analysis
Part 1
Metadata
Data about the dataset itself. Three kinds:
1. Descriptive: supporting identification and discovery: for example, the name of a dataset, or a
description of its contents
2. Structural: relating to the structure of the dataset: for example, the column headings in a tabular dataset
3. Administrative: recording the means by which the dataset came into being and how it may be, or may
have been, used.
• Can also be subject to data analysis
Stakeholders
• A dataset or database may have a very broad range of stakeholders
• For example, if data about an individual is being analyzed, then:
• That individual is a stakeholder
Kuwait – Salmiya – Salem Al-Mubarak St. North Salmiya Market Complex 2nd floor.
(965) 2572 6686 - 2571 4343 (965) 2571 0775 [email protected]
Prepared by ENG. A.SAMY Tel: 99941566 2
Big data
• Big data aims to gather, analyze, link, and compare large datasets to identify patterns
Data handling
• Comprises two distinct sets of activities, roles and responsibilities: data management and data analysis.
• Two ways to characterise data handling:
• as a cycle or life cycle (which mainly emphasizes data management)
• or as a pipeline (which combines data management with the way the data is used).
اعادة توظيف
Kuwait – Salmiya – Salem Al-Mubarak St. North Salmiya Market Complex 2nd floor.
(965) 2572 6686 - 2571 4343 (965) 2571 0775 [email protected]
Prepared by ENG. A.SAMY Tel: 99941566 3
Trust
Trust can relate to several aspects of data analysis:
• Trust in the data itself: in its origins, documentation, security, and curation and in the quality of its
maintenance.
• Trust in the processing applied to the data
• Trust in the data managers and analysts themselves: their competence, their understanding of procedures and
processes, and of concepts of fitness for purpose, data quality, appropriate interpretation of results and
requirements.
Bias
• Human bias
• Bias in data capture
• Bias in data cleaning
• Bias in data handling
• Data engineering has been defined as: extracting information partly through the analysis of data’
Kuwait – Salmiya – Salem Al-Mubarak St. North Salmiya Market Complex 2nd floor.
(965) 2572 6686 - 2571 4343 (965) 2571 0775 [email protected]
Prepared by ENG. A.SAMY Tel: 99941566 4
TM351
Data management and analysis
Part 2
Numerous problems that can arise when trying to represent data in the world (O/P), and on a computer
(Coding). This may require difficult decisions.
2.2 In the beginning was the bit: data and data types
Every typed programming language supplies a different set of atomic primitive types.
• Java offers:
• bool (1 bit)
• byte (1-byte signed)
• char (2-byte unsigned)
• short (2-byte signed)
• int (4-byte signed)
• double (8-byte floating point).
• Python, offers a richer set of primitive types including complex numbers and various collection types.
3. Interval: numbers on an interval scale can be ranked, and we know how far apart things are,
such as on a temperature scale, but without a specific origin being stated.
Example: temperature in Celsius or Fahrenheit scale
4. Ratio: Numbers are on an ordinal scale, but with a meaningful, known, fixed origin
Example 1: Mass of an object
Example 2: Height of an object
Kuwait – Salmiya – Salem Al-Mubarak St. North Salmiya Market Complex 2nd floor.
(965) 2572 6686 - 2571 4343 (965) 2571 0775 [email protected]
Prepared by ENG. A.SAMY Tel: 99941566 5
Kuwait – Salmiya – Salem Al-Mubarak St. North Salmiya Market Complex 2nd floor.
(965) 2572 6686 - 2571 4343 (965) 2571 0775 [email protected]
Prepared by ENG. A.SAMY Tel: 99941566 6
Output
5 Transporting data •
• XML is popular as a message passing format in the delivery of web services.
Also,
• CSV – comma-separated values file (sometimes referred to as a comma-separated variable file).
• JSON – JavaScript Object Notation
Kuwait – Salmiya – Salem Al-Mubarak St. North Salmiya Market Complex 2nd floor.
(965) 2572 6686 - 2571 4343 (965) 2571 0775 [email protected]
Prepared by ENG. A.SAMY Tel: 99941566 7
Kuwait – Salmiya – Salem Al-Mubarak St. North Salmiya Market Complex 2nd floor.
(965) 2572 6686 - 2571 4343 (965) 2571 0775 [email protected]