L1 Intro Data Analytics

Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

L1: Introductory Concepts in Data Analytics Data Analytics

• data analytics can be applied in various


What is Statistics? domains, including business, healthcare,
• a branch of mathematics that deals with the finance, marketing, sports, and more.
collection, analysis, interpretation, • plays a crucial role in helping organizations
presentation, and organization of data. make informed decisions, optimize processes,
• provides methods for drawing conclusions and gain a competitive advantage.
about the characteristics of a population based • additionally, with the increasing volume of data
on a limited set of observations or a sample generated in today's digital world, data analytics
from that population. has become a vital tool for extracting valuable
• the goal of statistics is to uncover patterns, information from large datasets.
trends, and relationships within it.
Statistics
What is Biostatistics? Descriptive Statistics
• branch of statistics that deals specifically with • Concerns about the collection, organization,
the application of statistical methods to and presentation of information under studied.
biological and health-related data. • The researcher tries to describe a situation
• it involves the design, analysis, and under study.
interpretation of experiments and observational Infernal Statistics
studies in fields such as medicine, biology, • Concerns about the analysis and interpretation
epidemiology, public health, and environmental of information under studied
science. • This implies before carrying out an inference,
appropriate and correct descriptive measures
What is Data Analytics? are employed to bring out good results
• the primary goal of data analytics is to leverage
data to gain valuable insights and make Data
informed decisions. • The raw materials/facts which the statistician’s
• the process of examining, cleaning, work.
transforming, and modeling data to extract • Scales of measurement
useful information, draw conclusions, and Quantitative - Measures the value or counts of data.
support decision-making. Answers the question, “how many or how much?”
• it involves the use of various techniques and Example: weight in KG, age, scores in quizzes.
tools to analyze data and uncover patterns, Qualitative - Describes the data as to categories or
trends, correlations, and insights. groups. Answers the question, “what type?”
Example: courses (BSBIO, BSPSYCH, etc,)
Key Components of Data Analytics
• Data Collection: Gathering relevant data from Nominal - Characterized by data that consist of names,
various sources, which can include databases, labels, or categories only.
spreadsheets, sensors, logs, and more. Example: religion, civil status, and other qualitative
• Data Cleaning: Ensuring that the collected data.
data is accurate, complete, and free from Ordinal - Involves data that may be arranged in some
errors. This may involve handling missing order, but differences between data values either
values, removing duplicates, and addressing cannot be determined or are meaningless.
inconsistencies. Example: job position, military rank, year level, etc.
• Data Transformation: Preparing and Interval - Like the ordinal, with the additional property
organizing the data for analysis. This can that meaningful amounts of differences between data
include converting data into a suitable format, can be determined. However, no inherent zero starting
aggregating or summarizing information, and point is used.
creating new variables. Example: IQ score, monthly salary of working students,
• Data Analysis: Applying statistical and and age.
mathematical techniques, as well as using Ratio - At this level, inherent zero starting point is
specialized tools and software, to explore and important, and differences and ratios are meaningful.
interpret the data. Common methods include Example: number of siblings in a family, width.
descriptive statistics, inferential statistics,
machine learning, and data visualization. Population
• Data Interpretation: Drawing meaningful • The totality of all the objects of a certain class
conclusions from the analyzed data and under consideration
identifying actionable insights. This step often • A complete set of individuals, objects or
involves making predictions or measurements having some common
recommendations based on the findings. observable characteristics.
• Data Visualization: Representing data in • Example: the students enrolled this SY 23-24
graphical or visual formats, such as charts,
graphs, and dashboards, to make it easier for
stakeholders to understand complex patterns
and trends.
Sample
• A finite number of objects taken from the
population
• Example: 1st year students taken from the
enrolled list this SY 23-24
Parameter – describes the whole population while
statistics describes a sample of a given population.

Variables
• In any research, the very first thing that a
researcher does is to define the variables in his
study.
• Variable is any quantity that make different
values
• A characteristic of data observed or measured
that may vary from person to person
• Example: a researcher would like to study
about the study habits of Dean’s List students
and correlate these to their GWA.
(Variables – study habits and GWA which may
vary depending on the individual students)

Classification of Variables
According to Functional Relationships:
1. Independent – predictor variable
2. Dependent - criterion variable

According to Continuity of Values


1. Continuous – measurable (this may be
expressed in decimals)
2. Discrete – countable (this may not be
expressed in decimals, instead whole numbers)

You might also like