AST 110 - Data Analytics (Reviewer)
AST 110 - Data Analytics (Reviewer)
AST 110 - Data Analytics (Reviewer)
Role – increase efficiency and improve performance by Classification of Variables according to Continuity of
discovering patterns in data. Values
1. Continuous Variable – decimal
Steps in Data Analytics Process 2. Discrete Variable – not decimal, as a whole
1. Data Mining – extracting data from unstructured
data sources. Classification of Variables according to Levels of
2. Data Warehousing – involves designing and Measurements
implementing databases that allow easy access to 1. Nominal – measurement, distinguishes, responses
data mining results. into attributes
3. Statistical Analysis – creates insights from data. 2. Ordinal – measurement, arranged from low to
4. Data Presentation – allows insight to be shared high
with stakeholders.
Internal – no true value of 0.
Importance of Data Analytics – analyzing data can optimize
efficiency in many different industries. Ratio – has a true value of 0, sameness/differences
1
Mean: x̄ = Σfx/n
Finite – Counted immediately.
n
Sample – a cross-section of elements drawn from a −¿ cf
Median: x̃ = LL + 2 i
population. ( )
f
^
Randomization – process of getting a sample.
d1
Mode: x = LL + ( )i
Sampling – the process of getting the number of individuals d 1+ d 2
from a population.
Tabulation – the process of condensing data and arranging 3 and above – Extremely Above Normal
them in the table. Below 3 – Extremely Below Normal
3 – Normal
Classification – the process of putting together similar
items.
Hypothesis Testing – not to question the computed value;
Frequency Distribution – Mean (Average), Median to make a judgement; to generalized a population from
(Middle), and Mode (Repeated) relatively small samples.
Range – the difference between the highest and lowest Hypothesis – explanation for a certain event; measurable
scores. R = HS-LS and testable
Class Interval – a category defined by a lower limit and an Null Hypothesis (Ho) – empty; statement of equality
upper limit. indicating the existence of relationship.
Class Boundaries – true limit between the upper limit and Alternative Hypothesis (HA) – statement of the
lower limit. expectation; derived from the theory under study.
Class Mark (x) – the middle value 1. Non-Directional Alternative Hypothesis – the
existence of difference
Class Size (i) – the difference between the upper and the 2. Directional Alternative Hypothesis – specifies that
lower-class boundary. CS = R/k one group performs better than the other
Relative Frequency – percentage distribution in every class Critical Value – depends on the nature of the null
interval. hypothesis; depends on the level of significance
2
2. Chance of failing to reject null hypothesis when it
is false
t= (Σd) .
√ n(Σd²)-(Σd)²
n(n-1)