Stats Notes
Stats Notes
Stats Notes
STATISTICS
Is matter of scientific method that deals with COSPA
Collection
Organizing
Summarizing
Presenting
Analyzing ……. DATA
DATA
Is a set of facts and figures
Are facts and figures collected, analyzed, and summarized for presentation and interpretation.
KINDS OF DATA
Alphanumeric Data- number, letters, special character (), *
Image Data - Graphic Image , Pictures
Audio Data- sound, noises, tones
Video Data - Movable pictures
Computer Data - Binary digits (Password)
numeric , special character, bites? (Di ko maintindiahn si sir hehe)
2. Inferential Statistics
Utilizes sample data to make estimates decisions, predictions, or other generalization about
larger set of data
Prediction
QUALITATIVE (Categorical)
> non numerical
> includes label or names used to identify an attribute of each elements
> can be used to classify / categorize
(Take categories as their values such as “True”, “False” , “Yes” ,“No” or “Blue” , “Red”
> Text File
Examples:
Small, Medium, Large (Sizes)
Blue, Red and Orange (Color)
QUANTITATIVE
> Referring to a number
> Values that indicate how much or how many of something
Examples
120kg
150 degrees fahrenheit
1 dozen egg
*weight / time / temperature / mass
1. Discrete Data
- counting process, finite set / involves whole number (sakto)
Ex.
Fingers
1 dozen
Number of enrollee
Basketball scored
Number of pets
2. Continues Data
Measurements / measuring process/ any value in a certain range
Ex.
Temperature
Height
Weight
Area of a house
>Categorical and numerical are the two classification of variables
1. Categorical
2. Numerical (Discrete and Continuous)
2. Ordinal Scale
Ordered attributes / can perform mathematical computations/ Likert Scale
Example:
First, second, third
1= Good
2= Very Excellent
3= Excellent
4= Superior
Attributes can be Rank Ordered
3. Interval Scale
Ordinal Ranking / Added attribute of equal spacing
Example:
Temperature in Fahrenheit
30 degrees is more than 20 degrees
20 degrees is more than 10 degrees
4. Ratio Scale
Ordinal Ranking / Equal Spacing / Absolute Zero
Example:
Physical Measurement
Height Depth
Weight Speed
Qualitative:
1. Nominal
2. Ordinal
Quantitative:
3. Interval
4. Ratio
Classification of Data
Primary Data
> refers to the data collected by the researcher, for the very first time, from different sources, with a particular
problem, question or specific purpose in mind (Original Data)
Secondary Data
> refers to the data that is collected by someone other than the primary users
Example:
Newspaper, Magazines, Websites, Publications (Government), Records, Journals and Books
Internal Data
Bio data resume
External Data
NSO total population
secondary data
1. Survey (Sample)
An activity in which many people are asked a question or a series of questions in order to gather
information about what most people do or think about something.
2. Questionnaire/ Indirect Method
Is a research instrument consisting of a series of questions for the purpose of gathering
information from respondents
3. Interview / Direct Method
This is done through a direct and personal contact of the researchers with the person from
whom data will be collected.
4. Registration
This method is done through the gathering of data from concern offices
5. Observation Method
This method is purely based on the subjective remarks of the observer
6. Experimental Method
It is the method that determines the cause and effect relationship of a certain
constraint or event under a controlled situation.
CLASSIFICATION OF VARIABLE
According to functional relationship
Type of Research
> Deductive Research
> Inductive Research
STATISTICAL INFERENCE
Many situations require data for groups of elements (individuals, companies, households, voters,
products, customers and so on). Because of time, cost and other concerns, data can be collected from only a
small portion of the group.
The larger group of elements in a particular study is called population (Universe) , and the smaller
group is called sample.
TECHNIQUE
PROBABILITY SAMPLING EXAMPLES
2 Types of SRS
1. Probability sampling- involves random selection. (unbias) masteral
2. Non-Probability Sampling- The sampling technique that do not involve random selection of data. (bias)
student
Clustering
In cluster sampling, researchers divide a population into smaller groups known as clusters. They then
randomly select among these clusters to form a sample. It is a method of probability sampling that is
often used to study large populations, particularly those that are widely geographically dispersed.
A sampling technique in which the researcher selects samples based on the subjective judgment of the
researcher rather than random selection. It is a less stringent method.
Availability / Convenience Sampling
Is a type of non-probability sampling that involves the sample being drawn from that part of the
population that is close to hand. This type of sampling is most useful for pilot testing.
Samples are selected from the population only because they are conveniently available to the
researcher . Researcher choose these samples just because they are easy to recruit, and the
researchers did not consider selecting a sample that represents the entire population. (involves
selecting samples based on convenience.)
Consecutive Sampling
Very similar to convenience sampling, with a slight variation,
The researcher picks a single person or a group of sample, conducts research over a period, analyzes
the results, and then moves on to another subject or group if needed.
It gives the researcher a chance to work with many topics and fine-tune his/ her research by collecting
results that have vital insights. (total enumeration sampling)
Snowball Sampling
Selecting participants by finding one or two participants and then asking them to refer you to others.
Examples:
Meeting homeless person, interviewing that person, and then asking him/ her to introduce you
to other homeless people you might interview