Lesson 1 (Obtaining Data)

Download as pdf or txt
Download as pdf or txt
You are on page 1of 7

Republic of the Philippines

CAMARINES NORTE STATE COLLEGE


F. Pimentel Avenue, Brgy. 2, Daet, Camarines Norte – 4600, Philippines

COLLEGE OF ENGINEERING

ENGINEERING DATA ANALYSIS

Data engineering is the aspect of data science that focuses on practical applications of data collection
and analysis. For all the work that data scientists do to answer questions using large sets of information,
there have to be mechanisms for collecting and validating that information. In order for that work to
ultimately have any value, there also have to be mechanisms for applying it to real-world operations in
some way. Those are both engineering tasks: the application of science to practical, functioning
systems.

1. OBTAINING DATA

1.1 Methods of Data Collection

Simple Random Sampling


is a sample of individuals that exist in a population whereby the individuals are randomly selected
from the population and placed into the sample. This method of randomly selecting individuals seeks
to select a sample size that is an unbiased representation of the population. However, a simple random
sample is not advantageous when the samples of the population vary widely.

Stratified Random sampling


is a method of sampling that involves dividing a population into smaller groups–called strata. The
groups or strata are organized based on the shared characteristics or attributes of the members in the
group. The process of classifying the population into groups is called stratification.

1.2 Planning and Conducting Surveys


Survey are widely used in development research to provide quantitative data on a variety of topics.
Survey research is a branch of applied statistics that allows us to:
• Gather accurate and quantifiable information on a certain phenomenon and population;
• Answer research questions using rigorous statistical tools;
• Collect data that can be used to measure the impact of projects and other development
interventions.
Surveys are most suited to answering questions in the form of ‘what’, ‘how many’ and ‘how often’.
They can also answer ‘why’ questions, although it is advisable to combine these with qualitative
research in order to do so most effectively. Surveys are typically undertaken using data collection
techniques, such as questionnaires, to record structured and standardized responses to questions that
can then be analysed statistically.

1.3 Planning and Conducting Experiments: Introduction to Design of Experiments


Experiment plays many roles in science. One of its important roles is to test theories and to provide
the basis for scientific knowledge. Experiment can provide hints toward the structure or mathematical
form of a theory and it can provide evidence for the existence of the entities involved in our theories.
The practical steps needed for planning and conducting an experiment include: recognizing the
goal of the experiment, choice of factors, choice of response, choice of the design, analysis and then
drawing conclusions.

EN MATH 4 – Engineering Data Analysis Page 1 of 7


Republic of the Philippines
CAMARINES NORTE STATE COLLEGE
F. Pimentel Avenue, Brgy. 2, Daet, Camarines Norte – 4600, Philippines

COLLEGE OF ENGINEERING

Variables
are any characteristics that can take on different values, such as height, age, species, or exam score,
etc.

Independent Variable
Is the cause. Its value is independent of other variables in your study.

Dependent Variable
Is the effect. Its value depends on changes in the independent variable.

Experimental Units
The experimental unit is "the smallest division of experimental material such that any two units
may receive different treatments in the actual experiment". For some experiments, the experimental
unit may be larger than the unit of observation or unit of randomization and often implies the appropriate
unit of analysis
The experimental unit for a study refers to an object or entity expected to produce a change in
some outcome about which a researcher wishes to generalize. The experimental unit must account for
important assumptions, such as independence of errors required by analysis methods. The
experimental unit plays a large role in the design of a research study

Example, if individual students were randomly assigned to receive a small-group intervention, the small
group would become the experimental unit because it is the smallest division in which students can
receive a different intervention

Random Assignment
Random assignment is a procedure used in experiments to create multiple study groups that
include participants with similar characteristics so that the groups are equivalent at the beginning of the
study.
EN MATH 4 – Engineering Data Analysis Page 2 of 7
Republic of the Philippines
CAMARINES NORTE STATE COLLEGE
F. Pimentel Avenue, Brgy. 2, Daet, Camarines Norte – 4600, Philippines

COLLEGE OF ENGINEERING

In experiments, researchers manipulate an independent variable to assess its effect on a


dependent variable, while controlling for other variables. To do so, they often use different levels of
an independent variable for different groups of participants.

THE MEAN, MEDIAN AND MODE

MEAN
The mean, often called the average, of a numerical set of data, is simply the sum of the data
values divided by the number of values. This is also referred to as the arithmetic mean. The mean is
the balance point of a distribution.

EN MATH 4 – Engineering Data Analysis Page 3 of 7


Republic of the Philippines
CAMARINES NORTE STATE COLLEGE
F. Pimentel Avenue, Brgy. 2, Daet, Camarines Norte – 4600, Philippines

COLLEGE OF ENGINEERING

𝑠𝑢𝑚 𝑜𝑓 𝑡ℎ𝑒 𝑣𝑎𝑙𝑢𝑒𝑠


𝑀𝑒𝑎𝑛 =
𝑡ℎ𝑒 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑣𝑎𝑙𝑢𝑒𝑠

EN MATH 4 – Engineering Data Analysis Page 4 of 7


Republic of the Philippines
CAMARINES NORTE STATE COLLEGE
F. Pimentel Avenue, Brgy. 2, Daet, Camarines Norte – 4600, Philippines

COLLEGE OF ENGINEERING

MEDIAN
The median is the number that falls in the middle position once the data has been organized.
Organized data means the numbers are arranged from smallest to largest or from largest to smallest.
The median for an odd number of data values is the value that divides the data into two halves. If n
represents the number of data values and n is an odd number, then the median will be found in the
𝑛+1
position.
2

EN MATH 4 – Engineering Data Analysis Page 5 of 7


Republic of the Philippines
CAMARINES NORTE STATE COLLEGE
F. Pimentel Avenue, Brgy. 2, Daet, Camarines Norte – 4600, Philippines

COLLEGE OF ENGINEERING

MODE
The mode of a set of data is simply the value that appears most frequently in the set.

EN MATH 4 – Engineering Data Analysis Page 6 of 7


Republic of the Philippines
CAMARINES NORTE STATE COLLEGE
F. Pimentel Avenue, Brgy. 2, Daet, Camarines Norte – 4600, Philippines

COLLEGE OF ENGINEERING

EN MATH 4 – Engineering Data Analysis Page 7 of 7

You might also like