Primary Data Collection Methods
Primary Data Collection Methods
Primary Data Collection Methods
While collecting the data, the researcher must identify the type of data to be
collected, source of data, and the method to be used to collect the data.
Also, the answers to the questions that who, when and where the data is to
be collected should be well addressed by the researcher.
Primary Data Collection Methods: The primary data are the first-hand
data, collected by the researcher for the first time and is original in nature.
The researcher collects the fresh data when the research problem is
unique, and no related research work is done by any other person. The
results of the research are more accurate when the data is collected
directly by the researcher but however it is costly and time-consuming.
Thus, the researcher can obtain data from either of the sources depending
on the nature of his study and the pursued research objective.
Definition: When the data are collected by someone else for a purpose
other than the researcher’s current project and has already undergone the
statistical analysis is called as Secondary Data.
The secondary data are readily available from the other sources and as
such, there are no specific collection methods. The researcher can obtain
data from the sources both internal and external to the organization. The
internal sources of secondary data are:
- Sales Report
- Financial Statements
- Customer details, like name, age, contact details, etc.
- Company information
- Reports and feedback from a dealer, retailer, and distributor
- Management information system
There are several external sources from where the secondary data can be
collected. These are:
- Government censuses, like the population census, agriculture
census, etc.
- Information from other government departments, like social
security, tax records, etc.
- Business journals
- Social Books
- Business magazines
- Libraries
- Internet, where wide knowledge about different areas is easily
available.
In-Person Interviews
Pros: In-depth and a high degree of confidence on the data
Cons: Time consuming, expensive and can be dismissed as anecdotal
Mail Surveys
Pros: Can reach anyone and everyone – no barrier
Cons: Expensive, data collection errors, lag time
Phone Surveys
Pros: High degree of confidence on the data collected, reach almost anyone
Cons: Expensive, cannot self-administer, need to hire an agency
Web/Online Surveys
Pros: Cheap, can self-administer, very low probability of data errors
Cons: Not all your customers might have an email address/be on the internet,
customers may be wary of divulging information online.
In-person interviews always are better, but the big drawback is the trap you
might fall into if you don’t do them regularly. It is expensive to regularly
conduct interviews and not conducting enough interviews might give you false
positives. Validating your research is almost as important as designing and
conducting it. We’ve seen many instances where after the research is
conducted – if the results do not match up with the “gut-feel” of upper
management, it has been dismissed off as anecdotal and a “one-time”
phenomenon. To avoid such traps, we strongly recommend that data-
collection be done on an “ongoing and regular” basis. This will help you in
comparing and analyzing the change in perceptions according to marketing
done for your products/services. The other issue here is sample size. To be
confident with your research you have to interview enough people to weed out
the fringe elements.
A couple of years ago there was quite a lot of discussion about online surveys
and their statistical validity. The fact that not every customer had internet
connectivity was one of the main concerns. Although some of the discussions
are still valid, the reach of the internet as a means of communication has
become vital in the majority of customer interactions. According to the
US Census Bureau, the number of households with computers has doubled
between 1997 and 2001.
5 useful methods of collecting primary data in statistics
1. Surveys: This method involves cost, as it cost money to design and implement
a survey. Also, data collected from surveys may contain lots of missing data,
data in incorrect format for example, someone could enter their age as ‘twenty
eight’ instead of 28, so lots of work is needed to preprocess, organize, clean,
and reshape data collected from surveys.
2. From websites: Sometimes you can scrape data from websites, lots of work
has to be done to clean, organize and reshape the data. However, some
websites contain data in a clean and structured format.
3. Purchase raw data from organizations or companies: This method is also
costly. But it saves time as sometimes data purchased from a company or
organization may already be in a structured format that can be used directly
for analysis without cleaning and reshaping the data.
4. Simulate data: This method is mostly used for stochastic processes. For
example, you can use Monte-Carlo simulation to simulate data that follows a
given probability distribution like Poison distribution or Normal distribution.
This method of generating raw data is free.
Data Collection Methods: What To
Know for Statistics
Introduction
When faced with a research problem, you need to collect, analyze and interpret data to
answer your research questions. Examples of research questions that could require you
to gather data include how many people will vote for a candidate, what is the best
product mix to use and how useful is a drug in curing a disease. The research problem
you explore informs the type of data you’ll collect and the data collection method you’ll
use. In this article, we will explore various types of data, methods of data collection and
advantages and disadvantages of each. After reading our review, you will have an
excellent understanding of when to use each of the data collection methods we discuss.
Types of Data
Quantitative Data
Data that is expressed in numbers and summarized using statistics to give meaningful
information is referred to as quantitative data. Examples of quantitative data we could
collect are heights, weights, or ages of students. If we obtain the mean of each set of
measurements, we have meaningful information about the average value for each of
those student characteristics.
Qualitative Data
When we use data for description without measurement, we call it qualitative data.
Examples of qualitative data are student attitudes towards school, attitudes towards
exam cheating and friendliness of students to teachers. Such data cannot be easily
summarized using statistics.
Primary Data
When we obtain data directly from individuals, objects or processes, we refer to it
as primary data. Quantitative or qualitative data can be collected using this approach.
Such data is usually collected solely for the research problem to you will study. Primary
data has several advantages. First, we tailor it to our specific research question, so
there are no customizations needed to make the data usable. Second, primary data is
reliable because you control how the data is collected and can monitor its quality. Third,
by collecting primary data, you spend your resources in collecting only required data.
Finally, primary data is proprietary, so you enjoy advantages over those who cannot
access the data.
Despite its advantages, primary data also has disadvantages of which you need to be
aware. The first problem with primary data is that it is costlier to acquire as compared to
secondary data. Obtaining primary data also requires more time as compared to
gathering secondary data.
Secondary Data
When you collect data after another researcher or agency that initially gathered it makes
it available, you are gathering secondary data. Examples of secondary data are
census data published by the US Census Bureau, stock prices data published by CNN
and salaries data published by the Bureau of Labor Statistics.
One advantage to using secondary data is that it will save you time and money,
although some data sets require you to pay for access. A second advantage is the
relative ease with which you can obtain it. You can easily access secondary data from
publications, government agencies, data aggregation websites and blogs. A third
advantage is that it eliminates effort duplication since you can identify existing data that
matches your needs instead of gather new data.
Despite the benefits it offers, secondary data has its shortcomings. One limitation is that
secondary data may not be complete. For it to meet your research needs, you may
need to enrich it with data from other sources. A second shortcoming is that you cannot
verify the accuracy of secondary data, or the data may be outdated. A third challenge
you face when using secondary data is that documentation may be incomplete or
missing. Therefore, you may not be aware of any problems that happened in data
collection which would otherwise influence its interpretation. Another challenge you may
face when you decide to use secondary data is that there may be copyright restrictions.
Now that we’ve explained the various types of data you can collect when conducting
research, we will proceed to look at methods used to collect primary and secondary
data.
Methods Employed in Primary Data Collection
When you decide to conduct original research, the data you gather can be quantitative
or qualitative. Generally, you collect quantitative data through sample surveys,
experiments and observational studies. You obtain qualitative data through focus
groups, in-depth interviews and case studies. We will discuss each of these data
collection methods below and examine their advantages and disadvantages.
Sample Surveys
A survey is a data collection method where you select a sample of respondents from a
large population in order to gather information about that population. The process of
identifying individuals from the population who you will interview is known as sampling.
To gather data through a survey, you construct a questionnaire to prompt information
from selected respondents. When creating a questionnaire, you should keep in mind
several key considerations. First, make sure the questions and choices are
unambiguous. Second, make sure the questionnaire will be completed within a
reasonable amount of time. Finally, make sure there are no typographical errors. To
check if there are any problems with your questionnaire, use it to interview a few people
before administering it to all respondents in your sample. We refer to this process as
pretesting.
Using a survey to collect data offers you several advantages. The main benefit is time
and cost savings because you only interview a sample, not the large population.
Another benefit is that when you select your sample correctly, you will obtain
information of acceptable accuracy. Additionally, surveys are adaptable and can be
used to collect data for governments, health care institutions, businesses and any other
environment where data is needed.
A major shortcoming of surveys occurs when you fail to select a sample correctly;
without an appropriate sample, the results will not accurately generalize the population.
Ways of Interviewing Respondents
Once you have selected your sample and developed your questionnaire, there are
several ways you can interview participants. Each approach has its advantages and
disadvantages.
In-person Interviewing
When you use this method, you meet with the respondents face to face and ask
questions. In-person interviewing offers several advantages. This technique has
excellent response rates and enables you to conduct interviews that take a longer
amount of time. Another benefit is you can ask follow-up questions to responses that
are not clear.
In-person interviews do have disadvantages of which you need to be aware. First, this
method is expensive and takes more time because of interviewer training, transport,
and remuneration. A second disadvantage is that some areas of a population, such as
neighborhoods prone to crime, cannot be accessed which may result in bias.
Telephone Interviewing
Using this technique, you call respondents over the phone and interview them. This
method offers the advantage of quickly collecting data, especially when used with
computer-assisted telephone interviewing. Another advantage is that collecting data via
telephone is cheaper than in-person interviewing.
One of the main limitations with telephone interviewing it’s hard to gain the trust of
respondents. Due to this reason, you may not get responses or may introduce bias.
Since phone interviews are generally kept short to reduce the possibility of upsetting
respondents, this method may also limit the amount of data you can collect.
Online Interviewing
With online interviewing, you send an email inviting respondents to participate in an
online survey. This technique is used widely because it is a low-cost way of interviewing
many respondents. Another benefit is anonymity; you can get sensitive responses that
participants would not feel comfortable providing with in-person interviewing.
When you use online interviewing, you face the disadvantage of not getting a
representative sample. You also cannot seek clarification on responses that are
unclear.
Mailed Questionnaire
When you use this interviewing method, you send a printed questionnaire to the postal
address of the respondent. The participants fill in the questionnaire and mail it back.
This interviewing method gives you the advantage of obtaining information that
respondents may be unwilling to give when interviewing in person.
The main limitation with mailed questionnaires is you are likely to get a low response
rate. Keep in mind that inaccuracy in mailing address, delays or loss of mail could also
affect the response rate. Additionally, mailed questionnaires cannot be used to interview
respondents with low literacy, and you cannot seek clarifications on responses.
Focus Groups
When you use a focus group as a data collection method, you identify a group of 6 to 10
people with similar characteristics. A moderator then guides a discussion to identify
attitudes and experiences of the group. The responses are captured by video recording,
voice recording or writing—this is the data you will analyze to answer your research
questions. Focus groups have the advantage of requiring fewer resources and time as
compared to interviewing individuals. Another advantage is that you can request
clarifications to unclear responses.
One disadvantage you face when using focus groups is that the sample selected may
not represent the population accurately. Furthermore, dominant participants can
influence the responses of others.
Summary
This article introduced you to the various types of data you can collect for research
purposes. We discussed quantitative, qualitative, primary and secondary data and
identified the advantages and disadvantages of each data type. We also reviewed
various data collection methods and examined their benefits and drawbacks. Having
read this article, you should be able to select the data collection method most
appropriate for your research question. Data is the evidence that you use to solve your
research problem. When you use the correct data collection method, you get the right
data to solve your problem.