Statistics
Statistics
Statistics
Statistics Theory
4) Analysis: Presented data are analyzed in this step. Methods used in analyzing the presented data are
numerous ranging from simple observation to highly mathematical techniques.
5) Interpretations: The last stage in statistical method is interpretation i.e. drawing conclusions from
the data collected & analyzed. The interpretation of data is a difficult task and requires a high degree
of skills and experience
Qus: What do you mean by data? (2014-1.a; 2011-1.b)
Data are considered as the raw materials of statistics.
Data are numerical measurement that are collected in scientific/ systematic way and are related to the
predetermined objectives.
Statistical observations are called data.
Qus: Data collection Techniques and Tools (2014-1.a; 2011-1.b)
Data collection Techniques
Data Collection Tools
1. Using available information
1. Checklist, data, compilation form
2. Observing
2. Eyes and ears, pen and paper, watch, tape or video
3. Interviewing
or recorder etc.
4. Administering written questionnaires
3. Interview schedule, checklist, questionnaire,
5. Organizing Focus Group Discussions
4. Tape recorder
5. Questionnaire
6. Discussion guide, tape recorder
Qus: Types of Data and Data Sources
Ans: In statistics, data are classified into two broad categories: quantitative data and qualitative data.
A. Quantitative data are those that can be quantified in definite units of measurement. Depending on
the nature of the variable observed for measurement, quantitative data can be further categorized as
continuous and discrete data.
1. Continuous data represent the numerical values of a continuous variable. A continuous variable
is the one that can assume any value between any two points on a line segment, thus representing
an interval of values. All characteristics such as weight, length, height, thickness, velocity,
temperature, tensile strength, etc., represent continuous variables.
2. Discrete data are the values assumed by a discrete variable. A discrete variable is the one whose
outcomes are measured in fixed numbers. The number of customers visiting a departmental store
every day, the incoming flights at an airport, and the defective items in a consignment received
for sale, are all examples of discrete data.
B. Qualitative data refer to qualitative characteristics of a subject or an object. A characteristic is
qualitative in nature when its observations are defined and noted in terms of the presence or absence
of a certain attribute in discrete numbers. These data are further classified as nominal and rank data.
1. Nominal data are the outcome of classification into two or more categories of items or units
comprising a sample or a population according to some quality characteristic. Classification of
students according to sex (as males and females), of workers according to skill (as skilled, semiskilled, and unskilled), and of employees according to the level of education (as matriculates,
undergraduates, and post-graduates), all result into nominal data. Given any such basis of
Md. Ali Reza
classification, it is always possible to assign each item to a particular class and make a summation
of items belonging to each class. The count data so obtained are called nominal data.
2. Rank data, on the other hand, are the result of assigning ranks to specify order in terms of the
integers 1, 2, 3... n. Ranks may be assigned according to the level of performance in a test. a
contest, a competition, an interview, or a show. The candidates appearing in an interview, for
example, may be assigned ranks in integers ranging from I to n, depending on their performance
in the interview. Ranks so assigned can be viewed as the continuous values of a variable
involving performance as the quality characteristic.
Data sources could be seen as of three types:
A. Secondary data: They already exist in some form: published or unpublished - in an identifiable
secondary source. They are, generally, available from published source(s), though not necessarily in
the form actually required.
B. Primary data: Those data which do not already exist in any form, and thus have to be collected for
the first time from the primary source(s). By their very nature, these data require fresh and first-time
collection covering the whole population or a sample drawn from it.
C. Internal Data: Internal data refer to the measurements that are the by-product of routine business
record keeping like accounting, finance, production, personnel, quality control, sales, R&D. Internal
data emerges in the process from payroll, sales amount from a sales journal & the units of finished
products produced from production records, cash receipts from cash records.
Types of Data classification based on basis:
A. Geographical area wise, Cities, districts etc
B. Chronological on basis of time
C. Qualitative according to some attributes
D. Quantitative in terms of some measurable quantity (magnitude)
5) Statistics & Natural Science: Statistical techniques have proved to be extremely useful in the study
of all natural sciences like biology, medicine, zoology & botany.
6) Statistics & Research: Statistics is indispensable in research work. Most of the advancement in
knowledge has taken place because of experiments conducted with the help of statistical methods. In
fact, there is hardly any research work today that one can find complete without statistical methods.
7) Statistics & Other uses: Besides the above, statistics are useful to various institutions such as
bankers, brokers, insurance companies, auditors, social workers, labour unions, trade associations &
chamber of commerce. Statistics are immensely useful to politicians & supporters.
8) Statistics & Personnel Management:
In personnel management there are frequent use of personnel statistics e.g. sex ratio, stillness
ratio, turnover rate, absenteeism rate, accident rate, I.Q. test etc are used in different branches of
personnel management, statistics are used in job evaluation, job analysis and wage classification,
fixation and changes.
It is also used in merit rating & other personnel evaluation for future man management &
personnel development in manpower planning.
In fact, the applications of statistics are so numerous that statistics today has risen from the science of
statecraft to the science of universal applicability. It is instrumental in enhancing human welfare & is such
a master key that enables to solve the problems of mankind almost in all fields.
Qus: Function of Statistics:
1. Definiteness: Numerical expressions are convincing & therefore, one of the most important
functions of statistics is to present general statements in a precise & definite form.
2. Condensation: Not only does statistics present facts in a definite form but it also helps in
condensing mass of data into a few significant figures.
3. Comparison: Unless figures are compared with others of the same kind they are often said to carry
any meaning. The statement that there has been an increase of per capita income from US$ 250 in
1985 to US$ 450 in 2005 definitely indicates the increasing trend in per capita income.
4. Prediction: Plans and policies of organizations are invariably formulated well in advance of the time
of their implementation. Knowledge of the future trends is very helpful in framing suitable policies
and plans. Statistical methods provide helpful means of forecasting future trends.
5. Formulating & testing Hypothesis: Statistical methods are extremely useful in formulating &
testing hypothesis & to develop new theories. For example hypothesis like whether students have
benefited from the extra coaching etc can be tested by appropriate statistical tools.
6. Formulation of policies: Statistics provide the basic material for framing suitable policies. For
example, it may be necessary to decide how much oil a nation should import in the next year.
Qus: Statistics in Business and Management
1) Marketing: Statistical analysis are frequently used in providing information for making decision in
the field of marketing it is necessary first to find out what can be sold and the to evolve suitable
strategy, so that the goods which to the ultimate consumer. A skill full analysis of data on production
purchasing power, man power, habits of compotators, habits of consumer, transportation cost should
be consider to take any attempt to establish a new market.
Md. Ali Reza
2) Production: In the field of production statistical data and method play a very important role. The
decision about what to produce? How to produce? When to produce? For whom to produce is based
largely on statistical analysis.
3) Finance: The financial organization discharging their finance function effectively depend very
heavily on statistical analysis of peat and tigers.
4) Banking: Banking institute have found if increasingly to establish research department within their
organization for the purpose of gathering and analysis information, not only regarding their own
business but also regarding general economic situation and every segment of business in which they
may have interest.
5) Investment: Statistics greatly assists investors in making clear and valued judgment in his investment
decision in selecting securities which are safe and have the best prospects of yielding a good income.
6) Purchase: The purchase department in discharging their function makes use of statistical data to
frame suitable purchase policies such as what to buy? What quantity to buy? What time to buy?
Where to buy? Whom to buy?
7) Accounting: Statistical data are also employer in accounting particularly in auditing function, the
technique of sampling and destination is frequently used.
8) Control: The management control process combines statistical and accounting method in making the
overall budget for the coming year including sales, materials, labour and other costs and net profits
and capital requirement.
Qus: Statistics and HRM (2012-1.a)
Ans: Now we will see why statistics is included in HRM.
1) Recruitment and placement: HR manager has to deal with job analysis, HR planning, recruitment,
employee testing, selection, and interviewing. For job analysis, information's are collected from job
related people through questionnaires and interview. For HR planning and recruiting new employees,
trend analysis is required to know the firms past employees and future estimated demand of staff. In
testing and selections, we can see relationship between test scores and job performance for a group
of people.
2) Training and Development: In this section, employees performance is measured and analyzed
through job related performance data such as productivity, absenteeism, grievance, waste, product
quality, and customer complain. All these are done with the help of statistical tools and technique.
3) Compensation: In this section, HR Manager has to collect data from competitive firms of the same
industry regarding compensation package and employee benefit. Here standard of living and price
index are analyzed by using statistical tools.
Finally we can conclude that statistics is inevitable in the field of HRM for employee relations, demand
forecasting, training needs assessment, salary survey, performance measurement, employee satisfaction
survey and training feedback. So we cannot imagine HRM without the help of statistical tools and
techniques.
Qus: Define Mean, Median, and Mode With Examples. Advantage and Disadvantages (2014-2.a; 2012-3; 2011-4.a)
Mean:
The sum of a set of data divided by the number of data. (Note :- the observations have to be
quantitative and not qualitative))
Median: The middle value or the mean of the middle two values, when the data is arranged in
numerical order. Think of a "median" being in the middle of a highway.
Mode:
The value (number) that appears the most. It is possible to have more than one mode, and it is
possible to have no mode. If there is no mode-write "no mode", do not write zero (0).
Mean
Data
Mean
Mode
Median
12.2
14
13
Advantages
Disadvantages
1. All the data is used to find the answer
1. Very large or very small numbers can
2. It is unique - there is only one answer.
distort the answer
3. Useful when comparing sets of data.
2. Affected by extreme values (outliers)
Qus: How does Mode help HR manager to measure the employee satisfaction of his/her
organization? (2013-3.a)
Ans: Mode contains the highest frequency in any data. Its applications are as follows:1. It is used to influx of the public transport.
2. The no. of games succeeded by any team of players.
3. The frequency of the need of infants.
4. Used to find the number of the mode is also seen in calculation of the wages, in the patients going to
the hospitals, the mode of travel etc.
Md. Ali Reza
Class Mark (Midpoint): The midpoint of each class. This is midway between the upper and lower class
limits. It is found by adding the upper and lower limits and dividing by two. It can also be found by
adding the upper and lower boundaries and dividing by two.
Qus: Why should we calculate standard deviation? (2014-4.a; 2011-4.b)
Ans: Standard Deviation (SD) is the measure of spread of the numbers in a set of data from its mean
value. Also called as SD, Standard Deviations symbol (sigma). This can also be said as a measure of
variability or volatility in the given set of data.
The standard deviation is a value used frequently in the social sciences and statistics, especially when
discussing data printed in research papers or journals. The standard deviation can be useful in determining
how to continue research or a course of action depending on how much variance exists in the data. For
example, a teacher who finds there is a large value for the standard deviation of test scores, indicating
there is great variance, may choose to adjust his teaching method to accommodate students of various
backgrounds and aptitudes. When test scores indicate there is little variance, represented by a small
standard deviation, and when they're consistently high, there may be little concern over how to instruct
the class or make up the lesson plans. There are two types of standard deviations: population standard
deviation and sample deviation.
Qus: What is Index Number? (2012-7)
Ans: Index numbers are statistical measures designed to show changes in a variable or group of related
variables with respect to time, geographic location or other characteristics such as income, profession, etc.
A collection of index numbers for different years, locations, etc., is sometimes called an index series.
1. Simple Index Number: A simple index number is a number that measures a relative change in a
single variable with respect to a base.
2. Composite Index Number: A composite index number is a number that measures an average relative
changes in a group of relative variables with respect to a base.
3. Price index Numbers: Price index numbers measure the relative changes in prices of a commodities
between two periods. Prices can be either retail or wholesale.
4. Quantity Index Numbers: These index numbers are considered to measure changes in the physical
quantity of goods produced, consumed or sold of an item or a group of items.
Qus: What do you mean by cost of living index? Describe the main problems for the construction of
index number (2014-6.a; 2013-6.a)
Ans: An inflationary indicator that measures the change in the cost of a fixed basket of products and
services, including housing, electricity, food, and transportation. The cost-of-living index is published
monthly. Also called cost-of-living index/Consumer Price Index (CPI)
Problems in the Construction of Index Numbers
1. The purpose of the index
2. Selection of a base period
a) The base period should be normal one
b) The base period should not be too distant in the past
c) Fixed base or chain base
3. Selection of number of items
4. Selection of source of data
5. Collection of data
6. Price quotations
7. Choice of an Average
8. Selection of appropriate weights
9. Selection of an appropriate formula
10. International Comparisons not Possible
Qus: Define probability of an event (2014-7.a)
Ans: An Event is a one or more outcome of an experiment. Example of Event-When a number cube is
rolled, 1, 2, 3, 4, 5, or 6 is a possible event. When you roll a number cube and toss a coin at the same
time, a possible event is a 3 and a tail respectively.
Probability Of An Event
The Number Of Ways Event A Can Occur
P(A) =
The total number Of Possible Outcomes
Qus: What is Regression Analysis? (2012-6)
Ans: In statistics, regression analysis is a statistical process for estimating the relationships among
variables. Statistical approach to forecasting change in a dependent variable (sales revenue, for example)
on the basis of change in one or more independent variables (population and income, for example). The
general form of each type of regression is:
Linear Regression: Y = a + bX + u
Multiple Regression: Y = a + b1X1 + b2X2 + B3X3 + ... + BtXt + u
Where:
Y= the variable that we are trying to predict
X= the variable that we are using to predict Y
a= the intercept
b= the slope
u= the regression residual.
Md. Ali Reza
It is also good to have a working knowledge of all of these kinds of samples. Some situations call for
something other than a simple random sample. We must be prepared to recognize these situations.
Qus:Variable and attribute [2015-7.a]
Ans: In science and research, attribute is a characteristic of an object (person, thing, etc.). Attributes are
closely related to variables. Variable is a logical set of attributes. Variables can "vary" - for example, be
high or low. How high, or how low, is determined by the value of the attribute (and in fact, an attribute
could be just the word "low" or "high"). (For example see: Binary option)
While an attribute is often intuitive, the variable is the operationalized way in which the attribute is
represented for further data processing. In data processing data are often represented by a combination of
items (objects organized in rows), and multiple variables (organized in columns).
Values of each variable statistically "vary" (or are distributed) across the variable's domain. Domain is a
set of all possible values that a variable is allowed to have. The values are ordered in a logical way and
must be defined for each variable. Domains can be bigger or smaller. The smallest possible domains have
those variables that can only have two values, also called binary (or dichotomous) variables. Bigger
domains have non-dichotomous variables and the ones with a higher level of measurement. (See also
domain of discourse.)