Statistics Preamble
Statistics Preamble
Statistics Preamble
Statistics The art and science of collecting, analyzing, presenting and interpreting data to make more
effective decision. A collection of numerical information is called statistics.
Boring The science of collecting, organizing, summarizing and presetting data, planning and
performing studies and experiments, analyzing results, interpreting results, drawing
conclusions, and presenting results.
Exciting Being able to discover something about a large population by only taking a relatively small sample
from that population.
Data The facts and figures that are collected, analyzed and interpreted. Data are collections of any
numbers of related observations.
Data Set All the data collected in particular study or a collection of data is called data set.
Elements The entities, for example a person, firm, state or country on which data are collected.
Variable A characteristic for the element. For example, sales (variable) of General Motors
(element).
Contd.
Population A set of all elements in a particular study.
Why sample?
▪ Easy to collect
▪ Time and cost effective
▪ Convenient and manageable
▪ Availability of data
Descriptive Statistics Organizing, picturing and summarizing data from samples or populations. Tabular,
graphical and numerical methods used to summarize data.
Inferential Statistics The process of using data obtained from a sample to make estimates about the
characteristics of a population. Using data from a sample to draw conclusion about the
population.
Statistics as data-plural sense
Statistical Data “By statistics we mean aggregate of facts affected to a marked extent by
multiplicity of causes, numerically expressed, enumerated or estimated according to
reasonable standard of accuracy, collected in a systematic manner for a pre-
determined purpose and placed in relation to each other.” – Horace Secrist
• Presentation
• Analysis
• Interpretations
Functions & limitations
Functions
• It presents facts in definite forms.
• It simplifies mass of figures.
• It facilitates comparison.
Limitations
• It is only a means.
• It can be misused.
▪ Misleading graphs (scale).
More Limitations
• Statistics usually produces probabilities; conclusions are provisional.
• The provisional conclusions have errors and error rates. Commonly 5% of the provisional conclusions of
significance testing are wrong.
• Statisticians are not in complete agreement on ideal methods.
• Statistical methods are based on assumptions which are seldom fully met.
• Data gathering is usually limited by ethical, practical and financial constraints.
Contd.
Should we use statistics?
• If it is used properly they help in taking prudent decisions and if misused they can do more harm than good.
• It depends how you would use statistics. But the fault does not lie with the science of statistics as such.
ANALYZE
• Graph the data – we have a separate class on it
• Explore the data – are there outliers? What is the distribution? Is there missing data?
Is there non-response?
• Apply statistical methods – build confidence intervals, conduct hypothesis tests,
calculate probabilities etc.
CONCLUDE
• Statistical significance
• Practical significance
Bias or error in statistical analysis
Loaded questions Worded in such a way that they can be misleading, confusing or worded
to a desired response.
Orders of questions People generally choose whatever option they hear first. Randomization
of is needed so that order of the questions may not a factor.
Missing data Sample data values are missing due to random factors and special factors.
Low income people do not respond their income.