RM File Deep Mehrotra Bba B& I 417
RM File Deep Mehrotra Bba B& I 417
RM File Deep Mehrotra Bba B& I 417
41714901818
TABLE OF CONTENTS
S.NO MODULE NAME DATE
1. INTRODUCTION TO SPSS
2. MANAGE DATA IN SPSS
3. CODING AND RECODING IN
SPSS
4. SELECTING, SORTING AND
ANALYSING DATA IN SPSS
5. MISSING VALUES, RECODING
THE SAME VARIABLE
6. DESCRIPTIVE STATISTICS
7. CORRECTING DATA
PROBLEMS
8. ONE WAY ANOVA
9. CHI - SQUARE TEST
10. REGRESSION
11. CORRELATION
12. ONE SAMPLE T TEST
13. INDEPENDENT SAMPLE T TEST
14. PAIRED SAMPLE T TEST
MODULE 1
INTRODUCTION TO SPSS
SPSS means " Statistical package for the social sciences" and was first launched in 1968.
Since SPSS was acquired by IBM in 2009, it's officially known as IBM SPSS Statistics but
most users still just refer to it as "SPSS".
SPSS is a software for editing and analyzing all sorts of data. These data may come from
basically any source, scientific research, a customer database, Google analytics or even the
server log files of a website.
SPSS can open all file formats that are commonly used for structured data such as:
• Spreadsheets from MS Excel or open office
• Plain text files ( .txt or .csv )
• Relational (SQL) databases
• Stata and SAS
Launching SPSS
To start SPSS go to
START>Programs>SPSS
A dialog box will open listing several options to choose from. The following options are
available.
▪ Run the tutorial
▪ Type in data
▪ Run an existing query
▪ Open an existing data source
▪ Open another type of file
DATA VIEW
VARIABLE VIEW
The first step is to open the variable view window of the data editor and define variables. Let
us consider an example where the employ data of the organization need to be analyzed . The
objective is to create a small data file for the employees that consist of six variables.
RECODE COMMAND
To convert a variable into another SPSS has transform command. A variable can be
converted using any of the following tools:
• Recode into same variable
• Recode into different variable
FOR EX: To create a new variable " gender_group". USE THE FOLLOWING STEPS:
• Go to transform on TOP MENU > a drop down menu will appear > SELECT
" Recode into different variables" to change the name of the variable.
• Now click on the GENDER variable to highlight it and then click the arrow
button to move it to across the Numeric Variable > Output variable box.
• In the output variable give the variable a new name and label.
• Click on the Change button
• Now click the old and new button
Transform> recode into different variables> new name and label> change.
This will create a new variable whose data will appear in data editor window.
VALUE LABEL
• After recoding command , a new variable "gender_group" is created.
• In the variable view , click on value column corresponding to gender_group.
• Click on dots and " value label" dialog box will open automatically.
• Add the value and their label in respective column i.e value 1 represents MALE, add
the other value similarly.
• Now click on " OK" to continue.
FREQUENCY DISTRIBUTION
Analyze command is used to convert the data into a frequency distribution in SPSS.
FOR EX: Creating a frequency distribution based on the variable, i.e GENDER. These steps
should be followed.
• In output window , ANALYZE> DESCRIPTIVE STATISTICS> FREQUENCY
DISTRIBUTION.
• Click on the variable i.e "GENDER" from the left column of the dialog box and move
it to the right hand " variables window " press OK.
• In the crosstab dialog box , add YEAR in ROWS and GENDER in COLUMN and
thenn click OK.
• Click on CASTE and YEAR to highlight it and use arrow to put them into "variables"
box and then click OK.
• The descriptive will appear as output in output window.
GRAPHS
'Graph' command is used to make a graph for any variable.
• To make graph , go GRAPHS>SELECT LEGACY DIALOG>BARS.
• A new dialog box will appear on the screen. Select the appropriate option in type of
data in the variable list.
✓ Summaries of group of cases.
✓ Summaries of seperate variables.
✓ Values of individual cases.
• After selections , click on DEFINE button.
• Now transfer those variable which one want to represent in bar graph from the left to
right column and the variable you want category label to show.
• Click OK to generate the graph.
• In output window , a graph will appear representing NAME OF STUDENTS on
horizontal axis and GENDER OF STUDENTS on vertical axis.
PIE CHART
• To insert a pie chart for GENDER , go to GRAPHS>LEGACY DIALOG>PIE
CHART.
• Define slices in the dialog box by selecting the variable.
• Click OK to generate pie chart in output window.
MODULE 2
MANAGE DATA IN SPSS
FINDING OUT THE CASE SUMMARY
To understand the nature of the data one will use case summaries
Output statistics viewer shows case summaries of final year marks based on caste.
COMPUTING NEW VARIABLE
• Go to transform> compute variable.
• Type midterm as target variable.
• Select mean from function and special variables.
• Add midterm 1 to 5 to syntax.
• Press OK.
• Click on old and new values and check output variabless as strings.
• Label the range "0 thru 49" 'C' "50 thru 59" 'B', "60 thru 75" 'A'.
• Another column named Grade appears depicting grades on basis of final score.
• A '' summarize cases" dialog box will appear, select "marks in final" in variable
column and "gender" in grouping variable column, also limit the cases to 120.
• Go to " statistics" , "summary statistics report" dialog box will appear . Choose
"mean,median etc" in cell statistic column and click continue.
• In output data viewer marks obtained in final exams by males will appear.
SORT CASES
• Go to data tab> sort cases from the drop down menu.
• Select " marks in final"
• Select descending order in sort order and click OK.
MISSING VALUES
• Go to Analyze Tab > select descriptive statistics > statistics from drop down menu.
• "Frequency" dialogue box will appear, choose "marks in final exam” in variable
column and then click on 'statistics' .
• Choose Mean from options in the "Frequencies Statistics" dialogue box and click
Continue.
• Now click "OK"
• Go to transform tab > Recoding into Same Variables > old and new value from the
drop down menu.
• "Recoding into Same Variables" dialog box will appear. Select '4- year resale value'
as numeric value expression as shown .
• Click on "Old and New Value" option .
• Select "System Missing" in Old Value box and '60.134' in new value box. Then click
continue as shown .
Then in Data Editor, all the missing values of the data will be replaced be a common mean
i.e. 60.134 as shown.
SPLITTING FILE
• Go to data tab > Split File from the drop down menu as shown.
• "Split File" dialogue box will appear, in which "gender" should be selected in "Group
Based on" box and Click "OK" as shown.
• In Output Viewer, data will appear into different splits based on "gender of students"
with their mean value as shown.
RECODING INTO SAME VARIABLE: OLD AND NEW
VARIABLE (SELECTING THE PARTICULAR BRAND NAME
AFTER SPLITTING THF FILE)
• Go to transform tab > choose recoding into same variables > old and new value from
the drop down menu.
• "Recoding into Same Variables" dialog box will appear. Select ‘score out of 10 in
midterm’ in Numeric Variables as shown.
• Then in Data Editor, all the missing values of the ford will be replaced by a common
mean i.e. 6.04 as shown.
• Press "OK".
• In data editor, all the missing values of Ford will be replaced by "6.04" shown.
MEAN VALUE OF FORD, ONCE MISSING VALUE IS
REMOVED
• Once missing value is replaced by 6.40, go to transform tab > recoding into same
variables old and new value from the drop down menu.
• "Recoding into Same Variables" dialog box will appear as shown.
• Choose 'IF' option from the dialogue box, new dialog box naming “Recode into Same
Variable: If Cases". Select the option “Include if case satisfies condition” and type
name = ''liza'' in the blank area and click "Continue" as shown.
In output viewer, mean, frequency, percentage and cumulative percentage is changed.
MODULE 6
DESCRIPTIVE STATISTICS
Analyze command is used to convert the data into descriptive statistics in SPSS, these steps
should be followed:
1. Click on analyze, select descriptive statistics from the drop down menu. select
descriptives.
2. 'Descriptives' dialogue box will appear, choose all the variables except Name in the
variable list and then click on 'statistics'
3. Click on options and select mean, standard deviationn, kurtosis and skewness.
4. Now click continue and then OK.
5. In output data viewer, we can see all the data described statistically.
MODULE 7
Correcting data problems
EXPLORE
• Go to Analyze Tab > select descriptive statistics > explore from drop down menu.
• "Explore" dialogue box will appear, choose "marks in midterm 1” in dependent list
and then click on 'statistics' .
• Explore: Statistics dialogue box will appear.
• Select - Descriptives
• Click continue
• Now click on PLOTS Explore: Plots dialogue box will appear.
• 123Click none under Boxplots and Histogram under Descriptives
3. After this the dialogue box of One-way Anova appears and the marks in
midterm 1 are put in dependent list and in the factor the year of the
students of school.
4 .And once the dialogue box appears we click on the Post hoc.
5. The output of post hoc is shown below.
1. After the data is entered we first go to graphs and select chart builder. After this the chart
builder dialogue box appears.
2. After this we select the scatter or plot and then click ok and the output is as follows.
.
3. After this select analyze and then select linear.
4.The linear regression dialogue box appears as above and score in midterm 1 of students is
taken as dependent variable and the year in school of students is taken as independent.
From there the output is as below.
5.The significance of .000 means that there is almost no significant relationship between the
year in school of the students and the score obtained by them in midterm 1.
MODULE 11
CORRELATION
To determine the co-relation between no of session and level of
satisfaction.
• Click on scatter
• Click scatter dot on simple scatter, click on define
• Move dependent variable on y-axis(MARKS IN FINAL) and independent on x-
axis(YEAR IN SCHOOL). Click ok.
Move the two variables in the variable box. Click on Pearson co-efficient of co-relation. Then
click OK.
OUTPUT
Objective- To compare the co-relation co-efficient for satisfaction among male and female
groups separately.
• Go to data in the top menu. Then select split file.
• Tick on the compare groups.
\
MODULE 12
ONE SAMPLE T TEST
1. Click on analyze then compare means and select One sample T Test.
1. Go to analyze and click on compare means and then select Independent sample T test.
2. The independent sample T test dialogue box appears where final marks if students is
testing variable and gender of students is grouping variable.
3. Now click on Define groups and label group as '1' for female and group 2 as '2' for
male.
2. After clicking on the independent sample T test the dialogue box appears.
3. We select the 2 variables, score in midterm 1 of the students and mean score of the
students as pair 1 and score in midterm 2 and midterm 3 as pair 2.
The significance in pair 1 of the score in midterm 1 of students and mean score of students is
.240 that means there is little relationship between the 2 variables. The significance in pair 2
of score in midterm 3 and midterm 4 is 0.581 that means there is some relationship between
them.