Analytics
Analytics
Analytics
(PGDM- FT)
Area: Marketing
Course Title Marketing Analytics
Course Description:
In todays markets, the big story is BIG DATA. Technological advances (e.g., web,
communications, and devices) are expanding customer-to-firm and consumer-to-consumer contact
points, and enabling capture of rich and diverse transaction data at nearly every contact. Marketing
data stocks are exploding at a rate that defies Moores law (double every two years). Yet, on their
own, big data do not reveal big insights. Insights from big data require smart analytics.
Unfortunately, data stocks are increasing at a rate that is not matched by our capacity for smart
analysis and deep insights. The marketing analytics course is integrates theory (e.g., analytical
concepts) and practice (e.g., real life data based problems) with focus on skills for smart analytics.
It is meant for students who want to build analytics competency for managerial decision making,
and study applications of analytics for customer and marketing related problems in management.
Some of the typical application areas include consumer credit, new products, market segmentation,
and customer value.
Learning Objectives:
(a) Developing skills for analysing marketing data for making intelligent decisions about
marketing investments that create value and build competitive advantage.
(b) Build capabilities for marketing analytics using current concepts and tools.
Pedagogy:
The course will blend theoretical concepts with practical problems to provide a broad analytics base
for skill building. The course will also focus on informed use of marketing tools and on writing
syntax for those tools.
Every course day will involve Classroom Seminar and Lab/Tutorial sessions, delivered twice a
week on Tuesday and Fridays (on specified dates) from 10.30 AM to 1.15 PM. The class will begin
promptly at 10.30am.
deducted. During the lab/tutorial session, students will start work on their homework assignment for
the next class.
Evaluation:
There will be two exams and six assignments for the course, with the following weights and
schedule::
Components of Evaluation
Class Participation
Mid-Term
Assignments
Final
Total
Date
Weightage of Marks
(in percentage)
Note: No evaluation will be repeated. Missing an assignment deadline or a test will lead to the
student being awarded zero points for that evaluation.
Grading Notes:
Class Participation and Attendance. The following aspects will be considered in evaluating
students grade:
a. Preparedness and Attendance. Regular attendance and diligent preparation for the class is
indicative of student commitment to participate in a learning environment.
b. Active Engagement. Active participation in the class with thoughtful questions, sharing
alternative perspectives, and responding critically to class discussion
*Bonus Points for Presentation. You will earn bonus points if you are selected for presentation to
the class. To be invited for presentation, the student must perform significantly better than the
others in terms of (a) quality, (b) depth or (c) innovativeness demonstrated in the analytics
assignment. Bonus points could be upto 10% of the assignment grade.
4. Feedback on your draft submission will be provided in the Friday class to address common
questions and errors. Students are encouraged to bring questions, concerns and problems
they encountered in developing their initial report to the class for discussion.
5. Final assignments will be due on Sunday 9.00 a.m.. No extensions permitted (see Late
Submissions below).
6. Feedback on the final draft of your assignment will be provided usually before Tuesday
class.
7. Based on the quality and creativity of your submission, the instructor may ask you to make a
presentation based on your assignment in the Tuesday class with the following guidelines:
a. 10 minute presentation
b. Focus on points highlighted in the feedback. Revise and sharpen your presentation
accordingly.
c. Polished and professional presentation. Assume you are presenting to a client.
Late Submissions: Late submission will result in a 1.5-point penalty (out of 10). That penalty is
one and a half point for each day (or part thereof) that the submission is late. If a submission must
be late due to circumstances beyond your control, contact the instructor. At his discretion and
based on his assessment of the actual degree of uncontrollability of the situation, he may permit a
special arrangement. The most typical special arrangement is for students who must miss class due
to extreme circumstances. They are often permitted to submit the assignment early. It is extremely
rare for the instructor to permit an extension of the due date.
Redos: Students are encouraged to go back to fix problems identified on their graded assignments
as this ensures that students complete the learning loop. In some instances, a redo may earn you a
portion of the original points, but often it may not. Redos that show students effort in addressing
the problem on their own, the nature of the problem and timely submission are all factors that go
into this decision. Additional points for redos are at the instructors discretion. The extra credit
will be upto 1 point maximum (10% of grade) and usually lower. Redos are intended to promote
learning and not as opportunities for making up the grade.
Changes: The instructor reserves the right to make changes to any aspect of syllabus that, to his
judgment, are needed to achieve the learning objectives of the course.
Academic Integrity:
All students in this course are expected to adhere to IMTs standards of academic integrity.
Cheating, plagiarism, and other forms of academic dishonesty will not be tolerated. This includes,
but is not limited to, consulting with another person during an exam, turning in written work that
was prepared by someone other than you, and making minor modifications to the work of someone
else and turning it in as your own. Unless properly referenced or quoted, submitted work is
assumed to be original contribution of the student.
Ignorance will not be permitted as an excuse. If you are not sure whether something you plan to
submit would be considered either cheating or plagiarism, it is your responsibility to ask for
clarification.
All submitted material will be run through anti-plagiarism software by the APOs office. A high
level of plagiarism detected by the software will lead to a failing grade in the course.
Course Dataset for Homework Assignments
Datasets and structured problems would be provided to the students for most of the course
assignments. The problems/case studies and dataset are provided for the purposes of coursework
and are copyrighted materials. You are asked to delete the materials and all associated data files
upon completion of the course.
Review material covered in a first-level graduate statistics course. Online materials are
available, usually free, for this purpose (e.g., http://davidmlane.com/hyperstat/;
http://onlinestatbook.com/2/index.html).
Familiarize yourself with the SPSS statistical software. SPSS has a useful tutorial (under
the Help tab in the main data screen). Many online resources are available
(http://www.hmdc.harvard.edu/projects/SPSS_Tutorial/spsstut.shtml,
http://www.ats.ucla.edu/stat/spss/modules/default.htm), and you can download a free
guidebook by visiting http://www.spsstools.net/ (use the link under purposes).
30 minutes
Introduction to the
course
30 minutes
Conceptual
background
to
crosstabs and Chisquare test.
30 minutes
Tutorial:
Performing
Crosstabs and Chisquare test: Tutorial
30 minutes
Conceptual
background to factor
Analysis
30 minutes
Tutorial: Performing
Factor Analysis
Readings:
Chapters 2 and 3 of Text Book
30 minutes
Summary feedback on
assignments
submitted
after session I
45 minutes
Student queries based on the
feedback
75 minutes
Discussion on additional concepts and issues about
crosstabs, Chi-square test and factor Analysis
GUEST: Alumni Perspectives
Readings:
Chapters 2 and 3 of Text Book
30 minutes
Presentation of best student assignment
on
Crosstabs-Chi-square/Factor
Analysis
45 minutes
Conceptual
background
regression analysis
to
Tutorial:
analysis
75 minutes
Performing
regression
Readings:
Chapter 4 of the text-book
30 minutes
Summary feedback on draft submission
on predictive analytics
45 minutes
Addressing queries raised by
students on feedback
Readings:
Chapter 4 of the text-book
75 minutes
Discussion on advanced topics of
regression.
30 minutes
Presentation of best student
assignment
on
predictive
analytics
45 minutes
Introduction to big data, web
analytics
and
attribution
modeling
75 minutes
Tutorial: Practical applications
Interactive analytics
of
30 minutes
Summary feedback on
submission
draft
45 minutes
More on big data, web
analytics
and
attribution
modeling
75 minutes
GUEST: Anshuman Gupta, Head,
Data Science Program, Cognizant
60 minutes
Introduction to conjoint analysis
90 minutes
Tutorial: Performing conjoint analysis
Readings:
Chapter 6 of the text-book
30 minutes
Summary feedback on draft
submission of conjoint analysis
assignment
45 minutes
Addressing student queries based
on summary feedback
75 minutes
Discussion on advanced topics.
GUEST: Ujjyaini Mitra, Head
Analytics
and
Marketing
Intelligence, Bharti Airtel
Readings:
Chapter 6 of the text-book
Multi-Dimensional Scaling
Sessions 10 (150 minutes)
30 minutes
Presentation of best student
assignment on conjoint analysis
45 minutes
Conceptual background to multidimensional scaling
Readings:
Chapters 10 of Text Book.
Multi-Dimensional Scaling
75 minutes
Tutorial:
Performing
dimensional scaling
multi-
45 minutes
Addressing student queries based on
summary feedback
75 minutes
Discussion on advanced topics.
GUEST: TBD
Readings:
Chapters 10 of Text Book.
30 minutes
Presentation of best student
assignment on multi-dimensional
scaling
45 minutes
Introduction to segmentation using
cluster analysis
75 minutes
Tutorial: Performing cluster analysis
Readings:
Chapters 7 and 9 of Text Book
30 minutes
Summary
feedback
on
draft
submission of assignment on cluster
analysis
Readings:
Chapters 7 and 9 of Text Book
45 minutes
Addressing student queries based on
summary feedback
75 minutes
Discussion on advanced topics.
GUEST: TBD
Course Assignments:
Assignment #1: Data Mining Analytics
This assignment involves two mini-assignments. Each indicates a different approach to dataminingone deductive, driven by a priori thinking, and the other inductive, driven by exploring
without any a-priori constraints.
Mini-Assignment 1A: Inductive Data Mining
An insurance company received more than usual number of fraudulent claims in 2008, and the
company is interested in examining possible causes. Claim history, background characteristics
(e.g., income) and past behavior of insurers are potential explanations but the company has no apriori data or industry evidence to provide insight. Ultimately, the company seeks insights to help
reduce the number of frauds and see if they should make any changes in their insurance policies.
Data: Insurance_Claims.SAV
In developing your client presentation (as a consultant), consider the following issues:
1. Do the fraudulent and non-fraud claims differ by types of claim, town size, or amount of
coverage?
2. Explore if there is any variable(s) that might influence the relationship between fraudulent/nonfraud claims and amount of coverage.
3. Challenge: Identify other patterns in the data that can provide insights into fraudulent claims.
Integrate results from your explorations to provide a clear recommendation to the insurance
company. You may need to conduct additional 3-way or 4-way cross-tabulations.
*** Hint: You will need to recode Amount of coverage into 3 categories:
a. Low coverage: Lower than 107
b. Moderate coverage: From 107 to 358
c. High coverage: Higher than 358
Mini-Assignment 1B: Deductive Data Mining
Bencare Insurance Company, a long standing firm for consumer insurance products worldwide, is
recently facing stiff global competition that is chipping away at its loyal customer base. Insurance
industry is tough business where price competitiveness of insurance products is matched by
customer needs for trusting relationships with firms of strong reputation (see
http://www.ey.com/GL/en/Industries/Financial-Services/Insurance/ey-2014-global-customerinsurance-survey).
To commission a survey of its customers, Bencare conducts a series of internal discussions among
its customer facing teams to key variables that should be include in the survey. These discussions
converge on 5 key variables which are later refined and reworded for inclusion in the customer
survey (specific survey question in parentheses:
1. Customers Trust in the Company Representative (Rep17 to Rep 20),
2. Customers Trust in the Company (Prac17 to Prac20),
3. Customer Satisfaction with Company products (Inter1 to Inter3),
4. Customers Value (benefits relative to costs) Assessment (Val1 to Val6)
5. Customer Loyalty (Loy1 to Loy8).
Subsequent to data collection, Bencare is curious if their conceptions of customer variables align
with how customers view the company, its products and their relationship with it. Use factor
analysis to identify, refine (as needed) and validate these key variables. Develop your report for
client presentation with a strong and complete technical appendix that supports your conclusions.
Data: Bencaref_raw.sav
Survey: Bencare_survey_excerpts_from_full_version.pdf
In developing your client presentation (as a consultant), consider the following issues:
4. What decision choices in Factor Analysis procedure are appropriate for this problem and data?
5. Obtain, tabulate and interpret the evidence for the validity of Bencares conceptions of key
customer variables.
6. Challenge: Are these variables useful managerially? Do they capture variations in different
customer segments (based on demographics)? Think about this last question again.
Little solid evidence exists for either view as credible explanation of customer decision rules.
Potentially decision rules likely change by segments and categories.
To understand how customers make trade-offs between fashion and price, a full profile conjoint
study is designed with the following attributes:
1. Fashion: two levels (Modern versus Traditional)
2. Quality: two levels (Low versus High)
3. Price: two levels (Lower versus Higher)
400 randomly selected customers from target segment are invited to complete the conjoint task by
rating each profile on a 1-5 likelihood to buy scale (1 = very unlikely, 5 = very likely).
Develop a report based on high quality analytics for the fashion company client to help them
develop marketing programs that take advantage of your insights to gain competitive advantage.
Data: Rating_Fashion.Sav
In preparing your report consider the following:
1. Outline how you will tackle the marketing problem to yield meaningful insights that go
beyond obvious statements about customer trade-offs. What makes your approach
distinctive (from other consultants)?
2. Execute your approach. Show your analytics, summarize, tabulate and plot your findings,
and interpret your interesting insights. What evidence would you provide to demonstrate
that your results are robust and valid?
3. Do your results support common rules of consumer trade-offs? If so, do they support
conventional or contemporary view? If not, how do consumer trade-off rules vary?
4. Challenge: Are there clear customer segments that differ in their trade-off rules? To what
extent are age and gender meaningful in defining these segments? Think again about these
questions.
Area- Analytics
Course Title:
Course Description
Today the business organisations are is dealing with gigabytes of information due to the development of mathematical
and computer science technology. The human brain is not able to handle and examine more than a few dozens of
records manually at a time to solve a problem even from well-organized data. But the algorithms of predictive
modelling can analyse millions of data points with hundreds of variables to make a diagnosis or a prediction of the
stochastic variables. The objective of predictive modelling course is to provide an overview of statistical and
econometrics modelling techniques with the help of the most advance softwares such as SAS or R. Predictive
modelling is useful in the area of Finance, marketing, economics, healthcare etc.
The students of predictive modelling will gain practical experience of developing predictive models based on routinely
collected data using a standard statistical computer package such as SAS or R.
Course outcomes: After this course participants will be able to:
1.
2.
3.
4.
5.
6.
Identify the statistical models to be applied in the given situation. Understand the predictive modelling process
Prepare data for the purpose of predictive modelling
Appreciate the use and importance of multiple data sources, e.g. CMIE, Bloomberg etc.
Compare and contrast the underlying predictive modelling algorithms, e.g. Logistic Regression and
Classification Trees
Select appropriate predictive modelling approaches to identify those cases that are at high risk of an event
Apply these approaches using a suitable package such as SAS or R
Learning Objectives
(a)
(b)
(c)
(d)
To develop the analytical skills using statistical analysis using statistical softwares.
To develop smart presentation skills for persuasive communication across stakeholder relationships.
To critically analyse situations for forward thinking through innovation.
To demonstrate techno-savvy capabilities to manage integrated environments.
Teaching Pedagogy
The pedagogy will be a mix of lectures, critique of readings/articles, case discussion, assignments and data analysis
using SAS software. The course uses data and cases as the means to drive the contextual suitability of predictive
modeling. Besides the readings in the course pack, additional reading material will be distributed in the class from time
to time.
Course Requirements
i.
Prerequisite to this course is the learning derived from Business Statistics and Business Research
Methods courses.
ii.
Since much of the course material requires class discussion, it is important that you are prepared for the class
and present and defend your ideas. Preparation for class includes having read the assigned material.
iii.
The class will be divided into a number of groups and each group will have 4-5 students. For all projects and
assignments, each group will submit analysis report and power point presentation one day before scheduled
class.
iv.
Evaluation for this assignment will be done based on individual as well as group performance.
v.
Class participation will be based on the value you add to the class through your creating learning environment
in class by raising intelligent questions, relevant statements on questions raised by others, and maintaining
class decorum.
Evaluation
Students will be evaluated on the basis of case analysis, project, assignments, class participation, mid-term and endterm exam. The weightages given to each of these units is listed below:
Course Evaluation
Class Participation
Assignments (4)
Project report (1)
Quiz (best 1 out of 2)
End Term Exam
10
10
30
10
40
Project Evaluation
Specific
Method
Assessment
Group
Assessment
Methods
Group Project Proposal
Presentation
Group
Project
Presentation
Group Project Report
Submission
Group Case analysis
submission
Group Assignments
Individual Assessment
Methods
Mid Term Exam
End-term Exam
30%
30%
40%
15%
15%
30%
40%
Project proposal
Data Collection
Final Report along with data used (if applicable)
Presentation
The presentation should cover both the analysis of primary and secondary data, as applicable.
Each presentation will be of 10 minutes with 8 minutes given to the group to highlight the key findings and 2
minutes for open discussion in the class.
Additional assignments would be given during the course.
Introduction
predictive
analytics
20 minutes
20 minutes
20 minutes
Session 3 & 4: Linear Regression, Regression means going to the past. So what is it doing in the business?
Examples of different verticals requiring prediction, How does one determine factors affecting a business,
Difference between BI and Predictive modeling, Relating the data available with the business problem, How
to effectively determine a business problem and relate it to Analytics? Overview of Observation Window and
Performance Window
Linear
regression
20 minutes
20 minutes
of
20 minutes
Session 5 & 6: Regression Diagnostics: The Regression problem, Finding the best-fit straight line - OLS
Method, Sum of Square Errors, Base Line model, Co-efficient of Determination, Normality of residual,
Violations of the CLRM assumptions : Multicollinearity, Tolerance, VIF (to be discussed in Detail),
Homoscedasticity (White Test), Autocorrelation of residual (Durbin Watson Statistic), Correlation between
Independent variable and residual etc., Implications of Violation of BLUE properties, Overview of different
methods to alleviate Hateroscedasticity, Box Cox Transformation Residual Analysis, Its importance and
Implications: Cooks-D statistic, standardized residual values etc.
15 minutes
Problems
regression
analysis
Session 7 & 8: Capping and Missing Imputations: Introduction to imputation techniques, A simple case
study wherein the values of the variables are missing and needs imputation, Incorporating Categorical Data:
Dummy Variable Coding, Dummy Variable Trap, Interpretation of Dummy Variable Coefficients, Structural
Breaks and testing of Seasonal behaviour anomaly, Model Building Variable selection, Interpretation of the
model, Prediction and scoring new data Model Performance (Goodness of Fit) & Tracking R2, Adjusted R2,
Performance Tracking through Split Samples.
Session 9 & 10: Principal components analysis and remedies of regression problems: Unit root test, Principal
component analysis, KMO and Bartlett test, Varimax rotations, Eigen values, Factor loadings and factor
scores.
15 minutes
Remedies
of
regression problems,
unit root problem,
combining variables,
identifying
and
removing outliers
Sessions 9 and 10
20 minutes
Principal components
analysis and remedies
of
regression
problems: Unit root
test,
Principal
component analysis,
(150 minutes)
20 minutes
20 minutes
Session 11 & 12: Logistic Regression-1: The Classification problem, The joint probability prediction,
Sigmoid function, The logistic transformation, Likelihood function and Maximum Likelihood estimation,
Difference between OLS & MLE, Linear Probability Model, Variable selection, Interpretation of the model
through maximum likelihood estimates, Interpretation of the model through odds ratio estimates. Wald test,
Exponential Beta, Concordants and Sommer's D, Hosmer Lemeshow Test, Error/Confusion matrix, Receiver
operating characteristics
15 minutes
The
Classification
problem, The joint
probability
prediction, Sigmoid
function, The logistic
transformation,
Likelihood function
and
Maximum
Likelihood
estimation
.
Session 13 & 14: Univariate forecasting using ARIMA forecasting, AR and MA components, identification
stages, Forecasting, ARIMAX, integrated series, forecasting accuracy, model testing. Making static and
dynamic predictions.
Sessions 13 and 14 (150 minutes)
20 minutes
20 minutes
20 minutes
Identification stages,
Developed
forecasted
Forecasting,
Univariate
forecasting
using ARIMAX, integrated model testing. Making Case study
forecasting static and dynamic
ARIMA forecasting, series,
predictions.
AR
and
MA accuracy,
components,
15 minutes
Course Description:
Few Quick questions:
How many people knew Make interesting and meaning out of big Data before?
Do companies measure data? How many times? In TB? In Zeta/Zotta /Bronto Bytes? What are the
methods
What are knowledge knuggetts for firms? How do you measure and build intelligence in software?
Is BIG Data the next big thing? How it is different? What is Machine learning, MapReduce and
Hadoop?
What type of business rules Business analysts write? Are they relevant? What is a data scientist?
RBI, Shoppers Stop, LIC, ICICI, AIIMS, Airtel adopted BI why and how?
Where are Indian Analytics companies doing? Does the 1990 IT bubble is forming again in India in
analytics?
What is OLAP market? What opportunities exist? Have you heard of real time data warehouse, it is
required in big data? Why should we know data warehouse, when we read Big data?
Do you we do De-Normalization for using analytics to discover new information?
What is prescriptive, predictive, diagnostic and descriptive analytics?
What is the architecture for big data technology? Examples.
What is Python, PMML, ETL tools, ODS tools.
First and foremost and very important, the term analytics started from data bases and data mining. A decade ago,
data base specialists used to call BIG DATA AS VERY LARGE DATA BASES. You predict almost everything
if you have historical data, say starting from customer/employee churn analysis to life time value analysis, RFM,
retention techniques to CART or CHAID or conjoint analysis etc. So this course is a dissemination and
culmination of Data warehousing methods, data mining, BI, Business analytics and big data.
MOST IMPORTANTLY HANDS ON TRAINING ON STATISTICA WOULD BE GIVEN 6-8 HOURS
on model building.
Please note,
It is not theoretical
It is not about programming, dont bunk classes.
The business scenarios and problems look strange but interesting
You need to spend long topics/research discussions after class hours with me
I have to get my hands very dirty to make you business data analysts (!!??!!!)
The course is tough: assignment + exams + Research project;
Analytics- Promising in business, research and development
Big data Big money as said by McKinsey: many well paid positions (median $100k)
Business Intelligence (BI) and Data mining looks cool. For me, to you too !!!
Learning Objectives:
To exactly understand what Indian Analytical industries are currently working on BIG data, or
Hadoop big data. For example, companies like MuSigma, Nabler, Absolute Data, and
Manthan Systems are all the hot Indian Analytics cases that we study
This is a unique course which prepares you for the world of work in analytics in companies. Students
having opted for this course in the previous years have found placement offers from leading companies,
the designations include Data analyst, Business development Manager, and Business Analyst and
research analyst. The course gives you a blend of industry knowledge, concepts and experiential
learning through collaborative teaching by industry experts.
To develop an understanding of the nature of BI, BD, BA and its role in business decisionmaking.
To prepare, comprehend and analyze business cases demonstrating both technical and
analytical capabilities.
To develop a working knowledge of and to demonstrate analytical skills
Pedagogy:
The pedagogy will be a mix of lectures, experience sharing, real life case discussion, assignments and
industry/research based projects. The course is focused on strategic issues with cases as the primary vehicle
for learning. In addition to the reading materials, additional readings and cases will be distributed in the
class from time to time. Students are also expected to prepare and analyze all the cases as class participation
is very important. There are four main pillars in the course pedagogy, namely, (a) lectures-cum-PPTs to
share the conceptual frameworks; (b) experience sharing through collaborative teaching by industry experts;
(c) hands-on Statistica Data mining tools, computer-lab based; and (d) case studies of leading organizations
selected from Harvard Cases and other sources. The course requires a high degree of interactions in the class
on part of the students and feedback on their hands-on work is provided by industry experts.
Course Requirements:
vi.
vii.
viii.
ix.
x.
xi.
Since much of the course material requires class discussion, it is important that you are prepared for
the class and present and defend your ideas. Preparation for class includes having read the assigned
material.
You must have basic statistics and mathematics background. Basic databases knowledge is must.
We will use a number of cases in this course. Cases are a great educational experience but they
absolutely require that you be ready for the class. When cases are assigned, you should come to
class ready to discuss the issues of the case. For each case, you should: a) identify the major
problem, b) give your recommendation to resolve this problem (both short and long term), and when
appropriate d) provide supporting analysis (both qualitative and quantitative) for your
recommendations. Answering the questions provided for each case should help in this effort.
The class will be divided into a number of groups and each group will have 5 students. For all
cases, each group will submit case analysis report and power point presentation one day before
scheduled class.
There will be several home assignments assigned throughout the Term along with a number of
unannounced quizzes. Late homework will not be accepted and there will be no make-up quizzes.
Class participation will be based on the value you add to the class through your questions,
statement, and comments. It is the quality of these that is more important than the quantity.
Evaluation
Course duration: 20 sessions plus Statistica hands-on with Statistica certification
Evaluation
End Term
Project
PPT
:
:
:
35%
40%
25%
CAUTION: I use EPHORUS and Turnitin software for Plagiarism detection. All your project
submissions should come to me after verification plagiarism. IMTG has procured Turnitin, FYI.
Check with our IT /APO department.
SpecificAssessmentMethod
GroupProject
Weight
35%
5*
CasesStudyAssignments*
EndTermExamination
40%
25%
Course Requirements:
Building Data Mining Applications for CRM : Alex Berson, Stephen Smith, Kurt Thearling, TMH
Data Mining : Tools and Techniques by Han and Kamber
Too Big to ignore: Big Data : John Wiley Indian edition
All articles books by Davenport.
Purba Rao books on Analytics.
20 minutes
10 minutes
30 minutes
20 minutes
20 minutes
30 minutes
10 minutes
10 minutes
20 minutes
20 minutes
Sessions
1-2
20 minutes
40 minutes
Mining:
concepts
and
Techniques Pages 7-39
Book: Han and Kamber Data
Mining:
concepts
and
Techniques Pages105-49
5-6
7-8
data
Ref. my
materials
latest
Xeroxed
TEXTMINING TOOLS.
9-10
12-13
14-15
CRISP-DM Methodology
Cross-Industry Standard Process (CRISP) for Data Mining:
Introduction and applications
FACEBOOK Success through BI discovered: case
Guest lectures Computer lab Building Model : for marketing Basket
analysis
Using Statistica Software
Guest lectures Computer lab Building Model : for decision trees
Using Statistica Software
Guest lectures Computer lab Building Model : for predictive modeling
Using Statistica Software
Guest lectures Computer lab Building Model : for CHAID, Clustering
methods
Using Statistica Software
16-19
extra Project discussions, descriptions and presentations
sessions
for
presentations
Computer lab
Computer lab
Computer lab
Computer lab
.
As a business student when you join companies especially in BI arena, you need to sometimes unlearn some
thing which you have done theoretically in the class as there are several advanced tools emerging and your
company may just use that. For this sake, last 2 sessions would be :
BI Consultancy: consulting, training, development in Analytics.
Data Mart building
ROI of DWH, Size of DWH: need
PMML,CRISP-DM
Data de-normalization, ETL, Data cleaning tools, meta data tools.
Big data companies : career profiles understanding
The schedule for tests, submission of projects, presentations will be intimated time to time in as per rules of
IMT. You are free to contact, discuss and clear your doubts with me between office hours. I expect complete
discipline, dedication and sincerity from each student. My office phone number is 0120-3002219.
Group Project
The group project will be one of the most important learning tools of the course. Each group will comprise
of 5 students. Each team will develop a live project. This is a highly interactive real life project, which
requires a high degree of analysis and tangible recommendations. Your group is required to identify a
company as well as project.
Project will be based on topics given below. These topics are only suggestive. Students will go to the
various organizations depending on the need of project and gather the information. According they will plan
for development of project. It is expected that project will involve additional time outside of class and lab
time to complete.
Project Ideas
The dates for submitting the topics, phase wise progress report & final submission will be decided in the
class. The presentation time per group will be 15 minutes with not more than 7-10 slides. If the time exceeds
negative marks will be assigned. The marks will be awarded on group performance basis. The presentation &
code will be e-mailed to the professor in form of a write-up at the following e mail : [email protected]
within 48 hours of the presentation.
The deliverables of the project include:
5.
6.
7.
8.
Project proposal
Interim Report
Final Report
Presentation
3.
4.
5.
6.
7.
8.
9.
References
Appendix
Ground Rules
The Final Report format as mentioned in the heading Contents is not sacrosanct. It is subject to
change depending on the sector specific requirements that need to be incorporated and highlighted so
as to improve the understanding of BA/BI. Please use Summer Project Guidelines as Style Manual
for writing the report.
Grading of the report will be done on the strength of the analysis and explanation therein.
No graphs, which is basically a reproduction of the data provided, are to be included as part of the
report.
Please follow the guidelines of the summer project in terms of report submission.
Area- Finance/Analytics
Course Description
The importance of the treasury analytics is growing in business transactions due to
exposure of various kind of risk involved in it. Risk mitigation, exposure management and
the uses of hedging strategies are all now considered essential. Treasury managers must
be able to use and apply financial products in order to maximize profit along with
minimizing risk. With the ever-increasing range and complexity of available financial
instruments and derivatives, treasury analytics professionals must constantly update their
skills in order to effectively undertake their crucial duties. This course prepare students in
enhancing their skills to deal with key functions in foreign exchange management, risk
management, asset liability management and financing activities. This course includes the
discussion of various risks involves in treasury function and their management with the
help of case studies, practical data etc.
Learning Objectives
This course aims at providing the detailed application of various tools in finance to manage
the different types of risk associated with financial assets and a comprehensive
understanding of how to manage treasury affairs with a detailed look at the various
financial markets and instruments that can be traded.
After this course the participants will be able to:
1.
2.
3.
4.
Teaching Pedagogy
The pedagogy will be a mix of lectures, critique of readings/articles, case discussion,
assignments and data analysis using software. The course uses data and cases as the
means to drive the contextual suitability of treasury analytics. Besides the readings in the
course pack, additional reading material will be distributed in the class from time to time.
The 16 sessions of 1 hr 15 minutes each would be conducted.
Course Requirement
xii.
Prerequisite to this course is the learning derived from Derivatives and Security
Analysis and Portfolio Management courses.
xiii.
Since much of the course material requires class discussion, it is important that you
are prepared for the class and present and defend your ideas. Preparation for
class includes having read the assigned material.
xiv.
The class will be divided into a number of groups and each group will have 4-5
students. For all projects and assignments, each group will submit analysis report
and power point presentation one day before scheduled class.
xv.
xvi.
Class participation will be based on the value you add to the class through your
creating learning environment in class by raising intelligent questions, relevant
statements on questions raised by others, and maintaining class decorum.
Quiz (2)
Assignments (3)
Project report (3)
Mid Term Exam
End Term Exam
10%
10%
20%
10%
50%
Project Evaluation
Group
presentation
Group
Assessment
Methods
Group
Project
Proposal
Presentation
Group
Project
Presentation
Group
Project
Report
Submission
Group
Case
analysis
proposal
Specific
Assessment
Method
Project
30%
1.2
1.3
30%
30%
40%
15%
submission
Group
Assignments
Individual
Assessment
Methods
Mid Term Exam
End-term Exam
15%
30%
40%
20 minutes
Types of financial
risks, Market risk,
Introduction
to
treasury Credit risk, liquidity
risk, financial risk.
Analytics
Functions
of
Treasurer
and
Controller.
20 minutes
Quantification
of
different type of risk,
Participants
in
Treasury Function
20 minutes
Application of
analytics
in
risk
management,
Problem
solving
Session 2 and 3: Market Risk Value At Risk: Methods of calculating VaR Historical,
Parametric VAR, Simulation, Monte Carlo Simulation, Bootstrap VAR, Liquidity VAR, Credit
VAR
40 minutes
Liquidity VaR,
Credit
VaR,
Portfolio VaR,
RAROC
Session 4 and 5: Valuations of Equity and Equity Index Options using Black Scholes
Model, Binomial Tree Method and Taylors Series Method. KMV Model, Merton Model,
Real options.
Black
Scholes
Model
40 minutes
40 minutes
40 minutes
Model
Estimating Probability KMV
of default using Black Merton Model
Scholes model
and Business
Valuations using
Black Scholes
Model
Session 6 and 7: Option Greeks, Developing immunized portfolio using option Greeks
Sessions 6 and 7 (150 minutes)
30 minutes
40 minutes
Introduction
to
option Features
Greeks
Greeks
of
40 minutes
Portfolio
immunization
Option option Greeks
40 minutes
Exercise
using
Session 8 and 9: : Introduction to the Credit Derivatives Market: . CLO, CBO, CDO etc,
SPVs, CDS, Players in the CDS market, Interpretation of CDS Spreads/Index, Valuations
of CDS, Total Return Swap
Sessions 8 and 9 (150 minutes)
30 minutes
40 minutes
Credit
derivative CDS valuation
market, CDS
40 minutes
40 minutes
Copula
Functions
Session 10 & 11: Multi Period options, caps, floors, captions, swaptions
Sessions 10 and 11 (150 minutes)
30 minutes
Introduction to
multi
period
options
40 minutes
Caps and Floors
40 minutes
Captions
40 minutes
Swaptions
Session 12 and 13: Evolution of risk measures- Standard Deviation, Semi deviation, VaR,
CVaR, etc, Risk modeling using performance analytics package in R.
Sessions 12 and 13 (150 minutes)
30 minutes
Introduction to R
software
40 minutes
40 minutes
Risk Modelling in R
Performance
package
analytics
40 minutes
Altman z score,
Practice
Session 14 & 15: Simulating Financial Asset Price, Arithmetic and Geometric Brownian
motion, Introduction to Martingales.
Sessions 14 and 15 (150 minutes)
30 minutes
40 minutes
Simulations
Asset Prices
40 minutes
40 minutes
Martingales
The presentation should cover both the analysis of primary and secondary data, as
applicable.
Each presentation will be of 10 minutes with 8 minutes given to the group to
highlight the key findings and 2 minutes for open discussion in the class.
Additional assignments would be given during the course.
Financial Econometrics
Teaching Pedagogy
The pedagogy will be a blend of lectures, critical articles, case discussion and
assignments. A well formed course pack will be given. The course employs chapters of
the course pack as the means to link the theory with practices. Every theoretical session
will be followed by hands-on session to appreciate the concepts by applying them. Data
sets for the hands-on exercises are to be downloaded from the folder FE folder within the
Hands-Out available in the common server.
Basic Text Book
Basic Econometrics by Damodar N. Gujrati & Sangeetha
Recommended Books
Introduction to Econometrics by G S Maddala
Evaluation Criteria
Quiz
End Term
Final project Presentation
Class participation
20%
50%
20%
10%
Assignment Evaluation
Final assignment is to be done and presented in a group of 4-5 members. The group should
be formed after the second session. Each group has to select a
commodity/index/stock/currency etc. which may be given by the instructor. Each group is
expected to present all the concepts and analysis discussed in the class. Assignment
presentation would be evaluated based on the concepts and coverage, depth of analysis,
presentation skill and question & answer.
Class Participation
As this course is centred around both theory and hands-on, so active classroom
participation is expected from every student. Those who will present would definitely be
categorised as active class participation.
Evaluation Type
Weightage
Group Project
Group project
End term
20%
50%
Programme
Objective-3.1
Programme
Objective-3.2
ii.
The
rationale
for
choosing
commodity/stock/currency/index etc. and the country should be justified.
iii.
the
iv.
Session Plan
Session 1 & 2: Overview of Classical Linear Regression Model (CLRM) and Hands-on
exercise on data sets
Session 1 & 2 (150 minutes)
30 minutes
30 minutes
15 minutes
75 minutes
How
CLRM
is Hands on experience
What
is Different
characteristics performed?
with Eviews & R
classical
of CLRM
Linear
Regression
Model?
Session 3 & 4: Further Developments and Diagnostic Tests of CLRM (Programme obj3.1) and Hands-on exercise on the data sets
Session 3 & 4 (150 minutes)
30 minutes
15 minutes
30 minutes
75 minutes
What
are Importance of Various
necessary Hands on experience
further
diagnostic
diagnostic checks after with Eviews & R
developments checks
performing CLRM
of CLRM?
Session 5 & 6: Problem of Heteroscedasticity (when the error variance is non-constant) &
Autocorrelation (when the error terms are correlated) (Programme obj-3.1) and Hands-on
exercise on the data set (Programme obj-3.1)
30 minutes
What
is
heteroscedasticity?
Why it has to be
taken care of?
Session 7 & 8: Concepts of Time Series Tests of stationarity, unit root, Univariate Time
series modelling (Programme obj-3.1) and Hands-on exercise on the data sets
(Programme obj-3.2)
30 minutes
What is time
series?
What
are
different
components
of a time
series?
Session 9 & 10: Modelling Volatility (Programme obj-3.1) and Hands-on exercise on the
data sets (Programme obj-3.2)
Session 9 & 10 (150 minutes)
30 minutes
30 minutes
15 minutes
75 minutes
Hands on experience
What
is Discussion on Diagnostic checks
with Eviews & R
various
volatility?
What
is volatility
modelling
conditional
volatility?
Session 11 & 12: Multivariate Time Series Modelling- Vector Auto Regression (VAR), and
Hands-on exercise on the data sets (Programme obj-3.2)
Session 11 & 12 (150 minutes)
30 minutes
30 minutes
15 minutes
75 minutes
Test
of Diagnostic checks
Hands on experience
Discussion
with Eviews & R
on multivatite causality
time
series
modelling
(VAR)
Lag
length
selection
Session 13 & 14: Co-integration, Error Correction Modelling (ECM). (Programme obj-3.1)
and Hands-on exercise on the data sets (Programme obj-3.2)
Session 13 & 14 (150 minutes)
30 minutes
30 minutes
15 minutes
75 minutes
Error correction Diagnostic checking on Hands on experience
Discussion
models formulated
with Eviews & R
on long term modelling
relationship
(cointegration)
Error
correction
modelling
Session 15 & 16: Course recap, final and project presentation, and feed back
Area: Operations
Course Title Operations Analytics
Course Description:
This course is aimed at improving the students skills in solving Operations and supply chain
problems through development of modeling and analytical skills
The primary goal of this course is to introduce the concepts, modeling, and dynamics of Supply
Chains and their design considerations. The specific objectives are as follows:
1. To introduce the concepts of Supply Chain Dynamics and its impact on Design of SCs
2. To introduce techniques to model the supply chain mathematically
3. To introduce the appropriate mathematical techniques to analyze it
4. To introduce the principles on which to design suitable system controls appropriate for the
situation at hand
5. To provide a deeper understanding of the underlying dynamic factors affecting SC operation.
The above modeling and analytical skills will prove useful and effective in the design of supply
chain systems, as well as their operation
Learning Objectives:
Learning Outcomes:
At the end of the course, the students would be able to:
1. Conceptualize a dynamic supply chain and identify factors in a dynamic environment that
affect it.
2. Model the supply chain mathematically in a dynamic environment.
3. Apply appropriate mathematical techniques to analyze it
4. Design suitable system controls appropriate to the objectives of the chain.
5. Obtain a deeper understanding of the dynamics and factors affecting the supply chain.
Pedagogy:
Pedagogy:TheMathematicaltechniquesusedformodeling,analysisanddesignofSupplyChains
inadynamicenvironmentwillbethefocusofthiscourse.TheoryandClassroomlectureswillbe
themainstayforthecourse.Therewillbeseveralgroupassignments,whichwillservetoelucidate
thetheoreticaldiscussions.Atermpaperisalsoenvisaged,ifpossible,whichwouldhelpthe
studenttoprobedeeperintothetopic.
Course Requirements:
Note: There is no text-book available for the latter Applications part of this course, and hence
students are advised to take down notes in class. Course notes will not be supplied, rather
students are required to take down their own notes in class to supplement their
understanding.
The course will be mathematical and a number of mathematical techniques will be developed
sequentially. Hence students are requested not to miss any class whatsoever to maintain their
learning continuity
Evaluation
Evaluation Scheme:
Sr. No.
1
2
Components
Weightages (in %)
20%
Mid-term Exam
Assignments
20%
3
20%
EndTerm
Examination
40 %
Assignments
A number of group assignments are envisaged to elucidate the mathematical concepts
covered in class
Topic
Motivation,DynamicSupplyChain
examples,andControlPrinciples:
Reference material
Classnotes
4,5
6,7,8
913/14
AnaircraftMaintenancehangarexample,
increasinginventoryfluctuationsalongthe
chain,andincreasingdisturbance
propagationbackwardsinthechain.Likely
Bullwhipeffect,Control/mitigationofthe
bullwhipeffect,controlobjectives:basedon
ResponsivenessandEfficiency
SupplyChainStrategiesandtheir
PerformanceMetrics:Twobasicstrategies:
ResponsivenessversusEfficiency,flexibility,
theirperformancemetrics,measurement,
implications,andoperationalsignificance
VarioustypesofControlsforSCs:Inventory
triggeredanddemandtriggeredcontrols,
proportional,proportionalintegral,and
proportionalintegralderivativetypeof
schemes,movingaveragetypeofschemes,
andtheirbasiccharacteristics
IntrotoLinearDifferenceEquations;
historicaldevelopmentandoccurrenceof
differenceequations,examplesfromclassical
mathematics,examplesformdiversefields:
physics,engineering,biology,etc.,examples
inmanagementdisciplines,relationshipwith
andoccurrenceinotherareasof
mathematicsandmathematicalmodeling,
combinatorialproblems,recurrencerelations
andinduction,closerelationshipwith
differentialequations
DifferenceCalculusanditsdevelopment:
Thedeltaoperator,theshiftoperatorE,
operatormechanics,differentiationcalculus,
deltacalculusofsimpleandcommon
functions,thefallingfactorialnotationandits
centralroleindifferencecalculus,product
rule,quotientrule,parallelwithdifferential
calculus,thesummationoperator,asthe
inverseofthedeltaoperator,summation
rules,summationbyparts,theconstantof
summation(akintotheconstantof
integrationincontinuousvariablecalculus),
indefiniteanddefinitesummation
SolutionMethodsforLDEs:
LinearDifferenceEquationsandcanonical
forms,firstorderequations,homogeneous
andnonhomogeneousequations,second
andhigherorderequations,InitialConditions
anduniquenessofsolution,shigherorder
LDEswithconstantcoefficients,
homogeneoussolutionandparticular
do
do
Ch1
Ch2
Ch3
14/1518
19
20
integral,theannihilatormethod,factoring
theLHSOperator,formofthesolution,
linearlyindependentsolutions,the
Casoratianmatrix,repeatedroots,complex
roots,specialmethodsforLDEs,series
expansionmethods,thegeneratingfunction
ApplicationsandWarehousesystem
Models:
Designparametersanddisturbancetypes,
TheP(I),P(D)andP(ID)schemesandtheir
characteristics,controlofoffsetand
permanentdepletionofinventory,thePI(I)
scheme,thePID(I)scheme,theirdesign
parametersandfluctuationlevels,methods
forcontroloffluctuationsandrapid
restorationtonormalcy,theMA(ID)scheme,
comparativeperformanceanddesign
parametersettings
TermpaperPresentations
Review,Wrapup,andscopeforfurther
reading
ClassNotes
Sessions 3 & 4
Topic
How and Why HR Analytics is
Strategic.
Linking
HR
Analytics
to
Business
Performance
Sessions 5 & 6
Topic
Foundations of Measurement
Readings/Cases
3) Making HR Measurement Strategic
4) Toward Analytics and Prediction
Readings/Cases
5) Analytics Foundations of HR Measurement
6) Big Data and Talent Management: Using Hard
Data to make the Soft Stuff easy
Sessions 7 & 8
Topic
HR Metrics for Effectiveness
Evaluation
Readings/Cases
7) Predictive Analytics in Action
8) The Analytics Era
Sessions 9 & 10
Topic
Tools and Techniques for
Analytics
Sessions 11 & 12
Topic
Metrics to Optimize HR Cost
Reduction
Readings/Cases
10) Apply Business Judgement to Data Science
11) Drive End-to-User Ownership of Talent
Analytics
Readings/Cases
12) Using HR Metrics to identify opportunities for
Cost Reduction
13) Business Driven HR Analytics Case of Novo
Nordisk
Sessions 13 & 14
Topic
Evidence Based
Support System
Decision
Readings/Cases
14) Implication Based Decision Support Case of
Seagate
15) Analytics Prioritization Principles Case of
Gap Inc.
Sessions 15 & 16
Students Presentations
Course Pack
A Compilation of selection readings, cases, etc will be given to the course participants.
Evaluation Scheme
Class Participation
Group Assignment
Individual Assignment
Case Discussion
End Term
:10%
:20%
:10%
:10%
: 40%
References Books:
Jac Fitz-Enz- The New HR Analytics- Predicting the Economic Value of Your Companys
Human Capital Investment-, American Management Association.
Becker, B., Huselid, M, and Ulrich, D. The HR Scorecard: Linking People, Strategy, and
Performance. HBS Press, 2001.
Cascio, W & John Boudreau- Investing in People: Financial Impact of Human Resource.
Robert S. Kaplan and David P. Norton, The Balanced Scorecard: Translating Strategy into
Action (Boston, MA: Harvard Business School Press, 1996).
Area- Analytics
Course Title:
Course Description
1. Introduction
Quantitative models in marketing plays a crucial role in providing decision makers with timely and objective
information to help them make informed decision in many areas. QMM links the consumer, customer and the
public to the marketer through information. The information is used to identify marketing opportunities and
problems (market demand determination, market segments identification, product/ service use studies,
competitive analysis), in generating, refining and evaluating potential marketing actions (proposed
marketing-mix evaluation testing, concept tests of proposed new products or services, new-product
prototype testing, pricing tests, advertising pretesting, in-store promotion effectiveness studies, distribution
effectiveness studies), in monitoring marketing performance (image analysis, customer-satisfaction and
distributor-satisfaction studies, web site evaluations) and improving marketing as a process (consumer
behaviour differences in e-business transactions, Predictors of new product success, The impact of longterm advertising on consumer choice). Thus, QMM is highly indispensable in generating the information
necessary for assisting in the strategic decision making.
The course of QMM is designed with a special emphasis on multivariate data analysis which is the lifeblood
of marketing research. Students taking the course will gain an expertise in different multivariate data analysis
tools like discriminant analysis for classification and prediction, cluster analysis for market segmentation,
conjoint analysis for product design and multidimensional scaling for brand positioning etc. They will also
gain hands-on experience in different statistical software like SPSS and AMOS.
Pre-requisites to take this course include successful completion of Business Research Methods with an
expertise in the statistical tools and techniques. The students should also have clarity of marketing
management and should possess sound analytical skills.
Course outcomes: After successful completion of the course, the students should be able to:
Understand the role of quantitative models in marketing research in taking better decisions.
Apply the research skills in a team in solving marketing problems.
Decide on the appropriate statistical tool to be used.
Analyse the collected data with the statistical tool.
Interpret the output of the analysis generated by the software.
Teaching Pedagogy
The pedagogy will be a mix of lectures, critique of readings/articles, case discussion, assignments and data
analysis using SAS software. The course uses data and cases as the means to drive the contextual
suitability of predictive modeling. Besides the readings in the course pack, additional reading material will be
distributed in the class from time to time.
Course Requirements
xvii.
Prerequisite to this course is the learning derived from Business Statistics and Business
Research Methods courses.
xviii.
Since much of the course material requires class discussion, it is important that you are prepared
for the class and present and defend your ideas. Preparation for class includes having read the
assigned material.
xix.
The class will be divided into a number of groups and each group will have 4-5 students. For all
projects and assignments, each group will submit analysis report and power point presentation one
day before scheduled class.
xx.
Evaluation for this assignment will be done based on individual as well as group
performance.
xxi.
Class participation will be based on the value you add to the class through your creating learning
environment in class by raising intelligent questions, relevant statements on questions raised by
others, and maintaining class decorum.
Evaluation
Students will be evaluated on the basis of case analysis, project, assignments, class participation, mid-term
and end-term exam. The weightages given to each of these units is listed below:
Course Evaluation
Class Participation
Assignments (4)
Project report (1)
Quiz (best 1 out of 2)
End Term Exam
10
10
30
10
40
Project Evaluation
Group Project presentation
Group Project Report submission
Specific
Method
Assessment
Group Assessment
Methods
Group
Project
Proposal Presentation
Group
Project
Presentation
Group Project Report
Submission
Group Case analysis
submission
Group Assignments
Individual
Assessment
Methods
Mid Term Exam
End-term Exam
Weightage
20
10
30%
30%
40%
15%
15%
30%
40%
Project proposal
Data Collection
Final Report along with data used (if applicable)
Presentation
The presentation should cover both the analysis of primary and secondary data, as applicable.
Each presentation will be of 10 minutes with 8 minutes given to the group to highlight the key
findings and 2 minutes for open discussion in the class.
Additional assignments would be given during the course.
20 minutes
20 minutes
20 minutes
Market
response Estimation of different Practice
response
exponential market
Introduction to models,
models, semi log and models
quantitative
models
in double log models, Log
marketing area models
Session 3 & 4: Forecasting of a new product using Bass diffusion models.
20 minutes
20 minutes
20 minutes
15 minutes
20 minutes
Image mapping
Introduction to
brand analysis
Session 7, 8, 9 and 10: Structural equation Modelling, construct analysis, path analysis