Regression Analysis - ISYE 6414
Regression Analysis - ISYE 6414
Regression Analysis - ISYE 6414
Class Schedule
12:05pm - 1:25pm Tuesday and Thursday (IC 105)
Honor Code For any questions involving Academic Honor Code issues, please consult me, my
teaching assistants, or www.honor.gatech.edu.
Course prerequisites: A sound familiarity with undergraduate or graduate statistics and prob-
ability.
Textbook: The course material will be based on a set of lecture notes being prepared by the
instructor, but two primary textbooks are highly recommended:
1. G. A. F. Seber, Alan J. Lee (2003) Linear Regression Analysis, Wiley Series in Probability
and Statistics.
2. L. Wasserman (2010) All of Statistics, Springer Series in Statistics.
Other recommended books:
1. P. McCullagh, J.A. Nelder (1989), Generalized Linear Models, Chapman & Hall.
2. L. Wasserman (2010) All of Nonparametrics, Springer Series in Statistics.
1
Midterms: There will be two midterm exams with problems reviewing the material (lectures
and assignments) provided in this course throughout the full semester. The exams are close notes
(including homeworks) and books but a two (one-sided) pages with formulas will be allowed. The
midterms are designed to help students grasp standard regression analysis methodology which will
further facilitate a deeper understanding in the application context. All students will take the
midterm in the same day at the same time.
Dates:
Midterm 1: October 11th
Midterm 2: November 16th
Assignments: Assignments will include both theoretical and computer problems; the latter prob-
lems will ask you carry out analysis of data sets and simulations using computer software. Keep in
mind that you should not hand in raw computer output. Conclusions and interpretation of results
are more important than good printouts. These assignments are intended to help you prepare for
the midterm exams and final project. You are allowed (and encouraged) to work together with
other students on homework, as long as you write up and turn in your own solutions. You are
also allowed (and encouraged) to ask me questions, although you should try to think about the
problems before asking. Late Homework will not be accepted.
Project: This project is a requirement you must fulfill in order to pass this course. The general
goal of the project is to provide you with experience in applying regression analysis methodology
to real data. For this project, you and your team must find a data set on your own. The data
cannot be a data set found in a textbook or been analyzed in detail and results published. This
project will serve as a means for students to demonstrate what they understand and can do with
the content of the course. There will be an oral presentation of the project (≈ 15 minutes). In
grading, I will primarily look for a sensible approach to the problem, and clearly-made connections
between your analyses and the substantive questions. You can use any computing equipment and
any computing resources in the school, any written source material you can find, in or out of the
school. However, replicating results which have been already published without referencing to the
source of publication is subject to plagiarism. Plagiarizing is defined by Websters as “to steal and
pass off (the ideas or words of another) as one’s own : use (another’s production) without crediting
the source.” Be sure to document carefully your project work.
Deadline to submit an abstract of the project: October 23rd, 2012.
Deadline to submit the project work (report): December 7th, 2012.
Class presentation dates: December 5th and December 7th, 2012.
2
Objectives: A tentative list of specific topics in this course is as follows:
Part 1: Simple Linear Regression
1. Introduction
2. Estimation
3. Inference
2. Estimation
3. Inference
4. Diagnostics
5. Misc Topics
1. Bias-Variance Decomposition
1. Model Estimation
1. Models
3. Model Validation
1. Smoothing
2. Kernel Regression
3. Local Polynomials