Training Schedule: Basic Data Manipulation

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 46

Training schedule

Basic Data Manipulation


Week 1: Chapters 1 & 2

Introduction to SAS

Week 2: Chapters 3 & 4 (except 3.9 through


3.14)

Creating permanent SAS datasets


SAS Libraries
Reading raw data from external files
Proc Print / Proc Contents

Training schedule
Basic Data Manipulation (cont.)
Week 3: Chapters 5 & 6 (and 3.9 through 3.14)

Creating formats and labels


Reading and writing data from an Excel
spreadsheet
Proc Import with a twist
Proc Datasets

Week 4: Chapters 7 & 10 (except 10.6, 10.13)

Conditional processing
Subsetting and combining SAS datasets

Training schedule
Basic Data Manipulation (cont.)
Week 5: Chapters 9, 11, & 12 (and
10.13)

Working with dates


About SAS Date, Time, and Datetime values
Anydate informat
Working with numeric functions
Working with character functions
Put and Input

Training schedule
Basic Reporting
Week 6: Chapters 14 & 19

Displaying your data (PROC Print)


Global statements (titles, footnotes, system
options)
Introducing the Output Delivery System

Week 7: Chapters 16 & 17 (and 10.6)

Summarizing your data (PROC Means)


Counting frequencies (PROC Freq)

Training schedule
Basic Reporting (cont.)
Week 8: Chapter 15

Creating customized reports (PROC Report)

Week 9: Chapter 18

Creating Tabular Reports (PROC Tabulate)

Training schedule
Advanced Topics
Week 10: Chapter 26

Introduction to SQL

Week 11: Chapters 8, 13 & 24

Iterative processing and looping


Working with multiple observations per
subject
By group processing
Working with arrays

Training schedule
Advanced Topics (cont.)
Week 12: Chapter 25

Introduction to SAS Macro Language


%LET
CALL SYMPUT

Week 13: Chapter 20

Generating high quality graphics

Training schedule
Advanced Topics (cont.)
Week 14: Chapters 21, 22 & 23

The Leftovers!
Restructuring SAS datasets (PROC
Transpose)
Using advanced features of user-defined
formats and informats
Using advanced input techniques
Saving and storing macros

SAS Display Manager

Explorer
Window:
See libraries
and SAS
datasets

Log:
Details about jobs youve
run, including error
messages

Enhanced Editor:
Where you write your SAS
code

Results tab:
List of
previously run
results

Output Window:
Basic view of results and
output
(Also known as Listing
Destination)

Run:
Submits your SAS code
(The Running Man)

You can submit the


program as a whole
or
Run only a portion of
highlighted code

Chunks of code will


be processed in the
order you RUN them,
not necessarily in the
order you wrote them

The elements of a SAS


program

DATA steps:
Read and write data,
manipulate data, perform
calculations, etc.
Every DATA step creates a new
dataset

Every SAS
program can
have
multiple
DATA and
PROC steps

PROC steps:
Produce reports, summarize data, sort
data,
create formats, generate graphs, etc. etc.
etc.

Global statements:
Remain in effect until changed by
another global statement or SAS
session ends
Examples:
Titles, Footnotes, Options, Macros,
Libname

Comment statements:
Ignored by SAS when code is run
Used to write comments to yourself or
others or
comment out code you dont want to
run
*comment statement;
or
/* comment statement */

Programming in SAS

SAS is generally forgiving

Code can be written on a single line or


multiple lines
Code and variable names not case sensitive
Even accepts some misspellings
Fi The Doo

Semicolons are super important


Colors of Enhanced Editor
Always check your log!!!
Often many ways to do the same thing

DATA step examples

DATA step examples

DATA step examples

DATA demographic
DATA demographic creates a dataset called
Demographic
Infile statement reads the text file with the data
Input statement tells SAS what to name the new variables.
$ sign after Gender indicates that this is a character
variable. The remaining variables are numeric.

DATA step examples

DATA newdemo
DATA newdemo creates a dataset called newdemo
SET demographic references the previous dataset
demographic as the basis for the new dataset
BMI = creates a new variable called BMI that is
calculated as shown. This is called an assignment
statement.

PROC step examples

PROC step examples

PROC step examples


PROC freq
Creates a basic
frequency table with:
Frequency
Percent
Cumulative
Frequency
Cumulative Percent
DATA=newdemo
references the newdemo
dataset
TABLE statement
indicates which
variable(s) to include
Can use multiple
variables and even create
cross-tab tables

PROC step examples


PROC means
Creates a basic
summary table with:
N
Mean
Std Dev
Minimum
Maximum
Can specify which
statistics to use in table
DATA=newdemo
references the newdemo
dataset
VAR statement indicates
which variable(s) to
include

Other notes
Step boundaries:
Each step is executed when a step
boundary is encountered
Explicit step boundary: RUN or
QUIT statement

Good SAS
programmer
s use
EXPLICIT
step
boundaries

Implicit step boundary: Beginning


of a new PROC or DATA step

Other notes
Referencing datasets:
Most procedures and DATA steps
reference an existing dataset
Explicit dataset reference: DATA=
or SET statement

Good SAS
programmer
s use
EXPLICIT
dataset
references

Implicit dataset reference: If not


explicitly referenced, SAS will use
the last referenced dataset

SAS Help

Where to learn more about SAS

Technical papers
SAS User Groups
Websites
Books
Classes

http://www.sascommunity.org/wiki/
Learning_SAS_-_Resources_You_Cant_Live_Without
Popular Websites

www.google.com
Popular Websites

www.ats.ucla.edu/stat/
Popular Websites
sas

Explains
both the
stats and
the SAS
code

www.ats.ucla.edu/stat/
Popular Websites
sas

www.lexjansen.com
Popular Websites

SAS Help and Documentation

Popular Websites

How to read SAS


Documentation
DATA output-SAS-data-set
(DROP=variable(s) | KEEP=variable(s));
SET SAS-data-set <options>;
BY variable(s);
RUN;

How to read SAS


Documentation
DATA output-SAS-data-set
(DROP=variable(s) | KEEP=variable(s));
SET SAS-data-set <options>;
BY variable(s);
RUN;

DATA, DROP=, KEEP=, SET, BY, and RUN are in


uppercase bold because they must be spelled as
shown.

How to read SAS


Documentation
DATA output-SAS-data-set
(DROP=variable(s) | KEEP=variable(s));
SET SAS-data-set <options>;
BY variable(s);
RUN;

output-SAS-data-set, variable(s), SAS-data-set, and


options are in italics because each represents a
value that you supply.

How to read SAS


Documentation
DATA output-SAS-data-set
(DROP=variable(s) | KEEP=variable(s));
SET SAS-data-set <options>;
BY variable(s);
RUN;

<options> is enclosed in angle brackets because it


is optional syntax.

How to read SAS


Documentation
DATA output-SAS-data-set
(DROP=variable(s) | KEEP=variable(s));
SET SAS-data-set <options>;
BY variable(s);
RUN;

DROP= and KEEP= are separated by a vertical bar (


| ) to indicate that they are mutually exclusive.

Sample datasets

Original zip file in:


R:\Training\SAS\Learning SAS by Example
(Cody)\60864

SAS Datasets:
Permanent location of all SAS
Datasets

Text and CSV:


Text and CSV data files used to
create the SAS Datasets

For next week


Read chapters 3 & 4
(skip sections 3.9 through 3.14)

You might also like