Mhos Pedsf

Download as pdf or txt
Download as pdf or txt
You are on page 1of 29

Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)

March 28, 2017

MHOS-PEDSF.Docx for MHOS Survey Level File

Selected SEER variables were copied from the Patient Entitlement and Diagnosis File
(PEDSF). These variables are available for MHOS patients found in the SEER file and linked
to Medicare. Not all variables were written to the MHOS survey file because they are either
restricted variables or not applicable to MHOS patients.

SEER Cases diagnosed 1973-2013

PEDSF File created on October 4, 2016

SEER data extracted from November, 2015 SEER Submission

NOTE1: Census data is not included on the PEDSF file but will be sent out in separate files
that can be linked by census tract and zip code.

The SEER information on the PEDSF file has been rearranged to mimic the SEER Research
File and the site variable is now 5 digits long instead of 2.

NOTE2: The SEER program has completed a review of PSA values for prostate cases from
2010-2013 and PSA errors have been corrected for the SEER data release in April 2016.
Prostate cases from 2004-2009 are currently being reviewed. Their PSA data will be added
after the completion of this review. For more information, visit https://seer.cancer.gov/data/psa-
values.html.

1
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)

March 28, 2017

COL FIELD LENGTH SOURCE NOTES


22 Date of Death Flag 1 Created Shows the level of agreement between
(dod_flg) at IMS SEER and MEDICARE on the patients
month of death. Dates of death after
12/13 were treated as not dead for
comparison purposes.
0 = Not dead by 12/13.
1 = Dead, both files agree.
2 = Dead, off by 1-3 months.
3 = Dead, off by 4-6 months.
4 = Dead in MEDICARE only,
5 = Dead in SEER only.
6 = Dead but number of months could not
be calculated due to the fact that
month was missing for either SEER
or MEDICARE.
23 Date of Birth Flag 1 Created Shows the level of agreement between
(dob_flg) at IMS SEER and MEDICARE on the patients
month of birth.
0 = Both files agree on birth date.
1 = Birth date off by 1-3 months.
2 = Birth date off by 4-6 months.
3 = Birth date off by 7-11 months.
4 = Birth date off by one year
5 = Birth date off by 13-23 months
6 = Birth date off by 2 years
7 = Birth date off by 25-35 months
8 = Birth date off by 3 years
9 = Birth date off by 37+ months
Blank = Birth date is missing
54 First Chronic Renal 4 EDB First occurrence of Chronic Renal
Disease Year Disease regardless of age.
(first_esrd_yr)

64 Valid Date of Death 1 EDB N = No


(vrfydth) Y = Yes
Geographic Information
SEER program data are from the SEER record corresponding to the first diagnosis of
cancer at age 65 or older. Zip codes are from Medicare enrollment file in the year of first
diagnosis at age 65 or older or the last diagnosis if never 65.
65 State (state) 2 SEER FIPS State Code
(renamed to sr_state

2
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)
March 28, 2017
COL FIELD LENGTH SOURCE NOTES
for the MHOS survey
file)
67 County (county) 3 SEER FIPS County Code
(renamed to sr_county
for the MHOS survey
file)
70 Zip Code (zip5) 5 EDB Encrypted Zip code, 5 digits.
* Special permission required for
unencrypted zip code.
75 Zip Code (zip4) 4 EDB Last four digits of zip code. Blanked out
when Encrypted Zip code is given.
* Special permission required for
unencrypted zip code.
79 Census Track Flag 1 SEER Use in conjunction with Census Tract
(code_sys) (tract)
0 = Not tracted
1 = 1970 Census tract definitions
2 = 1980 Census tract definitions
3 = 1990 Census tract definitions

80 Census Tract 6 SEER 1970/80/90 Encrypted census tract


(tract1990) * Special permission required for
unencrypted census tract.
86 Census Tract 2000 6 SEER 2000 encrypted census tract
(tract2000) * Special permission required for
unencrypted census tract.
92 Census Tract 2010 6 SEER 2010 encrypted census tract
(tract2010) * Special permission required for
unencrypted census tract.
98 HSA 3 ARF Health Service Area. Taken from the
(hsa) 2004 Area Resource File (ARF).
101 Urban/Rural recode 1 ARF Urban/Rural Code
(urbrur) 1 = Big Metro (Urban = 00 or 01)
2 = Metro (Urban = 02 or 03)
3 = Urban (Urban = 04 or 05)
4 = Less Urban (Urban = 06 or 07)
5 = Rural (Urban = 08 or 09)
9 = Unknown (Urban = 99)
102 Urban/Rural code 2 ARF 01-09, 99 see table at end of this
(urban) document

3
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)
March 28, 2017
COL FIELD LENGTH SOURCE NOTES
Demographic Information
The following information comes from the SEER record corresponding to the first diagnosis
at age 65 or over, or the latest diagnosis if all are prior to turning age 65.
104 Sex (s_sex) 1 SEER See Attachment A
105 Race Recode B 2 SEER Race Recode B from SEER
(rac_recb) 01,11 = White
(renamed to sr_race for 01 = Caucasian, NOS
the MHOS survey file) 02 = Black
03 = American Indian/Alaska Native
04 = Chinese
05 = Japanese
06 = Filipino
07 = Hawaiian
08 = Other Asian or Pac. Islander
09 = Unknown
11 = Caucasian, Spanish origin or
surname
12 = Other unspecified (1991+)
107 Race Recode Y 1 SEER See Attachment A
(rac_recy)
108 Race Recode A 1 SEER See Attachment A
(rac_reca)
109 ICD Code Cause of 1 SEER 0 = Patient is alive at last follow-up
Death (icd_code) 1 = Tenth ICD revision

8 = Eighth ICD Revision

9 = Ninth ICD revision

110 Cause of Death ICD -8 4 SEER 0000 = Alive at last contact


or 9 (cod89v) 7777 = State Death Certificate not
available.
7797 = Death Certificate available, no
COD listed.
114 Cause of Death ICD-10 4 SEER A020-Y891
(cod10v) 0000 = Alive at last contact
7777 = State Death Certificate not
available.
7797 = Death Certificate available, no
COD listed.

4
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)
March 28, 2017
COL FIELD LENGTH SOURCE NOTES
118 Cause of Death to site 5 SEER This is a recode based on underlying
recode KM (codkm) cause of death to designate cause of
death into groups similar to the incidence
site recode with KS and mesothelioma.
Study cutoff date has been applied, i.e.
coded as alive if death occurred after
study cutoff. Go to the end of this
document for a listing of codes and their
definition.
123 Cause of Death to site 5 SEER This recode was introduced to account
recode (codpub) for several newly valid ICD-10 codes and
includes both cancer and non-cancer
causes of death. Go to the end of this
document for a listing of codes and their
definition.
128 NHIA Derived Hispanic 1 SEER See Attachment A
Origin (nhiade)
129 SEER Month of Death 2 SEER 00 = Alive
(ser_dodm) blank = Unknown month
Date complete through 12/31/13
131 SEER Year of Death 4 SEER 0000 = Alive
(ser_dody) 2053 = Unknown year
Date complete through 12/31/13
135 Date Flag for Follow Up 2 SEER 0 = Alive
(deathflag)
137 SEER 2 SEER See Attachment A
Race/Ethnicity (srace)
139 SEER Hispanic 1 SEER See Attachment A
Surname (origin)
140 Origin recode NHIA 1 SEER See Attachment A
(Hispanic, Non-Hisp)
(origrecb)
141 Filler 1
142 Vital Status Recode 1 SEER See Attachment A
(stat_rec)
143 Census Tract Certainty 1 SEER Associated with 1970/80/90 Census data
(cen_cert) 1 = Census tract based on complete and
valid street address of residence
2 = Census tract based on residence

5
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)

March 28, 2017

COL FIELD LENGTH SOURCE NOTES


ZIP+4
3 = Census tract based on residence
ZIP+2
4 = Census tract based on residence ZIP
code only
5 = Census tract based on ZIP code of
post office box
6 = Census tract/BNA based on
residence city where city has only
one census tract, or based on
residence ZIP code where ZIP code
has only one census tract
9 = Unable to assign census tract based
on available information
144 Census Tract Certainty 1 SEER Associated with 2000 Census data
2000 (ctcer2k) 1 = Census tract based on complete and
valid street address of residence
2 = Census tract based on residence
ZIP+4
3 = Census tract based on residence
ZIP+2
4 = Census tract based on residence ZIP
code only
5 = Census tract based on ZIP code of
post office box
6 = Census tract/BNA based on
residence city where city has only one
census tract, or based on residence
ZIP code where ZIP code has only
one census tract
9 = Unable to assign census tract based
on available information
145 Census Tract Certainty 1 SEER Associated with 2010 Census data
2010 (ctcer2010) 1 = Census tract based on complete and
valid street address of residence
2 = Census tract based on residence
ZIP+4
3 = Census tract based on residence
ZIP+2
4 = Census tract based on residence ZIP
code only
5 = Census tract based on ZIP code of
post office box

6
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)
March 28, 2017
COL FIELD LENGTH SOURCE NOTES
6 = Census tract based on residence city
where city has only one census tract,
or based on residence ZIP code
where ZIP code has only one census
tract
9 = Unable to assign census tract based
on available information
146 Census Tract Poverty 1 SEER 1= 0%-<5% poverty
Indicator 2 = 5% to <10% poverty
(census_pov_ 3 = 10% to <20% poverty
ind) 4 = 20% to 100% poverty
9 = Unknown
147 SEER Year of Birth 4 SEER See Attachment A
(yr_brth)
151 Date of Birth Flag 2 SEER 12 = A proper value is applicable but not
(dbrflag) known.
153 Number of SEER 2 Created Number of eligible SEER records. SEER
records (count) at IMS records prior to 1992 for Rural Georgia
(renamed to sr_count) and records prior to 2000 for Greater
California, New Jersey, Kentucky, and
Louisiana were not included in PEDSF.
157 Diagnosis indicator 1 Created 0 = Last Dx; Patient always less than 65.
(resnrec) at IMS 1 = First Dx at age 65 or later.

1747 State (state1991- 2 EDB State code from EDB file.


state2015)
1749 County (cnty1991- 3 EDB County code from EDB file.
cnty2015)

Repeated SEER Cancer Information

A patient can have up to 10 diagnoses in SEER, so the below variables are repeated for each
cancer diagnosis the patient has in SEER. If the patient has more than 10 only the first 10 are
retained for this file.
In parenthesis you will find the variable names. In italics you will find the SEER
variable name.

7
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)

March 28, 2017

COL FIELD LENGTH SOURCE NOTES


2202 SEER Registry Code at 2 SEER 01 = San Francisco (1973+)
Diagnosis 02 = Connecticut (1973+)
(reg1-reg10) (reg) 20 = Detroit (1973+)
21 = Hawaii (1973+)
22 = Iowa (1973+)
23 = New Mexico (1973+)
25 = Seattle (1974+)
26 = Utah (1973+)
27 = Atlanta (1975+)
31 = San Jose (1988+)
35 = Los Angeles (1988+)
37 = Rural Georgia (1992+)
41 = Greater California (2000+)
42 = Kentucky (2000+)
43 = Louisiana (2000+)
44 = New Jersey (2000+)
47 = Greater Georgia (2000+)
2204 Marital Status at 1 SEER See Attachment A
Diagnosis
(marst1-marst10)
(mar_stat)
2205 Age at Diagnosis 3 SEER See Attachment A
(agedx1-agedx10)
(age_dx)
2208 Sequence Number 2 SEER See Attachment A
(seq1-seq10)
(seq_num)
2210 Month of Diagnosis 2 SEER See Attachment A
(modx1-modx10)
(date_mo)
2212 Year of Diagnosis 4 SEER See Attachment A
(yrdx1-yrdx10)
(date_yr)
2216 Primary Site 3 SEER See Attachment A
(site1-site10) Note: This variable is length 4 in the SEER file.
(site02v) The first character (which is always C) was
not written to the PEDSF.
2219 Laterality 1 SEER See Attachment A
(lat1-lat10)
(lateral)

8
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)
March 28, 2017
COL FIELD LENGTH SOURCE NOTES
2220 Histology ICD-O-2 4 SEER See Attachment A
(1992-2000)
(hist2_1-hist2_10)
(hist02v)
2224 Behavior code 1 SEER See Attachment A
ICD-O-2 (1992-2000)
(beh2_1-beh2_10)
(beh02v)
2225 Histology ICD-O-3 4 SEER See Attachment A
(hist1-hist10)
(hist03v)
2229 Behavior code 1 SEER See Attachment A
ICD-O-3
(beh1-beh10)
(beh03v)
2230 Grade 1 SEER See Attachment A
(grade1-grade10)
(grade)
2231 Diagnostic Confirmation 1 SEER See Attachment A
(dxconf1-dxconf10)
(dx_conf)
2232 Type of Reporting 1 SEER See Attachment A
Source
(src1-src10)
(rept_src)
2233 EOD 10 - Tumor Size 3 SEER See Attachment A
(1988-2003)
(e10sz1-e10sz10)
(eod10_sz)

2236 EOD 10 Tumor Extent 2 SEER See Attachment A


(1988-2003)
(e10ex1-e10ex10)
(eod10_ex)
2238 EOD 10 Prostate 2 SEER See Attachment A
pathology ext
(1995-2003)
(e10pe1-e10pe10)
(eod10_pe)

9
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)
March 28, 2017
COL FIELD LENGTH SOURCE NOTES
2240 EOD 10 - Nodes 1 SEER See Attachment A
(1988-2003)
(e10nd1-e10nd10)
(eod10_nd)
2241 EOD 10 - # positive 2 SEER See Attachment A
Nodes (1988+)
(e10pn1-e10pn10)
(eod10_pn)
2243 EOD 10 - # Nodes 2 SEER See Attachment A
Examined (1988+)
(e10ne1-e10ne10)
(eod10_ne)
2245 EOD OLD 13 Digit 13 SEER See Attachment A
(1973-1982)
(eod13_1-eod13_10)
(eod13)
2258 EOD OLD 2 Digit 2 SEER See Attachment A
(1973-1982)
(eod2_1-eod2_10)
(eod2)
2260 EOD 4 size 2 SEER See Attachment A
(1983-1987)
(e4siz1-e4siz10)
(eod4)
2262 EOD 4 extent 1 SEER See Attachment A
(1983-1987)
(e4ext1-e4ext10)
(eod4)
2263 EOD 4 nodes 1 SEER See Attachment A
(1983-1987)
(e4nod1-e4nod10)
(eod4)
2264 Coding system extent 1 SEER See Attachment A
of disease
(1973-2003)
(eod_cd1-eod_cd10)
(eodcode)
2265 Tumor Marker 1 1 SEER See Attachment A
(1990-2003)
(tumor1_1-tumor1_10)
(tumor_1v)

10
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)
March 28, 2017
COL FIELD LENGTH SOURCE NOTES
2266 Tumor Marker 2 1 SEER (Prostate data has been removed from this
(1990-2003) field) See Attachment A
(tumor2_1-tumor2_10)
(tumor_2v)
2267 Tumor Marker 3 1 SEER See Attachment A
(1998-2003)
(tumor3_1-tumor3_10)
(tumor_3v)
2268 CS Tumor Size 3 SEER See Attachment A
(2004+)
(cstum1-cstum10)
(cs_size)
2271 CS Extension 3 SEER See Attachment A
(2004+)
(csex1-csex10)
(cs_ext)
2274 CS Lymph Nodes 3 SEER See Attachment A
(2004+)
(cslym1-cslym10)
(cs_node)
2277 CS Mets at Dx 2 SEER See Attachment A
(2004+)
(csmet1-csmet10)
(cs_mets)
2279 CS Site-Specific Factor 3 SEER (Prostate data has been removed from this
1 (2004+) field) See Attachment A
(cs1st1-cs1st10)
(cs_ssf1)
2282 CS Site-Specific Factor 3 SEER (Prostate data has been removed from this
2 (2004+) field) See Attachment A
(cs2st1-cs2st10)
(cs_ssf2)
2285 CS Site-Specific Factor 3 SEER See Attachment A
3 (2004+)
(cs3st1-cs3st10)
(cs_ssf3)
2288 CS Site-Specific Factor 3 SEER See Attachment A
4 (2004+)
(cs4st1-cs4st10)
(cs_ssf4)

11
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)
March 28, 2017
COL FIELD LENGTH SOURCE NOTES
2291 CS Site-Specific Factor 3 SEER See Attachment A
5 (2004+)
(cs5st1-cs5st10)
(cs_ssf5)
2294 CS Site-Specific Factor 3 SEER See Attachment A
6 (2004+)
(cs6st1-cs6st10)
(cs_ssf6)
2297 CS Site-Specific Factor 3 SEER See Attachment A
25 (2004+)
(cs25st1-cs25st10)
(cs_ssf25)
2300 Derived AJCC T, 6th 2 SEER See Attachment A
edition (2004+)
(dajcct1-dajcct10)
(d_ajcc_t)
2302 Derived AJCC N, 6th 2 SEER See Attachment A
edition (2004+)
(dajccn1-dajccn10)
(d_ajcc_n)
2304 Derived AJCC M, 6th 2 SEER See Attachment A
edition (2004+)
(dajccm1-dajccm10)
(d_ajcc_m)
2306 Derived AJCC Stage 2 SEER See Attachment A
Group, 6th edition
(2004+)
(dajccstg1-
dajccstg10)
(d_ajcc_s)
2308 Derived SS1977 1 SEER See Attachment A
(2004+)
(dss77s1-dss77s10)
(d_ssg77)
2309 Derived SS2000 1 SEER See Attachment A
(2004+)
(dss00s1-dss00s10)
(d_ssg00)
2310 Derived AJCC-Flag 1 SEER See Attachment A
(dajcflg1-dajcflg10)
(d_ajcc_f)

12
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)
March 28, 2017
COL FIELD LENGTH SOURCE NOTES
2311 Derived SS1977 - Flag 1 SEER See Attachment A
(dss77f1-dss77f10)
(d_ssg77f)
2312 Derived SS2000 Flag 1 SEER See Attachment A
(dss00f1-dss00f10)
(d_ssg00f)
2313 CS Version 1st 6 SEER See Attachment A
(2004+)
(csvf1-csvf10)
(csv_org)
2319 CS Version Latest 6 SEER See Attachment A
(2004+)
(csvl1-csvl10)
(csv_der)
2325 CS version input 6 SEER See Attachment A
current (2004+)
(cscurrent1-
cscurrent10)
(csv_cur)
2331 RX Summ Surgery of 2 SEER See Attachment A
Primary Site (1998+)
(sxprif1-sxprif10)
(surgprim)
2333 RX Summ Scope Reg 1 SEER See Attachment A
LN Sur (2003+)
(sxscof1-sxscof10)
(scope)
2334 RX Summ--Surg Oth 1 SEER See Attachment A
Reg/Dis (2003+)
(sxsitf1-sxsitf10)
(surgoth)
2335 Number of regional 2 SEER See Attachment A
lymph nd exam
(1998-2002)
(numnd1-numnd10)
(surgnode)
2338 Reason no cancer- 1 SEER See Attachment A
directed surgery
(nosrg1-nosrg10)
(no_surg)

13
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)
March 28, 2017
COL FIELD LENGTH SOURCE NOTES
2339 Radiation 1 SEER See Attachment A
(rad1-rad10)
(radiatn)
2340 Radiation to Brain and 1 SEER See Attachment A
/or CNS
(radbrn1-radbrn10)
(rad_brn)
2341 Radiation sequence 1 SEER See Attachment A
with surgery
(radsurg1-radsurg10)
(rad_surg)
2342 Site Specific Surgery 2 SEER See Attachment A
(1983 1997)
(sssurg1-sssurg10)
(ss_surg)
2344 Scope of regional 1 SEER See Attachment A
lymph node surgery
(1998-2002)
(sxscop1-sxscop10)
(scope02)
2345 Surgery of other 1 SEER See Attachment A
reg/dist sites
(1998-2002)
(sxsite1-sxsite10)
(srgoth02)
2348 Age-site edit override 1 SEER See Attachment A
(ositage1-ositage10)
(o_sitage)
2349 Sequence number-dx 1 SEER See Attachment A
conf override
(oseqcon1-
oseqcon10)
(o_seqcon)
2350 Site-type-lat-seq 1 SEER See Attachment A
override
(oseqlat1-oseqlat10)
(o_seqlat)
2351 Surgery-diagnostic conf 1 SEER See Attachment A
override
(osurcon1-osurcon10)
(o_surcon)

14
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)
March 28, 2017
COL FIELD LENGTH SOURCE NOTES
2352 Site-type edit override 1 SEER See Attachment A
(osittyp1-osittyp10)
(o_sittyp)
2353 Histology edit override 1 SEER See Attachment A
(hbenign1-hbenign10)
(h_benign)
2354 Report source 1 SEER See Attachment A
sequence override
(orptsrc1-orptsrc10)
(o_rptsrc)
2355 Seq-ill-defined site 1 SEER See Attachment A
override
(odfsite1-odfsite10)
(o_dfsite)
2356 Leuk-Lymph dx 1 SEER See Attachment A
confirmation override
(oleukdx1-oleukdx10)
(o_leukdx)
2357 Site-behavior override 1 SEER See Attachment A
(ositbeh1-ositbeh10)
(o_sitbeh)
2358 Site-EOD-dx date 1 SEER See Attachment A
override
(oeoddt1-oeoddt10)
(o_eoddt)
2359 Site-laterality-EOD 1 SEER See Attachment A
override
(ositeod1-ositeod10)
(o_siteod)
2360 Site-laterality-morph 1 SEER See Attachment A
override
(ositmor1-ositmor10)
(o_sitmor)
2361 Type of follow-up 1 SEER See Attachment A
expected
(typefu1-typefu10)
(typefup)
2362 Age at Diagnosis 2 SEER See Attachment A
recode (ager1-ager10)
(age_rec)

15
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)
March 28, 2017
COL FIELD LENGTH SOURCE NOTES
2364 Site Recode ICD-O-3/ 5 SEER See Attachment A
WHO 2008 (siterwho1- *This variable is used to select patients
siterwho10) (siterwho) for data requests.
2369 Filler 4
2373 Recode ICD-O-2 to 9 4 SEER See Attachment A
(icdot09_1-
icdot09_10)
(icdot09v)
2377 Recode ICD-O-2 to 10 4 SEER See Attachment A
(icdot10_1-
icdot10_10)
(icdot10v)
2381 ICCC site recode 3 SEER See Attachment A
ICD-O-3/WHO 2008
(ICCC3WHO1-
ICCC3WHO10)
(ICCC3WHO)
2384 ICCC site recode 3 SEER See Attachment A
extended ICD-O-3/
WHO 2008
(ICC3XWHO1-
ICC3XWHO10)
(ICCC3XWHO)
2387 Behavior recode for 1 SEER See Attachment A
analysis (behtrend1-
behtrend10)
(behanal)
2388 Histology recode - 2 SEER See Attachment A
broad groupings
(histrec1-histrec10)
(histrec)
2390 Histology recode 2 SEER See Attachment A
Brain groupings
(hisrcb1-hisrcb10)
(brainrec)
2392 CS Schema v 0204 3 SEER See Attachment A
(cs04sch1-cs04sch10)
(cs0204schema)
2395 SEER Historic Stage A 1 SEER See Attachment A
(hstst1-hstst10)
(hst_stga)

16
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)

March 28, 2017

COL FIELD LENGTH SOURCE NOTES


2396 AJCC Stage 3rd Edition 2 SEER See Attachment A
(1988-2003)
(ajccstg1-ajccstg10)
(ajcc_stg)
2398 SEER Modified AJCC 2 SEER See Attachment A
Stage 3rd edition
(1988-2003)
(aj3sr1-aj3sr10)
(aj_3seer)
2400 SEER Summary Stage 1 SEER See Attachment A
1977 (1995-2000)
(sss77v1-sss77v10)
(ssg77)
2401 SEER summary stage 1 SEER See Attachment A
2000 (2001-2003)
(sssm2Z1-sssm2Z10)
(ssg2000)
2402 First Malignant Primary 1 SEER See Attachment A
Indicator
(frstprm1-frstprm10)
(firstprm)
2403 FIPS State Code 2 SEER See Attachment A
(statecd1-statecd10)
(stcounty)
2405 FIPS County Code 3 SEER See Attachment A
(cnty1-cnty10)
(stcounty)
2408 IHS Code 1 SEER See Attachment A
(ihscd1-ihscd10)
2409 Summary Stage 2000 1 SEER See Attachment A
(1998+)
(summ2k1-
summ2k10)
(hist_ssg_2000)
2410 AYA site recode/WHO 2 SEER See Attachment A
2008
(ayawho1-ayawho10)
(aya_recode)
2412 Lymphoma subtype 2 SEER See Attachment A
recode/WHO 2008
(lymwho1-lymwho10)
(lymphoma_recode)

17
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)
March 28, 2017
COL FIELD LENGTH SOURCE NOTES
2414 SEER cause-specific 1 SEER See Attachment A
death classification
(vsrtdx1-vsrtdx10)
(dth_class)
2415 SEER other cause of 1 SEER See Attachment A
death classification
(odthclass1-
odthclass10)
(o_dth_class)
2416 CS Tumor Size/Ext 1 SEER See Attachment A
Eval (2004+)
(csts1-csts10)
(exteval)
2417 CS Reg Nodes Eval 1 SEER See Attachment A
(2004+)
(csrg1-csrg10)
(nodeeval)
2418 CS Mets Eval (2004+) 1 SEER See Attachment A
(csmt1-csmt10)
(metseval)
2419 Primary by International 1 SEER See Attachment A
Rules
(intprim1-intprim10)
(intprim)
2420 ER Status Recode 1 SEER See Attachment A
Breast Cancer (1990+)
(erstat1-erstat10)
(erstatus)
2421 PR Status Recode 1 SEER See Attachment A
Breast Cancer (1990+)
(prstat1-prstat10)
(prstatus)
2422 CS Schema 2 SEER See Attachment A
(cssch1-cssch10)
(csschema)
2424 CS Site-Specific Factor 3 SEER See Attachment A
8 (2004+)
(cs8st1-cs8st10)
(cs_ssf8)

18
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)

March 28, 2017

COL FIELD LENGTH SOURCE NOTES


2427 CS Site-Specific Factor 3 SEER See Attachment A
10 (2004+)
(cs10st1-cs10st10)
(cs_ssf10)
2430 CS Site-Specific Factor 3 SEER See Attachment A
11 (2004+)
(cs11st1-cs11st10)
(cs_ssf11)
2433 CS Site-Specific Factor 3 SEER See Attachment A
13 (2004+)
(cs13st1-cs13st10)
(cs_ssf13)
2436 CS Site-Specific Factor 3 SEER See Attachment A
15 (2004+)
(cs15st1-cs15st10)
(cs_ssf15)
2439 CS Site-Specific Factor 3 SEER See Attachment A
16 (2004+)
(cs16st1-cs16st10)
(cs_ssf16)
2442 Lymph-vascular 1 SEER See Attachment A
Invasion (2004+)
(vasinv1-vasinv10)
(vasinv)
2443 Survival Months 4 SEER See Attachment A
(srvm1-srvm10)
(srv_time_mon)
2447 Survival Months Flag 1 SEER See Attachment A
(srvmflag1-
srvmflag10)
(srv_time_mon_flag)
2448 Filler 5
2453 Insurance Recode 1 SEER See Attachment A
(2007+)
(insrecpb1-
insrecpb10)
(insrec_pub)
2454 Derived AJCC T 7th ed 3 SEER See Attachment A
(dajcc7t1-dajcc7t10)
(dajcc7t)
2457 Derived AJCC N 7th ed 3 SEER See Attachment A
(dajcc7n1-dajcc7n10)

19
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)
March 28, 2017
COL FIELD LENGTH SOURCE NOTES
(dajcc7n)
2460 Derived AJCC M 7th ed 3 SEER See Attachment A
(dajcc7m1-dajcc7m10)
(dajcc7m)
2463 Derived AJCC 7 Stage 3 SEER See Attachment A
Group
(dajcc7_01-dajcc7_10)
(dajcc7stg)
2466 Breast Adjusted AJCC 2 SEER See Attachment A
6th T (1988+)
(adjajc6t1-adjajc6t10)
(adjtm_6value)
2468 Breast Adjusted AJCC 2 SEER See Attachment A
6th N (1988+)
(adjajc6n1-adjajc6n10)
(adjnm_6value)
2470 Breast Adjusted AJCC 2 SEER See Attachment A
6 M (1988+)
th

(adjajc6m1-
adjajc6m10)
(adjm_6value)
2472 Breast Adjusted AJCC 2 SEER See Attachment A
6th Stage (1988+)
(adjajc6_01-
adjajc6_10)
(adjajccstg)
2474 CS Site-Specific Factor 3 SEER See Attachment A
7 (cs7st1-cs7st10)
(cs7site)
2477 CS Site-Specific Factor 3 SEER See Attachment A
9 (cs9st1-cs9st10)
(cs9site)
2480 CS Site-Specific Factor 3 SEER See Attachment A
12 (cs12st1-cs12st10)
(cs12site)
2483 Derived HER2 Recode 1 SEER See Attachment A
(2010+)
(her2rec1-her2rec10)
(her2)

20
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)
March 28, 2017
COL FIELD LENGTH SOURCE NOTES
2484 Breast Subtype (2010+) 1 SEER See Attachment A
(brstsub1-brstsub10)
(brst_sub)
2485 Lymphoma Ann Arbor 1 SEER See Attachment A
Stage (1983+)
(annarbor1-
annarbor10)
(annarbor)
2486 CS Mets at DX-Bone 1 SEER See Attachment A
(2010+)
(csmetsdxb_pub1-
csmetsdxb_pub10)
(csmetsdxb_pub)
2487 CS Mets at DX-Brain 1 SEER See Attachment A
(2010+)
(csmetsdxbr_pub1-
csmetsdxbr_pub10)
(csmetsdxbr_pub)
2488 CS Mets at DX-Liver 1 SEER See Attachment A
(2010+)
(csmetsdxliv_pub1-
csmetsdxliv_pub10)
(csmetsdxliv_pub)
2489 CS Mets at DX-Lung 1 SEER See Attachment A
(2010+)
(csmetsdxlung_pub1-
csmetsdxlung_pub10)
(csmetsdxlung_pub)
2490 T value-based on AJCC 2 SEER See Attachment A
3rd (1988-2003)
(t_value1-t_value10)
(t_value)
2492 N value-based on 2 SEER See Attachment A
AJCC 3rd (1988-2003)
(n_value1-n_value10)
(n_value)
2494 M value-based on 2 SEER See Attachment A
AJCC 3rd (1988-2003)
(m_value1-m_value10)
(m_value)

21
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)

March 28, 2017

COL FIELD LENGTH SOURCE NOTES


2496 Diagnosis Date Flag 2 SEER 12 = Date is unknown
(ddxflag1-ddxflag10) Missing = A valid date is provided,
or the date was not expected to have
been transmitted
2498 Therapy date flag 2 SEER 10 = Unknown if therapy was administered
(dthflag1-dthflag10) 11 = Therapy was not administered
12 = Therapy was administered, but the date
is unknown
Missing = A valid date is provided, or was not
expected to have been transmitted
2500 Month Therapy Started 2 SEER Month of Therapy
(monrx1-monrx10) 00 = No Therapy
(monthrx) 01-12 = Valid month
blank = Unknown
2502 Year Therapy Started 4 SEER Year of Therapy. If year is blank, refer to
(yearrx1-yearrx10) Therapy Date Flag (dthflag) for more
(year_rx) information.
2506 Other Therapy 1 SEER 0 = No other cancer-directed therapy
(other_tx1-other_tx10) 1 = Other cancer-directed therapy
(othr_rx) 2 = Other experimental cancer-directed
therapy
3 = Double blind study, code not yet broken
6 = Unproven therapy (including Laetrile,
Krebiozen, etc)
7 = Refused therapy 1-3 above
8 = Recommended, unknown if administered
9 = Unknown
2507 ICD-O coding scheme 1 SEER 2 = Originally coded in ICD-O-2, (1973-2000)
(icdo1-icdo10) 3 = Originally coded in ICD-O-3, (2001+)
(icdover)

22
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)

March 28, 2017

COL FIELD LENGTH SOURCE NOTES


2508 NAPIIA Derived API 2 SEER The NAACCR Asian Pacific Islander
Race Identification Algorithm version 1 (NAPIIA v1)
(napiia1-napiia10) uses a combination of NAACCR variables to
(napiia) classify cases directly or indirectly as Asian
Pacific Islander for analytic purposes. This
version of the algorithm is focused on coding
cases with a race code of Asian NOS (race
code 96) to a more specific Asian race
category, using the birthplace and name fields
(first, last, and maiden names). Birthplace can
be used to indirectly assign a specific race to
one of eight Asian race groups (Chinese,
Japanese, Vietnamese, Korean, Asian Indian,
Filipino, Thai, and Cambodian). Names can be
used to indirectly assign a specific race to one
of seven Asian groups (Chinese, Japanese,
Vietnamese, Korean, Asian Indian, Filipino,
and Hmong). Future versions of NAPIIA will
incorporate Pacific Islanders and will
potentially incorporate name lists for Thai,
Cambodian, and Laotians.

Codes are the same as for Race/Ethnicity.


Blanks in field mean the algorithm was not run.
2510 Primary Payer at DX 2 SEER Primary Payer at Diagnosis identifies the
(payer_dx1- patients primary insurance carrier or method
payer_dx10) of payment at the time of initial diagnosis
(payerdx) and/or treatment.

01 = Not insured
02 = Not insured, self pay
10 = Insurance, NOS
20 = Private Insurance: Managed care, HMO,
or PPO
21 = Private Insurance: Fee-for-Service
31 = Medicaid
35 = Medicaid - Administered through a
Managed Care plan
60 = Medicare/Medicare, NOS
61 = Medicare with supplement, NOS
62 = Medicare - Administered through a
Managed Care plan
63 = Medicare with private supplement
64 = Medicare with Medicaid eligibility

23

Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)
March 28, 2017
COL FIELD LENGTH SOURCE NOTES
65 = TRICARE
66 = Military
67 = Veterans Affairs
68 = Indian/Public Health Service
99 = Insurance status unknown

24
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)

March 28, 2017

Codes for the variables:

-Cause of Death to site recode KM (codkm)

-Cause of Death to site recode (codpub)

00000 = Alive
20010 = Lip
20020 = Tongue
20030 = Salivary Gland
20040 = Floor of Mouth
20050 = Gum and Other Mouth
20060 = Nasopharynx
20070 = Tonsil
20080 = Oropharynx
20090 = Hypopharynx
20100 = Other Oral Cavity and Pharynx
21010 = Esophagus
21020 = Stomach
21030 = Small Intestine
21040 = Colon excluding Rectum
21050 = Rectum and Rectosigmoid Junction
21060 = Anus, Anal Canal and Anorectum
21071 = Liver
21072 = Intrahepatic Bile Duct
21080 = Gallbladder
21090 = Other Biliary
21100 = Pancreas
21110 = Retroperitoneum
21120 = Peritoneum, Omentum and Mesentery
21130 = Other Digestive Organs
22010 = Nose, Nasal Cavity and Middle Ear
22020 = Larynx
22030 = Lung and Bronchus
22050 = Pleura
22060 = Trachea,Mediastinum and Other Respiratory Organs
23000 = Bones and Joints
24000 = Soft Tissue including Heart
25010 = Melanoma of the Skin
25020 = Other Non-Epithelial Skin
26000 = Breast
27010 = Cervix Uteri
27020 = Corpus Uteri
27030 = Uterus, NOS
27040 = Ovary
27050 = Vagina
27060 = Vulva
27070 = Other Female Genital Organs

25
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)

March 28, 2017

28010 = Prostate
28020 = Testis
28030 = Penis
28040 = Other Male Genital Organs
29010 = Urinary Bladder
29020 = Kidney and Renal Pelvis
29030 = Ureter
29040 = Other Urinary Organs
30000 = Eye and Orbit
31010 = Brain and Other Nervous System
32010 = Thyroid
32020 = Other Endocrine including Thymus
33010 = Hodgkin Lymphoma
33040 = Non-Hodgkin Lymphoma
34000 = Myeloma
35011 = Acute Lymphocytic Leukemia
35012 = Chronic Lymphocytic Leukemia
35013 = Other Lymphocytic Leukemia
35021 = Acute Myeloid Leukemia
35031 = Acute Monocytic Leukemia
35022 = Chronic Myeloid Leukemia
35023 = Other Myeloid/Monocytic Leukemia
35041 = Other Acute Leukemia
35043 = Aleukemic, Subleukemic and NOS
36010 = Mesothelioma (ICD-10 only) (not in CODPUB)
36020 = Kaposi Sarcoma (ICD-10 only) (not in CODPUB)
37000 = Miscellaneous Malignant Cancer
38000 = In situ, benign or unknown behavior neoplasm
50000 = Tuberculosis
50010 = Syphilis
50030 = Septicemia
50040 = Other Infectious and Parasitic Diseases including HIV
50050 = Diabetes Mellitus
50051 = Alzheimers (ICD-9 and 10 only)
50060 = Diseases of Heart
50070 = Hypertension without Heart Disease
50080 = Cerebrovascular Diseases
50090 = Atherosclerosis
50100 = Aortic Aneurysm and Dissection
50110 = Other Diseases of Arteries, Arterioles, Capillaries
50120 = Pneumonia and Influenza
50130 = Chronic Obstructive Pulmonary Disease and Allied Cond
50140 = Stomach and Duodenal Ulcers
50150 = Chronic Liver Disease and Cirrhosis
50160 = Nephritis, Nephrotic Syndrome and Nephrosis
50170 = Complications of Pregnancy, Childbirth, Puerperium

26
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)
March 28, 2017
50180 = Congenital Anomalies
50190 = Certain Conditions Originating in Perinatal Period
50200 = Symptoms, Signs and Ill-Defined Conditions
50210 = Accidents and Adverse Effects
50220 = Suicide and Self-Inflicted Injury
50230 = Homicide and Legal Intervention
50300 = Other Cause of Death
41000 = State DC not available or state DC available but no COD
99999 = Unknown/missing/invalid COD

27
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)

April 24, 2015

Note:

Rural/Urban Continuum as Defined in the 2004 ARF file


(urban/rural code)

The 2003 Rural/Urban Continuum Codes are from Economic Research Service (ERS),
Department of Agriculture. The codes form a classification scheme that distinguishes
metropolitan (metro) counties by the population size of their metro area and nonmetropolitan
(nonmetro) counties by degree of urbanization and adjacency to a metro area or nonmetro areas.
All U.S. counties and county equivalents are grouped according to the official metro status
announced by the Office of Management and Budget (OMB) in June 2003, when the population
and worker commuting criteria used to identify metro counties were applied to results of the 2000
Census.

Metro counties are distinguished by population size of the Metropolitan Statistical Area of
which they are part. Nonmetro counties are classified according to the aggregate size of their
urban population. Within the three urban size categories, nonmetro counties are further identified
by whether or not they have some functional adjacency to a metro area or areas. A nonmetro
county is defined as adjacent if it physically adjoins one or more metro areas, and has at least 2
percent of its employed labor force commuting to central metro counties. Nonmetro counties that
do not meet these criteria are classed as nonadjacent.

In concept, the 2003 version of the Rural-Urban Continuum Codes is comparable with that of
earlier decades. However, OMB made major changes in its metro area delineation procedures
for the 2000 Census, and the Census Bureau changed the way in which rural and urban are
measured. Therefore, the new Rural-Urban Continuum Codes are not fully comparable with
those of earlier years. OMBs changes added some additional metro areas by no longer requiring
that a metro area must have at least 100,000 population if its urbanized area has no place of at
least 50,000 people. More importantly, simplifying the worker commuting criteria that determine
outlying metro counties had the effect of both adding numerous new outlying counties to metro
status while deleting a smaller number that were previously metro.

The Census Bureau made a radical shift in determining rural-urban boundaries by changing
and liberalizing the procedures for delineating urbanized areas of 50,000 or more people, and
abandoning place boundaries in measuring urban or rural population. The procedures used in
defining Urbanized Areas were extended down to clusters of 2,500 or more people, based solely
on population density per square mile.

In earlier versions of the Rural-Urban Continuum Codes, metro areas with 1 million
population or more were subdivided between central counties (Code 0) and fringe counties (Code
1). The Code 1 group has become much less meaningful in the last two censuses as more and
more counties of large metro areas have been rated as central counties by OMB procedures. In
2000, only 1.6 percent of the population of large metro areas was in fringe counties. Therefore,
this distinction has been dropped. Codes 0 and 1 have been combined, and the new code 1
represents all counties in metro areas of 1 million or more population.

28
Documentation for the Patient Entitlement and Diagnosis Summary File (PEDSF)

April 24, 2015

The 2003 Rural/Urban Continuum Codes are defined as follows:

CODE METROPOLITAN COUNTIES (1-3)


01 Counties of metro areas of 1 million population or more
02 Counties in metro areas of 250,000 - 1,000,000 population
03 Counties in metro areas of fewer than 250,000 population

CODE NON METROPOLITAN COUNTIES (4-9)


04 Urban population of 20,000 or more, adjacent to a metro area
05 Urban population of 20,000 or more, not adjacent to a metro area
06 Urban population of 2,500-19,999, adjacent to a metro area
07 Urban population of 2,500-19,999, not adjacent to a metro area
08 Completely rural or less than 2,500 urban population, adjacent to a metro area
09 Completely rural or less than 2,500 urban population, not adjacent to a metro area
99 Missing Value

29

You might also like