4.2. Individual Characteristics Files - Residential Geography

4.2.1. Overview

This file contains annual records with residential geography for individuals found in the wage data (workers). All records from the source tables are kept for workers, though workers may not have residential geography or jobs in every year. Information is available beginning in 1999. The target reference date for the residence is considered to be April 1 of the year.

  • CPR-Sourced Residence (ICF_US_RESIDENCE_CPR)

    For 1999-2010, residential geography is sourced from the Composite Person Record (CPR) file. Though the formal CPR was not available in 2011, locations for that year are included in this table using an alternate data source.

    • Scope: National

    • Key: PIK ADDRESS_YEAR

  • RCF-Sourced Residence (ICF_US_RESIDENCE_RCF)

    For 2012 onward, residential geography is sourced from the Residence Candidates File (RCF).

    • Scope: National

    • Key: PIK ADDRESS_YEAR

4.2.2. User Guidance

Composite Person Record (CPR)

The CPR was the original source of residence data used in LEHD. The CPR file series, which begins in 1999, contains fields that provided a linkage between a unique person record and a place of residence for each year. The CPR was produced until the file was discontinued in 2011 (after the production of data year 2010). In the following year, the MAF-ARF (Master Address File-Auxiliary Reference File) was used as a replacement for the CPR for data year 2011. The MAF-ARF was found to differ from the CPR in a number of ways, including a difference in coverage and a lack of deduplication among PIKs. LEHD was able to produce a deduplicated version of the MAF-ARF by defining some very basic business rules. Some additional history of the CPR and MAF-ARF can be found in Graham et al. [2017].

The county codes on the CPR (five-digit state+county FIPS code) are contemporaneous. The set of valid county codes and the boundaries of the counties can change over time. For more information on major county changes over time, see the Census Bureau’s County Changes page.

Residence Candidates File (RCF)

The RCF combines a set of federal administrative source files containing residence information for a person at a time into a file with preference weights for each person/location by reference period and with no remaining source information. The complete methodology is described in Graham et al. [2017].

This extract uses only the most preferred location for each PIK from each year’s RCF, so that there is only one record per PIK-year in the ICF.

If additional geographical information is needed (beyond the state, county, and census tract provided in this extract), researchers can lookup a records MAFID in the MAF/TIGER Extract (MAFX). The MAFX is not part of the LEHD Snapshot and must be requested separately for projects. The most recent MAFX will contain the vast majority of MAFIDs found in this extract; however, in some cases researchers may need to find individual MAFIDs in older MAFX files.

4.2.3. Codebook: The ICF_US_RESIDENCE_CPR File

Table Metadata for Residence Geography, 1999-2011 (ICF_US_RESIDENCE_CPR)

Access Requirements for ICF_US_RESIDENCE_CPR
State Approval Required IRS Approval Required SSA Approval Required
Access Requirements
Description

Residence geography prior to 2012 for individuals found in the wage data.

Scope

National

Key

PIK ADDRESS_YEAR

Sort Order

PIK

File Format

SAS Data Table

Download Codebook

CSV

Variable Information

Variable Information for ICF_US_RESIDENCE_CPR
Variable Name Type Length Description
PIK char 9 PIK - Protected Identification Key
ADDRESS_YEAR num 4 Year address is found on source data
GEOCODEFULL char 15 FIPS State(2) ||FIPS County(3) ||Tract(6)||Block(4)
LATITUDE_LIVE num 8 Latitude of residence, 6 implied decimal places
LONGITUDE_LIVE num 8 Longitude of residence, 6 implied decimal places

FLAG_LATLONG

num 3

Flag quality of latitude/longitude of residence (See details below)

Details for variable FLAG_LATLONG on ICF_US_RESIDENCE_CPR

Back

Description

Flag quality of latitude/longitude of residence

Codebook

Download as CSV

Value Label
-1 Lat/Long quality not available (2011 data)
1 Location interpolated from house number
2 House number outside of road segment address range; snapped to end of range
3 Complex house number: location interpolated from house number
4 Missing house number: location interpolated from other number in address
5 Used midpoint of road segment
6 Location based upon match to ZCTA
7 Location based upon match to county

4.2.4. Codebook: The ICF_US_RESIDENCE_RCF File

Table Metadata for Residence Geography, 2012 forward (ICF_US_RESIDENCE_RCF)

Access Requirements for ICF_US_RESIDENCE_RCF
State Approval Required IRS Approval Required SSA Approval Required
Access Requirements
Description

Residence geography from 2012 forward for individuals found in the wage data.

Scope

National

Key

PIK ADDRESS_YEAR

Sort Order

PIK

File Format

SAS Data Table

Download Codebook

CSV

Variable Information

Variable Information for ICF_US_RESIDENCE_RCF
Variable Name Type Length Description
PIK char 9 Protected Indentification Key PIK
ADDRESS_YEAR num 4 Year address is found on source data

COUNTY_LIVE

char 5

Current state/county FIPS as of year indicated in year variable - concatenates CURSTATE and CURCOUNTY (See details in appendix)

TRACT char 6 Current Census Tract
MAFID char 9 MAFID from best source
LATITUDE_LIVE num 8 Internal point latitude, 6 implied decimals
LONGITUDE_LIVE num 8 Internal point longitude, 6 implied decimals

FLAG_RCF

num 3

Flag for geographic precision (See details below)

Details for variable FLAG_RCF on ICF_US_RESIDENCE_RCF

Back

Description

Flag for geographic precision

Codebook

Download as CSV

Value Label
11 State/county and tract from BMF in current geography
12 State/county from current MAF, no precise GEOID
13 MAF record in earlier tab geography, county from BMF valid in current geography
14 MAF record in earlier tab geography, county from BMF not valid in current geography
15 MAF record in earlier tab geography, no GEOID, MAF county valid in current geography
16 MAF record in earlier tab geography, no GEOID, MAF county not valid in current geography