Important
This document is not an official Census Bureau publication. It is compiled from publicly accessible information by Lars Vilhuber (Labor Dynamics Institute, Cornell University). Feedback is welcome. Please write us at lars.vilhuber@cornell.edu. |
1. Scope
The public-use data from the Longitudinal Employer-Household Dynamics Program, including the Quarterly Workforce Indicators (QWI) and Job-to-Job Flows (J2J), are available for download according to structural and file naming schema. The data themselves are available as Comma-Separated Value (CSV) files through the LEHD website’s Data page at http://lehd.ces.census.gov/data/ as well as through the LED Extraction Tool.
Shapefiles are used to provide mapping functionality in QWI Explorer and Job-to-Job Explorer (Beta). They are created by transforming input shapefiles sourced from TIGER/Line. New TIGER/Line shapefiles are typically released by the Census Bureau’s Geography Division in August of each year, which are then processed by the LEHD program as new tabulation areas for the QWI[http://lehd.ces.census.gov/data/#qwi] and J2J[http://lehd.ces.census.gov/data/#j2j] data products. The LEHD shapefiles will be made available in the data schema in coordination with the public release of QWI and J2J data products, usually in November or December of each year.
2. Sources
Files are derived from TIGER/Line 2016 shapefiles:
-
tl_2016_(ST)_place (for creation of WIA/WIB shapefile)
-
tl_2016_(ST)_cousub (for creation of WIA/WIB shapefile)
3. Transformations
The following major transformations are applied to the input files:
-
All geographies are reprojected to WGS-1984 Geographic Coordinate System
-
Shoreline water has been clipped out to provide a more recognizable depiction of the coastlines.
-
Each layer is given internal point coordinates (stored as double) based on the WGS-1984 projection (decimal degrees).
-
Each layer is run through a "simplify polygon" procedure to remove unnecessary complexity from the features.
-
Features from Guam, American Samoa, and the Northern Mariana Islands have been removed because they are not used in current LEHD tabulations.
-
Each shapefile’s attribute table has been updated to conform to the standard LEHD output format, defined in Format section
4. Outputs
Output shapefiles – grouped by paired products – are listed below. Each shapefile includes specific notes on its preparation.
5. Basic Naming Schema
All files follow the following naming convention:
[type]]_[geocat].zip
where [type] = lehd_shp and geocat contains
type | Description |
---|---|
gb |
Metropolitan (complete) |
gc |
Counties |
gm |
Metropolitan/Micropolitan (state parts) |
gn |
National (50 States + DC) |
gs |
States |
gw |
Workforce Investment Areas |
5.1. FORMAT
Files are distributed as ESRI Shapefiles, packaged as ZIP files. The SHP component of these archives is described here. Other components (dbf, prj, shx) files are not documented here, we refer users to http://www.digitalpreservation.gov/formats/fdd/fdd000280.shtml .
column | label | description | type |
---|---|---|---|
STUSPS |
State USPS code |
FIPS State Postal Code as per https://www.census.gov/geo/reference/codes/cou.html |
string |
GEOGRAPHY |
Nationally unique identifier |
Derived from Nationally Unique Federal Information Processing Series (FIPS) Code as per https://www.census.gov/geo/reference/ansi.html (see notes) |
string |
NAME |
Feature Name |
Full Census Name of Geography Feature |
string |
LABEL |
Feature Label |
Shorter Census Name of Geography Feature for Thematic Mapping |
string |
INTPTLAT |
Internal Point Latitude |
Internal Point Latitude in WGS-1984 Decimal Degrees as per http://spatialreference.org/ref/epsg/wgs-84/ |
double |
INTPTLON |
Internal Point Longitude |
Internal Point Longitude in WGS-1984 Decimal Degrees as per http://spatialreference.org/ref/epsg/wgs-84/ |
double |
5.2. Values
5.2.1. STUSPS
( label_stusps.csv )
FIPS State Postal Code as per https://www.census.gov/geo/reference/codes/cou.html
5.2.2. GEOGRAPHY
( label_geography.csv ) The valid codes correspond to those listed on label_geography.csv and label_geography_cbsa.csv.
5.2.3. NAME
This is a string that corresponds in general to the label field on label_geography.csv and label_geography_cbsa.csv. Minor deviations for ease of exposition are possible.
5.3. Common files
5.3.1. State
( lehd_shp_gs.zip )
No transformations occur to this layer other than those listed above.
5.4. QWI Geographies
5.4.1. County
( lehd_shp_gc.zip )
-
STUSPS is appended to the NAME field so that county names are nationally unique. Example: "Cook, IL"
5.4.2. CBSA - within State
( lehd_shp_gm.zip )
-
All features are split into state-specific CBSA features by intersecting each feature with the state shapefile features.
-
The STUSPS field is added during the intersect with the state shapefile.
-
STFIPS (i.e. FIPS State Code as per https://www.census.gov/geo/reference/ansi_statetables.html) is prepended to the CBSA code (https://www.census.gov/population/metro/data/def.html) to create the GEOGRAPHY field to distinguish state-parts of the same CBSA (i.e. make them nationally unique).
-
The text "([STUSPS] part)" is appended to the NAME field only for those CBSA features that are split by state lines.
5.4.3. Workforce Investment Board Areas
( lehd_shp_gw.zip )
The WIA/WIB shapefiles are built from the Place, County Subdivision, and County shapefiles from TIGER/Line based on definitions provided by the LED state partners.
5.5. Job-to-Job Flow Geographies
5.5.1. Metropolitan (complete)
( lehd_shp_gb.zip )
-
Micropolitan areas are removed and state remainder areas are added as new features. State remainders are assigned unique codes ([STUSPS]+999) and names ("Not in metropolitan area, [STUSPS]").
6. Versioning
Versioning rules follow Semantic Versioning V2.0.0, which states that
Given a version number MAJOR.MINOR.PATCH, increment the:
-
MAJOR version when you make incompatible API changes,
-
MINOR version when you add functionality in a backwards-compatible manner, and
-
PATCH version when you make backwards-compatible bug fixes.