****** LEHD Public Use Schema V4.3.1 - File and Directory Naming Convention ****** Lars Vilhuber <> 30 October 2018 Table of Contents JavaScript must be enabled in your browser to display the table of contents. ( Printable_version ) ***** 1. Purpose ***** The public-use data from the Longitudinal Employer-Household Dynamics Program, including the Quarterly Workforce Indicators (QWI) and Job-to-Job Flows (J2J), are available for download with the following data schema. These data are available as Comma-Separated Value (CSV) files through the LEHD website’s Data page at http://lehd.ces.census.gov/data/ and through LED Extraction Tool at http://ledextract.ces.census.gov/. This document describes the file and directory naming schema for LEHD-provided CSV files. ***** 2. Schema for Data File Contents ***** The contents (schema) for LEHD data files are described in lehd_public_use_schema.html. The contents (schema) for shapefiles are described in lehd_shapefiles.html. ***** 3. Extends ***** This version modifies a portion of the structure of the metadata. Many files compliant with LEHD or QWI Schema v4.2 or lower will also be compliant with this schema, but compatibility is not guaranteed. ***** 4. Supersedes ***** This version supersedes V4.2, for files released as of R2018Q3 ***** 5. Basic Filename Schema ***** All files are preceded by a file type definition, followed by additional information on the aggregation level of the file, or some other identifier. [TYPE]_[DETAILS].[EXT] ( naming_convention.csv ) type Used By Description [type]_[id] qwix CSV files queried from the LED Extraction Tool at http:// ledextract.ces.census.gov/. [type]_[geohi]_[demo]_ qwi qwir qwiv qwivr j2j Common convention for [fas]_[geocat]_[indcat]_ j2jr j2jod currently provided files. [ownercat]_[sa] lehd_shp_[geocat].zip shapefiles ZIP files containing data for shapefiles version\_[type].txt metadata Machine-readable text files with metadata on geographic and temporal coverage j2jod_[geography]_avail.csv metadata Complex metadata (used only in J2JOD) - see below **** 5.1. QWIPU from the LED Extraction Tool **** Files downloaded through the LED Extraction Tool at http:// ledextract.ces.census.gov/ follow the following naming convention: [type]_[id].[EXT] where [id] is the Request ID (a unique string of characters) generated every time Submit Request'' is clicked. The ID references each query submission made to the database. 5.2. Metadata for QWI and J2J data files (version.txt) Metadata accompanies the data files, identifying provenance, geographic and temporal coverage. These files follow the following naming convention: version_[type].txt where each name component is described in more detail below. 5.3. Metadata for J2JOD files Because the origin-destination (J2JOD) data link two regions, we provide an auxiliary file with the time range for which cells containing data for each geographic pairing may appear in a data release. The reference region will always be either the origin or the destination. These files follow the following naming convention: j2jod_[geography]_avail.csv where each name component is described in more detail below. 6. Directory Paths Downloadable data files are organized on the download server (at the time of this writing: https://lehd.ces.census.gov/data/) in directories organized as follows: SERVER/data/[PRODUCT]/[RELEASE]/[GEOHI][ /[TYPE] ]/{files} where [PRODUCT], [RELEASE], [GEOHI], and [TYPE] are each described below. [TYPE] is optional. 7. Description of Naming Components 7.1. Release A release is defined by the calendar year quarter when data production occurs. It is thus generically constructed as R[YEAR]Q[QUARTER] where [YEAR] is the 4-digit year and [QUARTER] the single-digit calendar year quarter (1-4). 7.2. Types and Products ( naming_type.csv ) type product Description Explanation url j2j j2j Job-to-Job Flows Counts of hires and http:// Counts (J2J) separations, to and lehd.ces.census.gov/ from employment and data/#j2j non-employment, summed across all establishments j2jod j2j Job-to-Job Flows Origin and http:// Origin-Destination destination flows lehd.ces.census.gov/ and earnings, data/#j2j tabulated by the destination establishment j2jr j2j Job-to-Job Flows Hiring and http:// Rates separation rates, to lehd.ces.census.gov/ and from employment data/#j2j and nonemployment, summed across all establishments lodes lodes LEHD Origin- LEHD Origin- http:// Destination Destination lehd.ces.census.gov/ Employment Employment data/#lodes Statistics (LODES) Statistics (LODES) used by OnTheMap. Version 7 and higher of LODES was enumerated by 2010 census blocks. Version 6 and lower of LODES were enumerated with 2000 census blocks. qwi qwi Quarterly Workforce The Quarterly http:// Indicators Public Workforce Indicators lehd.ces.census.gov/ Use Counts (QWIPU) (QWI) are a set of data/#qwi economic indicators including employment, job creation, earnings, and other measures of employment flows. The QWI are reported using detailed firm characteristics (geography, industry, age, size) and worker demographics information (sex, age, education, race, ethnicity). qwir qwi Quarterly Workforce Rates for flow tbd Indicators Public variables in the Use Rates Quarterly Workforce Indicators (QWI) qwirv qwi Quarterly Workforce Variability measures tbd Indicators for rates in the Variability Measures Quarterly Workforce for Rates Indicators (QWI) qwiv qwi Quarterly Workforce Variability measures tbd Indicators for Quarterly Variability Measures Workforce Indicators (QWI) lehd_shp all LEHD Shapefiles Shapefiles are used lehd_shapefiles.html to provide mapping functionality in QWI Explorer and Job-to- Job Explorer 7.3. geohi ( naming_geohi.csv ) The geohi component in filenames is based on one of two possible code sets: * the lower-case alphabetic FIPS or postal state code based on FIPS_PUB_5- 2 (see also label_stusps.csv for upper-case variants). * the numeric CBSA code corresponding to the metropolitan areas (see label_geography_metro.csv) * It is expanded to encompass additional codes denoting national coverage, or a collection of states. For directories, files at the CBSA level are collected under a single metro directory. type Description all Collection of all available states us National data (50 states + DC) metro Indicates collection of CBSA-level files (directory names only) [st] Any legal 2-character lower-case state postal code [NNNNN] Any valid CBSA-derived code listed in label_geography_metro.csv **** 7.4. demo **** ( naming_demo.csv ) type Description d No demographic detail rh Race by Ethnicity tabulations sa Sex by Age tabulations se Sex by Education tabulations sarhe Collection of sa-rh-se tabulations **** 7.5. fas **** ( naming_fas.csv ) type Description f No firm size or age detail fa Tabulations by firm age fs Tabulations by firm size **** 7.6. geocat **** ( naming_geocat.csv ) type Description gb Metropolitan (complete) gc Counties gm Metropolitan/Micropolitan (state parts) gn National (50 States + DC) gs States gw Workforce Investment Areas **** 7.7. indcat **** ( naming_indcat.csv ) type Description n all industries ns NAICS sectors n3 NAICS subsectors n4 NAICS industry groups **** 7.8. owncat **** ( naming_owncat.csv ) type Description of Federal government (QWI Code A01) op All Private (QWI Code A05) oslp State, local, and private ownership categories (QWI Code A00) **** 7.9. sa **** ( naming_sa.csv ) type Description s Seasonally adjusted u Not seasonally adjusted **** 7.10. ext **** ( naming_ext.csv ) type Description csv Comma-Separated file pdf PDF file (printable) xls Excel file (only select output) xlsx Excel file (not available for larger files) 8. Changes For a description of how schema files are versioned, see main_directory. **** 8.1. This version (revisions) **** * 2018-10-30: Initial release **** 8.2. Version 4.3.1 from 4.3.0 **** * 2018-10-30: Due to a change of the geographic boundaries for a WIB in Iowa, the shape files need to be updated. * Adjust the "supersedes" definition to correctly reference the prior version. **** 8.3. Version 4.3.0 from 4.2.0 **** * Changed the format of the version.txt file. This does not affect data files, but it does affect the metadata, and thus requires an increase in the minor version. * Changed the description of the download locations of the QWI files, as part of the "naming" schema. [Important] Important Some of the data products noted above do not exist yet. This revision: Tue Oct 30 14:24:35 EDT 2018 =============================================================================== Last updated 2018-10-30 14:24:35 EDT