= LEHD Public Use Data Schema V4.0.4 Lars Vilhuber 15 March 2016 // a2x: --dblatex-opts "-P latex.output.revhistory=0 --param toc.section.depth=3" ( link:QWIPU_Data_Schema.pdf[Printable version] ) [IMPORTANT] .Important ============================================== This document is not an official Census Bureau publication. It is compiled from publicly accessible information by Lars Vilhuber (http://www.ilr.cornell.edu/ldi/[Labor Dynamics Institute, Cornell University]). Feedback is welcome. Please write us at link:mailto:lars.vilhuber@cornell.edu?subject=LEHD_Schema_v4[lars.vilhuber@cornell.edu]. ============================================== Purpose ------- The public-use Quarterly Workforce Indicators (QWI) data from the Longitudinal Employer-Household Dynamics Program are available for download with the following data schema. These data are available as Comma-Separated Value (CSV) files through the LEHD website’s Data page at http://lehd.ces.census.gov/data/ and through LED Extraction Tool at http://ledextract.ces.census.gov/. This document describes the data schema for QWI files. For each variable, a set of allowable values is defined. Definitions are provided as CSV files, with header variable definitions. Changes relative to the original v4.0 version are listed <>. File naming ----------- The naming conventions of the data files is documented in link:lehd_csv_naming.html[]. Extends ------- This version extends v4.0. Any file compliant with LEHD or QWI Schema v4.0 will also be compliant with this schema. Basic Schema ------------ Each data file is structured as a CSV file. The first columns contain <>, subsequent columns contain <>, followed by <>. === Generic structure [width="30%",format="csv",cols="<2",options="header"] |=================================================== Column name [ Identifier1 ] [ Identifier2 ] [ Identifier3 ] [ ... ] [ Indicator 1 ] [ Indicator 2 ] [ Indicator 3 ] [ ... ] [ Status Flag 1 ] [ Status Flag 2 ] [ Status Flag 3 ] [ ... ] |=================================================== Note: A full list of indicators for each type of file are shown below in the <> section. While all indicators are included in the CSV files, only the requested indicators will be included in data outputs from the LED Extraction Tool. <<< === [[identifiers]]Identifiers Records, unless otherwise noted, are parts of time-series data. Unique record identifiers are noted below, by file type. Identifiers without the year and quarter component can be considered a series identifier. ==== Identifiers for qwi ( link:lehd_identifiers_qwi.csv[] ) [width="100%",format="csv",cols="2*^1,<3",options="header"] |=================================================== include::lehd_identifiers_qwi.csv[] |=================================================== <<< <<< === [[indicators]]Indicators The following tables and associated mapping files list the indicators available on each file. The ''Indicator Variable'' is the short name of the variable on the CSV files, suitable for machine processing in a wide variety of statistical applications. When given, the ''Alternate name'' may appear in related documentation and articles. The ''Status Flag'' is used to indicate publication or data quality status (see <>). The ''Indicator Name'' is a more verbose description of the indicator. ( link:variables_qwipu.csv[variables_qwipu.csv] ) [width="95%",format="csv",cols="3*^2,<5",options="header"] |=================================================== include::variables_qwipu.csv[] |=================================================== <<< == [[catvars]]Categorical Variables Categorical variable descriptions are displayed above each table, with the variable name shown in parentheses. Unless otherwise stated, every possible value/label combination for each categorical variable is listed. Please note that not all values will be available in every table. === agegrp ( link:label_agegrp.csv[] ) [width="60%",format="csv",cols="^1,<4",options="header"] |=================================================== include::label_agegrp.csv[] |=================================================== === education ( link:label_education.csv[] ) [width="60%",format="csv",cols="^1,<4",options="header"] |=================================================== include::label_education.csv[] |=================================================== === ethnicity ( link:label_ethnicity.csv[] ) [width="60%",format="csv",cols="^1,<4",options="header"] |=================================================== include::label_ethnicity.csv[] |=================================================== === firmage ( link:label_firmage.csv[] ) [width="60%",format="csv",cols="^1,<4",options="header"] |=================================================== include::label_firmage.csv[] |=================================================== === firmsize ( link:label_firmsize.csv[] ) [width="60%",format="csv",cols="^1,<4",options="header"] |=================================================== include::label_firmsize.csv[] |=================================================== === ownercode ( link:label_ownercode.csv[] ) [width="60%",format="csv",cols="^1,<4",options="header"] |=================================================== include::label_ownercode.csv[] |=================================================== === periodicity ( link:label_periodicity.csv[] ) [width="60%",format="csv",cols="^1,<4",options="header"] |=================================================== include::label_periodicity.csv[] |=================================================== === race ( link:label_race.csv[] ) [width="60%",format="csv",cols="^1,<4",options="header"] |=================================================== include::label_race.csv[] |=================================================== === seasonadj ( link:label_seasonadj.csv[] ) [width="60%",format="csv",cols="^1,<4",options="header"] |=================================================== include::label_seasonadj.csv[] |=================================================== === sex ( link:label_sex.csv[] ) [width="60%",format="csv",cols="^1,<4",options="header"] |=================================================== include::label_sex.csv[] |=================================================== <<< === Industry === [[ind_level]] ==== Industry levels ( link:label_ind_level.csv[] ) [width="60%",format="csv",cols="^1,<4",options="header"] |=================================================== include::label_ind_level.csv[] |=================================================== ==== Industry ( link:label_industry.csv[] ) Only a small subset of available values shown. The 2012 NAICS (North American Industry Classification System) is used for all years. QWI releases prior to R2015Q3 used the 2007 NAICS classification (see (../V4.0.1)[Schema v4.0.1]). For a full listing of all valid 2012 NAICS codes, see http://www.census.gov/cgi-bin/sssd/naics/naicsrch?chart=2012. [width="90%",format="csv",cols="^1,<4",options="header"] |=================================================== include::tmp2.csv[] |=================================================== <<< === Geography === [[geo_level]] ==== Geographic levels [[label_geo_level.csv]] ( link:label_geo_level.csv[] ) [width="40%",format="csv",cols="^1,<3",options="header"] |=================================================== include::label_geo_level.csv[] |=================================================== Geography labels are provided in separate files, in directories by state. Note that cross-state CBSA will have state-specific parts, and thus will appear in multiple files. A separate link:label_fipsnum.csv[label_fipsnum.csv] contains values and labels for all entities of <> 'n' or 's', and is a summary of separately available files. For convenience ==== State-level values ==== ( link:label_fipsnum.csv[] ) [width="40%",format="csv",cols="^1,<3",options="header"] |=================================================== include::tmp.csv[] |=================================================== ==== Detailed state and substate level values For a full listing of all valid geography codes (except for WIA codes), see http://www.census.gov/geo/maps-data/data/tiger.html. Note about geography codes: Four types of geography codes are represented with this field, depending on the value of <>. Each geography has its own code structure: - State is the 2-digit http://quickfacts.census.gov/qfd/meta/long_fips.htm[FIPS] code (<> = 's') - County is the 5-digit FIPS code (<> = 'c') - Metropolitan/Micropolitan codes are constructed from the 2-digit state FIPS code and the 5-digit http://www.census.gov/population/metro/[CBSA] code provided by the Census Bureau’s Geography Division. (<> = 'm') ** In the QWI, the metropolitan/micropolitan areas are the state parts of the full CBSA areas. - The WIA code is constructed from the 2-digit state FIPS code and the 6-digit WIA identifier provided by LED State Partners. (<> = 'w') The 2015 vintage of Census TIGER/Line geography is used for all tabulations as of the R2015Q4 release. For convenience, a composite file containing all geocodes is available as link:us/label_geography.csv[]. [format="csv",width="50%",cols="^1,^3",options="header"] |=================================================== State,Format file AK,link:ak/label_geography.csv[] AL,link:al/label_geography.csv[] AR,link:ar/label_geography.csv[] AZ,link:az/label_geography.csv[] CA,link:ca/label_geography.csv[] CO,link:co/label_geography.csv[] CT,link:ct/label_geography.csv[] DC,link:dc/label_geography.csv[] DE,link:de/label_geography.csv[] FL,link:fl/label_geography.csv[] GA,link:ga/label_geography.csv[] HI,link:hi/label_geography.csv[] IA,link:ia/label_geography.csv[] ID,link:id/label_geography.csv[] IL,link:il/label_geography.csv[] IN,link:in/label_geography.csv[] KS,link:ks/label_geography.csv[] KY,link:ky/label_geography.csv[] LA,link:la/label_geography.csv[] MA,link:ma/label_geography.csv[] MD,link:md/label_geography.csv[] ME,link:me/label_geography.csv[] MI,link:mi/label_geography.csv[] MN,link:mn/label_geography.csv[] MO,link:mo/label_geography.csv[] MS,link:ms/label_geography.csv[] MT,link:mt/label_geography.csv[] NC,link:nc/label_geography.csv[] ND,link:nd/label_geography.csv[] NE,link:ne/label_geography.csv[] NH,link:nh/label_geography.csv[] NJ,link:nj/label_geography.csv[] NM,link:nm/label_geography.csv[] NV,link:nv/label_geography.csv[] NY,link:ny/label_geography.csv[] OH,link:oh/label_geography.csv[] OK,link:ok/label_geography.csv[] OR,link:or/label_geography.csv[] PA,link:pa/label_geography.csv[] RI,link:ri/label_geography.csv[] SC,link:sc/label_geography.csv[] SD,link:sd/label_geography.csv[] TN,link:tn/label_geography.csv[] TX,link:tx/label_geography.csv[] UT,link:ut/label_geography.csv[] VA,link:va/label_geography.csv[] VT,link:vt/label_geography.csv[] WA,link:wa/label_geography.csv[] WI,link:wi/label_geography.csv[] WV,link:wv/label_geography.csv[] WY,link:wy/label_geography.csv[] |=================================================== <<< == [[statusflags]]Status flags ( link:label_flags.csv[] ) Each status flag in the tables above contains one of the following valid values. The values and their interpretation are listed in the table below. [width="80%",format="csv",cols="^1,<4",options="header"] |=================================================== include::label_flags.csv[] |=================================================== <<< == [[changes]] Changes === This version (revisions) - 2015-12-22: Initial release - 2016-03-15: Added better cross-links between CSV naming schame, and datafile schema - 2016-03-15: Corrected label_geo_level.csv to include the national level value. === Version 4.0.1 from 4.0 - 2015-02-24: removed obsolete flag values - 2015-04-01: updated IL, NE geography definitions === Version 4.0.2 from 4.0.1 - 2015-04-01: switched NAICS coding from 2007 to 2012 === Version 4.0.3 from 4.0.2 - 2015-09-14: switched Geovintage to 2014, updated AK and SD files, added MA. === Version 4.0.4 from 4.0.3 - 2015-11-30: updated OR. - 2015-12-10: Added consolidated geography label file label_geography_all.csv - 2015-12-22: Updated the identification of the correct geo vintage - 2015-12-22: Added a link to NAICS 2012 tables - 2015-12-22: Removing the 99 row in industry values - only used for internal error checking <<< ******************* This revision: Tue Mar 15 16:06:20 EDT 2016 *******************