-
The Use of Administrative Records and the American Community Survey to Study the Characteristics of Undercounted Young Children in the 2010 Census
May 2018
Working Paper Number:
carra-2018-05
Children under age five are historically one of the most difficult segments of the population to enumerate in the U.S. decennial census. The persistent undercount of young children is highest among Hispanics and racial minorities. In this study, we link 2010 Census data to administrative records from government and third party data sources, such as Medicaid enrollment data and tenant rental assistance program records from the Department of Housing and Urban Development, to identify differences between children reported and not reported in the 2010 Census. In addition, we link children in administrative records to the American Community Survey to identify various characteristics of households with children under age five who may have been missed in the last census. This research contributes to what is known about the demographic, socioeconomic, and household characteristics of young children undercounted by the census. Our research also informs the potential benefits of using administrative records and surveys to supplement the U.S. Census Bureau child population enumeration efforts in future decennial censuses.
View Full
Paper PDF
-
Evaluating the Use of Commercial Data to Improve Survey Estimates of Property Taxes
August 2016
Working Paper Number:
carra-2016-06
While commercial data sources offer promise to statistical agencies for use in production of official statistics, challenges can arise as the data are not collected for statistical purposes. This paper evaluates the use of 2008-2010 property tax data from CoreLogic, Inc. (CoreLogic), aggregated from county and township governments from around the country, to improve 2010 American Community Survey (ACS) estimates of property tax amounts for single-family homes. Particularly, the research evaluates the potential to use CoreLogic to reduce respondent burden, to study survey response error and to improve adjustments for survey nonresponse. The research found that the coverage of the CoreLogic data varies between counties as does the correspondence between ACS and CoreLogic property taxes. This geographic variation implies that different approaches toward using CoreLogic are needed in different areas of the country. Further, large differences between CoreLogic and ACS property taxes in certain counties seem to be due to conceptual differences between what is collected in the two data sources. The research examines three counties, Clark County, NV, Philadelphia County, PA and St. Louis County, MO, and compares how estimates would change with different approaches using the CoreLogic data. Mean county property tax estimates are highly sensitive to whether ACS or CoreLogic data are used to construct estimates. Using CoreLogic data in imputation modeling for nonresponse adjustment of ACS estimates modestly improves the predictive power of imputation models, although estimates of county property taxes and property taxes by mortgage status are not very sensitive to the imputation method.
View Full
Paper PDF
-
Coverage and Agreement of Administrative Records and 2010 American Community Survey Demographic Data
November 2014
Working Paper Number:
carra-2014-14
The U.S. Census Bureau is researching possible uses of administrative records in decennial census and survey operations. The 2010 Census Match Study and American Community Survey (ACS) Match Study represent recent efforts by the Census Bureau to evaluate the extent to which administrative records provide data on persons and addresses in the 2010 Census and 2010 ACS. The 2010 Census Match Study also examines demographic response data collected in administrative records. Building on this analysis, we match data from the 2010 ACS to federal administrative records and third party data as well as to previous census data and examine administrative records coverage and agreement of ACS age, sex, race, and Hispanic origin responses. We find high levels of coverage and agreement for sex and age responses and variable coverage and agreement across race and Hispanic origin groups. These results are similar to findings from the 2010 Census Match Study.
View Full
Paper PDF
-
Within and Across County Variation in SNAP Misreporting: Evidence from Linked ACS and Administrative Records
July 2014
Working Paper Number:
carra-2014-05
This paper examines sub-state spatial and temporal variation in misreporting of participation in the Supplemental Nutrition Assistance Program (SNAP) using several years of the American Community Survey linked to SNAP administrative records from New York (2008-2010) and Texas (2006-2009). I calculate county false-negative (FN) and false-positive (FP) rates for each year of observation and find that, within a given state and year, there is substantial heterogeneity in FN rates across counties. In addition, I find evidence that FN rates (but not FP rates) persist over time within counties. This persistence in FN rates is strongest among more populous counties, suggesting that when noise from sampling variation is not an issue, some counties have consistently high FN rates while others have consistently low FN rates. This finding is important for understanding how misreporting might bias estimates of sub-state SNAP participation rates, changes in those participation rates, and effects of program participation. This presentation was given at the CARRA Seminar, June 27, 2013
View Full
Paper PDF
-
2010 American Community Survey Match Study
July 2014
Working Paper Number:
carra-2014-03
Using administrative records data from federal government agencies and commercial sources, the 2010 ACS Match Study measures administrative records coverage of 2010 ACS addresses, persons, and persons at addresses at different levels of geography as well as by demographic characteristics and response mode. The 2010 ACS Match Study represents a continuation of the research undertaken in the 2010 Census Match Study, the first national-level evaluation of administrative records data coverage. Preliminary results indicate that administrative records provide substantial coverage for addresses and persons in the 2010 ACS (92.7 and 92.1 percent respectively), and less extensive though substantial coverage, for person-address pairs (74.3 percent). In addition, some variation in address, person and/or person-address coverage is found across demographic and response mode groups. This research informs future uses of administrative records in survey and decennial census operations to address the increasing costs of data collection and declining response rates.
View Full
Paper PDF
-
A COMPARISON OF PERSON-REPORTED INDUSTRY TO EMPLOYER-REPORTED INDUSTRY
IN SURVEY AND ADMINISTRATIVE DATA
September 2013
Working Paper Number:
CES-13-47
The Census Bureau collects industry information through surveys and administrative data and creates associated public-use statistics. In this paper, we compare person-reported industry in the American Community Survey (ACS) to employer-reported industry from the Quarterly Census of Employment and Wages (QCEW) that is part of the Census Bureau's Longitudinal Employer-Household Dynamics (LEHD) program. This research provides necessary information on the use of administrative data as a supplement to survey data industry information, and the findings will be useful for anyone using industry information from either source. Our project is part of a larger effort to compare information on jobs from household survey data to employer-reported information. This research is the first to compare ACS job data to firm-based administrative data. We find an overall industry sector match rate of 75 percent, and a 61 percent match rate at the 4-digit Census Industry Code (CIC) level. Industry match rates vary by sector and by whether industry sector is classified using ACS or LEHD industry information. The educational services and health care and social assistance sectors have among the highest match rates. The management of companies and enterprises sector has the lowest match rate, using either ACS-reported or LEHD-reported sector. For individuals with imputed industry data, the industry sector match rate is only 14 percent. Our findings suggest that the industry distribution and the sample in a particular industry sector will differ depending on whether ACS or LEHD data are used.
View Full
Paper PDF
-
Estimating Measurement Error in SIPP Annual Job Earnings: A Comparison of Census Bureau Survey and SSA Administrative Data
July 2011
Working Paper Number:
CES-11-20
We quantify sources of variation in annual job earnings data collected by the Survey of Income and Program Participation (SIPP) to determine how much of the variation is the result of measurement error. Jobs reported in the SIPP are linked to jobs reported in an administrative database, the Detailed Earnings Records (DER) drawn from the Social Security Administration's Master Earnings File, a universe file of all earnings reported on W-2 tax forms. As a result of the match, each job potentially has two earnings observations per year: survey and administrative. Unlike previous validation studies, both of these earnings measures are viewed as noisy measures of some underlying true amount of annual earnings. While the existence of survey error resulting from respondent mistakes or misinterpretation is widely accepted, the idea that administrative data are also error-prone is new. Possible sources of employer reporting error, employee under-reporting of compensation such as tips, and general differences between how earnings may be reported on tax forms and in surveys, necessitates the discarding of the assumption that administrative data are a true measure of the quantity that the survey was designed to collect. In addition, errors in matching SIPP and DER jobs, a necessary task in any use of administrative data, also contribute to measurement error in both earnings variables. We begin by comparing SIPP and DER earnings for different demographic and education groups of SIPP respondents. We also calculate different measures of changes in earnings for individuals switching jobs. We estimate a standard earnings equation model using SIPP and DER earnings and compare the resulting coefficients. Finally exploiting the presence of individuals with multiple jobs and shared employers over time, we estimate an econometric model that includes random person and firm effects, a common error component shared by SIPP and DER earnings, and two independent error components that represent the variation unique to each earnings measure. We compare the variance components from this model and consider how the DER and SIPP differ across unobservable components.
View Full
Paper PDF
-
Estimating Measurement Error in SIPP Annual Job Earnings: A Comparison of Census Survey and SSA Administrative Data
September 2002
Working Paper Number:
tp-2002-24
The third chapter investigates measurement error in SIPP annual job
earnings data linked to SSA administrative earnings data. The multiple
earnings measures provided by the survey and administrative data enable
the identification of components of true variation and variation due to
measurement error. We find that 18% of the variation in SIPP annual job
earnings can be attributed to measurement error. We also find that in
both the SIPP and the DER, measurement error is persistent over time.
A lower level of auto-correlation in the SIPP measurement error than in
the economic error component leads to a lower reliability ratio of .62 for
first-differenced earnings.
View Full
Paper PDF
-
Estimating the Relationship between Employer-Provided Health Insurance, Worker Mobility, and Wages
September 2002
Working Paper Number:
tp-2002-23
In this paper, a joint model of wages, hazard of a job ending, and
probability of holding employer-provided health insurance is estimated,
taking account of unobservable person and job characteristics. A unique
data source, the 1990 and 1996 SIPP Panels linked to SSA administrative
job histories, enables the identification of random person and job effects
and the correlation of these effects across the three equations. The explicit
modeling of this correlation produces consistent estimates of the
effect of tenure on wages and the effect of health insurance on mobility.
Substantial levels of job-lock and significant annual returns to seniority
are found. Increasing the job-specific probability of obtaining employerprovided
health insurance from 60% to 63%, or increasing the job-specific
hourly wage rate by $.80, are both associated with an equivalent decrease
in the hazard of the job ending. However, the dollar value of the wage
benefit is substantially higher.
View Full
Paper PDF