CREAT: Census Research Exploration and Analysis Tool

Air Quality, Human Capital Formation and the Long-term Effects of Environmental Inequality at Birth

May 2017

Written by: John Voorheis

Working Paper Number:

carra-2017-05

Abstract

A growing body of literature suggests that pollution exposure early in life can have substantial long term effects on an individual's economic well-being as an adult, however the mechanisms for these effects remain unclear. I contribute to this literature by examining the effect of pollution exposure on several intermediate determinants of adult wages using a unique linked dataset for a large sample of individuals from two cohorts: an older cohort born around the 1970, and a younger cohort born around 1990. This dataset links responses to the American Community Survey to SSA administrative data, the universe of IRS Form 1040 tax returns, pollution concentration data from EPA air quality monitors and satellite remote sensing observations. In both OLS and IV specifications, I find that pollution exposure at birth has a large and economically significant effect on college attendance among 19-22 year olds. Using conventional estimates of the college wage premium, these effects imply that a 10 'g/m3 decrease in particulate matter exposure at birth is associated with a $190 per year increase in annual wages. This effect is smaller than the wage effects in the previous literature, which suggests that human capital acquisition associated with cognitive skills cannot fully explain the long term wage effects of pollution exposure. Indeed, I find evidence for an additional channel working through non-cognitive skill -pollution exposure at birth increases high school non-completion and incarceration among 16-24 year olds, and that these effects are concentrated within disadvantaged communities, with larger effects for non-whites and children of poor parents. I also find that pollution exposure during adolescence has statistically significant effects on high school non-completion and incarceration, but no effect on college attendance. These results suggest that the long term effects of pollution exposure on economic well-being may run through multiple channels, of which both non-cognitive skills and cognitive skills may play a role.

Document Tags and Keywords

Keywords Keywords are automatically generated using KeyBERT, a powerful and innovative keyword extraction tool that utilizes BERT embeddings to ensure high-quality and contextually relevant keywords.

By analyzing the content of working papers, KeyBERT identifies terms and phrases that capture the essence of the text, highlighting the most significant topics and trends. This approach not only enhances searchability but provides connections that go beyond potentially domain-specific author-defined keywords.
:
econometric, emission, pollutant, environmental, pollution, concentration, intergenerational, mortality, pollution exposure, exposure

Tags Tags are automatically generated using a pretrained language model from spaCy, which excels at several tasks, including entity tagging.

The model is able to label words and phrases by part-of-speech, including "organizations." By filtering for frequent words and phrases labeled as "organizations", papers are identified to contain references to specific institutions, datasets, and other organizations.
:
Internal Revenue Service, Social Security Administration, Ordinary Least Squares, Current Population Survey, Environmental Protection Agency, National Ambient Air Quality Standards, American Community Survey, Social Security Number, Longitudinal Employer Household Dynamics, Protected Identification Key, National Center for Health Statistics, Centers for Disease Control and Prevention, Person Validation System, Person Identification Validation System, SSA Numident, CAAA, Center for Administrative Records Research and Applications

Similar Working Papers Similarity between working papers are determined by an unsupervised neural network model know as Doc2Vec.

Doc2Vec is a model that represents entire documents as fixed-length vectors, allowing for the capture of semantic meaning in a way that relates to the context of words within the document. The model learns to associate a unique vector with each document while simultaneously learning word vectors, enabling tasks such as document classification, clustering, and similarity detection by preserving the order and structure of words. The document vectors are compared using cosine similarity/distance to determine the most similar working papers. Papers identified with 🔥 are in the top 20% of similarity.

The 10 most similar working papers to the working paper 'Air Quality, Human Capital Formation and the Long-term Effects of Environmental Inequality at Birth' are listed below in order of similarity.