CREAT: Census Research Exploration and Analysis Tool

Changes in Metropolitan Area Definition, 1910-2010

February 2021

Written by: Todd Gardner

Working Paper Number:

CES-21-04

Abstract

The Census Bureau was established as a permanent agency in 1902, as industrialization and urbanization were bringing about rapid changes in American society. The years following the establishment of a permanent Census Bureau saw the first attempts at devising statistical geography for tabulating statistics for large cities and their environs. These efforts faced several challenges owing to the variation in settlement patterns, political organization, and rates of growth across the United States. The 1910 census proved to be a watershed, as the Census Bureau offered a definition of urban places, established the first census tract boundaries for tabulating data within cities, and introduced the first standardized metropolitan area definition. It was not until the middle of the twentieth century, however, the Census Bureau in association with other statistical agencies had established a flexible standard metropolitan definition and a more consistent means of tabulating urban data. Since 1950, the rules for determining the cores and extent of metropolitan areas have been largely regarded as comparable. In the decades that followed, however, a number of rule changes were put into place that accounted for metropolitan complexity in differing ways, and these have been the cause of some confusion. Changes put into effect with the 2000 census represent a consensus of sorts for how to handle these issues.

Document Tags and Keywords

Keywords Keywords are automatically generated using KeyBERT, a powerful and innovative keyword extraction tool that utilizes BERT embeddings to ensure high-quality and contextually relevant keywords.

By analyzing the content of working papers, KeyBERT identifies terms and phrases that capture the essence of the text, highlighting the most significant topics and trends. This approach not only enhances searchability but provides connections that go beyond potentially domain-specific author-defined keywords.
:
census data, metropolitan, measure, geographically, population, urban, town, urbanization, city, geography, district, neighborhood, suburb, research census, resident, geographic

Tags Tags are automatically generated using a pretrained language model from spaCy, which excels at several tasks, including entity tagging.

The model is able to label words and phrases by part-of-speech, including "organizations." By filtering for frequent words and phrases labeled as "organizations", papers are identified to contain references to specific institutions, datasets, and other organizations.
:
Metropolitan Statistical Area, Office of Management and Budget, New England County Metropolitan, Consolidated Metropolitan Statistical Areas, American Community Survey, Core Based Statistical Area, 2010 Census

Similar Working Papers Similarity between working papers are determined by an unsupervised neural network model know as Doc2Vec.

Doc2Vec is a model that represents entire documents as fixed-length vectors, allowing for the capture of semantic meaning in a way that relates to the context of words within the document. The model learns to associate a unique vector with each document while simultaneously learning word vectors, enabling tasks such as document classification, clustering, and similarity detection by preserving the order and structure of words. The document vectors are compared using cosine similarity/distance to determine the most similar working papers. Papers identified with 🔥 are in the top 20% of similarity.

The 10 most similar working papers to the working paper 'Changes in Metropolitan Area Definition, 1910-2010' are listed below in order of similarity.