CREAT: Census Research Exploration and Analysis Tool

The Center for Economic Studies 1982-2007: A Brief History

October 2009

Written by: B.K. Atrostic

Working Paper Number:

CES-09-35

Abstract

More than half a century ago, visionaries representing both the Census Bureau and the external research community laid the foundation for the Center for Economic Studies (CES) and the Research Data Center (RDC) system. They saw a clear need for a system meeting the inextricably related requirements of providing more and better information from existing Census Bureau data collections while preserving respondent confidentiality and privacy. CES opened in 1982 to house new longitudinal business databases, develop them further, and make them available to qualified researchers. CES and the RDC system evolved to meet the designers' requirements. Research at CES and the RDCs meets the commitments of the Census Bureau (and, recently, of other agencies) to preserving confidentiality while contributing paradigm-shifting fundamental research in a range of disciplines and up-to-the-minute critical tools for decision-makers.

Document Tags and Keywords

Keywords Keywords are automatically generated using KeyBERT, a powerful and innovative keyword extraction tool that utilizes BERT embeddings to ensure high-quality and contextually relevant keywords.

By analyzing the content of working papers, KeyBERT identifies terms and phrases that capture the essence of the text, highlighting the most significant topics and trends. This approach not only enhances searchability but provides connections that go beyond potentially domain-specific author-defined keywords.
:
data, researcher, information census, enterprise, database, data census, report, census data, microdata, survey, disclosure, study, agency, respondent, confidentiality, research, information, privacy, business data, research census, datasets

Tags Tags are automatically generated using a pretrained language model from spaCy, which excels at several tasks, including entity tagging.

The model is able to label words and phrases by part-of-speech, including "organizations." By filtering for frequent words and phrases labeled as "organizations", papers are identified to contain references to specific institutions, datasets, and other organizations.
:
American Economic Association, Characteristics of Business Owners, Standard Statistical Establishment List, Internal Revenue Service, Standard Industrial Classification, Bureau of Labor Statistics, Social Security Administration, Small Business Administration, American Statistical Association, Longitudinal Research Database, National Science Foundation, Center for Economic Studies, Securities and Exchange Commission, Harvard University, Bureau of Economic Analysis, Permanent Plant Number, University of Maryland, Federal Reserve Bank, Organization for Economic Cooperation and Development, Current Population Survey, Longitudinal Business Database, Department of Energy, Pollution Abatement Costs and Expenditures, Survey of Industrial Research and Development, WECD, Decennial Census, University of California Los Angeles, Medical Expenditure Panel Survey, Census of Manufacturing Firms, Survey of Income and Program Participation, Boston College, Department of Education, National Employer Survey, Georgetown University, Social Security, Economic Census, Research Data Center, North American Industry Classification System, American Community Survey, Longitudinal Employer Household Dynamics, Agency for Healthcare Research and Quality, American Housing Survey, Kauffman Foundation, Integrated Longitudinal Business Database, ASEC, Federal Tax Information

Similar Working Papers Similarity between working papers are determined by an unsupervised neural network model know as Doc2Vec.

Doc2Vec is a model that represents entire documents as fixed-length vectors, allowing for the capture of semantic meaning in a way that relates to the context of words within the document. The model learns to associate a unique vector with each document while simultaneously learning word vectors, enabling tasks such as document classification, clustering, and similarity detection by preserving the order and structure of words. The document vectors are compared using cosine similarity/distance to determine the most similar working papers. Papers identified with 🔥 are in the top 20% of similarity.

The 10 most similar working papers to the working paper 'The Center for Economic Studies 1982-2007: A Brief History' are listed below in order of similarity.