Section 3 Data Requirements
This section details how to obtain all the necessary data for this project. Data can be broken into three types:
- Employment
- USAspending
- Additional Data
Please note, there is no guarantee that the data sources described below will exist in this form indefinitely.
3.1 Employment Data
This report focused on two types of employment data:
- Civilian
- Military
These employment data types were sourced from two separate websites.
Civilian employment is available from the Office of Personnel Management’s (OPM) FedScope website, which provides federal workforce data. The report uses September data to align with the federal fiscal year. To obtain the data:
- Load the Office of Personnel Management’s (OPM) FedScope website.
- Click on “Employment” under the “Status Data” bullet point to access quarterly employment data cubes.
- Click September of the desired year.
Filter the data by agency type (“Cabinet Level”) and location (“United States” and a state of one’s choice) in order to get the civilian employment numbers for the Departments of Defense (which is an aggregate of the Air Force, Army, Defense, and Navy cabinet agencies), Homeland Security, Veterans Affairs, and Energy. This data can be exported out as a PDF, CSV, or Excel file.
Military employment is available from the Defense Manpower Data Center (DMDC) website. As with civilian employment, the report uses September data to align with the federal fiscal year. To obtain the data:
- Load the Defense Manpower Data Center (DMDC) website.
- Hover over to Department of Defense (DoD) Data/Reports and select “Statistics & Reports.”
- On the linked page, scroll to the “DoD Personnel, Workforce Reports & Publications” text, and click on its hyperlink. This opens up a new window to access the military employment data.
- Scroll to the section “Military and Civilian Personnel by Service/Agency by State/Country (Updated Quarterly)” and download the Excel file for September of the desired year.
3.2 USAspending Data
This report utilizes USAspending.gov for the majority of its direct spending data. The spending types obtained from this site include contracts and grants for the U.S. Departments of Defense, Homeland Security, Veterans Affairs, and Energy, as well as direct payments from Veterans Affairs (i.e., veterans’ benefits). Data is available from the “Custom Awards Data” webpage. Data is available by award type (contracts, grants, and/or direct payments), awarding agency (Defense, Homeland Security, Veterans Affairs, Energy), recipient location (United States for “Country,” and, optionally, a specific state), action date range (a fiscal year “FY 20XX”), and file format. This download process must be done once per agency.
Code that automates this entire process has been developed to capture all USAspending data at once. Please refer to Section 6.3 in the Methods section for more details.
3.3 Additional Data
3.3.1 SmartPay Data via FOIA
The remainder of direct spending data used in these reports was SmartPay, a charge card program for federal employees. In order to obtain this data, Freedom of Information Act requests (commonly known as FOIAs) were filed to the federal agencies.
For more information about SmartPay, please visit the General Services Administration’s (GSA) SmartPay website.
3.3.2 Data Obtained Online
A variety of data sources are obtained online in order to assist with the data processing for this report. This includes:
Federal Real Public Property data from the GSA. The GSA provides an Excel file that details federal property by location. This data is used to distribute statewide Homeland Security and Veterans Affairs civilian employment across counties and districts.
American Community Survey data from the Census Bureau. Type “DP03” into the search bar at the center of the webpage. This returns a table of “Selected economic characteristics.” From this table, filter to the appropriate geography (state, county, congressional district, etc.) and year to download an Excel file. This data is used to distribute statewide military employment across counties and districts. To address issues of small sample size, our team used five-year estimates of this data.
NAICS to IMPLAN crosswalk from IMPLAN. IMPLAN provides a variety of crosswalks that are used to relate data from other sources to IMPLAN’s 546 industries codes. On this page, go to the second heading titled “2017 NAICS to IMPLAN 546 Industries” and download the Excel file. This crosswalk is utilized to help relate the USAspending contracts data to IMPLAN sectors in order to run that spending data through the IMPLAN software.
3.3.3 Data Provided Raw/Self-Made
The repository contains raw data files that have been built for this project. Specific instructions on how to access this repository and utilize these raw data sources are explained in the next section.