The data downloaded from Census came as a series of .xls, .accdb, .dbf, .txt, .csv files. In order to get this data in a manageable format, I used Stata, my preferred statistical/mathematical software program. I wrote Stata code to do this for each dataset.
First, I wrote a Stata code to read in the American Community Survey (ACS) data. Census provides several .csv files that include variable labels and descriptions for each of the segments and the geography files. This takes several steps reading the geography files for each state and using the .csv template to label data, reading the segment .txt file for each state and using the template to label the data, linking segment files to geography files using the LOGRECNO variable, creating national county, block group, and tract segment files, and linking segments to create complete county, block group, and tract estimates. In the code, the user needs to set/revise the path where they are storing log files and revised data. This Stata code (02-manageACS12_20180731.do) is available by clicking here.
Next, I wrote a Stata code to read in the Decennial Census 2010 Summary File One data. Census does not provide variable labels and descriptions for each of the segments and the geography files. Instead, it provides an Access 1999 file with templates for each of the segments. These templates are accessible in .csv format by clicking here. For the Stata code to work, these templates must be saved in the same location as the original data downloaded in the previous step. While it also has a template for the geography file, the template does not work because the state geography files do not have common separators such as commas. As a result, I had to write dictionaries for each state's geography file. For the Stata code to work, the dictionary files must be saved in the same location as the original data downloaded in the previous step. These dictionaries are accessible by clicking here. I then wrote the Stata code that goes through several steps reading the geography files for each state, reading the segment text file for each state and using the template to label the data, linking segment files to geography files, creating national block group, tract, and county segment files, and linking segments to create complete block, block group, tract, and county estimates. In the code, the user needs to set/revise the path where they are storing log files and revised data. This Stata code (03-manageDC00SF1_20180720.do) is available by clicking here.
Then, I wrote a Stata code to read in the Decennial Census 2000 Summary File One data. Census does not provide variable labels and descriptions for each of the segments and the geography files. Instead, it provides an Access 2007 file with templates for each of the segments. These templates are accessible by clicking here. For the Stata code to work, these templates must be saved in the same location as the original data downloaded in the previous step. While it also has a template for the geography file, the template does not work because the state geography files do not have common separators such as commas. As a result, I had to write dictionaries for each state's geography file. For the Stata code to work, the dictionary files must be saved in the same location as the original data downloaded in the previous step. These dictionaries are accessible by clicking here. I then wrote the Stata code that goes through several steps reading the geography files for each state, reading the segment text file for each state and using the template to label the data, linking segment files to geography files, creating national block group and tract segment files, and linking segments to create complete block, block group, and tract estimates. In the code, the user needs to set/revise the path where they are storing log files and revised data. This Stata code (04-manageDC00SF1_20180720.do) is available by clicking here.
Finally, I went through similar processes as described above for the Decennial Census 2000 Summary File Three data. Click here for the templates for each of the segments. Click here for the geography dictionary files. Click here to access the Stata code (05-manageDC00SF3_20180723.do).