#!/usr/bin/env python
# coding: utf-8
# # Geographical Information
#
# Taking into account everything that has already been said, we will specify any notable points or differences within this particular theme of the data.
# **Major Challenges:**
# * Transcribers often neglected to include complete geographical information, even when it was available (e.g. when an ad specifies that the owner lived in the locale of Washington, D.C. but the county column is left blank)
#
#
# # Mapping Computational Thinking into Library and Archival Science Practices (CT-LAS)
# (Data Practices and Systems Thinking Practices), focused on age and sex
#
#
# ## Data Practices
#
# * **Collecting Data** (Maryland State Archives and crowdsourced transcribers)
# * **Creating Data / Manipulating Data** - cleaning data to ensure typos of locales were corrected
# * **Analyzing / Visualizing Data** - in Tableau, queried relationships between county of origin, the number of entries for each county, and the predicted destinations to begin understanding more about the movements of the enslaved population described
# ## Systems Thinking Practices
#
# * **Investigating Complex System as a Whole** and **Understanding the Relationships within a System** - spent first day getting comfortable with the dataset and understanding the distinct and interconnected information included (runaway data and owner data, newspaper data, etc.) - they are all linked.
# * Explicity compared runaway data and owner data to attempt to trace path of runaways
# * **Thinking in Levels** - cleaned data of county typos first to minimize the potential to lose information, as counties have the highest potential to compare to other factors.
#
# # Frequency of Runaways in the Record by County
#
#
#