which route would three-amigos choose ?

Background and Motivation:

Our project motivation was to analyze data that was easy to relate to.

Hence starts the journey of three amigos, with the accommodation set at St.Regis in downtown SFO, each of them had their own favourite destination in mind for a quick jog but would only do so if the other two were also interested in the same spot. Hmm ... conflict of interest and also how would one convince the other or both to agree with one's choice. They started flipping coins and rolling dice but none would give them the answer they sought. Here comes data analytics to the rescue.

With SFO crime data readily available, we wanted to see if we could identify some relationships with likely variables and come up with practical recommendations as to which route would be the safest.

Project Objectives:

Analyze the San Francisco Crime data set provided by Kaggle.com to predict the likelihood of certain crime occurrences and suggest a better path among the 3 chosen routes, which start with a common starting point at St.Regis and end in either Aquarium of the Bay, Painted Ladies, or Cupid's Span respectively.

Approach:

Examine SFO Crime dataset, wrangle data to a level where it's only relevant to our end objective i.e., to tidy and collect observations which affect our mode of transportation which is WALKING/JOGGING. Wrangle it, Analyze it, and finally Visualize it to determine relationship between variables (if any) and conclude our objective by defining 1-3 walking paths by overlaying statistical likelihood of crime occurrences and suggesting better path. We intend to use the ‘training’ data set, as it is the only full-featured data set.

Analysis: