Exploratory Data Analysis - My Uber🚘 Ride History from unstructured data | GODWIN Nwalozie

Exploratory Data Analysis - My Uber🚘 Ride History from unstructured data

May 20, 2022

This exercise was born out of curiosity. 🤔 I drive a lot, but I was curious to find out how much money I had spent, and what time of day had the most rides on Uber, since my first ride in December 2016.

However, according to Uber’s policy, only the past 30 days’ worth of ride history data can be provided.



Link to notebook on github github



(1) Getting my Uber ride data or the trip history from Uber online.

  1. So I logged on to my uber account online

  2. After login, click the Trips tab.

My Image



(2) Copying the data and pasting into excel

  1. Then select to highlight your report from top to botton.

  2. Copy this and paste in an excel workbook. Save as csv file format.

My Image



(3) Importing with pandas

  1. Import with pandas into your IDE or prefered code editor such as Jupyter notebook or Visual Studio code pandas. Here I am using VSC.

My Image

You will get a dataframe that has a single column, unstructured data lumped together. My Image



(4) Preprocessing data to get the needed features.

  1. You will have to apply all forms of preprocessing methods, looping through the column to extract the required features.

This was my final dataframe. The 0.00 km and NGN0.00 (Nigerian Naira) indicates cancelled trips.

My Image



(5) Findings from data exploration and analysis

Spent N322,000 Nigerian Naira so far == USD 631 , by todays exhange rate
My Image

My Image


I have done more trips diring the morning and night My Image


Price and distance are positively(high) correlated My Image



(6) Short video illustrating the steps

This short video will give you a glimpse on the preprocessing steps to final dataframe


Thank you for your time 🙏