Based on the project’s business use-case, EDA was performed using the processed bike share ridership data in order to provide the following
- insights about user behavior based on temporal and geospatial bike share ridership patterns
- recommendations for which and when stations should be used during the campaign
🔂 High-Level Network Performance
The following exploration of the processed data was performed in order to understand the high-level historical performance of the network and its footprint
-
yearly totals for
- trips (bike share ridership)
- number of stations
- number of bikes
used in historical bike share ridership (2018 to 2022) and during the planned expansion window (2023 to 2025).
👤 Get Insights About User Behavior from Ridership Patterns
In order to address both requirements of the ‣ for this project, the following EDA was performed
- Identify top-performing stations
- the aggregated station demand (total bike share ridership) metrics used to classify stations as top-performers is discussed in the next section
- Extract insights about the attributes of both types of stations (top-performing stations and other stations)
- fraction of stations located in downtown Toronto, which dominated the original footprint of the bike sharing service in Toronto when it was launched
- fraction of stations located in
- downtown and adjacent neighborhoods (immediately East and West of downtown)
- other neighborhoods
- fraction of stations that accept a credit card as a method of payment
- breakdown of stations based on their physical configuration
- regular
- charging (supports e-bikes)
- stations with and without a dedicated check-in and check-out kiosk
- Extract insights from temporal patterns in user ridership trends for both types of bike share stations and for both types of bike share members (Annual and Casual), over the period from January 1, 2018 to March 31, 2023
- by month of year
- by hour of day per month of year
- by day of the week per month of the year
- relationship between daily maximum temperature and daily bike share ridership
- Extract insights from geospatial patterns in both types of stations
- by proximity of top-performing stations to neighborhoods within Downtown Toronto
These insights were used to understand attributes of the top-performing stations and to use the temporal and geospatial patterns to recommend which stations to select for use in the campaign and when displaying of digital ads should be prioritized at the selected stations.
🔝 Identifying Top-Performing Stations
đź’ Metrics for Identifying Top-Performing Stations
In order to recommend top-performing stations which should be prioritized for displaying digital ads, the following station performance metrics were used
- total departures during the most recent full year of historical ridership (2022)
- total arrivals during the most recent full year of historical ridership (2022)
- total departures during all full years of historical ridership combined (2018 - 2022)
- total arrivals during all full years of historical ridership combined (2018 - 2022)