The raw datasets used for this project are listed here. A data dictionary for each dataset is listed in the π Data Dictionaries section. A hyper-link to the data dictionary is provided for each dataset listed below.
π±οΈ Manually Downloaded
- The following datasets were manually downloaded
- Raw Bike Share Ridership Data
- The raw bike share ridership data for Bike Share Toronto contains a record for every trip. Data is posted to the Toronto Open Data Portal every month as a zipped folder with CSV files. This dataset was manually downloaded from the portal for 2018, 2019, 2020, 2021, 2022 and 2023.
- Census Tracts
- Administrative, statistical and non-standard boundaries (or subdivisions) across Canada are used to analyze data collected during the national census. Census tracts are a type of statistical boundary. The geodata file with geospatial boundaries for census tracts across Canada was manually downloaded from Statistics Canada.
β¨οΈ Programmatically Retrieved or Generated Using Python
The following datasets were programmatically retrieved from a web URL or generated based on publicly available data, using Python
- Neighbourhoods (retrieved)
- geospatial data for neighbourhoods within city of Toronto
- Bike Share Station Info (retrieved)
- contains attributes of bike share stations
- the neighbourhood containing each station was programmatically determined using the stationβs
lat
-lon
co-ordinates and appended in a column named Neighbourhood
- Colleges and Universities (created)
- contains the list of college and university campus locations (
lat
-lon
co-ordinates) within the city of Toronto, excluding MCU
- Daily Weather Data (retrieved)
- Downtown Neighbourhoods (created)
- names of neighbourhoods that are located in Downtown Toronto
- Bike Share Expansion Plans (created)
- Public Transit Routes (retrieved)
- Public Holidays (created)
- Library Branches (retrieved)
- Cultural Hotspots (retrieved)
- Places of Interest (retrieved)
- Public Transit Stops (retrieved)
- Cycle Paths (retrieved)