| Title | Author | Date | Output |
|---|---|---|---|
Cyclistic Bike-Share: How Does a Bike-Share Navigate Speedy Success? |
Sharon Ng |
2021/9/6 |
html_document |
- How do annual members and casual riders use Cyclistic bikes differently?
- Why would casual riders buy Cyclistic annual memberships?
- How can Cyclistic use digital media to influence casual riders to become members?
- What is the frequency of usage among annual members and casual riders respectively?
- What actions can be done to convert casual riders to annual members?
- Lily Moreno - the director of marketing responsible for the development of campaigns and initiatives to promote the bike-share program.
- Cyclistic Executive Team - holds the decision to approve the recommended marketing program
- Cyclistic bike-share users - Cyclistic's program change will affect the users' preferences
The downloaded files are stored in a local drive but not the cloud. The zip files containing the Cyclistic Bike-Share data is then extracted in CSV format. A folder is created to store this original data. An additional folder is created to store XLSX files converted from the CSV files. These two folders act as the original datasets and any modifications to that will not occur.
- Reliable: The data is downloaded from Google Capstone Project via an anonymised company. Since Google is a large, well-known and credible corporate, it is expected and assumed that the data is trustworthy. The files are stored in static form (zip file) instead of dynamic form (cloud), so the data is reliable and does not change without prior notice.
- Original: The data is second-hand but has not been processed by any third-party.
- Comprehensive: Most data files conatin all critical information needed to find the solution. Some data files are missing in start station and its ID, and end station and its ID. This will affect our consideration in how distance and location affect a user's preference in joining the membership.
- Current: The data is current, starting from September 2020 to August 2021.
- Cited: Google cited its source.
The data is sorted in ascending order by start_date, then end_date, then rideable_type in Excel.
- Entries that have 0 difference in both longitude and latitude are deemed invalid because it is abnormal that the location spots didn’t change slight a bit thus are removed.
- Entries that have ride_length = 00:00:00 or ######## are deemed invalid even if the longitudes and latitudes have changed because the persons have not travelled so they is using the service improperly thus are removed.
Excel for data cleaning, R for data transformation, data analysis and data visualisation
Click here for the R script.
In this graph, casual and member users ride longer on weekends.
In this graph, casual users ride more frequently on weekends, while member users ride more on Wednesdays.
In this graph, casual and member users ride longer on docked bikes.
In this graph, casual and member users ride more frequently on classic bikes.
In this graph, the top three months that casual users ride the longest are May, February and September in descending order. Casual members ride the shortest in January and December. Member users ride the longest in February and September.
In this graph, the top three months that casual users ride the most are July, August and June (all summer months) in descending order. Casual members ride the least in January and February. Member users ride the most in July and August.
- Provide incentives like discounts on weekends.
- Provide more docked bikes and classic bikes since they are the top two most used bikes in terms of ride length and frequency respectively.
- Provide incentives like free-hours on the ride time in February, May and September, and giveaways for the frequent use of bikes in June, July and August.





