New-York-Citi-Bike-Trip-Duration-2016
OpenML dataset with id 43573
No author found.
Full work available at URL: https://api.openml.org/data/v1/download/22102398/New-York-Citi-Bike-Trip-Duration-2016.arff
Upload date: 23 March 2022
Copyright license: Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International
Dataset Characteristics
Number of features: 8 (numeric: 6, symbolic: 0 and in total binary: 0 )
Number of instances: 4,500,000
Number of instances with missing values: 0
Number of missing values: 0
Context
Inspired by the New York City Taxi Trip Duration playground I created a dataset using the publicly available data from this link). Citi Bike is a bike sharing service available in New York City, that permits easy and affordable bike trips. They regularly release data about such trips, including starting and ending stations, starting and ending time, duration of the trip and few others variables.
It closely resembles the data available about taxi trips and I think it could be interesting to compare the two datasets. Let me know if you have any comment.
Content
The dataset covers 4.5M Citi Bike trips from the first 6 months of 2016. The data has been anonymized and the content has been arranged to follow the Taxi Trip dataset categories and nomenclature.
Notice that the starting and ending point of each trip correspond to one of the 500 Citi Bike stations spread around NYC, most of them in Manhattan, with a substantial subset in Brooklyn.
Acknowledgements
This dataset is the property of NYC Bike Share, LLC and Jersey City Bike Share, LLC (Bikeshare) operates New York Citys Citi Bike bicycle sharing service for TC click here
Inspiration
Is there a correlation between the duration of bike rides and taxi rides? Weather or traffic conditions could affect both in a similar way.
Is it always faster to get a cab?