New-York-Citi-Bike-Trip-Duration-2016

From MaRDI portal
Dataset:6036670



OpenML43573MaRDI QIDQ6036670

OpenML dataset with id 43573

No author found.

Full work available at URL: https://api.openml.org/data/v1/download/22102398/New-York-Citi-Bike-Trip-Duration-2016.arff

Upload date: 23 March 2022
Copyright license: Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International


Dataset Characteristics

Number of features: 8 (numeric: 6, symbolic: 0 and in total binary: 0 )
Number of instances: 4,500,000
Number of instances with missing values: 0
Number of missing values: 0

Context Inspired by the New York City Taxi Trip Duration playground I created a dataset using the publicly available data from this link). Citi Bike is a bike sharing service available in New York City, that permits easy and affordable bike trips. They regularly release data about such trips, including starting and ending stations, starting and ending time, duration of the trip and few others variables. It closely resembles the data available about taxi trips and I think it could be interesting to compare the two datasets. Let me know if you have any comment. Content The dataset covers 4.5M Citi Bike trips from the first 6 months of 2016. The data has been anonymized and the content has been arranged to follow the Taxi Trip dataset categories and nomenclature. Notice that the starting and ending point of each trip correspond to one of the 500 Citi Bike stations spread around NYC, most of them in Manhattan, with a substantial subset in Brooklyn. Acknowledgements This dataset is the property of NYC Bike Share, LLC and Jersey City Bike Share, LLC (Bikeshare) operates New York Citys Citi Bike bicycle sharing service for TC click here Inspiration Is there a correlation between the duration of bike rides and taxi rides? Weather or traffic conditions could affect both in a similar way. Is it always faster to get a cab?