Virginia Tech® home

Modeling bike availability in a bike-sharing system using machine learning

Leanna L. House

Abstract

This paper models the availability of bikes at San Francisco Bay Area Bike Share stations using machine learning algorithms. Random Forest (RF) and Least-Squares Boosting (LSBoost) were used as univariate regression algorithms, and Partial Least-Squares Regression (PLSR) was applied as a multivariate regression algorithm. The univariate models were used to model the number of available bikes at each station. PLSR was applied to reduce the number of required prediction models and reflect the spatial correlation between stations in the network. Results clearly show that univariate models have lower error predictions than the multivariate model. However, the multivariate model results are reasonable for networks with a relatively large number of spatially correlated stations. Results also show that station neighbors and the prediction horizon time are significant predictors. The most effective prediction horizon time that produced the least prediction error was 15 minutes.

People

Publication Details

Date of publication: August 17, 2017

Conference: IEEE International Conference on Models and Technologies for Intelligent Transportation Systems

Page number(s): 374-378

Volume:

Issue Number:

Publication Note: Huthaifa I. Ashqar, Mohammed Elhenawy , Mohammed H. Almannaa , Ahmed Ghanem, Hesham A. Rakha , Leanna House: Modeling bike availability in a bike-sharing system using machine learning. MT-ITS 2017: 374-378