
Ruibin Bai
Director of Lab
Computer Science and Operations Research

Taxi trajectory data (GPS data collected for 15,000 taxis at intervals of 30 seconds across three million journeys over eight days) is used to generate a spatio-temporal prediction of shopping behaviours in the emerging metropolitan city of Shenzhen, China. Two approaches are compared: time-series forecasting using ARIMA; and a gravity model approach, using the Huff model calibrated with Geographical Weighted Regression. Results demonstrate that ARIMA performs with significantly higher accuracy than the more traditional Huff model method. Further, it is demonstrate that while the accuracy of the Huff model is constrained by model assumptions, applying time-series methods to the underlying data directly (i.e., the ARIMA method) has no such constraints, and is limited only by the amount of data available. This suggests that, as richer data sets become available, spatio-temporal modelling of this kind will become more accurate.