| Literature DB >> 34138884 |
Xin Liu1, Konstantinos Pelechrinis1.
Abstract
One of the most crucial elements for the long-term success of shared transportation systems (bikes, cars etc.) is their ubiquitous availability. To achieve this, and avoid having stations with no available vehicle, service operators rely on rebalancing. While different operators have different approaches to this functionality, overall it requires a demand-supply analysis of the various stations. While trip data can be used for this task, the existing methods in the literature only capture the observed demand and supply rates. However, the excess demand rates (e.g., how many customers attempted to rent a bike from an empty station) are not recorded in these data, but they are important for the in-depth understanding of the systems' demand patterns that ultimately can inform operations like rebalancing. In this work we propose a method to estimate the excess demand and supply rates from trip and station availability data. Key to our approach is identifying what we term as excess demand pulse (EDP) in availability data as a signal for the existence of excess demand. We then proceed to build a Skellam regression model that is able to predict the difference between the total demand and supply at a given station during a specific time period. Our experiments with real data further validate the accuracy of our proposed method.Entities:
Year: 2021 PMID: 34138884 PMCID: PMC8211247 DOI: 10.1371/journal.pone.0252894
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
A list of notations used through the paper.
| Symbol | Description |
|---|---|
| actual bike departure rate by total demand | |
|
| estimated bike departure rate by total demand |
| actual bike departure rate by excess demand | |
|
| estimated bike departure rate by excess demand |
| λ | actual bike arrival rate by total demand |
|
| estimated bike arrival rate by total demand |
| λ | actual bike arrival rate by excess demand |
|
| estimated bike arrival rate by excess demand |
| number of available bikes | |
| EDP length | |
| the average of multiple | |
| the average of inter-supply intervals | |
| the end time stamp of the availability curve | |
| specific time stamps of the availability curve | |
| total bike demand volume | |
| total dock demand volume | |
| net total demand volume | |
|
| observed bike demand volume |
|
| excess bike demand volume |
|
| observed dock demand volume |
|
| excess dock demand volume |
|
| duration length for bike excess demand in a 30-minute interval |
|
| duration length for dock excess demand in a 30-minute interval |
Fig 1Bike departure and arrival event flows at a bike station.
Fig 2This bike availability curve indicates possible excess demand for t ∈ (0, t1).
Fig 3This bike availability curve indicates no excess demand for t ∈ (0, t).
Fig 4A segment of bike availability curve to illustrate the estimation of excess demand.
Fig 5Histogram of estimated excess demand rate.
Fig 6Average p-values from the K-S test for all stations for departures (left) and arrivals (right).
The K-S test cannot reject the hypothesis that the observed data follow a Poisson distribution.
Fig 7Cumulative bike excess demand rate for different stations.
Reprinted from [40] under a CC BY license, with permission from OpenStreetMap, original copyright 2021.
Fig 8Cumulative dock excess demand rate for different stations.
Reprinted from [40] under a CC BY license, with permission from OpenStreetMap, original copyright 2021.
Independent variable list.
The first three variables are numerical, and the remaining are categorical.
| Name | Description |
|---|---|
| temperature | temperature (unit: Kelvins) |
| cloud percentage | percentage of clouds in the sky |
| wind speed | wind speed (unit: meter/sec) |
| day of a week | day index of a week: Mon—Sun |
| interval index | 30-minute interval index of a day (e.g., 6:00—6:30, 6:30—7:00 etc.) |
| holiday indicator | binary indicator of whether the record falls in weekend or federal holidays (1) or not (0) |
| cloud indicator | binary indicator of the weather being “cloud” (1) or not (0) |
| rain indicator | binary indicator of the weather being “rain” (1) or not (0) |
| mist indicator | binary indicator of the weather being “mist” (1) or not (0) |
| snow indicator | binary indicator of the weather being “snow” (1) or not (0) |
| thunderstorm indicator | binary indicator of the weather being “thunderstorm” (1) or not (0) |
MSE of different time periods.
| Model type | Excess | All records | Excess | All records | Excess | All records |
|---|---|---|---|---|---|---|
| (7–9:30) | (7–9:30) | (16–18:30) | (16–18:30) | (non-peak) | (non-peak) | |
| Skellam | 42.6 | |||||
| Two-Poisson | 37.6 | 6.7 | 37.2 | 10.6 | 45.3 | 2.8 |
| Neural | 40.1 | 6.8 | 39.6 | 10.8 | 43.1 | 2.8 |
| XGBoost | 36.3 | 9.3 | 43.1 | 16.2 | 3.4 | |
| Constant | 44.6 | 8.8 | 68.2 | 16.7 | 67.0 | 3.1 |
Fig 9Skellam probability distribution with parameters , .
.
MSE of different time periods under Skellam model.
| Model type | Excess | All records | Excess | All records | Excess | All records |
|---|---|---|---|---|---|---|
| (7–9:30) | (7–9:30) | (16–18:30) | (16–18:30) | (non-peak) | (non-peak) | |
| Observed+Excess | 36.2 | 6.4 | 36.4 | 10.3 | 42.6 | 2.7 |
| Observed | 47.5 | 10.0 | 52.2 | 11.9 | 45.8 | 2.9 |