- 1Key Laboratory of Virtual Geographic Environment (Ministry of Education of PRC), Nanjing Normal University, Nanjing, China
- 2State Key Laboratory Cultivation Base of Geographical Environment Evolution (Jiangsu Province), Nanjing Normal University, Nanjing, China
- 3Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application, Nanjing Normal University, Nanjing, China
Shared bicycles are currently widely welcomed by the public due to their flexibility and convenience; they also help reduce chemical emissions and improve public health by encouraging people to engage in physical activities. However, during their development process, the imbalance between the supply and demand of shared bicycles has restricted the public's willingness to use them. Thus, it is necessary to forecast the demand for shared bicycles in different urban regions. This article presents a prediction model called QPSO-LSTM for the origin and destination (OD) distribution of shared bicycles by combining long short-term memory (LSTM) and quantum particle swarm optimization (QPSO). LSTM is a special type of recurrent neural network (RNN) that solves the long-term dependence problem existing in the general RNN, and is suitable for processing and predicting important events with very long intervals and delays in time series. QPSO is an important swarm intelligence algorithm that solves the optimization problem by simulating the process of birds searching for food. In the QPSO-LSTM model, LSTM is applied to predict the OD numbers. QPSO is used to optimize the LSTM for a problem involving a large number of hyperparameters, and the optimal combination of hyperparameters is quickly determined. Taking Nanjing as an example, the prediction model is applied to two typical areas, and the number of bicycles needed per hour in a future day is predicted. QPSO-LSTM can effectively learn the cycle regularity of the change in bicycle OD quantity. Finally, the QPSO-LSTM model is compared with the autoregressive integrated moving average model (ARIMA), back propagation (BP), and recurrent neural networks (RNNs). This shows that the QPSO-LSTM prediction result is more accurate.
Introduction
Shared bicycle systems have been widely adopted in cities as a promising solution to the last mile issues in public transportation (1–3). Apart from the immediate advantage for city commuters, it can also provide benefits to the environment and public health (4–8). In particular, due to its advantages of reducing resource consumption and chemical emissions, the use of shared bicycles is more environmentally friendly than motorized transportation (9). There is little doubt that using shared bicycles rather than automobiles benefits the environment. Meanwhile, the application of shared bicycles has the potential to promote public health. Physical inactivity has been linked to increased morbidity and mortality in numerous studies (10). Shared bicycles can provide means for people to exercise and improve their overall health (5, 11, 12).
At present, there are two types of shared bicycles in Nanjing city, docked shared bicycles and dockless shared bicycles. The docked shared bicycles require unified management with hardware devices, such as parking piles and paying devices. However, the docked shared bicycle user experience is not good due to the cumbersome certification and registration process (13). Moreover, the high capital and space costs of its supporting equipment also limit its development to a certain extent. Based on the mobile internet technology, dockless shared bicycles have rapidly developed in China and have become the main means of shared bicycles in the market due to their better user experience, simple registration and certification process, and convenience afforded when borrowing and returning bikes (14, 15). Dockless shared bicycles provide residents with more convenient services due to their stop-on-ride, flexibility, ease of use and low price. However, there are also many factors that are not conducive to the development of urban transportation, such as disorderly parking and the imbalance between supply and demand (16–18). These problems might decrease the opportunity and willingness of the public to use shared bicycles, but they have occurred in almost every city where shared bicycles are deployed in Asia, Europe and the Americas, including Beijing, Shanghai and Nanjing in China (17, 19, 20).
Short-term forecasts for the origin and destination (OD) number of shared bicycles in different areas can help in the discovery of behavioral patterns and help to solve the above possible future problems in advance (21). With better prediction results, users can be informed of the distribution of bikes in an area sometime in the future to better plan their itinerary. Operators can plan to place and reallocate bikes to improve the customer experience according to short-term demand forecasts. This will further promote the travel efficiency of urban residents and accelerate the improvement of the city's green travel layout. Therefore, it is beneficial not only for users and operators but also for the environment and public health. Geographic modeling is a useful way to discover geographic patterns and predict geographic processes (22, 23). Many scholars have utilized different models for shared bicycle prediction, including linear models based on mathematical statistics, intelligent theoretical models represented by neural networks, and combined models, which combine more than two types of models. The more representative linear model is the autoregressive integrated moving average (ARIMA) model. The ARIMA model was applied to predict the flow number and trip duration of bicycles (24). Although ARIMA is the most common statistical model for time series prediction, it has extremely high data requirements and requires time series data to be stable. It only captures the linear relationship of the data. The intelligent theoretical models used for bike-sharing prediction mainly include back propagation (BP), recurrent neural networks (RNNs), and long short-term memory (LSTM). BP has been used to predict shared bicycle demand and the number of public bicycle rented (25, 26). However, its learning speed is very slow, and network training is more likely to fail. RNNs are particularly good at capturing the temporal and spatial evolution of traffic flow, quantity and speed, so they are often used to predict short-term traffic volumes (27, 28). Although traditional RNNs perform well in non-linear time series data modeling, there are still several issues to be addressed, such as the inability to train time series with long time lags and the difficulty of automatically finding the optimal time window size (29, 30). LSTM makes up for the gradient disappearance and gradient explosion of RNNs and the lack of long-term memory ability so that the recurrent neural network can make full use of the long-term sequence information (31). Thus, LSTM can be applied to predict traffic flow and the demand of dockless shared bicycles (32). Due to the complexity of the traffic system and the shortcomings of various models, scholars have combined multiple models in recent studies to make full use of the advantages of different models for shared bicycle prediction. Combined models usually combine several different intelligent theoretical models and are based on mathematical statistics and intelligent theoretical models. For example, convolutional neural networks (CNNs) and LSTM were combined to predict the short-term distribution of dockless shared bicycles (33); a combination of CNNs and LSTM in a deep learning model was applied to predict the travel distance and OD distribution of shared bicycles (34).
The smart prediction of shared bicycles in this article is based on a deep learning algorithm. The first condition for accurate analysis using machine learning is to determine the appropriate model structure, including the number of stacked layers, the number of layer nodes, the activation function, the batch size and other hyperparameters. Determining the optimal hyperparameter combination in a high-dimensional space is a complicated problem. The traditional method for determining hyperparameters is the manual parameter adjustment method, which mainly relies on the experience of researchers, has strong subjectivity, and requires a long time for conducting experiments. In addition, other hyperparameter optimization methods, such as grid search, random search and Bayesian optimization, have high time complexity. There have been many studies on optimizing hyperparameters by using swarm intelligence algorithms due to their excellent parameter optimization performance. Commonly used swarm intelligence algorithms include ant colony optimization (ACO), particle swarm optimization (PSO), and quantum particle swarm optimization (QPSO) (35, 36). They can be applied to various types of deep learning hyperparameter optimization, but they were mainly applied to optimize common models such as support vector machines (SVMs) and BP (37–42).
Most short-term demand predictions were based on regular grids. Few studies use swarm intelligence algorithms to optimize models with more parameters, such as LSTM, and even fewer use them for shared bicycle prediction. This article aims to build a comprehensive model, called QPSO-LSTM, for predicting the OD quantity of dockless shared bicycles by combining QPSO and LSTM models. By taking AOI (area of interest) as the basic analysis unit, the model attempts to consider the influence of different urban function types on bicycle distribution. In addition, QPSO is applied to optimize LSTM to obtain better predictions. By taking the main urban areas of Nanjing city as an example, the QPSO-LSTM model is applied to predict the OD number of dockless shared bicycles in different types of AOIs. It can help determine possible problems in advance, such as bike aggregation or an insufficient number of bikes in the future period, to help with the bike scheduling and promote bike distribution rebalancing. Finally, the model is compared with other commonly used models, including BP, ARIMA and RNN, to verify its accuracy.
Study Area and Data Processing
Study Area and Data Acquisition
Nanjing is an important gateway city for the development of the Yangtze River Delta. According to the 7th census, it has a permanent population of ~9 million (2021). Since 2015, shared bicycles have experienced rapid development in Nanjing and have become a new travel choice for residents. Dockless shared bicycles were born as a new form of the sharing economy at the end of 2016, including bike share systems such as Ofo, Mobike and Hello Bike. Dockless shared bicycles quickly swept across the country due to their higher flexibility. As reported by Jiangsu Sina (2019), in 2018 the number of shared bicycles in Nanjing reached a record high of ~600,000–700,000. Then the total number of shared bicycles was limited to ~317,000 to regulate the bike-sharing market under the control of the government. According to the Hello Bike use report, in Nanjing (2017), 40,000 shared bicycles have been used 31 million times in 3 months, covering a distance of 54.81 million km, equivalent to cycling 1,367 laps around the equator. In this article, the study area is the downtown area of Nanjing, including Xuanwu, Gulou, Jianye and Qinhuai Districts, as shown in Figure 1.
Bicycle share ridership is affected by different weather variables, socio-demographic attributes, land use and built environment, among which the impact of several weather variables on bicycle trips has been investigated in many studies (43–49). This article obtains the datasets of dockless shared bicycles, road networks, traffic facility points, AOI data and administrative zoning data for Nanjing, as shown in Table 1. The dockless shared bicycle dataset includes the bike location data, including the date, bike ID, longitude and latitude of each bike from March 9 to April 8, 2018. The location data of 105,901 shared bicycles in the study area over 32 days include ~100 million records. The meteorological data, including temperature, wind speed, and precipitation from March 8 to April 2, 2018, with an interval of 1 h, were added as factors affecting bicycle trips. AOI is the area data used to represent each geographical entity, such as separate residential areas, independent commercial areas, and scenic zones. It is a carrier of all the social and economic activities of residents and reflects different types of urban functions. The attributes of each AOI include the name, address, category and latitude and longitude. The AOI is taken as the basic analysis unit in the study.
Periodicity of Shared Bicycle Trips
Time Interval of Periodic Series
The time interval for shared bicycle location data is 15 min. The number of bicycles in different time intervals was analyzed to determine the appropriate time interval for our study. Figure 2 shows the curves of bicycle numbers at 15, 30, 60, and 120 min intervals at the original points of residential areas of the study area on March 14. Figure 2 shows that the curve of the 1-h time interval can express the regularity of bicycle changes, and the curve is smoother than the time interval of 15 and 30 min. When the interval is >60 min, there are some difficulties in discovering the number changes of bicycle trips in 1 day. Therefore, this article takes 1 h as the time interval during the experiment to alleviate the impact of possible data loss on the overall law and effectively reduce the calculation rate.
Figure 2. The curves of bicycle numbers at different intervals. (A) 15 min. (B) 30 min. (C) 60 min. (D) 120 min.
Similarities in Different Types of AOIs
The possible trend similarity is explored to reflect the similar trends of cycling in different types of AOIs in the same time period. The cosine similarity can measure the trend similarity by comparing two vectors in a vector space. It evaluates the similarity between two vectors by calculating the cosine of the angle between them; thus it is not related to the specific value and is only affected by the angle of the two vectors. Formula (1) is used to calculate the cosine similarity cosθ (50).
where cosθ is the cosine similarity; Ai and Bi represent the components of vectors A and B, respectively; and n is the number of samples.
Table 2 lists the calculated cosine similarities among the AOIs of residential areas, commercial buildings and scenic areas. A higher cosine similarity indicates that the variation trend in the number of bicycle trips is similar within the same type of AOI. In contrast, a lower cosine similarity indicates that the variation trend in the number of bicycle trips is different in various types of AOIs, which are located in different urban function regions.
Daily Correlation
The daily changes in the number of bicycle trips on weekdays and weekends are considered to be different. The Pearson correlation coefficient is applied to calculate the daily correlation of the numbers of bicycle rides every 2 days in a week from March 12 to March 18. Formula (2) can calculate the Pearson correlation coefficient ρX, Y (51).
where X, Y are 2 days; n is the number of hours in each day, which is 24; Xi, Yi are the number of bicycles at the ith hour corresponding to X, Y; and are the average values of X, Y. When ρX, Y is −1, it means that X, Y are completely negatively correlated; when ρX, Y is 1, it means that X, Y are completely positively correlated; when ρX, Y is 0, it means that X, Y have no correlation.
As shown in Table 3, the correlation coefficients between weekdays are higher than 0.9. However, the correlation coefficients between weekdays and weekends are lower, ranging from 0.8 to 0.9. The correlation coefficient between Saturday and Sunday is 0.973, showing a high correlation. The results show that the daily cycling regularity is similar among weekdays and between weekends, but it is slightly different between weekends and weekdays.
Above all, the time interval for shared bicycle location data is determined to be 1 h in the constructed QPSO-LSTM model. In addition, due to the different regularity of changes, the number of bikes on weekdays and that on weekends need to be predicted separately, as does that in different AOIs.
Model Construction
The QPSO-LSTM model for predicting the OD quantity of shared bicycles is constructed as shown in Figure 3. The model is trained by using a training set and then is used to predict the number of shared bicycle trips in the future by using a test set. The model implementation process consists of three parts: data processing, the prediction model based on LSTM, and QPSO optimization.
First, the acquired cycling data were classified according to different AOI types to form cycling data with an interval of 1 h. Temperature, wind speed and precipitation were selected as the influencing factors of cycling and incorporated into the cycling data. The data underwent normalization, supervised learning and dataset division processing before model training and prediction. In particularly, 80% of the data were used as the training set and 20% were used as the validation set (52). Then, as a deep learning algorithm for sequential data prediction, LSTM is the core algorithm of the prediction model QPSO-LSTM. It was used to predict the OD quantity of shared bicycles by building a multilayer LSTM network. Finally, QPSO was applied to solve the optimization problem of the hyperparameters of the prediction model. By using QPSO, the hyperparameter combination suitable for the prediction model can be determined quickly to effectively improve the accuracy of the model.
Prediction Network Based on LSTM
The structure of the LSTM network of QPSO-LSTM is shown in Figure 4 below. Its input data are bicycle and meteorological data, and the output data are the number of bicycles needed in the future. The horizontal line through the entire memory cell is similar to a conveyor belt, indicating the state of the cell. There are three types of gates in LSTM: the forget gate, input gate and output gate. The forget gate determines how much information should be removed from the cell state at the last moment, as shown in the red box in Figure 4. The new input information is determined by the input gate in the cell state, as shown in the yellow box. The parts of the cell state that are utilized to generate the final output of the memory cell are decided by the output gate, as shown in the green box (54–56). In addition, the pink circle in the memory cell represents the point operation, while the yellow rectangle represents the activation function, and they are all used to compute the gates and the memory cell output.
Figure 4. Prediction network of QPSO-LSTM: Redrawn and extended from Li et al. (53).
QPSO Optimization
QPSO hyperparameter optimization mainly includes determining the parameters to be optimized, selecting the fitness function, and determining and updating the optimal positions of particles.
As shown in Figure 5, the time step, batch size and the numbers of nodes in the network hidden layers are selected as the parameters to be optimized in the prediction network, and each has an arrow pointing to it. The time step parameter determines the number of previous data moments in the model input at a certain moment. The batch size can effectively improve memory utilization and model training speed after optimization. A reasonable number of hidden layer nodes can avoid as much of the model over fitting phenomenon during training as possible. The change in the number of shared bicycle trips has a daily characteristic, so the value range of the time step is determined to be [1, 24]. The number of hidden layer nodes is determined according to the empirical rule (25), as shown in Formula (3).
Figure 5. Parameters to be optimized in the prediction network: 4 parameters are selected to be optimized.
where m, n are the number of input and output layers of the network, respectively, and a is any integer between 3 and 10. The number of hidden layers ranges from [5, 50], and the range of the batch size is set to [1, 30].
QPSO needs to determine the objective function to determine the pros and cons of the current particle. Commonly used evaluation indicators for regression models include root mean square error (RMSE), mean absolute error (MAE) and mean absolute percentage error (MAPE) (57). Formulas (4–6) below are used to calculate these evaluation indicators.
where m, h(xi), and yi are the length, predicted value and true value of the verification set, respectively.
When using QPSO to optimize the hyperparameters of our prediction network, MAPE is selected as the fitness function of the particle, which is defined by Formula (6).
According to the fitness function, the objective function value, the MAPE between the predicted value and the true value of the model, at each position is calculated. The smaller the objective function value is, the better the position.
In the whole learning process, the optimal position found by individual particles is pbesti, and the mean value of the best position of individual particles is mbest. The objective function values of all particles in the population are calculated many times during the learning process to determine the global optimal position gbest of each particle in all the learning processes. Formulas (7–9) can be used to calculate these values (35).
where M is the size of the particle group; xi is the position of the ith particle; α is the innovation parameter; and ∅ and u are uniformly distributed values on (0, 1).
Model Application and Verification
Model Results and Typical Case Analysis
The prediction models of different AOIs obtained after model training can be applied to predict the OD quantity of bicycles for all types of AOIs in the whole study area. According to the QPSO optimization results, the time step is 4, which means that the model uses the number of bicycles in the previous 4 moments to predict the number of bicycles in the next moment. In addition, the batch size and the node numbers of the first hidden layer and the second hidden layer are set to 28, 10, and 16, respectively.
For each AOI, the model to which it belongs is applied to predict the number of bicycles in each hour. For example, the prediction results of the OD quantities of bicycle rides at 8 am on March 14 are shown in Figure 6. At 8 A.M. in the study area, the bicycle OD quantities of the different regions are slightly different, which fully reflects the characteristics of residents' trips and verifies the effectiveness of the prediction model. As shown in the Figure 6, there are more origin points than destination points in residential areas. The AOI area of scenic areas accounts for the largest proportion of the total area of the study area, ~40.2%. Due to their larger area, there are more bikes in scenic areas, such as Purple Mountain and Xuanwu Lake, which are generally shown in red in the Figure 6.
Two typical areas, including residential areas and commercial buildings, are selected to use the corresponding model to predict the OD quantity of bicycles. The typical residential area selected is Huaxincheng, which is located near Yuantong station of subway Line 1 and Line 2, and leisure and entertainment facilities such as Hexi Central Park and Nanjing Famous Taiwan Goods City are around it. Taking it as an example, its bicycle OD data on the weekday of March 14 are selected to predict the supply and demand of bicycles by using the prediction model of residential area AOIs. The prediction of bicycle OD numbers in Huaxincheng is shown in Figure 7. The results show that bicycle OD numbers are higher in the morning and evening peak hours, but their changes are different. The peak value of the number of origin points in the morning peak period is greater than that in the evening peak period, while the peak value of the destination points in the evening peak period is higher than that in the morning peak period. This prediction conforms to the daily regularity of cycling for residents commuting to work.
The typical commercial building selected is Fenghuang Square located at No. 1, Hunan Road and is an innovative commercial mall. The cycling OD data of this area on March 14 are selected to predict the supply and demand of shared bicycles by using O and D models of commercial building AOIs. The predicted bike OD numbers of Fenghuang Square are shown in Figure 8. The result also reflects the characteristics of the morning and evening peaks. The number of bicycle destination points in the morning peak hours was slightly higher than that of origin points, while the number of origin points in the evening peak hours was much higher. This prediction is in line with the regularity of the use of bicycles near commercial buildings.
Model Evaluation
Bicycles have different cycling patterns in different types of AOIs. Using the constructed prediction model, bicycle supply and demand prediction models suitable for different types of AOIs, including residential areas, commercial buildings, scenic spot areas, science, education and cultural areas, and other service areas, are obtained after model training. The data used in model training are the series data of each type of AOI in the whole study area, with a total of 32 days of data, 80% of which is 25 days in total as the training set, and 20% of which is 7 days in total as the test set.
Figure 9 verifies the prediction accuracy of the OD distribution models of the residential areas. When verifying the model, the data from the week from March 12th to March 18th are selected as input. Each day consists of 24 h of data, with a total of 7*24, 168 pieces of input data. Since the time step set in the training model is 4, a total of 168*4 inputs are formed after conversion to supervised learning, and the range of predicted results is from 3 o'clock on March 12 to 23 o'clock on March 18. The results show that the predicted value and the true value of the QPSO-LSTM models are relatively consistent, as well as the change trend in the values. The peak times appearing each day are consistent, and the peak values are similar. In addition, residents have strong cycling regularity on weekdays but no obvious cycling regularity on weekends. There are several obvious morning peaks and evening peaks within a week, but the use of bicycles on weekends is greatly reduced. Specifically, for the origin points of the residential areas, as shown in Figure 9A, morning peaks are significantly higher than evening peaks, while for the destination points, as shown in Figure 9B, the comparison results are the opposite. The above conclusions are in line with the cycling characteristics of urban residents and verify the performance of the QPSO-LSTM prediction model.
Figure 9. Accuracy evaluation: true value and prediction of QPSO-LSTM in residential areas. (A) Origin points number. (B) Destination points number.
The BP network, RNN network and ARIMA model are applied to predict the number of bicycle origin points of residential areas from 4:00 on March 12 to 23:00 on March 18. The comparison results between the true value and prediction of these three models are shown in Figure 10. Comparing the prediction results of the three models with the true value, the change trends are roughly the same, and they all perform well at predicting the number of bicycles between peaks. However, there are some differences in the predictions for the periods around the peaks. The predicted values of ARIMA during these periods are closer to the true values than the other two models, especially in the periods near the low peaks. For example, from 0:00 to 6:00 every day, the ARIMA prediction is closer to the true value, while the prediction values of RNN and BP are significantly higher. In addition, compared with the predictions of BP and RNN, BP performs slightly better than RNN. As a result, ARIMA has the best prediction effect, followed by BP and finally RNN.
Figure 10. Accuracy evaluation: True value and prediction results of 3 models for the residential area bicycle origin point number from 4:00 on March 12 to 23:00 on March 18.
To verify the accuracy of the QPSO-LSTM model, a verification set is used to compare and verify the prediction accuracy of the other models constructed, including the BP network, RNN network, and ARIMA model, as shown in Table 4. Whether RMSE, MAE, or MAPE is used as the evaluation index, the error of the QPSO-LSTM model is smaller than that of RMSE, MAE, or MAPE, which means that the prediction accuracy of the QPSO-LSTM model is higher than that of theirs. Taking MAPE as an example, the QPSO-LSTM model has the lowest error value, which is 0.087. Conversely, it has the highest accuracy, which is up to ~91%. The error values of the other three models are all >0.1, which means that their accuracy is <90%. As a result, the QPSO-LSTM model is confirmed to have better accuracy and to be able to reasonably predict the supply and demand of shared bicycles of different AOIs.
Conclusions
The travel mode is a factor influencing environmental and public health that cannot be ignored (58, 59). Shared bicycles, as a healthy and environmentally sustainable travel mode, should be conveniently accessed by the public. Thus, it is necessary to predict the demand for shared bicycles and optimize the supply of shared bicycles in different urban regions. In this article, a bicycle prediction model called QPSO-LSTM is established, which aims to predict the number of bicycles at OD points in different regions. It is trained with the same dataset as BP, RNN, and ARIMA, and the result shows that QPSO-LSTM significantly outperforms the other models. Furthermore, the model is also applied to predict the bicycle numbers in two typical areas, and the prediction results validate the availability of the model.
Since QPSO-LSTM can predict the future supply and demand for shared bicycles of each analysis unit (AOI), the future distribution of shared bicycles can be rebalanced based on the prediction results. By designing scheduling schemes that optimize resource allocation, idle bikes can be dispatched in a timely manner to areas with large demand, so that the number of shared bicycles tends to be reasonable. Reasonable scheduling optimization can rebalance the distribution of shared bicycles, meet the needs of users for using bikes, improve user satisfaction, and ensure that the shared bicycle system is in a state of dynamic balance. Optimization of bicycle layout according to the actual demand and time series regularity can effectively guide the planning of urban green travel.
Although the QPSO-LSTM has been verified to be a useful method to predict the demand for shared bicycles, some limitations remain.
(1) The current QPSO-LSTM mainly used cycling data, and the weather data (e.g., temperature, wind speed and precipitation) were incorporated into the cycling data. However, other factors such as road networks, visibility in foggy weather, and seasonal changes in the weather might also affect the public's willingness to use bicycles and cause differences in OD distributions. Thus, these factors should also be taken into account in the prediction model.
(2) The performance of QPSO-LSTM was evaluated by using data from Nanjing. However, whether the model is a good choice for other similar cities still needs to be tested. In addition, the characteristics of dockless shared bicycle OD trips need to be analyzed to discover the driving mechanism of these characteristics.
(3) The prediction of the OD distribution provides opportunities for optimizing shared bicycle allocation. However, there is still a long way to go to keep the demand and supply of shared bicycles in a dynamic balance. The optimization of the layout of shared bicycles based on the prediction results is expected to be explored in further studies.
Data Availability Statement
The data and codes that support the findings of this study are available in GitHub with the identifier: https://github.com/SharingBikeNNU/QPSO-LSTM.
Author Contributions
MC: conceptualization, investigation, writing, and funding acquisition. YL: methodology and validation. YZ: methodology. GL: project administration. ZM: investigation and writing—review and editing. All authors contributed to the article and approved the submitted version.
Funding
This study was supported by the National Natural Science Foundation of China (Grant Nos. 41671385, 41622108, and 41871178), the Priority Academic Program Development of Jiangsu Higher Education Institutions (No. 164320H116).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
We thank the editor and reviewers who provided helpful suggestions on ways to improve the paper.
References
1. DeMaio P. Bike-sharing: History, impacts, models of provision, and future. J Public Transport. (2009) 12:41–56. doi: 10.5038/2375-0901.12.4.3
2. Fu C, Guo Q. Road traffic injuries in shared bicycle riders in China. Lancet Public Health. (2018) 3:e111. doi: 10.1016/S2468-2667(18)30024-0
3. Beairsto J, Tian Y, Zheng L, Zhao Q, Hong J. Identifying locations for new bike-sharing stations in Glasgow: an analysis of spatial equity and demand factors. Ann GIS. (2021). doi: 10.1080/19475683.2021.1936172
4. Fishman E. Bikeshare: A review of recent literature. Transport Rev. (2016) 36:92–113. doi: 10.1080/01441647.2015.1033036
5. Ding D, Jia Y, Gebel K. Mobile bicycle sharing: the social trend that could change how we move. Lancet Public Health. (2018) 3:e215. doi: 10.1016/S2468-2667(18)30066-5
6. de Chardon CM. The contradictions of bike-share benefits, purposes and outcomes. Transport Res Part A: Policy Pract. (2019) 121:401–19. doi: 10.1016/j.tra.2019.01.031
7. Maas S, Attard M, Caruana MA. Assessing spatial and social dimensions of shared bicycle use in a Southern European island context: The case of Las Palmas de Gran Canaria. Transport Res Part A: Policy Pract. (2020) 140:81–97. doi: 10.1016/j.tra.2020.08.003
8. Kang Y, Zhang F, Gao S, Lin H, Liu Y. A review of urban physical environment sensing using street view imagery in public health studies. Ann GIS. (2020) 26:261–75. doi: 10.1080/19475683.2020.1791954
9. Murphy E, Usher J. The role of bicycle-sharing in the city: Analysis of the Irish experience. Int J Sustain Transport. (2015) 9:116–25. doi: 10.1080/15568318.2012.748855
10. Lee IM, Shiroma EJ, Lobelo F, Puska P, Blair SN, Katzmarzyk PT, et al. Effect of physical inactivity on major non-communicable diseases worldwide: an analysis of burden of disease and life expectancy. Lancet. (2012) 380:219–29. doi: 10.1016/S0140-6736(12)61031-9
11. Woodcock J, Tainio M, Cheshire J, O'Brien O, Goodman A. Health effects of the London bicycle sharing system: health impact modelling study. BMJ. (2014) 348:g425. doi: 10.1136/bmj.g425
12. Heinen E, Kamruzzaman M, Turrell G. The public bicycle-sharing scheme in Brisbane, Australia: Evaluating the influence of its introduction on changes in time spent cycling amongst a middle-and older-age population. J Transport Health. (2018) 10:56–73. doi: 10.1016/j.jth.2018.07.003
13. Fishman E, Washington S, Haworth N. Barriers and facilitators to public bicycle scheme use: A qualitative approach. Transport Res Part F: Traffic Psychol Behav. (2012) 15:686–98. doi: 10.1016/j.trf.2012.08.002
14. Zhang Y, Mi Z. Environmental benefits of bike sharing: A big data-based analysis. Appl Energy. (2018) 220:296–301. doi: 10.1016/j.apenergy.2018.03.101
15. Chen Z, van Lierop D, Ettema D. Exploring dockless bikeshare usage: A case study of Beijing, China. Sustainability. (2020) 12:1238. doi: 10.3390/su12031238
16. Hua M, Chen J, Chen X, Gan Z, Wang P, Zhao D. Forecasting usage and bike distribution of dockless bike-sharing using journey data. IET Intelligent Trans Syst. (2020) 14:1647–56. doi: 10.1049/iet-its.2020.0305
17. Bao J, Yu H, Wu J. Short-term FFBS demand prediction with multi-source data in a hybrid deep learning framework. IET Intelligent Transport Syst. (2019) 13:1340–7. doi: 10.1049/iet-its.2019.0008
18. Mooney SJ, Hosford K, Howe B, Yan A, Winters M, Bassok A, et al. Freedom from the station: Spatial equity in access to dockless bike share. J Transport Geography. (2019) 74:91–96. doi: 10.1016/j.jtrangeo.2018.11.009
19. Shaheen SA, Guzman S, Zhang H. Bikesharing in Europe, the Americas, and Asia: past, present, and future. Transport Res Rec. (2010) 2143:159–67. doi: 10.3141/2143-20
20. Jiang J, Lin F, Fan J, Lv H, Wu J. A destination prediction network based on spatiotemporal data for bike-sharing. Complexity. (2019) 2019:7643905. doi: 10.1155/2019/7643905
21. Shen J, Liu X, Chen M. Discovering spatial and temporal patterns from taxi-based Floating Car Data: A case study from Nanjing. GIScience Remote Sensing. (2017) 54:617–38. doi: 10.1080/15481603.2017.1309092
22. Chen M, Voinov A, Ames DP, Kettner AJ, Goodall JL, Jakeman AJ, et al. Position paper: Open web-distributed integrated geographic modelling and simulation to enable broader participation and applications. Earth-Sci Rev. (2020) 207:103223. doi: 10.1016/j.earscirev.2020.103223
23. Chen M, Lv G, Zhou C, Lin H, Ma Z, Yue S, et al. Geographic modeling and simulation systems for geographic research in the new era: Some thoughts on their development and construction. Sci China Earth Sci. (2021) 64:1207–23. doi: 10.1007/s11430-020-9759-0
24. Salih-Elamin R, Al-Deek H. Short-term prediction for bike share systems' travel time under the effects of weather conditions. Adv Transport Stud. (2020) 50:81–94. doi: 10.4399/97888255317326
25. Liu Q, Xue Y, Wu J. Forecast of bicycle sharing demand using BP neural network based on improved brain storm optimization. In: EITCE 2019: 3rd International Conference on Electronic Information Technology and Computer Engineering; 2019 Oct 18; Xiamen, China. IEEE (2019). doi: 10.1109/EITCE47263.2019.9095056
26. Lin F, Wang S, Jiang J, Fan W, Sun Y. Predicting public bicycle rental number using multi-source data. In: IJCNN 2017: International Joint Conference on Neural Networks; 2017 May 14; Anchorage, AK, USA. IEEE. (2017). doi: 10.1109/IJCNN.2017.7966030
27. Lingras P, Sharma S, Zhong M. Prediction of recreational travel using genetically designed regression and time-delay neural network models. Transport Res Rec. (2002) 1805:16–24. doi: 10.3141/1805-03
28. Van Lint JW, Hoogendoorn SP, van Zuylen HJ. Freeway travel time prediction with state-space neural networks: modeling state-space dynamics with recurrent neural networks. Transport Res Rec. (2002) 1811:30–9. doi: 10.3141/1811-04
29. Gers FA, Schmidhuber J, Cummins F. Learning to forget: Continual prediction with LSTM. Neural Comput. (2000) 12:2451–71. doi: 10.1162/089976600300015015
30. Gers F. Long Short-Term Memory in Recurrent Neural Networks. [dissertation]. [Lausanne]: École Polytechnique Fédérale De Lausanne (2001).
31. Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. (1997) 9:1735–80. doi: 10.1162/neco.1997.9.8.1735
32. Xu C, Ji J, Liu P. The station-free sharing bike demand forecasting with a deep learning approach and large-scale datasets. Transport Res Part C: Emerg Technol. (2018) 95:47–60. doi: 10.1016/j.trc.2018.07.013
33. Ai Y, Li Z, Gan M, Zhang Y, Yu D, Chen W, et al. A deep learning approach on short-term spatiotemporal distribution forecasting of dockless bike-sharing system. Neural Comp App. (2019) 31:1665–77. doi: 10.1007/s00521-018-3470-9
34. Li Y, Shuai B. Origin and destination forecasting on dockless shared bicycle in a hybrid deep-learning algorithms. Multimedia Tools App. (2020) 79:5269–80. doi: 10.1007/s11042-018-6374-x
35. Kennedy J, Eberhart R. Particle swarm optimization. In: Proceedings of ICNN'95-International Conference on Neural Networks; 1995 Nov 27; Perth, WA, Australia. IEEE. (1995).
36. Sun J, Feng B, Xu W. Particle swarm optimization with particles having quantum behavior. In: Proceedings of the 2004 Congress on Evolutionary Computation; 2004 Jun 19; Portland, OR, USA. IEEE (2004).
37. Wang M, Wan Y, Ye Z, Lai X. Remote sensing image classification based on the optimal support vector machine and modified binary coded ant colony optimization algorithm. Inform Sci. (2017) 402:50–68. doi: 10.1016/j.ins.2017.03.027
38. Fan Y, Wang G, Lu X, Wang G. Distributed forecasting and ant colony optimization for the bike-sharing rebalancing problem with unserved demands. PLoS ONE. (2019) 14:e0226204. doi: 10.1371/journal.pone.0226204
39. Luo C, Huang C, Cao J, Lu J, Huang W, Guo J, et al. Short-term traffic flow prediction based on least square support vector machine with hybrid optimization algorithm. Neural Process Lett. (2019) 50:2305–22. doi: 10.1007/s11063-019-09994-8
40. Zhang ZG, Yin JC, Wang NN, Hui ZG. Vessel traffic flow analysis and prediction by an improved PSO-BP mechanism based on AIS data. Evolving Syst. (2019) 10:397–407. doi: 10.1007/s12530-018-9243-y
41. Zhang SQ, Lin KP. Short-term traffic flow forecasting based on data-driven model. Mathematics. (2020) 8:152. doi: 10.3390/math8020152
42. Zeng C, Hua C, Lei T, Xiao X. Short-term traffic flow prediction on campus based on modified PSOBP neural network. In: ICPMS2020: The Third International Conference on Physics; 2020 May 20-22; Kunming, China. IOP Publishing. (2020). doi: 10.1088/1742-6596/1592/1/012071
43. García-Palomares JC, Gutiérrez J, Latorre M. Optimizing the location of stations in bike-sharing programs: A GIS approach. Appl Geography. (2012) 35:235–46. doi: 10.1016/j.apgeog.2012.07.002
44. Imani AF, Eluru N. Analyzing destination choice preferences in bicycle sharing systems: an investigation of Chicago's Divvy System. J Transport Geography. (2015) 44:53–64. doi: 10.1016/j.jtrangeo.2015.03.005
45. Wang X, Lindsey G, Schoner JE, Harrison A. Modeling bike share station activity: Effects of nearby businesses and jobs on trips to and from stations. J Urban Plann Dev. (2016) 142:04015001. doi: 10.1061/(ASCE)UP.1943-5444.0000273
46. Zhang Y, Thomas T, Brussel M, Van Maarseveen M. Exploring the impact of built environment factors on the use of public bikes at bike stations: case study in Zhongshan, China. J Transport Geography. (2017) 58:59–70. doi: 10.1016/j.jtrangeo.2016.11.014
47. El-Assi W, Mahmoud MS, Habib KN. Effects of built environment and weather on bike sharing demand: a station level analysis of commercial bike sharing in Toronto. Transportation. (2017) 44:589–613. doi: 10.1007/s11116-015-9669-z
48. Du M, Cheng L. Better understanding the characteristics and influential factors of different travel patterns in free-floating bike sharing: Evidence from Nanjing, China. Sustainability. (2018) 10:1244. doi: 10.3390/su10041244
49. Scott DM, Ciuro C. What factors influence bike share ridership? An investigation of Hamilton, Ontario's bike share hubs. Travel Behav Soc. (2019) 16:50–8. doi: 10.1016/j.tbs.2019.04.003
50. Bhopale AP, Tiwari A. Swarm optimized cluster based framework for information retrieval. Expert Syst App. (2020) 154:113441. doi: 10.1016/j.eswa.2020.113441
51. Bommisetty RM, Prakash O, Khare A. Keyframe extraction using Pearson correlation coefficient and color moments. Multimedia Syst. (2019) 26:267–99. doi: 10.1007/s00530-019-00642-8
52. Sohrabi S, Ermagun A. Dynamic bike sharing traffic prediction using spatiotemporal pattern detection. Transport Res Part D: Transport Environ. (2021) 90:102647. doi: 10.1016/j.trd.2020.102647
53. Li YY, Van Do T, Nguyen HT. A comparison of forecasting models for the resource usage of MapReduce applications. Neurocomputing. (2020) 418:36–55. doi: 10.1016/j.neucom.2020.07.059
54. Moon T, Ahn TI, Son JE. Long short term memory for a model free estimation of macronutrient ion concentrations of root zone in closed loop soilless cultures. Plant methods. (2019) 15:59. doi: 10.1186/s13007-019-0443-7
55. Zhao Y, Liu Q, Zhang Z, Bi Z, Bai Y, Zhou Y, et al. Estimating the Q-marker concentrations of Salvia miltiorrhiza via a long short-term memory algorithm using climatic factors and metabolic profiling. Indus Crops Products. (2020) 156:112883. doi: 10.1016/j.indcrop.2020.112883
56. Lipton ZC, Berkowitz J, Elkan C. A critical review of recurrent neural networks for sequence learning. arXiv[Preprint].arXiv:1506.00019. (2015). doi: 10.48550/arXiv.1506.00019
57. Peng Y, Xiang W. Short-term traffic volume prediction using GA-BP based on wavelet denoising and phase space reconstruction. Phys A: Statist Mech App. (2020) 549:123913. doi: 10.1016/j.physa.2019.123913
58. Wu L, Wang W, Jing P, Chen Y, Zhan F, Shi Y, et al. Travel mode choice and their impacts on environment—a literature review based on bibliometric and content analysis, 2000–2018. J Clean Product. (2020) 249:119391. doi: 10.1016/j.jclepro.2019.119391
Keywords: dockless shared bicycles, origin and destination (OD), OD distribution, QPSO, LSTM
Citation: Cao M, Liang Y, Zhu Y, Lü G and Ma Z (2022) Prediction for Origin-Destination Distribution of Dockless Shared Bicycles: A Case Study in Nanjing City. Front. Public Health 10:849766. doi: 10.3389/fpubh.2022.849766
Received: 06 January 2022; Accepted: 28 February 2022;
Published: 08 April 2022.
Edited by:
Hongtao Yi, The Ohio State University, United StatesReviewed by:
Li He, Xi'an Jiaotong University, ChinaXintao Liu, Hong Kong Polytechnic University, Hong Kong SAR, China
Copyright © 2022 Cao, Liang, Zhu, Lü and Ma. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Zaiyang Ma, enlfbWEzMjdAMTI2LmNvbQ==