A Novel Travel Adviser Based on Improved Back-propagation Neural Network

Similar documents
Basketball field goal percentage prediction model research and application based on BP neural network

Neural Nets Using Backpropagation. Chris Marriott Ryan Shirley CJ Baker Thomas Tannahill

The Application of Pedestrian Microscopic Simulation Technology in Researching the Influenced Realm around Urban Rail Transit Station

B. AA228/CS238 Component

EVOLVING HEXAPOD GAITS USING A CYCLIC GENETIC ALGORITHM

CALIBRATION OF THE PLATOON DISPERSION MODEL BY CONSIDERING THE IMPACT OF THE PERCENTAGE OF BUSES AT SIGNALIZED INTERSECTIONS

The Incremental Evolution of Gaits for Hexapod Robots

LOCOMOTION CONTROL CYCLES ADAPTED FOR DISABILITIES IN HEXAPOD ROBOTS

1.1 The size of the search space Modeling the problem Change over time Constraints... 21

SIDT 2017 XXII SEMINARIO SCIENTIFICO DELLA SOCIETÀ ITALIANA DOCENTI DI TRASPORTI

Neural Networks II. Chen Gao. Virginia Tech Spring 2019 ECE-5424G / CS-5824

Evolving Gaits for the Lynxmotion Hexapod II Robot

EE 364B: Wind Farm Layout Optimization via Sequential Convex Programming

Ocean Fishing Fleet Scheduling Path Optimization Model Research. Based On Improved Ant Colony Algorithm

An approach for optimising railway traffic flow on high speed lines with differing signalling systems

Research Article Genetic Algorithm for Multiple Bus Line Coordination on Urban Arterial

Analysis of Pressure Rise During Internal Arc Faults in Switchgear

FORECASTING OF ROLLING MOTION OF SMALL FISHING VESSELS UNDER FISHING OPERATION APPLYING A NON-DETERMINISTIC METHOD

Predicting the Total Number of Points Scored in NFL Games

Chapter 23 Traffic Simulation of Kobe-City

Heart Rate Prediction Based on Cycling Cadence Using Feedforward Neural Network

Advanced Materials Research Vol

Dynamic configuration of QC allocating problem based on multi-objective genetic algorithm

Gait Evolution for a Hexapod Robot

Comparative Study of VISSIM and SIDRA on Signalized Intersection

Roundabouts along Rural Arterials in South Africa

Detection of Proportion of Different Gas Components Present in Manhole Gas Mixture Using Backpropagation Neural Network

A Novel Gear-shifting Strategy Used on Smart Bicycles

Calibration and Validation of the Simulation Model. Xin Zhang

Three New Methods to Find Initial Basic Feasible. Solution of Transportation Problems

ROSE-HULMAN INSTITUTE OF TECHNOLOGY Department of Mechanical Engineering. Mini-project 3 Tennis ball launcher

A Novel Approach to Predicting the Results of NBA Matches

On-Road Parking A New Approach to Quantify the Side Friction Regarding Road Width Reduction

Influence of Forecasting Factors and Methods or Bullwhip Effect and Order Rate Variance Ratio in the Two Stage Supply Chain-A Case Study

Gas-liquid Two-phase Flow Measurement Using Coriolis Flowmeters Incorporating Neural Networks

A Study on Weekend Travel Patterns by Individual Characteristics in the Seoul Metropolitan Area

Optimizing Cyclist Parking in a Closed System

An Empirical Analysis of the Impact of Renewable Portfolio Standards and Feed-in-Tariffs on International Markets.

intended velocity ( u k arm movements

Environmental Science: An Indian Journal

Cycling Volume Estimation Methods for Safety Analysis

Predicting National Gas Consumption in Iran using a Hierarchical Combination of Neural Networks and Genetic Algorithms

DOI /HORIZONS.B P23 UDC : (497.11) PEDESTRIAN CROSSING BEHAVIOUR AT UNSIGNALIZED CROSSINGS 1

Active Travel and Exposure to Air Pollution: Implications for Transportation and Land Use Planning

Evacuation Time Minimization Model using Traffic Simulation and GIS Technology

Torpedoes on Target: EAs on Track

A Brief History of the Development of Artificial Neural Networks

#19 MONITORING AND PREDICTING PEDESTRIAN BEHAVIOR USING TRAFFIC CAMERAS

Journal of Emerging Trends in Computing and Information Sciences

Traffic simulation of Beijing West railway station North area

Analysis and Research of Mooring System. Jiahui Fan*

Keywords: multiple linear regression; pedestrian crossing delay; right-turn car flow; the number of pedestrians;

Optimization and Search. Jim Tørresen Optimization and Search

EFFICIENCY OF TRIPLE LEFT-TURN LANES AT SIGNALIZED INTERSECTIONS

Access BART: TOD and Improved Connections. October 29, 2008

Journal of Chemical and Pharmaceutical Research, 2014, 6(3): Research Article

Pedestrian Behaviour Modelling

ORF 201 Computer Methods in Problem Solving. Final Project: Dynamic Programming Optimal Sailing Strategies

MIKE NET AND RELNET: WHICH APPROACH TO RELIABILITY ANALYSIS IS BETTER?

Examining Potential Travel Time Savings Benefits due to Toll Rates that Vary By Lane

A HYBRID METHOD FOR CALIBRATION OF UNKNOWN PARTIALLY/FULLY CLOSED VALVES IN WATER DISTRIBUTION SYSTEMS ABSTRACT

67. Sectional normalization and recognization on the PV-Diagram of reciprocating compressor

Calculation of Trail Usage from Counter Data

BICYCLE SHARING SYSTEM: A PROPOSAL FOR SURAT CITY

MICROSIMULATION USING FOR CAPACITY ANALYSIS OF ROUNDABOUTS IN REAL CONDITIONS

ARTIFICIAL NEURAL NETWORK BASED DESIGN FOR DUAL LATERAL WELL APPLICATIONS

Road Data Input System using Digital Map in Roadtraffic

Utilization of the spare capacity of exclusive bus lanes based on a dynamic allocation strategy

Simulation Study of Dedicated Bus Lanes on Jingtong Expressway in Beijing

Determining Occurrence in FMEA Using Hazard Function

Transformer fault diagnosis using Dissolved Gas Analysis technology and Bayesian networks

Emergent walking stop using 3-D ZMP modification criteria map for humanoid robot

Building an NFL performance metric

The Development and Policy Recommendations for Dockless Bike Share (DBS) in China

Traffic circles. February 9, 2009

Design of a Pedestrian Detection System Based on OpenCV. Ning Xu and Yong Ren*

Queue analysis for the toll station of the Öresund fixed link. Pontus Matstoms *

Neural Network in Computer Vision for RoboCup Middle Size League

An Analysis of the Travel Conditions on the U. S. 52 Bypass. Bypass in Lafayette, Indiana.

Chapter 12 Practice Test

Demonstration of Possibilities to Introduce Semi-actuated Traffic Control System at Dhanmondi Satmasjid Road by Using CORSIM Simulation Software

Optimal Signal Timing Method of Intersections Based on Bus Priority

Application of Bayesian Networks to Shopping Assistance

Planning Daily Work Trip under Congested Abuja Keffi Road Corridor

Global Journal of Engineering Science and Research Management

Traveling Salesperson Problem and. its Applications for the Optimum Scheduling

Evaluation and further development of car following models in microscopic traffic simulation

Robot Walking with Genetic Algorithms

AIS data analysis for vessel behavior during strong currents and during encounters in the Botlek area in the Port of Rotterdam

At each type of conflict location, the risk is affected by certain parameters:

Models for Pedestrian Behavior

The Role of Olive Trees Distribution and Fruit Bearing in Olive Fruit Fly Infestation

Analysis and realization of synchronized swimming in URWPGSim2D

u = Open Access Reliability Analysis and Optimization of the Ship Ballast Water System Tang Ming 1, Zhu Fa-xin 2,* and Li Yu-le 2 " ) x # m," > 0

Open Research Online The Open University s repository of research publications and other research outputs

CS472 Foundations of Artificial Intelligence. Final Exam December 19, :30pm

Legendre et al Appendices and Supplements, p. 1

EXPERIMENTÁLNÍ ANALÝZA MLP SÍTÍ PRO PREDIKCI POHYBU PLIC PŘI DÝCHÁNÍ Experimental Analysis of MLP for Lung Respiration Prediction

Blocking time reduction for level crossings using the genetic algorithm

Traffic Impact Study. Westlake Elementary School Westlake, Ohio. TMS Engineers, Inc. June 5, 2017

Transcription:

216 7th International Conference on Intelligent Systems, Modelling and Simulation A Novel Travel Adviser Based on Improved Back-propagation Neural Network Min Yang Department of Electronic Engineering Tsinghua University Beijing, China m-yang13@mails.tsinghua.edu.cn Xuedan Zhang Department of Electronic Engineering Tsinghua University Beijing, China zhangxuedan@sz.tsinghua.edu.cn Abstract Nowadays, public bicycle system has emerged as a good solution to terminal traffic. Compared with other public transportation systems (like bus or taxi), public bicycle system is clean, cheap, flexible and convenient. In particular, public bicycle doesn t need to follow fixed schedule. However, this flexibility of public bicycle system also brings in a problem: we don t know whether there exist available bikes when we get to a certain bike station. This paper proposes a novel method to predict the number of available bikes after a given period of time so as to optimize users travel choice. In our paper, we use improved backpropagation neural network as our prediction algorithm and a novel prediction model considering the impact of surrounding stations. Experimental results show that our novel method can properly handle this non-linear problem. Keywords - public bicycle system; back-propagation neural network; terminal traffic; time pattern I. INTRODUCTION With the rapid expansion of urban population and extreme lack of fossil energy, building a green and efficient public transportation system has nowadays became a big challenge. Generally speaking, there exist 3 public transportation systems: bus system, taxi system and subway system. Bus system and subway system will undoubtedly be the first choice in terms of cleanness. However, neither subway nor bus system can completely handle the problem of terminal traffic: people who choose these two means of transportation still need to walk to their final destinations after they get off at corresponding stations. In terms of convenience, taxi is relatively a better choice. But taxi will cost the users more and pollute the air because of the use of gas. Public bicycle system has drawn great attention for its advantages on cost, environmental protection, user experience and convenience. In general, public bicycle system has following strong points: (1) cleanness. Public bicycle system can effectively reduce urban air pollution, improve urban air quality, save fossil energy, serve as an exercise mode, enhance people's physical fitness and promote the city's image; (2) flexibility. Public bicycle system doesn t need to follow fixed schedule. So the users can start their journey whenever they want, which consequently promote users travel efficiency greatly; (3) convenience. Users can ride a bike to their final destinations. This means public bicycle system can properly handle the problem of terminal traffic; (4) low cost. Public bicycle system has a relatively lower cost and saves road resources. It can effectively relieve the lack of parking space and traffic jam. As discussed above, public bicycle system doesn t need to follow fixed schedule. This flexibility brings high efficiency as well as uncertainty [1]. We don t know whether there are available bikes or bike stands when we get to a certain bike station. Fig. 1 describes this situation. Suppose the user s name is Jack. There are many nearby bike stations to choose. But he doesn t know whether there are bikes left when he gets to the corresponding station. So to us the biggest challenge is to accurately predict the number of available bikes or bike stands after a given period of time. Many talented scholars have already done meaningful research in this field. Froehlich et al. [2] use Bayesian Networks for short-term and long-term (5- minute and 2-hour-ahead) predictions of bike availability in Barcelona s Bicing system. Their prediction model incorporates the time of the day, currently available bikes and prediction window. Experimental results show remarkable improvements when compared with last view and historic trend algorithm. Kaltenbrunner et al. [3] use another quiet different algorithm. Considering the time sequence of available bikes, they use Autoregressive Moving Average (ARMA) time series model. Experimental results show that the number of input surrounding stations has an optimum. Based on the instability of available bikes number, Yoon et al. [4] use Autoregressive Integrated Moving Average (ARIMA) time series model to promote prediction accuracy. Chen et al. [1] not only consider internal, but also the impact of external factors (such as weather) on the prediction results. By applying Generalized Additive Model (GAM) [5], they can predict the number of available bikes as well as provide estimates of the waiting time distribution when there are not available bikes. Labadi et al. [14] develop an original discrete event approach for modelling and performance evaluation of public bicycle system by using Petri nets with time, inhibitor arcs and variable arc weights. 2166-67/16 $31. 216 IEEE DOI 1.119/ISMS.216.15 283

In this paper, we mainly address the problem mentioned above: choosing the fittest bike station at the very beginning. In other words, we provide a novel method to predicting the number of available bikes after a given period of time. In our paper, we use a kind of improved back-propagation neural network as our prediction algorithm and provide a novel prediction model. After that, we discuss factors that impact the performance of our prediction. At last, we will say something about our future work. Station B Station C Station A 6 min 8 min 4 min Jack 7 min Station F 5 min 4 min Station D Station E Figure 1. The situation of choosing the right station II. DATASET AND PARAMETER DEFINITION A. Bicing System Bicing is the name of a public bicycle system in Barcelona established on March 22, 27. As a thirdgeneration public bicycle system, it is similar to the Vélo'v service in Lyon or SmartDC system in Washington, DC. Its purpose is to handle the problem of terminal traffic and support a new means of public transportation. In general, there are about 421 stations in this system and the government of Barcelona also maintain a website from which we can get the real-time data (includes the number of available bikes and bike slots) of all stations. In fact, the data used in our paper was crawled from this website. B. set We got the data used in this paper by crawling the website of Bicing system. The website offers real-time data of 421 bicycle stations. Our program collects station ID, available bikes and available bike slots of each of the 421 bicycle stations in Barcelona every 5 minutes. We separate the collected data into 2 parts. The first part is the data we collected from June 1st to July 17th, and we use it as our training set; the second part is the data we collected from September 1st to September 3th, and we use it as test set. Some of the data we collected is useless (because of the disconnection of network) and it may do harm to our prediction accuracy. So we reject the useless data and set its value to the previous value to get better prediction result. C. Parameter Definition In this part, I will give the definition of some parameters that will be used in our paper. Normalized available bikes (NAB): ΝΑΒ = α / ( α + β ). (1) In this equation, α stands for the number of available bikes at time t, β stands for the number of available bike stands at time t. Since the sum of available bikes and available bike slots is not a constant (some bikes or bike slots may be damaged, so we cannot use them), NAB can effectively reflect the percentage of available bikes. Normalized activity score (NAS): NAS = γ α / ( α + β ). (2) In this equation, γ stands for the number of available bikes at time t-1. NAS can effectively indicate how active a station is at a given time t. In this paper, we use the parameter to show the activeness of a station. D. Time pattern analysis Before we start our prediction, we must first discuss the time pattern of bike stations to optimize our prediction result [13]. In our paper, we use the two parameters defined in the last part (NAB and NAS) to indicate the time pattern of bike stations. Fig. 2 compares weekday NAB with weekend NAB over 24 hours at Station47; Fig. 3 compares weekday NAS with weekend NAS over 24 hours at Station47; According to these two pictures, we can see that both NAB and NAS differ greatly on weekdays and weekends. So we must discuss these two situations separately. In the following part of this article we only discuss weekday situation and we can handle weekend situation with the same method. normalized available bikes(station 47).9.8.7.6.5.4.3.2.1 weekday weekend 5 1 15 2 25 time(h) Figure 2. Comparison of weekday and weekend NAB normalized activity score(station 88).12.1.8.6.4.2 weekday weekend 5 1 15 2 25 time(h) Figure 3. Comparison of weekday and weekend NAS 284

III. PREDICTION METHODOLOGY A. Back-propagation Neural Network and Genetic Algorithm Back-propagation (BP) neural network [6] is a kind of multi-layer feed-forward neural network. There are connections between neurons of adjacent layers while the errors of the output layer provide feedback to the input layer. To be exactly, BP neural network consists of 3 lays: the input layer, the middle layer (often called the hidden layer ) and the output layer (we can see the structure clearly in Fig. 4). Inputs propagate from input layer to output layer while errors propagate in the opposite direction. It is a common method of training artificial neural networks used in conjunction with an optimization method such as gradient descent. The method calculates the gradient of a loss function with respect to all the weights in the network. The gradient is fed to the optimization method which in turn uses it to update the weights, in an attempt to minimize the loss function. Genetic algorithm (GA) [7] is a search heuristic that mimics the process of natural selection. This heuristic is often used to figure out optimal solutions to optimization and search problems. In fact, Genetic algorithm belongs to evolutionary algorithm (EA), which generates solutions to optimization problems using techniques inspired by natural evolution, such as inheritance, mutation, selection, and crossover. In other words, when we want to solve optimization problems, we don t need to think about exactly what we should do. On the contrary, we just need to figure out the fitness function and apply it to GA. In this way, we can easily get the final result without excessive deduction. In this paper, we propose a novel prediction model which considers the effect of surrounding bicycle stations (Fig. 6). In contrast to ordinary prediction model (Fig. 5), the bicycle number of surrounding stations can be explicitly factored into prediction model, resulting in significant gains in terms of prediction accuracy. In the ordinary model, we don t consider the impact of surrounding stations on the prediction result. This model has 3 inputs: (1) time point. It represents time of the day and has 24 possible values corresponding to each of the hours of the day; (2) available bikes. It is the value of NAB at time t and is discretized into five categories, where a value of one corresponds to ~2%, two corresponds to 2%~4%, etc.; (3) prediction window. The size of the prediction window, with six possible values corresponding to 1, 2, 3, 6, 9 and 12 minutes. This model also has an output named delta, which corresponds to the change of NAB from time t to time t+pw. So the final prediction result is made by adding the value of delta to current NAB. In our new prediction model, we consider the impact of surrounding stations (in this paper we only consider the impact of their NABs). Compared with the ordinary model, the new model has more inputs. They are the NABs of the chosen surrounding stations at time t. Fig. 7 and Fig. 8 shows the performance curves of ordinary and new prediction model (use improved BP as prediction algorithm). Table 1 compares their performances in detail. As shown in table 1, our new model obtains about 38.2% improvement in performance result. X1 Input 1 Hidden 1 Output 1 X2 Input 2 Hidden 2 Output 2 XN Input n Hidden m Output l Figure 4. The structure of back-propagation neural network Figure 5. Ordinary prdiction model B. Prediction Model 1) A novel prediction model 285

.5.45 Prediction error (use real value as unit).4.35.3.25.2.15.1.5 Figure 6. New prediction model 2 4 6 8 1 12 14 Number of surrounding stations.4 Ordinary model.4 New model Figure 9. The determination of optimal number Prediction=.52* +.23.3.2.1 -.1 -.2 -.2 -.1.1.2.3.4 Figure 7. The fitted curve of ordinary model TABLE I. Prediction=.71* + -.27.3.2.1 -.1 -.2 -.2 -.1.1.2.3.4 Figure 8. The fitted curve of new model PREDICTION ERROR OF DIFFERENT MODELS 3) the selection strategy of surrounding stations In the last section, we have discussed the optimal number of considered surrounding stations. And in that part, we choose these stations by distance. In other words, we choose the nearest surrounding stations. And in this part, we will choose surrounding stations that share the same time pattern (we use NAB) with our target station. Fig. 1 shows the fitted curve of considering 1 surrounding stations that have approximate time pattern. Compared with Fig. 8, we can see that when we consider stations that have approximate time pattern, we can get better prediction accuracy. Stations has approximate time pattern ARMA [8] GRNN Improved BP.4.3 ordinary model new model(consider surrounding nearest 1 stations) 3.12(PW=2min) 3.89(PW=2h) 2.1(PW=2min) 2.32(PW=2h) 1.95(PW=2min) 2.24(PW=2h) 1.2(PW=2min) 1.38(PW=2h) 1.86(PW=2min) 2.14(PW=2h) 1.15(PW=2min) 1.32(PW=2h) Prediction=.8* +.21.2.1 -.1 2) the number of surrounding stations According to our experimental results, we know that the number of surrounding stations considered in our prediction model also has a great influence on the final prediction result. As shown in Fig. 9, when the number of surrounding stations considered in our model is too big or too small, the prediction accuracy will descend. In this case, the best number of surrounding stations is 1. In other words, when we determine our prediction model, we must first calculate the fittest number of considered surrounding stations. -.2 -.2 -.1.1.2.3.4 Figure 1. The fitted curve of considering 1 surrounding stations that have approximate time pattern C. Improved BP Neural Network In this paper, we choose a kind of improved BP neural network as our prediction algorithm. As we know, the performance of BP neural network is determined by 3 factors: the structure of neural network (in this case we mean the number of hidden layers and their corresponding 286

neurons), the initial weights and biases and its training method. As to the number of hidden layers and the number of neurons each layer has, we often follow empirical equations listed below: m= n+ l + α (3) m= log n (4) m= nl (5) In the above 3 equations, stand for the number of neurons in hidden layers, input layer and output layer, respectively. stands for a constant ranging from zero to one. And we often choose gradient descent method as training algorithm. However, this training algorithm doesn t guarantee global optimization solution. In fact, we often get local optima because of random initial weights and biases. To solve this problem, we use GA to get the optimum initial weights and biases. The improved algorithm based on genetic algorithm mainly fix following steps: 1) Use the fittest empirical equation to determine the structure of BP neural network(often we only need one hidden layer); 2) Choose gradient descent method as training algorithm; 3) Determine the fitness function. As to the fitness function, the fittest chromosome has the biggest value, in our program, we use square of the difference between output computed with initial weights and biases and true output as our fittest function; 4) Initiation. The population size depends on the nature of the problem, but typically contains several hundreds or thousands of possible solutions. Often, the initial population is generated randomly, allowing the entire range of possible solutions (the search space). 5) Selection. Each generation we choose chromosomes that have bigger fitness; 6) Mutation and crossover. Use mutation and crossover to get the next generation; 7) Repeat step(5)~step(6) until the number of generation get its maximal value; 8) Use the initial weights and biases for the training of BP neural network; Fig. 11 indicate the training procedures of improved BP neural network. From this figure we can see that the neural network gets its target after 15 iterations. Compared with the original BP neural network (as displayed in Fig. 12), the improved one has a faster convergence velocity. Fig. 13 and Fig. 14 show the performance curves of improved BP neural network and original BP neural network (they share the same structure and training method). And table 2 compared their performance clearly. From this table we can see that the improved BP neural network obtains 24.3% improvement 2 in performance. In other words, the improved algorithm has better generalization ability. Mean Squared Error (mse) 1 2 1 1 1 1-1 1-2 Best Training Performance is.92792 at epoch 15 1-3 5 1 15 15 Epochs Figure 11. training procedures of improved BP neural network Prediction=.43* + -.91.4.3.2.1 -.1 -.2 Original BP neural network -.2 -.1.1.2.3.4 Figure 13. The fitted curve of ordinary BP neural network TABLE II. The style of neural network Train Best Goal Mean Squared Error (mse) 1 6 1 4 1 2 1 1-2 Best Training Performance is.52959 at epoch 5 Train Best Goal 5 1 15 2 25 3 35 4 45 5 5 Epochs Figure 12. training procedures of ordinary BP neural network Prediction~=.71* + -.27.4.3.2.1 -.1 -.2 Advanced BP neural network -.2 -.1.1.2.3.4 Figure 14. The fitted curve of improved BP neural network PREDICTION ERROR OF ORDINARY AND IMPROVED BP NEURAL NETWORK The number of neurons in input layer The number of neurons in hidden layer Standard BP 13 25 Improved BP 13 25 Prediction errors 1.52(PW=3min) 2.8(PW=12min) 1.15(PW=3min) 1.32(PW=12min) D. Factors that impact prediction result In this part, we use the novel model and improved BP neural network to illustrate factors that impact prediction result [13]. 1) Prediction Window As shown in Fig. 14, the prediction value is about.71 times of real value. That is to say, the error is about.29 times of real value. As we know, real value (real 287

delta) will increase with the increase of prediction window. So we can conclude that the error will increase as a consequence. On the other hand, real value (real data) will decrease with the decrease of prediction window, then the error will decrease consequently. In a word, bigger prediction window will contribute to bigger error. 2) Time Point of Prediction We can see from Fig. 2 that NAS gets its first peak on 9: a.m. and first valley on 5: a.m. This tells us that the bicycle station is active around 9: a.m. and relatively inactive around 5: a.m. To explore the impact of time point we can make our prediction on 9: a.m. and 5: a.m. and compare their prediction results. Fig. 15 and Fig. 16 show the prediction results in these two situations. From these two pictures, we know that prediction result will be more accurate when the station is relatively not so active. Prediction = 1.1* + -.58.2.15.1.5 -.5 5 a.m. -.1 -.1 -.5.5.1.15.2 Figure 15. The fitted curve on 5: a.m. Prediction = 1.3* + -.12.2.15.1.5 -.5 9 a.m. -.1 -.1 -.5.5.1.15.2 Figure 16. The fitted curve on 9: a.m. IV. CONCLUSION AND FUTURE WORK Public bicycle system nowadays has emerged as a good solution to terminal traffic [12]. Compared with other public transportation systems (like bus or taxi), public bicycle system is clean, cheap, flexible and convenient. However, the flexibility of public bicycle also brings in a problem: we don t know whether there exist available bikes when we get to a certain bike station. In this paper we have handled this problem. We use improved back-propagation neural network as our prediction algorithm and a novel prediction model considering the impact of surrounding stations to predict the number of available bikes at the given station after a given period of time. Prediction results show that our novel method can get a relatively better prediction result than previous prediction model with ordinary backpropagation neural network. As we know, the distribution of bike stations is also a critical problem in public bicycle system [9~11]. In the future, we will mainly do some research on the optimal distribution of bike stations to promote global efficiency. REFERENCES [1] Chen, Bei, et al. "Uncertainty in urban mobility: Predicting waiting times for shared bicycles and parking lots." proc. of ITSC. Vol. 13. 213. [2] Froehlich, Jon, Joachim Neumann, and Nuria Oliver. "Sensing and Predicting the Pulse of the City through Shared Bicycling." IJCAI. 29. [3] Kaltenbrunner, Andreas, et al. "Urban cycles and mobility patterns: Exploring and predicting trends in a bicycle-based public transport system." Pervasive and Mobile Computing 6.4 (21): 455-466. [4] Yoon, Ji Won, Fabio Pinelli, and Francesco Calabrese. "Cityride: a predictive bike sharing journey advisor." Mobile Management (MDM), 212 IEEE 13th International Conference on. IEEE, 212. [5] Hastie, Trevor J., and Robert J. Tibshirani. Generalized additive models. Vol. 43. CRC Press, 199. [6] Goh, A. T. C. "Back-propagation neural networks for modeling complex systems." Artificial Intelligence in Engineering 9.3 (1995): 143-151. [7] Houck, Christopher R., Jeff Joines, and Michael G. Kay. "A genetic algorithm for function optimization: a Matlab implementation." NCSU-IE TR 95.9 (1995). [8] Kutner, Michael H. Applied linear statistical models. Vol. 4. Chicago: Irwin, 1996. [9] Lin, Jenn-Rong, and Ta-Hui Yang. "Strategic design of public bicycle sharing systems with service level constraints." Transportation research part E: logistics and transportation re-view 47.2 (211): 284-294 [1] Girardin, Fabien, et al. "Digital footprinting: Uncovering tourists with user-generated con-tent." Pervasive Computing, IEEE 7.4 (28): 36-43. [11] Krykewycz, Gregory R., et al. "Defining a primary market and estimating demand for major bicycle-sharing program in Philadelphia, Pennsylvania."Transportation Research Record: Journal of the Transportation Research Board 2143.1 (21): 117-124. [12] Liu, Zhili, Xudong Jia, and Wen Cheng. "Solving the last mile problem: Ensure the success of public bicycle system in beijing." Procedia-Social and Behavioral Sciences 43 (212): 73-78. [13] Yang, Min, Yingnan Guang, and Xuedan Zhang. "Public Bicycle Prediction Based on Generalized Regression Neural Network." Internet of Vehicles-Safe and Intelligent Mobility. Springer International Publishing, 215. 363-373. [14] Labadi, Karim, et al. "Stochastic Petri Net Modeling, Simulation and Analysis of Public Bicycle Sharing Systems." (214). 288