Abstract
With the continuous increase in the number of flights, the use of airport collaborative decision-making(A-CDM) systems has been more and more widely spread. The accuracy of the taxi time prediction has an important effect on the A-CDM calculation of the departure aircraft’s take-off queue and the accurate time for the aircraft block-out. The spatial-temporal-environment deep learning (STEDL) model is presented to improve the prediction accuracy of departure aircraft taxi-out time. The model is composed of time-flow sub-model (airport capacity, number of taxiing aircraft, and different time periods), spatial sub-model (taxiing distance) and environmental sub-model (weather, air traffic control, runway configuration, and aircraft category). The STEDL model is used to predict the taxi time of departure aircraft at Hong Kong Airport and the results show that the STEDL method has a prediction accuracy of 95.4%. The proposed model also greatly reduces the prediction error rate compared with the other machine learning methods.
Keywords
air transportation; taxi time; deep learning; surface movement; convolutional neural network (CNN)
With the development of civil aviation transportation industry, the number of take-off and landing flights in China continues to grow, but the punctuality rate of flights continues to decline with the increase of the number of flights. The decrease in punctuality rate is not only due to the limitation of airspace capacity, but also the impact of airport operations. In busy airports, departure and landing aircraft need to share some taxiways due to the complex layout of the airport, which may cause airports’ high load operation for a long time. Some factors like runway configuration, boarding gate assignment, taxiing path planning and taxi time prediction will directly affect the operation efficiency of the airport. However, current airport collaborative decision-making (A‑CDM) only uses airport average taxi time as the prediction taxi time of all aircraft in the airport. It neglects the factors such as stands, runway configuration, number of taxiing aircraft and weather, which leads to the low prediction accuracy of the aircraft taxi-out time and take-off time, resulting in flight delays and increased fuel-burn costs. Therefore, the accuracy of aircraft taxi time prediction plays an important role in optimizing flight pushback time and improving the efficiency of departure time sequence. At the same time, it can provide a theoretical reference for airlines to accurately calculate fuel-burn costs and reduce emissions.
The study of aircraft taxi time is based on the historical data of airport operation, using statistics and data mining algorithms to predict and analyze the taxi time of aircraft. Shumsk
From the literatures above, the following problems were discovered:
(1) The above studies rarely involve the impact of weather and runway configuration changes on taxi time prediction, leading to the problem of reduced prediction accuracy of the model in the event of changes in the external environment such as weather and runway configuration. Airport ground operations are susceptible to severe weather, major activities, and other factors, which can cause a sudden drop in ground operation efficiency. However, many studies or simulation models focus on ground operation efficiency under normal operation, and lack of research under abnormal operating conditions.
(2) Traditional machine learning models are not suitable for the model fitting problem with too many feature samples, which may decrease the converge speed of the model and calculation efficiency.
Therefore, according to the above two problems, we propose a spatial-temporal-environment deep learning (STEDL) model that overcomes the drawbacks of the existing machine learning methods. The proposed model includes the actual capacity of airport surface, the number of taxiing aircraft, weather, taxi distance, and other factors, and joints the convolutional neural network and deep neural network model to predict the departure aircraft taxi time. It not only balances the generalization and representation abilities in one model but also improves the convergence speed of the model and the calculation efficiency.
Combining the research purpose of this article and the actual operation of Chinese airports, we define the number of taxiing aircraft, airport capacity, airport runway configuration, and air traffic control information.
The A-CDM’s definition of taxi time is as follows:
Due to the complex layout of the taxiway and runways in busy airports, the landing and take-off aircraft have to share some taxiways, which induces the conflict and affects the taxi time of aircraft
① The value counts the number of landing taxiing aircraft if the landing time of the aircraft is after the departure aircraft ’s block-out time and before the time of aircraft ’s departure time, as shown in Eq.(3).
② The value counts the number of departure taxiing aircraft if the block-out time of the departure aircraft is later than the block-out time of the aircraft and earlier than the departure time of the aircraft , as shown in Eq.(4).
(3) |
(4) |
Correlation analysis is conducted out on traffic flow (average) and taxi time (average), and the correlation coefficient is 0.43. At the same time, the correlation between the number of taxiing aircraft and the taxi time is analyzed, and the correlation is 0.62, indicating that although the traffic flow has an impact on the taxi time, the effect is not as large as the number of surface taxiing aircraft. Therefore, the number of taxiing aircraft is used to express the traffic flow.
Airport capacity is defined as the sum of the number of aircraft take-off and landing within n min before the estimate take-off time() of the departure aircraft i, as shown in Eq.(5). The value counts the number of departure aircraft if take-off time() of the aircraft is n min earlier than the estimate take-off time of the aircraft . The value counts the number of landing aircraft if the arrival time() of the aircraft is n min earlier than the estimate take-off time of the aircraft . The n is derived from the average taxi time at the airport.
(5) |
where
Traffic flow varies with time, the taxi time will be different in different time periods. With reference to Clewlow’s metho
In the actual operation of the airport, the taxi path is not the shortest path between the stand and the runway entrance. First, the automatic dependent surveillance-broadcast (ADS-B) monitoring data was analyzed to obtain the taxi path of the aircraft from the stand to the take-off runway. Then, statistics were made to take the most frequently used taxi path between each stand and the runway as the mainstream taxi pat
According to the definition and classification of severe weather in the civil aviation meteorological forecast specifications and the operation of Hong Kong Airport, thunderstorms, tropical cyclones, typhoons, advection fog, and heavy precipitation are classified as bad weather in this paper. At the same time, the visibility of the airport, the wind direction, and wind speed directly affect the aircraft’s operating speed and waiting time outside the runway, and the taxi time. The meteorological report of aerodrome conditions (METAR) every hour by the Air Traffic Management Bureau Meteorological Center is as the data source. The numerical variables are wind direction (WD), wind speed (WS), visibility (VIS), and cloud ceiling (CC). The dummy variables are used to describe the overall weather conditions. If it is severe weather, it is set to 1; otherwise, it is 0.
Air traffic flow control is a type of external environment restriction. If the amount of flights entering or leaving the airspace sector is too large or the air route cannot meet the required flow due to weather conditions, the air traffic control management center will release the flow control information. The type of flow control information release is mainly the airspace flow control. If the flow control information is released, the aircraft flying in this airspace will be delayed. The dummy variable is used to describe the flow control information. If the aircraft is affected by flow control, it is set to 1; otherwise, it is 0.
The runway configuration is mainly determined by the wind direction and air traffic flow of the airport at the current time. The change of the runway configuration includes the change of runway operation direction and multi-runway combination take-off and landing modes. Different runway configurations may result in different taxi time. Runway configuration is described in the form of “A1, A2 | D1, D2”, where A1 and A2 are landing runways, and D1 and D2 are take⁃off runways. For example, “07R | 07L” indicates that the runway 07R is used as the landing runway and 07L runway is used as the take-off runway during this period. In actual operation, three kinds of runway configuration are used. The dummy variable is used to describe the runway configuration. If the corresponding runway configuration is used during takeoff, it is set to 1; otherwise, it is 0.
Although the taxi distance of some flights is the same, the taxiing speed and holding short of runway time may be different due to the influence of aircraft departure wak
STEDL model is divided into three sub-models, which are composed of a spatial-temporal model and environment model based on two convolutional neural networks, and a fully-connected spatial model. Based on these three sub-models, the predicted weights are obtained, and the prediction results of the three models are fused to obtain the predicted value of the final departure aircraft taxi time.
The input variables are composed of the number of taxiing aircraft ,airport capacity, and traffic flow of different time(). The main structure of the model is convolutional neural network (CNN). The model framework is as follows:. The model definition is .
The input variable is the taxiing distance . The model structure is fully-connection. The model framework is as follows: .
The input variable are composed of airport weather, runway configuration, aircraft category, and air traffic control. The main structure of the model is CNN. The model framework is as follows:
As shown in
(6) |

Fig.1 Model of STEDL
Since the model still solves the regression problem, the mean square error (MSE) is taken as the loss function of the model.
(7) |
In STEDL model, its CNN sub-model consists of two convolutional layers, a pooling layer, and a fully connected layer. The pooling layer uses maxpolling. The use of Maxpolling makes the model shield the unimportant parameters while maintaining the data characteristics, and solves the problem of excessive model data redundancy. The sub-sampling window value of the pooling layer is set to 2 to reduce the original data length to half of the origin. In terms of convolution layer settings, the research data in this paper is discrete and is not sensitive to periodic changes in time. Therefore, the horizontal sliding value and vertical sliding value of the two convolutional layers of the sub-model are set to 1, and when performing the convolution operation, padding of all 0 s of same convolution type is used. In terms of activation function, we use the Relu function (Eq.(8)) as the activation function of the sub-model, which can avoid the problem of gradient explosion and disappearance of the model.
(8) |
The three sub-models analyze spatial-temporal characteristics, external environmental characteristics, and spatial characteristics of the model, which can show the effects of spatial-temporal correlation, environmental differences, and spatial changes in the taxi time of the departure aircraft.
This data was based on observations at Hong Kong Airport from 1 July,2019 to 10 February,2020. The period of interest was from 0:00 to 24:00. The data information consisted of flight number, aircraft type, parking stand, block-in time, block-out time, runway configuration, take‑off time, and landing time. Among them, the data affected by extreme conditions such as strong convection weather and typhoons accounted for 5% of the total data. The specific information including departure aircraft is shown in
After data cleaning, a total of 77 360 valid data were obtained.
Figs.

Fig.2 Frequency chart of departure taxi-out time

Fig.3 Residual analysis plot
STEDL model was implemented in Python and run by using the TensorFlow framework. The model used Adam as the optimization parameter, the activation function was the Relu function, the number of training iteration was 1 000, and the learning rate was 0.01.
To assess the overall model fit, we assessed three indices, including R-square (
According to

Fig.4 Plot of prediction vs. actual value
In order to study the impact of the three sub-models on taxi time, three sub-models are used to predict the taxi time, as shown in
According to MSE, the time-flow sub-model error is only 0.27, the external environmental sub-model error is 0.71, and the spatial sub-model error is the largest, 0.73. The
In order to investigate the prediction effect of the STEDL model on the taxi-out time in different weather conditions, the test samples are classified according to weather types. Thunderstorms, heavy precipitation, fog (with visibility less than 1 km), and typhoons (including three days before and after transit) are classified as severe weather, and the rest are classified as normal weather. The three days of 19 April, 31 July, and 1 August, 2019 are as severe weather samples. The day of 19 April, 2019 is a severe thunderstorm and strong precipitation, and the days of 31 July and 1 August, 2019 are typhoon weather. The comparison of the taxi time in different weather is shown in
From

Fig.5 Different weather forecast results
To improve the accurate calculation of flight departure and delay time, a deep learning model (STEDL) based on time-space-environment data is proposed to predict the taxi time of the departure aircraft. Some conclusions can be drawn as follows.
(1) The STEDL model can effectively reflect the impact of airport surface space attributes, environmental changes and surface traffic flow changes on aircraft taxi time, but the most important factor affecting aircraft taxi time is still the change in traffic flow.
(2) The accuracy of the STEDL model for taxi time prediction of departure aircraft is 95.4%. Its model-fitting prediction capability is higher than other machine learning algorithms such as SVM and RF, and it can be used to predict the actual taxi time of large airports.
(3) Severe weather such as strong thunderstorms and typhoons have a great impact on the taxi time of the aircraft and exhibit lagging and continuity characteristics.
(4) Due to the limitation of data acquisition, only the taxi time of Hong Kong Airport is analyzed. It is planned to add other large airports to the proposed model for prediction and comparison in the future to improve the universality of the model.
(5) Comparing the prediction results of the STEDL with the research results of other scholars. It is found that the fitting value (= 0.90) after using the STEDL model is higher than using the traditional machine learning mode
Contributions Statement
Ms. LI Nan designed the study, provided the cases and idea, conducted the analysis. Mr. JIAO Qingyu complied the models and wrote the manuscript. Dr. ZHU Xinhua contributed to the discussion and background of the study. Mr. WANG Shaocong contributed to the data collection and data analysis. All authors commented on the manuscript draft and approved the submission.
Acknowledgements
This work was supported by the National Natural Science Foundation of China (Nos.U1833103,71801215); the China Civil Aviation Environment and Sustainable Development Research Center Open Fund(No.CESCA2019Y04).
Conflict of Interest
The authors declare no competing interests.
References
SHUMSKY R A. Real-time forecasts of aircraft departure queues[J]. Air Traffic Control Quart, 1997, 5(4): 281-308. [百度学术]
IDRIS H, CLARKE J P, BHUVA R, et al. Queuing model for taxi-out time estimation[J]. Air Traffic Control Quarterly, 2007, 10(1): 1-22. [百度学术]
FLORIS H, RICHARD C, HENDRIKUS V, et al. Taxi-out time prediction model at Charles de Gaulle Airport[J]. Journal of Aerospace Information Systems, 2018, 15(3): 1-11. [百度学术]
CLEWLOW R L, SIMAIAKIS I, BALAKRISHNAN H. Impact of arrivals on departure taxi operations at airports[C]//Proceedings of AIAA Guidance, Navigation, and Control Conference. Toronto, Ontario Canada: AIAA, 2010: 1-21. [百度学术]
RAVIZZA S, ATKIN J A D, MAATHUIS M H, et al. A combined statistical approach and ground movement model for improving taxi time estimations at airports[J]. Journal of the Operational Research Society, 2013, 64(9): 1347-1360. [百度学术]
CHEN J, RAVIZZA S, ATKIN J A D, et al. Aircraft taxi time prediction: Comparisons and insights[J]. Applied Soft Computing Journal, 2014, 14(1): 397-406. [百度学术]
LEE H, MALIK W A, JUNG Y C. Taxi-out time prediction for departures at Charlotte Airport using machine learning techniques[C]//Proceedings of the 16th AIAA Aviation Technology, Integration, and Operations Conference. [S.l.]: AIAA, 2016. [百度学术]
ZHANG Y. Methods for determining airport unimpeded taxi times[C]//Proceedings of Transportation Research Board 90th Annual Meeting. Washington DC: Transportation Research Board, 2011: 1-19. [百度学术]
ZHANG Y, WANG Q. Methods for determining unimpeded aircraft taxi time and evaluating airport taxiing performance[J]. Chinese Journal of Aeronautics, 2017, 30(2): 523-537. [百度学术]
DIANA T. Can machines learn how to forecast taxi-out time? A comparison of predictive models applied to the case of Seattle/Tacoma International Airport[J]. Transportation Research Part E-Logistics and Transportation Review, 2018,119: 149-164. [百度学术]
LIU J, YIN M. Study on the influencing factors of departure aircraft taxi time[J]. Journal of Wuhan University of Technology, 2018, 42(2): 195-200. [百度学术]
YAO M F. Research on key techniques of aircraft surface trajectory prediction and path planning in airport [D]. Chengdu: University of Electronic Science and Technology of China, 2018. (in Chinese) [百度学术]
LIANG G, ZHANG Y P, XING Z W, et al. A new dynamic pushback control method for reducing fuel-burn costs: Using predicted taxi-out time[J]. Chinese Journal of Aeronautics, 2019, 32(3): 660-673. [百度学术]
YIN J N, HU M H, MA Y, et al. Airport taxi situation awareness with a macroscopic distribution network analysis[J]. Networks and Spatial Economics, 2019,19: 669-695. [百度学术]
XIA Z, ZHENG B, WAN J, et al. Recognition algorithm and risk assessment of airport hotspots[J]. Journal of Shanghai Jiaotong University(Science), 2019, 24(10): 769-774. [百度学术]
LI N, LIU P, JING H H. Research on aircraft speed anomaly detection in maneuvering area[J]. Computer Simulation, 2019, 36(1): 45-50. [百度学术]
Kim S H, FERON E, CLARKE J P, et al. Airport gate scheduling for passengers, aircraft, and operation[J]. Journal of Air Transportation, 2017, 25(4): 109-114. [百度学术]
LEE H, COUPE J, JUNG Y C. Prediction of pushback times and ramp taxi times for departures at Charlotte Airport[C]//AIAA Aviation 2019 Forum. Dallas: AIAA, 2019(2933): 1-13. [百度学术]
LIU T C. Hong Kong aviation safety program[R]. [S.l.]: Hong Kong Aviation Department,2018: 3-4. [百度学术]
LIU T C. Controlling officer’s reply[R]. [S.l.]: Hong Kong Aviation Department, 2019: 2-11. [百度学术]