| Literature DB >> 34764615 |
Zhichao Zhao1,2, Jinguo You1,2, Guoyu Gan1,2, Xiaowu Li1,2, Jiaman Ding1,2.
Abstract
Airfare price prediction is one of the core facilities of the decision support system in civil aviation, which includes departure time, days of purchase in advance and flight airline. The traditional airfare price prediction system is limited by the nonlinear interrelationship of multiple factors and fails to deal with the impact of different time steps, resulting in low prediction accuracy. To address these challenges, this paper proposes a novel civil airline fare prediction system with a Multi-Attribute Dual-stage Attention (MADA) mechanism integrating different types of data extracted from the same dimension. In this method, the Seq2Seq model is used to add attention mechanisms to both the encoder and the decoder. The encoder attention mechanism extracts multi-attribute data from time series, which are optimized and filtered by the temporal attention mechanism in the decoder to capture the complex time dependence of the ticket price sequence. Extensive experiments with actual civil aviation data sets were performed, and the results suggested that MADA outperforms airfare prediction models based on the Auto-Regressive Integrated Moving Average (ARIMA), random forest, or deep learning models in MSE, RMSE, and MAE indicators. And from the results of a large amount of experimental data, it is proven that the prediction results of the MADA model proposed in this paper on different routes are at least 2.3% better than the other compared models.Entities:
Keywords: Attention mechanism; Civil airline fare prediction; LSTM; Time series
Year: 2021 PMID: 34764615 PMCID: PMC8331096 DOI: 10.1007/s10489-021-02602-0
Source DB: PubMed Journal: Appl Intell (Dordr) ISSN: 0924-669X Impact factor: 5.019
Summary of airline fare prediction models and correlation time series forecasting models
| Category | Ref. | Addressed problem | Method | Performance result |
|---|---|---|---|---|
| Tim et al. [ | Predict the lowest ticket price before departure | Linear quantile mixed regression model | Short-term performance is reasonable, but long-term performance is inefficient. | |
| Tziridis et al. [ | Find the optimal fare prediction model from the regression algorithm. | Eight regression machine learning models | Bagging Regression:87.42% accuracy and Random Forest Regression Tree:85.91%. | |
| Gordiievych et al. [ | Prediction whether the price of ticket drops in the future. | ARIMA | Not given | |
| Machine Learing | Wohlfarth et al. [ | Predict the best time to buy tickets | CART and RF | CART and RF should be used for preregistered purchase periods to give a first coarse advice to the customer. |
| Ren et al. [ | Predict the lowest ticket price before departure | Ensemble model that uses LR, Naive Bayes, Softmax Regression, and SVM. | The training error of Naive Bayes and Softmax Regression reduced to 24.88% and 20.22% .SVM is also reduced by approximately 1%. | |
| GuanGui et al. [ | Flight delay prediction | Ensemble model that relevant random tree models with deep learning | LSTM is capable of handling the obtained aviation sequence data, RF (90.2% for the binary classification) can overcome the overfitting problem. | |
| Guokun et al. [ | Multivariate time series prediction | Deep learning: CNN and RNN to extract short-term local dependency patterns among variables. | The results show that three of the four experimental data sets have the best performance. | |
| Shih et al. [ | Multivariate time series prediction | Deep learning: Attention mechanism for selecting important time series and multivariate forecasting using frequency domain information. | The proposed TPA-LSTM performs best in experiments. | |
| Deep Learning | Deng et al. [ | Prediction of flight passenger load factors. | Deep learning: CNN using multi-granularity temporal attention mechanism(MTA-RNN). | The proposed MTA-RNN performs best in experiments. |
| YaoQin et al. [ | Stock time series prediction | Deep learning: Dual-stage attention-based RNN. | The DA-RNN model has the best performance in the data set SML 2010 and NASDAQ 100 compared with other models. | |
| TongChen et al. [ | Forecast sales volume in real-life commercial scenario. | Deep learning: Dual-stage attention-based RNN trend alignment with dual-attention, multi-task RNNs for sales prediction. | The results show that TADA prediction result is the best. |
Input attributes
| Feature attribute | Description |
|---|---|
| Airln_cd | Airline |
| AirCrft_Typ | Aircraft type |
| Dpt_AirPt_Cd | Departure airfield |
| Arrv_Airpt_Cd | Arrival airport |
| Air_route | Route |
| Flt_nbr | Flight no. |
| Flt_Schd_Dpt_Tm | Flight take-off time |
| Weekday | Weekly attributes |
| Holiday | Holiday |
| Pax_Qty_y | Number of flights |
| Fare | Air ticket price |
Fig. 1The trend of ticket price changes for a specific flight
Fig. 2Trends of fare changes in the first quarter
Fig. 3Trends of fare changes in the second quarter
Fig. 4Trend of fare changes in the third quarter
Fig. 5Important factors affecting fares
Fig. 6MADA Model
Comparison of MADA and various models
| Model | MSE | RMSE | MAE |
|---|---|---|---|
| ARIMA [ | 0.286090 ± 0.003 | 0.534880 ± 0.04 | 0.461500 ± 0.004 |
| RF [ | 0.03893 ± 00.002 | 0.19730 ± 0.003 | 0.14092 ± 0.003 |
| XGBoost [ | 0.00973 ± 0.001 | 0.09865 ± 0.002 | 0.09365 ± 0.004 |
| LSTM-CNN [ | 0.08562 ± 0.003 | 0.29261 ± 0.003 | 0.16951 ± 0.005 |
| CNN-LSTM [ | 0.07019 ± 0.001 | 0.26493 ± 0.002 | 0.13326 ± 0.003 |
| CNN-LSTM + Attn [ | 0.07812 ± 0.001 | 0.27950 ± 0.003 | 0.12005 ± 0.004 |
| Seq2Seq [ | 0.00028 ± 0.0002 | 0.01663 ± 0.002 | 0.01161 ± 0.003 |
| Seq2Seq+Attn [ | 0.0011 ± 0.0002 | 0.03390 ± 0.005 | 0.02544 ± 0.005 |
Bold entries signify the model proposed in this article
Fig. 7Evaluation index of different models
Fig. 8Visual comparison of different models, where T represents the time width, and D represents the number of days predicted in the future
Fig. 9Visual comparison of different models, where T represents the time width, and D represents the number of days predicted in the future
Fig. 10Compare different hidden layer models
Comparison of the variant MADA models
| Model | RMSE | MSE | MAE |
|---|---|---|---|
| MADA_nEx | 0.00976 | 0.00010 | 0.00753 |
| MADA_nAttn_allData | 0.02422 | 0.00059 | 0.01768 |
| MADA_sAttn_allData | 0.01771 | 0.00031 | 0.0146 |
Bold entries signify the model proposed in this article