Lei Ren, Lingna Yang, Yaqi Wang, Peng Yao, Jun Wei, Fan Yang, Fearghal O’ Donncha. Forecasting of surface current velocities using ensemble machine learning algorithms for the Guangdong-Hong Kong-Macao Greater Bay Area based on the High Frequency radar data[J]. Acta Oceanologica Sinica. doi: 10.1007/s13131-024-2363-2
Citation:
Lei Ren, Lingna Yang, Yaqi Wang, Peng Yao, Jun Wei, Fan Yang, Fearghal O’ Donncha. Forecasting of surface current velocities using ensemble machine learning algorithms for the Guangdong-Hong Kong-Macao Greater Bay Area based on the High Frequency radar data[J]. Acta Oceanologica Sinica. doi: 10.1007/s13131-024-2363-2
Lei Ren, Lingna Yang, Yaqi Wang, Peng Yao, Jun Wei, Fan Yang, Fearghal O’ Donncha. Forecasting of surface current velocities using ensemble machine learning algorithms for the Guangdong-Hong Kong-Macao Greater Bay Area based on the High Frequency radar data[J]. Acta Oceanologica Sinica. doi: 10.1007/s13131-024-2363-2
Citation:
Lei Ren, Lingna Yang, Yaqi Wang, Peng Yao, Jun Wei, Fan Yang, Fearghal O’ Donncha. Forecasting of surface current velocities using ensemble machine learning algorithms for the Guangdong-Hong Kong-Macao Greater Bay Area based on the High Frequency radar data[J]. Acta Oceanologica Sinica. doi: 10.1007/s13131-024-2363-2
Forecasting of surface current velocities using ensemble machine learning algorithms for the Guangdong-Hong Kong-Macao Greater Bay Area based on the High Frequency radar data
School of Ocean Engineering and Technology, Sun Yat-sen University, China, and Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai 519082 China
2.
Key Laboratory of Comprehensive Observation of Polar Environment (Sun Yat-sen University), Ministry of Education, Zhuhai 519082 China
3.
The National Key Laboratory of Water Disaster Prevention, Hohai University, Nanjing 210024 China
4.
School of Atmospheric Sciences, Sun Yat-sen University, Zhuhai 519082 China
5.
Zhuhai Marine Environmental Monitoring Central Station of the State Oceanic Administration, Zhuhai 519082 China
6.
International Business Machines Corporation (IBM) Research, Dublin D15 HN66 Ireland
Funds:
The Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai) under contract No. SML2020SP009; the National Basic Research and Development Program of China under contract Nos 2022YFF0802000 and 2022YFF0802004; the “Renowned Overseas Professors” Project of Guangdong Provincial Department of Science and Technology under contract No. 76170-52910004; the Belt and Road Special Foundation of the National Key Laboratory of Water Disaster Prevention under contract No. 2022491711; the National Natural Science Foundation of China under contract No. 51909290; the Key Research and Development Program of Guangdong Province under contract No. 2020B1111020003.
Forecasting of ocean currents is critical for both marine meteorological research and ocean engineering and construction. Timely and accurate forecasting of coastal current velocities offers a scientific foundation and decision support for multiple practices such as search and rescue, disaster avoidance and remediation, and offshore construction. This research established a framework to generate short-term surface current forecasts based on ensemble machine learning trained on High Frequency radar observation. Results indicate that an ensemble algorithm that used random forests to filter forecasting features by weighting them, and then used the AdaBoost method to forecast can significantly reduce the model training time, while ensuring the model forecasting effectiveness, with great economic benefits. Model accuracy is a function of surface current variability and the forecasting horizon. In order to improve the forecasting capability and accuracy of the model, the model structure of the ensemble algorithm was optimized, and the random forest algorithm was used to dynamically select model features. The results show that the error variation of the optimized surface current forecasting model has a more regular error variation, and the importance of the features varies with the forecasting time-step. At ten-step ahead forecasting horizon the model reported RMSE, MAE and correlation coefficient by 2.84 cm/s, 2.02 cm/s, and 0.96, respectively. The model error is affected by factors such as topography, boundaries, and geometric accuracy of the observation system. This paper demonstrates the potential of ensemble-based machine learning algorithm to improve forecasting of ocean currents.
Figure 1. Deployment location of HFR stations and observational area. SC: Shangchuan radar station, WS: Wanshan radar station.
Figure 2. Flowchart of random forest algorithm.
Figure 3. Representative points P1–P5 and sub-areas Z1–Z8. Observation points with data acquisition rate greater than 90%.
Figure 4. Relative importance of main features in the transition model for single time-step forecasting for surface current zonal component at point P1 (partial).
Figure 5. Surface current synthetic vectors during June 27, 2021 at P1.a. HFR data and b. application model.
Figure 6. Flow chart for single time-step forecasting model of surface current fields.
Figure 7. Surface current fields at 02:10 June 24, 2021. a. Model results and b. HFR data.
Figure 8. Rolling forecasting of application model.
Figure 9. RMSE values of surface velocity components for multi time-step forecasting at points P1–P5. a. Zonal component and b. meridional component.
Figure 10. Comparison of surface current vector fields. Model: the reconstruction model; HFR: observation. All times are in UTC+8.
Figure 11. Absolute errors of surface velocity components between forecasting and HFR data under different forecasting windows. Left subfigures are the zonal component; right subfigures are the meridional component.