A hybrid signal decomposition-machine learning benchmarking framework for multi-station precipitation prediction in the Kébir Rhumel basin (Algeria)


Erol A., Rehamnia I., Citakoglu H.

Acta Geophysica, cilt.74, ss.1-40, 2026 (SCI-Expanded, Scopus)

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 74
  • Basım Tarihi: 2026
  • Doi Numarası: 10.1007/s11600-026-01888-3
  • Dergi Adı: Acta Geophysica
  • Derginin Tarandığı İndeksler: Scopus, Science Citation Index Expanded (SCI-EXPANDED), Compendex, Geobase, INSPEC
  • Sayfa Sayıları: ss.1-40
  • Erciyes Üniversitesi Adresli: Evet

Özet

Reliable multi-station precipitation forecasting is challenging due to nonstationarity, noise, and spatial heterogeneity. This paper introduces a hybrid signal decomposition-machine learning benchmarking framework that integrates four decomposition methods (TQWT, MODWT, EWT, VMD) with three learners (Bagging, LSBoost, KNN), yielding twelve hybrid models. These models were rigorously tested across twelve stations in the Kébir Rhumel Basin using eight statistical metrics and distributional diagnostics to assess accuracy, stability, and generalization. Two dominant families emerged: TQWT-based hybrids achieved localized accuracy at four stations, while MODWT-Bagging led at eight stations and delivered the most consistent cross-station performance. MODWT-Bagging achieved R2 = 0.984–0.993 and NSE = 0.981–0.993, with RMSE ranging from 2.64 to 6.34, demonstrating strong predictive skill under varying hydro-climatic conditions. In noise-rich environments, it substantially reduced errors; for example, at El Milia, RMSE dropped from 12.57 (VMD-LSBoost) to 6.03, a ≈ 52% reduction, and improvements of up to 63% were observed at other stations. Its superiority stems from MODWT’s shift-invariance and noise robustness combined with Bagging’s variance reduction. Taylor diagrams and violin plots confirmed centered, compact error structures, while scatter plots verified accurate phase and magnitude tracking. By clarifying how decomposition structure and learner characteristics interact across heterogeneous regimes, this framework fills a key gap in signal decomposition-machine learning model selection. The findings support adaptive hybrid design for early warning, water resource management, and precipitation-driven forecasting systems. Overall, MODWT-Bagging is established as a robust default for complex precipitation modeling, and the proposed framework provides a scalable foundation for next-generation hybrid predictive tools.