Numerical and Experimental Investigation of Meteorological Data Using Adaptive Linear M5 Model Tree for the Prediction of Rainfall

Authors

DOI:

https://doi.org/10.18488/76.v9i1.2961

Abstract

Predicting a class with a continuous numeric value encounters many problems when applying machine learning to the data. Only a few machine-learning techniques can do this, but it is still considered one of the most complex tasks to perform. In this study, we demonstrate one of the techniques called the M5 Model Tree, which can handle continuous numeric data. This technique is a stepwise algorithm and uses linear functions at the leaf nodes of any decision tree inducer (like CART) constructed. These M5 model trees generate simple practical formulas like standard deviation (SD), standard deviation reduction (SDR), cost-complexity pruning (CCP), etc., which can be easily applied by another user to some other benchmark data. This work assesses the abilities of the M5 Model Tree algorithm for the assessment of rainfall data across the Kashmir province of the Union Territory of Jammu & Kashmir, India. The construction of the M5 model tree developed using (70–30) % training and test ratio, respectively, was considered one of the best fit models, predicting an RMSE of 2.593, an MAE of 1.68, and a correlation coefficient (R2) of 0.478. Moreover, M5 model trees use a small number of trails to develop the models and thus need less computational time and are therefore more convenient to use.

Keywords:

Linear regression, Meteorological data, M5 model tree, Smoothing, Splitting nodes, Linear model functions.

Abstract Video

Downloads

Crossref
Scopus
27
Partani S. (2024)
Enhancing nutrient absorption through the influence of mangrove ecosystem on flow rate and retention time in salt marshes. Science of the Total Environment, 924,
10.1016/j.scitotenv.2024.171518
Manafian J. (2024)
New solutions to a generalized fifth-order KdV like equation with prime number p=3 via a generalized bilinear differential operator. Partial Differential Equations in Applied Mathematics, 9,
10.1016/j.padiff.2023.100600
Farahmand Nejad M. (2024)
A novel linear algebra-based method for complex interval linear systems in circuit analysis. Heliyon, 10(4),
10.1016/j.heliyon.2024.e25786
Kheimi M. (2024)
Data-driven approaches for estimation of sediment discharge in rivers. Earth Science Informatics, 17(1), 761-781.
10.1007/s12145-023-01191-5
Sureh F.S. (2024)
Meteorological Drought Assessment and Prediction in Association with Combination of Atmospheric Circulations and Meteorological Parameters via Rule Based Models. Tarim Bilimleri Dergisi, 30(1), 61-78.
10.15832/ankutbd.1067486
Modaresi F. (2024)
A novel approach to predictor selection among large-scale climate indices for seasonal rainfall forecasting in small catchments. Hydrological Sciences Journal, 69(4), 488-505.
10.1080/02626667.2024.2313572
Bandhu D. (2024)
Recycling of agro-industrial waste by fabricating laminated Al-metal matrix composites: a numerical simulation and experimental study. International Journal on Interactive Design and Manufacturing,
10.1007/s12008-024-01759-5
Chen G. (2024)
Analysis of the effect of rainfall center location on the flash flood process at the small basin scale. Journal of Water and Climate Change, 15(2), 652-668.
10.2166/wcc.2023.526
Husein I. (2024)
Predictive Equations for Estimation of the Slump of Concrete Using GEP and MARS Methods. Journal of Soft Computing in Civil Engineering, 8(2), 1-18.
10.22115/SCCE.2023.389726.1619
Latif S.D. (2023)
Assessing rainfall prediction models: Exploring the advantages of machine learning and remote sensing approaches. Alexandria Engineering Journal, 82, 16-25.
10.1016/j.aej.2023.09.060
Gu Y. (2023)
New soliton waves and modulation instability analysis for a metamaterials model via the integration schemes. International Journal of Nonlinear Sciences and Numerical Simulation, 24(4), 1493-1519.
10.1515/ijnsns-2021-0443
Alizadeh S.M. (2023)
Application of soft computing and statistical methods to predict rock mass permeability. Soft Computing, 27(9), 5831-5853.
10.1007/s00500-022-07586-8
Kaul N. (2023)
Analogical Study of Activation Concept in Neural Networks with Neat- Python Module. Revue d'Intelligence Artificielle, 37(2), 249-256.
10.18280/ria.370201
Fayaz S.A. (2023)
How Machine Learning is Redefining Agricultural Sciences: An Approach to Predict Apple Crop Production of Kashmir Province. Revue d'Intelligence Artificielle, 37(2), 501-507.
10.18280/ria.370227
Fayaz S.A. (2023)
Tree-Based Approach’s to Mitigate the Heterogeneity Concerns among Different file Systems: A Possible Solution. Revue d'Intelligence Artificielle, 37(1), 231-237.
10.18280/ria.370129
Saravani M.J. (2023)
Investigating the Accuracy of Hybrid Models with Wavelet Transform in the Forecast of Watershed Runoff. Journal of Water Management Modeling, 31,
10.14796/JWMM.C499
Mir A.Y. (2022)
An Adaptive Classification Framework for Handling the Cold Start Problem in Case of News Items. Revue d'Intelligence Artificielle, 36(6), 889-896.
10.18280/ria.360609
Gu Y. (2022)
Variety interaction between k-lump and k-kink solutions for the (3+1)-D Burger system by bilinear analysis. Results in Physics, 43,
10.1016/j.rinp.2022.106032
Pan Y. (2022)
N-Lump Solutions to a (3+1)-Dimensional Variable-Coefficient Generalized Nonlinear Wave Equation in a Liquid with Gas Bubbles. Qualitative Theory of Dynamical Systems, 21(4),
10.1007/s12346-022-00658-y
Banday I.R. (2022)
Big Data in Academia: A Proposed Framework for Improving Students Performance. Revue d'Intelligence Artificielle, 36(4), 589-595.
10.18280/ria.360411
Fayaz S.A. (2022)
How M5 Model Trees (M5-MT) on Continuous Data Are Used in Rainfall Prediction: An Experimental Evaluation. Revue d'Intelligence Artificielle, 36(3), 409-415.
10.18280/ria.360308
Altaf I. (2022)
HARD VOTING META CLASSIFIER FOR DISEASE DIAGNOSIS USING MEAN DECREASE IN IMPURITY FOR TREE MODELS. Review of Computer Engineering Research, 9(2), 71-82.
10.18488/76.v9i2.3037
Fayaz S.A. (2022)
An Adaptive Gradient Boosting Model for the Prediction of Rainfall Using ID3 as a Base Estimator. Revue d'Intelligence Artificielle, 36(2), 241-250.
10.18280/ria.360208
Fayaz S.A. (2022)
HOW MACHINE LEARNING ALGORITHMS ARE USED IN METEOROLOGICAL DATA CLASSIFICATION: A COMPARATIVE APPROACH BETWEEN DT, LMT, M5-MT, GRADIENT BOOSTING AND GWLM-NARX MODELS. Applied Computer Science, 18(4), 16-27.
10.35784/acs-2022-26
Fayaz S.A. (2022)
A Super Ensembled and Traditional Models for the Prediction of Rainfall: An Experimental Evaluation of DT Versus DDT Versus RF. Lecture Notes in Networks and Systems, 461, 619-635.
10.1007/978-981-19-2130-8_48

Published

2022-04-13

How to Cite

Fayaz, S. A. ., Zaman, M. ., & Butt, M. A. . (2022). Numerical and Experimental Investigation of Meteorological Data Using Adaptive Linear M5 Model Tree for the Prediction of Rainfall . Review of Computer Engineering Research, 9(1), 1–12. https://doi.org/10.18488/76.v9i1.2961

Issue

Section

Articles