Unsupervised Identification of Post-Rainfall Behavior in a Medium Reservoir Using Feature-Engineered Clustering
Received: 6 October 2025 | Revised: 31 October 2025 | Accepted: 15 November 2025 | Online: 9 February 2026
Corresponding author: Anyawee Chiwachirakhamporn
Abstract
Understanding reservoir behavior after rainfall is essential for effective water management, especially in medium-sized reservoirs with limited data. This study uses a daily time-series dataset comprising 2,922 days of observations from 2014 to 2023 and proposes a novel unsupervised learning approach to identify operational patterns during post-rainfall periods, which are critical for deciding whether to store or release water. The method integrates rainfall-regime segmentation with feature engineering to emphasize hydrologically significant periods. Real-world data from a medium-sized reservoir were analyzed, focusing on inflow, storage, outflow, and usable water variables. A post-rainfall subset was derived to capture the system's recovery behavior following rainfall. Clustering of these post-rainfall days revealed three distinct operational modes: (1) flood response with high inflow and release, (2) recharge with moderate inflow and retention, and (3) idle or drought with minimal activity. Internal validation demonstrated strong performance (Silhouette coefficient = 0.565, Davies–Bouldin Index (DBI) = 0.550), outperforming traditional approaches. These findings highlight the value of context-aware clustering in revealing interpretable, operation-relevant patterns, providing a practical decision-support tool for medium reservoirs under data limitations.
Keywords:
reservoir operation, unsupervised learning, rainfall regime, clustering analysis, feature engineeringDownloads
References
J. W. Labadie, "Optimal Operation of Multireservoir Systems: State-of-the-Art Review," Journal of Water Resources Planning and Management, vol. 130, no. 2, pp. 93–111, Mar. 2004. DOI: https://doi.org/10.1061/(ASCE)0733-9496(2004)130:2(93)
J. Zhang, X. Cai, X. Lei, P. Liu, and H. Wang, "Real-time reservoir flood control operation enhanced by data assimilation," Journal of Hydrology, vol. 598, July 2021, Art. no. 126426. DOI: https://doi.org/10.1016/j.jhydrol.2021.126426
D. Li, Y. Chen, L. Lyu, and X. Cai, "Uncovering Historical Reservoir Operation Rules and Patterns: Insights From 452 Large Reservoirs in the Contiguous United States," Water Resources Research, vol. 60, no. 8, Aug. 2024, Art. no. e2023WR036686. DOI: https://doi.org/10.1029/2023WR036686
W. W.-G. Yeh, "Reservoir Management and Operations Models: A State-of-the-Art Review," Water Resources Research, vol. 21, no. 12, pp. 1797–1818, Dec. 1985. DOI: https://doi.org/10.1029/WR021i012p01797
E. Aytaç, "Unsupervised learning approach in defining the similarity of catchments: Hydrological response unit based k-means clustering, a demonstration on Western Black Sea Region of Turkey," International Soil and Water Conservation Research, vol. 8, no. 3, pp. 321–331, Sept. 2020. DOI: https://doi.org/10.1016/j.iswcr.2020.05.002
A. Wunsch, T. Liesch, and S. Broda, "Feature-based Groundwater Hydrograph Clustering Using Unsupervised Self-Organizing Map-Ensembles," Water Resources Management, vol. 36, no. 1, pp. 39–54, Jan. 2022. DOI: https://doi.org/10.1007/s11269-021-03006-y
D. Li, G. Sang, W. Liu, Y. Liu, and M. Zhao, "Cluster analysis-based hydrological similarity assessment in small watersheds of Shandong Province’s hilly areas," Hydrology Research, vol. 56, no. 2, pp. 93–107, Dec. 2024. DOI: https://doi.org/10.2166/nh.2024.017
I. Ebtehaj, H. Bonakdari, M. Zeynoddin, B. Gharabaghi, and A. Azari, "Evaluation of preprocessing techniques for improving the accuracy of stochastic rainfall forecast models," International Journal of Environmental Science and Technology, vol. 17, no. 1, pp. 505–524, Jan. 2020. DOI: https://doi.org/10.1007/s13762-019-02361-z
Y. Gao, M. Taie Semiromi, and C. Merz, "Efficacy of statistical algorithms in imputing missing data of streamflow discharge imparted with variegated variances and seasonalities," Environmental Earth Sciences, vol. 82, no. 20, Sept. 2023, Art. no. 476. DOI: https://doi.org/10.1007/s12665-023-11139-z
V. Sharma and K. Yuden, "Imputing Missing Data in Hydrology using Machine Learning Models," International Journal of Engineering Research & Technology, vol. 10, no. 1, pp. 78–82, Jan. 2021. DOI: https://doi.org/10.17577/IJERTV10IS010011
A. H. Blasi, M. A. Abbadi, and R. Al-Huweimel, "Machine Learning Approach for an Automatic Irrigation System in Southern Jordan Valley," Engineering, Technology & Applied Science Research, vol. 11, no. 1, pp. 6609–6613, Feb. 2021. DOI: https://doi.org/10.48084/etasr.3944
A. E. Sikorska-Senoner, "Clustering model responses in the frequency space for improved simulation-based flood risk studies: The role of a cluster number," Journal of Flood Risk Management, vol. 15, no. 1, Mar. 2022, Art. no. e12772. DOI: https://doi.org/10.1111/jfr3.12772
F. Ghobadi and D. Kang, "Application of Machine Learning in Water Resources Management: A Systematic Literature Review," Water, vol. 15, no. 4, Feb. 2023, Art. no. 620. DOI: https://doi.org/10.3390/w15040620
S. Lee et al., "Clustering of Time Series Water Quality Data Using Dynamic Time Warping: A Case Study from the Bukhan River Water Quality Monitoring Network," Water, vol. 12, no. 9, Sept. 2020, Art. no. 2411. DOI: https://doi.org/10.3390/w12092411
C.-M. Forke and M. Tropmann-Frick, "Feature Engineering Techniques and Spatio-Temporal Data Processing," Datenbank-Spektrum, vol. 21, no. 3, pp. 237–244, Nov. 2021. DOI: https://doi.org/10.1007/s13222-021-00391-x
G. Papacharalampous et al., "Global-scale massive feature extraction from monthly hydroclimatic time series: Statistical characterizations, spatial patterns and hydrological similarity," Science of The Total Environment, vol. 767, May 2021, Art. no. 144612. DOI: https://doi.org/10.1016/j.scitotenv.2020.144612
A. M. Rushworth, A. W. Bowman, M. J. Brewer, and S. J. Langan, "Distributed Lag Models for Hydrological Data," Biometrics, vol. 69, no. 2, pp. 537–544, June 2013. DOI: https://doi.org/10.1111/biom.12008
G. Papacharalampous, H. Tyralis, Y. Markonis, and M. Hanel, "Hydroclimatic time series features at multiple time scales," Journal of Hydrology, vol. 618, Mar. 2023, Art. no. 129160. DOI: https://doi.org/10.1016/j.jhydrol.2023.129160
M. Sit, B. Z. Demiray, Z. Xiang, G. J. Ewing, Y. Sermet, and I. Demir, "A comprehensive review of deep learning applications in hydrology and water resources," Water Science and Technology, vol. 82, no. 12, pp. 2635–2670, Aug. 2020. DOI: https://doi.org/10.2166/wst.2020.369
S. Shamshirband et al., "Clustering project management for drought regions determination: A case study in Serbia," Agricultural and Forest Meteorology, vol. 200, pp. 57–65, Jan. 2015. DOI: https://doi.org/10.1016/j.agrformet.2014.09.020
P. Yang, Z. Xiao, J. Yang, and H. Liu, "Characteristics of clustering extreme drought events in China during 1961–2010," Acta Meteorologica Sinica, vol. 27, no. 2, pp. 186–198, Apr. 2013. DOI: https://doi.org/10.1007/s13351-013-0204-x
X. Wang, K. Smith, and R. Hyndman, "Characteristic-Based Clustering for Time Series Data," Data Mining and Knowledge Discovery, vol. 13, no. 3, pp. 335–364, Nov. 2006. DOI: https://doi.org/10.1007/s10618-005-0039-x
E. G. D. Barros, F. K. Yap, E. Insuasty, P. M. J. van den Hof, and J. D. Jansen, "Clustering Techniques for Value-of-information Assessment in Closed-loop Reservoir Management," in ECMOR XV - 15th European Conference on the Mathematics of Oil Recovery, Amsterdam, Netherlands, 2016. DOI: https://doi.org/10.3997/2214-4609.201601858
T. Lan et al., "A Clustering Preprocessing Framework for the Subannual Calibration of a Hydrological Model Considering Climate-Land Surface Variations," Water Resources Research, vol. 54, no. 12, pp. 10034-10052, Dec. 2018. DOI: https://doi.org/10.1029/2018WR023160
P. Berkhin, "A Survey of Clustering Data Mining Techniques," in Grouping Multidimensional Data: Recent Advances in Clustering, J. Kogan, C. Nicholas, and M. Teboulle, Eds. Berlin, Heidelberg, Germany: Springer, 2006, pp. 25–71. DOI: https://doi.org/10.1007/3-540-28349-8_2
Z. Hao et al., "GRDL: A New Global Reservoir Area-Storage-Depth Data Set Derived Through Deep Learning-Based Bathymetry Reconstruction," Water Resources Research, vol. 60, no. 1, Jan. 2024, Art. no. e2023WR035781. DOI: https://doi.org/10.1029/2023WR035781
X. Jia et al., "Physics-Guided Machine Learning for Scientific Discovery: An Application in Simulating Lake Temperature Profiles," ACM/IMS Transactions on Data Science, vol. 2, no. 3, May 2021, Art. no. 20. DOI: https://doi.org/10.1145/3447814
J. MacQueen, "Some methods for classification and analysis of multivariate observations," in Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Statistics, Berkeley, CA, USA, 1965, pp. 281–297.
P. J. Rousseeuw, "Silhouettes: A graphical aid to the interpretation and validation of cluster analysis," Journal of Computational and Applied Mathematics, vol. 20, pp. 53–65, Nov. 1987. DOI: https://doi.org/10.1016/0377-0427(87)90125-7
D. L. Davies and D. W. Bouldin, "A Cluster Separation Measure," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. PAMI-1, no. 2, pp. 224–227, Apr. 1979. DOI: https://doi.org/10.1109/TPAMI.1979.4766909
T. Caliński and J. Harabasz, "A dendrite method for cluster analysis," Communications in Statistics, vol. 3, no. 1, pp. 1–27, Jan. 1974. DOI: https://doi.org/10.1080/03610917408548446
Downloads
How to Cite
License
Copyright (c) 2025 Sakchan Luangmaneerote, Anyawee Chiwachirakhamporn, Jeeranut Tasuntia

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.
