doi: 10.56294/dm2024.371
ORIGINAL
Distribution of Rain Intensity: Daily Maximum Rainfall Data in The Province of South Sumatera and West Sumatera
Distribución de la intensidad de la lluvia: datos de precipitación máxima diaria en la provincia de Sumatra del Sur y Sumatra Occidental
Selpa Dewi1,2 *, M Giatman3 *, Rusnardi Rahmat Putra3 *, Ambiyar3 *
1Indonesia Graduate Student, Graduate School of Education of Vocational Engineering, Faculty of Engineering, Universitas Negeri Padang, Padang, Indonesia.
2Department of Civil Engineering, Faculty of Engineering, Universitas Muhammadiyah Sumatera Barat, Bukittinggi, Indonesia.
3Graduate School of Education of Vocational Engineering, Faculty of Engineering, Universitas Negeri Padang, Padang, Indonesia.
Cite as: Dewi S, Giatman M, Rahmat Putra R, Ambiyar A. Distribution of Rain Intensity: Daily Maximum Rainfall Data in The Province of South Sumatera and West Sumatera. Data and Metadata. 2024; 3:.371. https://doi.org/10.56294/dm2024.371
Submitted: 23-01-2024 Revised: 16-04-2024 Accepted: 31-08-2024 Published: 01-09-2024
Editor: Adrián Alejandro Vitón-Castillo
Corresponding author: Selpa Dewi *
ABSTRACT
The high intensity of rain in May 2024 caused flooding in the Sumatra region, especially Padang City and Palembang City. The region of West Sumatra is also known as the ethalae of disasters with frequent earthquakes, floods, and landslides. In Indonesia, many frequency analyses are conducted using the Gumbel distribution without testing data and clear hydrological reasons. It is feared that this method is considered as a routine way, thus causing the risk of unwanted deviations, resulting in many disasters and errors in the planning of water buildings. This research aims to determine and know the type of distribution that is representative for the maximum daily rainfall frequency in West Sumatra and South Sumatra Provinces as a reference for development and disaster mitigation in the region. The research location is in West Sumatra and South Sumatra. The data used for this study were taken from daily maximum rainfall data from rain stations for 35 to 40 years from 24 rain stations in West Sumatra Province, and 16 rain stations for South Sumatra region. This type of research is development research with the 4D development model (Define, Design, Develop, Disseminate). The development method consists of four different phases: Initial Investigation, Design, Realisation and Development. The data of each station was then organised into two types of data series, namely the annual maximum data series and the annual minimum data series. The results of this data series test are expected to follow one or more types of distributions commonly used in Hydrology and Drainage. The distribution results obtained for the South Sumatra region are Gama-III and LP-III, while for the West Sumatra region the distribution types are Gama-III, LP-III Normal Log and Gembel. The conclusion obtained is that West Sumatra does not have only one type of distribution. For the type of test using the SPSS.22 application, the results obtained are goodness of fit test, parametric test, Chi-Square test, Kolmogorov Smirnov test and Anderson-Darling test and Histrogram (visual) method.
Keywords: Distribution; Maxsimum Rainfall Data; Annual Exceedances; Goodness of Fit; South Sumatra and West Sumatera Provinces.
RESUMEN
La gran intensidad de las lluvias de mayo de 2024 provocó inundaciones en la región de Sumatra, especialmente en las ciudades de Padang y Palembang. La región de Sumatra Occidental también es conocida como la etala de los desastres, con frecuentes terremotos, inundaciones y corrimientos de tierra. En Indonesia, muchos análisis de frecuencia se realizan utilizando la distribución de Gumbel sin datos de prueba ni razones hidrológicas claras. Se teme que este método se considere una forma rutinaria, lo que provoca el riesgo de desviaciones no deseadas, que dan lugar a muchas catástrofes y errores en la planificación de las construcciones hidráulicas. El objetivo de esta investigación es determinar y conocer el tipo de distribución representativa de la frecuencia máxima de precipitaciones diarias en las provincias de Sumatra Occidental y Sumatra Meridional, como referencia para el desarrollo y la mitigación de catástrofes en la región. El lugar de la investigación se encuentra en Sumatra Occidental y Sumatra Meridional. Los datos utilizados para este estudio se tomaron de los datos de precipitaciones máximas diarias de las estaciones pluviométricas durante 35 a 40 años de 24 estaciones pluviométricas de la provincia de Sumatra Occidental, y 16 estaciones pluviométricas de la región de Sumatra Meridional. Este tipo de investigación es una investigación de desarrollo con el modelo de desarrollo 4D (Definir, Diseñar, Desarrollar, Difundir). El método de desarrollo consta de cuatro fases diferentes: Investigación inicial, Diseño, Realización y Desarrollo. A continuación, los datos de cada estación se organizaron en dos tipos de series de datos, a saber, la serie de datos máximos anuales y la serie de datos mínimos anuales. Se espera que los resultados de esta prueba de series de datos sigan uno o varios tipos de distribuciones comúnmente utilizadas en Hidrología y Drenaje. Los resultados de distribución obtenidos para la región de Sumatra Meridional son Gama-III y LP-III, mientras que para la región de Sumatra Occidental los tipos de distribución son Gama-III, LP-III Normal Log y Gembel. La conclusión obtenida es que Sumatra Occidental no tiene un solo tipo de distribución. En cuanto al tipo de prueba mediante la aplicación SPSS.22, los resultados obtenidos son la prueba de bondad de ajuste, la prueba paramétrica, la prueba de Chi-cuadrado, la prueba de Kolmogorov Smirnov y la prueba de Anderson-Darling y el método Histrograma (visual).
Palabras clave: Distribución; Datos de precipitaciones máximas; Excedencias anuales; Bondad de ajuste; Provincias de Sumatra Meridional y Sumatra Occidental.
INTRODUCTION
Palembang is one of the metropolitan cities in Indonesia that is geographically located between 2052’-305’LS and 104052’BT. The area of the city of Palembang is 400,61 km2 with an average height of 8 meters above sea level. With the northern, eastern and western boundaries is Banyuasin district, and the southern boundary is Muara Enim district. While the city of Padang is the largest city located on the west coast of the island of Sumatra, this city is the western gateway of Indonesia from the Indian Ocean. The area of the city of Padang is 694,96 km2 with geographical conditions bordering the sea and surrounded by hills with an altitude of 1,853 meters above sea level. Padang has a coastline of 68,126 km on the mainland of Sumatra. The author’s interest in this study is due to the fact that the area studied is the same island but has a different georological position.(12)
Harto (1993)1 states that the selection of inappropriate distribution types can invite considerable errors, both ‘over-estimated’ and ‘under-estimated’, both of which are undesirable. Harto (1993) also stated that in Indonesia, frequency analysis using Gumbel distribution is carried out without data testing and clear hydrological reasons. It is feared that this method is considered a routine method, resulting in the risk of unwanted deviations. Previously, there had been research on the problem of this type of distribution, including.(11,25) The author’s interest in this study is due to the fact that the area studied is the climate line of West Sumatra in general is tropical with a fairly high air temperature, which is between 22,6 °C to 31,5 °C.
This province is also traversed by the Equator, precisely in Bonjol, Pasaman. In this province, there are a number of large rivers that drain into the east coast of Sumatra such as Batang Hari, Siak, Inderagiri (referred to as Batang Kuantan in the upstream part), and Kampar. Meanwhile, the rivers that drain into the west coast are Batang Anai, Batang Arau, and Batang Tarusan5.Because of the many tributaries that stretch out, this is a characteristic for the province of West Sumatra.The goodness of fit test used is the histogram method, parametric test, Chi-Squared test, Kolmogorov-Smirnov (K-S) test and Anderson-Darling (A-D) test.(32)
METHOD
The process of conducting this research is divided into three parts, namely data collection, data processing and output in the form of conclusions and recommendations from the research results. 8 research steps are outlined as follows :
1. Collect maximum rainfall data in South Sumatra Province through the PSDA Office of South Sumatra Province and maximum rainfall data in West Sumatra Province through the PSDA of West Sumatra Province.
2. Compiling the maximum rainfall data series in South Sumatra and West Sumatra by means of annual maxima and annual exceedances;
3. Calculating the base ten logarithm or natural logarithm of the annual maxima and annual exceedances data series from the discharge data at each station. Mera (2011)3 states that data before logging is called original data, while data after logging is called unoriginal data;
4. Create histograms of the annual maxima and annual exceedances discharge data series at each station to visually determine the distribution;
5. Calculate the statistical parameters of each annual maxima and annual exceedances data series at each station;
6. Match the calculated statistical parameters with the parameters that characterise several distributions namely Normal distribution, Normal-log distribution, Gumbel distribution and LP-III distribution;
7. Testing the distribution of data statistically by Chi-square, K-S and A-D methods. the types of distributions tested are Normal distribution, Normal-log distribution, Gumbel distribution, LP-III distribution, Gama II distribution and Gama III distribution;
8. Compare and determine the representative distribution of the Annual Maxima and Annual Exceedances data series of maximum rainfall data in South Sumatra and West Sumatra.
The procedure of the research is illustrated in the flowchart below:
Figure 1. Research flowchart
The results obtained from this study are to determine the representative type of distribution for rainfall obtained from rainfall measuring stations in South Sumatra and West Sumatra Provinces. For this reason, the secondary objectives of this study are:
a. Compiling a series of monthly maximum rainfall data in South Sumatera and in West Sumatera with the annual maximum series and annual exceedance series.
b. Determine the distribution of daily maximum rainfall data in South Sumatera Province and in West Sumatera, visually by making a histogram.
c. Determine the distribution by parametric test, namely matching the skew coefficient of the Normal distribution, the Normal-log distribution, the Gumbel distribution, the Gama-II distribution, the Gama-III distribution and the LP-III distribution with the skew coefficient obtained from the data series in South Sumatra and West Sumatra.
d. Determine the distribution of daily maximum rainfall data in South Sumatra and West Sumatra based on Chi-Square test, K-S test and A-D test of Normal distribution, Normal-log distribution, Gumbel distribution, Gama-II distribution, Gama-III distribution, and LP-III distribution
The data table below taken is the daily maximum rainfall for 20 years to 40 years from 1974 to 2015 in West Sumatra Province obtained from the West Sumatra Provincial PSDA Office as many as 24 (Twenty-four) stations in accordance with figure 2. based on the availability of existing data.(20)
Figure 2. Map of the Research Location
The limitations of the problem of this study are:
1. The regions studied are South Sumatra and West Sumatra
2. Topographic map and map of the location of rain stations in the provinces of South Sumatra and West Sumatra (figure 1)
3. The data taken is the data on the maximum daily rainfall for 20 years from 1994 to 2014 in South Sumatra Province which was obtained from the South Sumatra Provincial PSDA Office as many as 16 (Sixteen) stations.
4. The data taken is the maximum daily rainfall for 20 years to 40 thousand from 1973 to 2014 in West Sumatra Province which was obtained from the West Sumatra Provincial PSDA Office as many as 24 (Twenty-four) stations in accordance with Table 2. based on the availability of existing data.The following is an explanation of the table 1.1 The data taken is the daily maximum rainfall data for 20years from 1995 to 2015 in South Sumatra Province which is obtained from the PSDA Office of South Sumatra Province as many as 16 (Sixteen).
RESULTS
In compiling a data series to determine the representative distribution of planned rainfall in the provinces of South Sumatra and West Sumatra by using maximum rainfall data, the data is compiled into two types of data series, namely the annual maximum data series (series of annual maxima) and the series of annual larger data (series of annual exceedances). A series of annual maxima is a series of data obtained from a data set with a maximum value every year, where each year one data with a maximum value is taken. Meanwhile, a series of annual exceedances is a series of data obtained from a collection of n amounts of data that have a value of n years larger. This second series includes data that has a second, third, and subsequent large value in a given year that is larger than the data that has a first large value in another year.
In other words, this second data series could have taken more than one data in a given year and not in another year (Mera, 2011). Many people make histograms as an initial assumption to determine the type of distribution to the data being studied. Mera (2011) stated that the height and general shape of the histogram are useful for seeing data characteristics such as whether the data is symmetrical or tilted/tilted/skewed. Data is considered normally distributed if the histogram formed by the data is symmetrical. Meanwhile, the data is considered not normally distributed if the histogram formed by the data is skewed as shown in figure 3.
Figure 3. Tistogram formed by the data is skewed
In figure 3, it is explained that if the graphic image is skewed to the right or left, the data is not evenly distributed if it is in the middle, the distribution spreads evenly, if the pattern is skewed to the right. This means that the data follows Normal-log, Gumbel, Gama II, Gama III and LP-III distributions.
a. Data Series Matching Test with Parametric Test
The determination of distribution based on parametric tests is to calculate statistical parameters from the data and match them with the required statistical characteristics of each distribution. The distributions tested were Normal, Normal-Log, Gumbel, Gama-II, Gama-III distributions and LP-III distributions. Because in general, the data used for frequency analysis in the field of engineering hydrology is relatively small, so the coefficient of inclination is the most priority.
Figure 4. Histogram shape
Based on figure 4, the research results are obtained for the Histogram Shape of the Annual Maxima and Annual Exceedances Debit Data Series of Rivers in West Sumatra for Tarusan station.
Determination of distribution based on parametric test is to calculate the statistical parameters of the data and match them with the required statistical characteristics of each distribution. The distributions tested are Normal, Normal-Log, Gumbel, Gama-II, Gama-III and LP-III distributions. Since in general, the data used for frequency analysis in the field of engineering hydrology is relatively small, the skewness coefficient is the most prioritized. Figure 5 is the statistical parameters required in the parametric test.(32)
Figure 5. Statistical Parameters required by some distributions (Mera, 2011; Bedient and Huber, 1992)
Recapitulation of the distribution type of the annual exceedances discharge data series from West Sumatra Province based on parametric tests can be seen in figures 6, 7, 8 and 9 as shown below. Figure 6 shows recapitulation of the type of distribution of the annual maxima annual discharge data series in South Sumatera based on parametric tests.
Figure 6. Recapitulation of the distribution type of annual excedances discharge data series in South Sumatra based on parametric tests
Figure 7. Recapitulation of the type of distribution of the maxima annual discharge data series from West Sumatra Province based on parametric tests
Figure 8. Recapitulation of the distribution type of annual exceedances discharge data series from West Sumatera Province based on parametric tests
b. Data Series Matching Test with Chi-Squared Test
Chi-squared tests are applied to data that has been grouped into class intervals. Usman and Akbar (2011) stated that in the Chi-squared test, a distribution is determined based on a significant difference between the observation frequency and the theoretical frequency of a data. The distribution is acceptable if the value of Chi-squared is calculated (χ2) < Chi-squared is critical (χ2critical). According to Sudjana (2005), the equations used in calculating the value of χ2 are:
|
(1) |
Figure 9. Normal Distribution Chi-square Test of the annual maxima discharge data series of Muaro Labuah, Indonesia
c. Data Series Compatibility Test with Anderson-Darling Test
The determination of distribution by the A-D test method for the annual maxima and annual exceedances data series in West Sumatra and South Sumatra is the same process. The author will explain the procedure for the annual maxima discharge data series of Muaro Labuah station. The process of determining the value of A2 for the Normal.
d. Recapitulation of the Suitability Test
Recapitulation of the goodness of fit test on the annual maxima discharge data series in West Sumatra and in South Sumatra. While figure 9 is a comparison chart of distribution types in West Sumatra and in South Sumatera. A goodness of fit test against the annual Exsedences data series while Figures 10 and 11 are a comparison chart of distribution types in West Sumatra and in South Sumatra.
Figure 9. Recapitulation of goodness of fit test results based on the Annual maxsima data series of West Sumatra Province and South Sumatera Province
From the picture above, it can be concluded that the results of the research are for the distribution results of the Rain Stations in West Sumatra and South Sumatra Provinces.
Figure 10. Comparison graph of the distribution type of goodness of fit test against the Maxsima annual data series in West Sumatera
Figure 11. Comparison graph of the distribution type of goodness of fit test against the Maxsima annual data series in South Sumatera
The above is a recapitulation of the goodness of fit test of the annual maxima discharge data series in West Sumatra and South Sumatra. While figures 10 and 11 are a comparison chart of distribution types in West Sumatra and in South Sumatra.
CONCLUSION
Visually, the histogram shape of the annual maxima and annual exceedances data series in South Sumatra Province and in West Sumatra Province has a right-leaning pattern. This means that the data follows the distribution of Normal-log, Gumbel, Gama II, Gama III and LP-III distribution. Based on parametric tests in South Sumatra and in West Sumatra, the majority of data follows the Normal-Log and LP-III distributions. Based on the Chi-square test, the annual maxima data series for South Sumatra Province follows the Gamma III and LP-III distributions, and for the annual exceeded data series follows LP III, and for West Sumatra Province the annual maxima data series follows the distribution of Gumbel, Gamma-III and Log-Person Type III (LP-III), and for the annual exceeded data series follows the Normal Log.
Meanwhile, the majority of the annual exceedances data series in South Sumatra followed the LP-III distribution and in West Sumatra Province for the maxima annual data series, the majority followed the Normal-log distribution, and for the annual exceeded followed the Gama-III and LP-III distributions. Meanwhile, in West Sumatra Province, there is no representative distribution of all of these data.
REFERENCES
1. APHA.Standard Methods For The Examination of Water and Wastewater, 23rd Edition (A.D.E.EW. Rice, R.B. Baird (ed.)). American Water Works Association.2003
2. EPA. SW- 846 Method 9081: CationExchange Capacity of Soils (Sodium Acetate). 1986.
3. Agus, I.,Determination of Distribution Type and Smirnov Conformity TestKolmogorov Rain Data Taratak Timbulun Watershed Pesisir Selatan Regency, Civil Engineering, pages 42-51 http://ojs.polinpdg.ac.id/index.php/JRS/arti cle/view/221 (accessed on June 26, 2014).2010
4. Agus, I., Hartati,Chikuadrat Suitability Test of Rain Data of Batang Kuranji Watershed in Padang City,Civil Engineering,pages99111. http://journal.polinpdg.ac.id/index.php?option=com_content&view=article&id=9&Itemid=1&article_id=240 (accessed on June 26, 2014).2011
5. Anderson, T. W., D. A. Darling 1954, A Test of Gooness of Fit, Journal of the American Statistical Association, Vol. 49, No. 268 pages 765-769. (accessed on July 15, 2019).
6. Bedient, P. B., and Huber W. C. 1992, Hydrology and Floodplain Analysis, Addison-Wesley Publishing Company, 712p.
7. Badan Pusat Statistik Provinsi Sumatera Barat”. sumbar.bps.go.id. Archived from the original version on 2023-02-10. Accessed on 2024-02-10.
8. Frank, J., Massey, Jr.,The Kolmogorov-Smirnov test for goodness of fit, Journal of the American Statistical Association, Vol 46, No. 253, Pages 68-78.1951
9. Harto, S., Br., Hydrologic Analysis, Publisher PT Gramedia Pustaka Utama, Jakarta, 303p.1993
10. Haan, C. T.Statistical Methods in Hydrology, The Lowa University Press.1979
11. Machairiyah,Rainfall Analysis for Estimation of Peak Discharge by Rational Method in Percut Watershed, Deli Serdang Regency, Final Project S1, University of North Sumatra, Medan.2007
12. Mera, M.,Engineering Hydrology, Publisher CV. Ferila Padang, 179p.2011
13. Mera, M., Givi, M., Februarman, F., & Daulay, A. Flood Investigation of Safa and Marwa Housing in Reality and Model. Journal of Civil Engineering, 17(2), 176-185.2021
14. PSDA,Publication of Rainfall Data for 2000-2010, Water Resources Management Office of West Sumatra Province, Padang.2009
15. PSDA,Rainfall Data Publication Year.2010
16. Water Resources Management Office of West Sumatra Province, Padang. 2012-2013,
17. PSDA, Publication of 2014-2015 Rainfall Data, Water Resources Management Office of West Sumatra Province, Padang.2011
18. PSDA,Publication of 2014-2015 Rainfall Data of West Sumatra Province Water Resources Management Office, Padang. 2011
19. Pratikno, A. S., Prastiwi, A. A., & Ramahwati, S. Continuous Random Chance Distribution, Normal Distribution, Standard Normal Distribution, T Distribution, Chi Square Distribution, and F Distribution. Osf Preprints, 27(3), 1-5.2020
20. Putra, Perdana. Gabrillin, Abba, ed. “1 October 1945 Established as the Anniversary of West Sumatra”. Kompas.com. Archived from the original version on 2019-12-26. Accessed 2019-12-26.
21. Rodrik, T., K. S. Lubis, Supriadi,Assessment of Rainfall Distribution at Several Rainfall Gauge Stations in Padang Watershed, Online Journal of Agroecotechnology, pages 373-379. http://jurnal.usu.ac.id/index.php/agroekotek nology/article/view/5838 (accessed on June 18, 2014).2013
22. Ruhiat, Y.,Daily Rainfall Distribution with Chi-square, Kolmogorov-Smirnov, and Anderson Darling Tests (Case Study: Rainfall Distribution of Serang Station), Proceedings of National Seminar on Mathematics and Mathematics Education 2013, University of Andalas Padang, pages 133-139. 2013
23. Scholz, F. W., M. A. Stephens, 1987, KSample Anderson-Darling Test, Journal of the American Statistical Association, Vol. 82, No. 399, pages 918-924. 1987
24. Stephens, M., A., An appreciation of Kolmogorov’s 1993 Paper, Technical Report No. 453, Dept.of Statistics Stanford 22p. https://statistics.stanford.edu/sites/default/fi/SOL%20ONR%20453.pdf (accessed on October 07, 2014). Sudjana, 2005, Statistical Methods, Tarsito Publisher, Bandung, 508p. 1992
25. Sutikno, and A. Bey,Probability Analysis to Predict Extreme Climate Events “Case Study: Rainfall Distribution in Karawang, Indramayu and Subang Regencies”, J. Agromet, pages 62-72. http://journal.ipb.ac.id/index.php/agromet/a rticle/view/3592 (accessed on June 18, 2014). 2003
26. Usman, H., P. S. Akbar, introduction to Statistics, Publisher PT Bumi Aksara, Second Edition, Jakarta, 363p. 2011
27. Upomo, T. C., & Kusumawardani, R.Selection of probability distribution in rainfall analysis with goodness of fit test method. Journal of Civil Engineering and Planning, 18(2), 139-148. 2016
28. Usman, H., P. S. Akbar, Introduction to Statistics, Publisher PT Bumi Aksara, Second Edition, Jakarta, 363p. 2011
29. Upomo, T. C., & Kusumawardani, R. (2016). Selection of probability distribution in rainfall analysis with goodness of fit test method. Journal of Civil Engineering and Planning, 18(2), 139-148. 2016
30. Widyasari, T., Intensity Duration Frequency (IDF) Curve of Mononobe Equation in Sleman Regency, Janateknika, pages 85-94. http://jurnalteknik.janabadra.ac.id/wp.../Bu _titik_09.pdf (accessed on July 5, 2014).2009
31. Widhiarso, W.Procedure for Testing Normal Distribution through Kai Kuadrat Test in SPSS.2008
32. Yani, R. N., Yanuar, F., & Yozza, H. BAYESIAN INFERENCE FOR 2 FROM THE NORMAL DISTRIBUTION WITH VARIOUS PRIOR DISTRIBUTIONS. UNAND Journal of Mathematics, 7(2), 132-139. http://journal.ipb.ac.id/index.php/agromet/a rticle/view/3592 (diakses pada tanggal 18 Juni 2014).2018
FINANCING
None.
CONFLICT OF INTEREST
No conflicts of interest are disclosed by the writers.
ACKNOWLEDGEMENTS
Many researchers collaborated to conduct this research. Selpa Dewi improved this publication by assisting with data collection, papers and research data processing. M Giatman, Ambiyar, and Adi Fitra Andikos participated in instrument validation and gave approval to the final draft of the paper. The final draft was approved by all authors.
AUTHORSHIP CONTRIBUTION
Conceptualization: Selpa Dewi.
Data curation: Selpa Dewi.
Formal analysis: Selpa Dewi.
Research: Rusnardi Rahmat Putra.
Methodology: M Giatman.
Project management: Ambiyar.
Validation: M Giatman & Ambiyar.
Drafting - original draft: Rusnardi Rahmat Putra.
Writing - proofreading and editing: M Giatman & Rusnardi Rahmat Putra.
Co Autor: Ambiyar.