초록 열기/닫기 버튼

Effective nonpoint source (NPS) pollution management requires frequent water quality monitoring, which is, however, often costly to be implementedin practice. Statistical techniques and machine learning methods allow us to identify and focus on fundamental environmental variables that have closerelationships with NPS pollutants of interest. This study developed surrogate models to predict the concentrations of suspended sediment (SS) and totalphosphorus (T-P) from turbidity and runoff discharge rates using multiple linear regression (MLR) and random forest (RF) methods. The RF modelsprovided acceptable performance in predicting SS and T-P, especially when runoff discharge rates were high. The RF models outperformed the MLRmodels in all the cases. Such finding highlights the potential of RF techniques and models as a tool to identify fundamental environmental variablesthat are measured in relatively inexpensive ways or freely available but still able to provide information required to quantify the concentrations o f NPSpollutants. The analysis of relative importance rates showed that the temporal variations of SS and T-P concentrations could be more effectivelyexplained by that of turbidity than runoff discharge rate. This study demonstrated that the advanced statistical techniques such as machine learning couldhelp to improve the efficiency of NPS pollutants monitoring.