Skip to main navigation Skip to search Skip to main content

Assessing the effectiveness of non-point source pollution models in data-limited urban areas

  • Fangze Shang
  • , Sijie Tang
  • , Hantao Wang
  • , Ruiyi Yang
  • , Zhiqiang Hou
  • , Yang Ping
  • , Zhenzhou Zhang
  • , Huayu Chen
  • , Yange Yu
  • , Ashantha Goonetilleke
  • , Changhyun Jun
  • , Xin Tian
  • , Shuo Wang
  • , Ying Wan*
  • , Jiping Jiang
  • *Corresponding author for this work
  • Ltd.
  • Southern University of Science and Technology
  • Hong Kong Polytechnic University
  • Queensland University of Technology
  • Korea University
  • KWR Watercycle Research Institute
  • Ministry of Ecological Environment
  • Harbin Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Non-point source (NPS) pollution from stormwater runoff has become a major threat to urban water bodies. Rapid and reliable pollution profiling is essential for effective mitigation, yet early-stage stormwater management often lacks detailed drainage data and long-term monitoring, complicating model selection. This study evaluates the performance and practical utility of three widely used NPS modeling approaches—statistical regression, machine learning, and physical process-based models—using a large-scale field monitoring dataset. Improved Export Coefficient Method models achieved high accuracy for TN and COD (R2 > 0.7) but showed overfitting risks due to collinearity. Random Forest Regression predicted COD, TN, NH3-N, and TP well (R2 > 0.6) but struggled with predicting TSS loads. In contrast, SWMM models failed to deliver reliable predictions, even after auto-calibration, underscoring their limitations without prior user expertise. Factor contribution analysis highlighted antecedent dry period, rainfall depth, and land use as key predictors. Nitrogen-related pollutants were more influenced by dry deposition, while phosphorus was more affected by rainfall-triggered wash-off. Finally, a practical multi-criteria evaluation framework, considering accuracy, generalizability, robustness, and cost-efficiency, is proposed to guide model selection under data-limited conditions. This study is expected to promote the utility of machine learning models in practice and provide theoretical support for NPS pollution mitigation in urban areas.

Original languageEnglish
Article number133636
JournalJournal of Hydrology
Volume661
DOIs
StatePublished - Nov 2025
Externally publishedYes

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 11 - Sustainable Cities and Communities
    SDG 11 Sustainable Cities and Communities
  2. SDG 15 - Life on Land
    SDG 15 Life on Land

Keywords

  • Export coefficient method
  • Non-point source
  • Random forest
  • SWMM
  • Stormwater quality
  • Urban runoff

Fingerprint

Dive into the research topics of 'Assessing the effectiveness of non-point source pollution models in data-limited urban areas'. Together they form a unique fingerprint.

Cite this