Input keywords, title, abstract, author, affiliation etc..
Journal Article An open access journal
Journal Article

Research on Water Quality Prediction Based on Machine Learning

by Xudong Chen 1,# Fengtian Pei 2,# Minghao Liu 3,# Zejun Chen 4,# Keqin Li 5,#  and  Jingcheng Xie 6,#
1
College of Computer Science and Engineering, Chongqing University of Technology, Chongqing, China
2
School of Information and Intelligent Manufacturing, Jiangyang City Construction College, Sichuan, China
3
College of Water&Architectural Engineering, Shihezi University, Shihezi, China
4
Faculty of Social Sciences, University of Macau, Macao, China
5
Saint-Petersburg Institute for Shipbuilding & MarineTechnology of Guangdong Ocean University, Guangdong Ocean University, Zhanjiang, China
6
School of Software Engineering, Tongji University, Shanghai, China
#
Co-first author
*
Author to whom correspondence should be addressed.
Received: 3 July 2024 / Accepted: 26 July 2024 / Published Online: 12 August 2024

Abstract

At present, some urban water plants in China have started using chloramine disinfection. So how to determine whether the disinfected water is drinkable? This article collected a water quality prediction data, including indicators such as chloramine and trihalomethanes. Firstly, descriptive statistics and Pearson correlation analysis were conducted between the data of chloramine and trihalomethanes and the target variable (whether it is drinkable). It is known that water quality cannot be judged solely based on these two indicators, so more indicators such as pH value will be used. In order to establish a more accurate prediction model, the dataset is first preprocessed, including statistical analysis of missing values, determination of box plot outliers, and filling with KNN algorithm. Then, feature engineering is performed, including Yeo Johnson transformation, correlation analysis, and calculation of Shap values. Subsequently, the processed data was input into the established Stacking, Voting, and attention based CNN-LSTM classification prediction models. Random search and cross validation were used to train each model, resulting in the optimal hyperparameters for each model. The relevant evaluation indicators for each model were calculated to measure its accuracy.


Copyright: © 2024 by Chen, Pei, Liu, Chen, Li and Xie. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY) (Creative Commons Attribution 4.0 International License). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
Show Figures

Share and Cite

ACS Style
Chen, X.; Pei, F.; Liu, M.; Chen, Z.; Li, K.; Xie, J. Research on Water Quality Prediction Based on Machine Learning. Journal of Globe Scientific Reports, 2024, 6, 105. doi:10.69610/j.gsr.202408121
AMA Style
Chen X, Pei F, Liu M et al.. Research on Water Quality Prediction Based on Machine Learning. Journal of Globe Scientific Reports; 2024, 6(4):105. doi:10.69610/j.gsr.202408121
Chicago/Turabian Style
Chen, Xudong; Pei, Fengtian; Liu, Minghao; Chen, Zejun; Li, Keqin; Xie, Jingcheng 2024. "Research on Water Quality Prediction Based on Machine Learning" Journal of Globe Scientific Reports 6, no.4:105. doi:10.69610/j.gsr.202408121

Article Metrics

Article Access Statistics

References

  1. Huang Wei, Yang Wanli, Mei Yuqin, etc Control analysis of disinfection byproduct trihalomethanes in chloramine disinfection [J] Occupational and Health, 2019, 35 (22): 3071-3074.
  2. Julie Jie, He Kai, Huang Sheng, etc Development and Future Prospects of Water Quality Monitoring Technology Driven by Data Model Coupling [J/OL] People's the Pearl River: 1-15 [2014-06-10].
  3. Shang Xudong, Duan Zhongxing, Chen Bingsheng, etc Water quality prediction based on bidirectional long short-term memory network combination model [J/OL] Journal of Environmental Science: 1-10 [June 10, 2024].
  4. Xiao Yanglan, Shen Huirou, Xu Yihan, etc Water quality prediction of Minjiang River Basin based on GBDT-LSTM [J] Journal of Ecological Environment, 2024, 33 (04): 597-606.
  5. Fu Dunkai, Zhang Yunhui, Xu Xiaojun, etc Progress and Trends in Water Quality Prediction Research Based on Bibliometrics [J] East China Geology, 2024, 45 (01): 88-100.
  6. Zhang Shuyan, Chen Qibing, Cai Yijie Research on ARIMA Model Prediction of River Water Quality Based on Exponential Smoothing [J] Guangdong Chemical Industry, 2024, 51 (06): 95-98.
  7. Xiang Xinjian, Xu Honghui, Xie Jianli, etc Research on Water Quality Prediction Based on VMD-TCN-GRU Model [J] People's Yellow River, 2024, 46 (03): 92-97.
  8. Yang Zhenjian, Pang Ying Water quality prediction model based on GAT-BILSTM Res [J] Journal of Tianjin Chengjian University, 2024, 30 (01): 60-65.
  9. Xiao Mingjun, Zhu Yichun, Gao Wenyuan, etc Comparison of Water Quality Prediction Methods Based on Different Artificial Neural Networks [J/OL] Environmental Science: 1-10 [June 10, 2024].
  10. Xu Shengqiang Research on Water Quality Evaluation and Prediction of Handan Yuecheng Reservoir Based on BP Neural Network [J] Shaanxi Water Resources, 2024 (02): 104-105+108.
  11. Niu Jinghui Key data prediction algorithm for industrial wastewater quality based on GWO XGBoost [J] Industrial Water Treatment, 2024, 44 (01): 184-190.