Multivariate Statistical Techniques to Identify the Source of Pollution and Assessment of Surface Water Quality

Mohammed Amjed Hossain, I. m. Sujaul, M. A. Nasly


Principal component analysis (PCA) and multiple linear regression (MLR) analysis were applied on the data for 14 physico-chemical parameters of surface waters from Tunggak River adjacent to the Gebeng Industrial Estate, Pahang, Malaysia during February 2012-January 2013 with the objective of identifying sources of pollution and their contribution to the variation in water quality. Physico-chemical parameters were determined for a period of 12 months by following standard methods of analysis. Results revealed that most of the parameters including BOD, COD, conductivity, NH4-N and phosphorus were in concentrations greater than the national standard of Malaysia. PCA was applied to identify the source and MLR analysis was done to determine their contribution. PCA yielded five VFs; which extracted 74.72% of total variance that established its validation. Results showed that, surface water quality was strongly influenced by ionic groups of salts, soil erosion and agricultural runoff, organic and nutrient pollutions from domestic wastewater, industrial sewage and wastewater treatment plants. Vicinity of industrial park resulted in low DO concentration all over the basin. MLR showed the contribution of every variable to be highly significant (p <0.01).


Principal Component Analysis[ Varimax Rotation; Multiple Linear Regression; Dissolved Oxygen; Anthropogenic Activities; Industrial Effluents


Adamu, M. and Abdu, A. 2012. Application of principal component analysis & multiple regression models in surface water quality assessment. Journal of Environment and Earth Science. 22), 16-23.

APHA. 2005. Standard methods for the examination of water and wastewater. Washington, USA: American Public Health Association. 21st edition. 1368 pages.

Fukasawa, E. 2005. Determination of origin of nitrate nitrogen in Fuefuki river using stable isotope method. Bachelor thesis (unpublished). Department of Ecosocial System Engineering, University of Yamanashi, Japan.

HACH. 2005. Water analysis guide. HACH Company, USA.

Hinton, P.; Charlotte, B. and Isabella, M. M. 2004. SPSS Explained. Routledge, Taylor and Francis group. 400 Pages

Huang, F.; Wang X.; Liping, L.; Zhiqing, Z. and Jiaping, W. 2010. Spatial variation and source apportionment of water pollution in Qiantang River China) using statistical techniques. Water Research 44: 1562-1572.

Juahir, H.; Zain, M.S.; Yusoff, M.K.; Hanidza, T.I T.; Mohd Armi, A. S.; Toriman, M.E. and Mokhtar, M. 2011. Spatial water quality assessment of Langat River Basin (Malaysia) using environ-metric techniques. Environmental Monitoring and Assessment 173(1-4): 625-641.

Koklu, R.; Sengorur, B. and Topal, B. 2010. Water quality assessment using multivariate statistical methods, a case study: Melen River system. Water Resource management 24 (5): 959-978.

Liu, C.W.; Lin, K.H. and Kuo, Y.M. 2003. Application of factor analysis in the assessment of groundwater quality in a Blackfoot disease area in Taiwan. Science of the Total Environment 313(1-3): 77-89.

Milovanovic, M. 2007. Water quality assessment and determination of pollution sources along the Axios/Vardar River, South-eastern Europe. Desalinization 213 (1-3): 159-173.

Nasly, M.A.; Hossain, M.A. and Islam, Mir Sujaul. 2013. Water quality index of Sungai Tunggak: an analytical study. In the proceedings of 3rd International Conference on Chemical, Biological and Environment Sciences (ICCEBS'2013) January 8-9, 2013 Kuala Lumpur (Malaysia): 40-44.

Nathans, L.; Oswald, F. and Nimon, K. 2012. Interpreting Multiple Linear Regression: A Guidebook of Variable Importance. Practical Assessment, Research and Evaluation 17 (9). Available at:

Onojake, M. C.; Ukerun, S. O. and Iwuoha, G. 2011. A statistical approach for evaluation of the effect of industrial and municipal wastes on Warri Rivers, Niger Delta, Nigeria. Water quality Exposure and Health 3(2): 91-99.

Pejman, A.H.; Nabi Bidhendi, G. R.; Karbassi, A.R.; Mehrdadi, N. and Esmaeili, Bidhendi M. 2009. Evaluation of spatial and seasonal variations in surface water quality using multivariate statistical techniques. International Journal of Environmental Science and Technology 6(3): 467-476.

Praveena, S.M.; Kwan, O.I. and Aris, A.Z. 2011. Effects of data pre-treatment procedures on principal component analysis: a case study for mangrove surface sediment datasets. Environmental Monitoring and Assessment. doi: 10.1007/s10661-011-2463-2.

Ragno, G.; De Luca, M. and Ioele, G. 2007. An application of cluster analysis and multivariate classification methods to spring water monitoring data. Micro-chemical Journal. 87: 119–127.

Saad, M.; Farah, N.; Nik, Abdul R.; Nik, Norulaini; Abdul Kadir; Mohd Omar and Mohd Omar, Fatehah. 2008. Project Report: Identification of Pollution Sources within the Sungai Pinang River Basin. Universiti Sains Malaysia. Available at:

Satheeshkumar, P. B. and Anisa, K. 2011. Identification of mangrove water quality by multivariate statistical analysis methods in Pondicherry coast, India. Environmental Monitoring and Assessment. doi: 10.1007/s10661-011-2222-4.

Shrestha, S. and Kazama, F. 2007. Assessment of surface water quality using multivariate statistical techniques: A case study of the Fuji river basin, Japan. Environmental Modelling & Software 22: 464-475.

Simeonov, V.J.; Stratis, C.J.; Samara, G.J.; Zachariadis, D.; Voutsa, A.; Anthemidis, M.; Sofriniou, T. and Koumtzis, T. 2003. Assessment of the surface water quality in Northern Greece. Water Resources 37(17): 4119–4124.

Singh, K.P.; Malik, A.; Sinha, S.; Vinod, K. and Murthy, R. C. 2005. Estimation of source of heavy metal contamination in sediments of Gomti River (India) using principal components analysis. Water Air and Soil Pollution 166 (1-4): 321-341.

Sujaul, I.M.; Hossain, M. A.; Nasly M A, and Sobahan M A. 2013. Effect of industrial pollution on the spatial variation of surface water quality. American Journal of Environmental Science 9 (2): 120-129.

Varol Memet, Bülent Gökot, Aysel Bekleyen and Bülent Şen 2012. Spatial and temporal variations in surface water quality of the dam reservoirs in the Tigris River basin, Turkey. Catena. doi:10.1016/j.catena.2011.11.013.

Wang, X.; Yong, Y.L.; Jing-yi, H.; Gui-zhen, H. and Tie-yu, W.2007. Identification of anthropogenic influences on water quality of rivers in Taihu watershed. Journal of Environmental Manage-ment 19: 475-481.

Wu, M.L.; Wang, Y.S.; Sun, C.C.; Wang, H.L. and Dong, J.D. 2009. Using chemometrics to identify water quality in Day Bay, China. Oceanologia 52: 217–232.

Xiao-long, W.; Yong-long, L. U.; Jing-yi, HAN, Gui-zhen, HE and Tie-yu, W.2007. Identification of anthropogenic influences on water quality of rivers in Taihu watershed. Journal of Environmental Sciences 19: 475–481.

Zhang, Y.; Yin, Y.; Feng, L.; Zhu, G.; Shi, Z.; Liu, X. and Zhang, Y. 2011. Characterizing chromophoric dissolved organic matter in Lake Tianmuha and its catchment basin using excitation-emission matrix fluorescence and parallel factor analysis. Water Research 45: 5110–5122.

Zhao, Jian, Guo, Fu, Kun, Lei and Yanwu, Li 2011. Multivariate analysis of surface water quality in the three Gorges area of China and implication for water management. Journal of Environmental Sciences. 23(9): 1460–1471.

Zheng, B.H., Cao, C.J., Qin, Y.W. and Huang, M.S. 2008. Analysis of nitrogen distribution characters and their sources of the major input rivers of three Gorges Reservoir. Environmental Science. 29: 1–6.

Full Text: PDF


  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.

COPYRIGHT of this Journal vests fully with the National Instional Institute of Ecology. Any commercial use of the content on this site in any form is legally prohibited.