Assessment of Spatial Water Quality Variations in Shallow Wells Using Principal Component Analysis in Half London Ward, Tanzania ()
1. Introduction
1.1. Background
Growing urban areas in Africa increasingly depend on groundwater as a primary source for domestic and commercial water use [1]. Groundwater is typically accessed via shallow or deep wells [2]. In regions where surface water supply systems are insufficient to meet demand, urban populations often turn to shallow wells due to their affordability and accessibility [3] and [4]. Approximately 85% of public water consumption in these areas comes from groundwater sources [5], which is generally perceived as clean and safe, particularly when drawn from deep, confined aquifers [6] and [7]. Groundwater’s slower response to climate change, compared to surface water, further enhances its appeal. However, shallow wells often face contamination risks due to their proximity to sources like pit latrines, small farms, and solid waste dumps, introducing pollutants from physical, chemical, and microbial origins [2] and [8]. Studies in various African towns have consistently reported microbial contamination in shallow wells near pit latrines, especially in densely populated areas where wells are located downslope of sanitation facilities ([9], [10] and [11]). As a result, managing the quality of groundwater from shallow wells remains a critical challenge. Groundwater quality is often degraded by factors such as small-scale agriculture, urbanization, and industrial activities [12] and [13], while climate change and natural stream movement further impact groundwater chemistry and flow dynamics [14] and [15]. Therefore, regular assessment of groundwater quality is crucial for sustainable water resource management [16] [17].
Water quality assessment involves considering multiple parameters, making it a multidimensional process that requires a robust analytical approach. Principal Component Analysis (PCA) has become an effective tool for this purpose, reducing data complexity while preserving essential information. PCA identifies patterns and correlations among water quality parameters, simplifying multivariate datasets and aiding in better management and decision-making. [18] applied PCA and Cluster Analysis to groundwater data from 20 boreholes, identifying five principal factors that explained 78.69% of the total variance. Significant factors included total hardness, total dissolved solids, and electrical conductivity, which were linked to anthropogenic activities and natural processes. PCA revealed the significant contribution of human actions, such as waste disposal, to groundwater quality degradation. Similarly, [19] used PCA to analyse water samples from a tropical Ramsar wetland near seafood processing facilities. Six principal components were identified, accounting for 65.79% of the variance, with parameters like alkalinity, BOD, and COD indicating organic pollution from seafood waste. This demonstrates the utility of PCA in isolating major pollution contributors in complex ecosystems. Furthermore, [20] used PCA to reduce the dimensionality of water quality data, identifying six significant components that explained 65.40% of the variance. These components were subsequently used as inputs for Artificial Neural Network (ANN) models, which accurately predicted the Water Quality Index (WQI) with a high coefficient of determination (R2 = 0.9999). These studies exemplify the effectiveness of PCA in groundwater quality analysis, making it an essential tool for assessing water quality in areas like Half London, Tunduma Town, Tanzania.
1.2. Description of the Study Area
Tunduma Town, located in the Southern Highlands of Songwe Region at the border of Tanzania and Zambia, spans an area of 87.5 km2 and supports a population of 219,309, with a rapid annual growth rate of 13% [21]. The town’s elevation varies from just below 1500 meters to above 1600 meters above mean sea level, reflecting a varied topography (Figure 1). The study area, Half-London Ward, experiences a unimodal rainy season, which lasts from November to mid-May, with the heaviest rainfall typically occurring in January and February. The region receives an annual average of 1006 mm of rainfall. The average temperature is 20.5˚C, with extremes recorded as low as 6.5˚C in October and as high as 29.0˚C in July. These climatic conditions, along with the town’s geographic and demographic characteristics, make it an ideal setting for water resource studies, particularly related to groundwater from shallow wells.
Figure 1. Location map of the study area (Half-London Ward) in Tunduma Town, Tanzania.
2. Methods
2.1. Study Design and Data Sampling
Water quality sampling was conducted by randomly selecting five commonly used shallow wells from the Half-London Ward in Tunduma, Tanzania. These wells were coded as WW1, WW2, WW3, WW4, and WW5. The selection followed the guidelines provided by the Environmental Protection Agency (EPA) as revealed by [22], which recommend a minimum sample size of five to ten shallow wells when studying groundwater within a specific community. Considering the temporal variation in rainfall patterns, water sampling was carried out monthly over 12 months, from June to May, to capture seasonal changes. Water was collected from the designated points of the five shallow wells, as shown in Figure 1. The position and elevation of the wells were measured using a Global Positioning System (GPS), as detailed in Table 1, and the depth of each well was recorded.
Water analysis was performed for biological (faecal coliform and total coliform), selected physical (pH, electrical conductivity, turbidity, total suspended solids, total dissolved solids, and colour), and chemical (nitrate, phosphate, total iron, and BOD) parameters. The parameters were selected based on their public health importance [23] and their relevance to public water consumption [24]. Samples for microbial tests were collected in sterilized glass bottles, which were rinsed three times with the source water to minimize the risk of external contamination. Since all the shallow wells were open, the sample bottle was held by a bottle holder and submerged to a depth of 0.4 m below the water level to avoid collecting floating debris. For bacterial counts (total coliforms and faecal coliforms), the membrane filtration technique was applied, with results expressed as counts per 100 ml for each sample. All procedures for water sample collection and analysis followed the methods prescribed by the American Public Health Association [25], and the results were compared against the World Health Organization [26] and [27] for water quality to determine the suitability of the water for domestic use.
2.2. Application of Principal Component Analysis
In addition to traditional water quality assessments, Principal Component Analysis (PCA) was employed as a robust statistical tool to manage and interpret the multidimensional water quality data collected over the year. PCA helps in reducing the complexity of the dataset by identifying key variables that contribute most to the variance in water quality. This method was used to transform the original water quality parameters into a set of principal components, which simplified the data while preserving essential information. The PCA was perceived to be capable of revealing the most significant factors affecting water quality, including biological contamination and chemical pollutants, thus providing clearer insights into the primary sources of contamination. Principal Component Analysis (PCA) was conducted using the Jamovi software [28], following the method outlined by [29].
3. Results and Discussion
In this study all water quality parameters which include physical, chemical and biological parameters that were tested are considered in the analysis in order to create an insight of the water quality issues in Half London Ward in Tunduma Tanzania. The descriptive statistics follows next.
3.1. Descriptive Statistics
In spite of the water quality from shallow wells being site specific, it also depends on the various parameters. The pollution parameters of the 5 shallow wells have been summarized by the calculation of minimum and maximum, mean (average), median, standard deviation, skewness, kurtosis, and Shapiro-Wilk as shown in Tables 1-5. When the skewness is considered for all the wells, the values ranged from −2.2 to 11.8 with no zero-value suggesting that the data are skewed. The Kurtosis values for all the wells suggest that the colour for WW1, total dissolved salts for WW2, total dissolved salts and Salinity for WW4 and total dissolved salts for WW5 have their values greater than 3 suggesting that the data are from distribution that has sharper peak and fatter tails compared to a normal distribution [30].
Table 1. Descriptive statistics for analytical measurements of pollution parameters for WW1.
|
Mean |
Minimum |
Maximum |
Median |
SD |
Skewness |
Kurtosis |
Shapiro-Wilk (p) |
pH |
6.35 |
6.20 |
6.70 |
6.30 |
0.16 |
1.15 |
0.72 |
0.03 |
EC |
264.48 |
240.94 |
283.88 |
265.01 |
14.21 |
−0.31 |
−1.20 |
0.48 |
Temp |
23.67 |
22.30 |
25.20 |
23.90 |
1.00 |
0.07 |
−0.85 |
0.30 |
Col |
1.36 |
0.95 |
2.50 |
1.34 |
0.40 |
2.32 |
6.90 |
0.00 |
Turb. |
7.65 |
5.50 |
10.50 |
7.24 |
1.27 |
0.74 |
1.54 |
0.53 |
TSS |
0.49 |
0.12 |
1.80 |
0.23 |
0.53 |
1.78 |
2.68 |
0.00 |
TDS |
61.89 |
52.00 |
69.20 |
60.77 |
5.62 |
−0.50 |
−0.40 |
0.20 |
Hd |
177.33 |
160.00 |
194.10 |
177.50 |
12.24 |
−0.13 |
−1.55 |
0.28 |
Alk |
180.04 |
165.02 |
194.10 |
183.17 |
8.75 |
−0.27 |
−0.88 |
0.61 |
Sal |
0.15 |
0.01 |
0.35 |
0.17 |
0.12 |
0.17 |
−1.12 |
0.17 |
Cl− |
11.99 |
8.10 |
14.50 |
12.27 |
1.92 |
−0.68 |
−0.07 |
0.66 |
|
0.10 |
0.02 |
0.22 |
0.07 |
0.07 |
0.81 |
−0.94 |
0.03 |
|
7.55 |
4.20 |
9.64 |
7.88 |
1.47 |
−1.01 |
1.27 |
0.33 |
|
2.32 |
0.48 |
4.25 |
2.22 |
1.07 |
−0.01 |
−0.17 |
0.86 |
BOD |
2.93 |
0.89 |
4.78 |
3.01 |
1.22 |
−0.19 |
−0.78 |
0.92 |
Ca2+ |
12.92 |
9.60 |
15.00 |
13.25 |
1.60 |
−0.73 |
−0.08 |
0.39 |
Mg2+ |
29.09 |
24.90 |
32.30 |
29.75 |
2.08 |
−0.82 |
0.38 |
0.21 |
Fe2+ |
0.46 |
0.02 |
1.20 |
0.09 |
0.52 |
0.51 |
−1.95 |
0.00 |
FC |
1.25 |
0.00 |
3.00 |
1.00 |
1.06 |
0.52 |
−0.64 |
0.07 |
TC |
5.08 |
2.00 |
9.00 |
5.00 |
2.35 |
0.28 |
−1.45 |
0.16 |
Table 2. Descriptive statistics for analytical measurements of pollution parameters for WW2.
|
Mean |
Minimum |
Maximum |
Median |
SD |
Skewness |
Kurtosis |
Shapiro-Wilk (p) |
pH |
6.60 |
6.30 |
6.80 |
6.70 |
0.17 |
−0.53 |
−1.20 |
0.06 |
EC |
182.75 |
165.30 |
195.80 |
185.35 |
8.67 |
−0.69 |
0.20 |
0.57 |
Temp |
22.51 |
20.40 |
23.90 |
22.88 |
1.19 |
−0.54 |
−1.20 |
0.14 |
Col |
0.87 |
0.32 |
2.30 |
0.57 |
0.59 |
1.47 |
2.05 |
0.02 |
Turb. |
11.97 |
9.00 |
19.20 |
10.97 |
3.18 |
1.30 |
1.22 |
0.05 |
TSS |
0.24 |
0.00 |
2.26 |
0.05 |
0.64 |
3.43 |
11.82 |
<0.001 |
TDS |
298.33 |
240.90 |
350.30 |
297.73 |
31.16 |
−0.13 |
−0.41 |
0.98 |
Hd |
193.24 |
182.30 |
209.40 |
191.04 |
8.56 |
0.77 |
−0.47 |
0.23 |
Alk |
181.45 |
160.70 |
196.75 |
181.60 |
9.23 |
−0.66 |
1.47 |
0.55 |
Sal |
0.39 |
0.11 |
1.50 |
0.25 |
0.37 |
2.83 |
8.72 |
<0.001 |
Cl− |
17.85 |
12.33 |
24.56 |
17.38 |
3.13 |
0.50 |
1.26 |
0.74 |
|
0.38 |
0.09 |
0.60 |
0.41 |
0.17 |
−0.52 |
−0.77 |
0.33 |
|
5.04 |
2.60 |
7.22 |
5.38 |
1.73 |
−0.38 |
−1.47 |
0.11 |
|
1.83 |
0.47 |
4.20 |
1.33 |
1.09 |
1.09 |
0.47 |
0.07 |
BOD |
2.25 |
1.07 |
3.20 |
2.25 |
0.66 |
−0.24 |
−0.96 |
0.74 |
Ca2+ |
24.05 |
19.50 |
28.82 |
23.60 |
2.32 |
0.17 |
1.43 |
0.83 |
Mg2+ |
19.49 |
13.75 |
23.55 |
19.89 |
2.69 |
−0.80 |
1.04 |
0.23 |
Fe2+ |
0.16 |
0.03 |
0.45 |
0.11 |
0.13 |
0.99 |
0.40 |
0.06 |
FC |
0.58 |
0.00 |
2.00 |
0.00 |
0.79 |
0.99 |
−0.46 |
0.00 |
TC |
1.83 |
0.00 |
6.00 |
1.00 |
2.12 |
0.67 |
−0.81 |
0.01 |
Table 3. Descriptive statistics for analytical measurements of pollution parameters for WW3.
|
Mean |
Minimum |
Maximum |
Median |
SD |
Skewness |
Kurtosis |
Shapiro-Wilk (p) |
pH |
6.57 |
6.30 |
6.80 |
6.55 |
0.17 |
−0.21 |
−0.64 |
0.28 |
EC |
261.25 |
240.00 |
274.65 |
265.00 |
12.76 |
−0.65 |
−0.87 |
0.10 |
Temp |
22.58 |
21.30 |
23.80 |
22.50 |
0.65 |
−0.03 |
0.67 |
0.65 |
Col |
1.32 |
0.55 |
3.00 |
0.86 |
0.84 |
1.09 |
−0.16 |
0.02 |
Turb. |
7.61 |
4.20 |
12.70 |
7.58 |
2.11 |
0.96 |
2.69 |
0.26 |
TSS |
0.47 |
0.18 |
1.30 |
0.27 |
0.39 |
1.23 |
0.16 |
0.00 |
TDS |
203.34 |
163.69 |
245.50 |
200.30 |
18.98 |
0.17 |
2.93 |
0.13 |
Hd |
222.28 |
190.25 |
250.00 |
222.65 |
20.83 |
−0.23 |
−1.51 |
0.24 |
Alk |
191.43 |
179.40 |
203.25 |
190.70 |
8.76 |
−0.07 |
−1.42 |
0.26 |
Sal |
0.58 |
0.23 |
0.99 |
0.57 |
0.19 |
0.45 |
1.54 |
0.78 |
Cl− |
18.40 |
10.78 |
24.55 |
18.57 |
3.85 |
−0.52 |
0.47 |
0.62 |
|
0.95 |
0.33 |
1.80 |
1.04 |
0.44 |
0.31 |
−0.53 |
0.60 |
|
5.38 |
3.00 |
8.50 |
5.64 |
1.58 |
0.16 |
0.12 |
0.68 |
|
1.14 |
0.49 |
2.54 |
0.99 |
0.61 |
1.08 |
1.18 |
0.13 |
BOD |
2.77 |
1.20 |
5.00 |
2.93 |
1.01 |
0.61 |
0.99 |
0.54 |
Ca2+ |
30.18 |
27.50 |
36.82 |
30.25 |
2.59 |
1.55 |
3.37 |
0.02 |
Mg2+ |
27.08 |
16.90 |
31.83 |
27.55 |
4.61 |
−1.39 |
1.40 |
0.02 |
Fe2+ |
0.04 |
0.02 |
0.09 |
0.04 |
0.02 |
1.10 |
2.33 |
0.05 |
FC |
2.25 |
1.00 |
5.00 |
2.00 |
1.29 |
0.98 |
0.37 |
0.06 |
TC |
12.75 |
8.00 |
18.00 |
12.00 |
3.60 |
0.19 |
−1.77 |
0.08 |
Table 4. Descriptive statistics for analytical measurements of pollution parameters for WW4.
|
Mean |
Minimum |
Maximum |
Median |
SD |
Skewness |
Kurtosis |
Shapiro-Wilk (p) |
pH |
6.53 |
6.20 |
6.90 |
6.40 |
0.27 |
0.29 |
−1.78 |
0.05 |
EC |
419.96 |
368.53 |
442.00 |
425.40 |
21.56 |
−1.28 |
1.78 |
0.08 |
Temp |
23.09 |
20.50 |
24.10 |
23.23 |
1.04 |
−1.46 |
2.54 |
0.04 |
Col |
2.03 |
0.50 |
4.20 |
1.85 |
1.08 |
0.41 |
−0.04 |
0.80 |
Turb. |
11.28 |
7.56 |
18.70 |
10.21 |
3.30 |
1.38 |
1.26 |
0.03 |
TSS |
0.85 |
0.10 |
7.20 |
0.20 |
2.01 |
3.40 |
11.68 |
<0.001 |
TDS |
237.21 |
200.55 |
266.50 |
243.99 |
23.62 |
−0.28 |
−1.65 |
0.14 |
Hd |
121.17 |
16.50 |
201.30 |
185.00 |
87.69 |
−0.38 |
−2.24 |
<0.001 |
Alk |
178.11 |
155.00 |
189.40 |
179.58 |
8.71 |
−1.78 |
4.48 |
0.02 |
Sal |
0.39 |
0.03 |
0.76 |
0.38 |
0.20 |
0.22 |
0.10 |
0.97 |
Cl− |
31.90 |
25.66 |
37.93 |
31.90 |
3.65 |
0.04 |
−0.40 |
0.96 |
|
1.29 |
0.48 |
2.51 |
1.22 |
0.53 |
0.87 |
1.38 |
0.52 |
|
4.91 |
3.90 |
6.94 |
4.68 |
0.91 |
1.09 |
0.75 |
0.15 |
|
2.18 |
1.20 |
3.20 |
2.04 |
0.62 |
0.30 |
−0.63 |
0.70 |
BOD |
2.86 |
0.97 |
4.69 |
2.84 |
1.22 |
0.01 |
−1.08 |
0.83 |
Ca2+ |
22.32 |
16.30 |
26.81 |
22.98 |
2.81 |
−0.54 |
0.73 |
0.57 |
Mg2+ |
30.01 |
25.46 |
34.66 |
30.29 |
2.53 |
−0.12 |
0.17 |
0.86 |
Fe2+ |
1.37 |
0.95 |
1.85 |
1.33 |
0.32 |
0.31 |
−1.55 |
0.12 |
FC |
0.67 |
0.00 |
2.00 |
0.00 |
0.89 |
0.80 |
−1.27 |
0.00 |
TC |
4.58 |
2.00 |
6.00 |
5.00 |
1.38 |
−0.58 |
−0.83 |
0.10 |
Table 5. Descriptive statistics for analytical measurements of pollution parameter for WW5.
|
Mean |
Minimum |
Maximum |
Median |
SD |
Skewness |
Kurtosis |
Shapiro-Wilk (p) |
pH |
6.51 |
6.10 |
6.80 |
6.55 |
0.24 |
−0.34 |
−1.35 |
0.25 |
EC |
302.51 |
274.25 |
325.80 |
308.27 |
18.73 |
−0.28 |
−1.41 |
0.20 |
Temp |
21.95 |
20.50 |
23.90 |
21.70 |
1.08 |
0.83 |
−0.12 |
0.18 |
Col |
1.10 |
0.23 |
3.86 |
0.58 |
1.11 |
1.74 |
2.54 |
0.00 |
Turb. |
12.00 |
7.50 |
18.50 |
11.68 |
3.48 |
0.41 |
−0.59 |
0.65 |
TSS |
0.52 |
0.00 |
4.25 |
0.10 |
1.22 |
3.08 |
9.76 |
<0.001 |
TDS |
366.33 |
320.89 |
390.16 |
369.53 |
20.43 |
−0.98 |
0.77 |
0.31 |
Hd |
235.01 |
209.20 |
253.55 |
233.00 |
15.16 |
−0.47 |
−0.76 |
0.22 |
Alk |
214.57 |
168.30 |
245.00 |
216.76 |
19.41 |
−1.01 |
2.38 |
0.31 |
Sal |
0.08 |
0.03 |
0.17 |
0.08 |
0.04 |
0.63 |
−0.28 |
0.34 |
Cl− |
25.97 |
18.60 |
29.50 |
26.75 |
3.44 |
−1.23 |
0.80 |
0.06 |
|
1.10 |
0.12 |
2.30 |
1.24 |
0.67 |
0.18 |
−0.97 |
0.46 |
|
3.61 |
1.59 |
5.48 |
3.51 |
1.18 |
−0.12 |
−0.80 |
0.96 |
|
0.77 |
0.09 |
1.50 |
0.75 |
0.48 |
0.15 |
−1.13 |
0.56 |
BOD |
2.41 |
0.59 |
3.90 |
2.51 |
1.06 |
−0.21 |
−0.68 |
0.60 |
Ca2+ |
19.65 |
17.50 |
21.89 |
19.80 |
1.30 |
−0.08 |
−0.68 |
0.86 |
Mg2+ |
24.17 |
17.52 |
27.57 |
24.63 |
2.99 |
−1.10 |
0.82 |
0.12 |
Fe2+ |
0.39 |
0.09 |
0.69 |
0.39 |
0.24 |
0.09 |
−1.71 |
0.10 |
FC |
0.17 |
0.00 |
1.00 |
0.00 |
0.39 |
2.06 |
2.64 |
<0.001 |
TC |
1.00 |
0.00 |
5.00 |
0.00 |
1.65 |
1.60 |
1.94 |
<0.001 |
3.2. Correlation Matrix for Different Shallow Wells Water Quality
The correlation matrices for various water quality parameters were analysed using Pearson’s correlation coefficients, as detailed in Tables 6-10. Pearson’s correlation coefficient (r) values were used to assess the strength and direction of the relationships between variables. According to [25] and [28] correlation values are interpreted as follows: |r| ≥ 0.5 is considered a strong correlation, 0.3 ≤ |r| < 0.5 is moderate, 0.1 ≤ |r| < 0.3 is weak, and 0 indicates no correlation. A positive r indicates that as one variable increases, the other also increases, while a negative r suggests an inverse relationship between the variables.
In WW1 (Table 6), pH showed strong correlations with temperature (Temp), total suspended solids (TSS), nitrate (
), and faecal coliforms (FC). Electrical conductivity (EC) correlated with
and biochemical oxygen demand (BOD). Total dissolved solids (TDS) showed strong correlations with
, iron (Fe2+), FC, and total coliforms (TC), highlighting the connection between runoff and microbial contamination. Salinity (Sal) correlated with phosphate (
), BOD, and magnesium (Mg2+). Chloride (Cl−) was negatively correlated with TC, suggesting reduced microbial contamination with higher Cl− concentrations. Other significant correlations included sulphate (
) with calcium (Ca2+), and
with Fe2+, FC, and TC, pointing to potential contamination from fertilizers and human waste sources [14] [31].
For WW2 (Table 7), EC was negatively correlated with coliforms (Col), while temperature (Temp) was positively correlated with Col and Ca2+ but negatively correlated with BOD. TDS showed significant correlations with hardness (Hd),
, Mg2+, Fe2+, and TC. Alkali (Alk) was correlated with
,
, BOD, and Mg2+, reflecting the geological influence on water chemistry. Similar patterns were observed for Cl−, which was positively correlated with
, BOD, and Mg2+. These relationships indicate that both geological strata and human activities, such as pit latrine use and agricultural runoff, significantly influence groundwater quality [32] [33].
In WW3 (Table 8), pH correlated with hardness (Hd), while EC correlated with Alk, reflecting mineral dissolution. Col was positively correlated with turbidity (Turb) and Alk. Strong correlations were found between TSS, Cl−, and Fe2+, indicating contamination from runoff and potential industrial activities. Additionally,
and
were strongly correlated with Ca2+, Mg2+, FC, and TC, pointing to contamination from fertilizers and human waste sources. These findings align with studies highlighting the role of agricultural runoff and sanitation practices in groundwater contamination [7] [34] and [35].
For WW4 (Table 9), EC was negatively correlated with Col and positively correlated with Alk. Temperature was negatively correlated with TSS and Hd but positively correlated with Sal and Fe2+, indicating a complex relationship between water temperature and chemical composition. Turb was correlated with
,
, and other dissolved solids, indicating pollution from both agricultural and geological sources. The correlation between
, BOD, and Mg2+ further supported the role of mineral dissolution and organic pollution in influencing water quality [35] [37].
In WW5 (Table 10), pH was significantly correlated with several variables, including Turb, TDS, Hd, Alk, Sal,
,
,
, BOD, and Mg2+. Negative correlations were observed between EC and Col, while temperature was correlated with Col but negatively correlated with Hd, Mg2+, and BOD. TDS, Hd, and Alk were positively correlated with
,
, and
, indicating pollution from fertilizers and waste sources. These results demonstrate the influence of human activities and natural processes on groundwater quality in WW5 [29].
Table 6. Correlation matrix for different water quality parameters in WW1.
Table 7. Correlation matrix for different water quality parameters in WW2.
Table 8. Correlation matrix for different water quality parameters in WW3.
Table 9. Correlation matrix for different water quality parameters in WW4.
Table 10. Correlation matrix for different water quality parameters in WW5.
3.3. Component Numbers and Eigenvalue Relations
The selection of the component numbers in the analysis was based on the Kaiser criterion, where only eigenvalues greater than 1 were considered significant [38]. Consequently, wells WW1, WW2, WW3, WW4, and WW5 had 5, 6, 4, 5, and 6 principal components (PCs), respectively, in the principal component analysis (PCA) as indicated in Figure 2. These PCs were deemed sufficient to capture the essential variation in the original water quality variables from the shallow wells. The analysis allowed for a comprehensive representation of the water quality characteristics, with a detailed explanation of the significant PCs for each well provided to ensure a clear understanding of the underlying factors influencing water quality [39].
(a)
(b)
(c)
(d)
(e)
Figure 2. Scree plot of the eigenvalue for each component for the 5 shallow wells.
3.3.1. WW1
The water quality from WW1 can be effectively described by five principal components (PC1, PC2, PC3, PC4, and PC5), accounting for 87.53% of the total variability in the dataset (Table 11). The variance contributions for each component are: 33.05% for PC1, 25.97% for PC2, 14.56% for PC3, 8.44% for PC4, and 5.50% for PC5, indicating PC1 and PC2 are the most influential. The Sum of Squared Loadings (SS Loadings) also explain 87.53% of the variability, with PC1, PC2, PC3, PC4, and PC5 contributing 29.47%, 18.47%, 17.96%, 14.15%, and 7.47%, respectively. These results indicate that the first two components capture the majority of the water quality variability, consistent with findings in similar environmental studies [15] [40].
Table 11. Percentage variances and cumulative variance of the first 5 PCs’ eigenvalues and SS loadings.
PC No. |
Eigenvalue |
% of Variance |
Cumulative % |
SS Loadings |
% of Variance |
Cumulative % |
1 |
6.61 |
33.05 |
33.05 |
5.89 |
29.47 |
29.47 |
2 |
5.19 |
25.97 |
59.03 |
3.69 |
18.47 |
47.95 |
3 |
2.91 |
14.56 |
73.59 |
3.59 |
17.96 |
65.91 |
4 |
1.69 |
8.44 |
82.03 |
2.83 |
14.15 |
80.05 |
5 |
1.10 |
5.50 |
87.53 |
1.49 |
7.47 |
87.53 |
Figure 3 presents the factor scores for the five principal components (PCs) that explain 87.53% of the total variance in the water quality data from WW1. These scores indicate the contribution of each variable to the variance explained by each component, providing insight into the underlying factors driving water quality variations.
PC1 (33.05%) is strongly influenced by pH, electrical conductivity (EC), temperature, total suspended solids (TSS), total dissolved solids (TDS), hardness (Hd), phosphate (
), nitrate (
), iron (Fe2+), faecal coliforms (FC), and total coliforms (TC). This suggests that surface runoff is a major factor, contributing to increased EC, Temp, TSS, and TDS, while contamination from pit latrines is indicated by high FC and TC levels. The presence of agrochemical fertilizers is highlighted by the elevated levels of
and
. Interestingly, chloride (Cl−) has a negative effect, implying an inverse relationship with the contaminants driving PC1. Variables such as alkalinity (Alk), biochemical oxygen demand (BOD), calcium (Ca2+), coliforms (Col), magnesium (Mg2+), salinity (Sal), sulphate (
), and turbidity (Turb) do not significantly influence PC1. This aligns with studies showing that microbial and chemical contaminants are frequently associated with surface runoff and poor sanitation practices [14] and [41].
PC2 (25.97%) is characterized by the positive contribution of EC, hardness (Hd), Alk, Sal, BOD, Ca2+, Mg2+, and FC, pointing to the influence of geological formations and pit latrine contamination. High alkalinity, hardness, and salinity often result from water interacting with geological strata, such as limestone or dolomite, which release calcium and magnesium ions into the water [42]. The presence of faecal coliforms further suggests the proximity of contamination sources like pit latrines. Variables such as Cl−, Col, Fe2+,
, pH,
,
, TC, TDS, Temp, TSS, and Turb do not significantly contribute to PC2, reinforcing the idea that geological processes dominate in this component [43].
PC3 (14.56%) is dominated by coliforms (Col), turbidity (Turb), TSS, Sal, and Cl⁻, suggesting that surface runoff and microbial contamination play a key role. The positive loadings of TSS and Turb indicate the presence of suspended particles typically carried by runoff from urban or agricultural lands. The strong association between salinity and chloride further suggests influence from both surface runoff and geological sources, as these ions can enter the water through the dissolution of rock salts or human activities like road de-icing [41]. The negative contribution of EC and
may suggest areas where high microbial contamination correlates with lower concentrations of these chemicals, potentially due to dilution or other local environmental factors.
PC4 (8.44%) is defined by the positive loadings of hardness (Hd),
,
, Ca2+, and Mg2+, which are indicative of mineral dissolution from geological sources. The significant presence of sulphate and phosphate suggests the influence of sulphate-bearing minerals like gypsum or anthropogenic sources such as agricultural fertilizers. The negative effect of
points to the possibility of competing interactions between geological sources and human activities for phosphorus levels in the water. These findings align with studies indicating that areas with high sulphate and hardness often reflect regions where groundwater interacts with mineral-rich bedrock [42].
PC5 (5.50%) reflects the influence of pH, temperature (Temp), Cl−, and
, suggesting a combination of geological and human influences. Human activities such as water treatment or industrial discharges can impact pH and temperature, while the presence of sulphate and chloride may indicate inputs from both natural sources (e.g., mineral dissolution) and urban activities. The negative contributions of salinity and Mg2+ suggest that these factors are less influenced by surface interactions and may reflect deeper groundwater characteristics. This component aligns with findings from studies where industrial or urban activities have a significant impact on water temperature and chemical composition [14].
The factor scores reveal a complex interplay between surface runoff, agricultural practices, pit latrine contamination, and geological processes. PC1 emphasizes the impact of surface runoff and microbial contamination, PC2 and PC4 highlight the role of geological formations, while PC3 and PC5 show the combined effects of human activities and natural processes. These findings are consistent with previous research that demonstrates the significant contributions of both natural and anthropogenic factors to water quality variability [41] and [42].
(a)
(b)
(c)
(d)
(e)
Figure 3. Factor score coefficient for different water quality parameter for WW1 for PCs.
3.3.2. WW2
The water quality from WW2 can be effectively described by six principal components (PCs): PC1, PC2, PC3, PC4, PC5, and PC6, which together account for 92.55% of the total variability in the original dataset (Table 12). The contribution of each PC to the variance is as follows: 36.17% for PC1, 22.72% for PC2, 11.17% for PC3, 9.35% for PC4, 8.13% for PC5, and 5.00% for PC6. Additionally, the Sum of Squared Loadings (SS Loadings) for these PCs explain 92.55% of the total variance, with each PC contributing 25.97%, 18.08%, 17.19%, 14.85%, 9.13%, and 7.34%, respectively. These results indicate that the first two components, PC1 and PC2, capture the majority of the variability, highlighting their importance in explaining the water quality variation in WW2 [29].
Table 12. Percentage variances and cumulative variance of the first 5 PCs’ eigenvalues and SS loadings.
PC No. |
Eigenvalue |
% of Variance |
Cumulative % |
SS Loadings |
% of Variance |
Cumulative % |
1 |
7.23 |
36.17 |
36.17 |
5.19 |
25.97 |
25.97 |
2 |
4.54 |
22.72 |
58.89 |
3.62 |
18.08 |
44.04 |
3 |
2.23 |
11.17 |
70.06 |
3.44 |
17.19 |
61.23 |
4 |
1.87 |
9.35 |
79.41 |
2.97 |
14.85 |
76.08 |
5 |
1.63 |
8.13 |
87.55 |
1.83 |
9.13 |
85.21 |
6 |
1.00 |
5.00 |
92.55 |
1.47 |
7.34 |
92.55 |
Figure 4 illustrates the factor scores for the six principal components (PCs) that explain 92.55% of the total variability in water quality data. These PCs highlight the dominant variables influencing water quality, providing insight into both natural processes and anthropogenic activities that impact the study area.
PC1, accounting for 36.17% of the total variance, is driven by pH, electrical conductivity (EC), temperature (Temp), coliforms (Col), turbidity (Turb), total suspended solids (TSS), total dissolved solids (TDS), salinity (Sal), chloride (Cl−), sulphate (
), magnesium (Mg2+), and iron (Fe2+). These variables indicate the influence of surface runoff (EC, Temp, Col, TSS, TDS), which contributes to elevated suspended solids and microbial contamination, and geological formations (pH, Sal, Cl−,
, Mg2+, Fe2+), which affect water chemistry through mineral dissolution. Variables such as hardness (Hd), alkalinity (Alk), phosphate (
), nitrate (
), and faecal coliforms (FC) show no significant effect on PC1, aligning with studies that link surface runoff and geological strata as major drivers of water quality variation [42].
PC2, contributing 22.72% of the variance, is influenced by Col, Turb, TSS, TDS, Hd, Alk, Sal, Cl−, and Mg2+. This reflects surface runoff (Col, Turb, TSS, TDS) and geological processes (Hd, Alk, Sal, Cl−, Mg2+). Runoff from urban or agricultural areas increases suspended solids, while mineral dissolution contributes to hardness and alkalinity in the groundwater. Variables such as pH, EC, Temp,
, and
show no effect. These patterns are consistent with studies of water quality where runoff and mineral dissolution affect the chemical composition of water sources [41].
PC3, explaining 11.17% of the variance, is dominated by TSS, TDS, Alk,
,
, and
, indicating surface runoff (TSS, TDS), agrochemical influences (
,
), and geological factors (Alk,
). The presence of phosphates and nitrates suggests contamination from agricultural fertilizers, while alkalinity and sulphates point to mineral dissolution. Chloride (Cl−) and Fe2+ contribute negatively, possibly indicating dilution or lower concentrations of these ions in areas dominated by agricultural inputs. These results are in line with research showing the impact of fertilizers on water quality in regions with high agricultural activity [43].
PC4, which accounts for 9.35% of the total variance, is influenced by Turb, TDS, Cl−, BOD, Mg2+, and Fe2+. Surface runoff (Turb, TDS) and contamination from pit latrines (BOD) are key factors, while Mg2+ and Fe2+ suggest mineral dissolution from geological strata. The negative contributions from Ca2+ and
imply less influence from these variables in this component, which may be explained by their absence in areas of higher pit latrine contamination or geological settings less rich in these minerals. Such findings are consistent with studies in regions where sanitation and runoff significantly affect water quality [14].
PC5 contributes 8.13% of the variance and is dominated by Temp, Sal,
, and Mg2+, indicating a combination of agrochemical (
) and geological influences (Mg2+, Sal). The presence of
suggests contamination from agricultural sources or pit latrines, while the influence of salinity and magnesium reflects interactions with geological formations. The negative contribution of FC implies lower microbial contamination in areas influenced by geological and agrochemical sources, potentially due to deeper groundwater sources being less exposed to surface contamination [41].
PC6, explaining 5.00% of the variance, is primarily driven by
and FC, indicating contamination from both agrochemical runoff and pit latrines. The strong presence of nitrates and faecal coliforms in this component suggests areas where agricultural and sanitation practices significantly affect water quality. Other variables have minimal influence, consistent with studies that highlight nitrate and microbial contamination as major issues in regions affected by human and agricultural waste [14].
The six principal components reflect a complex interplay of surface runoff, geological factors, agrochemical pollution, and sanitation-related contamination. PC1 and PC2 highlight the dominant roles of surface runoff and geological strata, while PC3 and PC5 emphasize the contributions of agrochemicals and sanitation practices. These results align with previous studies, confirming that both natural processes and human activities significantly impact water quality in the study area [42].
(a)
(b)
(c)
(d)
(e)
(f)
Figure 4. Factor score coefficient for different water quality parameter for WW2 for PCs.
3.3.3. WW3
The quality of the water from the WW2 can be sufficiently described by 4 PCs namely PC1, PC2, PC3 and PC4. These PCs contribute 84.61% of the variability of the original results (Table 13). The variances of the PCs are 39.63%, 18.79%, 14.15% and 12.04% for PC1, PC2, PC3 and PC4 respectively. On the other hand, the sum of the square loading (SS Loading) for the PCs are shown and the variance contribute to 84.61% of the variability of the SS loading. The variances of the PCs are 34.27%, 19.84%, 15.4% and 15.09% for PC1, PC2, PC3 and PC4 respectively.
Table 13. Percentage variances and cumulative variance of the first 5 PCs’ eigenvalues and SS loadings.
PC No. |
Eigenvalue |
% of Variance |
Cumulative % |
SS Loadings |
% of Variance |
Cumulative % |
1 |
7.93 |
39.63 |
39.63 |
6.85 |
34.27 |
34.27 |
2 |
3.76 |
18.79 |
58.42 |
3.97 |
19.84 |
54.11 |
3 |
2.83 |
14.15 |
72.57 |
3.08 |
15.4 |
69.52 |
4 |
2.41 |
12.04 |
84.61 |
3.02 |
15.09 |
84.61 |
Figure 5 provides the factor scores for the five principal components (PCs) that explain water quality variability. Each component reflects a combination of factors such as surface runoff, geological formations, sanitation, and agrochemical use. The higher the score value of a variable, the greater its contribution to the variability of the corresponding component.
PC1 contributes 36.17% of the variance and is driven by pH, electrical conductivity (EC), temperature (Temp), total dissolved solids (TDS), hardness (Hd), salinity (Sal), chloride (Cl−), phosphate (
), calcium (Ca2+), magnesium (Mg2+), faecal coliforms (FC) and total coliforms (TC). This component indicates a blend of influences: Geological strata (pH, Hd, Sal, Cl−, Ca2+, Mg2+). These parameters are typically associated with natural mineral dissolution as groundwater interacts with subsurface formations. Elevated hardness and salinity often reflect regions rich in minerals like limestone or gypsum, contributing to higher concentrations of calcium, magnesium, and salts in water [43]. Surface runoff (TDS, Cl−): Runoff from urban or agricultural areas can introduce dissolved solids and chloride, often due to fertilizers. This results in elevated levels of TDS and chloride in surface and shallow groundwater. Sanitation-related contamination (FC, TC). The presence of faecal coliforms and total coliforms suggests contamination from pit latrines or poorly maintained sanitation systems, which can leach into groundwater. This is consistent with studies showing microbial contamination in areas with inadequate waste management [32]. Agrochemical influence (
,
). Phosphates and nitrates are commonly associated with fertilizer runoff, highlighting the impact of agricultural activities on water quality. Interestingly, EC and Temp have negative effects on PC1, suggesting that in areas with high contributions from other variables (e.g., TDS and coliforms), conductivity and temperature may have less of an impact. Variables such as coliforms (Col), turbidity (Turb), TSS, alkalinity (Alk), sulphate (
), BOD, and Iron (Fe²⁺) do not contribute significantly to PC1.
PC2 explains 22.72% of the total variance and is influenced by Temp, Col, Turb, TSS, Sal, Cl−,
, BOD, Ca2+ and Fe2+. This component suggests a mix of surface and subsurface influences: Surface runoff (Temp, Col, Turb, TSS): Runoff from urban areas or agricultural lands can introduce suspended solids, turbidity, and coliform bacteria into the water, increasing contamination. High temperature often indicates surface water interaction or shallow groundwater exposed to climatic variations [15]. Geological influence (Sal, Cl−, Ca2+, Fe2+): These variables point to mineral dissolution processes, particularly in groundwater rich in iron and calcium due to the presence of minerals such as calcite or dolomite. Agrochemical contamination (
). Elevated phosphate levels suggest fertilizer runoff, indicating agricultural activities as a major influence on water quality. Phosphates in groundwater are often linked to excessive use of chemical fertilizers, leading to nutrient pollution. Variables such as pH, EC, TDS, Hd, Alk,
,
, Mg2+, FC, and TC show no significant effect, highlighting that PC2 is primarily dominated by runoff and specific geological interactions.
PC3, accounting for 11.17% of the variance, is driven by Temp, TDS,
, and Mg2+, while Col, Sal,
, and Ca2+ show negative effects: Surface runoff (TDS, Temp): This highlights the influence of dissolved solids from runoff, where temperature plays a role in the rate of contamination. Geological strata (Mg2+): The presence of magnesium reflects mineral interactions within groundwater aquifers, pointing to natural sources of contamination. Agrochemical inputs (
): Again, phosphates are a strong indicator of agricultural runoff, emphasizing the impact of farming activities on groundwater quality. Negative contributions from Col, Sal,
, and Ca2+ may suggest that in regions with high phosphate or magnesium concentrations, these variables are less influential. Other variables such as pH, EC, Turb, TSS, Hd, Alk,
, BOD, Fe2+, FC, TC, and Cl− do not show a significant impact, implying a more localized influence of surface runoff and geological features in PC3.
PC4 explains 9.35% of the variance, with positive contributions from EC, Alk, and BOD, and negative contributions from Temp, Col,
, and TC: Geological influence (EC, Alk): The presence of alkalinity and conductivity suggests mineral dissolution, particularly in regions where groundwater interacts with calcareous formations, contributing to higher alkalinity. Pit latrine contamination (BOD): Elevated BOD indicates organic pollution, typically associated with human waste leaching into water sources, often from pit latrines. Negative contributions from temperature, coliform bacteria, and nitrates suggest that microbial contamination may be less impactful in areas dominated by mineral dissolution or deeper groundwater sources. This finding reflects regions where organic pollution from pit latrines is more significant than surface contamination. Variables such as pH, Turb, TSS, TDS, Hd, Sal, Cl−,
,
, Ca2+, Mg2+, Fe2+, and FC do not show significant influence, emphasizing geological and sanitation-related processes as dominant in this component.
The four principal components reveal critical influences on water quality, including surface runoff, geological interactions, agricultural activities, and sanitation-related contamination. PC1 highlights the role of runoff and geology, while PC2 and PC3 emphasize the impact of agrochemical runoff and geological processes. PC4 points to contamination from pit latrines. The analysis underscores the complex interaction between natural processes and human activities in shaping groundwater quality [42].
(a)
(b)
(c)
(d)
Figure 5. Factor score coefficient for different water quality parameter for WW3 for PCs.
3.3.4. WW4
The water quality of WW4 can be adequately described by six principal components (PC1-PC6), which together explain 90.83% of the total variability in the original dataset (Figure 6). The variance contributions for each PC are 36.17% for PC1, 22.72% for PC2, 11.17% for PC3, 9.35% for PC4, 8.13% for PC5, and 5.00% for PC6, making them sufficient to represent the underlying water quality patterns. The Sum of Squared Loadings (SS Loadings) for these PCs explain 92.55% of the total variance, with each PC contributing 25.97%, 18.08%, 17.19%, 14.85%, 9.13%, and 7.34%, respectively, indicating a substantial representation of the original data [29] (Table 14).
As shown in Table 12, PC1 has an eigenvalue of 8.71, accounting for 43.53% of the variance, followed by PC2 at 3.89 (19.44%), PC3 at 2.81 (14.04%), PC4 at 1.73 (8.66%), and PC5 at 1.03 (5.17%). These components cumulatively explain 90.83% of the variance, indicating that the majority of the variability in water quality across WW4 can be captured by these five components [44].
Table 14. Percentage variances and cumulative variance of the first 5 PCs’ eigenvalues and SS loadings.
PC No. |
Eigenvalue |
% of Variance |
Cumulative % |
SS Loadings |
% of Variance |
Cumulative % |
1 |
8.71 |
43.53 |
43.53 |
5.86 |
29.32 |
29.32 |
2 |
3.89 |
19.44 |
62.96 |
5.72 |
28.61 |
57.93 |
3 |
2.81 |
14.04 |
77.00 |
2.9 |
14.51 |
72.44 |
4 |
1.73 |
8.66 |
85.66 |
2.09 |
10.46 |
82.9 |
5 |
1.03 |
5.17 |
90.83 |
1.59 |
7.93 |
90.83 |
Figure 6 shows the factor scores for the five PCs. The higher the score value of the variable, the more the variable contributes to the variability of the particular PC. PC1 (43.53% variance). 12 variables contribute positively, including Turb, TDS, Alk, Sal, Cl−,
,
,
, BOD, Ca2+, Mg2+ and FC. This indicates influences from surface runoff (Turb, TDS), pit latrines (
,
, BOD, FC), and geological strata (Alk, Sal, Cl−, Ca2+, Mg2+), as well as agrochemical fertilizers (
,
). Variables such as pH, EC, Temp, Col, TSS, Hd, Fe2+, and TC do not contribute significantly [42].
(a)
(b)
(c)
(d)
(e)
Figure 6. Factor score coefficient for different water quality parameter for WW4 for PCs.
PC2 (19.44% variance): Positive contributors include Temp, TDS, Sal,
,
, Fe2+, FC, and TC, while Hd has a negative effect. This suggests that surface runoff (Temp, TDS), pit latrine contamination (FC, TC), and geological factors (Sal, Fe2+) are key drivers, along with agrochemical inputs (
,
) [36].
PC3 (14.04% variance): Positive contributions from pH, Col, Turb,
,
, and
suggest influences from surface runoff (Col, Turb), pit latrines (
,
), geological factors (pH), and agrochemical use. Negative effects from EC and TC indicate areas where these variables are less influential [14].
PC4 (8.66% variance): Only pH and TSS contribute positively, while Temp and Cl⁻ have negative effects. This suggests the role of surface runoff (TSS) and geological influence (pH), with other variables showing minimal impact.
PC5 (5.17% variance): pH, EC, and
contribute positively, indicating influences from surface runoff (
), geological strata (pH, EC), and agricultural fertilizers. Other variables, including Temp, Col, Turb, TSS, Hd, Sal, and Fe2+, have no significant contribution.
The principal component analysis reveals that water quality in WW4 is strongly influenced by surface runoff, geological factors, pit latrine contamination, and agrochemical inputs. The first three components capture the majority of variability, with PC1 highlighting the combined effect of geological and human influences, particularly from sanitation and agricultural practices [36] [45].
3.3.5. WW5
The water quality of WW2 can be effectively described by six principal components (PC1-PC6), accounting for 92.51% of the total variability in the original dataset (Figure 7). The variance contributions of each PC are 42.73% for PC1, 18.26% for PC2, 12.17% for PC3, 8.46% for PC4, 5.71% for PC5, and 5.18% for PC6, making these components sufficient to capture the underlying water quality variation. The Sum of Squared Loadings (SS Loadings) further explain 92.55% of the variance, with contributions of 35.21%, 15.04%, 14.92%, 11.06%, 8.79%, and 7.50% for the respective PCs (Table 15) [29].
As presented in Table 15, PC1 has the highest eigenvalue of 8.55, explaining 42.73% of the variance, followed by PC2 at 3.65 (18.26%), PC3 at 2.43 (12.17%), PC4 at 1.69 (8.46%), PC5 at 1.14 (5.71%), and PC6 at 1.04 (5.18%). The cumulative variance explained by these components is 92.51%, indicating that the six PCs sufficiently capture the essential patterns in the water quality data [44].
Table 15. Percentage variances and cumulative variance of the first 5 PCs’ eigenvalues and SS loadings.
PC No. |
Eigenvalue |
% of Variance |
Cumulative % |
SS Loadings |
% of Variance |
Cumulative % |
1 |
8.55 |
42.73 |
42.73 |
7.04 |
35.21 |
35.21 |
2 |
3.65 |
18.26 |
60.98 |
3.01 |
15.04 |
50.26 |
3 |
2.43 |
12.17 |
73.15 |
2.98 |
14.92 |
65.17 |
4 |
1.69 |
8.46 |
81.62 |
2.21 |
11.06 |
76.23 |
5 |
1.14 |
5.71 |
87.33 |
1.76 |
8.79 |
85.02 |
6 |
1.04 |
5.18 |
92.51 |
1.50 |
7.50 |
92.51 |
PC1 (42.73% variance): 11 variables contribute positively, including pH, turbidity (Turb), total dissolved solids (TDS), hardness (Hd), alkalinity (Alk), salinity (Sal), phosphate (
), sulfate (
), nitrate (
), biochemical oxygen demand (BOD), and magnesium (Mg2+). Temperature (Temp) has a negative effect. This indicates influences from surface runoff (Turb, TDS), pit latrines (
,
, BOD), geological strata (pH, Hd, Alk, Sal), and agrochemical fertilizers (
,
,
). EC, Col, TSS, Cl−, Ca2+, Fe2+, FC, and TC show no significant effect [42].
PC2 (18.26% variance): Positive contributors include TDS, Sal,
,
,
, Ca2+, FC, and TC, suggesting influences from surface runoff (TDS), pit latrines (
,
, FC, TC), geological strata (Ca2+, Sal), and agrochemical fertilizers (
,
,
). Variables such as pH, EC, Temp, Col, Turb, TSS, Hd, Alk, and BOD have no significant effect [41].
PC3 (12.17% variance): Four variables—Turb, Cl−, BOD, and Fe2+—contribute positively, while TDS, Sal,
,
, and TC negatively affect this component. This suggests the influence of surface runoff (Turb), pit latrine connections (BOD), and geological strata (Cl−, Fe2+) [14].
PC4 (8.46% variance): Positive contributions from Turb, Col, and Temp suggest the impact of surface runoff (Turb, Col, Temp). Ca2+ has a negative effect, with no significant influence from other variables [15].
PC5 (5.71% variance): EC, Alk, Cl−, and
contribute positively, while Col and Mg2+ have negative effects. This suggests influences from geological strata (EC, Alk, Cl−,
) and surface runoff (Turb, Col) [42].
PC6 (5.18% variance): Positive contributors are TSS, Ca2+, and Mg2+, suggesting influences from surface runoff (TSS) and geological strata (Ca2+, Mg2+). pH negatively affects this component.
The analysis of the six principal components (PCs) highlights the complex interactions among natural and human factors influencing the water quality of WW2. The first three PCs explain the majority of the variance, underscoring the significant role of surface runoff, pit latrines, geological strata, and agro-chemical usage. Surface runoff acts as a key vector for transporting natural sediments and anthropogenic pollutants, while pit latrines contribute to microbial and nutrient contamination, especially in proximity to water sources [46] and [47]. Geological formations regulate water quality through mineral leaching and filtration, mediating the impacts of agricultural and human activities [38] and [48]. Agro-chemical application further exacerbates nutrient loading, contributing to water quality degradation. These findings align with predictive models demonstrating the long-term effects of runoff and leaching on water systems, emphasizing the need for comprehensive land-use and sanitation management [42]. Together, these components reveal the necessity of integrating natural geological characteristics with mitigation strategies to address anthropogenic impacts sustainably.
![]()
(a)
(b)
(c)
(d)
(e)
(f)
Figure 7. Factor score coefficient for different water quality parameter for WW5 for PCs.
4. Conclusion and Recommendation
The water quality of shallow wells in Half London Ward, Tunduma, Tanzania, reveals significant variability across the five wells (WW1 to WW5), with principal component analysis (PCA) demonstrating the dominant roles of surface runoff, pit latrine contamination, agrochemical inputs, and geological factors. In all wells, surface runoff, particularly after rainfall, contributes significantly to microbial contamination, with faecal and total coliforms indicating poor sanitation practices. Contamination from agricultural fertilizers, notably nitrates (
) and phosphates (
), further degrades water quality, highlighting the impact of small-scale farming on groundwater. Geological processes, including the dissolution of minerals like magnesium, calcium, and sulphates, contribute to the water’s hardness and salinity. The study reveals that both human activities (such as sanitation and agricultural practices) and natural processes (like mineral dissolution) significantly influence the water quality, underscoring the vulnerability of shallow wells to contamination in densely populated and agriculturally active regions. The study recommends on implementation of improved sanitation practices, proper use of agrochemicals, regular monitoring of Water Quality, Safe Well Construction public awareness campaigns enforcement of regulations on safe distances between wells and latrines, as well as agricultural fields.
Acknowledgements
The authors extend their heartfelt gratitude to the Management of Mbeya University of Science and Technology, particularly Prof. Aloys Mvuma, for their invaluable support and granting permission to pursue this research. We deeply appreciate their unwavering guidance and encouragement throughout the preparation of this research article.