Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Application of a combined approach including contamination indexes, geographic information system and multivariate statistical models in levels, distribution and sources study of metals in soils in Northern China

  • Kuixian Huang,

    Roles Formal analysis, Investigation, Methodology, Software, Visualization, Writing – original draft

    Affiliation Department of Environmental Science & Engineering, Fudan University, Shanghai, China

  • Xingzhang Luo,

    Roles Funding acquisition, Validation, Writing – review & editing

    Affiliation Department of Environmental Science & Engineering, Fudan University, Shanghai, China

  • Zheng Zheng

    Roles Conceptualization, Data curation, Project administration, Resources

    zzhenghj@fudan.edu.cn

    Affiliation Department of Environmental Science & Engineering, Fudan University, Shanghai, China

Abstract

The purpose of this study is to recognize the contamination characteristics of trace metals in soils and apportion their potential sources in Northern China to provide a scientific basis for basic of soil environment management and pollution control. The data set of metals for 12 elements in surface soil samples was collected. The enrichment factor and geoaccumulation index were used to identify the general geochemical characteristics of trace metals in soils. The UNMIX and positive matrix factorizations (PMF) models were comparatively applied to apportion their potential sources. Furthermore, geostatistical tools were used to study the spatial distribution of pollution characteristics and to identify the affected regions of sources that were derived from apportionment models. The soils were contaminated by Cd, Hg, Pb and Zn to varying degree. Industrial activities, agricultural activities and natural sources were identified as the potential sources determining the contents of trace metals in soils with contributions of 24.8%–24.9%, 33.3%–37.2% and 38.0%–41.8%, respectively. The slightly different results obtained from UNMIX and PMF might be caused by the estimations of uncertainty and different algorithms within the models.

Introduction

It is well known that heavy metals in soils present risks for human health due to their toxicity, persistence, and non-biodegradable natures [1]. Urban and industrial areas are generally considered a sink of trace metals from various pollution sources [2]. In China, vehicular emissions, industrial waste, and atmospheric deposition are the most important sources of heavy metals in urban topsoil, whereas wastewater irrigation and land fertilization with sludge contribute most to heavy metal contamination of topsoil in rural areas [3]. Due China’s rapid urbanization and industrialization, the establishment of industrial operations and fast urban expansion have drastically increased industrial and municipal wastewater discharges and other pollutant emissions nationwide [4]. Consequently, strengthening the prevention and control of soil pollution has become China's current key strategic task [4].

Tianjin City, located in Northern China, is an important industrial base. Since the 1980s, Tianjin has undergone rapid agricultural and industrial development, so heavy metals are more or less constantly emitted into the soil via atmospheric deposition, solid waste emissions, application of pesticides and fertilizers and wastewater irrigation from sewage and industrial effluents [5]. In particular, wastewater irrigation for agriculture has been used for nearly 50 years, to some extent, leading to the accumulation of metals in soils [6]. Studies have found a significant enrichment of trace metals in the topsoil of Tianjin [7]. Elevated soil trace metal concentrations in Tianjin have attracted attention from both the government and the public. The Work Plan of Soil Pollution Prevention and Control of Tianjin was published by the Tianjin Municipal Government in January 2017 [8]. Therefore, understanding the contamination characteristics of trace metals in soils and identifying their potential sources are important for developing appropriate pollution prevention and control regulations [9, 10]. However, few reports have focused on the source apportionment of trace metals in the soils of Tianjin.

Source apportionment of heavy metals in soil uses quantitative methods to accurately identify natural and anthropogenic sources and apportion their contribution [10]. The receptor model is a general approach for apportioning the contribution from all major sources based on the concentration profiles observed at sampling sites [10, 11]. Several receptor models, such as the chemical mass balance (CMB) model, principal component analysis/absolute principal component scores (PCA/APCS), APCS/multiple linear regression, positive matrix factorization (PMF), and UNMIX, have been proposed over the past several decades [12]. Although these models have their own characteristics and limitations, they have been widely used and have shown good applicability in source apportionment of pollutants in various environmental media, including the atmosphere, sediments, and water.

Among them, the UNMIX and PMF models can apportion potential sources without prior knowledge of source profiles, and they incorporate non-negative constraints into their calculation procedures, which makes their results more interpretable [10]. In addition, the UNMIX and PMF models have been recommended by the United States Environmental Protection Agency (U.S. EPA) as general modeling tools for source apportionment studies [10, 12]. However, the source apportionment of soil trace metals in large-scale regions is challenging because of the high spatial variability of heavy metal contents in topsoil that is caused by both heterogeneous parent materials and widespread human activities. In these cases, the spatial variability of the trace metal concentrations in soils is basic information for identifying the possible sources of contamination. Mapping based on geographical information systems (GIS) and spatial analysis is widely used to understand spatial distribution patterns and identify the likely sources of metals in surface soils. Therefore, receptor models combined with GIS may be an effective method that can improve the accuracy of source apportionment of soil trace metals.

In this study, a detailed investigation was conducted to comprehensively understand the levels and spatial distribution in soils of the Tianjin area and to identify their potential sources. Geochemical methods, including the geoaccumulation index and enrichment factor, were used to analyze the contamination characteristics of trace metals in soils. Receptor models, including the UNMIX and PMF models, were applied to identify trace metal sources and calculate source contributions. The systematic research presented here provides significant information for understanding the applications and challenges of PMF and UNMIX in the source apportionment of soil heavy metals at regional scale. In addition, the results will be useful and helpful for managers of soil environments to select and implement suitable measures for soil pollution prevention and control in Tianjin.

Materials and methods

Study area

Tianjin (N: 38°34′–40°15′; E: 116°43′–118°04′), in the lower reaches of the Haihe River System, is an important industrial metropolitan area in northern China. Major industries include metal smelting and pressing, machine manufacturing, petrochemical industry, electronic equipment manufacturing, automotive manufacturing, chemical industry, pharmaceutical manufacturing and production of electric and heat power; there are several other industrial activities [1315]. Tianjin covers a total area of 11,900 km2 and had a population of 12.99 million in 2010 [13]. This region, close to Bohai Bay, which is affected by the ocean climate and has a semi-humid monsoon climate, that is characterized by wide seasonal variation in annual rainfall of approximately 600 mm [13]. The landform of its north is higher in elevation than is that in the south [13]. The altitude ranges from 3.5 to 1052 m [13]. The main soil types are endoaquept, haplaguept and hapludalf [13].

Sample collection and analysis

A total of 171 surface soil samples (0–20 cm) were collected in May 2013, which reasonably represent soils of the entire study area. Sample collection and preparation, analytical methods and quality control were conducted according to the Technical Specification for Soil Environmental Monitoring of China [16].

Before field work, sampling coding system and sampling cell were made on a 1:250,000 digital maps in national fundamental geographic information system. The size of sampling grid (sampling density) varied with land use patterns, with an 8 km×8 km grid for cultivated land, a 16 km×16 km grid for forested land and grasslands, and a 40 km×40 km grid for unutilized land. Each total sample was a composite material taken from several sub-sample (0–20 cm) over a 50 m × 50 m patch of land. Ten to thirty sub-samples were collected at sites that did not have a not flat landscape and had heterogeneity in soils by using the S-shaped method. Five to nine sub-samples were collected at sites in cropland by using diagonal method. Five sub-samples were collected at sites with flat landscapes and homogeneous soil using the quincunx method. Samples for measuring organic compounds were collected with a pre-cleaned stainless steel scoop in a pre-cleaned brown glass bottle with a Teflon cap and were immediately transported to the laboratory and stored at -4°C until analysis.

Soil sample preparation for element analysis was performed as follows [17]. Each soil sample (10–20 mg) was digested in 1 mL of 60% (w/w) HNO3 and 1 mL of 60% (w/w) HClO4 in a stainless steel high-pressure digestion bomb at 140°C for 6 h. After completely cooling the system, the open vial was transferred to a hot plate (about 190°C) to evaporate the solution until the volume had decreased to several hundred micro-liters, then 0.5 mL of 49.5% (w/w) HF was added and the sample was evaporated again. The HF treatment was repeated several times until the silicate minerals had been completely dissolved. Finally, the residual solution was diluted to 6 mL with 1% (w/w) HNO3, filtered through a syringe filter (0.45 μm). The total concentrations of As, Hg and Se were analyzed using atomic fluorescence spectrometry (AFS); the total concentrations of the other elements were determined using inductively coupled plasma mass spectrometry (ICP-MS). The detection limits were 0.2, 0.1, 0.7, 5.0, 0.5, 0.005, 5.0, 0.5, 0.5, 5, 1.0 and 5.0 mg kg-1, for As, Cd, Co, Cr, Cu, Hg, Mn, Ni, Pb, V, Se and Zn, respectively. Quality assurance and quality control procedures were performed in conjunction with laboratory analyses by analyzing standard reference materials GSS-1, GSS-2, GSS-3, and GSS-4 soil (National Research Center for Geoanalysis of China). The results showed that the precision and bias of the analysis were generally below 5%. Recoveries of samples spiked with standards ranged from 95 to 105%.

Geochemical methods

Enrichment factor.

An enrichment factor (EF) is a commonly used geochemical criterion to evaluate the degree of heavy metal pollution [18]. It is defined as follows: (1) where (Cn/Cref)sample is the concentration ratio of the examined metal and the reference element in soil samples, and (Bn/Bref)background is the natural background value of the examined metal to the reference element ratio. The elements that are most often employed as reference elements include several conservative elements, such as Sc, Mn, Al, and Fe. [19]. In this study, Mn was used as the reference element. The background values of soil trace metals in the Tianjin area (CNEMC 1990) were used for calculating the EF values. The five-category system proposed by Sutherland [20] was used to classify samples: no or minimal enrichment (EF<2), moderate enrichment (2≤EF<5), significant enrichment (5≤EF<20), very high enrichment (20≤EF<40), and extremely high enrichment (EF≥40).

Geoaccumulation index.

The geoaccumulation index (Igeo), was first proposed by Müller [21] to assess the pollution levels of bottom sediments, and has been widely applied to the assessment of contamination by trace metals in soil. It is calculated as follows: (2) where Cn is the concentration of the examined trace element (n) in the soil samples, and Bn is the background value for the corresponding trace element. A constant of 1.5 is used to compensate for possible variations in the background data because of lithological variations as well as very small anthropogenic influences [19]. Igeo was calculated using the background values in order to compare the calculation results of EF and Igeo. The Igeo consists of 7 classes [18]: Class 0 (Igeo≤0), uncontaminated; Class 1 (0<Igeo≤1), uncontaminated to moderately contaminated; Class 2 (1<Igeo≤2), moderately contaminated; Class 3 (2<Igeo≤3), moderately to heavily contaminated; Class 4 (3<Igeo≤4), heavily contaminated; Class 5 (4<Igeo≤5), heavily to very heavily contaminated; Class 6 (Igeo≥5), very heavily contaminated.

Source apportionment models

Both UNMIX and PMF are advanced multivariate receptor models that are based on factor analysis, and constrain all of the elements in the factorized matrices to nonnegative. They determine the number of sources as well as their chemical compositions and contributions without the source profile data. In the models, the sample data matrix is decomposed into two matrices (factor contributions and factor profiles), with a residual matrix [5, 22]. The matrix equation can be expressed as: (3) where xij is the jth measured element concentration in the ith sample, p is the number of factors that contribute to the samples, gik is the relevant contribution of factor k to the ith sample, fkj is the concentration of element j in factor k profile, and eij is the residual error matrix. Details of the two models are described in the following sub-sections.

UNMIX.

The UNMIX program includes two key algorithms, that are linked by the matrix operation of Singular Value Decomposition (SVD). Given a data matrix with n samples and m species, the first step is to perform the SVD to reduce the dimensionality of the data space from m to p, the number of sources. The number of sources (p) is estimated by the NUMFACE algorithm based on signal-to-noise considerations [23]. The next algorithm is to find data "edges" (more generally, hyperplanes) in a (p−1)-dimensional space, which are then used to determine the compositions and contributions of the p sources [24]. It should be noted that UNMIX retains only the species that contribute to the improvement of the model's signal-to-noise ratio. In this work, U.S. EPA UNMIX version 6.0 was used.

PMF.

Compared to UNMIX, the PMF model considers the uncertainty of each data point individually and is built on a completely different algorithm [25, 26]. The core of the PMF algorithm is to minimize the object function Q in the following way [27]: (4) where μij is the uncertainty of the jth species concentration for the ith sample, n is the number of samples, and m is the number of species.

In this study, U.S. EPA PMF version 5.0 was employed based on the multilinear engine-2 (ME-2) algorithm. ME-2 solves the PMF equation iteratively using the conjugate gradient algorithm, minimizing the Q function [28]. The model was used in robust mode to avoid the impact of outliers on the PMF model results. Additionally, the application of the PMF model depends on the uncertainty of each data point to reduce the influence of noise in the environment data. In this study, for data above the method detection limit (MDL), the uncertainty matrix (μij) was calculated to be 10% of xij plus MDL divided by three (μij = 0.1 × xij + MDL/3). If xij is below or equal to MDL, xij values below MDL were replaced with MDL/2, and the uncertainty matrix is estimated to be 20% of xij plus MDL divided by three (μij = 0.2 × xij + MDL/3) [29].

Geostatistical tools

Kriging and inverse distance weighting (IDW) are the two most commonly used geostatistical interpolation methods for characterizing spatial patterns of soil contamination. Kriging is a linear interpolation technique that exhibits the best linear unbiased estimates for spatial variables [30]. However, the relatively easy method of IDW is more appropriate for the purposes of this study of assessing the spatial variations of soil trace metals contamination based on EF and Igeo values and for identifying the affected regions of pollution sources obtained from UNMIX and PMF models. It uses a specific number of nearest points, which are then weighted according to their distance from the point being interpolated [31]. The IDW interpolation maps were produced using ArcGIS 10.2 software.

Results and discussion

Contamination characteristics

The soil pH values varied from 5.9 to 9.0 with a mean value of 8.24. Because 93.6% of soil pH levels were above 7.5, this indicated that most soil of this region is alkaline soil. The basic statistics for the concentrations of measured trace metals in surface soils from the Tianjin area are listed in Table 1. From Table 1, it apparent that only that the median and mean concentrations of only Cd, Pb and Zn were higher than their corresponding average background values (ABVs). Approximately 76.6% of Cd samples, 74.3% of Pb samples and 54.4% of Zn samples exceed their corresponding ABVs. In the Chinese soil quality guidelines [32], Grade I represents the average natural levels for uncontaminated soil and Grade II indicates that intervention should occur to protect human health [33]. Compared with the Chinese soil quality guidelines [32], 24.6%, 15.8%, 13.5%, 15.2%, 32.8% of the Cd, Cu, Hg, Ni and Zn concentrations exceed their corresponding Grade I values. Additionally, the highest concentrations of these five trace metals even exceeded their Grade II values. It was also found that the mean concentrations of most trace metals, except Co and V, were lower than the target values that are recommended by in the Netherlands soil contamination guidelines [34].

thumbnail
Table 1. Summary statistics of trace metal concentrations in soil samples and some reference values (mg kg-1).

https://doi.org/10.1371/journal.pone.0190906.t001

To evaluate the extent of trace metals contamination and anthropogenic inputs, EF and Igeo values of different elements were calculated separately for all sampling sites (Fig 1; Table 2; S1 Table). EF values showed that Cd was the most serious metal contaminant in soils, with 47.9% of samples classified above the moderate contamination level (Fig 1(A); Table 2(A)). Both Pb and Zn had the same average EF of 1.38, with 16.2% of samples for Pb and 22.0% of samples for Zn with EF >2. Although the mean EF value for Hg was below zero, 17.1% of EF values for Hg were in the range of 2–5 and 2.6% of EF values were in the range of 5–20, that influences Hg was also an important pollutant influencing soil quality in Tianjin. At least 93% of EF values of the other trace metals showed no or minimal contamination, with only a few individual sampling sites with EF values >2.

thumbnail
Fig 1. Boxplots of enrichment fator (EF) and geoaccumulation index (Igeo) for trace metals in soil samples.

https://doi.org/10.1371/journal.pone.0190906.g001

thumbnail
Table 2. Percentages of class distribution for pollution assessment of trace metals in soil samples using enrichment factor index (a) and geoaccumulation index (b) (n = 171; %).

https://doi.org/10.1371/journal.pone.0190906.t002

The mean Igeo values for these trace metals, except for Cd (mean Igeo = 0.19), were lower than 0, indicating no soil contamination. The Igeo values of Cd varied the most, ranging from uncontaminated to heavily contaminated. The proportions of these elements in soil samples with Igeo > 0 decreased in the following order: Cd (58.5%) > Zn (25.1%) > Pb (22.8%) > Hg (14.6%) > As (10.5%) > Cu (8.2%) > Co (5.3%) > Ni (4.7%) > V (4.1%) >Se (3.0%) > Cr (1.8%) (Fig 2(B), Table 2(B)). The maps of EF and Igeo for these elements exhibited very similar spatial distributions, confirming the interpolation results (S2 Fig). The classification of samples by EF was more severe than that by Igeo, which was consistent with the report by Chabukdhara and Nema [35]. Overall, Cd, Hg, Pb and Zn showed the relatively high contamination levels. In particular, Cd was the most significant pollutant in Tianjin soils.

thumbnail
Fig 2. Concentration (mg/kg) and corresponding percentage (%) of the species in the different factors obtained by Unmix (Left) and PMF (Right).

https://doi.org/10.1371/journal.pone.0190906.g002

According to Zhang and Liu [36], EF values >1.5 suggeste that the metal was derived mainly from anthropogenic sources. From S1 Fig, the estimated maps of the spatial distribution of trace metal EF values showed that the soils in different districts of Tianjin were affected by human activities to varying degrees. Cd contamination was observed over a wide area, which might be associated with agricultural activities, such as fertilizer use and sewage irrigation. Although Pb and Zn showed similar spatial distributions, the hot-spot areas for Zn were wider than those for Pb, which mainly distributed in the suburban and the northern areas of Tianjin. The high EF values of Hg mainly occurred surrounding these districts that cross the Dagu Ditch and the Ji district in the north of Tianjin, demonstrating obvious regional characteristics. The Dagu Wastewater Drainage River, located in the south of Haihe River, is an important urban drainage channel that has received municipal sewage and industrial waste water for a long period of time [37]. The main industries that types draining waste into this channel include electroplated metal products industry, chlor-alkali industry, paper manufacturing industry, the leather industry, and textile industry [38]. Previous studies have noted that Hg in agricultural soils near the Dagu Ditch was one of the most serious pollutants because the Dagu wastewater is used for irrigation [39]. The high EF values for As, Co, Cr, Cu, Ni, Se and V were distributed only sporadically in the study area, indicating that these seven elements were less influenced by anthropogenic emissions.

Comparison of source profiles

To apportion the sources of soil trace metals in the Tianjin area, both the UNMIX and PMF models were comparatively applied. First, the optimum number of sources needs to be determined when running these two models. Generally, we can use the Min. R2 value and Min S/N value as the diagnostic basis for the UNMIX model. Here, Min. R2 means the minimal explained variance of each species under the specified number of sources, not the overall r-squared of the fit [40]. Min S/N refers to the smallest estimated signal-to-noise ratio of any of the factors included in the model [40]. A value of Min. R2 > 0.80 and a value of Min. S/N > 2.0 usually indicate that the UNMIX model fits well [40]. Table 3 shows the Min. R2 and Min. S/N values under different source numbers obtained from the UNMIX model in this study. Obviously, three-factor solution was the optimal choice for UNMIX with the Min. R2 value of 0.85 and Min. S/N value of 2.1 (S2 Table). Unlike UNMIX, no strict rules are set to find the optimum number of sources for the PMF model. The best value for the number of sources (p) is determined by repeating the analysis with different p values [40]. An optimal PMF solution has obtained that had a robust Q value near the theoretical Q value. The theoretical Q value (Qt) was 1503, which was approximated using the equation Qtm × np × (m + n), where m is the number of species and n is the number of samples [40]. Consequently, the three-factor solution was chosen. Source profiles for the two models are presented in Fig 2 and details of source composition are listed in Tables 4 and 5. Qualitatively, it apparent that the source profiles identified by the two models showed a high degree of similarity in terms of concentration and percentage (S3 Table; S4 Table).

thumbnail
Table 3. Min. R2 and Min. S/N values for different source numbers obtained from UNMIX model.

https://doi.org/10.1371/journal.pone.0190906.t003

To further evaluate how well the two methods fit the data, the performance parameters of explained variance (r2) and correlation coefficients for the UNMIX factor profiles and corresponding PMF factor profiles were calculated (Table 6). The results showed that the three-factor solution explained 92.4% and 91.5% of the total variances for UNMIX and PMF, respectively. The minor difference in the r2 values might be attributed to the processing methods for noisy analytical data in UNMIX and PMF. Measurement errors were assumed to be identical in UNMIX, whereas PMF considered the noise structure of the analytical data to be heterocedastic. In most of the cases of source apportionment with noisy environmental data, the assumption regarding error structure in UNMIX is unrealistic. In comparison, the uncertainty estimate in PMF plays an important role because it allows each data point to be individually weighed. In this study, most of trace metals in soil samples had relatively high coefficients of variation, indicating a relatively high noise variation. Thus, the higher r2 values may not indicate that UNMIX explained the larger amount of meaningful information than PMF. This might incorporate noise in the UNMIX solution. In general, the resolved solutions by these two models provided sufficient information to represent the majority of the entire dataset. From Table 6, the correlation coefficients between composition (gk) / contribution profiles (fk) from UNMIX and the corresponding profiles from PMF were high in all cases, ranging from 0.848 to 0.999. In particular, the correlation coefficients of composition profiles for each factor obtained from the two models were higher than 0.98. Generally, uncertainty estimation of the analytical data would affect the results of source apportionment, because it reflects the quality and reliability of each measured data point. Different algorithms for the receptor models would also usually produce incompletely consistent factor score matrix and factor loading matrix in the matrix decomposition process. Therefore, the difference of source profiles from UNMIX and PMF might be caused by the uncertainty parameters of the models and their different algorithms. Both models showed good correlation for the same identified factor profiles and produced good results, with minor differences between them, which mutually verified the reliability of the source apportionment.

thumbnail
Table 6. Contribution and fits for different factors obtained by UNMIX and PMF.

https://doi.org/10.1371/journal.pone.0190906.t006

Source identification

Because similar source profiles for the trace metals in soils of Tianjin were derived from both UNMIX and PMF models (Fig 2), only the PMF solution is further discussed and interpreted here. The spatial distributions of EF and Igeo were provided as a useful aid to identify hot-spot areas with high levels of contamination and to infer the possible sources of trace metals in soils at the regional scale. Estimated maps of EF and Igeo in the whole area of Tianjin are presented in S1 Fig and S2 Fig, respectively. A scatter diagram of enrichment factor (EF) for soil trace metals of study area is presented in S3 Fig. The sampling points were plotted in different quadrants indicating that the trace metals were controlled by different sources to varying degree.

The first factor (F1) was characterized by high loading of Hg. As mentioned above, the spatial distribution maps of EF and Igeo for Hg indicated that high values were mainly concentrated near the Dagu Wastewater Drainage River. Mercury-containing wastewater generated in industrial activities is discharged into this river for agricultural irrigation, causing an accumulation of Hg in the surrounding soil [39]. Additionally, it has been reported that HgCl has been used as a catalyst for producing polyvinyl chloride by the calcium carbide method in a Dagu chemical plant in recent decades, which has led to increasing Hg emission to the environment [12]. The accumulation of Hg was also reported to be associated with coal combustion in industrial activities [41, 42]. Thus, the factor might represent an anthropogenic source of industry activities from waste emission and coal combustion.

The second factor (F2) was predominantly characterized by high loading of Cd, followed by As, Zn, Co, Cr, Cu and Ni. As analyzed in the previous sections, the calculated results of EF and Igeo values revealed that a wide range of agricultural soils in Tianjin were highly contaminated with Cd. The application of chemical fertilizers was usually regarded as an important source of Cd in surface soils [43, 44]. Cadmium can be present in relatively large amounts in phosphate fertilizers ranging from near 0 to more than 150 mg/kg, depending on the provenance of the phosphate rock [45]. Arsenic can be easily emitted into the environment by the application of compound fertilizer and pesticides containing As [46]. Extensively application of fungicides for crops and vegetables is one of the main soil zinc sources in China [46]. In addition to the impact of pesticides and fertilizers, the use of sewage irrigation and sewage sludge is also an important source of trace metals entering agricultural soils [47]. Wastewater from drainage rivers, mainly receiving municipal sewage and industrial wastewater, has been used for irrigation in agricultural areas in Tianjin for decades [48]. Previous studies had shown that heavy metal pollution existed in the wastewater-irrigated area of Tianjin, especially for Cd accumulation [910, 39]. From S1 Fig, we identified several hotspots of high Cd and Zn concentration across the study area. Compared to spatial distribution of Cd and Zn, the hotspots of Hg were only distributed near sewage irrigation area. We can distinguish the difference between Cd (or Zn) and Hg in agricultural region far away from the sewage irrigation area. Cumulatively, the sources of Cd and Zn in soil were main originated from agricultural sources, even though the impact of industrial activity on soil Cd and Zn in sewage irrigation area. Together, this factor might indicate an anthropogenic source associated with agricultural activities (e.g., the application of pesticides and fertilizers).

The third factor (F3) was characterized by high loading of V and Mn and slight loading of Co, Cr, Cu, Ni, Pb, Se and Zn. The mean concentrations of V, Mn, Co, Cr, Cu, Ni and Se in soils were below their corresponding background concentrations in Tianjin (Table 1). Approximately 94.7% of Igeo values for Co, 98.3% of Igeo values for Cr, 91.8% of Igeo values for Cu, 95.3% of Igeo values for Ni, 97.1% of Igeo values for Se and 95.9% of Igeo values for V were less than zero (Table 2(B)), indicating that these elements were probably predominantly controlled by natural sources. Previous studies indicated that soil Co, Cr, Mn, Ni and V were highly dependent on soil parent materials [4951]. Thus, these elements were mainly derived from natural sources. It is worth noting that, the contributions of Co, Cr, Cu and Ni in F2 were similar to their F3. One explanation is that concentrations were a result of varying inputs due to spatial differences in influence factors. For instance, some samples contaminated by agricultural activities were identified as agricultural sources. While, some samples with higher concentrations for Co, Cr, Cu and Ni were controlled by the parent material. From the study area as a whole, Co, Cr, Cu and Ni in soil across the study area originated from both natural and agricultural sources.

To test the potential mixing relationships among mixing endmembers. Correlation analysis were done by using commercial statistics software package SPSS for Windows (SPSS Inc. Quarry Bay, HK). Tables 7 and 8 presents pearson correlation coefficients and their significance levels. The natural sources exhibited significantly negative correlations (p<0.01) with industrial sources in PMF (r = -0.243**) and UNMIX (r = -0.517**). The natural sources also exhibited significantly negative correlations (p<0.01) with agriculture sources in PMF (r = -0.410**) and UNMIX (r = -0.518**). These results indicate the natural source was much different from anthropogenic sources (agricultural sources and/or industrial sources). This conclusion confirmed the third factor (F3) was mainly derived from natural source, whereas the first factor (F1) and second factor (F2) were derived from anthropogenic sources.

thumbnail
Table 7. Pearson correlation coefficients of different sources obtained from UNMIX model.

https://doi.org/10.1371/journal.pone.0190906.t007

thumbnail
Table 8. Pearson correlation coefficients of different sources obtained from PMF model.

https://doi.org/10.1371/journal.pone.0190906.t008

Source contributions

The percent contribution from each source was calculated based on the average of the individual percent contributions and the results are presented in Table 6. The first common source was mainly generated by coal combustion and waste emission in industrial activities, with relative contributions of 24.9% for UNMIX and 24.8% for PMF. The secondary source (agricultural activities) identified by UNMIX and PMF models, which was mainly associated with the application of pesticides and fertilizers and sewage irrigation, accounted for 33.3% and 37.2%, respectively, of the total contributions. Natural sources were responsible for the most significant contribution extracted by UNMIX (41.8%) and PMF (38.0%) models. Both models provided virtually the same contributions and similar spatial distributions for all the explained sources. In addition, both UNMIX and PMF models have their special features with different algorithms and parameter settings. Therefore, it is essential to use multiple methods to estimate potential sources and contributions to confirm the validity of source apportionment.

The spatial variation of different source contributions obtained from UNMIX and PMF results was shown in Fig 3, clearly showing the respective regions that were affected by the three sources. The spatial distribution maps of F1 that were identified by the two models showed almost identical results. The main area under the influence of this factor was in the region with high concentrations of Hg. The affected regions of F2 and F3 identified by the two models showed similar results with minor differences. High values of F2 were concentrated in the urban and suburban areas with dense population and frequent agricultural activities. The distributions of F3 demonstrated that most of the high values were located in the outer suburbs and coastal areas less affected by human activities. The apportionment results suggested that the impact of human activities on trace metals in soils of the Tianjin area cannot be ignored because of the rapid development over past decades.

thumbnail
Fig 3. Spatial distribution of factor scores calculated from UNMIX (Upper) and PMF (Below) across study area.

https://doi.org/10.1371/journal.pone.0190906.g003

Conclusions

In this study, EF, Igeo, PMF, and UNMIX, combined with a GIS tool, were used to identify contamination characteristics and provide a quantitative source apportionment of soil trace metals in Tianjin. The results showed that Cd was the most significant pollutant that covered a wide range of contaminated areas. Low levels of pollution by Hg, Pb and Zn were distributed in certain regions of Tianjin. The other measured metals (As, Co, Cr, Cu, Mn, Ni, Se and V) showed no or minimal contamination and were sporadically distributed across the study area. Industrial activities, agricultural activities and natural sources were apportioned as potential sources, with relative contributions of 24.8–24.9%, 33.3–37.2% and 38.0–41.8% for soil trace metals, respectively. Spatially, the regions under the influence of industrial activities and agricultural activities mainly occurred in the urban and suburban areas of Tianjin, whereas the outer suburbs and coastal areas were less affected by human activities. In addition, both the UNMIX and PMF models produced comparable results with good agreement between respective composition and contribution profiles among the three factors, mutually confirming the source apportionment. Therefore, the results of this study demonstrate that the combination of the contamination index, geographic information systems, and multivariate receptor models, which account for spatial variability, can be a good approach to use for the source apportionment of soil trace metals on a regional scale.

The application of these models in soil trace metals has not been well studied to date, and it can be difficult to test because of the complexities of soil geochemistry [51, 52]. Recently, some researchers have successfully used PMF and UNMIX to trace the source of heavy metals in soils [5359]. Most of their study areas were of limited size, having similar geology and rainfall, thereby providing a relatively homogenous site for the application of these models. Chen at al. performed a source apportionment for surface soils of the Beijing metropolitan area, which was an exception [60]. The receptor model assumes, in theory, that the concentration of a chemical element is the linear summation of the contributors, with no reaction between the emissions from different sources and any formation or elimination of the substance during transmission. The modeling errors were caused by using a model in which the true physical-chemical phenomena were simplified. The instability of a receptor model due to nearly collinear sources is often worsened by a large number of unknown sources. When a study area becomes larger, the variability of the values increases, and the pollutants from unknown sources increase [61]. The similarity of these source profiles usually affects the results of the receptor models. If such data are apportioned by the PMF model or the UNMIX model, one factor extracted may contain these nearly collinear sources. There have been concerns that using PMF and UNMIX alone for source apportionment cannot reliably identify and apportion pollution sources, as it ignores the geochemical relevance such as pollutants from the sources that usually do not migrate to the entire area.

Increasing the number of fitting species and samples would enhance the accuracy of the model outcome. However, doing so is very costly and often prohibited by the available resources. Thus, simplifying the assumption of sources and contamination pathways in soil is inevitable but provides a potential application of the receptor model in source apportionment. In view of this study of source apportionment for soil trace metals in Tianjin, the results still show relatively large error. Therefore, the application of multivariate receptor modeling to apportioning sources of soil heavy metals at a regional scale remains a challenge.

Supporting information

S1 Fig. Spatial distribution maps of enrichment factor (EF) for soil trace metals of study area.

https://doi.org/10.1371/journal.pone.0190906.s001

(TIF)

S2 Fig. Spatial distribution maps of geoaccumulation index (Igeo) for soil trace metals of study area.

https://doi.org/10.1371/journal.pone.0190906.s002

(TIF)

S3 Fig. Scatter diagram of enrichment factor (EF) for soil trace metals of study area.

https://doi.org/10.1371/journal.pone.0190906.s003

(TIF)

S1 Table.

Percentages of class distribution for pollution assessment of trace metals in soil samples using enrichment factor index (a) and geoaccumulation index (b).

https://doi.org/10.1371/journal.pone.0190906.s004

(DOCX)

S2 Table. Min. R2 and Min. S/N values for different source numbers obtained from UNMIX model.

https://doi.org/10.1371/journal.pone.0190906.s005

(DOCX)

S3 Table. Source composition (mg kg-1) from UNMIX model.

https://doi.org/10.1371/journal.pone.0190906.s006

(DOCX)

S4 Table. Source composition (mg kg-1) from PMF model.

https://doi.org/10.1371/journal.pone.0190906.s007

(DOCX)

S5 Table. Details of input data from UNMIX and PMF models.

https://doi.org/10.1371/journal.pone.0190906.s008

(DOCX)

Acknowledgments

We would like to acknowledge the contributions of the advisory board of the study (Chenbo Li, Liumei Qin, Yanna Xiong, and Zhi Li) and our colleagues at Department of Environmental Science & Engineering, Fudan University.

References

  1. 1. Maas S, Scheifler R, Benslama M, Crini N, Lucot E, Brahmia Z, et al. Spatial distribution of heavy metal concentrations in urban, suburban and agricultural soils in a Mediterranean city of Algeria. Environmental pollution, 2010, 158(6): 2294–2301. pmid:20307922
  2. 2. Yu S, Zhu Y, Li X. Trace metal contamination in urban soils of China. Science of the total environment, 2012, 421: 17–30. pmid:21575982
  3. 3. MEP and MLR (Ministry of Environmental Protection and Ministry of Land and Resource of the People’s Republic of China), 2014. The Bulletin of Nationwide Soil Pollution Status Survey (April, 2014. Index No. 000014672/2014-00351) (in Chinese).
  4. 4. SCC (the State Council of the People’s Republic of China), 2016. The National Soil Pollution Prevention and Treatment Action Plan of China (May, 2016. Index No. 000014349/2016-00104) (in Chinese).
  5. 5. Zhao L, Xu Y, Hou H, Shangguan Y, Li F. Source identification and health risk assessment of metals in urban soils around the Tanggu chemical industrial district, Tianjin, China. Science of the Total Environment, 2014, 468: 654–662. pmid:24061056
  6. 6. Shi RG, Lv JG, Cai YM, Liu Y, Wang Z, Feng J, et al. Levels, spatial distribution and possible sources of heavy metals contamination of suburban soils in Tianjin, China. Bulletin of Environmental Contamination and Toxicology, 2010, 85: 287–290. pmid:20640845
  7. 7. Meng W, Wang Z, Hu B, Wang Z, Li H, Goodman RC. Heavy metals in soil and plants after long-term sewage irrigation at Tianjin China: A case study assessment. Agricultural Water Management, 2016, 171: 153–161.
  8. 8. Tianjin environmental protection bureau, The Work Plan of Soil Pollution Prevention and Control of Tianjin, 2016.
  9. 9. Qiao M, Cai C, Huang YZ, Liu Y, Lin A, Zheng Y. Characterization of soil heavy metal contamination and potential health risk in metropolitan region of northern China. Environmental Monitoring and Assessment, 2011, 172: 353–365. pmid:20135216
  10. 10. Taiwo AM, Harrison RM, Shi Z. A review of receptor modelling of industrially emitted particulate matter. Atmospheric Environment, 2014, 97: 109–120.
  11. 11. Wang XL, Sato T, Xing BS, Tao S. Health risks of heavy metals to the general public in Tianjin, China via consumption of vegetables and fish. Science of the Total Environment, 2005, 350: 28–37. pmid:16227070
  12. 12. Zhao L, Xu YF, Hou H, Shangguan Y, Li F. Source indentification and health risk assessment of metals in urban soils around the Tanggu chemical industrial district, Tianjin, China. Science of the Total Environment, 2014, 469: 654–662.
  13. 13. TMBS (Tianjin Municipal Bureau of Statistics), 2014. Tianjin Statistical Yearbook. (in Chinese).
  14. 14. Zuo Q, Duan YH, Yang Y, Wang XJ, Tao S. Source apportionment of polycyclic aromatic hydrocarbons in surface soil in Tianjin, China, Environmental Pollution, 2007, 147 (2): 303–310 pmid:16828945
  15. 15. Tao S, Cui YH, Xu FL, Li BG, Cao J, Liu WX, et al, Polycyclic aromatic hydrocarbons (PAHs) in agricultural soil and vegetables from Tianjin, Science of The Total Environment, 2004, 320 (1): 11–24. pmid:14987923
  16. 16. SEPAC (State Environmental Protection Agency of China), 2004. The Technical Specification for Soil Environmental Monitoring (HJ/T 166–2004). (in Chinese).
  17. 17. Chen SC, Liao CM. Health risk assessment on human exposed to environmental polycyclic aromatic hydrocarbons pollution sources. Science of the Total Environment, 2006, 366(1): 112–123 pmid:16307791
  18. 18. Loska K, Wiechula D, Barska B, Cebula E, Chojnecka A. Assessment of arsenic enrichment of cultivated soils in Southern Poland. Polish Journal of Environmental Studies, 2003, 12: 187–192.
  19. 19. Loska K, Cebula J, Pelczar J, Wiechuła D, Kwapuliński J. Use of enrichment, and contamination factors together with geoaccumulation indexes to evaluate the content of Cd, Cu, and Ni in the Bybnik water reservoir in Poland. Water, Air, & Soil Pollution, 1997, 93: 347–365.
  20. 20. Sutherland RA. Bed sediment-associated trace metals in an urban stream, Oahu, Hawaii. Environmental Geology, 2000, 39: 611–627.
  21. 21. Müller G. Index of geoaccumulation in sediments of the Rhine River. Geological Journal, 1969, 2: 108–118.
  22. 22. Zhang Y, Guo CS, Xu J, Tian Y Z, Shi G L, Feng YC. Potential source contributions and risk assessment of PAHs in sediments from Taihu Lake, China: comparison of three receptor models. Water Research, 2012, 46: 3065–3073. pmid:22459329
  23. 23. Henry RC, Park ES, Spiegelman CH. Comparing a new algorithm with the classic methods for estimating the number of factors. Chemometrics and Intelligent Laboratory Systems, 1999, 48: 91–97.
  24. 24. Henry RC. Multivariate receptor modeling by N-dimensional edge detection. Chemometrics and Intelligent Laboratory Systems, 2003, 65: 179–189.
  25. 25. Chueinta W, Hopke PK, Paatero P. Investigation of sources of atmospheric aerosol at urban and suburban residential areas in Thailand by positive matrix factorization. Atmospheric Environment, 2000, 34: 3319–3329.
  26. 26. Zheng TH, Ran Y, Chen LG. Polycyclic aromatic hydrocarbons (PAHs) in rural soils of Dongjiang River Basin: occurrence, source apportionment, and potential human health risk. Journal of Soils and Sediments, 2014, 14: 110–120.
  27. 27. Paatero P, Tapper U. Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values. Environmatrics, 1994, 5: 111–126.
  28. 28. Pekey H, Karakaş D, Bakoglu M. Source apportionment of trace metals in surface waters of a polluted stream using multivariate statistical analyses. Marine Pollution Bulletin, 2004, 49(9): 809–818.
  29. 29. Reff A, Eberly SI, Bhave PV. Receptor modeling of ambient particulate matter data using positive matrix factorization: review of existing methods. Journal of the Air & Waste Management Association, 2007, 57: 146–154.
  30. 30. Franco C, Soares A, Delgado J. Geostatistical modelling of heavy metal contamination in the topsoil of Guadiamar river margins (S Spain) using a stochastic simulation technique. Geoderma, 2006, 136: 852–864.
  31. 31. Carr R, Zhang CS, Moles N, Harder M. Identification and mapping of heavy metal pollution in soils of a sports ground in Galway City, Ireland, using a portable XRF analyser and GIS. Environ Geochem Health, 2008, 30: 45–52. pmid:17610027
  32. 32. CEPA (Chinese Environmental Protection Administration), 1995. Environmental Quality Standard for Soils (GB15618-1995). CMEP, Beijing. (In Chinese).
  33. 33. CNEMC (China National Environmental Monitoring Center), 1990. The background values of elements in Chinese soils (in Chinese). Environmental Science Press of China, Beijing.
  34. 34. VROM (2000). Circular on target values and intervention values for soil remediation Annex A target values, soil remediation intervention values and indicative levels for serious contamination. Dutch Ministry of Housing: Spatial Planning and Environment.
  35. 35. Chabukdhara M, Nema AK. Assessment of heavy metal contamination in Hindon River sediments: A chemometric and geochemical approach. Chemosphere, 2012, 87: 945–953. pmid:22406241
  36. 36. Zhang J, Liu CL. Riverine composition and estuarine geochemistry of particulate metals in China-weathering features, anthropogenic impact and chemical fluxes. Estuarine Coastal and Shelf Science, 2000, 54: 1051–1070.
  37. 37. Wu C, Zhang ZJ, Fei YH, Wu GQ, Qian Y, Meng SH. Characteristics of mercury form in soil-rice system and food security assessment in wastewater-irrigated paddy fields of Tianjin. Transactions of the Chinese Society of Agricultural Engineering, 2016, 16(18): 207–212. (in Chinese).
  38. 38. Xu FX, Guo P, 2009. Study on industrial distribution and structure adjustment plan of Dagu basin in Tianjin. New Rural Construction and Environmental Protection–Outstanding papers of the 16th academic annual meeting of China Environmental Science Society. (in Chinese).
  39. 39. Wu GH, Cao SS. Mercury and cadmium contamination of irrigation water, sediment, soil and shallow groundwater in a wastewater-irrigated field in Tianjin, China. Bulletin of Environmental Contamination and Toxicology, 2010, 84: 336–341. pmid:20111947
  40. 40. Byrne B M. Factor analytic models: Viewing the structure of an assessment instrument from three perspectives. Journal of personality assessment, 2005, 85(1): 17–32. pmid:16083381
  41. 41. Streets DG, Hao JM, Wu Y, Jiang J, Chan M, Tian H, et al. Anthropogenic mercury emissions in China. Atmospheric Environment, 2005, 39: 7789–7806.
  42. 42. Zhang CB, Wu LH, Luo YM, Zhang H, Christie P. Identifying sources of soil inorganic pollutants on a regional scale using a multivariate statistical approach: Role of pollutant migration and soil physicochemical properties. Environmental Pollution, 2008, 151: 470–476. pmid:17604890
  43. 43. Li XD, Poon CS, Liu PS. Heavy metal contamination of urban soils and street dusts in Hong Kong. Applied Geochemistry, 2001, 16: 1361–1368.
  44. 44. Nziguheba G, Smolders E. Inputs of trace elements in agricultural soils via phosphate fertilizers in European countries. Science of the Total Environment, 2008, 390: 53–57. pmid:18028985
  45. 45. Lambert R, Grant C, Sauve S. Cadmium and zinc in soil solution extracts following the application of phosphate fertilizers. Science of the Total Environment, 2007, 378: 293–305. pmid:17400282
  46. 46. Luo L, Ma YB, Zhang SZ, Wei D, Zhu YG. An inventory of trace element inputs to agricultural soils in China. Journal of Environmental Management, 2009, 90: 2524–2530. pmid:19246150
  47. 47. Micó C, Recatalá L, Peris M, Sánchez J. Assessing heavy metal sources in agricultural soils of an European Mediterranean area by multivariate analysis. Chemosphere, 2006, 65: 863–872. pmid:16635506
  48. 48. Wang YC, Qiao M, Liu YX, Zhu Y. Health risk assessment of heavy metals in soils and vegetables from wastewater irrigated area, Beijing-Tianjin city cluster, China. Journal of Environmental Sciences, 2012, 24: 690–698.
  49. 49. Davidson CM, Urquhart GJ, Ajmone-Marsan F, et al. Fractionation of potentially toxic elements in urban soils from five European cities by means of a harmonised sequential extraction procedure. Analytica Chimica Acta, 2006, 565: 63–72.
  50. 50. Facchinelli A, Sacchi E, Mallen L. Multivariate statistical and GIS-based approach to identify heavy metal sources in soils. Environmental Pollution, 2001, 114: 313–324. pmid:11584630
  51. 51. Thurston GD, Spengler JD. A quantitative assessment of source contributions to inhalable particulate matter pollution in metropolitan Boston. Atmospheric Environment, 1985, 19: 9–25.
  52. 52. Lee E, Chan CK, Paatero P. Application of positive matrix factorization in source apportionment of particulate pollutants in Hong Kong. Atmospheric Environment, 1999, 33: 3201–3212.
  53. 53. Ramadan Z, Eickhout B, Song XH, Buydens LMC, Hopke PK. Comparison of positive matrix factorization and multilinear engine for the source apportionment of particulate pollutants. Chemometrics and Intelligent Laboratory Systems, 2003, 66:15–28.
  54. 54. Kim M, Deshpande SR, Crist KC. Source apportionment of fine particulate matter (PM2.5) at a rural Ohio River Valley site. Atmospheric Environment, 2007, 41:9231–9243.
  55. 55. Soonthornnonda P, Christensen ER. Source apportionment of pollutants and flows of combined sewer wastewater. Water Research, 2008, 42: 1989–1998. pmid:18164048
  56. 56. Vaccaro S, Sobiecka E, Contini S, Locoro G, Free G, Gawlik BM. The application of positive matrix factorization in the analysis, characterisation and detection of contaminated soils. Chemosphere, 2007, 69: 1055–1063. pmid:17544480
  57. 57. Xue JL, Zhi YY, Yang LP, Shi JC, Zeng LZ, Wu LS. Positive matrix factorization as source apportionment of soil lead and cadmium around a battery plant (Changxing County, China). Environmental Science and Pollution Research, 2014, 21: 7698–7707. pmid:24622990
  58. 58. Parra S, Bravo MA, Quiroz W, Moreno T, Karanasiou A, Font O. Source apportionment for contaminated soils using multivariate statistical methods. Chemometrics and Intelligent Laboratory systems, 2014, 138: 127–132.
  59. 59. Han L, Gao B, Lu J, Zhou Y, Xu D, Gao L, et al. Pollution characteristics and source identification of trace metals in riparian soils of Miyun Reservoir, China. Ecotoxicology and environmental safety, 2017, 144: 321–329 pmid:28646737
  60. 60. Jiang Y, Chao S, Liu J, Yang Y, Chen Y, Zhang A. Source apportionment and health risk assessment of heavy metals in soil for a township in Jiangsu Province, China. Chemosphere, 2017, 168: 1658–1668.
  61. 61. Zhi YY, Li P, Shi JC, Zeng L, Wu L. Source identification and apportionment of soil cadmium in cropland of Eastern China: a combined approach of models and geographic information system. Journal of Soils and Sediments, 2016, 16: 467–475.