Next Article in Journal
Bacterial Diversity in a Dynamic and Extreme Sub-Arctic Watercourse (Pasvik River, Norwegian Arctic)
Next Article in Special Issue
Quantification and Regionalization of the Interaction between the Doumen Reservoir and Regional Groundwater in the Urban Plains of Northwest China
Previous Article in Journal
A Multiscale Framework for Sustainable Management of Tufa-Forming Watercourses: A Case Study of National Park “Krka”, Croatia
Previous Article in Special Issue
Characterization of Surface Evidence of Groundwater Flow Systems in Continental Mexico
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Data-Driven Approach to Assess Spatial-Temporal Interactions of Groundwater and Precipitation in Choushui River Groundwater Basin, Taiwan

1
Taiwan International Graduate (TIGP), Earth System Science, Academia Sinica and National Central University, Taipei 112, Taiwan
2
Graduate Institute of Applied Geology, National Central University, Taoyuan City 32001, Taiwan
3
Education and Culture Board, Medan City Government, Sidorame Barat II, Medan Perjuangan, Medan City, North Sumatra 20233, Indonesia
4
Research and Development Center, The Agency for Meteorology, Climatology, and Geophysics of the Republic of Indonesia (BMKG), Jl. Angkasa I No.2, Kemayoran, Jakarta Pusat 10720, Indonesia
5
Center for Environmental Studies, National Central University, Taoyuan City 32001, Taiwan
*
Author to whom correspondence should be addressed.
Water 2020, 12(11), 3097; https://doi.org/10.3390/w12113097
Submission received: 11 October 2020 / Revised: 29 October 2020 / Accepted: 2 November 2020 / Published: 4 November 2020
(This article belongs to the Special Issue Characterizing Groundwater - Surface Water Interaction Using GIS)

Abstract

:
The scarcity of groundwater and precipitation stations has limited accurate assessments of basin-scale groundwater systems. This study proposes a workflow that integrates satellite and on-site observations to improve the spatial and temporal resolution of the groundwater level and enable recharge estimations for the Choushui River groundwater basin (CRGB) in Western Taiwan. The workflow involves multiple data processing steps, including analysis of correlation, evaluation of residuals, and geostatistical interpolation based on kriging methods. The observed groundwater levels and recharge are then the basis to assess spatial-temporal interactions between groundwater and recharge in the CRGB from 2006 to 2015. Results of correlation analyses show the high correlation between the groundwater level and the land surface elevation in the study area. However, the multicollinearity problem exists for the additional precipitation data added in the correlation analyses. The correlation coefficient, root mean square error, and normalized root mean square parameters indicate that the Regression Kriging (RK) performs better the groundwater variations than the Ordinary Kriging (OK) dose. The data-driven approach estimates an annual groundwater recharge of approximately 1.40 billion tons, representing 37% of the yearly precipitation. The correlation between groundwater levels and groundwater recharge exhibits low or negative correlation zones in the groundwater basin. These zones might have resulted from multipurpose pumping activities and the river and drainage networks in the area. The event-based precipitation and groundwater level have shown strong recharge behavior in the low-land area of the basin. Artificial weir operations at the high-land mountain pass might considerably influence the groundwater and surface water interactions.

1. Introduction

Groundwater resources play an important role in water resource management issues because of the highly variable climate conditions that significantly influence water availability. The main advantage of the groundwater resource is the relatively stable amount of water supply that can be taken from aquifer systems [1]. Nowadays, population growth has enhanced the reduction in natural forest and grassland. These activities produce potential contaminant sources, which often impact the hydrological system and change surface runoff and water quality [2]. In a groundwater basin, the management of rivers and their riparian zone has a significant impact on upstream and downstream reaches [3,4]. Managing the groundwater resource in a groundwater basin highly relies on the efficient management of surface water (including the riparian zone) [3,4,5] and regular monitoring and collection of reliable groundwater levels [6]. The water levels integrated with groundwater recharge (GWR) then provide information on flow dynamics in the groundwater basin. These spatial and temporal interactions of groundwater and surface water are useful for making accurate water resource strategies.
The key process to monitor groundwater and the associated recharge requires groundwater levels to be obtained from wells and the precipitation from rain gauge stations. The direct measurements of groundwater levels and precipitation at the stations are accurate and typically have relatively high temporal resolutions. In Taiwan, many stations can have data with a resolution of hours or minutes. However, the high costs in developing monitoring networks for groundwater levels and precipitation have limited the spatial resolution of observations for most practical problems [7]. Satellite-based monitoring has recently become a useful technique for many hydrological implementations [8]. With a large covering of the monitoring areas, the satellite observations can be very efficient for mapping numerous types of spatial variations, including landcover and landscape evolution, changes of river bed landform, drought and flooding, and others on the ground [9,10,11,12]. However, the revisit time of a specific satellite can vary from days to months. The relatively low temporal resolution of satellite-based observations has motivated the use of integrated monitoring data to improve the spatio-temporal resolution for different applications [13].
Motivated by the data scarcity, interpolation methods have been widely used to improve the resolution of available observations. Kriging and the associated improved methods are the geostatistical interpolation methods, which consider geostatistical structures as the interpolation weightings. Numerous investigations have successfully used kriging methods to assess the spatial-temporal behavior of specified observations such as precipitation [14], groundwater levels and quality [15], aquifer properties [16], and others. These investigations have shown the advantages of geostatistical interpolation to improve the spatial resolution of observations of site-specific problems [17]. Specifically, Li and Heap (2014) reviewed over 40 interpolation methods, including deterministic, geostatistic, and combined methods [18]. Their study provided guidelines and suggestions for the application of spatial interpolation methods to environmental data. However, sample size, sampling design, and data properties might significantly influence the performance of the estimations. The selections of interpolation methods are usually data-specific or even variable-specific.
Regression Kriging (RK) is a combined method integrating either a simple or multiple regression model with the Ordinary Kriging (OK) of the prediction residuals. The regression equation in the regression model aims to predict the value of the dependent variable based on the linear function established by the independent or auxiliary variables (such as terrain parameters, remote sensing images, or thematic maps) [19]. For most cases, the r-squared, p-values, and correlations that appear in the output of linear regression analysis must be assessed to examine the suitability at explaining target variables. Kumar et al. (2012) indicated that an appropriate regression model might show residual errors with random and normal distributions [20]. Meng et al. (2013) evaluated different deterministic and geostatistical algorithms and mapped important bathymetric features of the Tucuruí hydroelectric reservoir [21]. The performance of the algorithms was assessed through cross-validation and Monte Carlo Simulation. Their results showed that the Regression Kriging (RK) could significantly improve the accuracy of spatial estimations when a weakly correlated auxiliary variable was used in the regression function. A similar approach applied to soil moisture interpolation in the Plateau of China can be found in Yao et al.’s work (2013) [22].
The satellite-based data have shown the advantages of investigating large-scale hydrological problems. Station-based point data show a high temporal resolution and accuracy of the observations. Sufficient open data supported by different organizations have made the data-driven approaches feasible. The objectives of the study are to develop and assess a data-driven approach that integrates satellite and on-site observations to improve the spatial-temporal resolution of groundwater levels and recharge in the Choushui River groundwater basin (CRGB) in Western Taiwan. The assessment involves multistep data processing, including correlation analyses, operations of data residuals, and OK and RK [23] interpolations. The observed groundwater levels and recharge are then the basis to assess spatial-temporal interactions between groundwater and recharge in the CRGB from 2006 to 2015. Specifically, the study uses the RK method to integrate multiple observations for evaluating interactions between groundwater and surface water. The land surface elevation and precipitation were considered as the auxiliary variables for interpolating groundwater levels in the groundwater basin. The recharge of the groundwater basin relies on the data from precipitation, land use from Climate Hazards Group InfraRed Precipitation with Station (CHIRPS) data and the Moderate Resolution Imaging Spectroradiometer (MODIS) Land Cover Type product (MCD12Q1).

2. Materials and Methods

2.1. Study Area

The study area is located in Western-central Taiwan, covering the CRGB (see Figure 1a). The CRGB is the most extensive alluvial plain in Western Taiwan. The study area is approximately 2300 km2. The groundwater basin boundaries are enclosed by Pakuashan tableland and Douliu hill to the east, Taiwan Strait to the west, Wu River to the north, and Peikang River to the south. Moreover, Choushui River is the longest river in Taiwan, and the river flows cross Central Range, Western Foothill, and Coastal Plain then into Taiwan Strait. The sedimentary thickness was estimated from 750 to 3000 m in the groundwater basin. The grain size varies mostly from the east to the west because of the relatively large land surface slopes. The materials of the sedimentary contain clay, fine sand, coarse sand, and gravel.
In the study area, the wet months are typically from May to October, and dry months are from November to April in the following years [24]. There are special conditions in the proximal fan of the CRGB. There is a mountain pass between the Pakuashan tableland and Douliu hill (Figure 1). Based on the well logs obtained from Taiwan Central Geological Surveys (CGSs), the mountain pass has a relatively shallow aquifer thickness, which varies from 30 to 50 m, and low permeable bedrock bounds the aquifer bottom. Field investigations have shown that groundwater and surface water interactions in the east and the west of the mountain pass are different because of the narrow and shallow mountain pass that allows the limited space for groundwater flow from upstream to the downstream. The specific landform near the mountain pass provides excellent conditions for the storage of surface water. The development of the Jiji weir at the mountain pass was one of the important infrastructures for the conjunction used of water resources in the CRGB.
The land use map in Figure 1b shows that agricultural lands (green color) are distributed uniformly in the study area, especially in the distal fan. The urban area shown with the dark red color is mainly located in the areas close to the Peikang River and along the coastline. Wetlands are mostly located along the coastline. Land use has an essential role in groundwater recharge. Building and other artificial structures from concrete will reduce the infiltration process. Vegetation cover over the area will support the recharge because vegetation presence increases infiltration by reducing the compacted soil and the surface runoff. Vegetation with coarse roots should facilitate groundwater recharge in the study area. The rainfall sensitivity to the rainfall in the distal fan is probably caused by vegetation over the distal fan along the Choushui River. The discharge decreases with less vegetation, indicating the impact of increased evapotranspiration due to more vegetation [4,5].

2.2. Data Used

This study used several types of data, including groundwater levels (GWLs), precipitation, land use, land surface elevation, soil types, and evapotranspiration. The monthly averaged GWLs at 31 groundwater wells were provided by the Water Resource Agency (WRA) of Taiwan from 2006 to 2015. There were two sets of precipitation used in this study. The first set of precipitation data was obtained based on 9 rain gauges of the Taiwan Water Resource Agency (WRA) from 2006 to 2015. The second set of precipitation data relied on satellite processing, which was available from The Climate Hazards Group InfraRed Precipitation with Station data (CHIRPS) [25]. The CHIRPS data were created by the National Aeronautics and Space Administration (NASA) and the National Oceanic and Atmospheric Administration (NOAA). The CHIRPS provides global coverage of daily precipitation with a 5.5 × 5.5-km spatial resolution. CHIRPS combined several satellite data and validated by station data. The land use data we obtained from sensor of the MODIS Land Cover Type product (MCD12Q1) provided by the United States Geological Survey (USGS) have a 450 × 450-m spatial resolution (Figure 1b). The supervised classification was performed to generate 18 different types of land uses and land covers. For the data processing, the land use data were reorganized from 18 classes to seven classes, namely: Water, Forest, Grassland/Savanna, Wetlands, Croplands, Urban area, and Open space or bare soils. The 30-m Shuttle Radar Topographic Mission (SRTM) data were used as the elevation data. Besides land use data, we also have soil type data provided by the Agricultural Committee of Taiwan. There were six classes of soil types, namely: (0) fine alluvial soil of tableland with well drainage, (1) limestone- and shale-type medium-coarse alluvial soil with poor drainage, (2) soaking nonlimestone-type fine alluvial soil, (3) nonlimestone-type medium alluvial soil with well drainage, (4) Poor drainage fine red soil, (5) andesite-type fine alluvial soil with poor drainage. In this study, the monthly evapotranspiration data for the analysis were obtained from the Water Resource Agency (WRA) of Taiwan. This study also analyzed the relationship between GWL, GWR, and precipitation based on an hourly rainfall event to assess the impact of the rainfall event on the groundwater responses. Due to the limited hourly evaporation data, we used the hourly evaporation and surface runoff obtained from ERA-Interim data that were produced by the European Centre for Medium-Range Weather Forecasts (ECMWFs).

2.3. Methods

In this study, assessing the relationship among the GWLs, GWR, and precipitation will focus on two topics: the interpolation of GWL and GWR estimation. In the first topic, we compared OK and RK for analyzing the spatial-temporal distribution of the GWLs over the study area. In principle, OK used only the GWL as the single variable for interpolation. However, the RK used the elevation and “modified” precipitation as the covariates for interpolating the GWLs. The closeness between the predicted and observed values has been presented using selected criteria such as Coefficient Correlation (r), Root Mean Square Error (RMSE), and Normalized Mean Square Error (NMSE). The OK and RK methods produced 240 maps for the monthly averaged GWLs. The discussion of the results will focus on different time-scales that represent the significant variations of GWLs induced by short events and seasonal and annual variations. The second topic will focus on the estimation of basin-scale GWR. This study employed the concept of water balance for calculating GWR. In the analysis, the water cycle components include the regional precipitation (P), evapotranspiration (ET), and surface runoff (R). Results of GWR and the associated GWLs and precipitation play an essential role in characterizing the interactions of surface water and groundwater in the CRGB.

2.3.1. Modified Precipitation (MoP)

The MoP aims to reduce the over- and underestimations of the precipitation obtained from the CHIRPS satellite data. Over the years, research efforts have focused on improving the temporal and spatial accuracy of the CHIRPS data for site-specific problems [25]. It was recognized that the accuracy of the CHIRPS data could vary significantly based on the available rain gauge stations used for conditioning the CHIRPS observations at the rain gauge stations. For this study, it was assumed that the rain gauge stations in the CRGB might also play an essential role in predicting regional precipitation. Therefore, the residual precipitation between rain gauge observations and CHIRPS satellite data were then used to integrate and assess the interactions between the precipitation and groundwater. There are 12 rain gauge stations in the study area (see Figure 1a). Because of the limited rain gauge stations to formulate the geostatistical structure for kriging interpolation, the residual maps for different times were interpolated based on the Inverse Distance Weighting (IDW) method. Adding the residual maps back to the CHIRPS data can yield the MoP at different times in the study area. The temporal scales of the precipitation for our analyses can be a month or a day. Therefore, the daily results of MoP were further accumulated to obtain different time intervals of MoP for our analyses.

2.3.2. Linear Regression Analysis

This study implemented both simple linear regression and multiple linear regression algorithms to assess the relationship between the independent variables (i.e., precipitation and elevation) and the GWL. The simple linear regression aims to determine the relationship between the precipitation and GWL and the land surface elevation versus the GWL. We combined the land surface elevation and precipitation as the independent variables for the analysis for the multiple linear regression. The correlation squared (r2) has a special meaning in linear regressions. It represents the proportion of variation in GWL explained by independent variables [26].
Results of the linear regression analysis provide general insight into the use of the precipitation and land surface elevation for estimating the GWL. In the study, the multiple linear regression model has the general formula [26]:
z ^ = β ^ 0 + β ^ 1 x 1 + β ^ n x n + ε
where z ^ is the dependent variable. The notation x n is the independent variable and β ^ n is the regression coefficient. The ε on the right-hand side represents the error from the regression.

2.3.3. Geostatistical Interpolation Methods

This study explored two geostatistical interpolation methods, including the OK and the RK methods for a data-driven approach [23]. Kriging is a geostatistical interpolation method frequently used for various implementations [18]. The kriged estimates rely on the weighted linear sum of the observations. The unique feature of the kriging is to consider the geostatistical structure of the observations. The weights are derived based on the selected variogram or covariance models to represent the spatial variability of the variables [25]. Kriging methods provide an estimate that is more sophisticated than other deterministic interpolation methods such as the nearest-neighbor or the IDW method.
Ordinary kriging is the most commonly used type of kriging method [26]. OK creates an estimator used in cases where there are unknown global means. The prediction model of OK can be described by:
Z ^ ( s 0 ) = μ + ε ( s 0 )
where Z ^   ( s 0 )   is   the predicted value at an unsampled location s 0 , μ   is the constant stationary function (global mean), and ε ( s 0 )   is the spatially correlated stochastic part of the variation. The first step in OK is to calculate a semivariogram from the scatter point set to be interpolated. A semivariogram is a tool that can be used to quantify the changes of correlation among the measurements by the increasing distance [17]. The variograms can be described by [17]:
γ ( h ) = 1 2 E [ ( Z ( s ) Z ( s + h ) ) 2 ]
where γ ( h ) is the semivariogram with a fixed distance h . The Z ( s ) is the value of the variable at the location s and Z ( s + h ) is the value at a location separated by the distance h . Determining a variogram model requires the information of the properties, the sill, the range, and the nugget, fitted based on the data pairs in the experimental semivariogram [27].
Regression kriging is a modified kriging method that combines a regression of the dependent auxiliary variables (such as terrain parameters, remote sensing imagery, and thematic maps) with the kriging of the regression residuals. It is mathematically equivalent to the interpolation method variously called universal kriging and kriging with external drift, where the auxiliary predictors are used directly to solve the kriging weights [19]. In the study, the concept of the regression algorithm for the RK has the following mathematical expressions [17]:
z ^ ( x 0 ) = m ^ ( s 0 ) + ε ^ ( s 0 )
z ^ ( x 0 ) = β ^ 0 + k = 0 p β ^ k q k ( s 0 ) + i = 1 n λ i ε ( s i )
where m ^ ( s 0 ) is the fitted drift and ε ^ ( s 0 ) is the interpolated residual. Notations β ^ k and β ^ 0 are the estimated drift model coefficients and the estimated intercept, respectively. q k ( s 0 ) represents the values of the auxiliary variables at the target location s 0 and λ i represents the kriging weights determined by the spatial dependence structure of the residual ε ( s i ) at the measurement location s i .

2.3.4. Assessment Criteria

This study assesses the results obtained from the OK and RK. The closeness of the predicted values to the observed values relies on the calculated statistics such as the Coefficient Correlation (r), Root Mean Square Error (RMSE), and Normalized Mean Square Error (NMSE). The Pearson correlation coefficient, r, indicates how far away all these data points are to this line of best fit (i.e., how well the data points fit this new model or the line of the best fit) [26]. The RMSE is a measure of the error size, but it is sensitive to outliers. The RMSE should be as low as possible to produce better modeling. The Normalized Mean Square Error (NMSE) indicates information on the deviations and therefore always giving positive values. The increasing NMSE means the increase in deviation between predicted and observed values. If a model has a very low NMSE, it indicates good predictions in space and time.

2.3.5. Estimations of GWR Based on Land Cover and Land Use

The calculation of the GWR is critical to quantify the input of an aquifer system. In this study, the mass balance concept was employed to estimate the spatial-temporal GWR in the CRGB. Based on the available observations from the Water Resource Agency and Central Weather Bureau of Taiwan, the recharge can be determined by the following water balance equation [28]:
W = P E T R
where W is the recharge, P is atmospheric precipitation, ET is the evapotranspiration, and R is the surface runoff. The MoP is determined for the source of the precipitation data for the analysis. The monthly evapotranspiration used the data from the Central Weather Bureau of Taiwan.
For the calculation of surface runoff, this study used the Soil Conservation Service (or SCS) curve number (CN) model to link the relationships between land cover, hydrologic soil classes, and curve numbers [29]. The SCS-CN model is simple to apply, easy to understand, and relatively stable to the solution. Furthermore, this method includes most runoff characteristics in watersheds, such as hydrologic conditions, land use, moisture condition, and soil type. The SCS-CN model was initially developed for its use on small agricultural watersheds and was extended and applied to rural, forest, and urban watersheds. Since the inception of the model, it has been used in a wide range of environmental implementations. The Natural Resource Conservation Service classifies four Hydrologic Soil Groups (HSGs) based on the runoff potential of different soils. Therefore, the first step to calculate the runoff is to define the soil types based on the hydrologic soil groups. Details of this classification can be found in the study of Abraham et al. [30].
The second step is to classify the land use based on the curve number groups of the U.S. Department of Agriculture (USDA) in 1986 [30]. In this study, we used the land use data derived by the MODIS satellite. Specifically, the land use is derived based on the land cover classifications proposed by Anderson et al. [31] and Congalton and Green [32].

3. Results and Discussion

The presentation of the results will focus on comparing different supporting variables (i.e., MoP and surface elevation) to predict the GWL in the CRGB. There are long-term data collected for this study. The data set in January 2006 was used for illustration purposes. Other data sets follow the same workflow to obtain the detailed temporal variations from February 2006 to December 2015. With the developed workflow applied to the groundwater basin, the details of the GWR were obtained. The variations of GWR enable the assessment of long-term interactions between groundwater and surface water in the CRGB.

3.1. Modified Precipitation (MoP)

Figure 2 shows an example of the results for the calculation of the MoP. The map for January 2006 was selected to illustrate how the MoP was obtained in the study. Figure 2a shows the original CHIRPS data for January 2006, while Figure 2b presents the MoP in the study area. The CHIRPS observations give a maximum and minimum monthly precipitation of 33.52 and 7.37 mm for January 2006. Furthermore, the CHIRPS observations in Figure 2a show that the high precipitation areas were in the east region of the CRGB. With the observations at the rain gauge stations, the MoP shows different patterns and magnitudes of the precipitation in the CRGB. In general, the magnitude of the precipitation is reduced based on the field observations at the rain gauge stations. The variations of the precipitation in local areas are also high as compared with the original CHIRPS data in Figure 2a. The significant differences are obtained in the south and north of the distal fan area. In the proximal fan area, there are considerable variations shown in the MoP map (i.e., Figure 2b).
We found significant differences between the original CHIRPS data and the MoP results. The large difference can be from the resolution of CHIRPS data and the field observations at the stations. Before the use of CHIRPS, we applied a linear regression analysis for CHIRPS data and field observations. The Choushui River located in Southwestern Taiwan, and the correlation coefficient shows a strong correlation (r > 0.50) between CHIRPS data and field observations. However, the performance of the CHIRPS data was not always consistent through the observation times [25]. The difference between CHIRPS data and field observations at the station can also produce results of MoP different from the original CHIRPS data.

3.2. Linear Regression Analysis

A linear regression analysis can provide indications to assess whether the additional variables have a linear correlation with the GWL. In this study, there were three combinations of models tested by the linear regression method. The results are the criteria for selecting secondary variables in the RK interpolation of the GWL. As an example, we showed the results of the linear regression analyses for each model in January 2006.

3.2.1. GWL and Land Surface Elevation (DEM)

The first model evaluates the relationship between GWL and the land surface elevation in the CRGB. According to the regression model result, using land surface elevation as the additional variable produced an r-squared (r2) value of 0.95, and the correlation coefficient of the regression model was 0.97 (Table 1). The correlation coefficient (r) between the GWL and elevation is closer to +1, indicating a positive and strong linear relationship between the rain gauges and elevation. Such results imply that the land surface elevation can describe 95% of GWL variation in the study área. Additionally, the increase in land surface elevation is associated with the increase in the GWL in the study area. The statistics of the two data sets show that the p-value is smaller than 0.05, indicating the significance of the land surface elevation as the additional variable. The significance of the land surface elevation suggests that the elevation data obtained from DEM SRTM can be included as an additional covariate for improving the RK interpolation of the GWL.

3.2.2. GWL and MoP

The second regression model considers the MoP to be the additional variable for predicting the GWL. In the analysis of the GWL and the MoP, the linear regression model shows an R-squared value of 0.49, and the correlation coefficient (r) for the model is 0.70 (Table 1). The result shows a strong and positive linear relationship between the GWL and MoP. Furthermore, as the additional variable, the p-value of MoP for the GWL is also lower than 0.05 (i.e., p-value < 0.05). This p-value implies that adding the MoP into the regression model might improve the GWL prediction. Compared with the result obtained from land surface elevation analysis, the MoP has a relatively low linear relationship with the GWL. In the case of January 2006, MoP can be used as an additional variable for predicting the GWL. However, the highly nonuniform precipitation in CRGB might lead to a low correlation between the GWL and the MoP. Based on the results obtained from the 10-year data, the overall performance of the correlation coefficient was r = 0.23, showing a weak positive correlation. In general, the prediction of the monthly GWL could include the MoP as the additional variable. However, the linear regression analysis for each month is required because of the inconsistency of the linear relationship between the GWL and MoP for each month.

3.2.3. GWL, Land Surface Elevation (DEM), and MoP

The third regression model considers the land surface elevation and MoP simultaneously. The linear regression results show that the correlation coefficient (r) for the third model is 0.97 (Table 1). This correlation coefficient shows a strong linear correlation among GWLs, land surface elevation, and the MoP. However, the p-value of the analysis indicates that adding land surface elevation and MoP in the model leads the MoP from being significant to insignificant. The multicollinearity of data characteristic induces the MoP to become an insignificant variable in the third model. The data multicollinearity represents the independent variables in a multiple regression model that might closely correlate to others. In the third model, we obtained a correlation coefficient values of 0.68 for land surface elevation and MoP. A similar correlation coefficient value (r = 0.70) was obtained for the GWL and MoP in the CRGB (Table 1). The results give a general insight into the use of the additional variable for RK interpolation. A higher correlation coefficient (i.e., closer to 1) between the independent and dependent variables will lead to better results of RK interpolation. Therefore, this study considered the land surface elevation to be the single variable used for improving the interpolation of the GWL in the CRGB.

3.3. Geostatistical Interpolation Methods

The kriging methods require the geostatistical structure of the data for site-specific problems. In most cases, the geospatial analysis can produce the geostatistical structure for specific parameters. Figure 3 shows an example of the experimental variogram and the fitted Gaussian variogram model for January 2006. The results in Figure 3 used the monthly averaged GWLs obtained from the first layer of the groundwater monitoring network in the CRGB. Note that the isotropic Gaussian model and nonlinear least-squares fitting algorithm were used for the analysis. The study of Kumar and Remadevi (2008) found that the Gaussian variogram model is appropriate for the interpolation of the GWL [33]. Three important properties are indicated in the (semi) variograms, including the sill, range, and nugget [27]. In Figure 3, the fitted variogram model yields sill, nugget (c0), and range values of 1094, 85, and 15,295 m, respectively. In this study, the data set for other months follows the same workflow to obtain the geostatistical structure as the kriging interpolation methods. The data processing and the fitting algorithm were developed with the R programming language. In this study, there are 120 sets of monthly averaged data for the OK and RK interpolations. The variogram parameters for different months might show differently because the groundwater variations in the CRGB might be influenced by natural precipitation and human activities.
Figure 4 shows an example of the interpolated GWL for January 2006. Figure 5 further presents the kriging standard deviation (i.e., variance) for the different interpolation methods. The results in Figure 4 and Figure 5 show that the RK associated with the regression of land surface elevation can reproduce more details of the groundwater variations. It is clear that the GWLs near the Choushui River had been resolved, and the GWL varies with the land surface elevation gradually from the east to the west of the CRGB. A considerable difference was obtained near the coastal line in the north portion of the CRGB. With relatively coarse monitoring wells located in the area (see Figure 1a), the OK result shows the smooth and slightly overestimated GWL. The kriging error variance accounts for the uncertainty that might be introduced by the interpolation methods. The greater the error variance, the higher the uncertainty obtained from the interpolation method (Figure 5).
By definition, the values of the error variance depend on the locations of the observations and the nugget effect in the variogram model. The locations with observation data (i.e., the conditioning points) lead to the smallest variances. Away from the observation locations, the kriging interpolation employs the geostatistical structure to derive the kriging weightings for interpolations. In Figure 4b, the RK result has shown that the additional variable from the land surface elevation can significantly improve the interpolation. The improvement is especially evident in the areas near the coastal line, the south boundary, and the north boundary of the CRGB. Note that the result shown here is for the month-averaged GWL in January 2006. The overall quantitative evaluation of the interpolation results will be discussed later in the following sections.

3.4. Assessment Criteria

The comparison of cross-validation results between OK and RK for 10-year data are shown in Figure 6. The study employed the Leave One-Out Cross-Validation (LOOCV) method to quantify the performance of different kriging interpolation methods. The selected criteria for the evaluation are Normalized Mean Square Error (NMSE), Root Mean Square Error (RMSE), and the correlation coefficient (r). Again, the high NMSE value indicates the increase in deviation between predicted and observed values [34]. The parameter NMSE is very sensitive to differences between predicted and measured values. A model with a low NMSE represents well prediction in space and time [34]. The RMSE deals with the measurement error size, but it is sensitive to outliers [35]. The RMSE should be as low as possible to produce better modeling results [36]. The Pearson product-moment correlation coefficient (or Pearson correlation coefficient, for short) is a measure of the strength of a linear association between two variables [35].
By accumulating the results for 10-year observations, the average performance for each month was obtained. In general, the RK shows a relatively better performance when estimating the GWL in the CRGB. However, the differences are not evident if one compares the average performance for each month. Figure 6 shows that the interpolation performance varies with the seasonal fluctuation of the groundwater observations. In the dry seasons from February to June, GWL estimations are generally less accurate than those in the wet seasons from July to October in a year. A particular case in April shows that the RK method obtains an NMSE value higher than that obtained by using the OK method. This result implies that the land surface elevation added in the RK interpolation might not improve the interpolation results. A possible factor that influences the inconsistency of the RK interpolation could be the human activities introduced in the CRGB. In April, the dry season will lead to excessively high groundwater pumping for agriculture activities (especially the paddy fields for the first crop season) in the CRGB. Local groundwater pumping will lead to a local GWL decrease, and the local GWL reduction might not directly correlate to the land surface elevation.

3.5. GWR

The GWR is relevant to numerous factors, including infiltration capacity, the stochastic character of precipitation, and other site-specific climate conditions [37]. Previous investigations have shown that the spatial and temporal distribution of the regional precipitation is the main factor that dominates the natural variations of the GWR [37].
Table 2 summarizes the soil groups identified in the CRGB. In the study, the Composite Curve Number (CCN) number weights were modified based on the studies of USDA (1986) and Hong and Adler (2008). Table 1 also lists the areas for the land use types. It is recognized that crop lands play an important role in regional water interactions. The agriculture activities associated with the water demand might produce a challenging task for water resource management. The results indicate that the type A of hydrogeological soil group occupied the maximum area in the CRGB. There is more than 60% of soil type A obtained for the study area. The total area was then calculated by CN of type A which multiplied by area per land use (CN x area).
Using the CCN values enables the calculation of the S and Ia for the entire groundwater basin. The value of S and Ia are the key parameters to derive the surface runoff (R) for the study area. With the distribution of S and Ia in the study area, the groundwater recharge (W) can be calculated with different temporal scales. Figure 7 shows the annual average GWR from 2006 to 2015 in the CRGB. In general, the GWR pattern varies with the land surface elevation and gradually decreases from the east to the west in the CRGB. This spatial pattern can also be found in the individual year of GWR. The distribution of the GWR is consistent with the distribution of precipitation in a year. However, in local areas, the GWR variations are high, especially in the distal fan area near the coastal zone. In the proximal fan area in the CRGB, we obtained the highest GWR, and the recharge zone is stable throughout the 10-year analysis. The Taiwan Central Geological Survey (Taiwan CGS) had defined the area as the recharge sensitivity zone for the CRGB. Conducting intensive groundwater monitoring is required to protect the groundwater resource in the proximal fan area.
Using the raster calculation can obtain the total volume of the GWR in a year. The data-driven approach yields a total volume of 1.40 × 109 m3 GWR for the CRGB. This value represents approximately 37% of the total precipitation in a year, which has a total volume of 3.77 × 109 m3 on average in a year. The results in this study agree with many previous investigations that used numerical models to evaluate the GWR in the CRGB. In these studies, the annual average recharge in the CRGB was approximately 1.238 × 109 m3 [38]. Our result slightly overestimated the GWR in the CRGB.

3.6. Spatial-Temporal Variations of GWL and GWR

In this study, overlaying the land use map in the study area found negative GWLs obtained in the bare soil and urban regions, especially in the west of the CRGB. The over-extraction of the groundwater might cause the local negative GWLs in these areas. Understanding the interactions between the GWL and MoP is crucial in managing basin-scale water resources. This study then conducted correlation analyses to characterize the interactions between the normalized GWL and the GWR.
Figure 8 shows the time series of the normalized GWL and GWR by integrating the 10-year data. Note that the normalized data had considered the standard deviation of GWL and GWR for presentation purposes. We used three selected groundwater monitoring stations, including 9100211, 1,004011, and 9180211 (; shown in Figure 1a) in the proximal, middle, and distal fans to illustrate the analysis. In this study, the land surface elevations for these monitoring stations in the proximal, middle, and distal fans are 179, 46, and 5 m, respectively.
In Figure 8, the results for July and August show similar trends for the GWL and the GWR in the entire CRGB. Estimated correlation coefficients between GWL and GWR show approximately 0.86, 0.59, and 0.82 in proximal, middle, and distal fan areas in the high precipitation months (i.e., July and August in a year). This finding agrees with the conclusion proposed by Tsai et al. (2015) that the upstream area of the Choushui River has a higher recharge rate than other areas [39]. There are significant decreases in the GWR from September to October because of less precipitation in the study area (see Figure 8a–c). In Figure 8d–f, the results from December to February show clear negative correlations between the GWL and the GWR. The cases in January are the worse scenarios that show high deviations for proximal, middle, or distal fan areas. This period is a new crop season for the CRGB. With the shortage of available surface water, human activities might induce additional decreases in the GWL. The conditions can lead to the complexity of water resource management.
Notice that the selected well 9,100,211 is located in the proximal fan area. We found that the pattern of the correlation between GWL and GWR in the dry months is different from the other two wells in the middle and distal fans. The results in Figure 8d show a negative correlation in dry months. However, the GWL gradually increases from November to February, which is different from the results in Figure 8e,f. The increase in the GWL at the well might be influenced by the controlling lateral recharge from the Choushui River. As we have known in the proximal fan area, there is the Jiji weir built at the mountain pass for conjunction use of water resources. The operations of the weir might influence the natural behavior of the GWL near the upstream of the proximal fan area.
For understanding the impact of precipitation on the GWR and GWL, the Pearson correlation coefficient maps were derived using the 10-year data (i.e., 2006–2015) of the GWL and GWR. Based on the precipitation data, the high precipitation months are generally from May to October, representing the wet months. The low precipitation months are from November to April, which represent the dry months. The Pearson correlation values are generally various in the study area, but most higher correlation values are shown in the proximal fan area. Figure 9 shows the annual average of the estimated correlation between GWL and GWR. The results are the basis to assess the regional interactions between the GWL and GWR.
In the upstream area, there are low correlation zones for the GWL and GWR (Figure 9a). Principally, low correlations occur when disagreement trends exist in the same months in different years at the grid points. The mechanism is complicated because the long-term data set covered the climate and anthropogenic conditions through the collection data. The pattern of the results in Figure 9 can provide the fundamental elements to assess the outcome of the regional water interactions. In this study, the GWR estimation relied on the calculation of surface runoff using the data from precipitation, land use, evaporation, and soil types. The various land use patterns and soil types have shown a high impact on the estimated GWR variations. In Figure 9a, the spots with high correlations typically represent the areas without vegetation, and those areas consist of well drainage soil types. Therefore, these areas have a consistent high GWR and GWL in the same period and produce a relatively higher correlation between the GWR and GWL. The areas with low correlation or negative correlation could be influenced by land use and human activities.
Figure 9a shows the correlation between GWL and GWR by averaging all data sets from 2006 to 2015. The high annual correlation values between GWL and GWR are mainly located in the proximal and distal fan areas. There are negative correlation zones in the central regions of the CRGB. The correlation values in these zones can reach −0.5, indicating the disturbance of GWL has dominated the character of surface water and groundwater interactions. Figure 9b,c show the correlation between GWL and GWR for the wet and dry months. There are considerable differences in the patterns of the correlation distribution. In the wet months, the positive correlation areas dominate the CRGB, indicating a similar GWL and GWR trend. There are also negative correlation areas located along the Choushui River in the center of the CRGB and along the Peikang River in the south of the CRGB. The behavior is not apparent based on the analyses of the correlation between the GWL and GWR. Figure 9c shows the correlation map for dry months. The map shows a comparison with the result in Figure 9b. The negative correlation dominates in the dry months. We found that the positive correlation areas in the dry months are also along the river system. The weak correlation near the river system could be the overall character that is induced by pumping activities and controlled by the river stages.
Figure 10 shows the selected distributions of the correlation map for February and July, representing the dry month and wet month, respectively. Note that the correlation maps considered the data sets obtained from 2006 to 2015. The averaged correlation maps can provide variations of the water interaction dynamics in the CRGB. Comparing the results in Figure 9b,c with the results of February and July in Figure 10, we have observed identical distributions of the correlation, suggesting that the behavior in July and February can represent the interactions between the GWL and GWR in the wet and dry months.
Except for the selected months in Figure 10, the proposed approach can estimate correlation maps for different months. During the monsoon season (May and June), the precipitation in the season varies significantly from one year to another. Therefore, the correlation between GWL and GWR during the transition month (May and June) is not consistent with other wet months (May–October). The unstable variations of the precipitation might directly influence the overall correlation between GWL and GWR. However, in the dry months (i.e., November to April), the correlation of the GWL and GWR is not clear. Characterizing the mechanism of water interactions becomes a challenging task because the variations of the correlation are very high from month to month. Human activities in the CRGB might need to be involved in the analysis to address the issue.

3.7. Event-Based Correlation of GWL and GWR

Figure 11 shows the even-based estimation of the GWR and compares the GWL at wells in the proximal, middle, distal fan areas. This study selected a 3-day rainfall event from 22 May 2015 to 24 May 2015 to assess the interactions of the GWL and GWR. In the comparison case, we employed the hourly precipitation and GWL for the analysis. This period was determined based on a continuous rain event, enabling us to focus on the typical rain event. We calculated the GWR using the mass balance concept shown in Equation (6). Due to limited evaporation data, the GWR was estimated by using hourly evaporation and runoff data from ERA-Interim data produced by the European Centre for Medium-Range Weather Forecasts (ECMWFs). ERA-Interim is a global atmospheric reanalysis that combines observations and a numerical model to create a synthesized data estimate. Evaporation and runoff data of ERA-Interim have often been used to analyze hydrological problems in many cases [40].
Figure 11 shows that the rainfall event in the CRGB produced a similar rainfall pattern but was slightly different in the rainfall intensity. However, the two GWL observations in the proximal fan area (one in the east of the mountain pass and one in the west of the mountain pass) show significantly different patterns. The GWL in the east of the proximal fan area shows a gradual decrease (see Figure 11a). The effect of the rainfall event on the GWL is limited. With the heavy rainfall after the third day (i.e., after 48 h), the GWL decreased dramatically. However, the GWL in the west of the mountain pass in the proximal fan shows a pattern similar to the GWL in the middle and distal fan areas. The GWL in the west increases with the increase in precipitation in the rainfall event. The GWL observations in the east and the west of the proximal fan show significantly different responses to the same rainfall event. This different behavior could be influenced by the high variation of aquifer thickness at the mountain pass between Pakuashan tableland and Douliu hill (see Figure 1a). The mountain pass could constrain the groundwater flow. Additionally, the Jiji weir was built across the mountain pass (Figure 1a). The operations of the weir could also induce the local interactions of groundwater and surface water. With a flood event, the water levels in the weir might be reduced to protect the infrastructure. The GWL in the middle and distal fan areas generally follows the precipitation pattern, directly contributing to the GWR. Our study found that the rainfall in the middle fan area is relatively sensitive to the GWR as compared to the GWL in the distal fan area.

4. Conclusions

This study proposes a data-driven approach that enables the integration of multiple data sources to produce a higher spatial and temporal resolution of the GWL. The results provide useful information to assess the basin-scale water interactions. Numerous evaluation criteria were used to determine the accuracy of the interpolated GWLs and evaluate the performance when additional variables were added in the interpolation. Based on the MoP obtained from the data processing, this study employed the CCN method to estimate the GWR and evaluate the correlation between GWL and GWR. The interactions between GWL and GWR are systematically discussed in the study.
The MoP was obtained based on reducing the residuals between CHIRPS and field observations from the rainfall stations. The results of the precipitation modification showed significant improvement of the accuracy as compared to the original CHIRPS data. The linear regression analysis showed that the precipitation and DEM data could not be included together as the additional variables for the GWL interpolation because of the obtained multicollinearity condition. With the DEM data included in the RK, the interpolated GWL showed considerable improvement in spatial and temporal domains. This study calculated the GWR by using the relationship that involves observations from the precipitation, evapotranspiration, and surface runoff in the study area. By the raster calculation, the volume of GWR in a year is about 1.40 billion m3/year and the value agrees with previous numerical studies. This value represents 37% of the annual precipitation (3.77 billion m3/year).
The high annual correlation values between the GWL and GWR are mainly located in the proximal and distal fan areas. There are negative and positive correlation zones that coexist in the central regions of the CRGB. The correlation values in these zones can reach −0.5, indicating the disturbance of GWL has dominated the character of surface water and groundwater interactions. In the wet months, the positive correlation areas dominate the CRGB, showing a similar GWL and GWR trend. The positive correlation between GWL and GWR are along with the river system during the dry years. The weak correlation near the river system could be the overall character that is induced by pumping activities and controlled by the river stages.
We have observed that the behavior in July and February can represent the GWL and GWR interactions in the wet and dry months. In general, the wet months show relatively high correlations between GWL and GWR. However, in the dry months, the correlation between GWL and GWR is not clear based on the available observation data. The operations of the Jiji weir at the mountain pass can also induce the local interactions of groundwater and surface water. More information relevant to anthropogenic activities such as groundwater use and weir operations is required to quantify the influence of the anthropogenic activities on the regional water interactions.
The approach in this study was developed to improve the spatio-temporal resolution of the observations obtained from the satellite and field observations. The RK is robust to interpolate the groundwater levels supported by surface elevation as the second variable. The available satellite data allow for evaluating basin-scale groundwater recharge by using the modified precipitation and land use. However, the temporal resolution has been restricted by the available satellite images, although the field observations from field stations have relatively high sampling rates. The inconsistency of the temporal resolution will limit the use of the near real-time monitoring of near-surface hydrology and groundwater dynamics.

Author Contributions

L.N. and C.-F.N., conceived the research and made helpful discussions during the conception of the study. Y.D. was included in preparation and processing of remote sensing data. L.N. conducted the study, performed analyses, and wrote the first manuscript draft. C.-F.N., I.-H.L., C.-P.L., and W.-C.L., enhanced and finalized the manuscript for the communication with the journal. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially supported by the Ministry of Science and Technology, the Republic of China, under grants MOST 107-2116-M-008 -003 -MY2, MOST 109-2621-M-008 -003, MOST 109-2625-M-008 -006, and by the Soil and Groundwater Pollution Remediation Fund in 2017 and 2018.

Acknowledgments

The authors thank the Water Resource Agency (WRA) of Taiwan for providing the groundwater and precipitation data.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Twort, A.C.; Ratnayaka, D.D.; Brandt, M.J. 4-Groundwater Supplies. In Water Supply, 5th ed.; Butterworth-Heinemann: London, UK, 2000; p. 114-II. [Google Scholar]
  2. Keesstra, S.D.; Geissen, V.; Mosse, K.; Piiranen, S.; Scudiero, E.; Leistra, M.; van Schaik, L. Soil as a filter for groundwater quality. Curr. Opin. Environ. Sustain. 2012, 4, 507–516. [Google Scholar] [CrossRef]
  3. Keesstra, S.D.; Kondrlova, E.; Czajka, A.; Seeger, M.; Maroulis, J. Assessing riparian zone impacts on water and sediment movement: A new approach. Neth. J. Geosci. Geol. en Mijnb. 2014, 91, 245–255. [Google Scholar] [CrossRef]
  4. Keesstra, S.D. Impact of natural reforestation on floodplain sedimentation in the Dragonja basin, SW Slovenia. Earth Surf. Process. Landf. 2007, 32, 49–65. [Google Scholar] [CrossRef]
  5. Keesstra, S.D.; van Dam, O.; Verstraeten, G.; van Huissteden, J. Changing sediment dynamics due to natural reforestation in the Dragonja catchment, SW Slovenia. Catena 2009, 78, 60–71. [Google Scholar] [CrossRef]
  6. Şen, Z. Chapter 6-Groundwater Management. In Practical and Applied Hydrogeology; Şen, Z., Ed.; Elsevier: Oxford, UK, 2015; pp. 341–397. [Google Scholar]
  7. Miro, M.; Famiglietti, J. Downscaling GRACE Remote Sensing Datasets to High-Resolution Groundwater Storage Change Maps of California’s Central Valley. Remote Sens. 2018, 10, 143. [Google Scholar] [CrossRef] [Green Version]
  8. Li, J.; Pei, Y.; Zhao, S.; Xiao, R.; Sang, X.; Zhang, C. A Review of Remote Sensing for Environmental Monitoring in China. Remote Sens. 2020, 12, 1130. [Google Scholar] [CrossRef] [Green Version]
  9. Lu, C.-H.; Ni, C.-F.; Chang, C.-P.; Chen, Y.-A.; Yen, J.-Y. Geostatistical Data Fusion of Multiple Type Observations to Improve Land Subsidence Monitoring Resolution in the Choushui River Fluvial Plain, Taiwan. Terr. Atmos. Ocean. Sci. 2016, 27, 505. [Google Scholar] [CrossRef] [Green Version]
  10. Schwenk, J.; Khandelwal, A.; Fratkin, M.; Kumar, V.; Foufoula-Georgiou, E. High spatiotemporal resolution of river planform dynamics from Landsat: The RivMAP toolbox and results from the Ucayali River. Earth Space Sci. 2017, 4, 46–75. [Google Scholar] [CrossRef]
  11. Lu, C.-H.; Ni, C.-F.; Chang, C.-P.; Yen, J.-Y.; Chuang, R.Y. Coherence Difference Analysis of Sentinel-1 SAR Interferogram to Identify Earthquake-Induced Disasters in Urban Areas. Remote Sens. 2018, 10, 1318. [Google Scholar] [CrossRef] [Green Version]
  12. Liou, Y.-A.; Mulualem, G.M. Spatio–temporal Assessment of Drought in Ethiopia and the Impact of Recent Intense Droughts. Remote Sens. 2019, 11, 1828. [Google Scholar] [CrossRef] [Green Version]
  13. Kreklow, J.; Steinhoff-Knopp, B.; Friedrich, K.; Tetzlaff, B. Comparing Rainfall Erosivity Estimation Methods Using Weather Radar Data for the State of Hesse (Germany). Water 2020, 12, 1424. [Google Scholar] [CrossRef]
  14. Adhikary, S.K.; Muttil, N.; Yilmaz, A.G. Cokriging for enhanced spatial interpolation of rainfall in two Australian catchments. Hydrol. Process. 2017, 31, 2143–2161. [Google Scholar] [CrossRef] [Green Version]
  15. Li, L.; Huang, G. Groundwater Level Mapping Using Multiple-Point Geostatistics. Water 2016, 8, 400. [Google Scholar] [CrossRef] [Green Version]
  16. Jeihouni, M.; Delirhasannia, R.; Alavipanah, S.K.; Shahabi, M.; Samadianfard, S. Spatial analysis of groundwater electrical conductivity using ordinary kriging and artificial intelligence methods (Case study: Tabriz plain, Iran). Geofizika 2015, 192–208. [Google Scholar] [CrossRef]
  17. Hengl, T. A Practical Guide to Geostatistical Mapping, 2nd ed.; University of Amsterdam: Amsterdam, The Netherlands, 2009. [Google Scholar]
  18. Li, J.; Heap, A.D. Spatial interpolation methods applied in the environmental sciences: A review. Environ. Model. Softw. 2014, 53, 173–189. [Google Scholar] [CrossRef]
  19. Hengl, T.; Heuvelink, G.B.M.; Rossiter, D.G. About regression-kriging: From equations to case studies. Comput. Geosci. 2007, 33, 1301–1315. [Google Scholar] [CrossRef]
  20. Kumar, S.; Lal, R.; Liu, D. A geographically weighted regression kriging approach for mapping soil organic carbon stock. Geoderma 2012, 189–190, 627–634. [Google Scholar] [CrossRef]
  21. Meng, Q.; Liu, Z.; Borders, B.E. Assessment of regression kriging for spatial interpolation—Comparisons of seven GIS interpolation methods. Cartogr. Geogr. Inf. Sci. 2013, 40, 28–39. [Google Scholar] [CrossRef]
  22. Yao, X.; Fu, B.; Lü, Y.; Sun, F.; Wang, S.; Liu, M. Comparison of Four Spatial Interpolation Methods for Estimating Soil Moisture in a Complex Terrain Catchment. PLoS ONE 2013, 8, e54660. [Google Scholar] [CrossRef]
  23. Glenn, J.; Tonina, D.; Morehead, M.D.; Fiedler, F.; Benjankar, R. Effect of transect location, transect spacing and interpolation methods on river bathymetry accuracy. Earth Surf. Process. Landf. 2015, 41, 1185–1198. [Google Scholar] [CrossRef]
  24. Lu, C.H.; Ni, C.F.; Chang, C.P.; Yen, J.Y.; Hung, W.C. Combination with precise leveling and PSInSAR observations to quantify pumping-induced land subsidence in central Taiwan. Proc. IAHS 2015, 372, 77–82. [Google Scholar] [CrossRef] [Green Version]
  25. Funk, C.; Peterson, P.; Landsfeld, M.; Pedreros, D.; Verdin, J.; Shukla, S.; Husak, G.; Rowland, J.; Harrison, L.; Hoell, A.; et al. The climate hazards infrared precipitation with stations—A new environmental record for monitoring extremes. Sci. Data 2015, 2. [Google Scholar] [CrossRef] [Green Version]
  26. Goovaerts, P. Applied Geostatistic for Natural Resources Evaluation; Oxford University Press, Inc.: Oxford, UK, 1997. [Google Scholar]
  27. Burrough, P.A.; Lloyd, C.D. Principles of Geographical Information Systems, 3rd ed.; Oxford University Press: Oxford, UK, 2015. [Google Scholar]
  28. Bisson, R.A.; Lehr, J.H. Modern Groundwater Exploration: Discovering New Water Resources in Consolidated Rocks Using Innovative Hydrogeological Concepts, Exploration, Drilling, Aquifer Testing and Management Methods; John Wiley & Sons: Hoboken, NJ, USA, 2004; p. 321. [Google Scholar]
  29. Mishra, S.K.; Singh, V. Soil Conservation Service Curve Number (SCS-CN) Methodology; Water Science and Technology Library; Kluwer Academic Publisher: Dordrecht, The Netherlands, 2003. [Google Scholar]
  30. Abraham, S.; Huynh, C.; Vu, H. Classification of Soils into Hydrologic Groups Using Machine Learning. Data 2019, 5, 2. [Google Scholar] [CrossRef] [Green Version]
  31. Anderson, J.R.; Hardy, E.E.; Roach, J.T.; Witmer, R.E. A Land Use and Land Cover Classification System for Use with Remote Sensor Data; Professional Paper 964; The U.S. Geological Survey (USGS) Land Cover Institute (LCI): Sioux Falls, SD, USA, 1976.
  32. Congalton, R.G.; Green, K. Assessing the Accuracy of Remotely Sensed Data; CRC Press: Boca Raton, FL, USA, 2019; p. 346. [Google Scholar]
  33. Kumar, V.; Remadevi, V. Kriging of Groundwater Levels—A Case Study. J. Spat. Hydrol. 2006, 6, 81–92. [Google Scholar]
  34. Mosca, S.; Giovanni, G.; Kluga, W.; Bellasioa, R.; Bianconia, R. A Statistical Methodology for the Evaluation of Long-Range Dispersion Models: An Application to the ETEX Exercise. Atmos. Environ. (ETEX Spec. Issue) 1998, 32, 4307–4324. [Google Scholar] [CrossRef]
  35. Feidas, H.; Kokolatos, G.; Negri, A.; Manyin, M.; Chrysoulakis, N.; Kamarianakis, Y. Validation of an infrared-based satellite algorithm to estimate accumulated rainfall over the Mediterranean basin. Theor. Appl. Climatol. 2008, 95, 91–109. [Google Scholar] [CrossRef]
  36. Li, J.; Heap, A.D. A review of comparative studies of spatial interpolation methods in environmental sciences: Performance and impact factors. Ecol. Inform. 2011, 6, 228–241. [Google Scholar] [CrossRef]
  37. Şen, Z. Chapter Six-Climate Change, Droughts, and Water Resources. In Applied Drought Modeling, Prediction, and Mitigation; Şen, Z., Ed.; Elsevier: Boston, MA, USA, 2015; pp. 321–391. [Google Scholar]
  38. Lin, H.-T.; Ke, K.-Y.; Tan, Y.-C.; Wu, S.-C.; Hsu, G.; Chen, P.-C.; Fang, S.-T. Estimating Pumping Rates and Identifying Potential Recharge Zones for Groundwater Management in Multi-Aquifers System. Water Resour. Manag. 2013, 27, 3293–3306. [Google Scholar] [CrossRef]
  39. Tsai, J.-P.; Chen, Y.-W.; Chang, L.-C.; Kuo, Y.-M.; Tu, Y.-H.; Pan, C.-C. High Recharge Areas in the Choushui River Alluvial Fan (Taiwan) Assessed from Recharge Potential Analysis and Average Storage Variation Indexes. Entropy 2015, 17, 1558–1580. [Google Scholar] [CrossRef] [Green Version]
  40. Sorí, R.; Nieto, R.; Vicente-Serrano, S.M.; Drumond, A.; Gimeno, L. A Lagrangian perspective of the hydrological cycle in the Congo River basin. Earth Syst. Dyn. 2017, 8, 653–675. [Google Scholar] [CrossRef] [Green Version]
Figure 1. (a) The study area in Choushui River groundwater basin (CRGB) and the monitoring well locations for the first-layer groundwater level and precipitation (Central Geological Survey (CGS): The World Geodetic System (WGS) 1984 The Universal Transverse Mercator (UTM) Zone 51N); (b) the land use map for the CRGB based on the MODIS Land Cover Type product (MCD12Q1) provided by the United States Geological Survey (USGS).
Figure 1. (a) The study area in Choushui River groundwater basin (CRGB) and the monitoring well locations for the first-layer groundwater level and precipitation (Central Geological Survey (CGS): The World Geodetic System (WGS) 1984 The Universal Transverse Mercator (UTM) Zone 51N); (b) the land use map for the CRGB based on the MODIS Land Cover Type product (MCD12Q1) provided by the United States Geological Survey (USGS).
Water 12 03097 g001
Figure 2. An example of the calculated Modified Precipitation (MoP) for January 2006: (a) the precipitation data for January 2006 based on original Climate Hazards Group InfraRed Precipitation with Station (CHIRPS) data, and (b) the MoP data calculated by adding the original CHIRPS data with the interpolated residuals obtained from the regression analysis between the rain gauge stations and the CHIRPS data.
Figure 2. An example of the calculated Modified Precipitation (MoP) for January 2006: (a) the precipitation data for January 2006 based on original Climate Hazards Group InfraRed Precipitation with Station (CHIRPS) data, and (b) the MoP data calculated by adding the original CHIRPS data with the interpolated residuals obtained from the regression analysis between the rain gauge stations and the CHIRPS data.
Water 12 03097 g002
Figure 3. The omnidirectional semivariogram obtained by using the Gaussian variogram model for January 2006. The fitted parameters of the sill, nugget (C0), and range are 1094, 85, and 15295 m, respectively.
Figure 3. The omnidirectional semivariogram obtained by using the Gaussian variogram model for January 2006. The fitted parameters of the sill, nugget (C0), and range are 1094, 85, and 15295 m, respectively.
Water 12 03097 g003
Figure 4. The comparison of the GWL based on two different geostatistical interpolation methods: (a) the map of GWL using Ordinary Kriging (OK) for January 2006; (b) the map of GWL using Regression Kriging (RK) for January 2006. Note that the land surface elevation has been considered in the RK for the additional variable.
Figure 4. The comparison of the GWL based on two different geostatistical interpolation methods: (a) the map of GWL using Ordinary Kriging (OK) for January 2006; (b) the map of GWL using Regression Kriging (RK) for January 2006. Note that the land surface elevation has been considered in the RK for the additional variable.
Water 12 03097 g004
Figure 5. An example of comparing the kriging error variance for different methods: (a) kriging standard deviation of GWL using Ordinary Kriging (OK) for January 2006; (b) the kriging standard deviation of GWL using Regression Kriging (RK) for January 2006.
Figure 5. An example of comparing the kriging error variance for different methods: (a) kriging standard deviation of GWL using Ordinary Kriging (OK) for January 2006; (b) the kriging standard deviation of GWL using Regression Kriging (RK) for January 2006.
Water 12 03097 g005
Figure 6. The assessment criteria for the OK and RK estimations based on Leave One-Out Cross-Validation (LOOCV) method, which represents the averaged values for each month: (a) Normalized Mean Square Error (NMSE), (b) Root Mean Square Error (RMSE), and (c) correlation coefficient (r).
Figure 6. The assessment criteria for the OK and RK estimations based on Leave One-Out Cross-Validation (LOOCV) method, which represents the averaged values for each month: (a) Normalized Mean Square Error (NMSE), (b) Root Mean Square Error (RMSE), and (c) correlation coefficient (r).
Water 12 03097 g006
Figure 7. The annual average of groundwater recharge from 2006 to 2015.
Figure 7. The annual average of groundwater recharge from 2006 to 2015.
Water 12 03097 g007
Figure 8. The temporal variations of the normalized GWL and GWR for the selected monitoring wells: (a,d) well 9100211 in the proximal fan area, (b,e) well 1004011 in the middle fan area, and (c,f) well 9180211 in the distal fan area. The land surface elevations for these monitoring stations in the proximal, middle, and distal fans are 179, 46, and 5 m, respectively.
Figure 8. The temporal variations of the normalized GWL and GWR for the selected monitoring wells: (a,d) well 9100211 in the proximal fan area, (b,e) well 1004011 in the middle fan area, and (c,f) well 9180211 in the distal fan area. The land surface elevations for these monitoring stations in the proximal, middle, and distal fans are 179, 46, and 5 m, respectively.
Water 12 03097 g008
Figure 9. The distributions of Pearson Correlation for GWL and GWR: (a) the annual average correlation, (b) the wet months (May–October), and (c) the dry months (November–April).
Figure 9. The distributions of Pearson Correlation for GWL and GWR: (a) the annual average correlation, (b) the wet months (May–October), and (c) the dry months (November–April).
Water 12 03097 g009aWater 12 03097 g009b
Figure 10. Monthly long-term averaged Pearson correlation distributions for GWL and GWR: (a) February and (b) July. The maps are consistent with the averaged dry and wet months. Note that the monthly maps are the accumulated results that account for the data obtained from 2006 to 2015.
Figure 10. Monthly long-term averaged Pearson correlation distributions for GWL and GWR: (a) February and (b) July. The maps are consistent with the averaged dry and wet months. Note that the monthly maps are the accumulated results that account for the data obtained from 2006 to 2015.
Water 12 03097 g010
Figure 11. Time series of hourly GWL versus precipitation for a continuous rain event from 22 May 2015 to 24 May 2015: (a) the well locations in the east (Well 9100211) and in the west (well 9090111) of the proximal fan (see Figure 1 for the details of the well locations), (b) the well location (well 1004011) in the middle fan, and (c) the well location (9180211) in the distal fan. Note that the recharge calculation is based on the Composite Curve Number (CCN) method, and the recharge at the well locations was extracted for presentation purposes.
Figure 11. Time series of hourly GWL versus precipitation for a continuous rain event from 22 May 2015 to 24 May 2015: (a) the well locations in the east (Well 9100211) and in the west (well 9090111) of the proximal fan (see Figure 1 for the details of the well locations), (b) the well location (well 1004011) in the middle fan, and (c) the well location (9180211) in the distal fan. Note that the recharge calculation is based on the Composite Curve Number (CCN) method, and the recharge at the well locations was extracted for presentation purposes.
Water 12 03097 g011
Table 1. Results of linear regression analyses among groundwater level (GWL), surface elevation (DEM), and MoP.
Table 1. Results of linear regression analyses among groundwater level (GWL), surface elevation (DEM), and MoP.
Main VariableSecondary Variablesrr2p-ValueSignificance
GWLSurface Elevation (DEM)0.970.952.2 × 10−6DEM (significant)
GWLMoP0.700.495.4 × 10−6MoP (significant)
GWLSurface Elevation, MoP0.970.952.2 × 10−6DEM (significant)
MoP (insignificant)
DEMMoP0.680.461.55 × 10−5MoP (significant)
Table 2. The list modified Curve Number (CN) for the CRGB.
Table 2. The list modified Curve Number (CN) for the CRGB.
Land IDLand Use Area (km2)CN for HSGCN × Area
ABCDTotal Area (km2)
1Water60.25989898985904.75
2Mix Forest66.90356073802341.43
3Grassland133.98497080856564.82
4Wetlands43.69255570771092.25
5Crop lands1159.524466778351,018.88
6Urban area255.845470808513,815.09
7Bare soils569.173561748019,920.81
Total2289.34 100,658.03
Composite Curve Number (CCN)44
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Nainggolan, L.; Ni, C.-F.; Darmawan, Y.; Lee, I.-H.; Lin, C.-P.; Li, W.-C. Data-Driven Approach to Assess Spatial-Temporal Interactions of Groundwater and Precipitation in Choushui River Groundwater Basin, Taiwan. Water 2020, 12, 3097. https://doi.org/10.3390/w12113097

AMA Style

Nainggolan L, Ni C-F, Darmawan Y, Lee I-H, Lin C-P, Li W-C. Data-Driven Approach to Assess Spatial-Temporal Interactions of Groundwater and Precipitation in Choushui River Groundwater Basin, Taiwan. Water. 2020; 12(11):3097. https://doi.org/10.3390/w12113097

Chicago/Turabian Style

Nainggolan, Lamtupa, Chuen-Fa Ni, Yahya Darmawan, I-Hsien Lee, Chi-Ping Lin, and Wei-Ci Li. 2020. "Data-Driven Approach to Assess Spatial-Temporal Interactions of Groundwater and Precipitation in Choushui River Groundwater Basin, Taiwan" Water 12, no. 11: 3097. https://doi.org/10.3390/w12113097

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop