Next Article in Journal
Spatiotemporal Distribution of Irrigation Water Use Efficiency from the Perspective of Water Footprints in Heilongjiang Province
Previous Article in Journal
A Review on Interpretable and Explainable Artificial Intelligence in Hydroclimatic Applications
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Decision Making Model for Municipal Wastewater Conventional Secondary Treatment with Bayesian Networks

by
Edgardo Medina
1,*,
Carlos Roberto Fonseca
1,
Iván Gallego-Alarcón
1,
Oswaldo Morales-Nápoles
2,*,
Miguel Ángel Gómez-Albores
1,
Mario Esparza-Soto
1,
Carlos Alberto Mastachi-Loza
1 and
Daury García-Pulido
1
1
Inter-American Institute of Water Sciences and Technology, Autonomous University of Mexico State, km 14.5 Toluca-Atlacomulco Road, Toluca 50295, Mexico
2
Faculty of Civil Engineering and Geosciences, Delft University of Technology, P.O. Box 5, 2600 AA Delft, The Netherlands
*
Authors to whom correspondence should be addressed.
Water 2022, 14(8), 1231; https://doi.org/10.3390/w14081231
Submission received: 13 March 2022 / Revised: 4 April 2022 / Accepted: 6 April 2022 / Published: 11 April 2022
(This article belongs to the Section Wastewater Treatment and Reuse)

Abstract

:
Technical, economic, regulatory, environmental, and social and political interests make the process of selecting an appropriate wastewater treatment technology complex. Although this problem has already been addressed from the dimensioning approach, our proposal in this research, a model of decision making for conventional secondary treatment of municipal wastewater through continuous-discrete, non-parametric Bayesian networks was developed. The most suitable network was structured in unit processes, independent of each other. Validation, with data in a mostly Mexican context, provided a positive predictive power of 83.5%, an excellent kappa (0.77 > 0.75), and the criterion line was surpassed with the location of the model in a receiver operating characteristic (ROC) graph, so the model can be implemented in this region. The final configuration of the Bayesian network allows the methodology to be easily extended to other types of treatments, wastewater, and to other regions.

1. Introduction

The current and growing need for water that is available in sufficient quantity and quality for all has resulted in its reuse (or recovery) near the place of consumption [1,2,3,4,5]. Wastewater treatment has become relevant for sustainable development, the environment and human health [1,6]; however, about 32% of the world’s population lacks coverage of wastewater treatment plants (WWTPs). In most developing countries, construction and operation are a challenge [7]; for example, in Mexico only 34% of municipalities treat their wastewater [8]. According to data from [9], worldwide, 54% of the wastewater produced is treated, where developed countries treat above 90%. However, they require new treatment plants and constantly promote stricter regulations [7].
Technical, social, economic, regulatory, environmental and spatial factors make the process of selecting an appropriate wastewater treatment technology complex [2,6,10,11,12]; in addition, social and political interests and conflicts should also be considered [13]. This requires extensive experience, knowledge and reliable studies [6], including the uncertainty related to the origin of wastewater and operational conditions [14]. It is, therefore, of paramount importance to adopt a rational decision-making procedure that selects the appropriate technologies for wastewater treatment [13] and incorporates the uncertainty of the factors involved.
Decision-making methods can be identified as follows [15,16]: Life Cycle Assessment (LCA), Cost-Benefit Analysis (CBA), Intelligent Systems, Multicriteria Decision Making (MCDM) and Mathematical Models (MM). LCA and CBA provide a cost or environmental impact, useful for projects with specific conditions, but their scope is restricted to finite scenarios, so they are considered tools for decision-making processes [15]. Intelligent systems emulate the human decision-making process, using a set of conditional rules or automatic learning processes. Therefore, the time and resources required to develop these systems are high, limiting their application [17].
Multicriteria decisions aim to order a set of alternatives under decision-maker (DM) defined factors [7,18], giving some importance (weighting) to each decision criterion [15], e.g., the analytical hierarchy process [19,20]. These methods are complemented by tools such as fuzzy logic for considering uncertainty [21,22,23]; however, in the selection of wastewater treatment technologies, the hierarchy of alternatives with regard to each criterion may change depending on the design conditions of the plant, making it difficult to implement a comprehensive system that values any case study in a single modelling.
The decision-making models (DMMs) based on MM represent a tool to gain a comprehensive understanding of the problem characteristics, as they do not require high costs for implementation [16]. However, in multicriteria decisions, and with uncertainty, they may require complex algorithms, making their implementation more difficult [24,25].
In this context, Bayesian networks are graphs where variables (nodes) and their dependencies (arcs) are represented. In the nodes, the distributions of probability for each variable are defined, and the dependencies are determined with range correlations or conditional tables. They have been used in multicriteria decision-making (MCDM) to support decisions in different contexts, because they address in a structured way the uncertainty of the criteria and their interrelationships, due to the convenient use of conditional probabilities [26]. In addition, with observed values of some of the variables and with the dependencies given by the arcs, all sources of uncertainty are propagated to obtain the new probability distributions for the other variables [27].
As for the DMMs used in WWTPs, several applications have been developed, for example, for designing, estimation of energy consumption, operational optimization, improvement of effluent quality, environmental impact, and health risks [2,16]. Nevertheless, choosing an appropriate process is one of the most challenging steps [12] since environmental, social and economic factors must be taken into account, as well as the quality and quantity of wastewater [2,6,7,11]. Uncertainty is also part of all the above variables [14]. That is, both the inputs (characteristics of wastewater and its flow) and the outlets (removal efficiencies, sludge production, by-products with value, costs, among others) are variables that cannot be valued with enough certainty in a design process. In this way, Bayesian networks are a suitable alternative to model such complex processes, with the advantage of being able to integrate both quantitative and qualitative variables in the model [28].
The frequency of use of Bayesian networks in water modeling and management has increased rapidly due to their powerful inference capacity, their convenient decision support mechanisms, and their flexibility and applicability to factors that affect wastewater treatment systems [29], in addition to their ability to provide a visual interpretation of the structures of the model [28]. Wastewater engineers and decision makers can apply this method in risk assessment and prediction applications [30].
Yu et al. [31] developed Bayesian networks to pre-evaluate and contrast the results of prediction models applied to the long-term effect of iron on methane yield in an anaerobic membrane bioreactor, obtaining differences of less than 0.5%. Li et al. [29] proposed a method based on Bayesian networks to model and predict the behavior of a wastewater treatment system based on a modified sequencing batch reactor. According to these results, they concluded that Bayesian networks provide an effective approach to predictive analysis in real time of wastewater treatment systems. Xu et al. [28] applied Bayesian networks to conveniently model the complex processes between anthropogenic activities and water quality. They showed that both quantitative (such as water quality and land use data) and qualitative variables (different seasonal scenarios) can be incorporated into a model. Through the design of a Bayesian network, Herrera-Murillo et al. [32] estimated the probabilities of complying with regulations in wastewater discharges under some alternative scenarios of operation.
According to the aforementioned studies, the advantage of Bayesian networks lies in their flexibility and reliability of application in factors of treatment systems, their visual structure and their ability to evaluate different scenarios of wastewater conditions.
Within the entire process of municipal wastewater treatment, secondary treatment consists of the removal of organic compounds. After primary treatment, this treatment significantly reduces suspended solids and virtually all dissolved organic compounds from the influents to meet a given standard [33].
In this work, it is proposed to develop a Bayesian network-based DMM for the selection of secondary municipal wastewater treatment processes for initial implementation in a Mexican context (based on data of 117 wastewater treatment plants) and, with appropriate assessments, subsequently in a global context. Therefore, this kind of model could be considered the first step in an adequate design process of the wastewater treatment type selected. Furthermore, it would allow the depiction of the expert knowledge acquired through empirical experience. Bayesian networks seen as an MM allow us to address uncertainty in the variables involved [26,34]. Once the network is configured, it can be used to explore different scenarios in the variables [35], and therefore different case studies.

2. Materials and Methods

The development of the DMM for the choice of the unitary process of secondary wastewater treatment is proposed to be carried out in six stages (Figure 1). The first three stages are focused on the selection of the adequate variables (stage 1), building an appropriate and consistent Bayesian network in terms of element independence, direction of dependencies (stage 2), and obtaining marginal probability distribution functions (PDF) and range correlations (stage 3). The later stages address the model validation in the Mexican geographical context. This is, stage 4 focuses on the data collection, depuration and storage of information related to variables of WWTPs operation, such as inlet flow rate, total suspended solids, etc. The validation of the model (stage 5) assesses the predictions made by the model according to the database generated of WWTPs. The model is intended to find both a more adequate process based on the initial conditions and also the most suitable order of the different processes in view of some performance indicators.
Taking into account the WWTP design process, three types of elements are defined in the model: input conditions (ICs) or decision constraints; possible secondary unit processes to choose (UPs) or decision objects; performance indicators (PIs) representing the effects provided by these processes under the above-mentioned ICs (stage 1).
As for the UPs, this research is limited to secondary treatments, and those which are also conventional processes most commonly used for municipal wastewater in Mexico (Table 1). Some types of processes, such as maturation lagoons, are excluded, as they are tertiary or final processes. The most commonly used processes are aerobics, so septic tanks and Imhoff tanks (anaerobic processes) can be omitted from the model. In addition, the main objective of conventional municipal plants is the reduction of suspended solids, biodegradable organic matter, and fecal and total coliforms [36,37], so in the selection it is possible to discard processes focused on nitrifier variants (e.g., nitrification biological contactors).
Some criteria in the literature [36,37,38] suggest the representative variables to define the ICs shown in Table 2. In the set of characteristics of wastewater, those that are not associated with aerobic and conventional secondary unit processes, and which do not represent a disjunction in the decision, are ruled out. A variable that does not generate a disjunction must have values that can be treated in the same way by any process, or regulated, so they impact the performance of the treatment in a non-significant way. Medina et al. [39] describe the appropriate operating intervals of the ICs for each defined UP, which can determine the existence of the disjunction between the processes. Intervals allow UPs to be located at an operation level according to the associated variable (Table 2, operation level). As a result, if processes are able to handle the same level of an IC, there is no disjunction in the decision, and they can be discarded from the model.
Other selection criteria applied by Adams et al. [36], Metcalf and Eddy [37] and Rodgers et al. [38] are considered to determine the elements belonging to the PI group (Table 3). These criteria are related to effluent quality, and monetary, social and environmental impacts. However, the dimensioning of the model towards conventional processes allows us to rule out any variable that represents the treatment of nutrients, refractory organic matter, carcinogens, mutagens, teratogens and toxics, and dissolved inorganic solids.
Social impact variables and health effects are covered by the construction of WWTPs at a certain distance from populations, therefore this factor is considered in the CLS variable of ICs. The main environmental impacts that can be attributed to secondary treatment are carbon-emissions, which are estimated implicitly by energy consumption (ENC) [2], and sludge production (SLD) [40]: PIs previously defined.
The design of the Bayesian networks proposed for this research (stage 2) is based on the determination of dependencies between variables, the type of variable (continuous or discrete) for each element, as well as the estimation of nodes PDFs and the quantification of dependency between variables [35]. Since the objective of the model is to choose the most appropriate secondary unit process, its dependence on the choice of another is meaningless, and therefore the UPs are considered independently of each other. A similar case occurs with ICs, where their dependence is strongly linked to the origin of wastewater.
Unlike UPs and ICs, there are dependencies between some IPs that can be ignored, for example, among OMR and CCO indicators, because in general the most efficient unit processes in removal are the most expensive. However, that dependency is only “active” if a value is set in the OMR variable or CCO (based on Díez-Vegas [41]), which implies a preference of one UP over the others.
There are dependencies between some PIs and ICs; for example, TMP and BOM influence OMR [12,36,42,43,44], but their dependencies are implicit in their PDFs, therefore we do not need to assign dependency arcs between ICs and PIs.
The Bayesian network can work without arcs between the ICs, UPs and PIs themselves, as well as arcs from ICs to PIs, allowing only UPs dependent on PIs and PIs dependent on UPs (Figure 2a). In this way, different potential configurations of the Bayesian network can be conceived considering discrete variables and UPs as states (Figure 2b), continuous-discrete variables and separated UPs in continuous variables (Figure 2c), individual PIs for each UP (Figure 2d), ICs displayed independently by UPs (Figure 2e): and inference of ICs probabilities given the UPs (Figure 2f).
Marginal PDFs and range correlations (stage 3) associated with the most appropriate Bayesian network of the model can be estimated from information found in databases; however, structured expert judgements can become an alternative source of data, especially to support uncertainty analysis [45].
For marginal PDFs of UPs to be defined as continuous variables, a score can be assigned to them, useful for decision making, in a range of 0 to 10 (uniform density function).
The PDFs associated with the BOM variable (Table 4, column 4) can be obtained by means of the reported efficiencies (column 3) in relation to some regulations, such as the Mexican one (30 mg/L BOD5) [46]. For example, the average reported efficiency of an RBC is 82.5% and the maximum 92.5%, so it is appropriate to manage BOD5 concentrations in the range of 171.4 to 400 mg/L (or 300 mg/L, considering the concentration limit in municipal wastewater). In the case of TMP, these data are obtained through expert judgement.
As normal distributions, synthetic random samples are generated [25,50] to obtain the parameters of the marginal PDFs (Table 4, column 5) with the set of four samples of each variable.
The above conception of BOM and TMP variables, although correct, is incomplete because it can lead to inconsistencies for the model. That is, the high range correlations (r > 0.8) obtained between the BOM or TMP with each UP can underestimate the other variables. To solve this inconsistency, the BOM variable can be conceived as the “biodegradable organic matter difference with the optimal one for each process” (MDO). The optimal biodegradable O. M. is the median of each process, assuming that moving away from it implies that the process decreases its probability of treating that O. M., and another process increases its probability of treating it. Therefore, the term “optimal” does not refer to the best performance of the process, but to the maximum eligibility of the process. In the case of TMP, the performance of a process does not depend on its operation at a high or low temperature of wastewater, but on how much its average operating temperature moves away from the optimal operating temperature of each process. Therefore, the TMP variable can be adjusted as “difference in wastewater temperature with optimal process operating temperature”, which is hereinafter referred to as “temperature difference with optimal” (TDO).
Examining the meanings of the optimal values, it follows that they are in the center of the PDFs of each UP (Table 4, column 6), providing four transformed marginal PDFs (column 7). The optimal value of each UP (e.g., BOMUPi-optimal) is located in the marginal (original) distribution of MOB or TMP (Figure 3a). Therefore, the probability of having a difference (e.g., biodegradable O. M.), or lower, with the optimal value, are the cumulative probabilities on both the left and right. In this way, a cumulative probability distribution is defined based on different increments of the variables.
Similarly, the transformed PDFs of each UP (Table 4, column 8) are obtained by means of the optimal values (Figure 3b), which correspond to the median of the PDFs. The new distributions of each UP are useful for estimating the range correlations between the MDO and TDO variables with their respective UP.
CONAGUA (National Water Commission) [48] provides the implemented processes and design flows (installed capacity) of each plant registered in Mexico. Assuming a normal distribution of values, with the PDFs of WWF estimated of each UP (Table 4, column 4), a synthetic random sample is constructed per process, and, with the four joint synthetic samples, the marginal PDF is obtained (column 5).
Due to the lack of data that allow us to correlate the ICs with the proposed score for the UP, in this study, it is proposed to obtain the range correlations of the variables MDO, TDO and WWF with the UPs by conditional probabilities [51] but replacing the probabilities given by the experts with probabilities obtained through PDFs of each process and the medians of the marginal PDFs. For example, to determine the correlation between the WWF variable and the TRF variable, it must be obtained from the experts: the probability that the WWF is greater than 9.2 m3/d if the TRF score is greater than 5.0. Assuming that a score greater than 5.0 results in the process being tempted and chances of being chosen, a judgement may now be required of the experts: the probability that the WWF is greater than 9.2 m3/d if TRF is eligible. The fact that TRF is eligible means that, there are flow rates that it can handle. Additionally, these flows, defined by the PDF, can be taken into account to determine the probability. Therefore, with the marginal distribution of WWF (Figure 4a; Table 4, column 5) the median is located, and with the flow rates distribution of TRF (WWFTRF; column4) is calculated the probability of surplus (conditional probability, column 9), to calculate the range correlation (column 10) using the method described by Morales et al. [51].
Finally, similar to the WWF variable, from the median given by the cumulative marginal PDF of MDO (or TDO) associated with an UP (Fmarginal(MDOUPi)) (Figure 4b; Table 4, column 7) and with the cumulative PDF of the MDO of the same process (F(MDOUPi)) (column 8), the probability of surplus (column 9) required to determine the range correlation (column 10) between MDOUPi and UPi is calculated.
Some variables, such as CLS, can be defined as a continuous–discrete variable with two states, in this case including processes that can be close to populations (up to 200 m) and those that must be away from them (at more than 1000 m) [52]. It can be assumed that in a rural town, where the population centers are distant, land is available far enough from the population to opt for any process without undesirable effects on society. In the case of an urban population, it is likely that only land close to the population is available, and processes that have fewer undesirable effects should be opted for. Therefore, it is assumed that the probability of building a plant at less than 1000 m (near, state 1) is equal to the probability of having an urban population, and the probability of building it at more than 1000 m (away, state 2) is equal to the probability of existence of a rural population. For example, in Mexico 77.6% of the population is urban and 22.4% of the population is rural [49], percentages that can define the probability distribution of the CLS variable (Table 4, column 5). Range correlations (column 10) are calculated directly from the conditional probabilities (column 9) provided by experts, for example, the probability of placing the plant at 1000 m away if AEL is chosen.
The marginal distributions of the OMR, SSR and SLD variables dependent on each of the UPs can be determined with the interval values reported in the literature, e.g., the removal efficiency of O. M. in terms of BOD5 from a trickling filter is between 45% and 81% [36,47]. Marginal PDFs resulting from each variable and process are proposed to estimate with synthetic samples based on information provided by Asano et al. [5], César-Valdez & Vázquez-González [33], Adams et al. [36] and Wang et al. [47]
As for STY, three classes can be distinguished depending on the biological process. The lagoons are a very stable process due to the volume of the reactor, as even the same lagoon is considered as the regulation of the treatment plant [47]. Trickling filters can be considered stable because they have no significant variations in O. M. removal efficiencies, even with fluctuations in hydraulic and organic wastewater loads [53,54]. Activated sludge is an unstable process because it requires the control of variables such as feed/biomass ratio, hydraulic retention time, and amount of aeration, and it is susceptible to bulking (elevation of sludge volume in secondary settler) [47]. With these considerations, it can be established that AELs have stability 3; CBRs and TRFs, stability 2; and ASLs, stability 1. Such stability may vary depending on the process variants, but, as at this stage (research) the effects of the variants are not analyzed, these values will be constant for each process. This results in no correlation between STY and UP scoring, so they cannot be established as Bayesian network nodes, but only as variables displayed in decision support.
The CPY values of each UP can also be considered as constant values, so similar to STY, they are displayed only as a support for the decision: ASL with a complexity of 20; RBC, complexity of 11; TRF, 10; and AEL, 5 [55].
Chhipi-Shrestha et al. [2] provide approximations for estimating the construction cost, operation cost and energy consumption of different treatment processes depending on the operating flow rate. From these, unit values, relative to the flow rate, CCO, OCO and ENC of each UP (Table 5) can be estimated (at increments of 4000 m3/d). These values determine the marginal PDFs of the variables for each UP.
To determine the range correlations between PIs and UPs, three statements derived from PI characteristics are taken into account:
  • The higher rated (higher score) a process to treat certain wastewater is, the higher the chances of obtaining high O. M. and TSS removal increase.
  • The lower graded (lower score) a process is, the higher the chances that sludge production and costs (monetary and environmental) increase.
  • Because PIs are measured directly from and characterized by UP, there is a strong correlation between the UP and its PIs.
The first two statements imply the signs of range correlations, positive for removal efficiencies, and negative for sludge production and costs. As for the magnitude, a value of 0.9 is proposed, according to [56,57,58], as a strong correlation.
Although the Bayesian network integrates the main part of the model, it is necessary to define the decision mechanisms. Under this decision model, it is possible to provide two results: the score of the UP and the probability of a favorable event in the PIs determining the ICs. When a user sets values in ICs, useful probability distributions to determine the probabilities of meeting selection criteria are provided. For example, what is the most appropriate process according to ICs? (higher score), which process is most likely to exceed an O. M.? What process is most likely to be below a certain cost? In order to perform these comparisons, the scores and probabilities of getting a favorable event in a PI are displayed per process on a single web chart. This chart will allow the user to easily decide which variables have the highest weight, according to the criterion, and choose the process. In addition, it allows us to observe quantitatively and globally the advantages and disadvantages presented by the processes.
To evaluate the model, it is necessary to collect information about WWTPs (stage 4). In Mexico, CONAGUA [48] has registered 888 wastewater treatment plants (Figure 5), with the processes of this study. The database has information about the flows and the type of process that was implemented, but information about TSS, the biodegradable O. M., wastewater temperature and proximity to homes, are required. This information can be obtained through operation reports, project proposals, technical reports and local climate reports. Therefore, a necessary database for validation is built upon the complete information about 117 treatment plants.
The performance of a statistical prediction model (stage 5) can be evaluated by measuring the correspondence or agreement between predicted and observed values [59], condensed into an array of hits and errors [60,61]. Valuations provide a number of parameters to measure model performance and a ROC chart that allows to visualize its performance [61].
The user can determine the relevance between the PIs and the score of the UPs to make their choice. In the case of validation, rules are proposed to simulate reality and eliminate the triviality of the set of model choices [61]. In this way, only the score of the UP, OMR, SLD, CCO, OCO and ENC are used for validation. Moreover, because in reality only one unit process is chosen for the project, it is determined that the process that on the radial chart had three or more criteria in its favor was chosen. If three processes are tied with two criteria, the chosen one must be the process with the highest score, since this criterion is derived from the ICs.
In addition to the results of the Bayesian network, the model must be supported under three important conditions. If the amount of TSS is less than 25 mg/L and biodegradable O. M. is less than 50 mg/L BOD5 in the secondary treatment influent, it is recommended to increase the efficiency of the primary settler to meet the limit of the regulations (40 mg/L TSS, 30 mg/L BOD5) [46] without a secondary treatment. On the contrary, if the 275 mg/L TSS and 300 mg/L BOD5 are exceeded, a process for high concentrations should be chosen: ASL. The third condition is based on wastewater flow. AEL and RBC typically handle lower flow rates than TRFs, and TRFs in turn handle flows lower than ASL, so above the TRF limit (1600 L/s, maximum found in the CONAGUA [48] database), only ASL can be chosen.

3. Results

According to the observed arguments in stage 2 of the methodology, the configuration of the selected Bayesian network was the one corresponding to the separated UPs with their own variables (i.e., UP exclusionary) with the direction of the influences (arcs) of the UPs towards the ICs (Figure 6). Although the direction of relationships in this configuration could be considered non-causal, from a mathematical or abstract point of view, Bayesian networks do not impose the direction of the causal arc [62]. In this configuration the MDO and TDO variables have a different marginal distribution in the Bayesian network of each process; WWF and CLS have a single duplicate marginal in each process; and each process has its own PIs and its marginals.
For the rest of the possible configurations (Figure 2) it can be mentioned that discrete Bayesian networks (Figure 2b) allow consideration of the exclusionary nature of UPs; however, the other variables (ICs and PIs) are continuous, the accuracy of which would mean excessive network complexity, making its modeling costly [63]. This study applied continuous-discrete non-parametric Bayesian (NPBN) networks, whose configuration is reduced to the quantification of a marginal distribution per variable and to one (conditional) dependency parameter per arc [35].
The discrete nature of UPs could be treated in an NPBN, defining them in separate variables (Figure 2c). Nevertheless, the results and demands (PIs) of each UP are also excluding; hence, they are derived from a single process. Thus, each process has PIs associated with the same parameters, but in different variables (Figure 2d).
This could be considered as the most appropriate configuration, but there can be “indirect” dependency between UPs when connected by an IC, which is not consistent with the selection process. Although this is solved by properly configuring the rank correlation matrix, or, when instantiating ICs, because communication between UP is closed [41], it is preferable to represent the phenomenon with UPs that depend on their own ICs (Figure 2e).
The fourth configuration suggested complete independence between UPs but required obtaining range correlations of up to three conditions (Table 6, column A), which generated inconsistencies such as the overvaluation of some ICs when determined by expert judgement. The Bayesian network, whose arcs are directed from the PIs to the ICs (Figure 2f), is required to calculate only unconditional correlations (Table 6, column B). On one hand, from the literature data on the TSS, MDO, and WWF variables in relation to UPs (Table 6, column C), two rank correlations were obtained (r UPi,MDO, r UPi,WWF). On the other hand, a structured expert judgment was carried out to obtain PDFs and remaining rank correlations from stage 3 (Table 7).
In this expert judgement, the values obtained for the temperature variable in RBC and TRF are equal, as it is expected. From Table 2, these two processes show the same operation level with respect to temperature, i.e., both processes work with the same type of microorganisms, attached biofilm and have natural (not forced) aeration. Therefore, their appropriate operating temperatures must be similar.
A reliable expert judgment requires calibration questions to assess the performance of the experts and give them a weight in the combination of their opinions (decision maker). Therefore, a questionnaire involving all the ICs was elaborated to determine the marginal distributions of the TDO variable and the rank correlations of the CLS variable with the UPs. The variables TSS, BOM and WWF depicted the calibration questions (see Appendix A, Table A1).
From the data of the structured expert judgment treated with the Excalibur v1.0 program [64], the weight of the results was determined: expert 3, 30.69%; and expert 4, 69.31% (Table A2). Such results, where one or two experts get all the weight of the DM are not erroneous or atypical results [65], as in the Colson and Cooke [66] study, where two experts out of nine take virtually all the weight of the information. According to [65], each expert can access different information or can interpret it differently, so there is no logical reason why all experts must have the same state of knowledge.
DM reliability (22.82%; calibration score) resulted above the recommended limit (5%) [67], and the DM information score (0.7786) was only three times less than the greatest score provided by expert 2 (2.309). Due to the proper calibration score and admissible information score obtained, it was not necessary to interview more experts to get the necessary values (Table 8). This means that experts 3 and 4 are highly informative and provide the required data.
With the model developed and the database generated (stage 4), the selection of the treatment was carried out by the plant. For example, Table 9 shows data from four strategic WWTPs: one with successful selection, one with incorrect selection, and two belong to a group of 40 plants in the database whose type of processing was assumed to be the least appropriate. Four MDO and TDO values can be seen because these are obtained in return to the optimal value of each process, and only one for WWF and CLS value.
The values were entered on the ICs nodes (Figure 7a), and as a result updated marginal PDFs were generated in the nodes of the UP scores (Figure 7b). In the example, it is observed that ASL is the most appropriate process, by the average scoring values, which are derived from the ICs. This result already represents a trend in the decision, but it is desirable to assess PIs to strengthen the selection. With marginal PDFs of PIs derived from the score (Figure 7c), it was possible to determine useful probabilities for decision-making, e.g., the probability of exceeding, with one process, the O. M. removal efficiency required to comply with a standard (or a proposed efficiency value) or the probability of having, with the process, a lower cost than the average of the four processes (or a required cost). To assist in comparing the four scores and the four probabilities obtained from each PI, it was proposed to place the values on a web chart (Figure 7d) that shows which processes are most favored according to PIs.
For stage 5, all 117 case studies were rated. However, as a comparison, in the case of Plant 001, it can be observed that the process chosen by score (Figure 8) matches that implemented (ASL) and shows three processes with two favorable indicators. The choice of model and case will be corroborated when analyzing the input values: high values of biodegradable O. M., a temperature close to the optimal of ASL and location close to the population. Perhaps the only inconvenience to choose ASL is a relatively low flow, so RBC is approaching similar values in scoring and could be contemplated.
The input data for Plant 002 (from Table 9) indicate that the process to be chosen was a TRF, as suggested by the model (it has three PIs in its favor; Figure 8). The values of biodegradable O. M. are so low that they support the decision to choose TRF, and the temperature of the WW approaches its optimal value (22.5 °C) and there is a favorable flow for the process. Therefore, because an ASL was implemented, this case, along with similar ones, are considered to have an unsuitable process.
An opposite case is observed at the Plant 010 where a TRF is implemented, and the model chooses ASL. It is observed that the biodegradable O. M. that enters has a very high concentration for a TRF. Although, it is possible to treat these concentrations with the implemented process, it is necessary to raise the costs of construction and operation, which leads to analyzing a balance between cost and efficiency obtained. In this way, the choice is between an expensive TRF with less chance of obtaining efficiencies or an ASL, which is also expensive, with a high chance of efficiencies. With this approach, this case study and similar ones that were found, were considered as unsuitable processes.
In the last example, the model chose AEL, but ASL was implemented. It was considered an incorrect choice because the plant is close to a population center, which can definitely eliminate this option, and also the magnitude of the flow exceeds what it (AEL) can conventionally treat. On the other hand, the tendency of the model to choose AEL is justified by having a process-friendly temperature and biodegradable O. M.: 27.7 °C with an optimal process temperature of 29.3 °C; and 225 mg/L BOD5 in the secondary treatment influent with optimal BOM of the same value.
For validation, the valuation parameters by UP and globally were calculated (Table 10) according to the equations shown by Fielding and Bell [61].
According to its location on the ROC chart (Figure 9), the model meets the established criterion (exceed 90% of the unit area); the PP+ of the model achieves a good percentage of positive predictions, 83.5%; the PP− (93.9%) influences the validation of the model to a lesser extent, since, for each case study, three processes are not observed; and, an excellent Kappa (0.77 > 0.75) is achieved [68]. UP predictions are also well positioned (above criterion) for RBC, TRF, and ASL processes (Figure 9). Only AEL predictions showed very low performance. This may be due to the fact that when choosing this process, the availability of the land for designers is decisive. Nevertheless, the model takes into account three other factors (MOB, TMP, WWF) and decides the availability of the land according to the CLS. This suggests adding a criterion to this variable, discarding the process when the plant is nearby, or giving it greater precision.
The case studies with unsuitable processes found in the database reflect a problem that exists in Mexico and probably in other regions, justifying the implementation of a methodology for choosing the wastewater treatment process. However, about 50% of the information to build the Bayesian network was obtained from American literature, so it is possible that the model can be implemented in the U.S.A. if validation is performed with data from the region. In this way, it can be generalized that the model can be useful for any region by validating it with endemic data and, if required, adjusting it with data from the region under study. One objective that is visualized is to achieve the development of a comprehensive model that can be implemented in any region, or even, achieving greater dissemination of the model, standardizing decision trends globally.
Due to the flexibility of the latest model configuration and because any process can be evaluated based on its inputs (ICs) and outputs (PIs), the model can be adjusted for other types of treatment and/or variables can be added to the Bayesian network. With the structure where the IC and PI distributions depend on the evaluation (distribution) of the UP it is possible to define a set of processes of similar type, and according to the characteristics of the set add the variables that would be involved in the decision of that set. In addition, because of that flexibility, any model could be easily improved by adding (or modifying) the necessary variables. For example, the model of this research can be improved by adding the nitrogen (or phosphorus) removal variable, for which, due to the last direction of arcs between IC and UP, it is only necessary to investigate the data of each process linked to this parameter. An unconventional treatment, such as membranes, can even be added, or process variants can be separated, requiring data according to the ICs and PIs that were defined in this study and structuring the process with their own variables.
Once the model is considered suitable for use in Mexico, or in later regions, it is important to use and broadcast the software, with an interface that takes the input data (ICs) of the design and deployment, in the proposed web graph, and the results (PIs). The Uninet software, with which the Bayesian network of this research was modeled, has a library (UninetEngine) for programming in several languages (C++, Delphi, Matlab, among others). This library requires the acquisition of a license. The Netica API is also available for discrete, free-distribution networks, for use in Java or C++, and has functions to emulate continuous variables, which are possible to use if conditional arrays are a bit complex or have dependencies on a single variable, as in the case of this investigation.

4. Conclusions

A statistical model based on Bayesian networks has been generated that underpins the choice of the optimal process of secondary municipal wastewater treatment, based on statistical data and mathematical justifications. According to the results of the validation, it provides an acceptable level of certainty based on technical, economic, social and environmental performance, so the methods carried out are supported. As this Bayesian Network-based model showed satisfactory results for aerobic wastewater treatment, it could be expanded to the selection of other types of processes, such as anaerobics or membranes, so their inclusion in later versions are suggested.
In the methodology of the model, it was important, in addition to determining the parameters associated with the variables, to define the appropriate conception of the variables and their relationship with the UPs to define the adequate marginal PDFs and range correlations, avoiding inconsistencies in the model. As for the PDFs of each process and range correlations that could not be obtained through databases, expert judgment was successfully used in obtaining this information. Some Bayesian network correlations were successfully estimated by conditional probabilities obtained from the PDFs of each process associated with the parameter of a variable and the medians of marginal PDFs associated with the same variable.
Unlike other tools for WWTPs, this model is a support mechanism prior to the design of the treatment train, which provides results based on data and the experience of experts, with no need of dimensioning the treatment train. Therefore, in addition to the known advantages, such as addressing uncertainty, an easy-to-view structure, and the evaluation of different scenarios with a single model, this model acquires significance because it provides objective and comparable information on four types of secondary treatment for a decision-making process of selection.
The criteria for choosing conventional secondary wastewater treatment were related to model variables, variable type, model type, and output type. The appropriate variable type for unit processes was the discrete variable; input variables and performance indicators could be better visualized and processed as continuous variables. The most suitable type of networks for the variables involved were continuous-discrete, non-parametric Bayesian networks, structured in different Bayesian networks by unit processes. It was useful for the choice to visualize the score value of the UP and the probabilities of meeting design demands.
Because positive predictive power and negative predictive power exceeded the required value, the Kappa parameter indicated a satisfactory valuation, and the location of validity on the ROC chart surpassed the criterion line, the model is considered valid to support the choice of secondary wastewater treatment in Mexico.
The model’s adaptability to obtaining information in an elementary way, i.e., by parameters or variables seen in isolation, allows the development methodology to be easily extended to other types of treatments of wastewater, that is, it can be used for any type of wastewater treatment plant, and also allows the model to be expanded and improved for application in other regions.

Supplementary Materials

The following supporting information can be downloaded at: http://iitca.uaemex.mx/cira/vinculacion/decision-support-wastewater-treatment (accessed on 7 April 2022), softDSSWT.zip with the functions conforming the informatics tool (Medina, E.; Fonseca, C.R.).

Author Contributions

Individual contributions of authors have been acknowledged for the following roles: conceptualization, E.M., C.R.F., I.G.-A., O.M.-N. and D.G.-P.; methodology, E.M., C.R.F. and O.M.-N.; software, E.M. and C.R.F.; validation, E.M., C.R.F., I.G.-A., O.M.-N. and M.Á.G.-A.; formal analysis, E.M., C.R.F. and O.M.-N.; investigation, E.M.; resources, E.M., C.R.F., D.G.-P. and M.E.-S.; data curation, E.M.; writing—original draft preparation, E.M., C.R.F., I.G.-A. and O.M.-N.; writing—review and editing—, visualization and supervision, E.M., C.R.F., I.G.-A., O.M.-N., M.Á.G.-A., M.E.-S., C.A.M.-L. and D.G.-P.; project administration, E.M., C.R.F., I.G.-A. and D.G.-P.; funding acquisition, C.R.F. and O.M.-N. All authors have read and agreed to the published version of the manuscript.

Funding

Authors appreciate the support given by CONACyT (Mexico) through profile 707989, UAEMex through project 6497/2022CIB and TU-Delft for support the publication.

Data Availability Statement

Data is contained within the article or Supplementary Materials.

Acknowledgments

We are grateful to the academics of the Inter-American Institute of Technology and Water Sciences (IITCA) and the industry professional, for their participation in expert judgment. This work has been translated by Edgar R. Diz–CTP 64500 +52-722-771-6058.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

AELAErobic Lagoons
ASLActivated Sludges
BODBiochemical Oxygen Demand
BOMBiodegradable Organic Matter
BNBayesian network
CBACost Benefit Analysis
CCOConstruction Cost
CLSCLoSeness to households
CONAGUANational Water Comission
CPYprocess ComPlexitY
DMDecision Maker
DMMDecision Making Model
ENCENergy Consumption
ICInput Condition
LCALife Cycle Assessment
NPBNcontinuous-discrete No Parametric Bayesian Network
MCDMMultiCriteria Decision Making
MDObiodegradable organic Matter Difference with Optimal
MMMathematical Model
O. M.Organic Matter
OCOOperation COst
OMRbiodegradable Organic Matter Removal efficiency
PDFProbability Distribution Function
PIPerformance Indicator
RBCRotating Biological Contactors
ROCReceiver Operating Characteristic
SLDSLuDge production
TSSTotal Suspended Solids
SSRtotal Suspended Solids Removal efficiency
STYprocess STabilitY
TMPTeMPerature
TDOTotal Suspended Solids
TRFTRickling Filter
TSSTotal Suspended Solids
UPUnitary Process
WWWasteWater
WWFWasteWater Flow
WWTPWasteWater Treatment Plant

Appendix A. Structured Experts Judgement

Table A1. Data obtained from experts for the determination of marginal PDFs of the TDO variable and range correlations of UP and CLS.
Table A1. Data obtained from experts for the determination of marginal PDFs of the TDO variable and range correlations of UP and CLS.
ICUPIDExpertR
1234
Quantile (%)
55095550955509555095
TSSCBR6.1175212.525025032540050250450100225350381
FPR7.120025030020027535050125200100150200160
LAE8.180011501500250325400100300500100300500500
LAS9.17008501000300400500100275450100300500458
BOMCBR6.2100125150300500700100275450100225350286
FPR7.28010012040057575050150250100225350120
LAE8.2500700900500625750100275450100300500375
LAS9.2300500700400550700150325500100300500344
TDOCBR6.31520251015201022.535522.540E
FPR7.31520251012.5151022.535522.540E
LAE8.31220281520251826.5352532.540E
LAS9.31521.52815202552545102540E
WWFCBR6.460,00090,000120,00040506020501010,000502254003812
FPR7.4100,000150,000200,0006070802010,01020,000400120020005523
LAE8.440,00060,00080,000405060201010200020101020002809
LAS9.4100,000150,000200,00040506020,000110,000200,000200051,000100,0009267
CLSCBR6.557.51011.52406080105090E
FPR7.5512.52011.52512.520103050E
LAE8.520355011.521017.525110.520E
LAS9.570809011.52305070103050E
Combining each IC with a UP (IC, UP): IC values that can successfully handle by a PU in the quantiles (5, 50, and 95%). For example, (TSS, TRF): The amount of TSS that TRF can properly treat in those quantiles. R: realization, the actual value of the calibration question; E: Information sought from the experts; ID: The ID of the question.
Table A2. Expert calibration and information.
Table A2. Expert calibration and information.
Expert1234DM
Calibration score0.0000020.0000010.0610.1240.228
Information score1.0262.2310.8220.9090.779
Weigh0.0010.00230.69069.310
DM: Decision maker.

References

  1. Chrispim, M.C.; de Souza, F.D.; Scholz, M.; Nolasco, M.A. A framework for sustainable planning and decision-making on resource recovery from wastewater: Showcase for São Paulo megacity. Water 2020, 12, 3466. [Google Scholar] [CrossRef]
  2. Chhipi-Shrestha, G.; Hewage, K.; Sadiq, R. Fit-for-purpose wastewater treatment: Conceptualization to development of decision support tool (I). Sci. Total Environ. 2017, 607–608, 600–612. [Google Scholar] [CrossRef] [PubMed]
  3. Jiménez-Cisneros, B. Water Recycling and Reuse: An Overview. In Water Reclamation and Sustainability; UNESCO: Paris, France, 2014. [Google Scholar] [CrossRef]
  4. U.S. Environmental Protection Agency (U.S. EPA). Guidelines for Water Reuse; U.S. Agency for International Development: Washington, DC, USA, 2012. [Google Scholar]
  5. Asano, T.; Burton, F.L.; Leverenz, H.L.; Tsuchihashi, R.; Tchobanoglous, G. Water Reuse: Issues, Technologies, and Applications; Metcalf and Eddy: New York, NY, USA, 2007. [Google Scholar]
  6. Ullah, A.; Hussain, S.; Wasim, A.; Jahanzaib, M. Development of a decision support system for the selection of wastewater treatment technologies. Sci. Total Environ. 2020, 731, 139158. [Google Scholar] [CrossRef] [PubMed]
  7. Arroyo, P.; Molinos-Senante, M. Selecting appropriate wastewater treatment technologies using a choosing-by-advantages approach. Sci. Total Environ. 2018, 625, 819–827. [Google Scholar] [CrossRef]
  8. Instituto Nacional de Estadística Geografía e Informática (INEGI). Comunicado de Prensa Núm. 127/17; INEGI: Aguascalientes, Mexico, 2017. [Google Scholar]
  9. Aquastat. FAO’s Global Information System on Water and Agriculture: Wastewater. Available online: www.fao.org/aquastat/en/overview/methodology/wastewater (accessed on 30 May 2021).
  10. Jajac, N.; Marović, I.; Rogulj, K.; Kilić, J. Decision support concept to selection of wastewater treatment plant location-the case study of Town of Kutina, Croatia. Water 2019, 11, 717. [Google Scholar] [CrossRef] [Green Version]
  11. Nasiri, F.; Savage, T.; Wang, R.; Barawid, N.; Zimmerman, J.B. A system dynamics approach for urban water reuse planning: A case study from the Great Lakes region. Stoch. Environ. Res. Risk Assess. 2013, 27, 675–691. [Google Scholar] [CrossRef]
  12. Tchobanoglous, G.; Burton, F.L.; Stensel, H.D. Wastewater Engineering: Treatment and Reuse; McGraw-Hill Publishing: New York, NY, USA, 2003. [Google Scholar]
  13. Kalbar, P.P.; Karmakar, S.; Asolekar, S.R. The influence of expert opinions on the selection of wastewater treatment alternatives: A group decision-making approach. J. Environ. Manag. 2013, 128, 844–851. [Google Scholar] [CrossRef]
  14. Roozbahani, A.; Zahraie, B.; Tabesh, M. Integrated risk assessment of urban water supply systems from source to tap. Stoch. Environ. Res. Risk Assess. 2013, 27, 923–944. [Google Scholar] [CrossRef]
  15. Walling, E.; Vaneeckhaute, C. Developing successful environmental decision support systems: Challenges and best practices. J. Environ. Manag. 2020, 264, 110513. [Google Scholar] [CrossRef]
  16. Mannina, G.; Rebouças, T.F.; Cosenza, A.; Sànchez-Marrè, M.; Gibert, K. Decision support systems (DSS) for wastewater treatment plants—A review of the state of the art. Bioresour. Technol. 2019, 290, 121814. [Google Scholar] [CrossRef]
  17. Pick, R.A.; Weatherholt, N. A Review On Evaluation And Benefits Of Decision Support Systems. Rev. Bus. Inf. Syst. 2013, 17, 7–20. [Google Scholar] [CrossRef] [Green Version]
  18. Ahmadi, A.; Kerachian, R.; Emami-Skardi, J.M.; Abdolhay, A. A stakeholder-based decision support system to manage water resources. J. Hydrol. 2020, 589, 125138. [Google Scholar] [CrossRef]
  19. Paul, M.; Negahban-Azar, M.; Shirmohammadi, A.; Montas, H. Assessment of agricultural land suitability for irrigation with reclaimed water using geospatial multi-criteria decision analysis. Agric. Water Manag. 2020, 231, 105987. [Google Scholar] [CrossRef]
  20. Zavadskas, E.K.; Turskis, Z.; Kildiene, S. State of art surveys of overviews on MCDM/MADM methods. Technol. Econ. Dev. Econ. 2014, 20, 165–179. [Google Scholar] [CrossRef] [Green Version]
  21. Chen, S.M.; Chu, Y.C. Multiattribute decision making based on U-quadratic distribution of intervals and the transformed matrix in interval-valued intuitionistic fuzzy environments. Inf. Sci. 2020, 537, 30–45. [Google Scholar] [CrossRef]
  22. Ramalho, F.D.; Ekel, P.Y.; Pedrycz, W.; Pereira-Júnior, J.G.; Luís-Soares, G. Multicriteria decision making under conditions of uncertainty in application to multiobjective allocation of resources. Inf. Fusion 2019, 49, 249–261. [Google Scholar] [CrossRef]
  23. Ekel, P.; Kokshenev, I.; Parreiras, R.; Pedrycz, W.; Pereira, J. Multiobjective and multiattribute decision making in a fuzzy environment and their power engineering applications. Inf. Sci. 2016, 361–362, 100–119. [Google Scholar] [CrossRef]
  24. Cuevas-Velásquez, V.; Sordo-Ward, A.; García-Palacios, J.H.; Bianucci, P.; Garrote, L. Probabilistic model for real-time flood operation of a dam based on a deterministic optimization model. Water 2020, 12, 3206. [Google Scholar] [CrossRef]
  25. Eppen, G.D.; Gould, F.J.; Schmidt, C.P.; Moore, J.H.; Weatherford, L.R. Introductory Management Science: Decision Modeling with Spreadsheets; Prentice-Hall, Inc.: New Jersey, NJ, USA, 1998. [Google Scholar]
  26. Barton, D.N.; Sundt, H.; Adeva-Bustos, A.; Fjeldstad, H.-P.; Hedger, R.; Forseth, T.; Köhler, B.; Aas, Ø.; Alfredsen, K.; Madsen, A.L. Multi-criteria decision analysis in Bayesian networks—Diagnosing ecosystem service trade-offs in a hydropower regulated river. Environ. Model. Softw. 2020, 124, 104604. [Google Scholar] [CrossRef]
  27. Gehl, P.; Cavalieri, F.; Franchin, P.; Negulescu, C.; Meza, K. Use of Bayesian networks as a decision support system for the rapid loss assessment of infrastructure systems. In Proceedings of the 16th European Conference On Earthquake Engineering, Thessaloniki, Greece, 18–21 June 2018; pp. 1–12. [Google Scholar]
  28. Xu, J.; Jin, G.; Mo, Y.; Tang, H.; Li, L. Assessing anthropogenic impacts on chemical and biochemical oxygen demand in different spatial scales with bayesian networks. Water 2020, 12, 246. [Google Scholar] [CrossRef] [Green Version]
  29. Li, D.; Yang, H.Z.; Liang, X.F. Prediction analysis of a wastewater treatment system using a Bayesian network. Environ. Model. Softw. 2013, 40, 140–150. [Google Scholar] [CrossRef]
  30. Roozbahani, A. Application of Bayesian Networks Modelling in Wastewater Management. In Soft Computing Techniques in Solid Waste and Wastewater Management; Elsevier: Amsterdam, The Netherlands, 2021; pp. 111–130. [Google Scholar] [CrossRef]
  31. Yu, D.; Liang, Y.; Nilusha, R.T.; Ritigala, T.; Wei, Y. Prediction of the long-term effect of iron on methane yield in an anaerobic membrane bioreactor using Bayesian network meta-analysis. Membranes 2021, 11, 100. [Google Scholar] [CrossRef] [PubMed]
  32. Herrera-Murillo, J.; Mora-Campos, D.; Salas-Jimenez, P.; Hidalgo-Gutierrez, M.; Soto-Murillo, T.; Vargas-Calderon, J.; Villalobos-Villalobos, A.; Androvetto-Villalobos, E. Wastewater discharge and reuse regulation in Costa Rica: An opportunity for improvement. Water 2021, 13, 2631. [Google Scholar] [CrossRef]
  33. César-Valdez, E.; Vázquez-González, A.B. Ingeniería de los Sistemas de Tratamiento y Disposición de Aguas Residuales; México, D.F., Ed.; Fundación ICA, A.C.: Mexico City, Mexico, 2003. [Google Scholar]
  34. Landis, W.G.; Markiewicz, A.J.; Ayre, K.K.; Johns, A.F.; Harris, M.J.; Stinson, J.M.; Summers, H.M. A general risk-based adaptive management scheme incorporating the Bayesian Network Relative Risk Model with the South River, Virginia as case study. Integr. Environ. Assess. Manag. 2017, 13, 115–126. [Google Scholar] [CrossRef]
  35. Hanea, A.; Morales-Nápoles, O.; Ababei, D. Non-parametric Bayesian networks: Improving theory and reviewing applications. Reliab. Eng. Syst. Saf. 2015, 144, 265–284. [Google Scholar] [CrossRef]
  36. Adams, C.E.; Aulenbach, D.B.; Bollyky, L.J.; Boyd, J.L.; Buchanan, R.D.; Burns, D.E.; Canter, L.W.; Crits, G.J.; Dahlstrom, D.; Daniels, S.L.; et al. Wastewater Treatment. In Environmental Engineers’ Handbook; CRC Press LLC: Boca Raton, FL, USA, 1999; pp. 507–926. [Google Scholar]
  37. Metcalf & Eddy, Inc. Wastewater Engineering: Treatment, Disposal Reuse; McGraw-Hill, Inc.: New York, NY, USA, 1991. [Google Scholar]
  38. Rodgers, M.; Mulqueen, J.; Carty, G.; O’Leary, G. Wastewater Treatment Manuals: Treatment Systems for Small Communities, Business, Leisure Centres and Hotels; Environmental Protection Agency: Wexford, Ireland, 1999.
  39. Medina-Rivera, E.; Fonseca, C.R.; Gallego-Alarcón, I.; Morales-Nápoles, O.; Gómez-Albores, M.A.; Esparza-Soto, M.; Mastachi-Loza, C.A.; García-Pulido, D. Modelo de decisión para tratamientos secundarios de aguas residuales. Presented at Tercer Coloquio de Investigación en Ingeniería y Doceavo Curso-Taller “Temas Actuales en Ciencia del Agua”, Universidad Autónoma del Estado de México, Toluca, Mexico, 8–11 November 2020. [Google Scholar]
  40. Seifert, C.; Krannich, T.; Guenther, E. Gearing up sustainability thinking and reducing the bystander effect—A case study of wastewater treatment plants. J. Environ. Manag. 2019, 231, 155–165. [Google Scholar] [CrossRef] [PubMed]
  41. Díez, F.J. Local conditioning in Bayesian networks. Artif. Intell. 1996, 87, 1–20. [Google Scholar] [CrossRef] [Green Version]
  42. Huang, Z.; Qie, Y.; Wang, Z.; Zhang, Y.; Zhou, W. Application of deep-sea psychrotolerant bacteria in wastewater treatment by aerobic dynamic membrane bioreactors at low temperature. J. Membr. Sci. 2015, 475, 47–56. [Google Scholar] [CrossRef]
  43. Yuan, Q.; Sparling, R.; Oleszkiewicz, J.A. VFA generation from waste activated sludge: Effect of temperature and mixing. Chemosphere 2011, 82, 603–607. [Google Scholar] [CrossRef]
  44. Rajeshwari, K.V.; Balakrishnan, M.; Kansal, A.; Lata, K.; Kishore, V.V.N. State-of-the-art of anaerobic digestion technology for industrial wastewater treatment. Renew. Sustain. Energy Rev. 2000, 4, 135–156. [Google Scholar] [CrossRef]
  45. Leontaris, G.; Morales-Nápoles, O. ANDURIL—A MATLAB toolbox for ANalysis and Decisions with UnceRtaInty: Learning from expert judgments. SoftwareX 2018, 7, 313–317. [Google Scholar] [CrossRef]
  46. Diario Oficial de la Federación (DOF). Norma Oficial Mexicana NOM-001-SEMARNAT-1996. 2003. Available online: https://www.dof.gob.mx/nota_detalle.php?codigo=5510140&fecha=05/01/2018 (accessed on 4 June 2019).
  47. Wang, L.K.; Pereira, N.C.; Hung, Y.T.; Shammas, N.K. Biological Treatment Processes; Humana Press: New York, NY, USA, 2009. [Google Scholar]
  48. Comisión Nacional del Agua (CONAGUA). Inventario Nacional De Plantas Municipales de Potabilización Y De Tratamiento De Aguas Residuales En Operación; Secretaría de Medio Ambiente y Recursos Naturales: Mexico City, Mexico, 2019. [Google Scholar]
  49. Instituto Nacional de Estadística Geografía e Informática (INEGI). Población Rural y Urbana. 2018. Available online: http://cuentame.inegi.org.mx/poblacion/rur_urb.aspx?tema_P (accessed on 18 July 2019).
  50. Winston, W.L. Operations Research: Applications and Algorithms; Thomson Learning, Brooks/Cole: Belmont, CA, USA, 2004. [Google Scholar]
  51. Morales, O.; Kurowicka, D.; Roelen, A. Eliciting conditional and unconditional rank correlations from conditional probabilities. Reliab. Eng. Syst. Saf. 2008, 93, 699–710. [Google Scholar] [CrossRef]
  52. RAS. Reglamento Técnico Del Sector De Agua Potable Y Saneamiento Basico—Tratamiento De Aguas Residuales; Dirección de Agua Potable y Saneamiento Básico: Bogota, Colombia, 2000. [Google Scholar]
  53. Medina-Rivera, E.A. Evaluación del Medio UAEMex-1 en Filtros Percoladores Ante Distintas Cargas Hidráulicas Y Alturas De Empaque. Master’s Thesis, Universidad Autónoma del Estado de México, Toluca, Mexico, 2017. [Google Scholar]
  54. Habte-Lemji, H.; Eckstädt, H. A pilot scale trickling filter with pebble gravel as media and its performance to remove chemical oxygen demand from synthetic brewery wastewater. J. Zhejiang Univ. Sci. B 2013, 14, 924–933. [Google Scholar] [CrossRef]
  55. Department Of Environmental Conservation. “WWTP Facility Score Sheet.” New York State. 2019. Available online: https://www.dec.ny.gov/docs/water_pdf/wwtpfacscore.pdf (accessed on 16 December 2019).
  56. Joo, H.Y.; Kim, J.W.; Moon, J.H. Use of big data analysis to investigate the relationship between natural radiation dose rates and cancer incidences in Republic of Korea. Nucl. Eng. Technol. 2020, 52, 1798–1806. [Google Scholar] [CrossRef]
  57. Tian, W.; Liu, Z.; Li, L.; Zhang, S.; Li, C. Identification of abnormal conditions in high-dimensional chemical process based on feature selection and deep learning. Chin. J. Chem. Eng. 2020, 28, 1875–1883. [Google Scholar] [CrossRef]
  58. Prion, S.; Haerling, K.A. Making Sense of Methods and Measurement: Spearman-Rho Ranked-Order Correlation Coefficient. Clin. Simul. Nurs. 2014, 10, 535–536. [Google Scholar] [CrossRef]
  59. Ivanescu, A.E.; Li, P.; George, B.; Brown, A.; Keith, S.W.; Raju, D.; Allison, D.B. The importance of prediction model validation and assessment in obesity and nutrition research. Int. J. Obes. 2016, 40, 887–894. [Google Scholar] [CrossRef] [Green Version]
  60. Fawcett, T. An introduction to ROC analysis. Pattern Recognit. Lett. 2006, 27, 861–874. [Google Scholar] [CrossRef]
  61. Fielding, A.H.; Bell, J.F. A review of methods for the assessment of prediction errors in conservation presence/absence models. Environ. Conserv. 1997, 24, 38–49. [Google Scholar] [CrossRef]
  62. Millán, E.; Loboda, T.; Pérez-de-la-Cruz, J.L. Bayesian networks for student model engineering. Comput. Educ. 2010, 55, 1663–1683. [Google Scholar] [CrossRef]
  63. Niazi, M.H.K.; Nápoles, O.M.; van Wesenbeeck, B.K. Probabilistic characterization of the vegetated hydrodynamic system using non-parametric bayesian networks. Water 2021, 13, 398. [Google Scholar] [CrossRef]
  64. TU Delft. Excalibur v1.0: Program for Expert Calibration/Information. 2007. Available online: https://www.tudelft.nl/ewi/over-de-faculteit/afdelingen/applied-mathematics/applied-probability/research/research-themes/risk/software/excalibur/excalibur (accessed on 27 March 2020).
  65. Hartley, D.; French, S. A Bayesian method for calibration and aggregation of expert judgement. Int. J. Approx. Reason. 2021, 130, 192–225. [Google Scholar] [CrossRef]
  66. Colson, A.R.; Cooke, R.M. Expert elicitation: Using the classical model to validate experts’ judgments. Rev. Environ. Econ. Policy 2018, 12, 113–132. [Google Scholar] [CrossRef] [Green Version]
  67. Aspinall, W.P.; Cooke, R.M. Quantifying Scientific Uncertainty from Expert Judgement Elicitation. In Risk and Uncertainty Assessment for Natural Hazards; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar] [CrossRef] [Green Version]
  68. Landis, J.R.; Koch, G.G. The Measurement of Observer Agreement for Categorical Data. Biometrics 1977, 33, 159–174. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Methodology flow diagram for the development of the DMM.
Figure 1. Methodology flow diagram for the development of the DMM.
Water 14 01231 g001
Figure 2. Potential configurations of the Bayesian network for the model.
Figure 2. Potential configurations of the Bayesian network for the model.
Water 14 01231 g002
Figure 3. Determination of (a) marginal PDF transformed and (b) PDF transformed of BOM by UP.
Figure 3. Determination of (a) marginal PDF transformed and (b) PDF transformed of BOM by UP.
Water 14 01231 g003
Figure 4. Conditional probabilities determination by (a) PDF and by (b) cumulative PDF.
Figure 4. Conditional probabilities determination by (a) PDF and by (b) cumulative PDF.
Water 14 01231 g004
Figure 5. Wastewater treatment plants registered in México, own elaboration based on [48].
Figure 5. Wastewater treatment plants registered in México, own elaboration based on [48].
Water 14 01231 g005
Figure 6. Resulting Bayesian network.
Figure 6. Resulting Bayesian network.
Water 14 01231 g006
Figure 7. Construction process of the web graph for selection.
Figure 7. Construction process of the web graph for selection.
Water 14 01231 g007
Figure 8. Web graphs of selected case studies.
Figure 8. Web graphs of selected case studies.
Water 14 01231 g008
Figure 9. ROC graph showing where the model choice validity is located.
Figure 9. ROC graph showing where the model choice validity is located.
Water 14 01231 g009
Table 1. Secondary unit processes (UP) and their variants referred to this study.
Table 1. Secondary unit processes (UP) and their variants referred to this study.
Rotating Biological Contactors (RBC)Aerobic Lagoons (AEL)
a.
Secondary
  • Low rate aerobic
  • High rate aerobic
  • Aerated lagoon
Trickling filter (TRF)Activated sludges (ASL)
a.
Low rate
b.
Intermediate rate
c.
High rate
d.
Super high rate
e.
Roughing
f.
Two stages
  • Conventional
  • Completely mixed
  • Step feed
  • Contact stabilization
  • High-purity oxygen
  • Oxidation ditch
  • Sequencing batch reactor
  • Deep shaft
  • Extended aeration
Own elaboration based on information in [33,36].
Table 2. Identification of input conditions as candidates for model variables and the selected variables.
Table 2. Identification of input conditions as candidates for model variables and the selected variables.
Representative VariablesCTSTOperation LevelDJComments
RBCTRFAELASL
WWCColor and odorCCCCXThey are eliminated in the same treatment processes
Suspended solidsAAAAXIn the influent of secondary treatment, they are usually very low
TemperatureBBCA–CModel variable, temperature (TMP)
Biodegradable organic matterBABCModel variable, biodegradable organic matter (BOM)
Fats and oilsX XThey are removed in preliminary processes
ROMXX XIf a conventional treatment is considered, they should be low, because the treatment does not eliminate them
CMTTXX X
DISXX X
NutrientsAAAAXIf they are very low, they are fitted; if they are very high, conventional treatment is not suitable
pHBBBBXIt must be previously regulated
OxygenAAAAXIt is constantly added in the treatment
Higher organismsX XThey are removed in pretreatment processes
Pathogenic organismsX XThey are eradicated in disinfection
Wastewater flow or design flowThey do not applyABACModel variable; wastewater flow (WWF)
Wastewater flow variationThey do not applyAAAAXRegulators mechanisms, as tanks, are implemented
Closeness to the nearest homeThey do not applyA–BBBA–BModel variable; closeness (CLS)
Construction land availability and costThey do not applyAABAIt is considered in WWF, only the lagoons ought to handle small flows to limit the land costs
WWC: Wastewater characteristics; ROM: refractory organic matter; CMTT: carcinogens, mutagens, teratogens and toxics; DIS: dissolved inorganic solids; CT: Is it associated with conventional treatment? ST: Is it associated with secondary treatment? DJ: Does it generate a disjunction? A: level of parameter low values; B: level of parameter intermediate values; C: level of parameter high values.
Table 3. Performance indicators defined for the model.
Table 3. Performance indicators defined for the model.
CriterionConsidered Variables
Process efficiencyBOM removal efficiency (OMR)
TSS removal efficiency (SSR)
Environmental constraints
Treatment wastesSludge production (SLD)
Sludge treatment
System stabilityProcess stability (STY)
System complexityProcess complexity (CPY)
Construction and operation costConstruction cost (CCO)
Operation cost (OCO)
Energy consumption (ENC)
Environmental impactHealth impacts (considered in the variable CLS, Table 2)
Social impacts (considered in the variable CLS, Table 2)
Environmental impact (it is evaluated in SLD and ENC)
BOM: Biodegradable organic matter; TSS: total suspended solids; CLS: closeness.
Table 4. Procedure for determining the marginal distributions of the IC and their rank correlations with the UP. Normal distributions are assumed except for the transformed ones.
Table 4. Procedure for determining the marginal distributions of the IC and their rank correlations with the UP. Normal distributions are assumed except for the transformed ones.
ICUP Related to ICSource of DataMean and SD of PDFMean and SD of Marginal PDF by UPOptimal Value (Median)Mean and SD of Transformed Marginal PDFMean and SD of Transformed PDFCPR
{1}{2}{3}{4}{5}{6}{7}{8}{9}{10}
BOMRBC (1)[33,36,47]235.70 ± 37.10200.50 ± 53.50235.70MMDOM1 ± SMDOM1MMDO1 ± SMDO1PMDO1RMDO1
TRF (2)119.80 ± 86.50119.80MMDOM2 ± SMDOM2MMDO2 ± SMDO2PMDO2RMDO2
AEL (3)225.00 ± 31.50225.00MMDOM3 ± SMDOM3MMDO3 ± SMDO3PMDO3RMDO3
ASL (4)241.90 ± 41.50241.90MMDOM4 ± SMDOM4MMDO4 ± SMDO4PMDO4RMDO4
TMPRBC (1)Experts JudgementMTMP1 ± STMP1MTMPM ± STMPMOTMP1MTDOM1 ± STDOM1MTDO1 ± STDO1PTDO1RTDO1
TRF (2)MTMP2 ± STMP2OTMP2MTDOM2 ± STDOM2MTDO2 ± STDO2PTDO2RTDO2
AEL (3)MTMP3 ± STMP3OTMP3MTDOM3 ± STDOM3MTDO3 ± STDO3PTDO3RTDO3
ASL (4)MTMP4 ± STMP4OTMP4MTDOM4 ± STDOM4MTDO4 ± STDO4PTDO4RTDO4
WWFRBC (1)[48]3.81 ± 7.239.16 ± 33.14Not appliesNot appliesNot applies0.22−0.74
TRF (2)5.52 ± 8.620.32−0.47
AEL (3)2.81 ± 12.700.31−0.54
ASL (4)24.89 ± 63.860.650.41
CLSRBC (1)[49] and Experts JudgementNot appliesp(state 1) = 0.78Not appliesNot appliesNot appliesPCLS1RCLS1
TRF (2)PCLS2RCLS2
AEL (3)p(state 2) = 0.22PCLS3RCLS3
ASL (4)PCLS4RCLS4
SD: standard deviation; CP: conditional probability; R: rank correlation.
Table 5. Total and unitary CCO, OCO and ENC of the UP, own elaboration based on [2].
Table 5. Total and unitary CCO, OCO and ENC of the UP, own elaboration based on [2].
ProcessRBCTRFAELASLRBCTRFAELASL
Flow (m3/d)CCO (US$)CCO (US₵/m3)
10000.600.570.010.968.237.780.0113.08
40001.211.150.012.624.153.950.018.96
40,0006.385.9926.1817.442.182.058.965.97
400,00048.2144.44296.90155.901.651.5210.165.34
440,00052.6548.51326.78171.761.641.5110.175.34
Mean1.901.769.625.58
Standard deviation2.913.035.171.23
Flow (m3/d)OCO (million US$/year)OCO (US$/m3)
100014.1013.000.0219.8038.7035.600.0454.26
400056.5052.000.0479.2038.6935.580.0354.25
40,000564.70519.400.23791.8038.6835.570.0254.24
400,0005646.505192.901.647917.6038.6735.570.0154.23
440,0006211.105712.201.798709.3038.6735.570.0154.23
Mean38.6835.570.01354.23
Standard deviation6.563.4532.1122.11
Flow (m3/d)ENC (kWh/d)ENC (Wh/m3)
10006.0040.00225.001506.005.9240.29225.001506.20
400012.0080.00425.001990.002.9620.12106.13497.38
40,00083.00563.002819.007789.002.0714.0770.46194.73
400,000792.005387.0026,759.0065,785.001.9813.4766.90164.46
440,000871.005923.0029,419.0072,229.001.9813.4666.86164.16
Mean2.0513.9869.94190.36
Standard deviation67.0355.1715.64178.61
Table 6. Rank correlation sorts of two Bayesian network configurations.
Table 6. Rank correlation sorts of two Bayesian network configurations.
UP Dependent on ICICs Dependent on UPAvailable Data
{A}{B}{C}
Rank correlationCorrelation sortRank correlationCorrelation sortCorrelated variablesObtained objective
UPi, TSSCal. in E. J.
r UPi,MDOUnconditionalr UPi,MDOUnconditionalUPi, MDOCal. in E. J. and r UPi,MDO
r UPi,TDO|MDOConditionalr UPi,TDOUnconditional
r UPi,WWF|MDO,TDOConditionalr UPi,WWFUnconditionalUPi, WWFCal. in E. J. and r UPi,WWF
r UPi,CLS|MDO,TDO,WWFConditionalr UPi,CLSUnconditional
r UPi,MDO: Rank correlation of UPi and MDO; r UPi,TDO|MDO: Rank correlation of UPi and TDO given MDO; r UPi,WWF|MDO,TDO: Rank correlation of UPi and WWF given MDO and TDO; r UPi,CLS|MDO,TDO,WWF: Rank correlation of UPi and CLS given MDO, TDO and WWF; r UPi,TDO: Rank correlation of UPi and TDO; r UPi,WWF: Rank correlation of UPi and WWF; r UPi,CLS: Rank correlation of UPi and CLS; Cal. in E. J.: calibration in experts judgment.
Table 7. Resolution of the marginal distributions of the IC and their rank correlations with the UP.
Table 7. Resolution of the marginal distributions of the IC and their rank correlations with the UP.
ICUP Related to ICSource of DataMean and SD of PDFMean and SD of Marginal PDF by UPOptimal Value (Median)Mean and SD of Transformed Marginal PDFMean and SD of Transformed PDFCPR
{1}{2}{3}{4}{5}{6}{7}{8}{9}{10}
BOMRBC (1)[33,36,47]235.70 ± 37.10200.50 ± 53.50235.749.60 ± 36.6029.90 ± 22.400.26−0.68
TRF (2)119.80 ± 86.50119.888.20 ± 49.0068.50 ± 50.600.32−0.51
AEL (3)225.00 ± 31.50225.045.70 ± 34.2025.40 ± 19.000.22−0.75
ASL (4)241.90 ± 41.50241.952.40 ± 38.2033.40 ± 25.000.28−0.62
TMPRBC (1)Experts Judgement22.50 ± 6.15024.80 ± 6.0522.55.20 ± 3.894.90 ± 3.700.48−0.07
TRF (2)22.50 ± 6.1522.55.20 ± 3.894.93 ± 3.700.48−0.07
AEL (3)29.30 ± 5.6529.36.12 ± 4.414.53 ± 3.400.35−0.44
ASL (4)25.00 ± 6.0025.04.85 ± 3.654.81 ± 3.610.50−0.01
WWFRBC (1)[48]3.81 ± 7.239.16 ± 33.14Not appliesNot appliesNot applies0.22−0.74
TRF (2)5.52 ± 8.620.32−0.47
AEL (3)2.81 ± 12.700.31−0.54
ASL (4)24.89 ± 63.860.650.41
CLSRBC (1)[49] and Experts JudgementNot appliesp (state 1) = 0.776Not appliesNot appliesNot applies0.50−0.01
TRF (2)0.730.64
AEL (3)p (state 2) = 0.2240.870.91
ASL (4)0.610.32
SD: standard deviation; CP: conditional probability; R: rank correlation.
Table 8. Values obtained from expert judgment.
Table 8. Values obtained from expert judgment.
ICUPQuantile (%)Normal Distribution Parameters
55095MeanStandard Deviation
Temperature (°C)RBC5.4622.5039.5422.505.68
TRF5.4622.5039.5422.505.68
AEL18.8330.8139.7729.303.49
ASL6.7925.0043.2125.006.07
ICUPQuantile (%)p (CLS < 1000)p (CLS > 1000)
CLSRBC11.1854.7089.060.500.50
TRF5.6021.1149.180.270.73
AEL1.2913.0224.740.130.87
ASL10.5136.1468.140.390.61
Table 9. Input data examples of some cases for the model.
Table 9. Input data examples of some cases for the model.
Characteristic or ParameterUnitPlant (#)
001002010098
Case of studyImplemented process ASLASLTRFASL
Total suspended solids(mg/L TSS)376.0132.00184.60261.00
Biodegradable O. M.(mg/L BOD5)393.5455.90369.00334.00
WW temperature(°C)23.6419.4523.2027.72
Wastewater flow(L/s)100.00200.00237.50171.00
Closeness to homes closeclosecloseclose
Bayesian networkMDO RBC(mg/L BOD5)29.94 *197.9713.3810.25
MDO TRF145.8482.07129.28105.65
MDO AEL40.64187.2724.080.45
MDO ASL23.74 *204.177.1816.45
TDO RBC(°C)1.143.060.705.22
TDO TRF1.143.060.705.22
TDO AEL5.669.866.101.58
TDO ASL1.365.561.802.72
WWF(m3/d) * 10008.64 *17.2820.5214.77
CLS(state)1111
Process chosen by the model ASLTRFASLAEL
* Values used in the example in Figure 7.
Table 10. Successes and errors accounting of the case studies and values of the useful parameters for the validation of the model.
Table 10. Successes and errors accounting of the case studies and values of the useful parameters for the validation of the model.
ParameterRBCTRFAELASLGlobal
T+1605966
F+075113
F−0101314
V−7663724215
F+R0.0000.1000.0650.2000.057
T+R1.0000.857--0.8190.825
PP+1.0000.4620.0000.9830.835
PP−1.0000.9841.0000.2350.939
Kappa1.0000.5460.0000.2930.771
+CP0.0130.0910.0000.9350.260
Sensitivity1.0000.857--0.8190.825
T+: true positives; F+: false positives; F−: false negatives; T−: true negatives; F+R: false positive rate; T+R: true positives rate; PP+: positive predictive power; PP−: negative predictive power; +CP: positive cases prevalence.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Medina, E.; Fonseca, C.R.; Gallego-Alarcón, I.; Morales-Nápoles, O.; Gómez-Albores, M.Á.; Esparza-Soto, M.; Mastachi-Loza, C.A.; García-Pulido, D. Decision Making Model for Municipal Wastewater Conventional Secondary Treatment with Bayesian Networks. Water 2022, 14, 1231. https://doi.org/10.3390/w14081231

AMA Style

Medina E, Fonseca CR, Gallego-Alarcón I, Morales-Nápoles O, Gómez-Albores MÁ, Esparza-Soto M, Mastachi-Loza CA, García-Pulido D. Decision Making Model for Municipal Wastewater Conventional Secondary Treatment with Bayesian Networks. Water. 2022; 14(8):1231. https://doi.org/10.3390/w14081231

Chicago/Turabian Style

Medina, Edgardo, Carlos Roberto Fonseca, Iván Gallego-Alarcón, Oswaldo Morales-Nápoles, Miguel Ángel Gómez-Albores, Mario Esparza-Soto, Carlos Alberto Mastachi-Loza, and Daury García-Pulido. 2022. "Decision Making Model for Municipal Wastewater Conventional Secondary Treatment with Bayesian Networks" Water 14, no. 8: 1231. https://doi.org/10.3390/w14081231

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop