- Open Access
Calibrating and validating a combined accumulation and mobilisation model for water distribution system discolouration using particle swarm optimisation
Smart Water volume 4, Article number: 3 (2019)
A discolouration concept is proposed describing simultaneous pipe wall material accumulation and mobilisation processes that define discolouration in drinking water distribution systems, one of the biggest causes of customer dissatisfaction. Validation of these mathematical forms is presented. The model formulation was shown to maintain the mobilisation functionality of previous validated shear-stress-dependant modelling tool, but requiring only two empirical parameters. Two distinct operational datasets are then analysed and robust empirical model parameter calibration is obtained utilising a refined particle swarm optimisation technique. The model is shown to make usefully accurate simulations for flow mediated events, providing evidence of predictive capabilities. The combined tracking of accumulation and mobilisation behaviour enables assessment of the current and future discolouration risk posed by any pipe irrespective of age or material, allowing pro-active, risk based planning and prioritisation of maintenance interventions to protect the quality of delivered water.
Discolouration material is acknowledged to accumulate continually and ubiquitously across all pipe surfaces in drinking water distribution systems (DWDS) (van Summeren and Blokker 2017, Vreeburg et al. 2008). As a result, any maintenance intervention for managing discolouration risk, such as disruptive invasive techniques or in-operation flow conditioning as developed in the UK to periodically mobilise accumulated material using controlled increases in system shear stress (Husband and Boxall 2016), can only be considered a temporary strategy. Of operational significance therefore is the rate at which particulate matter re-accumulates on the pipe walls, increasing the discolouration risk as the asset condition deteriorates and the frequency at which interventions are required. Models describing the mobilisation of material from pipe walls due to elevated hydraulic forces exist (Vreeburg and Boxall, 2007) and have been verified in the field for distribution pipes and trunk mains (Husband and Boxall, 2010, Husband and Boxall 2016). However, models of material accumulation do not reflect process knowledge obtained from operational experience and supported by extensive field and laboratory studies.
Discolouration risk is difficult to measure directly due to its origin as material on the inside of buried pipe infrastructure. Turbidity and flow data characterising the discolouration response can be collected, however due to the short term sporadic and spatially variable nature of ‘natural’ discolouration events this can be complex (Furnass et al. 2013). However it is possible to collect discolouration data around planned events that disturb pipe accumulated material, although these have inherent risks to water quality and customer perceptions. Water companies therefore need a means for assessing the current discolouration risk not as a complex, typically noisy flow and turbidity time-series, but as a simple, comparable metric. In addition, strategic planning requires a way of assessing how discolouration risk changes in the short, medium and long term under multiple hydraulic scenarios.
With continuous material accumulation highlighting the restricted short-term value from one-off strategies such as pipe renewal or invasive cleaning, maintenance programmes of regular managed flow increases offer a potential for long-term low cost risk management. However, hydraulics and water quality in DWDS are difficult to characterise as they are topographically complex bio-reactors with large surface areas, predominantly access restricted and with limited monitoring opportunities. Without pro-active discolouration risk management, any increase in flow resulting from planned or unplanned events can mobilise accumulated material. As this material represents long term accumulation of low level background components, the hazards to water quality extend beyond simply aesthetic discolouration and can include elevated concentrations such as for iron, manganese or lead, disinfection by-products and microbial biofilm content including potential pathogens. Operational flexibility is however paramount for the management of distribution systems and in particular trunk mains, where failure consequences can be extensive. Water providers therefore need to be able to conduct scheduled maintenance, such as to treatment works, pumping stations and service reservoirs, and also react promptly when such assets fail. These need to be conducted while minimising any negative impact on water quality. It is therefore vital that discolouration processes are well characterised to aid with their management. Efforts to achieve this have however been historically impeded by the heterogeneity of DWDS and by the practical requirements of pipe-specific discolouration studies.
It is difficult to characterise well the biofilms/layers that accumulate on the walls of DWDS without disturbing them (Douterelo et al. 2016, Husband et al. 2016). A model capable of describing material mobilisation and accumulation processes, plus the contributing factors, would equip water providers with the means to better schedule and design both flow conditioning and general maintenance programmes. This would reduce the risk of customers being supplied with discoloured or poor quality water and by understanding return periods would also facilitate whole life costing of those programmes. This paper presents a model that combines accumulation and mobilisation processes and its application to field experimental data. Implementation in a software Application Program Interface (API) allowed validation by calibration to real world data.
The detection of discoloured water at customers’ taps has been shown to be predominately the result of a change in hydraulic conditions within the upstream DWDS mobilising wall bound particulate material (Husband and Boxall, 2010), as shown in Fig. 1. Proactive, operational strategies monitoring the turbidity response to managed flow events have evidenced this as a cost-effective means for reducing the likelihood of discolouration events (Husband and Boxall, 2016). In addition, discolouration modelling has been validated as capable of predicting the turbidity response, due to material mobilisation, when preceding peak flows are exceeded.
Over time continual particulate material flux in the bulk water causes material accumulation on the internal surfaces of DWDS in cohesive layers (Husband and Boxall 2011). The rate of accumulation on pipe walls is a complex function driven by multiple processes including corrosion, adhesion and microbial growth (Abe et al. 2012, Matthieu et al. 2014, Mounce et al., 2016). With a heterogeneous and complex composition, wall-bound discolouration material does not have a single binding shear strength. Most pipes in operational networks do not pose a discolouration risk on a typical daily basis, indicating that accumulated material has a strength versus quantity profile with the lower bound of the strength range defined by the prevailing shear stress i.e. discolouration material is conditioned by the usual pipe hydraulics. The strength profile of wall-bound material can be studied by increasing the shear stress incrementally above the ‘background’ and measuring how much material (the turbidity response) is mobilised (Husband et al. 2008). One way to extend this investigation to examine how material then accumulates over different strength ranges is to then repeat this exercise after a defined period (Husband and Boxall, 2011).
Modelling provides a way of encoding system state information and transaction rules in an abstract form that allows for extrapolation. For example, the above understanding of material mobilisation may be captured by describing mobilisation of wall-bound material occurring when the applied shear stress, τa, exceeds the shear strength, τ, of bound material. This is the basis of the empirical Prediction of Discolouration in Distribution Systems (PODDS) discolouration model (Boxall et al., 2001). This model does not attempt to model corrosion processes, biofilm development, suspended iron flocs or chunks of biofilm directly; instead, material mobilisation is considered to be the mechanism by which turbidity causing, homogeneous material can move between being bound to the pipe wall and being a wash load suspended in the bulk water. The PODDS model estimates the amount of material (bound to the pipe wall) by assuming an equilibrium state with the normal daily hydraulic conditions and then simulates the turbidity response that would result from an excess shear stress event. PODDS model parameters are empirical, estimated through calibration to measured data. Flow trials (flushing of distribution pipework or raising flows in larger diameter trunk mains) are used to generate the turbidity, flow and headloss data required for such calibration. Headlosses can either be estimated using a pipe roughness or measured directly during trials, facilitating concurrent hydraulic network model calibration. PODDS model parameters have demonstrated transferability between sites with analysis showing them sensitive to pipe diameter, pipe material and source water (Boxall and Saul, 2005).
While the PODDS model has been validated to represent mobilisation behaviour, it does not describe the material accumulation processes observed in both field and laboratory studies. In Furnass et al. (2014) a new model, the Variable Condition Discolouration Model (VCDM), was proposed to overcome this shortcoming. It is postulated that wall-bound material with τ ≤ τa is eroded whilst simultaneously material for which τa < τ < τmax accumulates (where τmax is the maximum possible τ with which material binds to the wall). The model therefore tracks the relative amount of discolouration material bound to the pipe wall over time at each of a number of shear strengths. A shear-strength-variable yet time-invariant mobilisation model (see Fig. 2) is combined with the accumulation model as a) it allows for non-serial layer mobilisation so can account for spatial heterogeneity in material composition; and b) it can ensure that the mobilisation rate is a function of excess shear stress. The PODDS model captures mobilisation as a process progressing in series from strong to weak layer, in series. Whereas observation of wall bound material is shown to be heterogeneous at the micro scale (Fish et al. 2016 and Fish et al. 2017), with different strength material exposed simultaneously. Figure 2 attempts to capture this, showing how the distribution of amount of material (φ) across different strengths (τ) would be simultaneously mobilised, but at decreasing amounts, due to each time step of an imposed excess shear stress.
Modelling the accumulation rate could be achieved using a process model, taking account of many factors such as corrosion, temperature and biofilm growth. This however would require a sufficiently-detailed understanding of the inter-dependencies (a set of physical, chemical and biological reactions per distinct environment, with the stoichiometry and kinetics/equilibria being characterised for each biological and chemical reaction), hence likely to be difficult and impractical. For the purposes of this work, material accumulation is considered to be a simple, empirical rate that is a scalar and time-invariant. As further data and process knowledge becomes available, this rate-based model can be refined through decomposition into empirical and process-aware sub-models.
The output from the mobilisation and accumulation model is a rate of material release from the pipe wall into the bulk flow. Together with an understanding of system hydraulics, this can then be fed into existing water quality models to account for advection and mixing and to calculate turbidity at any downstream location. A series of pipe lengths with differing hydraulic or asset characteristics can collectively be modelled in a ‘daisy-chained’ fashion: the turbidity predictions from modelling the most-upstream pipe are supplied as input when modelling the change of state and turbidity for the next pipe downstream, and the output from that sub-model is used as input when considering the next pipe downstream etc. (see Fig. 3).
The VCDM model has three parameters, a relative mobilisation or erosion rate factor βe, an accumulation or regeneration of material rate βr, and a scaling factor α for converting from relative to absolute quantities of discolouration material. The wall state boundary condition for t = 0 is the relative material quantity per tracked shear strength, which is a non-trivial monotonically-increasing function of τ. The material condition function φ (τ,t) is the relative amount of discolouration material bound to the pipe wall with shear strength τ at time t. τ is a vector property, in that it is used to track the range of material strengths at every position on a pipe, and φ (τ,t) ∈ [0,1] where 0 corresponds to complete depletion (no material) and 1 to maximum accumulation.
VCDM detail and mathematical formulations can be found in Furnass et al. (2014) and Furnass (2015). In these publications formulation verification was conducted with artificial data and limited data from operational networks and using manual calibration. The proposed model however still requires validation, primarily to explore the error in predictions made using calibrated models and continuous flow data. The proposed model is only of greater value than PODDS if both mobilisation and accumulation mechanisms can be shown to be valid. The proposed model of material mobilisation differs from the validated PODDS mobilisation mechanism so it must be independently validated.
The aim of the research reported is to calibrate and validate a formulation that attempts to describe the continual processes of material accumulation and mobilisation (due to hydraulic changes) within DWDS and hence inform asset management for the control of discolouration. Specifically, the paper presents information on:
Confirmation that the proposed formulation, with only two mobilisation parameters, retains functionality to describe material mobilisation as currently described by the PODDS model, requiring three parameters.
Implementation of the proposed formulation within a Python environment, in particular integration with optimisation algorithms to enable efficient and repeatable calibration and automated systems to identify periods of interest from long time series data.
To provide evidence of the ability of the model to describe the long term behaviour of both continual material accumulation and mobilisation within a trunk main through calibration, and to provide initial validation of predictive simulations once calibrated.
Validation of the discolouration mobilisation model can be achieved using field data obtained during the flushing of pipes. Flushing durations are sufficiently short relative to the time required for material accumulation, irrespective of if this is within larger trunk mains or smaller diameter District Meter Area (DMA) pipes (Husband and Boxall, 2016), that the accumulation can be assumed to be negligible over the duration of a flush. If flushing is repeated at suitable intervals the processes of accumulation may also be investigated. The relationships between the quantities explicitly referenced by the proposed VCDM are shown in Fig. 4. Temperature and fluid density are not explicitly included in the model as the impact of variation in those factors in DWDS hydraulics is considered negligible. Temperature / seasonal variations are known to influence material accumulation processes, but are not captured in the datasets available here.
The first stage in the confirmation of the formulation was the selection and implementation of search and optimisation routines to enable automated calibration. Key functionality sought was comprehensive exploration of prescribed parameter space, efficiency and repeatability. Automated functionality was also developed to enable identification and tagging of periods of interest in long time series flow and turbidity data. This helped enable visual inspection to focus on periods of interest, and could potentially be used to focus calibration by biasing fitting to these periods. The second stage was to use the automated calibration routines to confirm the mobilisation functionality of the new formulation. This is presented for four short duration DMA flushing responses as first reported in Boxall and Saul (2005) and Husband et al. (2010). The ability of the new model to simulate long term continual accumulation and mobilisation was then assessed, requiring long time series flow and turbidity data. This is presented here as calibration to two different long term data sets. The second data set was sufficient to enable segregation into training and testing data allowing validation. Sufficient was judged to be a time series that contained a sequence of at least 5 clear mobilisation events with sufficient periods between for measurable (in terms of turbidity response when mobilised) accumulation to have occurred between them for training, and then a further 3 events for testing. The periods and magnitude necessary for this are system and events specific.
Automated event identification
Identifying the times when turbidity events occurred or are most likely to have occurred from continuous turbidity and shear stress datasets covering extended time periods is not straightforward and automation is therefore highly desirable. The motivation for identifying the times of likely events is that the data over these periods provide a response that is critical to facilitate calibration. Model validation can therefore be tailored to better qualify/quantify the ability of the VCDM to characterise the processes of interest by focussing on these disjointed subsets of the time-series dataset. Turbidity events due to mobilisation within a pipe are most likely when the pipe shear stress exceeds recent maximum. The set of likely turbidity events of a given dataset was identified as follows:
For each timestamp t, mark t as being an event time if τa (t) exceeds the maximum of the previous t_win seconds;
(Optional) All identified events that correspond to just a single timestamp should be classified as not being events as they could just be due to measurement error;
For all remaining events, consider all timestamps within n_turnovers×tt after each event to be part of that event, thus allowing for discolouration material propagation following the flow increase.
Particle swarm optimisation
Evolutionary optimisation techniques such as genetic algorithms (Holland, 1992) mimic the competition inherent in natural selection to iteratively move towards an optimal candidate solution. Swarm optimisation methods are also stochastic but use collaboration rather than competition between candidate solutions to converge on the global optima. One of the more commonly-used techniques is particle swarm optimisation (PSO) (Kennedy et al., 2001).
In PSO, each dimension corresponds to an optimisation problem variable (model parameter). For the bird flocking problem, the objective function that needs to be minimised could take a set of coordinates in three-dimensional space as its three parameters and return the reciprocal of the concentration of flies at those coordinates. Initially a number of birds or particles (typically between 20 and 50 (Eberhart and Shi, 2001)) are assigned random positions and velocities within the problem space. The coordinates of each bird represent a candidate solution. The birds then move about this problem space over a number of timesteps. At each timestep each bird calculates a new velocity based on three factors (Floreano and Mattiussi, 2008):
Inertia: its current velocity vector
Nostalgia: the problem space position in the bird’s memory since t = 0 where the fitness value was greatest;
Societal influence: the position of the neighbouring bird with the best fitness at the current timestep (birds may use calls to communicate their current fitness values to each other).
PSO has been used to design distribution networks with optimal topologies and pipe diameters (Suribabu and Neelakantan, 2006), develop optimal reservoir management strategies (Reddy and Nagesh Kumar, 2007), learn the optimal parameters of a lake water quality model (Campbell and Phinn, 2009), parameterise hydrological and hydrogeological flow models (Zambrano-Bigiarini and Rojas, 2013) and learn formula parameters for solute mixing at DWDS junctions (Yu et al., 2014).
Two advantages of PSO over evolutionary optimisation approaches such as genetic algorithms are that PSO is purportedly more efficient than genetic algorithms and the key algorithmic components can be encapsulated in very little code (Kennedy et al., 2001). A possible disadvantage of PSO is that it is not naturally suited to optimising problems involving categorical variables (Floreano and Mattiussi, 2008). However, with no categorical input parameters, as with VCDM, PSO is desirable due to its simplicity and efficiency of design, implementation and execution.
In the present work, model parameters were fitted using a robust PSO metaheuristic, as sensitivity analysis showed that there were many similar ways in which model parameters and measurable/calculable properties, such as flow and pipe diameter, could influence the shape and scale of time-series model outputs (Furnass 2015). The PSO objective function was the error between time-series turbidity observations and predictions, optionally calculated only at times when erosion events were likely. The dissimilarity metric used in the objective function is the integral of squared errors (ISE) and was chosen as it is moderately sensitive to the magnitude of transformations, is very insensitive to noise, is simple to calculate (Eq. 1) and is dimensional.
where tcompare is the set of times at which errors are to be calculated (which do not need to be contiguous).
For the purposes of validating the proposed model for a single pipe, a bespoke software library was developed in Python (Lutz, 2011). Python is suitable for this task as it is interpreted and dynamically typed, facilitating rapid prototyping, data exploration and experimentation, yet can interface with statically typed, compiled extension modules for expediting computationally-intensive tasks (e.g. via Cython (Behnel et al., 2011)). Python was thus considered to offer the best balance of efficiency and ease of development and the greatest flexibility with respect to model fitting and sensitivity analysis.
The freeware pyvcdm Python package implementation provides VCDM coding and development functionality with which discolouration material release models can be instantiated then executed for a given time-series before modelling advection and mixing for that process i.e. the wall state change and material release code is largely decoupled from the advection and mixing code. This decoupling facilitates VCDM coding as an add-on into existing hydraulic software.
An implementation of PSO in Python, pyshoal was developed to provide the features and satisfy the model requirements and provides a simple API. Once instantiated, an optimisation run continues until convergence tolerances are satisfied for a number of iterations or a maximum number of iterations are reached. The outputs following a run are the best objective function parameter values found during the run, the value of the objective function given those parameter values and the number of iterations. The software allows for the optimisation of an arbitrary objective function (which can take any number of parameters) and can execute objective functions in parallel using multiple processes for reducing the execution time of a single optimisation run.
The calibration of model parameters can be formulated as an optimisation problem to be solved in an automated fashion:
The objective function to be minimised is an appropriately sensitive scalar measure of the dissimilarity (the integral of squared errors was used as the metric) between turbidity observations and model instance turbidity predictions over the calibration period;
The parameters to be optimised are VCDM model parameters βe, α and βr (unless one is confident that the duration of the simulation period is negligible compared to the time required for full regeneration in which case only βe and α);
The optimisation constraints are independent parameter bounds for βe, α and βr: all three parameters must be positive and finite.
The approach used for calibrating the model parameters is to fit all three parameters simultaneously given static and time-series observations of bulk water properties, a PSO configuration (including parameter bounds) and known initial wall state boundary conditions (ϕ (τ, t0)). This approach, as illustrated in Fig. 5, is referred to as FIT_SIMULT. Options for fitting/estimating initial wall state boundary conditions are discussed in Furnass (2015).
The PSO configuration options used when fitting the proposed model to data are specified in Table 1 (excluding the specific parameter space bounds and convergence tolerance values as these are more case-specific). As PSO is a non-deterministic metaheuristic that is likely but not guaranteed to find the global optimum, each of this set of fitting tests was attempted up to three times to achieve a satisfactory fit and confirm that an optimal, repeatable solution was found.
Fitting VCDM mobilisation to four DMA flushing events
The mobilisation and accumulation processes operate over dramatically different time frames. Mobilisation and associated change in pipe wall bound material and turbidity response is in terms of minutes, whereas accumulation processes and associated change in pipe bound material layer state are in terms of weeks and months. Hence by considering only short time period mobilisation events the effects of accumulation, and the associated model functionality, can be ignored. Therefore the mobilisation functionality can effectively be validated independently of the accumulation function.
The VCDM was fitted to four independent flushing events to validate the mobilisation functionality across a range of pipe materials, pipe roughness and source waters. The four selected flushes were from DMA’s around the UK and were first presented in Boxall and Saul (2005) and later in Husband et al. (2010) with corresponding PODDS model calibrations. The four flush examples (PODDS-CI1, PODDS-CI2, PODDS_PE1, PODDS-PE2), shown in Fig. 6, were used to explore the accuracy and precision (repeatability) with which the VCDM can be fitted and the quality of model fit (using a relative dissimilarity metric) compared to the corresponding PODDS calibrations.
The VCDM was calibrated using each of the DMA flushing datasets by using the FIT_SIMULT method to fit the mobilisation parameters βe and α. The βr parameter was set to 0 s−1, simplifying the fitting process as for these short duration simulations the accumulation effects would be negligible. Prior to flushing, according to company records, the pipes effected had only experienced a regular daily flow pattern for the preceding years so could be considered as ‘previously undisturbed’. As a result, the initial wall state boundary conditions were fitted using a single cusp approach: the relative amount of material at the start of the simulation was polarised around a single shear strength τc, which, like βe and α, was fitted by FIT_SIMULT. Here τc is conceptually similar to the initial layer strength term in the PODDS model which was also derived from the preceding peak flow from the daily pattern. A fourth quantity was fitted by FIT_SIMULT, this being the maximum shear strength, τmax, with which material can be bound to the walls of pipes. In operational terms τmax is effectively a ‘self-cleaning’ boundary, that is the strongest force with which erodible material can accumulate at the pipe wall; for pipes experiencing daily forces in excess, material accumulation is mitigated. As with the prior PODDS simulations, τmax was only fitted for the PE pipes as no evidence of a maximum shear strength has been found in CI pipes (Husband et al. 2010).
Fitting the VCDM to trunk main (TM-NR)
The most informative method for validating the accumulation mechanism and evidencing its practical value is to assess whether it can represent a series of turbidity responses identified as due to hydraulic mobilisation during an extended time period from a relatively homogeneous pipe length. In this first case study, the model was fitted to a 22-month dataset with a number of shear stress events and where the spacing between events was such that significant accumulation could occur. As part of an integrated network these events where in response to network demands and coordinated from a central control room. Trunk main TM-NR is a 15 km large-diameter (800 mm), lined steel trunk main that is situated in the north-east of England supplied with surface water. The main runs between a break pressure tank (BPT) and a service reservoir (SR) see Fig. 7.
After pre-processing to remove unrealistic flow values such as during flow meter maintenance and standardising to SI units, the dataset from trunk main TM-NR was used for calibrating and validating the VCDM is shown in Fig. 8.
The automated event detection method was applied to the TM-NR dataset and identified increased shear stress using a time window of fourteen days and the turbidity response highlighted for a propagation duration of five pipe turnovers. Figure 9 shows a temporal zoom of Fig. 8 with the identified turbidity responses due to increases in shear stress highlighted in red. Using this technique, a total 39 events were detected in this dataset. The three model parameters (βe, α and like βr) were fitted using the FIT_SIMULT method. Unlike calibration of the mobilisation process for the flushing operations with the initial wall state boundary conditions (ϕ (τ, t0)) polarised around a defined peak daily flow, the more variable trunk main flow history can generate a range of layer states. Fitting options are discussed in Furnass (2015), whilst in practice the simplest method as applied here sets initial wall state material as greater than would be expected with the opening flow data. This is rapidly mobilised (βe> > βr) with the initial week’s simulation results accepted as poor while a realistic wall state description is defined.
Fitting the VCDM to trunk main (TM-WB)
The second trunk main case study, like the study of TM-NR, made use of a multi-year dataset. Unlike TM1 however there is little demand driven variation, as a result planned interventions featuring managed flow trials were instigated to investigate the practical application of the concepts described by the VCDM. Also unlike TM-NR which is a surface water supplied trunk main, TM-WB is supplied with treated ground water and runs between two SRs in the south-west of England. The reach referred to as TM-WB is a series of mains of varying materials (AC and unlined DI), with changing diameters and flows due to there being take-offs along its length (see Fig. 10).
Operators of this network wished to be able to support resilience flows should additional demand be required in the event of a WTW fail, SR fail or major burst, without there being a negative impact on water quality. To achieve this an increase from the initial (pre October 2009) operating maximum of 1100 m3 day− 1 to 2800 m3 day− 1 was identified. A number of previous discolouration events associated to hydraulic triggers in this trunk main had indicated that there was a significant risk of discolouration. The planned and monitored trials reported herein covered a period were therefore used to investigate the behaviour of TM-WB to flow increases and to determine if an in-situ hydraulic strategy could be applied to facilitate this resilience work.
From June 2008 investigations commenced with concurrent flow and turbidity monitoring trials. During flow increase trials that were predicted to mobilise significant quantities of material the downstream reservoir was isolated for later discharge to safeguard the downstream network. The data set with large flow changes due to resilience targeting would help confirm the mobilisation functionality of VCDM whilst the varying periodic returns was ideal to validate the accumulation functionality. Once resilience had been achieved it was planned that the flow in the main would be periodically elevated back up to the target level to maintain resilience by removing any fresh material accumulation. To facilitate this an actuated valve and permanent turbidity meter were later installed on the inlet to the downstream SR for accurate and remote flow control and future flow conditioning. Turbidity observations were logged at 15 min via Supervisory Control And Data Acquisition (SCADA) from the date of commissioning. Some of the key events in the history of TM-WB regarding discolouration risk management are shown in Table 2.
After time series assembly, the dataset was available for calibrating and validating the VCDM using data from October 2009, capturing the flow trials as shown in Fig. 11.
The model was fitted to the dataset using the same settings used when fitting the TM-NR dataset. The FIT_SIMULT method was executed three times with the same inputs to assess the repeatability of the PSO fitting process.
Results of fitting the VCDM to DMA flushing events
The VCDM mobilisation model fitted well to the first three datasets. The best correlations between predictions and observations were similar for the VCDM and PODDS, based on visual comparison of Figs. 6 and 12. Direct numerical goodness of fit comparison is not presented as it is not considered valid due to differences in implementation of PODDS from within EPANET network modelling and VCDM for single pipe lengths. Figure 12 shows the VCDM is capable of simulating mobilisation functionality across different pipe materials and source waters. The ability to fit the fourth dataset, PODDS-PE2, is hampered by data quality. This is due to elevated turbidity during the first step in this trial that the VCDM model was unable to replicate as part of the complete sequence. For the second and third flow increments VCDM simulations are arguably better than the original PODDS results. The difficulties of fitting this data were also observed in Husband and Boxall (2010), who noted this elevated initial response is a likely consequence of disturbing the system when cleaning the hydrant prior to attaching monitoring equipment. The overall results therefore support a good general fit between VCDM and PODDS simulation and measured response to all pipes investigated. The fitting process was repeatable when the data quality was good, confirmed by the small parameter ranges for three out of the four cases. However, when the quality was suspect, repeating FIT_SIMULT multiple times and visually inspecting the bias of the fits allowed some confidence in parameter estimation. Data quality was assessed subjectively by looking at the distinct nature of both the hydraulic disturbance and the turbidity response relative to background flow and turbidity. Signal to noise approaches with metrics were tried, but a more complex process is required to capture what is currently an expertise based assessment.
Results of fitting the VCDM to TM-NR
FIT_SIMULT parameters for the fitting approach are shown in Table 4, as are the ISE and R2 correlation metrics and the reciprocal of the (mean or constant) βr, expressed in years. There was no difference in any of these values for 3 repeated optimisation runs. Figure 13 provides 6 (from 39) randomly chosen event examples, identified using the automated approach.
Although the R2 for the full times series is only 0.311 it should be noted that the data set is dominated by near zero turbidity data with only small errors over this period dominating the metric. Visual assessment of turbidity simulations during the major shear stress events indicates that the model does represent the key discolouration events. There were however periods with less correlated fit and turbidity events that the model could not account for. There are several possible reasons for this, including: the net turbidity response using Tds_obs rather than calculated from both Tus_obs and Tds_obs so material imported from upstream was not accounted for; the flow monitoring not recording short duration events due to fifteen-minute logging, but the response being evident in the turbidity responses. Although the mobilising hydraulic event is too short to be captured at 15 min resolution, material is still mobilised and propagated downstream. Without confirmation of possible short-term hydraulic events, the model is unable to predict this turbidity response. This then has knock on effects into the next event as the amount of accumulated material will be overestimated. The accumulation rate indicates full material accumulation, or maximum discolouration risk, in ~ 1.8 years.
Results of fitting the VCDM to TM-WB
FIT_SIMULT parameters from the fitting approach applied to TM-WB are shown in Table 4. Results for repeated optimisations are identical other than slight variation in number of iterations (104, 99, 109). The parameters identified by the three repeat runs are consistent with a reasonable R2 = 0.7 correlation with slightly transformed observations.
The accuracy/validity of the calibrated model was explored by predicting the turbidity response over the part of the dataset not used for calibration (after trial 6 on 2011-06-22 at 09:12:17 to 2013-10-17 at 13:00:00, approximately 848 days). The values of φ (τ, t) from the final time step of the calibration simulations (see Fig. 14; no observable variation between runs) was used as the wall state boundary condition when simulating from 2011 to 06-22 at 09:12:17 onwards. The relationship between shear strength and relative material quantity at this time is not trivial compared to the PODDS model that is defined by a single value. This supports the VCDM design decision of allowing relative material quantity to vary freely (or at least monotonically increasingly) with shear stress.
The correlation between Tnet_est and Tnet_pred over this period was not as strong (R2 was 0.041 when considering timestamps where Tnet_est was not null) as for the period used for fitting the model. Figure 15 provides a random selection of event examples. As with TM-NR these figures show visually acceptable simulation results around events, while the correlation metrics are dominated by small errors over the majority of the near zero turbidity data.
Fitting the model to six distinct flow trials in TM-WB, the FIT_SIMULT method proved to be highly repeatable, both in terms of identified parameter values, and net turbidity profiles with an accumulation rate of 0.9 years. Although reasonable, model fit could have been improved had upstream turbidity observations been available for determining which turbidity increases were due to imported material.
Details of the best calibration fits for all six DWDS pipe lengths modelled are shown in Table 4. The model was able to represent DMA mobilisation events in unlined CI pipes well. The fits to mobilisation events in plastic DMA pipes were not as good, even if the maximum shear strength is explicitly limited, but the same was also true for the PODDS model (Husband et al. 2010).
An empirical model is preferable to a process derived fully deterministic model given the large number of factors influencing discolouration. The material quantity versus shear strength relationship is considered to be sufficiently complicated for it not to be possible to learn model structure solely from data through empirical machine learning. The most suitable approach is therefore suggested to be a grey-box empirical model consisting of anthropogenic formulations that have been informed by multiple field and laboratory studies.
The VCDM is shown to be able to represent mobilisation events as well as the field-validated PODDS discolouration model. It is therefore proposed that the VCDM succeeds the PODDS discolouration model: it provides the same functionality with fewer parameters and also includes an accumulation mechanism that far more closely resembles observed behaviour. The complexity of the VCDM wall state model is greater than that of the PODDS model, but only as complex as is required to represent the observed characteristics of material mobilisation and accumulation processes.
By understanding factors influencing accumulation, mitigating steps in treatment or chemical dosing can be investigated with the potential to improve future water quality and reduce operating costs. It might be reasonable to expect temperature to impact accumulation rates as a result of effecting network microbial and chemical reaction rates. If model parameters are also shown to be reproducible, the potential ability to transfer between sites could reduce the need for trunk specific studies thereby facilitating widespread application.
While the substantial advantage of the modelling approach verified here is the ability to simulate accumulation with the complexity of the wall bound state suitably described, the processes of accumulation themselves are simply captured in a single pipe specific coefficient. This could be readily advanced as understanding of these processes are better understood, such as including a seasonal temperature dependence, reflecting the understanding that microbial processes are central to material accumulation and that microbial processes are nearly all temperature dependent. Previous work has also shown that accumulation rate is a function of both the bulk water quality in the pipe of interest and the pipe material itself (Husband and Boxall, 2011 and Cook and Boxall, 2011). The dataset for calibration to TM-NR with an upstream turbidity record could be used to indicate if change in background bulk water turbidity over a trunk main pipe length could be a related to accumulation processes. However, dependence of the accumulation rate to background concentrations of turbidity, or any other parameter, was not included in the model form as the evidence for this is currently at best incomplete. Interestingly the bulk water concentration examples discussed here demonstrate that even the understanding of whether, where and when accumulation mechanisms are rate of supply or process limited is currently unknown.
Recent industrial initiatives have stressed the importance of striking a better balance between capital and operational expenditure through consideration of whole life costs. As a result, more water service providers are appraising and developing risk based management strategies to help prolong the life of assets to safeguard water quality. For example, operators may want to understand the impact of a near-term future flow increase, or by how much they could increase the flow without exceeding a turbidity threshold. Using the VCDM this could be posed as a simple optimisation problem. Alternatively, operators could be presented with a real-time management outputs demonstrating the maximum turbidity that will result from any number of instantaneous flow increases.
Data driven methodologies are also being developed that could be used as early warning systems enabling beneficial proactive management strategies to be implemented as an alternative to expensive trunk mains cleaning programs (Meyers et al. 2017, Kazemi et al. 2018). Both data driven methodologies and the VCDM however require large temporal datasets with a range of turbidity events for calibration or facilitate machine learning training. The VCDM is based on conceptual understanding, so if further calibration is shown to offer grouping of VCDM parameter values, and hence transferability between locations, it would potentially be of generalised value. Both approaches have the potential for greater operational application and there is a trade-off to be explored based on amount of data and under/overfitting.
The VCDM could in future be refined using knowledge gained from calibrating model instances for a variety of different systems and also by using machine learning techniques to build accumulation sub-models. The resulting improvements in predictive accuracy could drive the uptake of asset management strategies where shear stresses are periodically, pro-actively increased so that planned or unplanned flow increases do not cause unacceptable turbidity responses. By tracking discolouration potential over an extended period (weeks, months or years), the VCDM can equip water providers with a tool for assessing how different interventions and incidents could impact bulk water turbidity, alert operators when a flow increase is predicted to cause the bulk water turbidity to exceed a threshold, review network changes by analysis of accumulation rates, develop and optimise flow conditioning programmes for non-invasive cleaning and determine associated whole life costs.
Following the quotation, or slight misquotation of George Box, we fully accept that the model we propose is in one sense wrong, in that it does not fully capture or describe all the processes and interactions occurring. However, the majority of the validation presented here shows the ability to simulate the observed turbidity data to an accuracy of less than 1 NTU. This is the turbidity standard applied at the exit of water treatment works, is significantly less than the 4 NTU limit at consumers taps and even lower than the generally accepted limit of human observation which is, subject to lighting conditions and size of sample, generally into the teens. Hence we conclude that the model is useful.
The paper presents calibration and validation of a semi-empirical model that tracks the continual processes of accumulation and mobilisation of material from pipe walls within drinking water distribution systems. Functionality to enable automated calibration was developed and implemented making use of particle swam optimisation. This was found to provide efficient and repeatable solutions from rigorous searching of the problem space. Automated approaches were also developed enabling identification of periods of interest from long and complex turbidity time series data.
The model formulation was shown to maintain the mobilisation functionality of previous validated shear-stress-dependant modelling tool, but requiring only two empirical parameters. Evidence of this was presented for four classic cases across both cast iron and plastic pipe materials and hence covering different material behaviour within these. Calibration to long time series flow and turbidity data was presented for two trunk main systems. The model is shown to make usefully accurate simulations for flow mediated events and with sufficient data, validation of the predictive capability was demonstrated. This supports the observation and model functionality that material with all shear strengths accumulates continuously at a rate invariant to hydraulic forces.
Overall the paper demonstrates the capability of the new formulation combining a sub-model tracking the temporal shear strength profile of accumulated pipe wall material and a hydraulically defined material mobilisation model. With the VCDM, water service providers have for the first time a tool to assess both effective real-time and long-term hydraulic discolouration responses, quantify network changes and plan, with costing implications, maintenance schedules to safeguard drinking water quality.
Application Program Interface (API)
Break Pressure Tank
District Meter Area
Drinking Water Distribution Systems
Nash Sutcliffe (Model) Efficiency Index
Prediction of Discolouration in Distribution Systems
Particle Swarm Optimisation
Supervisory Control And Data Acquisition
Variable Condition Discolouration Model
Water Treatment Works
Abe Y, Skali-Lami S, Block J-C, Francius G (2012) Cohiveness and hydrodynamic properties of young drinking water biofilms. Water Res 46:1155–1166
Behnel S, Bradshaw R, Citro C, Dalcin L, Seljebotn DS, Smith K (2011) (2011). Cython: the best of both worlds. In: Computing in Science & Engineering, vol. 13, no. 2, pp. 31–39, march–April. https://doi.org/10.1109/MCSE.2010.118
Boxall JB, Saul AJ (2005) Modeling discoloration in potable water distribution systems. J Environ Eng 131:716
Boxall, J. B., Skipworth, P., Saul, A. J., 2001. A novel approach to modelling sediment movement in distribution mains based on particle characteristics. In: Water Software Systems: v. 1: Theory and Applications
Campbell, G. and Phinn, S. R. (2009). Accuracy and precisions of water quality parameters retrieved from particle swarm optimisation in a sub-tropical lake. In: SPIE 7473. SPIE
Cook, D.M. and Boxall, J. B, (2011) Discolouration material accumulation in water distribution systems. Journal of Pipeline Systems Engineering and Practice ASCE, Vol. 2 No. 4. pp 113–123
DNV GL, 2010. SynerGEE water. URL https://blogs.dnvgl.com/software/category/so/
Douterelo I, Husband S, Loza V, Boxall JB (2016) Dynamics of biofilm regrowth in drinking water distribution systems. Appl Environ Microbiol 82(14):4155–4168
Eberhart RC, Shi Y (2001) Particle swarm optimization: developments, applications and resources. In: Proceedings of the 2001 congress on evolutionary computation, vol 1, pp 81–86
Fish KE, Osborn AM, Boxall JB (2016) Characterising and understanding the impact of microbial biofilms and the extracellular polymeric substance (EPS) matrix in drinking water distribution systems. Environ. Sci.: water res. Technol. 2(4):614–630
Fish KE, Osborn AM, Boxall JB (2017) Biofilm structures (EPS and bacterial communities) in drinking water distribution systems are conditioned by hydraulics and influence discolouration. Sci Total Environ 593–594:571–580
Floreano, D. and Mattiussi, C. (2008). Bio-inspired Artificial Intelligence theories, methods, and technologies. MIT Press, Cambridge, Mass. ISBN: 9780262062718
Furnass WR (2015) Modelling both the continual accumulation and erosion of discolouration material in drinking water distribution systems, Ph.D. thesis, University of Sheffield. In: Licensed under a creative commons attribution – non commercial - no derivatives 4.0 international license
Furnass WR, Collins RP, Husband PS, Sharpe RL, Mounce SR, Boxall JB (2014) Modelling both the continual erosion and regeneration of discolouration material in drinking water distribution systems. Water Science & Technology: Water Supply 14(1):81–90
Furnass WR, Mounce SR, Boxall JB (2013) Linking distribution system water quality issues to possible causes via hydraulic pathways. Environ Model Softw 40:78–87
Holland JH (1992) Adaptation in natural and artificial systems: an introductory analysis with applications to biology, control, and artificial intelligence. MIT Press, Cambridge, Mass
Husband P, Boxall J (2011) Asset deterioration and discolouration in water distribution systems. Water Res 45:113–124
Husband P, Whitehead J, Boxall J (2010) The role of trunk mains in discolouration. Proceedings of the ICE - Water Management 163(WM8):397–406
Husband PS, Boxall JB (2016) Understanding and managing Discolouration risk in trunk mains. Water Res 107:127–140
Husband PS, Boxall JB, Saul AJ (2008) Laboratory studies investigating the processes leading to Discolouration in water distribution networks. Water Res 42:4309–4318
Husband S, Boxall JB (2010) Field studies of discoloration in water distribution systems: model verification and practical implications. J Environ Eng 136(1):86–94
Husband S, Fish KE, Douterelo I, Boxall JB (2016) Linking discolouration modelling and biofilm behaviour within drinking water distribution systems. Water Sci Technol Water Supply 16(4):942–950
Innovyze (2018). https://www.innovyze.com/en-us/products/infoworks-ws-pro
Kazemi E, Mounce SR, Husband S, and Boxall JB (2018) Predicting turbidity in water distribution trunk mains using nonlinear autoregressive exogenous artificial Neural Netw Proceedings of 13th HydroInformatics conference, Palermo, Sicily, 1st to 6th July, 2018
Kennedy JF, Eberhart RC, Shi Y (2001) Swarm intelligence. Morgan Kaufmann Publishers, San Francisco
Lutz, M., 2011. Programming Python, fourth edition Edition. O’Reilly Media
Matthieu L, Bertrand I, Abe Y, Angel E, Block J-C, Skali-Lami S, Francius G (2014) Drinking water biofilm cohesiveness changes under chlorination or hydrodynamic stress. Water Res 15:175–184
Meyers G, Kapelan Z, Keedwell E (2017) Short-term forecasting of turbidity in trunk main networks. Water Res 124:67–76
Mounce SR, Blokker EJM, Husband PS, Furnass W, Schaap PG, Boxall JB (2016) Multivariate data mining for estimating the rate of discolouration material accumulation in drinking water distribution systems. J Hydroinf 1(18):96–114
Reddy MJ, Nagesh Kumar D (2007) Multi-objective particle swarm optimization for generating optimal trade-offs in reservoir operation. Hydrol Process 21(21):2897–2909
Suribabu CR, Neelakantan TR (2006) Design of water distribution networks using particle swarm optimization. Urban Water J 3(2):111–120
van Summeren J, Blokker M (2017) Modelling particle transport and discoloration risk in drinking water distribution networks. Drink Water Eng Sci 10(2):99–107
Vreeburg J, Boxall JB (2007) Discolouration in potable water distribution systems: a review. Water Res 41:519–529
Vreeburg JHG, Schippers D, Verberk JQJC, van Dijk JC (2008) Impact of particles on sediment accumulation in a drinking water distribution system. Water Res 42(16):4233–4242
Yu TC, Shao Y, Shen C (2014) Mixing at cross joints with different pipe sizes in water distribution systems. J Water Resour Plan Manag 140(5):658–665
Zambrano-Bigiarini M, Rojas R (2013) A model-independent particle swarm optimisation software for model calibration. Environ Model Softw 43:5–25
The authors would like to thank the PODDS consortium of UK water companies for supporting the ongoing programme of discolouration research at the University of Sheffield and for data provision and permission to publish the details included herein.
The research was funded by the UK Engineering and Physical Sciences Research Council (EPSRC) Challenging Engineering grant EP/G029946/1- Pipe Dreams.
Availability of data and materials
The code for the reference implementation of the VCDM (pyvcdm) along with the datasets used in this study are available upon request under permissive licenses to allow others to further validate and refine this modelling framework. At the time of writing the model has been implemented into (commercial) network modelling packages Infoworks WS Pro (Innovyze, 2018) and Synergi (DNV GL, 2010) enabling network (as opposed to a single pipe) analysis. Synergi has implemented the VCDM code as presented here. The Infoworks discolouration functionality is also based on the research described in this paper. The PSO implementation pyshoal is available under a GPL-3.0 license on GitHub.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.