Parameter identification in the activated sludge process

Similar documents
Introduction To Genetic Algorithms

Controller Tuning Of A Biological Process Using Optimization Techniques

2. Genetic Algorithms - An Overview

Intelligent Techniques Lesson 4 (Examples about Genetic Algorithm)

Evolutionary Algorithms

Optimisation and Operations Research

Machine Learning: Algorithms and Applications

GENETIC ALGORITHMS. Narra Priyanka. K.Naga Sowjanya. Vasavi College of Engineering. Ibrahimbahg,Hyderabad.

BIOAUTOMATION, 2009, 13 (4),

Minimizing Makespan for Machine Scheduling and Worker Assignment Problem in Identical Parallel Machine Models Using GA

Introduction to Artificial Intelligence. Prof. Inkyu Moon Dept. of Robotics Engineering, DGIST

Introduction To Genetic Algorithms

Evolutionary Computation. Lecture 3. Evolutionary Computation. X 2 example: crossover. x 2 example: selection

Part 1: Motivation, Basic Concepts, Algorithms

Genetic Algorithms and Sensitivity Analysis in Production Planning Optimization

initial set of random solutions called population satisfying boundary and/or system

CSE /CSE6602E - Soft Computing Winter Lecture 9. Genetic Algorithms & Evolution Strategies. Guest lecturer: Xiangdong An

Genetic Algorithms. Part 3: The Component of Genetic Algorithms. Spring 2009 Instructor: Dr. Masoud Yaghini

APPLICATION OF COMPUTER FOR ANALYZING WORLD CO2 EMISSION

Modelling of Escherichia coli Cultivations: Acetate Inhibition in a Fed-batch Culture

Keywords Genetic Algorithm (GA), Evolutionary, Representation, Binary, Floating Point, Operator

What is Evolutionary Computation? Genetic Algorithms. Components of Evolutionary Computing. The Argument. When changes occur...

Processor Scheduling Algorithms in Environment of Genetics

Genetic Algorithm for Supply Planning Optimization under Uncertain Demand

Comparative Study of Different Selection Techniques in Genetic Algorithm

Journal of Global Research in Computer Science PREMATURE CONVERGENCE AND GENETIC ALGORITHM UNDER OPERATING SYSTEM PROCESS SCHEDULING PROBLEM

College of information technology Department of software

10. Lecture Stochastic Optimization

Genetic Algorithms and Genetic Programming Lecture 14

Plan for today GENETIC ALGORITHMS. Randomised search. Terminology: The GA cycle. Decoding genotypes

Introduction to Genetic Algorithm (GA) Presented By: Rabiya Khalid Department of Computer Science

Feature Selection for Predictive Modelling - a Needle in a Haystack Problem

Evolutionary Computation

Evolutionary Algorithms - Population management and popular algorithms Kai Olav Ellefsen

Genetic Algorithm: An Optimization Technique Concept

VISHVESHWARAIAH TECHNOLOGICAL UNIVERSITY S.D.M COLLEGE OF ENGINEERING AND TECHNOLOGY. A seminar report on GENETIC ALGORITHMS.

Industrial Microbiology INDM Lecture 10 24/02/04

Genetic Algorithm: A Search of Complex Spaces

CHAPTER 4 LINEAR ANTENNA ARRAY SYNTHESIS USING GENETIC ALGORITHM

A Genetic Algorithm for Order Picking in Automated Storage and Retrieval Systems with Multiple Stock Locations

Keywords Genetic, pseudorandom numbers, cryptosystems, optimal solution.

Assoc. Prof. Rustem Popa, PhD

Population control in evolutionary algorithms

COMPARATIVE STUDY OF SELECTION METHODS IN GENETIC ALGORITHM

The Metaphor. Individuals living in that environment Individual s degree of adaptation to its surrounding environment

Generational and steady state genetic algorithms for generator maintenance scheduling problems

Design and Implementation of Genetic Algorithm as a Stimulus Generator for Memory Verification

Dynamic simulation of a fed-batch enzyme fermentation process

Genetic algorithms. History

Selecting Genetic Algorithm Operators for CEM Problems

Genetic Algorithm for Variable Selection. Genetic Algorithms Step by Step. Genetic Algorithm (Holland) Flowchart of GA

Design of PI Controller for Bioreactors for Maximum Production Rate

Genetic Algorithms and Genetic Programming Lecture 13

Study of Optimization Assigned on Location Selection of an Automated Stereoscopic Warehouse Based on Genetic Algorithm

Parametric Study of the Hydrogen Fuel Cell Electrochemical Model

Challenge of batch process control: highly nonlinear varying with time. In some cases (polymerisation)

Proceeding, Seminar of Intelligent Technology and Its Applications (SITIA 2000) Graha Institut Teknologi Sepuluh Nopember, Surabaya, 19 April 2000

GENETIC ALGORITHM A NOBLE APPROACH FOR ECONOMIC LOAD DISPATCH

Computational Intelligence Lecture 20:Intorcution to Genetic Algorithm

CapSel GA Genetic Algorithms.

Adaptive Online Parameter Tuning Using Genetic Algorithms

Bioreactor System ERT 314. Sidang /2011

A Gene Based Adaptive Mutation Strategy for Genetic Algorithms

Genetic Algorithms. Moreno Marzolla Dip. di Informatica Scienza e Ingegneria (DISI) Università di Bologna.

Genetic Algorithms for Optimizations

CHAPTER 3 RESEARCH METHODOLOGY

USE OF A ROTATING BIOLOGICAL CONTACTOR FOR APPROPRIATE TECHNOLOGY WASTEWATER TREATMENT

Validity Constraints and the TSP GeneRepair of Genetic Algorithms

A Hybrid Genetic Algorithm for Parameter Identification of Bioprocess Models

PARALLEL LINE AND MACHINE JOB SCHEDULING USING GENETIC ALGORITHM

An Evolutionary Approach involving Training of ANFIS with the help of Genetic Algorithm for PID Controller Tuning

Designing High Thermal Conductive Materials Using Artificial Evolution MICHAEL DAVIES, BASKAR GANAPATHYSUBRAMANIAN, GANESH BALASUBRAMANIAN

An introduction to modeling of bioreactors

Reproduction Strategy Based on Self-Organizing Map for Real-coded Genetic Algorithms

Intro. ANN & Fuzzy Systems. Lecture 36 GENETIC ALGORITHM (1)

Machine Learning. Genetic Algorithms

Machine Learning. Genetic Algorithms

Volume 3, Special Issue 3, March International Conference on Innovations in Engineering and Technology (ICIET 14)

Rule Minimization in Predicting the Preterm Birth Classification using Competitive Co Evolution

Genetic approach to solve non-fractional knapsack problem S. M Farooq 1, G. Madhavi 2 and S. Kiran 3

Grey and neural network prediction of suspended solids and chemical oxygen demand in hospital wastewater treatment plant effluent

Evolutionary Algorithms and Simulated Annealing in the Topological Configuration of the Spanning Tree

Modelling Fed-Batch Fermentation Processes: An Approach Based on Artificial Neural Networks

Using Multi-chromosomes to Solve. Hans J. Pierrot and Robert Hinterding. Victoria University of Technology

CEng 713 Evolutionary Computation, Lecture Notes

Keywords COCOMO model, cost estimation, genetic algorithm, ant colony optimization.

A HYBRID ALGORITHM TO MINIMIZE THE NUMBER OF TARDY JOBS IN SINGLE MACHINE SCHEDULING

GENETIC ALGORITHM CHAPTER 2

TIMETABLING EXPERIMENTS USING GENETIC ALGORITHMS. Liviu Lalescu, Costin Badica

An introduction to evolutionary computation

Genetically Evolved Solution to Timetable Scheduling Problem

Evolutionary Algorithms

Optimization of a Primary Metabolite Fermentation Process: Effect of Cost Factor on the Optimal Feed Rate Control

Introduction Evolutionary Algorithm Implementation

OBJECTIVE To design an algorithm that can be used for planning the location of distribution substations in a network.

Genetic Annealing Optimization: Design and Real World Applications

냉각결정화기술의응용전략 -2 고려대화공생명공학과양대륙

A Comparison between Genetic Algorithms and Evolutionary Programming based on Cutting Stock Problem

Simulation of Feedforward-Feedback Control of Dissolved Oxygen (DO) of Microbial Repeated Fed-batch Culture

Implementation of CSP Cross Over in Solving Travelling Salesman Problem Using Genetic Algorithms

Transcription:

Parameter identification in the activated sludge process Päivi Holck, Aki Sorsa and Kauko Leiviskä Control Engineering Laboratory, University of Oulu P.O.Box 4300, 90014 Oulun yliopisto, Finland e-mail: kauko.leiviska@oulu.fi Numerical optimisation methods are common in nonlinear parameter identification, but they may end up in a local optimum value. Genetic algorithms are able to escape local optima and continue the search towards a global optimum value. This paper shows that they are useful in identifying parameters for the nonlinear model of the activated sludge plant. The number of identified parameters in the model is seven: three coefficients in Haldane s equation, two yields, oxygen kinetic constant, and the oxygen transfer coefficient. The results are good, and show a good correlation with the actual values. 1. Introduction There are several models for the biological wastewater treatment in the literature. Common features in these models are nonlinear dynamics, high number of parameters describing the kinetics and stoichiometry of the reactions, several operating points, sensitivity to parameter values, and so on. Dissolved oxygen (DO) control is critical for the good operation of activated sludge processes. Too high DO leads to high energy consumption and bad quality, and too low DO leads to low pollutant removal. Several control methods are in use starting from conventional PID controls to model predictive control and various adaptive or fuzzy control strategies (Ertunc et al., 2009, Oliveira et al., 2005). More advanced control methods need models for the controller design and tuning and face the problems with parameter identification of nonlinear activated sludge models. This paper introduces the use of genetic algorithms for the nonlinear parameter identification of the activated sludge process. The data for modelling was produced by a Chemostat model built in Simulink environment. The model is based on mass balances and Haldane kinetics. The parameters identification uses the values of dissolved oxygen concentration as the system output, because its measurement in practice is easier than other state variables and, as mentioned before, it is also a common control variable. 2. Process model In this paper, data for identification trials are produced using the model of the aeration basin of the activated sludge process. The activated sludge process is modelled as a Chemostat bioreactor using substrate, biomass, and dissolved oxygen mass balances

(Equation 1) and Haldane kinetics (Equation 2) that describe the inhibition of the reaction by high substrate concentration. ds Q = ( S in S ) Y X / S µ ( S ) X dt V dx Q = ( X in X ) + µ ( S ) X dt V do Q = O in O Y X / O S X k o X + k L a O s O dt V ( ) µ ( ) ( ) (1) µ max µ ( S) = 1 2 I S K S + S + K S. (2) In the above equations, S, X, and O are the concentrations of substrate, biomass, and dissolved oxygen, the subscript in refers to their values in the input flow, Q is the flow rate, V is the liquid volume in the reactor, and Y is the yield. Table 1 contains the values for eight parameters shown in the above equations: the maximum growth rate (µ max ), saturation coefficient (K s ), inhibition constant (K I ), yield coefficients (Y X/S, Y X/o ), oxygen kinetic constant (k O ), volumetric oxygen transfer rate (k L a) and the saturation concentration for dissolved oxygen (O s ). With the chosen parameter set, the system shows nonlinear bi-stable dynamics, i.e. it has two stable operation points (Holck, 2008) (Sorsa and Leiviskä, 2006). Table 1. Parameter values for Equations 1 and 2 (Holck, 2008). Parameter µ max K S K I Y X/S Y X/O k O k L a O s Value 1.04 3.75 60 1.40 1.03 0.006 16.8 6 The model simulator is a Simulink program. The solver used in the simulations is ode45 that is capable of solving nonstiff differential equations, and uses the Runge- Kutta method. 3. Data The process has two distinct operation points: low conversion (OP 1) and high conversion area (OP 2). Identification data sets were generated for both points by applying two consecutive step changes of sizes +5 and -10 to the input concentration of the substrate. This approach should also reveal the nonlinearity of the system. Random disturbances were applied to the input concentration of substrate (S in ), and biomass (X in ), and to the liquid volume in the reactor (V). The disturbances were taken from the normal distribution and their means and variances are in Table 2. The disturbance to V is very small since it was assumed that the liquid level in the reactor was in control.

Table 2. The means and the variances of disturbances. Variation in S in X in V Mean 0 10 100 Variance 1 0.4 1 Table 3 shows the total simulation time (t sim ), the steady-state input concentration of the substrate, and the size and time of the step changes in both operation points. The total simulation time for the second operation point is longer than for the first operation point. This is due to working in the low conversion area and with less biomass in the system. Therefore, the reactions occur more slowly. Table 3. Simulation time, substrate input concentrations and sizes and times for the step changes in both operation points. Step change 1 Step change 2 t sim S in Size Size OP1 120 40 5 50-10 100 OP2 275 75 5 100-10 200 The identification bases on the concentration of dissolved oxygen. There are two reasons for this: in practice, it is the easiest output variable to measure and secondly, as mentioned in the introduction, several control strategies use the concentration of dissolved oxygen as the controlled variable. Figures 1 and 2 later show the step responses of dissolved oxygen in both operation points. 4. Identification with Genetic Algorithms The model parameters in this paper are identified by using the real-coded genetic algorithms. Genetic algorithms are an optimization method mimicking evolution. The population consisting of chromosomes evolves towards the global optimum. The possible solutions to the optimization problem are encoded into the chromosomes. Binary or real-valued coding can be used. The link between the chromosomes and the problem is the objective function, the fitness function. The main stages in genetic optimization are parent selection, fitness evaluation, variation, and population update. These stages are repeated until the optimum has been reached. Typical parent selection mechanisms are the tournament and the roulette wheel mechanisms (Davis, 1991). In tournament mechanism, a certain number of chromosomes are selected randomly to participate in a tournament. The most suitable chromosome is the winner and is selected as a parent. In the roulette wheel method, the fitter chromosomes have a larger slot in the roulette, and have therefore better chances to become a parent. The genetic operators crossover and mutation regulate the variation: The crossover operator mates two parents to produce the offspring (1 or 2, depending on the methods). Mutation adds random changes to the population so that the optimization is not trapped into any local optimum. Genetic algorithms (GA) have been used for both the structure and parameter identification (Gray et al., 1998, Nyarko and Scitovski, 2004, Chang, 2007, Khalik et

al., 2007, Wang et al., 2008). The most significant drawback of conventional methods (such as gradient methods) is the lack of ability to overcome local optima. That problem does not exist with GA since the optimal solutions are searched from multiple directions and, basically, the whole search space is covered. Therefore, GA are more likely to find the global optimum. The drawbacks of GA are that the exact solution may not be found and the optimization is time-consuming with complex systems. The parameter identification was performed for both operation points by using two different crossover methods: the linear crossover and the arithmetic crossover with crossover probability of 0.9. The mutation method was the uniform mutation with the mutation probability of 0.05. The initial population was taken randomly from the uniform distribution and each generation consisted of 200 chromosomes. Elitism was also included and from each generation the best chromosome was transferred to the next to ensure that the best result will not disappear from the population. The initial population was subjected to genetic operations for 30 generations after which the best chromosome was returned by the algorithm. To validate the results of the parameter identification, the optimization was repeated for both operation points (and with both crossover methods) 500 times. The fitness function was the sum of the squared prediction error, SSE that compared the predicted value of dissolved oxygen with its actual value. Seven parameters were identified from the step response data, i.e. all parameters in Table 1 except O s. The feasible range allotted for the parameters is very important in genetic optimization, and the ranges for the seven parameters are in Table 4. Table 4. Parameter ranges for genetic optimization. Parameter µ max K S K I Y X/S Y X/O k O k L a Range 0-2 0-10 30-90 0-5 0-20 0-0.1 10-40 5. Results and discussion Table 5 shows the best parameter sets for both operation points and crossover methods. The table shows also differences between the actual parameters and the identified parameters in percentages. Table 6 shows the statistical values of the fit: SSE, Mean error (EM), Standard deviation (SD), and correlation (R2) between the predicted and actual responses. It also shows the number of iterations (NI), where the best parameter sets were achieved. Figures 1 and 2 together with the correlation coefficients in Table 6 show that genetic algorithms perform well in parameter identification. There is also only a little difference between the used crossover methods even though some of the identified parameter values differ a lot from the actual values. There are two reasons for the big differences: the random disturbances applied to the system inputs in data sets generation and also the fact that the parameters are identified for both operation points separately. The parameter that differs the most from the actual parameter values is k O regardless of the crossover method used. Its value is very small and its search area is much larger than the relative search areas for the other parameters.

Table 5. The best set of parameter values and their differences from the actual ones for both crossover methods (CM) and operation points (OP). L means Linear Crossover and A means Arithmetic Crossover. Difference is in percentages. Parameter OP CM µ max K S K I Y X/S Y X/O k O k L a Value 1 L 0.91 7.16 79.2 1.65 1.38 0.016 18.6 Difference -12.5 90.9 32.1 17.9 34.0 167 10.7 Value A 0.97 6.16 72.5 1.55 2.11 0.05 33.1 Difference -6.7 64.3 20.8 10.7 105 733 97.0 Value 2 L 0.74 7.13 33.1 0.007 1.46 0.013 25.2 Difference -28.8 90.1-44.8-99.5-41.7 117 49.9 Value A 0.60 6.73 81.6 1.82 2.43 0.045 32.9 Difference -42.3 79.5 35.9 30.0 136 651 96.1 Table 6. Statistical values for the model validation. OP CM SSE*10-4 EM*10-3 SD*10-3 R2 NI 1 L 2.37 2.45 4.48 0.998 462 A 2.96 7.08 5.96 0.998 397 2 L 2.40 1.94 2.36 0.991 142 A 2.92 2.81 1.77 0.990 105 In the future, the parameter search areas should be studied more to ensure that the parameter ranges do not include impossible values for the real process. This would help to cut out unrealistic parameter sets. It would also be essential to perform sensitivity analysis for the model parameters. SSE is only one of the many possible fitness functions. Other functions should be tested to find out whether different fitness functions have a significant influence to the identification results. During the optimization, the fitness function is evaluated hundreds of times. Cutting down the runtime of the whole genetic algorithm is important to improve the efficiency of the solution. 5.8 5.8 5.78 5.76 5.78 5.76 5.74 5.74 a 35 60 90 120 35 b 60 90 120 Figure 1. The actual and predicted concentration of dissolved oxygen for Operation Point 1. Linear Crossover (a) and Arithmetic Crossover (b). Denotations: --- predicted, actual.

5.902 5.902 5.897 5.897 5.892 5.892 5.887 5.887 a 60 140 220 300 b 60 140 220 300 Figure 2. The actual and predicted concentration of dissolved oxygen for Operation Point 2. Linear Crossover (a) and Arithmetic Crossover (b). Denotations: --- predicted, actual. References Sorsa A. and Leiviskä K., 2006, State detection in the biological water treatment process. Oulu, Control Engineering Laboratory. 53. Report A. 31. ISBN 951-42- 8273-6/ISSN 1238-9390. Holck P., 2008, Parameter identification with genetic algorithms for biological wastewater treatment process. Master's thesis, Control Engineering Laboratory, University of Oulu, Finland. Gray G.J., Murray-Smith D.J., Li Y., Sharman K.C and Weinbrenner T., 1998, Nonlinear model structure identification using genetic programming, Control Engineering Practice 6, 11, 1341-1352. Chang W.-D., 2007, Nonlinear system identification and control using a real-coded genetic algorithm, Applied Mathematical Modelling 31, 3, 541-550. Wang G., Feng E. and Xiu Z., 2008, Modeling and parameter identification of microbial bioconversion in fed-batch cultures, Journal of Process Control 18, 12, 1323-1326. Khalik M.A., Sherif M., Saraya S. and Areed G., 2007, Parameter identification problem: Real-coded GA approach, Applied Mathematics and Computation 187, 2, 1495-1501. Nyarko E.K. and Scitovski R., 2004, Solving the parameter identification problem of mathematical models using genetic algorithms, Applied Mathematics and Computation 153, 3, 651-658. Davis L., 1991, Handbook of Genetic Algorithms. Van Nostrand Reinhold, New York. Ertunc S., Akay B., Boyacioglu H. and Hapoglu H., 2009, Self-tuning control of dissolved oxygen concentration in a batch bioreactor, Food and Bioproducts Processing (in press). Oliveira R., Clemente J.J. Cunha A.E. and Carrondo, M.J.T., 2005, Adaptive dissolved oxygen control through the glycerol feeding in a recombinant Pichia pastoris cultivation in conditions of oxygen transfer limitation, Journal of Biotechnology 116, 35 50.