Metamodelling and optimization of copper flash smelting process

Similar documents
IMPROVING GROUNDWATER FLOW MODEL PREDICTION USING COMPLEMENTARY DATA-DRIVEN MODELS

ANN-Based Sizing of Battery Storage in a StandAlone PV System

Forecasting Electricity Consumption with Neural Networks and Support Vector Regression

Estimation of Boiler s Tubes Life, Artificial Neural Networks Approach

Neural Network Model of the Process Nickel-Smelting of Copper Raw Materials in Furnaces

AGENTS MODELING EXPERIENCE APPLIED TO CONTROL OF SEMI-CONTINUOUS PRODUCTION PROCESS

An Application of Neural Networks in Market Segmentation

REVIEW ON PREDICTION OF CHRONIC KIDNEY DISEASE USING DATA MINING TECHNIQUES

Churn Prediction with Support Vector Machine

MISSING DATA CLASSIFICATION OF CHRONIC KIDNEY DISEASE

CS6716 Pattern Recognition

DATA MINING METHODS FOR PREDICTION OF AIR POLLUTION

A Study of Financial Distress Prediction based on Discernibility Matrix and ANN Xin-Zhong BAO 1,a,*, Xiu-Zhuan MENG 1, Hong-Yu FU 1

Progress Report: Predicting Which Recommended Content Users Click Stanley Jacob, Lingjie Kong

Multilayer Perceptron to Model the Decarburization Process in Stainless Steel Production

Estimation of pesticides usage in the agricultural sector in Turkey using Artificial Neural Network (ANN)

Analysis of Boiler Operational Variables Prior to Tube Leakage Fault by Artificial Intelligent System

Neuro-Genetic Optimization of LDO-fired Rotary Furnace Parameters for the Production of Quality Castings

9/12/2018. Course Objectives MSE 353 PYROMETALLURGY. Prerequisite. Course Outcomes. Forms of Assessment. Course Outline

A Comprehensive Evaluation of Regression Uncertainty and the Effect of Sample Size on the AHRI-540 Method of Compressor Performance Representation

PREDICTION OF CONCRETE MIX COMPRESSIVE STRENGTH USING STATISTICAL LEARNING MODELS

Applied Mathematical Modelling

Corrosion Test Improvement by Climatic Modeling

SEISMIC ATTRIBUTES SELECTION AND POROSITY PREDICTION USING MODIFIED ARTIFICIAL IMMUNE NETWORK ALGORITHM

Conclusions and Future Work

Best Suitable Cogeneration Power for Factories from Multi-Objective Data Analysis

Estimation of the Distance Travelled While Collecting Bales from a Field Using Artificial Neural Networks (ANN)

Customer Relationship Management in marketing programs: A machine learning approach for decision. Fernanda Alcantara

forming processes Microstructure evolution in metal Jianguo Lin, Daniel Balint and Maciej Pietrzyk Edited by PUBLISHING WOODHEAD

When to Book: Predicting Flight Pricing

BIOINFORMATICS THE MACHINE LEARNING APPROACH

Big Data. Methodological issues in using Big Data for Official Statistics

A Data-driven Prognostic Model for

2 Maria Carolina Monard and Gustavo E. A. P. A. Batista

Monte Carlo Simulation of Grain Growth

Top-down Forecasting Using a CRM Database Gino Rooney Tom Bauer

Predictive Analytics Using Support Vector Machine

Prediction of organic matter removal from pulp and paper mill wastewater using artificial neural network

BUSINESS DATA MINING (IDS 572) Please include the names of all team-members in your write up and in the name of the file.

To combine or not to combine: selecting among forecasts and their combinations

Predicting Corporate Influence Cascades In Health Care Communities

Geometric Semantic Genetic Programming Algorithm and Slump Prediction

arxiv: v1 [cs.lg] 26 Oct 2018

UTILIZING A REDUCING GAS INJECTION IN CONVERTER SLAG FOR AVOIDING BOTTOM BUILD-UP IN REVERBERATORY FURNACES

Random forest for gene selection and microarray data classification

Prediction of Success or Failure of Software Projects based on Reusability Metrics using Support Vector Machine

Predicting Reddit Post Popularity Via Initial Commentary by Andrei Terentiev and Alanna Tempest

Addressing class imbalance though multi-objective evolution of neural networks. Dr. Alex Shenfield Sheffield Hallam University

Copyr i g ht 2012, SAS Ins titut e Inc. All rights res er ve d. ENTERPRISE MINER: ANALYTICAL MODEL DEVELOPMENT

Applying 2 k Factorial Design to assess the performance of ANN and SVM Methods for Forecasting Stationary and Non-stationary Time Series

Insights from the Wikipedia Contest

blast furnace stoichiomelry II

Natural Gas Market Prediction via Statistical and Neural Network Methods

Predicting user rating on Amazon Video Game Dataset

Evaluating predictive models for solar energy growth in the US states and identifying the key drivers

Prediction of Axial and Radial Creep in CANDU 6 Pressure Tubes

Day-Ahead Price Forecasting of Electricity Market Using Neural Networks and Wavelet Transform

PREDICTION OF PIPE PERFORMANCE WITH MACHINE LEARNING USING R

Predicting Chinese Mobile Phone Users Based on Combined Exponential Smoothing-Linear Regression Method. Meng-yun JIANAG and Lin BAI *

Application of an Improved Neural Network Algorithm in E- commerce Customer Satisfaction Evaluation

Neural Networks and Applications in Bioinformatics. Yuzhen Ye School of Informatics and Computing, Indiana University

Data Mining Applications with R

OUTOTEC FLASH SMELTING TECHNOLOGY

Predicting Defect Types in Software Projects

Linear model to forecast sales from past data of Rossmann drug Store

Air Pollutants Monitoring Data Recovery of Henhouse Based on QGSA-SVM

Electricity Price Modeling Using Support Vector Machines by Considering Oil and Natural Gas Price Impacts

Neural Networks and Applications in Bioinformatics

Job shop flow time prediction using neural networks

From Fraud Analytics Using Descriptive, Predictive, and Social Network Techniques. Full book available for purchase here.

A Protein Secondary Structure Prediction Method Based on BP Neural Network Ru-xi YIN, Li-zhen LIU*, Wei SONG, Xin-lei ZHAO and Chao DU

Recovery of Copper from Reverberatory Copper Slag and Production of a Leach Residue used as a Portland Cement Additive


Design of Truss-Structures for Minimum Weight using Genetic Algorithms

Let the data speak: Machine learning methods for data editing and imputation

Stock data models analysis based on window mechanism

Structural Damage Detection Using Randomized Trained Neural Networks for Evaluation of Quality of Production Line

Comparison of Different Independent Component Analysis Algorithms for Sales Forecasting

Citrine Informatics. Success stories for AI-assisted materials design. Citrine Informatics. The data analytics platform for the physical world

Price Prediction Evolution: from Economic Model to Machine Learning

A Genetic Algorithm Applying Single Point Crossover and Uniform Mutation to Minimize Uncertainty in Production Cost

THE INFLUENCE OF THE SILICATE SLAG COMPOSITION ON COPPER LOSSES DURING SMELTING OF THE SULFIDE CONCENTRATES

Supplemental Digital Content. A new severity of illness scale using a subset of APACHE data elements shows comparable predictive accuracy

Prediction optimization of mechanical properties of ferrite stainless steels after rolling treatment with use of genetic algorithms

Oxygen activity measurements in simulated converter matte

INDIAN INSTITUTE OF MATERIALS MANAGEMENT Post Graduate Diploma in Materials Management PAPER 18 C OPERATIONS RESEARCH.

Taylor & Francis Taylor & Francis Croup Boca Raton London New York

Predicting Corporate 8-K Content Using Machine Learning Techniques

Determining NDMA Formation During Disinfection Using Treatment Parameters Introduction Water disinfection was one of the biggest turning points for

CONTROL OF PITCH ANGLES TO OPTIMIZE THE AERODYNAMIC USING PARTICLE SWARM OPTIMIZATION

Application of Decision Trees in Mining High-Value Credit Card Customers

FEEMD-DR Model for Forecasting. Water Consumption

Drug Discovery in Chemoinformatics Using Adaptive Neuro-Fuzzy Inference System

Use of Artificial Neural Network for Predicting the Mechanical Property of Low Carbon Steel

EST Accuracy of FEL 2 Estimates in Process Plants

Today. Last time. Lecture 5: Discrimination (cont) Jane Fridlyand. Oct 13, 2005

Index Terms: Customer Loyalty, SVM, Data mining, Classification, Gaussian kernel

Software Next Release Planning Approach through Exact Optimization

3 Ways to Improve Your Targeted Marketing with Analytics

Structural Bioinformatics (C3210) Conformational Analysis Protein Folding Protein Structure Prediction

Transcription:

Metamodelling and optimization of copper flash smelting process Marcin Gulik mgulik21@gmail.com Jan Kusiak kusiak@agh.edu.pl Paweł Morkisz morkiszp@agh.edu.pl Wojciech Pietrucha wojtekpietrucha@gmail.com Piotr Jarosz Faculty of Non-Ferrous Metals pijar@agh.edu.pl Stanisław Małecki Faculty of Non-Ferrous Metals stanmal@agh.edu.pl Piotr Oprocha oprocha@agh.edu.pl Łukasz Sztangret szt@agh.edu.pl Abstract: The aim of the paper is metamodelling of copper flash smelting process and an attempt of its optimization. Flash smelting process is an important step in the metallurgical process of production of copper blister (pure copper). Three different modelling techniques: Artificial Neural Networks (ANN), Support Vector Machines (SVM) and Random Forest (RF) were used in construction of metamodels and the obtained results were compared. The metamodels are used in finding a Pareto-optimal solution (and control parameters) for the considered industrial process. Key Words: Metamodelling, Artificial Neural Networks, Random Forest, Support Vector Machines, optimization, flash copper smelting process 1 Introduction Industrial processes are usually very complex, nonlinear and depend on numerous parameters. Moreover, not all parameters of a real industrial process can be measured, while some of process parameters (signals) depend on many other elements, such as technological condition (wear) of tools, quality of materials or semi-products used in the process, etc. Therefore, we often have to rely only on averaged data, experience of engineers, etc. The models of such industrial processes are usually very complex and their simulations are costly with regard to the computing time. The computation time increases rapidly in case of optimization calculations, requiring numerous iterations (simulation calls). Hence, the motivation for the current research was a replacement of computationally costly model of the analyzed process by a metamodel. Developed metamodels should describe the analyzed process with required accuracy and decrease the computation time of the optimization procedure, as well, can be an important aid in the control of the production. In the present paper we will test three different methods of construction of metamodels: Artificial Neural Networks (see [5, 6]), Support Vector Machines (see [1, 12]) and Random Forests [2, 5]. Our ISBN: 978-1-61804-347-4 52

testing ground will be simplified model of Outotec type copper flash smelting process. We will use data generated by theoretical model of the process (see [3]). 2 Copper flash smelting process We focus on a copper flash smelting process (Outotec type) which is an important step in production of copper blister (99 % Cu). The process takes place in a flash smelting furnace, which schematically is presented in Figure 1. Main feedstock of the process is As a result of smelting in furnace a molten matte is produced. The metamodels have the following four input signals: fuel, O 2, N 2 and x. Constructed metamodels predict (separately) two output parameters: Q 1 - concentration of Cu in molten Cu-F e-s matte (between 45% and 75% of Cu) - a semiproduct which goes to the further production stage - oxidation converting into metallic copper, Q 2 - concentration of F eo in molten F e silicate slag. The aim of the further optimization procedure is obtaining the required values of 1 and 2. Figure 1: Schematic structure of outotec copper flash smelting furnace a dry Cu-F e-s concentrate. Very roughly speaking, dry concentrate is blasted with O 2 in reaction shaft. Furnace is equipped with hydrocarbon fuel burners located concentrically in a reaction shaft. Burners are used to adjust temperature, so that the reaction in the shaft is performed properly, that is a concentrate smelts at specified rate. Main factor of the process is a composition of dry concentrate which is a mixture of Cu 2 S and CuF es 2. Let us assume that the furnace is loaded with 1 000 kg of concentrate. Then the amount (on average) of the Cu, F e and S in the concentrate can be evaluated using the the following relations (see [3]): Cu = 346x + 800(1 x), F e = 304x, S = 350x + 200(1 x), where the parameter x [0, 1] depends on the contents/origin of the concentrate. Parameter x cannot be controlled, since it depends on the composition of the material that is delivered to the plant. There are three parameters of the process that can be controlled, and have an essential impact on the quality of the final product of smelting fuel - fossil fuel burned in the furnace. In the process are used between 0 and 40 kg of fuel. O 2 - mass of O 2 in blast, N 2 - mass of N 2 in blast. 3 Metamodelling of copper flash smelting process One of our main goals was to build and to compare three different approaches to metamodels: ANN, SVM and RFs. Two independent metamodels, separately for each quality parameter, i.e. separate model for Q 1 and Q 2 were constructed using each of these methods. We had 46 741 records (observations) as starting point for building our metamodels. We used 98% of the records (45 707 observations) as training samples, while 2% (934 elements) were used at the testing stage. The first metamodel was built on the basis of Artificial Neural Network 1 (Multi-Layer Perceptron type). The Levenberg-Marquardt backpropagation method was employed in training process of the networks. Since the ANN s accuracy depends on the topology of the network, we trained 50 networks using different topologies for each quality criterion and the best were chosen: 3-layer network of 4-28- 1 structure with tan-sigmoid, tan-sigmoid and linear activation functions for metamodel of Q 1 ; the 4-layer network of 4-22-2-1 structure with log-sigmoid, tansigmoid, tan-sigmoid and linear activation functions for the metamodel describing the Q 2 quality criterion. In the second approach, Support Vector Machines 2 were used as a regression approach in metamodelling of considered process. The learning process was executed using the radial basis kernel and eps-regression type as parameters. The last methodology used for metamodelling was Random Forest. Again, our models were implemented in R, using randomforest package. We constructed two different forests consisting of 500 trees. We used standard entries of the function randomforest in the first case of regression mode 1 Matlab 2011b (Neural Network Toolbox) was used. 2 Package e1071 in R program was used. ISBN: 978-1-61804-347-4 53

and we additionally set parameter 3 nodesize = 5 in the second attempt. To test fitting of our metamodels on verification data we calculated mean error separately for each quality function (here Q i is value in metamodel, and Q i is value in the testing dataset): E = 1 n n Q i Q i. (1) i=1 When constructing our metamodel, we considered 4 modifications of data adding random perturbation, which should reflect problems in evaluation of quality parameters. We considered 4 different levels of perturbation of real (accurate value) of Q i, that is: 0% (accurate measurement), 2%, 5% and 10%. The accuracy of each metamodel was computed using equation (1) for all levels of training data perturbation. The obtained errors are presented in Table 1 and Figure 2. Table 1: Errors of fitting (1) of considered metamodels. Error Pert. RF RF ns=5 SVM ANN 0% Q 1 0,005245 0,005442 0,001432 0,000012 Q 2 0,046774 0,046840 0,010633 0,000123 2% Q 1 0,005374 0,005440 0,000492 0,000077 Q 2 0,046738 0,046967 0,010128 0,000215 5% Q 1 0,005515 0,005515 0,000573 0,000157 Q 2 0,047113 0,046743 0,010628 0,000463 10% Q 1 0,006281 0,006375 0,000763 0,000286 Q 2 0,047022 0,046974 0,011860 0,000701 (a) Error of fitting of Q 1 (b) Error of fitting of Q 2 Figure 2: Graphical illustration of fitting errors (1) of metamodels. 4 Optimization As we mentioned above, two different quality functions were considered. Therefore, it is possible to get only Pareto-optimal solutions. We decided to apply Global Criterion Method (see [8] or [4]), with goals 3 In what follows, we will write RF ns=5 to emphasize that Random Forest with this extra parameter was applied 1 = 0.65 and 2 = 0.54, which are technologically motivated optimal values of quality functions. The multi-objectives problem was transformed into single-objective one, and each of the quality functions was associated with a weight. Recapitulating, the considered objective function had the following form: F = w Q 1 1 1 2 + (1 w) Q 2 2. (2) The assumed feasible decision space of control parameters was technologically limited by the following ranges: fuel: 0 40, O 2 : 122 467, N 2 : 0 1634. All optimization calculations were performed for x = 0.5, which means that the input concentrate was composed of 500 kg of Cu 2 S and 500 kg of CuF es 2. In the first approach we performed singleobjective optimization assuming the equal importance of both quality functions, that is w = 0.5. The optimization process was performed using Monte- Carlo method with 50 000 objective function tests for given metamodel (in this approach we used metamodels trained with no perturbations). In each case we performed optimization 100 times, obtaining 100 different candidates for optimal solution.the best (B), average (A) and worst (W) values of considered optimization criteria (F, Q 1 and Q 2 ) 4 corresponding to the found optimal solutions of the parameters (fuel, O 2 and N 2 ) calculated using each metamodel are presented in Table 2. Table 2: Optimal values of criteria and corresponding parameters. Criteria Parameters F Q 1 Q 2 fuel O2 N2 B 8.78e-4 0.649 0.538 31.5 184 142 ANN A 0,011 0.654 0.537 - - - W 0.0273 0.67 0.517 39 226 430 SVM RF RF ns=5 B 0.0021 0.648 0.539 31.9 187 161 A 0.019 0.653 0.538 - - - W 0.047 0.664 0.482 39 221 421 B 4.29e-4 0.65 0.539 27.8 173 60 A 0.004 0.649 0.537 - - - W 0.0081 0.649 0.537 28.6 167 56 B 1.67e-4 0.65 0.538 27.5 174 61 A 0.005 0.649 0.537 - - - W 0.0096 0.652 0.526 27.9 136 48 We can see, that optimization performed based on RF and RF ns=5, metamodels give much better solutions than in case of optimization based on two other metamodels. However, it must be notice that RF and RF ns=5 metamodels are characterized by the biggest metamodelling errors (see Figure 2). 4 We are looking for the minimum of F, while Q 1, Q 2 should be close to the given, required values 1 = 0.65 and 2 = 0.54 ISBN: 978-1-61804-347-4 54

In second approach, our goal was to obtain Pareto fronts. Therefore, for each metamodel and each level of perturbation we set different values of weight w (in equation (2)) to 0, 0.05,..., 0.95, 1. As previously, in each case we run optimization procedure 100 times. To create Pareto fronts (presented in Figure 3) we used average values of obtained results. (a) Perturbation 0% (b) Perturbation 2% (c) Perturbation 5% (d) Perturbation 10% Figure 3: Pareto front for different metamodels and perturbation ranges Static fitting evaluation (1) comparing metamodels responses on test data is not the only criterion to be taken into account. Applied methodology may have strong influence on the topology of the model and as consequently different metamodels may behave differently when searching for optimal solution of quality function (2). For example, Random Forest always extrapolate its answers in the max-min range provided by data set in learning process, while ANN or SVM may respond with values outside desired range and so penalty functions have to be used. Clearly, there are many other tiny details that have to be taken into account in one model but does not exist in the other approach. 5 Conclusion We presented three different methodologies of metamodeling of copper flash smelting process. Analyzing average errors of fitting models into data (see Table 1 and Figure 2) we see that the best results were obtained in modeling by multi-layer neural network. Especially random forest methodology related to Q 2 was very far from desired. Surprisingly, fitting results of random forest with parameter ns = 5 were only slightly better. One of the reasons can be that some values of control parameters, while in acceptable range, can be unacceptable from technological point of view. For these parameters we probably will not have records in the data set (failure of the process). There is no chance to avoid these regions, since RS methodology is a kind of averaged classification method, hence is extrapolating values between minmax. It seems that in practice, to improve performance of metamodel built using this methodology, some additional records for non-acceptable parameters set should be provided and set with some unacceptable value (e.g. -1 as value of Q 1 which is concentration of Cu in molten Cu-F e-s matte). It suggests further investigation on the structure of forest (and learning methodology) in modeling of the process of flash smelting may be necessary. It is also interesting that performance of SVM increased when data became perturbed (random noise was added). It also behaved much better than ANN in the optimization process when modelling using perturbed data. It suggests that in the case of usually very noisy industrial data, SVM can be good choice. This observation also motivates further investigation in this direction. Acknowledgements: The research was supported by National Science Centre in Poland (NCN), Grant no. DEC-2013/11/B/ST8/00352. References: [1] B. E. Boser, I. M. Guyon, V. N. Vapnik, A Training Algorithm for Optimal Margin Classifiers. Proceedings of the Fifth Annual Workshop on Computational Learning Theory 5, Pittsburgh, 1992, pp. 144-152. [2] L. Breiman, Random Forests, Machine Learning 45, 2001, pp. 5-32 [3] W. G. Davenport, M. D. Jones, M. J. King, E. H. Partelpoeg, Flash Smelting: Analysis, Control and Optimization, The Minerals, Metals & Materials Society, Warrendale 2001. [4] K. Deb, Multi-objective optimization using evolutionary algorithms. John Wiley & Sons, Chichester 2001. [5] T. Hastie, R. Tibshirani, J. Friedman, Jerome. The elements of statistical learning. Data mining, inference, and prediction. Second edition. Springer Series in Statistics. Springer, New York, 2009. [6] J. Kusiak, D. Szeliga, Ł.Sztangret, Modelling techniques for optimizing metal forming processes, in: Microstructure evolution in metal forming processes, Woodhead Publishing Limited, 2012. [7] J. Kusiak, Ł.Sztangret, Ł. Rauch, M. Pietrzyk, Metamodel driven optimization of thermomechanical industrial processes, Computer Methods in Materials Science : quarterly 14, 2014, pp. 2026. ISBN: 978-1-61804-347-4 55

[8] S. S. Rao, Engineering Optimization: Theory and Practice, John Wiley & Sons, New York, 1996. [9] M. E. Schlesinger, M. J. King, K. C. Sole, W. G. Davenport, Extractive Metallurgy of Copper, Fifth Edition. Elsevier Ltd., Amsterdam 2011. [10] Ł Sztangret, D. Szeliga, J. Kusiak, M. Pietrzyk, Application of inverse analysis with metamodelling for identification of metal flow stress, Canadian Metallurgical Quarterly 51, 2012, pp. 440446. [11] Ł. Sztangret, Ł. Rauch, J. Kusiak, P. Jarosz, S. Małecki, Modeling of the oxidizing roasting process of zinc sulphide concentrates using artificial neural networks, Computer Methods in Materials Science : quarterly 11, 2011, pp. 122127 [12] V. N. Vapnik. The Nature of Statistical Learning Theory. 2nd edition, Springer, 1999. ISBN: 978-1-61804-347-4 56