Visual Data Mining: A case study in Thermal Power Plant

Similar documents
Analysis of Boiler Operational Variables Prior to Tube Leakage Fault by Artificial Intelligent System

Session 15 Business Intelligence: Data Mining and Data Warehousing

The Application of Data Mining Technology in Building Energy Consumption Data Analysis

Condition Assessment of Power Transformer Using Dissolved Gas Analysis

IBM SPSS Modeler Personal

International Journal of Computer Engineering and Applications, ICCSTAR-2016, Special Issue, May.16

Building Energy Modeling Using Artificial Neural Networks

Measurement and Control of Coal Pipe Temperature of Coal Mills of PF Boiler

Managing Knowledge in the Digital Firm

DESIGN OF COPD BIGDATA HEALTHCARE SYSTEM THROUGH MEDICAL IMAGE ANALYSIS USING IMAGE PROCESSING TECHNIQUES

PREDICTING EMPLOYEE ATTRITION THROUGH DATA MINING

Artificial Intelligence-Based Modeling and Control of Fluidized Bed Combustion

Data Warehousing. and Data Mining. Gauravkumarsingh Gaharwar

BelVis PRO Enhancement Package (EHP)

Fraudulent Behavior Forecast in Telecom Industry Based on Data Mining Technology

MODULE - 9 LECTURE NOTES 4 DECISION SUPPORT SYSTEMS

Improving the Response Time of an Isolated Service by using GSSN

WKU-MIS-B11 Management Decision Support and Intelligent Systems. Management Information Systems

Development of an intelligent flame monitoring system for gasfired steel reheating furnaces

Multi-Agent Model for Power System Simulation

Business Information Systems. Decision Making and Problem Solving. Figure Chapters 10 & 11

ABB Cement Fingerprint Holistic Approach for Cement Industries

Chapter 8 Analytical Procedures

Combining Back-Propagation and Genetic Algorithms to Train Neural Networks for Start-Up Time Modeling in Combined Cycle Power Plants

An expert system concept for diagnosis and monitoring of gas turbine combustion chambers

Study on Predictive Maintenance Strategy

Optimization of Plastic Injection Molding Process by Combination of Artificial Neural Network and Genetic Algorithm

N.MAFFEI, G.GUIDI, C.VECCHI, G.BALDAZZI Physics Department, University of Bologna, via Irnerio Bologna, Italy

Prediction of permeability from reservoir main properties using neural network

Application of Intelligent Methods for Improving the Performance of COCOMO in Software Projects

Monitoring Wind Farms With Performance Curves

Prediction of Dissolved Oxygen Using Artificial Neural Network

Stock Market Prediction with Multiple Regression, Fuzzy Type-2 Clustering and Neural Networks

Industrial IT : Optimization for the Cement Industry

A Model Based Continuous Improvement Methodology for Sustainable Manufacturing

Association rules model of e-banking services

Dependency Graph and Metrics for Defects Prediction

MANAGING NEXT GENERATION ARTIFICIAL INTELLIGENCE IN BANKING

Is Machine Learning the future of the Business Intelligence?

Bringing the Power of SAS to Hadoop Title

Application of Machine Learning to Financial Trading

The use of optical flame scanners for combustion process analysis in OP-650 power boiler

Copyr i g ht 2012, SAS Ins titut e Inc. All rights res er ve d. ENTERPRISE MINER: ANALYTICAL MODEL DEVELOPMENT

CONNECTING SOCIAL MEDIA TO ECOMMERCE USING MICROBLOGGING AND ARTIFICIAL NEURAL NETWORK

Power-Cost Alternative De-NOx Solutions for Coal-Fired Power Plants

FIS Based Speed Scheduling System of Autonomous Railway Vehicle

Neural Networks and Applications in Bioinformatics. Yuzhen Ye School of Informatics and Computing, Indiana University

The Combined Model of Gray Theory and Neural Network which is based Matlab Software for Forecasting of Oil Product Demand

Experimental design on product development

2016 INFORMS International The Analytics Tool Kit: A Case Study with JMP Pro

BUS Public Transportation System Fuel Efficiency Patterns

Managing Information Systems Seventh Canadian Edition. Laudon, Laudon and Brabston. CHAPTER 11 Managing Knowledge

Feature Selection of Gene Expression Data for Cancer Classification: A Review

Wind turbine vibration study: a data driven methodology

SPM 8.2. Salford Predictive Modeler

Predictive analytics [Page 105]

Best practices from the Italian case: the RES NOVAE project

Genetic Algorithm and Neural Network

FUZZY LOGIC APPROACH TO REACTIVE POWER CONTROL IN UTILITY SYSTEM

International Journal of Scientific & Engineering Research, Volume 6, Issue 10, October-2015 ISSN

Chapter 13 Knowledge Discovery Systems: Systems That Create Knowledge

Anil Kumar*, Ajay Rathore** Ashish Patra ***

Classical Approach to Solve Economic Load Dispatch Problem of Thermal Generating Unit in MATLAB Programming

HEATING LOAD PREDICTIONS USING THE STATIC NEURAL NETWORKS METHOD. S.Sholahudin 1, Hwataik Han 2*

Technical Notes for Biomass and Air Quality Guidance

Information Systems in Organizations. Decision Making. Mintzberg s five 5 organizational parts. Types of Organizations.

BACKPROPAGATION NEURAL NETWORK AND CORRELATION-BASED FEATURE SELECTION FOR EARNING RESPONSE COEFFICIENT PREDICTION

Determining NDMA Formation During Disinfection Using Treatment Parameters Introduction Water disinfection was one of the biggest turning points for

Classification Model for Intent Mining in Personal Website Based on Support Vector Machine

Hyperknowledge in Practice - Users Attitudes to Active DSS

Can Advanced Analytics Improve Manufacturing Quality?

Anomaly Detection in Vessel Tracking Using Support Vector Machines (SVMs)

Stock data models analysis based on window mechanism

Machine Learning Applications in Supply Chain Management

Contributed to AIChE 2006 Annual Meeting, San Francisco, CA. With the rapid growth of biotechnology and the PAT (Process Analytical Technology)

Designing an Effective Scheduling Scheme Considering Multi-level BOM in Hybrid Job Shop

Solution Brief Monitoring and Management For Desktops and Servers

Artificial Neural Networks for predicting cooling load in buildings in tropical countries

Prediction of polymer composite material products using neural networks

A Hybrid Adaptive Rule based System for Smart Home Energy Prediction

Applying RFID Hand-Held Device for Factory Equipment Diagnosis

BullSequana S series. Powering Enterprise Artificial Intelligence

Introduction to Information Systems Fifth Edition

Fault Diagnosis of Rotating Machinery based on Wavelet Packet Transform- A Review

International Journal of Scientific & Engineering Research, Volume 5, Issue 9, September ISSN

A Visualization is Worth a Thousand Tables: How IBM Business Analytics Lets Users See Big Data

WinGD Integrated Digital Expert WiDE System

Reliability Analysis of the Railway Time Synchronization Network Based on Bayesian

Deep Dive into High Performance Machine Learning Procedures. Tuba Islam, Analytics CoE, SAS UK

Data Mining Research on Time Series of E-commerce Transaction. Lanzhou, China.

RELATIONSHIP BETWEEN TRAFFIC FLOW AND SAFETY OF FREEWAYS IN CHINA: A CASE STUDY OF JINGJINTANG FREEWAY

Earthquake damage detection in Tehran s water distribution system

A HYBRID MODERN AND CLASSICAL ALGORITHM FOR INDONESIAN ELECTRICITY DEMAND FORECASTING

e-trans Association Rules for e-banking Transactions

Innovations in Clinical

Prediction of bead reinforcement height and width of Gas Tungsten Arc Welded bead-on plate joints using Artificial Neural Network

Sourcing Optimization

Applying Computational Intelligence in Software Testing

Predicting Corporate Influence Cascades In Health Care Communities

PSS E. High-Performance Transmission Planning Application for the Power Industry. Answers for energy.

Transcription:

Visual Data Mining: A case study in Thermal Power Plant Md Fazullula s 1, Mr. Praveen M P 2, S.S.Mahesh Reddy 3 1 Student, Department of Mechanical Engineering, East Point College of Engineering & Technology, Bangalore, India-560049 2 Associate Professor, Department of Mechanical Engineering, East Point College of Engineering & Technology, Bangalore, India-560049 3 Assistant Professor, Department of Mechanical Engineering, East Point College of Engineering & Technology, Bangalore, India-560049 Abstract Power Generation is a critical and a complicated process, which in turn demands sophisticated systems for control and operation. As the system complexity increases, data overload poses a generic and difficult problem for system operators and operators may find it difficult to cope with a situation that generates an avalanche of electronic data and information. Data Mining is defined as the process of extracting patterns as well as predicting trends from large quantities of data by posing repeated queries. This paper makes an attempt to use Visualization techniques to provide visual understanding of data, patterns and trends from the emission data of a leading thermal Power plant Index Terms Data Mining, Visualization, Thermal power plant, emission. Introduction Rapid progress in technology, together with the scale enlargement, social, financial and political stimuli have changed the scope of society. Systems are becoming more complex and consequently dependent on automation increasingly. Industrial processes including many critical systems such as power generation plants are also becoming more complex with an increasing number of interactions between different domains and plants. Centralized system has become the need of the hour for handling such complex systems efficiently. The technological advances that have happened in the control systems have ensure minimal human intervention which only in critical case like shut down, start up or critical malfunction. As the system complexity increases, data overload poses a generic and difficult problem for system operators and operators may find it difficult to cope with a situation where in they will have to deal with a large of data and information. Although the advances in artificial intelligence, computer graphics, or electronic connectivity etc. promise to help people better understand and manage wider range of activities in a plant more efficiently and effectively. Power producers are looking for ways not only to improve efficiency of power plant assets but also equally concerned about the environmental impacts of power generation without compromising their market competitiveness. Data mining discovers hidden relationships in data, in fact it is part of a wider process called knowledge discovery. Knowledge discovery describes the phases which should be done to ensure reaching meaningful results Data Mining can be used in power plants to predict or forecast system failures, fault diagnosis, predict Emissions and the probable quantum of the same etc. This paper makes an attempt to use advanced visualization, machine learning and domain expertise into an actionable set of tools. I. LITERATURE REVIEW The concepts explained in each paper are considered as a reference throughout the project and also which will enhance quality of the work. The following authors published papers are as follows. Ilamathi P, et al predicted modelling of nitrogen oxides emission from a 210 MW coal Fired thermal power plant with combustion parameter optimization is proposed. The oxygen 110

concentration in flue gas, coal properties, coal flow, boiler load, air distribution scheme, flue gas outlet temperature and nozzle tilt were studied. The parametric field experiment data were used to build artificial neural network (ANN). The coal combustion parameters were used as inputs and nitrogen oxides as output of the model. The predicted values of the ANN model for full load condition were verified with the actual values [8] Tania fonts,et al, showed Artificial Neural Networks (ANN) have been essentially used as regression models to predict the concentration of one or more pollutants usually requiring information collected from air quality stations. In this work we consider a Multilayer Perception (MLP) with one hidden layer as a classifier of the impact of air quality on human health, using only traffic and meteorological data as inputs. Our data was obtained from a specific urban area and constitutes a 2-class problem: above or below the legal limits of specific pollutant concentrations. The results show that an MLP with 40 to 50 hidden neurons and trained with the cross-entropy cost function, is able to achieve a mean error around 11%, meaning that air quality impacts can be predicted with good accuracy using only traffic and meteorological data. [9] Andrew Kusiak, et al., worked on a datamining approach is used to develop a model for optimizing the efficiency of an electric-utility boiler subject to operating constraints. Selection of process variables to optimize combustion efficiency is discussed. The selected variables are critical for control of combustion efficiency of a coal fired boiler in the presence of operating constraints. Two schemes of generating control settings and updating control variables are evaluated. One scheme is based on the controllable and no non controllable variables. The second one incorporates response variables into the clustering process. The process control scheme based on the response variables produces the smallest variance of the target variable due to reduced coupling among the process variables. An industrial case study and its implementation illustrate the control approach developed in this paper [10] Hao Zhou Showed A Modelling Nox emission from coal fired utility boiler is critical to develop predictive emissions monitoring system (PEMS) and to implement combustion optimization software package for low Nox combustion. This paper presents an efficient NOx emissions model based on support vector Regression (SVR), and compares its performance with traditional modelling techniques,i.e., back propagation (BPNN) and generalized regression(grnn) neural networks. A large number of NOx emissions data from an actual power plant, was employed to train and validate the SVR model as well as two neural networks models.moreover,an ant colony optimization(aco)based technique was proposed to select the generalization parameter C and Gaussian kernel parameter g. The focus is on the predictive accuracy and time response characteristics of the SVR model. [11] V Milind S. Mankar et, al describes a systematic approach to predict of nitrogen oxides emission from 270 MW coal fired thermal power plant with the help of artificial neural network. The NO formation mechanism and NOx emission control techniques also describe. The oxygen concentration in flue gas, coal properties coal flow, boiler load, air distribution scheme, flue gas outlet, temperature and nozzle tilt were investigated through field experiment. The predicted values of ANN model for different load condition were verified with the actual values. These parameters help us to ensure to complete combustion and less emission with increased boiler life.[12] Ningling Wang et al showed Large coalfired power generation is a complex process characterized as nonlinear and coupling correlation between the levels of equipment, subsystems and 111

function modules.. With data mining methods such as Support Vector Regression (SVR) and Genetic Algorithm (GA), a huge amount of practical operation data stored in the plant-level Supervisory Information System (SIS) were used to model the energy consumption and optimize the operation parameters for less coal consumption. The results show that the power coal rate reduced significantly under the combination of SVR and GA.[ 13] II. Visual Data Mining The basic idea of Visual Data Mining is to present the data in some visual form, allowing the human to get insight into the data, draw conclusions, and directly interact with the data. III. demand for visual exploration techniques and makes them indispensable in con-junction with automatic exploration techniques. In this research visual data mining using parallel coordinates [2] is used. Visual data mining using parallel coordinates has been used in variety of applications like - quality control [3], process improvement [1] and process modeling [7]. Visual Data Mining Process The Process of visual data mining is indicated below (Fig (a) Process Chart) 1. Plot Process Data from a spreadsheet Visual data mining techniques have proven to be of high value in exploratory data analysis and they also have a high potential for exploring large databases. Visual data exploration is especially useful when little is known about the data and the exploration goals are vague. Since the user is directly involved in the exploration process, shifting and adjusting the exploration goals is automatically done if necessary [5]. The main advantages of Visual Data Mining over automatic data mining techniques from statistics or machine learning are [4]: 1. Visual data exploration can easily deal with highly inhomogeneous and noisy data 2. Visual data exploration is intuitive and requires no under-standing of complex mathematical or statistical algorithms or parameters. As a result, visual data exploration usually allows a faster data exploration and often provides better results, especially in cases where automatic algorithms fail. 3. Visual data exploration techniques provide a much higher degree of confidence in the findings of the exploration. This fact leads to a high 2.Choose the process objective 112

3. Focus on choice The parameters derived from the available data include:- Best Operating Zone - Coal Speed (as an input ) / Load Efficiency Class NO2 Class RESULTS: CASE STUDY The Proposed Visual Data Mining technique has been applied to a Thermal Power Plant data. AUTORULE, a visual analytics software was used for the process of data mining. A total of 744 records was evaluated. The recording was done at different loads. The parameters considered for this research are: 1. Coal Speed 2. Coal Flow (Tons per hour) 3. Primary Air Flow 4. Secondary Air Temperature 5. Burner Tilt 6. Induced Draught 7. Forced Draught 8. Flue Gas Temperature 9. Un burnt Oxygen 10. Sulphur Di Oxide 11. Nitrogen Di Oxide 12. Carbon Di Oxide 13. Date 14. Time 15. Total Air Flow 16. Primary Air Temperature Fig (b) Spreadsheet Of The Records The objective of this study was to identify some of the possibilities where Nox could be high. Any record that had a Nox greater than 250 was considered to be that of higher class and below 250 as lower class. Fig(C) The Full Process Data Visualization. 113

Some of the key points with reference to the Nox emission from the data visualization process include Several similar studies were conducted in the plant and a certain set of rules were from the process of Visual Data mining which includes 1]. IF TAFL > 604 AND CFTPH > 29 THEN [NO2CLASS = HIGH] (12.0) Fig (D).Change Of Nox Wrt Time There was Temporal pattern of NOX change with reference to time. It was observed that Low NOX was during certain days, despite similar operating conditions. However in discussions with the domain experts and the available literature survey the change in NOX can be attributed to coal quality. [2]. IF TAFL > 604 AND BT > 90 AND Load > 203 AND O2E > 4.1 THEN [NO2CLASS = HIGH] (12.0/3.0) [3]. IF TAFL > 604 AND BT > 90 AND Load > 203 AND O2E <= 4.1 THEN [NO2CLASS = LOW] (17.34/3.34) [4]. IF TAFL > 604 AND BT > 90 AND Load <= 203 THEN [NO2CLASS = HIGH] (46.68/4.0) Fig.(e) Relationship Between O2 And Nox It was found from the data as indicated above that when the total airflow is low, NOX is high. This was found to be surprising as the NOX production is possible when O2 and N presence in air is high. i.e. when the total airflow is high. A further investigation in this regard is now being conducted by the domain team at the plant to identify the possible causes for this phenomenon. [5]. IF TAFL > 604 AND BT <= 90 AND BT > 78 THEN [NO2CLASS = LOW] (60.81/12.81) [6]. IF TAFL > 604 AND BT <= 90 AND BT <= 78 AND O2E > 4.9 THEN [NO2CLASS = LOW] 114

These rules derived from the data provide basic guidelines to the management of the power plant for effective control of the NOX emissions. However the rules are being pilot tested before its implementation. Conclusions Visual analytics process accelerates and amplifies the domain knowledge. It has been observed the multivariable analysis done with visual analytics has helped the domain team to formulate the rules for running plant in better operating conditions so as to reduce emissions under any loads. It can be quantified that the observed results will provide base line profit in operating a power plant. References 1. Brooks R W, Wilson G J and Thorpe R J, Geometric Process Control for Improved Alarm Management, AIDIC Italian Association of Chemical Engineers PRES 01 4th Conference Process Integration, Modeling and Optimization for Energy Saving and Pollution Reduction. Florence, Italy May, 2001 2. Inselberg A and Dimsdale B., Parallel Coordinates: A Tool for Visualizing Multidimensional Geometry, in Proc of IEEE Conference. On Visualation. '1990, 361-378, Los Alamitos, CA,1990 3. Inselberg A, Visual Data Mining with Parallel Coordinates, Computational Statistics, 13(1), 47-63, 1998. 4. Inselberg A, Don t Panic... just do it in Parallel!, in Computational Statistics. 14, 53-77,1999. 5. Keim D, Visual exploration of large databases, Communications of the ACM, Vol. 44, no. 8, pp. 38 44, 2001. 6. Keim D A, Information Visualization and Visual Data Mining, IEEE Transactions on Visualization and Computer Graphics, VOL. 7, NO. 1, January-March 2002 7. Wegman E, Hyper dimensional data analysis using parallel coordinates, Journal of American Statistical Association., 85, pp. 664-675, 1990. 8. Ilamathi P, Selladurai V, Balamurugan K., Predictive modelling and optimization of Nox emission from power plant, IAES International Journal of Artificial Intelligence (IJ-AI) Vol. 1, No. 1, March 2012, pp. 11~18 ISSN: 2252-8938. 9. Fonts T, Silva L, Pereira S,Application of Artificial Neural Networks To Predict The Impact Of Traffic Emissions On Human Health, Conference On Artificial Intelligence, EPIA 2013, Angra do Heroísmo, Azores, Portugal, September 9-12, 2013 10. Kusiak A, Member, IEEE, and Zhe Song, Combustion Efficiency Optimization and Virtual Testing: A Data-Mining Approach, Ieee Transactions On Industrial Informatics, Vol. 2, No. 3, August 2006 11. Zhou H, Zhao J, Zheng L, Wanga C, Fa k, Cen F, Modeling Nox Emissions From Coal- Fired Utility Boilers Using Support Vector Regression With Ant Colony Optimization, Engineering Applications of Artificial Intelligence 25 (2012) 147 158 12. Mankar M, Vyawahare M, Jitendra S., Nitrogen Oxides Emission Prediction in Coal Based Thermal Power Plant using Artificial Neural Network, International Journal of Emerging Science and Engineering (IJESE)ISSN: 2319 6378, Volume-2, Issue-7, May 2014 115