Rule Minimization in Predicting the Preterm Birth Classification using Competitive Co Evolution

Similar documents
GENETIC ALGORITHMS. Narra Priyanka. K.Naga Sowjanya. Vasavi College of Engineering. Ibrahimbahg,Hyderabad.

What is Evolutionary Computation? Genetic Algorithms. Components of Evolutionary Computing. The Argument. When changes occur...

Changing Mutation Operator of Genetic Algorithms for optimizing Multiple Sequence Alignment

Evolutionary Algorithms

Recessive Trait Cross Over Approach of GAs Population Inheritance for Evolutionary Optimisation

Genetic Algorithms in Matrix Representation and Its Application in Synthetic Data

VISHVESHWARAIAH TECHNOLOGICAL UNIVERSITY S.D.M COLLEGE OF ENGINEERING AND TECHNOLOGY. A seminar report on GENETIC ALGORITHMS.

Introduction To Genetic Algorithms

Applying Computational Intelligence in Software Testing

Machine Learning. Genetic Algorithms

Machine Learning. Genetic Algorithms

Journal of Global Research in Computer Science PREMATURE CONVERGENCE AND GENETIC ALGORITHM UNDER OPERATING SYSTEM PROCESS SCHEDULING PROBLEM

Genetic Algorithm: A Search of Complex Spaces

A Comparison between Genetic Algorithms and Evolutionary Programming based on Cutting Stock Problem

TIMETABLING EXPERIMENTS USING GENETIC ALGORITHMS. Liviu Lalescu, Costin Badica

Implementation of Genetic Algorithm for Agriculture System

COMBINED-OBJECTIVE OPTIMIZATION IN IDENTICAL PARALLEL MACHINE SCHEDULING PROBLEM USING PSO

Intelligent Techniques Lesson 4 (Examples about Genetic Algorithm)

Intro. ANN & Fuzzy Systems. Lecture 36 GENETIC ALGORITHM (1)

Evolutionary Algorithms - Population management and popular algorithms Kai Olav Ellefsen

Improvement of Control System Responses Using GAs PID Controller

PARALLEL LINE AND MACHINE JOB SCHEDULING USING GENETIC ALGORITHM

The Metaphor. Individuals living in that environment Individual s degree of adaptation to its surrounding environment

Energy management using genetic algorithms

Genetic Algorithms-Based Model for Multi-Project Human Resource Allocation

TRAINING FEED FORWARD NEURAL NETWORK USING GENETIC ALGORITHM TO PREDICT MEAN TEMPERATURE

Optimization of Riser in Casting Using Genetic Algorithm

Evolutionary Algorithms and Simulated Annealing in the Topological Configuration of the Spanning Tree

A Genetic Algorithm on Inventory Routing Problem

Parameter identification in the activated sludge process

Measurement and Control of Coal Pipe Temperature of Coal Mills of PF Boiler

Genetic Algorithm. Presented by Shi Yong Feb. 1, 2007 Music McGill University

Designing High Thermal Conductive Materials Using Artificial Evolution MICHAEL DAVIES, BASKAR GANAPATHYSUBRAMANIAN, GANESH BALASUBRAMANIAN

A HYBRID MODERN AND CLASSICAL ALGORITHM FOR INDONESIAN ELECTRICITY DEMAND FORECASTING

Storage Allocation and Yard Trucks Scheduling in Container Terminals Using a Genetic Algorithm Approach

AUTOMATIC TEST CASE GENERATION BASED ON GENETIC ALGORITHM

Public Key Cryptography Using Genetic Algorithm

Deterministic Crowding, Recombination And Self-Similarity

The Making of the Fittest: Natural Selection in Humans

Genetic Algorithm with Upgrading Operator

The Theory of Evolution

AN IMPROVED ALGORITHM FOR MULTIPLE SEQUENCE ALIGNMENT OF PROTEIN SEQUENCES USING GENETIC ALGORITHM

-Is change in the allele frequencies of a population over generations -This is evolution on its smallest scale

Reproduction Strategy Based on Self-Organizing Map for Real-coded Genetic Algorithms

Genetic Algorithms using Populations based on Multisets

Derivative-based Optimization (chapter 6)

An Efficient and Effective Immune Based Classifier

Construction and Application of Mathematical Model Based on Intelligent Test Paper Generation Algorithm

11.1 Genetic Variation Within Population. KEY CONCEPT A population shares a common gene pool.

Routing Optimization of Fourth Party Logistics with Reliability Constraints based on Messy GA

Genetic Equilibrium: Human Diversity Student Version

Evolutionary Computation. James A. Foster Prof., Biological Sciences Initiative for Bioinformatics & Evol. Studies University of Idaho

Genetically Evolved Solution to Timetable Scheduling Problem

Genetic Algorithms. Lecture Notes in Transportation Systems Engineering. Prof. Tom V. Mathew

Permutation Free Encoding Technique for Evolving Neural Networks

GENETIC ALGORITHM CHAPTER 2

CEng 713 Evolutionary Computation, Lecture Notes

Optimizing Genetic Algorithms for Time Critical Problems

Optimization of Software Testing for Discrete Testsuite using Genetic Algorithm and Sampling Technique

The Weapon Target Assignment Strategy Research on Genetic Algorithm

AP BIOLOGY Population Genetics and Evolution Lab

FacePrints, Maze Solver and Genetic algorithms

Multi-product inventory optimization in a multiechelon supply chain using Genetic Algorithm

Coevolution, Memory and Balance

Keywords Genetic, pseudorandom numbers, cryptosystems, optimal solution.

INFLUENCE OF DATA QUANTITY ON ACCURACY OF PREDICTIONS IN MODELING TOOL LIFE BY THE USE OF GENETIC ALGORITHMS

On Naïve Crossover Biases with Reproduction for Simple Solutions to Classification Problems

Evolutionary Computation

LAB ACTIVITY ONE POPULATION GENETICS AND EVOLUTION 2017

Unraveling the Evolution of Defectors in Online Business Games

A NEW MUTATION OPERATOR IN GENETIC PROGRAMMING

Adaptive Genetic Programming applied to Classification in Data Mining

Using Chaos in Genetic Algorithms

Immune Programming. Payman Samadi. Supervisor: Dr. Majid Ahmadi. March Department of Electrical & Computer Engineering University of Windsor

Proceeding, Seminar of Intelligent Technology and Its Applications (SITIA 2000) Graha Institut Teknologi Sepuluh Nopember, Surabaya, 19 April 2000

GENETIC ALGORITHMS FOR DESIGN OF PIPE NETWORK SYSTEMS

Genetic Algorithm and Neural Network

3. A student performed a gel electrophoresis experiment. The results are represented in the diagram below.

Optimization in Allocating Goods to Shop Shelves. for Cup Noodles

An Improved Grouping Genetic Algorithm

Using evolutionary techniques to improve the multisensor fusion of environmental measurements

Chapter 6: SELECTION. 6.1 Introduction

Using Neural Network and Genetic Algorithm for Business Negotiation with Maximum Joint Gain in E-Commerce

An Evolutionary Algorithm Based On The Aphid Life Cycle

OPTIMIZATION OF A THREE-PHASE INDUCTION MACHINE USING GENETIC ALGORITHM

Symbiotic Combination as an Alternative to Sexual Recombination in Genetic Algorithms

NEUROEVOLUTION AND AN APPLICATION OF AN AGENT BASED MODEL FOR FINANCIAL MARKET

Optimization of Plastic Injection Molding Process by Combination of Artificial Neural Network and Genetic Algorithm

Evolutionary Algorithms - Introduction and representation Jim Tørresen

A GENETIC ALGORITHM FOR POLYTECHNIC TIME TABLING (EEPIS Timetabling Case Study)

The Effects of Mutation and Directed Intervention Crossover when applied to Scheduling Chemotherapy

OPTIMIZATION OF GROUNDWATER RESOURCES MANAGEMENT IN POLLUTED AQUIFERS

Economic Design of Control Chart

A Fast Genetic Algorithm with Novel Chromosome Structure for Solving University Scheduling Problems

AP Biology Laboratory 8 Population Genetics Virtual Student Guide

Journal of Chemical and Pharmaceutical Research, 2014, 6(5): Research Article

An Improved Immune Genetic Algorithm for Capacitated Vehicle Routing Problem

Forecasting Euro United States Dollar Exchange Rate with Gene Expression Programming

Simulation-Based Analysis and Optimisation of Planning Policies over the Product Life Cycle within the Entire Supply Chain

Cognitive Radio Spectrum Management

Transcription:

Indian Journal of Science and Technology, Vol 9(10), DOI: 10.17485/ijst/2016/v9i10/88902, March 2016 ISSN (Print) : 0974-6846 ISSN (Online) : 0974-5645 Rule Minimization in Predicting the Preterm Birth Classification using Competitive Co Evolution Jyothi Thomas 1* and G. Kulanthaivel 2 1 Department of Computer Science and Engineering, Sathyabama University, Chennai - 600119, Tamil Nadu, India; phd.jthomas@gmail.com 2 National Institute of Technical Teachers Training and Research, Chennai - 600113, Tamil Nadu, India; gkveldr@gmail.com Abstract Objective: Accurate prediction of pre term birth probability in the deliveries of babies with an effective classifier tool is a big challenging task. This paper talks about a novel competitive co evolution rule prediction classifier for extracting minimum number of rules for identifying the preterm birth. Methods/Analysis: Competitive Co evolution algorithm is applied to the preterm classifier for deriving the pattern governing the classified dataset. In this approach we have used two classes namely normal birth dataset and preterm birth dataset as two individuals competing with each other. Fitness of each individual is calculated based on the relative fitness of the other population. Findings: The dataset consist of 1052 records of preterm birth and normal dataset of 1314 each of five attributes. The experimental result shows the total no of rules needed for training and testing is drastically reduced compared to the total rules. The accuracy is also improved to a greater extended by applying the proposed algorithm. Applications/Improvement: The proposed algorithm minimizes the number of training rules to 16 and testing rules to 11 out of total 28 possible rules in the rule set. The accuracy of 0.962938881664499 of correct classification of preterm birth dataset is obtained through method. Keywords: Co Evolution, Preterm Birth, Reproduction, Rules Extraction 1. Introduction Pre term birth is the delivery of baby at least three weeks before the delivery date i.e. usually before 37 weeks of pregnancy 1. The pre term birth child has the risk of hearing problems, eyes problem and delay in the development of various organs. There are many causes for preterm birth mainly multiple pregnancies, diabetics, blood pressure, mental stress and smoking tobacco. Assisted reproductive technology for achieving pregnancy is also one of the factors for pre term birth. The probability of early prediction 2 of pre term is very difficult and no exact test to predict since the causes of pre term is unpredictable. Every year millions of preterm birth occurs all over the world causing a serious concern in health related issues to these new born babies. Evolution 8 is the process of evolving things better and better over a period of generations resulting in better future traits. Darwin s theory says survival of the fittest which is the back bone of the evolutionary algorithms. Evolutionary algorithms are applied in optimization 3 problems to derive the best possible solution closest to the reality. Competitive co-evolution 4 is the concept of evolving to traits 5 better and better over the competition exists between these two traits. The two different populations of species evolves over generations in competing against each other which makes them to evolve better and better as the generations increases. Genetic Algorithm is applied on each population for evolving its traits over the generations. Genetic algorithm contains five phases namely initial population of individuals, selection of parents, Reproduction, mutation and selection of individuals for next generation. Initial stage is the representation of individuals in the population in the form of chromosomes. There are two ways of representing chromosomes namely binary chromosome and real valued chromosomes. In binary chromosomes * Author for correspondence

Rule Minimization in Predicting the Preterm Birth Classification using Competitive Co Evolution every gene is represented either using 0 or 1 which are the values for the attribute. In real valued chromosomes the gene value is represented as floating value. The next phase is the selection of parents for reproductions where the individuals are selected from the set of population based on fitness function. Every individual in the population is evaluated using the fitness function. Fitness function 6 is the mathematical formula mapping of individual with respect to the solution of the problem. There are many selection 7 strategies for selecting parents for reproduction. Random selection, roulette wheel selection, tournament selection 8, Boltzmann selection are few selection techniques used for selecting individuals for reproduction. Reproduction 9 is the process of exchanging the genetic material between the parents to evolve new offspring s. Mutation 10 is applied once in n generations to enhance the speed of convergence of reaching the optimized results at a faster rate. Mutation is the sudden biological change in the chromosomes genetic value by slightly triggering unknown changes which modifies the genetic composition of the chromosome. 2. Methodology Competitive Coevolution 11-13 algorithm is applied to the preterm classifier for deriving the pattern governing the classified dataset. The competitive co evolution algorithm improves the quality of both the population of two groups which compete each other. In this approach we have used two classes namely normal birth dataset and preterm birth dataset as two individuals competing with each other. Fitness of each individual is based on the relative fitness of the other population. Figure 1 contains the proposed competitive co-evolution preterm classifier algorithm which generated the pattern for classification. Figure 1. Pseudo code for the competitive co evolution preterm classifier algorithm. 2 Vol 9 (10) March 2016 www.indjst.org Indian Journal of Science and Technology

Jyothi Thomas and G. Kulanthaivel 3. Implementation The prediction of identifying the class of preterm birth dataset is based on extracting the rules. The competitive co evolution technique is applied for the classification of pre term birth datasets. The proposed Competitive co evolution preterm classifier algorithm extracts rules for the prediction of preterm birth class. The various steps implemented in the algorithm as shown in Figure 1 are discussed below. Steps: 1. Initialization: In this phase two populations C 1 and C 2 representing the preterm birth and normal class respectively are randomly generated with n and m individuals. Each individual represents a real valued chromosome of five attributes randomly generated using uniform distribution. 2. Measure of Relative Fitness: Every individual in each population C 1 and C 2, the relative fitness is measured with respect the opponent individuals in the other population. The relative fitness of each individual in C 1 is measured using the opponents of individuals in C 2. For each individual xi belongs to C 1, 50 opponents are randomly selected and tournament is conducted with each selected individual opponents in C 2. The result of the tournament is calculated in terms of score which is 1 if x i wins (i.e. x i fitness is greater than the opponent) and 0 if x i losses (i.e. x i fitness is less than the opponent). The Equation 1 represents the measure of relative fitness of every individual in the population. å n Relative fitness R F(xi)= ( S ) j =1 ij (1) The relative fitness is the summation of the outcome of all the matches conducted in the tournament by summing the results S ij. The result is either win or loss represented by 1 or 0 respectively based on the fitness of the opponent as per the Equation 2. ì 1if f(x i) > f(y j) Where S ij = ï í ï ïî 0 otherwise (2) Where y j is the individual from the opponent population 3. Reproduction: Reproduction is the process of generating new offspring s by exchanging genetic composition with the parent chromosomes. Parents are selected for reproduction based on the relative fitness measured in the above step. Crossing over of genetic materials is simulated as per the given Equation 3. x j (t)= (1-γ) x 1j (t) + γ x 2j (t) (3) Where x 1j (t) and x 2j (t) are the jth component of the two parents respectively and γ is a constant with a value of 0.5. 4. Mutation: Mutation is performed by altering or modifying the offspring once in five generations to speed up the convergence of solution. Next generation of individuals are evolved for the population C1. 5. The steps 2 to 4 are performed on the individuals of the other population C2 with C1 as the opponents until the stopping condition is satisfied. 6. Extraction of Rules: Repeat step 2 to 5 until the stopping condition is satisfied. Select the top 10% of individuals from the population C1 and rules are extracted in the form of IF----THEN statements. Rule 1: IF (range 1 >=a 1 && a 1 ) then class=preterm Rule 2: IF ((range 1 >=a 1 && a 1 <=range2) AND (range 1 >=a 2 && a 2 <=range2)) then class=pre-term. Rule n: IF ((range1 >=a 1 && a1<=range2) AND (range 1 >=a 2 && a 2 ) AND ((range 1 >=a k && a k ) then class=pre-term Where a 1,a 2,.a n are attributes of the preterm birth dataset. Totally 28 rules are generated by combining the attributes. 4. Experiments and Results Experiments are conducted using pre term birth datasets. The dataset consist of 1052 records of preterm birth and normal dataset of 1314 each of five attributes. Out of 1052 records 80% of records selected in random is used for training and the remaining 20% of datasets for testing. The preterm birth datasets represent the population C 1 and the normal dataset represents the population C 2 as per the algorithm. At the end of the algorithm top 10% of the best individuals representing the population C 1 is taken as the solution. Rules are extracted from the solution for Vol 9 (10) March 2016 www.indjst.org Indian Journal of Science and Technology 3

Rule Minimization in Predicting the Preterm Birth Classification using Competitive Co Evolution predicting the preterm classifier in the form of IF THEN. Statement using the attributes in the condition. The general formats of the rules are IF (min 1 <a 1 <max 1 ) AND (min 2 <a 2 <max 2 ) AND (min i <a i <max i ) THEN class = Preterm. Where a i =a 1,a 2,. a n number of attributes. Totally 28 rules are generated from the solution set in the form of IF.THEN statement as defined above. The experimental result shows the total no of rules needed for training and testing is drastically reduced compared to the total rules. The accuracy is also improved to a greater extended by applying the proposed algorithm. measured in terms of accuracy of prediction. Accuracy of prediction is given by the formula Accuracy = (True Positive + True Negative)/ (True Positive + True Negative + False Positive + False Negative) (4) Maximum accuracy of 0.9629 is obtained in the proposed algorithm which is very effective in terms of classification of preterm birth datasets. 5. Conclusion The proposed competitive co evolution preterm classifier extracts the prediction rules for classifying the pre term datasets. The accuracy of 0.9629 which is obtained at iteration=200 is the best performance of this algorithm. The proposed algorithm also tries to minimize the no of prediction rules needed for classifying under training and testing data. The classification of preterm birth can also be implemented by using the splitting of attributes into two groups as opponents and the attributes value can be competitively evolved over the generations using genetic algorithm. 6. References Figure 2. Optimisation of rules interms of training and testing based on number of iterations. Figure 3. Performance interms of accuracy. In Figure 2 the no of rules required for training and testing decreases as the iteration increases. It is found that a minimum of 16 rules for training and 11 rules for testing is required for the classification of preterm birth datasets out of totally 28 existing rules. The rules are pruned and the minimization of rules is obtained when the iteration is 400. In Figure 3 the performance of the algorithm is 1. Sonek JD, Iams JD, Blumenfeld M, Johnson F, Landon M, Gabbe S. Measurement of cervical length in pregnancy: comparison between vaginal ultrasonography and digital examination. Obstetrics Gynecology. 1990; 76(2): 172 75. 2. Back T. Selective Pressure in Evolutionary Algorithms: A Characterization of Selection Mechanisms. Proceedings of the First IEEE World Congress on Computational Intelligence; Conference on Evolutionary Computation; Orlando: FL,.1994; 1: 57 62. 3. Angeline PJ, Pollack JB. Competitive Environments Evolve Better Solutions for Complex Tasks. Proceedings of the Fifth International Conference on Genetic Algorithms; 1993. p. 1 8. 4. Nguyen S, Zhang M, Johnston M, Tan KC, Automatic Design of Scheduling Policies for Dynamic Multi-objective Job Shop Scheduling via Cooperative Coevolution Genetic Programming. IEEE Transactions on Evolutionary Computation. 2014; 18 (2): 193 208. 5. Holland JH. ECHO: Explorations of Evolution in a Miniature World. Ed: J.D. Farmer and J. Doyne. Proceedings of the Second Conference on Artificial Life; 1990. 6. Le Ly D, Lipson H. Optimal Experiment Design for Coevolutionary Active Learning. IEEE Transactions on Evolutionary Computation. 2014; 18 (3):394 404. 7. Catley C, Frize M, Walker CR, Petriu DC. Predicting High- 4 Vol 9 (10) March 2016 www.indjst.org Indian Journal of Science and Technology

Jyothi Thomas and G. Kulanthaivel Risk Preterm Birth Using Artificial Neural Networks. IEEE Transactions on Information Technology in Biomedicine. 2006; 10(3): 540 49. 8. Michalewicz Z. Genetic Algorithms + Data Structures = Evolutionary Programs. Springer: Berlin. 1996. 9. Cohen WW. Fast Effective Rule Induction. Proceedings of twelth conference (ML 95); 1995. p. 115 23. 10. Wei F, Wang Y, Zong T. A Novel Cooperative Coevolution for Large Scale Global Optimization. 2014 IEEE International Conference on Systems, Man, and Cybernetics; San Diego, CA: USA. 2014. p. 738 41. 11. Chandra R, Bali K. Competitive two-island cooperative coevolution for real parameter global optimization. IEEE Congress on Evolutionary computing (CEC); Sendai. May 2015. p. 93 100. 12. Thomas J, Kulanthaivel G. Classification of Preterm Birth with Optimized Rules by Mutating the Cognitive Knowledge of PSO. Indian Journal of Science and Technology. 2015 July; 8(15):1 5. 13. Nguyen S, Zhang M, Johnston M, Tan KC, Automatic Design of Scheduling Policies for Dynamic Multi-objective Job Shop Scheduling via Cooperative Coevolution Genetic Programming. IEEE Transactions on Evolutionary Computation. 2014; 18 (2): 193 208. Vol 9 (10) March 2016 www.indjst.org Indian Journal of Science and Technology 5