Supplemental Digital Content. A new severity of illness scale using a subset of APACHE data elements shows comparable predictive accuracy

Similar documents
EVOLUTIONARY ALGORITHMS AT CHOICE: FROM GA TO GP EVOLŪCIJAS ALGORITMI PĒC IZVĒLES: NO GA UZ GP

IMPLEMENTATION OF AN OPTIMIZATION TECHNIQUE: GENETIC ALGORITHM

Available online at ScienceDirect. Procedia Computer Science 102 (2016 )

What is Evolutionary Computation? Genetic Algorithms. Components of Evolutionary Computing. The Argument. When changes occur...

CHAPTER 3 RESEARCH METHODOLOGY

Performance Analysis of Multi Clustered Parallel Genetic Algorithm with Gray Value

The Impact of Population Size on Knowledge Acquisition in Genetic Algorithms Paradigm: Finding Solutions in the Game of Sudoku

Evolutionary Algorithms

Assoc. Prof. Rustem Popa, PhD

Available online at International Journal of Current Research Vol. 9, Issue, 07, pp , July, 2017

Feature Selection for Predictive Modelling - a Needle in a Haystack Problem

Introduction to Genetic Algorithm (GA) Presented By: Rabiya Khalid Department of Computer Science

Keywords Genetic Algorithm (GA), Evolutionary, Representation, Binary, Floating Point, Operator

APPLICATION OF COMPUTER FOR ANALYZING WORLD CO2 EMISSION

Mrs. Shahana Gajala Qureshi 1, Mrs. Uzma Arshi Ansari 2

Rule Minimization in Predicting the Preterm Birth Classification using Competitive Co Evolution

Intro. ANN & Fuzzy Systems. Lecture 36 GENETIC ALGORITHM (1)

Evolutionary Computation. Lecture 1 January, 2007 Ivan Garibay

2. Genetic Algorithms - An Overview

Introduction To Genetic Algorithms

Fixed vs. Self-Adaptive Crossover-First Differential Evolution

Applying Computational Intelligence in Software Testing

Evolutionary Computation

Gene Selection in Cancer Classification using PSO/SVM and GA/SVM Hybrid Algorithms

10. Lecture Stochastic Optimization

Genetic Algorithm and Neural Network

Implementation of Genetic Algorithm for Agriculture System

Introduction to Artificial Intelligence. Prof. Inkyu Moon Dept. of Robotics Engineering, DGIST

Computational Intelligence Lecture 20:Intorcution to Genetic Algorithm

Improving Differential Evolution Algorithm with Activation Strategy

Evolutionary Computation. Lecture 3. Evolutionary Computation. X 2 example: crossover. x 2 example: selection

A Comparison between Genetic Algorithms and Evolutionary Programming based on Cutting Stock Problem

Genetic Algorithm: A Search of Complex Spaces

An Evolutionary Approach involving Training of ANFIS with the help of Genetic Algorithm for PID Controller Tuning

COMBINED-OBJECTIVE OPTIMIZATION IN IDENTICAL PARALLEL MACHINE SCHEDULING PROBLEM USING PSO

Recessive Trait Cross Over Approach of GAs Population Inheritance for Evolutionary Optimisation

Genetic Algorithm for Variable Selection. Genetic Algorithms Step by Step. Genetic Algorithm (Holland) Flowchart of GA

CEng 713 Evolutionary Computation, Lecture Notes

Multi-Level Neutrality in Optimization

Validity Constraints and the TSP GeneRepair of Genetic Algorithms

[Sharma* et al., 5(6): June, 2016] ISSN: IC Value: 3.00 Impact Factor: 4.116

Evolutionary Algorithms - Population management and popular algorithms Kai Olav Ellefsen

Changing Mutation Operator of Genetic Algorithms for optimizing Multiple Sequence Alignment

What is Genetic Programming(GP)?

PERFORMANCE EVALUATION OF GENETIC ALGORITHMS ON LOADING PATTERN OPTIMIZATION OF PWRS

Breast Cancer Diagnostic Factors Elimination via Evolutionary Neural Network Pruning

Genetic Algorithms in Matrix Representation and Its Application in Synthetic Data

International Journal of Research in Advent Technology Available Online at:

An Improved Immune Genetic Algorithm for Capacitated Vehicle Routing Problem

Processor Scheduling Algorithms in Environment of Genetics

Path-finding in Multi-Agent, unexplored And Dynamic Military Environment Using Genetic Algorithm

Repeated Sequences in Genetic Programming

Comparative Study of Different Selection Techniques in Genetic Algorithm

TOLERANCE ALLOCATION OF MECHANICAL ASSEMBLIES USING PARTICLE SWARM OPTIMIZATION

Genetic Algorithms. Lecture Overview. A Brief History. Evolutionary Computation. Biological Systems: A rough guide. Biological Systems: A rough guide

AUTOMATED INTERPRETABLE COMPUTATIONAL BIOLOGY IN THE CLINIC: A FRAMEWORK TO PREDICT DISEASE SEVERITY AND STRATIFY PATIENTS FROM CLINICAL DATA

Journal of Global Research in Computer Science PREMATURE CONVERGENCE AND GENETIC ALGORITHM UNDER OPERATING SYSTEM PROCESS SCHEDULING PROBLEM

Genetic Algorithm: An Optimization Technique Concept

Iteratively Inferring Gene Regulatory Networks with Virtual Knockout Experiments

Genetic Algorithm with Upgrading Operator

Research of Product Design based on Improved Genetic Algorithm

Tumor Detection Using Genetic Algorithm

Part 1: Motivation, Basic Concepts, Algorithms

SEISMIC ATTRIBUTES SELECTION AND POROSITY PREDICTION USING MODIFIED ARTIFICIAL IMMUNE NETWORK ALGORITHM

Genetic Programming for Symbolic Regression

Genetic Algorithms for Optimizations

Evolutionary Algorithms for Cardiovascular Decision Support

Design and Implementation of Genetic Algorithm as a Stimulus Generator for Memory Verification

GENETIC ALGORITHMS. Narra Priyanka. K.Naga Sowjanya. Vasavi College of Engineering. Ibrahimbahg,Hyderabad.

Genetic approach to solve non-fractional knapsack problem S. M Farooq 1, G. Madhavi 2 and S. Kiran 3

CHAPTER 4 MAINTENANCE OPTIMIZATION USING GENETIC ALGORITHM

Optimal Design of Laminated Composite Plates by Using Advanced Genetic Algorithm

GENETIC ALGORITHM A NOBLE APPROACH FOR ECONOMIC LOAD DISPATCH

COMPARATIVE STUDY OF SELECTION METHODS IN GENETIC ALGORITHM

Geometric Semantic Genetic Programming Algorithm and Slump Prediction

Evolutionary Algorithms

Introduction Evolutionary Algorithm Implementation

AUTOMATIC TEST CASE GENERATION BASED ON GENETIC ALGORITHM

Multi-Plant Multi-Product Aggregate Production Planning Using Genetic Algorithm

Optimisation and Operations Research

Comparison of some fine-grained and coarse-grained parallel genetic algorithms with population reinitialization

Fig Flowchart of bacterial evolutionary algorithm

ADVANCED data analysis algorithms have a gradually

TIMETABLING EXPERIMENTS USING GENETIC ALGORITHMS. Liviu Lalescu, Costin Badica

Genetic algorithms. History

PDGA: the Primal-Dual Genetic Algorithm

Multi-product inventory optimization in a multiechelon supply chain using Genetic Algorithm

OPTIMAL DESIGN METHOD FOR BUILDINGS & URBAN ENERGY SYSTEMS USING GENETIC ALGORITHMS. Tokyo, Japan. Tokyo, Japan

The Metaphor. Individuals living in that environment Individual s degree of adaptation to its surrounding environment

Adaptive Genetic Algorithm for Sensor Coarse Signal Processing

Comparison of Three Evolutionary Algorithms: GA, PSO, and DE

Genetic Algorithms. Moreno Marzolla Dip. di Informatica Scienza e Ingegneria (DISI) Università di Bologna.

CONTROL OF PITCH ANGLES TO OPTIMIZE THE AERODYNAMIC USING PARTICLE SWARM OPTIMIZATION

Farhad A. Kamangar, Ph.D. Department of Computer Science and Engineering The University of Texas at Arlington Arlington, TX (817)

College of information technology Department of software

Optimal Allocation and Contingency Analysis Studies of Embedded Generation in Distribution Systems

Designing a New Particle Swarm Optimization for Make-with-Buy Inventory Model

Re-Labeling Differential Evolution for Combinatorial Optimization and Interactive Evolutionary Computation

LONG-HAUL FREIGHT SELECTION FOR LAST-MILE COST REDUCTION

Selecting an Optimal Compound of a University Research Team by Using Genetic Algorithms

Transcription:

Supplemental Digital Content A new severity of illness scale using a subset of APACHE data elements shows comparable predictive accuracy Alistair E. W. Johnson, BS Centre for Doctoral Training in Healthcare Innovation, Department of Engineering Science, University of Oxford, Oxford, UK and the Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK Andrew A. Kramer, PhD Cerner Corporation, Vienna, VA Gari D. Clifford, PhD Department of Engineering Science, University of Oxford, Oxford, United Kingdom 1

Appendix I. Description of the genetic algorithm methodology used in this study In order to select variables yielding the most information for a severity score, we employed a machine learning technique known as genetic algorithms (GA) (1). GAs belong to a class of optimizers known as evolutionary algorithms (2), as they mimic real life processes found in nature. The theoretical groundwork of the technique was originally developed in the late 1950s, and then used to solve a variety of practical engineering problems in the early 1970s (3, 4). The GA implementation in this work is based upon that of Siedlecki and Sklansky (5). A brief description of the GA used in this work follows. Each one of the 37 potential measurements is represented as a binary (0, 1) variable: a so called gene. Initially genes are randomly assigned to be either 0 (off, or not included in the model) or 1 (on / included). The collection (vector) of genes is called a chromosome. The collection of genes is called a chromosome. There are a fixed number of chromosomes, each with 37 genes that make up what is known as the population. The number and location of the genes that are turned on varies across chromosomes. Each chromosome receives a fitness value. Fitness is assessed by utilizing the genes that are turned on in a predictive model, e.g. logistic regression. In our study fitness was determined by a severity score developed using the Particle Swarm Optimization (PSO) technique described below. The GA operates by evaluating all of the chromosomes in a population. The population is then subjected to a natural selection cycle whereby a proportion of the chromosomes with the lowest fitness are removed. Conversely, a fixed proportion of the best performing chromosomes are left unchanged to ensure that the best combinations in the new generation perform at least as well as the previous generation. The remaining members of the population are then constructed via a process that mirrors genetic crossover. Chromosomes are matched via a fitness weighted Monte Carlo method, and fragments from the distinct chromosomes are combined. Figure E1 shows an example of this process. Figure E1. Schematic diagram of crossovers in genetic algorithms Finally, genes are randomly selected to mutate. In the GA used here that meant reversing a gene s value, i.e. switched from on to off or vice versa. The resultant collection of chromosomes then 2

cycles through the evaluation natural selection crossover mutation process. This continues from one iteration to the next until the best chromosome does not improve the AUROC. The genes on the best chromosome which are turned on represent the variables that are selected for creating OASIS. A schematic diagram of the GA process is given in Figure E2. Figure E2. The genetic algorithm process Define genes, select population parameters Randomly generate {0, 1} for each gene across all chromosomes Run support vector machine algorithm to predict ICU mortality on each chromosome in the population. Derive AUROC for each chromosome: fitness value. Rank chromosomes based on fitness. Keep best two chromosomes. Assign probability of being a parent based on fitness. Select parents for crossover. Perform crossover to create new chromosomes. Randomly mutate a set % of genes. Check for convergence of solution. Yes Best chromosome mandates which genes (i.e. variables) will be used for developing the severity of illness score. No 3

Appendix II. Description of the particle swarm optimization methodology used in this study To select the weights for the components making up OASIS we used particle swarm optimization (PSO), a non linear optimization technique widely applied in the fields of bio informatics and power systems (6, 7). PSO is a population based search in that it randomly initializes a set of candidate solutions ( particles ) and iteratively updates them until the best solution found ceases to improve. The implementation of PSO in this work now follows. First, 10 quantiles of the data were calculated for each variable, resulting in 10 ranges over which scores could be optimized. Initially 24 candidate solutions (particle positions ) were initialized by sampling from a uniform random distribution in the range (0,10). For each candidate solution, an additional update vector (particle velocity ) was initialized to 0. Finally, each particle has an associated prior best position which saves the best performing particle s position across all iterations. Thus, for each particle there is: a position (the values of the score), a velocity (used in the iterative update to the score), and a prior best position (also used in the iterative update to the score). The algorithm then begins iteratively optimizing the candidate solutions. First, each particle position is assessed by the fitness function and assigned a score, where the fitness function is the AUROC of a score calculated on the development data set using the current particle position as the individual in the severity score. The ranges for the severity score are kept fixed using ten data derived quantiles. If the current particle position scores higher than its prior best position, then the prior best position is set to the current position. The best performing prior best position is then assigned as the globally best position. The velocity is then updated as a weighted sum of the current velocities, the difference between the current position and the prior best position, and the difference between the current position and the globally best position. These weights are randomly selected from the range (0, 2.05). Thus the solutions are encouraged to return to their previous best position (similar to a local search), and toward the global best position (similar to a global search), providing a flexible trade off between these two paradigms. This process is repeated and the algorithm terminates when the global best position's score no longer improves. This global best position is the severity score output by the algorithm. Figure E3 shows as an example how a weight for a patient was determined, in this case age. If the patient is aged 54, he/she is assigned an age component score of 7; if he/she is aged 23 then the component score is 1, etc These scores are summed over all components to give the final OASIS score for the patient. The scores (1,4,7,10, and 8 in this example) were obtained through the PSO algorithm. 4

Figure E3. Example of Particle Swarm Optimization (PSO) scores developed for the age component. 5

References 1. Mitchell M : An Introduction to Genetic Algorithms, Third Printing, Cambridge, MA, MIT Press, 1997. 2. Sumathi S, Hamsapriya T, Surekha P : Evolutionary Intelligence, Berlin, Springer Verlag, 2008. 3. Barricelli NA: Symbiogenetic evolution processes realized by artificial methods, Methodos 1957, 143 182. 4. Holland JH: Adaptation in natural and artificial systems: an introductory analysis with applications to biology, control, and artificial intelligence, Ann Arbor, MI,. University of Michigan Press, 1975. 5. Siedlecki W, Sklansky J: A note on genetic algorithms for large scale feature selection, Pattern Recognition Letters 1989, 10:335 347. 6. Eberhart R, Kennedy J: Particle Swarm Optimization, IEEE International Conference on Neural Networks 1995; 4:1942 1948. 7. Kramer AA, Vida M: Evolutionary algorithms in the analysis of physiologic measures and their relationship to critical care outcomes, J Crit Care 2006, 21:357. 6