Interactive Exploration of Fuzzy Clusters using Neighborgrams

Similar documents
Solar in Wetlands. Photo credit: a k e.org/blog/2012/08/15mw solar field near philadelphia.html

AN IDEA BASED ON HONEY BEE SWARM FOR NUMERICAL OPTIMIZATION (TECHNICAL REPORT-TR06, OCTOBER, 2005) Dervis KARABOGA

CONICAL PIPE ENVELOPE FORMATION PROCESS

Customer Portfolio Analysis Using the SOM

Social Rewarding in Wiki Systems Motivating the Community

Fuzzy evaluation to parkour social value research based on AHP improved model

SANITARY ENGINEERING ASSISTANT, 7866 SANITARY ENGINEERING ASSOCIATE, 7870 SANITARY ENGINEER, 7872

SCHEDULING FOR YARD CRANES BASED ON TWO-STAGE HYBRID DYNAMIC PROGRAMMING

Quantifying the First-Flush Phenomenon: Effects of First-Flush on Water Yield and Quality

Improving Software Effort Estimation Using Neuro-Fuzzy Model with SEER-SEM

COMPUTER MODELLING AND FINITE ELEMENT ANALYSIS OF TUBE FORMING OPERATIONS Dr.S.Shamasundar, Manu Mathai, Sachin B M

Surface Water Hydrology

OPTIMIZATION OF FILLER METALS CONSUMPTION IN THE PRODUCTION OF WELDED STEEL STRUCTURES

A biomechanical model for the study of plant morphogenesis: Coleocheate orbicularis, a 2D study species.

of the North American Automotive Industry VOLUME 3: MATERIALS June, 1998 Published by

Two-tier Spatial Modeling of Base Stations in Cellular Networks

THE EFFECT OF SHEAR STRENGTH NORMALISATION ON THE RESPONSE OF PILES IN LATERALLY SPREADING SOILS

DESIGN OF OPTIMAL WATER DISTRIBUTION SYSTEMS

Managing Accounting Information Quality: An Australian Study

Theoretical Investigation on Condensing Characteristics of Air and Oil Vapor Mixtures

Investigation of a Dual-Bed Autothermal Reforming of Methane for Hydrogen Production

Common up Regulated and down regulated Genes for Multiple Cancers using Microarray Gene Expression Analysis

Time of Day Tariff Structure

Production Policies of Perishable Product and Raw Materials

MIAMI-DADE COUNTY PRODUCT CONTROL SECTION DEPARTMENT OF REGULATORY AND ECONOMIC RESOURCES (RER)

Analysis of the Internal Pressure in Tube Hydroforming and Its Experimental Investigation

Lecture 3 Activated sludge and lagoons

Detection of allele-specific methylation through a generalized heterogeneous epigenome model

Optimal Spatial Design of Capacity and Quantity of Rainwater Harvesting Systems for Urban Flood Mitigation

Optimum Design of Pipe Bending Based on High- Frequency Induction Heating Using Dynamic Reverse Moment

Progress towards Modeling Red Tides and Algal Blooms

An Approach to Classify the Risk of Operating Nuclear Power Plants Case Study: Neckarwestheim Unit 1 and Unit 2

The impact of velocity on thermal energy storage performance of tube type thermocline tank

TRAINING NEEDS ANALYSIS and NATIONAL TRAINING STRATEGIES

Quantitative Models to Study the Soil Porosity

ABSTRACT INTRODUCTION

DEW POINT OF THE FLUE GAS OF BOILERS CO-FIRING

Adjoint Modeling to Quantify Stream Flow Changes Due to Aquifer Pumping

SURFACE TENSION OF LIQUID MARBLES, AN EXPERIMENTAL APPROACH

Assessing Emission Allocation in Europe: An Interactive Simulation Approach

Evolving Large Scale UAV Communication System

One-to-one Marketing on the Internet

DEFECT ASSESSMENT ON PIPE USED FOR TRANSPORT OF MIXTURE OF HYDROGEN AND NATURAL GAS

Transcriptome-based distance measures for grouping of germplasm and prediction of hybrid performance in maize

Application of Induction Machine in Wind Power Generation System

Arch. Min. Sci., Vol. 61 (2016), No 4, p

PcBn for cast iron Machining

Global Energy Trade Flows and Constraints on Conventional and Renewable Energies A Computable Modeling Approach

Competitive Analytics of Multi-channel Advertising and Consumer Inertia

A New Wiper Insert Line Now Available for Gold Rhino

Demulsification of Water-in-Oil Emulsions by Microwave Heating Technology

Journal of Retail Analytics

Combining ability analysis for yield and quality traits in indigenous aromatic rice

A Misranking/Masquerading-Proof Mechanism for Online Reputation System

Evaluating the Effectiveness of a Balanced Scorecard System Implemented in a Functional Organization

Cross-Roller Ring Series

Learning and Technology Spillover: Productivity Convergence in Norwegian Salmon Aquaculture

The effect of hitch-hiking on genes linked to a balanced polymorphism in a subdivided population

Self-assessment for the SEPA-compliance of infrastructures

HOBAS NC Line. Make things happen.

A two-level discount model for coordinating a decentralized supply chain considering stochastic price-sensitive demand

DISPLACEMENT-BASED DESIGN OF CONCRETE TILT-UP FRAMES ACCOUNTING FOR FLEXIBLE DIAPHRAGMS

Steam Turbine Seminar -17 Lund University

CONE PERMEAMETER IN-SITU PERMEABILITY MEASUREMENTS WITH DIRECT PUSH TECHNIQUES

Numerical Simulation of Transient 3-D Surface Deformation of a Completely Penetrated GTA Weld

Quick Reference: Amplifier Equations

PROGRAMA BIOEN Projeto 2008/ Simulating Land Use and Agriculture Expansion in Brazil: Food, Energy, Agro industrial and Environmental Impacts

E T HIGH PERFORMANCE MULTI-MATERIAL MILLING. The Mastermill VX range: Exceptional performance and reliability. UROPA OOL

Springback Simulation with Complex Hardening Material Models

Gel Filtration columns and media

Theoretical model and experimental investigation of current density boundary condition for welding arc study

On the Degeneracy of the Water/Wastewater Allocation Problem in Process Plants

The Brazilian ethanol industry

Development of Surrogate Reservoir Models (SRM) For Fast Track Analysis of Complex Reservoirs

How To Grow Bionically vs.

Coal ash ponds: Could they contribute to Alzheimer s disease risk in residential populations?

Experimental Evaluation of the Energy Performance of an Air Vortex Tube when the Inlet Parameters are Varied

Original Research Bioavailability of Lead, Cadmium, and Nickel in Tatra Mountain National Park Soil

GEO-SLOPE International Ltd, Calgary, Alberta, Canada Salt Flow Example

FACTORS INFLUENCING ENERGY CONSUMPTION IN FRUIT AND VEGETABLE PROCESSING PLANTS. Janusz Wojdalski, Bogdan DróŜdŜ, Michał Lubach

MSEC_ICM&P ESTIMATION OF TEMPERATURE DISTRIBUTION IN SILICON DURING MICRO LASER ASSISTED MACHINING

SIMULATION OF NATURAL GAS FLUIDIZED BED USING COMPUTATIONAL FLUID DYNAMICS

Nucleation and crystallisation kinetics of a Na-fluorrichterite based glass by differential scanning calorimetry (DSC)

GenomeLab GeXP. Troubleshooting Guide. A53995AC December 2009

MODELING AND SIMULATION OF A FUEL CELL REFORMER FOR CONTROL APPLICATIONS

Lectures on: Introduction to and fundamentals of discrete dislocations and dislocation dynamics. Theoretical concepts and computational methods

Paper Effects of Wear and Service Conditions on Residual Stresses in Commuter Car Wheels Paper 1-12

* 1 Present address: Department of Mechanical Engin.eering, University of Lagos, Akoka, Yaba, Lagos (Nigeria).

Recombinant Enzymes from NEB

Citation Zeitschrift für Metallkunde. 92(11)

Pass-Through and Consumer Search: An Empirical Analysis. by Timothy J. Richards, Miguel I Gómez and Jun Lee

THE CLIMATE FRAMEWORK FOR UNCERTAINTY, NEGOTIATION AND DISTRIBUTION (FUND), TECHNICAL DESCRIPTION, VERSION 3.9

MODELING THE TAPPING OF SILICON MELT FROM THE SUBMERGED ARC FURNACES

Environmental Externalities in the Presence of Network Effects: Adoption of Low Emission Technologies in the Automobile Market

Development projects, migration and malaria in the GMS

A brief history of the Indian iron and steel industry

KINEMATICS OF RIGID BODIES. y Copyright 1997 by The McGraw-Hill Companies, Inc. All rights reserved. KINEMATICS OF RIGID BODIES

A Model for Dissolution of Lime in Steelmaking Slags

A New Wiper Insert Line, Now Available for RHINORUSH

Transcription:

Inteactive Exploation of Fuzzy Clustes using Neighbogams (joint wok with Michael R. Bethold and David E. Patteson) Bend Wiswedel Bend.Wiswedel@uni-konstanz.de 2.9.24 Bend Wiswedel, Univesity of Konstanz #1 Oveview Motivation: Neighbogams Neighbogam Constuction Clusteing Algoithm Results on Benchmak Data Sets Visualization and Inteaction Conclusion 2.9.24 Bend Wiswedel, Univesity of Konstanz #2 1

Motivation: Neighbogams One-dimensional epesentation of the neighbohoods Can be used to geneate potential cluste candidates Geedy algoithm picks best clustes one by one (supevised clusteing) Easy to visualize and theefoe suitable to inject expet knowledge into clusteing pocess 2.9.24 Bend Wiswedel, Univesity of Konstanz #3 Neighbogam Constuction Neighbogam fo Centoid inceasing distance to Centoid 2.9.24 Bend Wiswedel, Univesity of Konstanz #4 2

Neighbogam Constuction Neighbogam fo Centoid inceasing distance to Centoid 2.9.24 Bend Wiswedel, Univesity of Konstanz #5 Neighbogam Constuction Neighbogam fo Centoid inceasing distance to Centoid 2.9.24 Bend Wiswedel, Univesity of Konstanz #6 3

Neighbogam Constuction Neighbogam fo Centoid inceasing distance to Centoid 2.9.24 Bend Wiswedel, Univesity of Konstanz #7 Neighbogam Constuction Neighbogam fo Centoid inceasing distance to Centoid 2.9.24 Bend Wiswedel, Univesity of Konstanz #8 4

Neighbogam Constuction Neighbogam fo Centoid inceasing distance to Centoid 2.9.24 Bend Wiswedel, Univesity of Konstanz #9 Neighbogam Constuction 2.9.24 Bend Wiswedel, Univesity of Konstanz #1 5

Neighbogams ae one-dimensional mappings of the neighbohoods of paticula pattens (the centoids) built fo all pattens of inteest small o medium size data sets: all pattens lage data sets: pattens of a minoity class (example: high-thoughput-sceening in dug discovey) 2.9.24 Bend Wiswedel, Univesity of Konstanz #11 Neighbogam Cluste Puity fo Cluste: p ( ) # same _ class = # all _ classes ( ) ( ) 1 Fo us: given a puity, detemine thee adii to define cluste shape 2 3 1 2 (p=.7) 3 (p=.7) 2.9.24 Bend Wiswedel, Univesity of Konstanz #12 6

Membeship Functions Rectangula membeship Tapezoidal membeship µ µ 1 1 1 2 3 1 2 3 Tiangula membeship Gaussian membeship µ 1 µ 1 1 2 3 θ 1 2 3 2.9.24 Bend Wiswedel, Univesity of Konstanz #13 Neighbogam Cluste Puity fo Cluste: p ( ) # same _ class = # all _ classes ( ) ( ) 1 Fo us: given a puity, detemine thee adii to define cluste shape 2 3 1 2 (p=.7) 3 (p=.7) 2.9.24 Bend Wiswedel, Univesity of Konstanz #14 7

Neighbogam Cluste Puity fo Cluste: p ( ) # same _ class = # all _ classes ( ) ( ) 1 Fo us: given a puity, detemine thee adii to define cluste shape 3 2 1 2 (p=.5) 3 (p=.5) 2.9.24 Bend Wiswedel, Univesity of Konstanz #15 (Cisp) Clusteing Algoithm Each Neighbogam consideed as potential cluste candidate Geedy choice of best cluste The moe pattens ae coveed the bette Once cluste is chosen, its coveed pattens ae discaded, i.e. emoved fom consideation Loop while thee ae too many uncoveed pattens 2.9.24 Bend Wiswedel, Univesity of Konstanz #16 8

(Cisp) Clusteing Algoithm Best Cluste 2.9.24 Bend Wiswedel, Univesity of Konstanz #17 (Cisp) Clusteing Algoithm Best Cluste 2 nd best Cluste 2.9.24 Bend Wiswedel, Univesity of Konstanz #18 9

(Cisp) Clusteing Algoithm Best Cluste 2 nd best Cluste 3 d best Cluste 2.9.24 Bend Wiswedel, Univesity of Konstanz #19 (Cisp) Clusteing Algoithm Best Cluste 2 nd best Cluste 3 d best Cluste 4 th best Cluste 2.9.24 Bend Wiswedel, Univesity of Konstanz #2 1

(Cisp) Clusteing Algoithm Best Cluste 2 nd best Cluste 3 d best Cluste 4 th best Cluste 2.9.24 Bend Wiswedel, Univesity of Konstanz #21 (Fuzzy) Clusteing Algoithm Cluste coves pattens patly (accoding to membeship function) A patten can only be coveed to a maximal degee of 1. Cluste anking: cumulative degee of coveage 2.9.24 Bend Wiswedel, Univesity of Konstanz #22 11

Results on Benchmak Data - SatImage Data Set - 36 attibutes, 6 classes 4,435 taining pattens, 2, test pattens Puity p = 1. Membeship Function #cluste Eo [%] no class Pedicted [%] Eo [%] Majoity class PNN Eo [%] k-nn MLP c4.5 Rectangle 585 16.65 8.95 15.9 Tapezoidal Tiangula 585 1852 15.5 1.5 6.95 2.65 14.4 9.9 9.8 9.4 13.9 15. Gaussian 2695 8.1. 8.1 2.9.24 Bend Wiswedel, Univesity of Konstanz #23 Results on Benchmak Data - Othe Data Sets - Thee data sets fom the StatLog-Poject Best esults with Neighbogam Clusteing Algoithm usually with Gaussian membeship function Data set #dimensions #classes #pattens NG Eo [%] PNN k-nn Eo [%] MLP c4.5 SatImage 36 6 6,435 8.1 9.8 9.4 13.9 15. Diabetes 8 2 768 26.4 24.9 32.4 24.8 27. Segment 11 7 2,31 3.6 3.5 7.7 5.4 4. 2.9.24 Bend Wiswedel, Univesity of Konstanz #24 12

Visualization and Inteaction (NCI Data) 2.9.24 Bend Wiswedel, Univesity of Konstanz #25 Visualization and Inteaction (NCI Data) 2.9.24 Bend Wiswedel, Univesity of Konstanz #26 13

Visualization and Inteaction (NCI Data) 2.9.24 Bend Wiswedel, Univesity of Konstanz #27 Visualization and Inteaction (NCI Data) 2.9.24 Bend Wiswedel, Univesity of Konstanz #28 14

Conclusion Neighbogams as epesentation of high-dimensional data Applicable to model small o medium size data sets, o inteesting subsets of lage data sets Results of automatic clusteing compaable to state-of-the-at techniques Allows to inject domain knowledge though inteactive, visual clusteing 2.9.24 Bend Wiswedel, Univesity of Konstanz #29 15