Customer Portfolio Analysis Using the SOM

Similar documents
The Research of Risk Management in Two Non-Independent IT System

Inventory Strategy of Dual-Channel Supply Chain from Manufacturer's Perspective

Enterprise Systems and Revenue Recognition: The Missing Link FINANCIAL EXECUTIVE BENCHMARKING SURVEY. Enterprise Systems Edition

Production Planning under Hierarchical Workforce Environment

Journal of Chemical and Pharmaceutical Research, 2014, 6(6): Research Article

Managing Accounting Information Quality: An Australian Study

Optimal Pricing under The Consumer Segmentation in Dual Channel Supply Chain

Re-Designing a Customer Satisfaction and Loyalty Program via Linkage

An Energy-Economy Model to Evaluate the Future Energy Demand-Supply System in Indonesia

THE VALUE OF GRID-SUPPORT PHOTOVOLTAICS IN PROVIDING DISTRIBUTION SYSTEM VOLTAGE SUPPORT 2. OBJECTIVE

Analysis of Preschool Linguistic Education Based on Orienting Problem Algorithm

Fuzzy evaluation to parkour social value research based on AHP improved model

Production Policies of Perishable Product and Raw Materials

Available online at ScienceDirect. Procedia Engineering 122 (2015 )

The 23 rd. Corresponding author: Tel ,

RETRACTED ARTICLE. The Fuzzy Mathematical Evaluation of New Energy Power Generation Performance. Open Access. Baoling Fang * , p 2

Solar in Wetlands. Photo credit: a k e.org/blog/2012/08/15mw solar field near philadelphia.html

AN IDEA BASED ON HONEY BEE SWARM FOR NUMERICAL OPTIMIZATION (TECHNICAL REPORT-TR06, OCTOBER, 2005) Dervis KARABOGA

PERFORMANCE EFFICIENCY EVALUATION OF THE TAIWAN S SHIPPING INDUSTRY: AN APPLICATION OF DATA ENVELOPMENT ANALYSIS

( τα ) = Product of transmittance and absorptance

Phd Program in Transportation. Transport Demand Modeling

The relationship between risk management and strategic performance of the organization (case study: selected university of the Semnan province)

machine design, Vol.6(2014) No.3, ISSN pp

Multi-year Expert Meeting on Transport, Trade Logistics and Trade Facilitation:

Journal of Applied Research and Technology ISSN: Centro de Ciencias Aplicadas y Desarrollo Tecnológico.

Oregon and Washington). The second major study used for this report is "Parks and Recreation Information

Estimating Fish Abundance - Mark Recapture Method

An Evaluation of Negative Selection in an Artificial Immune System for Network Intrusion Detection

A Note on Void Ratio of Fibre-Reinforced Soils

Proposal of a Measuring Method of Customer s Attention and Satisfaction on Services

MULTI-OBJECTIVE OPTIMIZATION OF PLANNING ELECTRICITY GENERATION AND CO 2 MITIGATION STRATEGIES INCLUDING ECONOMIC AND FINANCIAL RISK

Fig. 1; Study Area, Source (PPC), Meteorological Stations (Souda and Airport) and Modeling Domain.

OPTIMIZATION OF FILLER METALS CONSUMPTION IN THE PRODUCTION OF WELDED STEEL STRUCTURES

Optimization of the Brass Melting

Predicting future changes in climate and evaporation by a stepwise regression method

A Research on The Basic Theories of Systematic Risk Transmission in Enterprise Value Chain

Out-of-Merit-Order Dispatch

Annual Review APPLICATIONS AND STATISTICAL PROPERTIES OF MINIMUM SIGNIFICANT DIFFERENCE-BASED CRITERION TESTING IN A TOXICITY TESTING PROGRAM

Market Potential and Distance Decay

ScienceDirect. Measuring solid liquid interfacial energy by grain boundary groove profile method (GBG)

SANITARY ENGINEERING ASSISTANT, 7866 SANITARY ENGINEERING ASSOCIATE, 7870 SANITARY ENGINEER, 7872

Theoretical Investigation on Condensing Characteristics of Air and Oil Vapor Mixtures

The impact of technology innovation on green chemistry and lowcarbon

THE EFFECT OF SHEAR STRENGTH NORMALISATION ON THE RESPONSE OF PILES IN LATERALLY SPREADING SOILS

Evaluating the Effectiveness of a Balanced Scorecard System Implemented in a Functional Organization

Interactive Exploration of Fuzzy Clusters using Neighborgrams

Analysis of Electric Stress in High Voltage Cables Containing Voids

SCHEDULING FOR YARD CRANES BASED ON TWO-STAGE HYBRID DYNAMIC PROGRAMMING

The Added-Value Metric - A Complementary Performance Measure for Six Sigma and Lean Production

Blind Multi-Channel Estimation of Arterial Input Function in Dynamic Contrast-Enhanced MRI

Water Management of Heat Pump System for Hot Water Supply in a Medium Size Hospital

Business Law Curriculum Coordination at Berkeley Law (Boalt Hall) For last several (~10) years

Quantifying Relative Autonomy in Multiagent Interaction

MANAGEMENTS' ROLE IN DEVELOPMENT OF INFORMATION SYSTEM

CASE-BASED REASONING MODEL OF THE FISH DISEASE DIAGNOSIS

The impact of velocity on thermal energy storage performance of tube type thermocline tank

Numerical Simulation of Gas Tungsten Arc Welding in Different Gaseous Atmospheres. Tashiro, Shinichi; Tanaka, Manabu; Ushio, Masao

Mobile Application Testing in Pakistan: A Survey

A Shaking Table Test Design on Seismic Strengthened Brick Masonry Structures

FOR IMMEDIATE RELEASE TUESDAY, JULY 3, 2012, AT 10:00 A.M. EDT

Supply Chain Inventory Models with Refurbishment

FDSS Training Manual

Reporting Checklist for Nature Neuroscience

ASSESSMENT OF MATERIAL BEHAVIOUR AND STRUCTURAL INTEGRITY OF ENGINEERING STRUCTURES BASED ON R6 PROCEDURE

Surface Water Hydrology

GHG Emissions Reduction by Improving Efficiency of Utilities Transport and Use and Cross-Sectorial Energy Integration

Design of customer evaluation model in the catering industry supply chain ecosystem

FOR IMMEDIATE RELEASE MONDAY, DECEMBER 5, 2011, AT 10:00 A.M. EST

Precision Management of Nutrients, Water, & Seeds: A Holistic Approach

Research Article Integrated Location-Production-Distribution Planning in a Multiproducts Supply Chain Network Design Model

Data-Driven Simulation-Enhanced Optimization of People-Based Print Production Service

DECOMPOSITION ANALYSIS OF GLOBAL VALUE CHAIN S IMPACT ON THAI ECONOMY

Application of Induction Machine in Wind Power Generation System

Complication Prevention of Posterior Capsular Rupture using Image Processing Techniques

QUANTITATIVE RESEARCH REGARDING PERFORMANCE MEASURES FOR INTERMODAL FREIGHT TRANSPORTATION

Applications of the Vehicle Routing Problem with Trailers and Transshipments

DESIGN OF OPTIMAL WATER DISTRIBUTION SYSTEMS

Distribution and Risk Assessment of Soil Heavy Metals in the Main Grain Producing Area of China

4 STRUCTURAL MODELLING

Concentric Induction Heating for Dismantlable Adhesion Method

Reliability and Availability Analysis Methodology for Power System Protection Schemes

Research Article Operational Efficiency Evaluation of Iron Ore Logistics at the Ports of Bohai Bay in China: Based on the PCA-DEA Model

THE FUTURE IS HERE BUSINESS PLANNING CONFERENCE REGISTRATION FORM 2018 BUSINESS PLANNING CONFERENCE AGENDA

Interindustry Wage Differentials, Technology. Adoption, and Job Polarization

CARPATHIAN JOURNAL OF FOOD SCIENCE AND TECHNOLOGY

TRAINING NEEDS ANALYSIS and NATIONAL TRAINING STRATEGIES

Improving Software Effort Estimation Using Neuro-Fuzzy Model with SEER-SEM

FUZZY LOGIC BASED UAV ALLOCATION AND COORDINATION

SIMILARITY SOLUTION ON UNSTEADY AXI-SYMMETRIC VISCOUS BOUNDARY LAYER FLOW

Numerical Analysis of Current Attachment at Thermionic Cathode for Gas Tungsten Arc at Atmospheric Pressure

On Postponed Production Optimization for Two-stage Supply Chain in Mass Customization

Social Rewarding in Wiki Systems Motivating the Community

Self-assessment for the SEPA-compliance of infrastructures

Citation 京都大学経済学部 Working Paper (1993), 21.

Resource manager for an autonomous coordinated team of UAVs

Catalogue no X. Wholesale Trade. August 2008

Investigation of a Dual-Bed Autothermal Reforming of Methane for Hydrogen Production

SURVEY OF MOVING CAR OBSERVER AND MKJI 1997 APPLICATIONS AGAINST THE TRAFFIC FLOW SECTION IN MERAUKE REGENCY

ADVANCEMENT OF BRIDGE HEALTH MONITORING BASED ON DISTRIBUTED FIBER OPTIC SENSORS

Transcription:

Association fo Infomation Systems AIS Electonic Libay (AISeL) ACIS 2008 Poceedings Austalasian (ACIS) 2008 Custome Potfolio Analysis Using the SOM Annika H. Holmbom Tuku Cente fo Compute Science (TUCS) Åbo Akademi Univesity Tuku, Finland, annika.h.holmbom@abo.fi Tomas Eklund Institute fo Integated and Intelligent Systems (IIIS) Giffith Univesity Bisbane, Austalia, tomas.eklund@abo.fi Babo Back Depatment of Infomation Technologies Åbo Akademi Univesity Tuku, Finland, babo.back@abo.fi Follow this and additional woks at: http://aisel.aisnet.og/acis2008 Recommended Citation Holmbom, Annika H.; Eklund, Tomas; and Back, Babo, "Custome Potfolio Analysis Using the SOM" (2008). ACIS 2008 Poceedings. 5. http://aisel.aisnet.og/acis2008/5 This mateial is bought to you by the Austalasian (ACIS) at AIS Electonic Libay (AISeL). It has been accepted fo inclusion in ACIS 2008 Poceedings by an authoized administato of AIS Electonic Libay (AISeL). Fo moe infomation, please contact elibay@aisnet.og.

19 th Austalasian Confeence on Infomation Systems Custome Potfolio Analysis Using the SOM 3-5 Dec 2008, Chistchuch Holmbom, et al. Custome Potfolio Analysis Using the SOM Annika H. Holmbom Tuku Cente fo Compute Science (TUCS) Åbo Akademi Univesity Tuku, Finland Email: annika.h.holmbom@abo.fi Tomas Eklund Visiting eseache, Institute fo Integated and Intelligent Systems (IIIS) Giffith Univesity Bisbane, Austalia Email: tomas.eklund@abo.fi Babo Back Depatment of Infomation Technologies Åbo Akademi Univesity Tuku, Finland Email: babo.back@abo.fi Abstact In ode to compete fo pofitable customes, companies ae looking to add value using Custome Relationship Management (CRM). One subset of CRM is custome segmentation, which is the pocess of dividing customes into goups based upon common featues o needs. Segmentation methods can be used fo custome potfolio analysis (CPA), the pocess of analyzing the pofitability of customes. This study was made fo a case oganization, who wanted to identify thei pofitable and unpofitable customes, in ode to gain knowledge on how to develop thei maketing stategies. Data about the customes wee gatheed fom the case oganization s own database. The Self-Oganizing Map (SOM) was used to divide the customes into segments, which wee then analyzed in light of poduct sales infomation. Keywods Custome Relationship Management (CRM), Custome Potfolio Analysis (CPA), Data-diven maket segmentation, Self-Oganizing Map (SOM) INTRODUCTION Custome elationship management (CRM) is an impotant topic of management today. The objective of CRM is to integate sales, maketing and custome cae sevice in ode to add value fo both the company and its customes (Chalmeta 2006; Datta 1996; Heinich 2005). CRM fist emeged in 1993 and has developed apidly in ecent yeas thanks to advances in infomation technology ( Buttle 2004; Rygielski et al. 2002), to become the impotant function that it is in today s companies. One of the most impotant tasks within CRM is custome segmentation, the pocess of identifying and gouping customes with simila pofiles o equiements (Lingas et al. 2005). The key element in CRM and custome segmentation is oveall infomation about customes. Today, data about customes ae eadily available though ERPs, copoate data waehouses, and the Intenet. Data can also be puchased fom othe companies, which has lately fomed into a new categoy of business (Buttle 2004; Rygielski et al. 2002). The poblem is that the amount of infomation available fo segmentation is huge and can be vey challenging to deal with because of issues such as missing data, non-unifom distibutions, eos, etc. The extaction of infomation fom lage databases is, theefoe, often pefomed using data mining methods ( Bey and Linoff 2004; Beson et al. 2000; Famili et al. 1997; Rygielski et al. 2002; Shaw et al. 2001). The pupose of the pape is to illustate how the Self-Oganizing Map (SOM) can be used fo one categoy of CRM, custome potfolio analysis (CPA). The wok builds upon the eseach initiated in Holmbom (2007) in which a model fo segmentation of custome data was built. The model was based upon custome data povided by a case company, and the Self-Oganizing Map is used to constuct the model. The model was face validated 412

19 th Austalasian Confeence on Infomation Systems Custome Potfolio Analysis Using the SOM 3-5 Dec 2008, Chistchuch Holmbom, et al. by expets fom the sales depatment of the case oganization. The model could potentially be used to adjust maketing effots to incease the pofitability of customes. METHODOLOGY Custome Segmentation Custome segmentation is an example of analytical CRM, i.e., the use of analytical tools to study custome data (Paas and Kuijlen 2001). Custome segmentation is the pocess of gouping customes into subgoups with simila behavio o needs, in ode to bette seve o taget the customes (Buttle 2004; Lingas et al. 2005). The identified segments can then be moe effectively tageted with suitable maketing stategies (Fank et al. 1972, p. 26; Wedel and Kamakua 1999, p. 5). Custome segmentation is also used to identify unpofitable and pofitable customes in the custome base, as well as custome elationships with development potential. This is efeed to as Custome Potfolio Analysis, o CPA (Buttle 2004). CPA is impotant as seveal studies have found that the 20/80 ule holds fo custome pofitability as well; 20% of customes account fo 80% of pofits, and vice vesa (Kim et al. 2006; Pak and Baik 2006). Although CPA is elated to segmentation and can be seen as a subset of it, the pupose is diffeent and many of the methods used ae unique (Teho and Halinen 2007). CPA can beneficially be applied to analyze segments identified using segmentation methods. Thee ae a lage vaiety of diffeent methods available fo CPA. Howeve, although thee is a geat wealth of theoetical liteatue suounding CPA available, vey little liteatue appeas to show how CPA is actually being used by companies today (Teho and Halinen 2007). Much of the liteatue concens mathematical optimization models, such as potfolio theoy (e.g., Tunbull 1990) and custome lifetime value (e.g., Kim et al. 2006). These methods geneally view the custome base in the same way as a potfolio of investments, to be managed using the same methods. Thee ae two main bases fo segmentation, i.e., demogaphic data, such as socioeconomic and lifestyle measues, and poduct specific measues, such as poduct usage, custome band attitudes, band pefeences, benefits sought and esponse sensitivity to diffeent maketing campaigns. Demogaphic data ae the most commonly used base fo segmentation (Fank et al. 1972; Tsai and Chiu 2004; Wedel and Kamakua 1999). Segmentation can also be divided into two majo goups based upon the appoach used: maket-diven and data-diven segmentation. Maket-diven segmentation uses data to divide customes into segments. These segments ae befoehand set accoding to chaacteistics that descibe a specified custome pofile, e.g., one that has been detemined to be pofitable. Data-diven segmentation is pefomed on actual custome data, e.g., the shopping behavio of a custome (Beson et al. 2000). As fo CPA, thee ae a lage vaiety of methods available fo segmentation. Many commonly used segmentation methods belong to the family of clusteing appoaches. Most of the methods in this aea ae statistical tools, such as k-means clusteing and hieachical clusteing methods. Data mining appoaches, such as sequence analysis, maket basket analysis and neual netwoks ae also employed. Othe commonly used appoaches ae decision tee-elated appoaches (e.g., CHAID) and fuzzy clusteing appoaches (e.g., Fuzzy FCM). In this study, a data-diven exploatoy CPA, based upon demogaphic data and coupled with poduct sales infomation, will be pefomed. The Data The data wee povided by a case company that sells poducts anging fom simple peiodicals to advanced consulting sevices, to othe companies (B2B). The company wanted to pefom a custome potfolio analysis in ode to detemine which of its customes wee pofitable and woth developing its elationship with, and convesely, which customes wee bette let go of. In addition, the company wanted to detemine which goups of customes puchased which poducts. Oveall, the stategic goal was to ceate a tool to be used by the sales depatment in ode to adjust company maketing pactices, i.e., to detemine suitable maketing effot levels fo diffeent, peviously unknown, categoies of customes. The data wee extacted fom the case company s data waehouse and consist of data about customes and thei puchasing behavio. The customes ae companies fom diffeent lines of business, e.g., sevice, constuction, industial, wholesale, and etail. The data oiginate fom the customes annual epots and the case company s own data waehouse. They contain desciptive categoies, descibing the attibutes of the customes, as well as sales infomation concening the majo poducts. Based upon a pilot test of the data and a eview in coopeation with the case company, a numbe of small and vey lage customes (appeaing as outlies in the esults) wee emoved, and some poduct categoies with small and infequent puchases wee meged. The motivation fo doing this was that the lagest customes ae aleady individually seved by an own sales epesentative, and the smallest customes wee usually one-time puchases. The desciptive categoies consisted of: 413

19 th Austalasian Confeence on Infomation Systems Custome Potfolio Analysis Using the SOM 3-5 Dec 2008, Chistchuch Holmbom, et al. Risk facto, which is an intenally calculated measue of potential financial losses Company age Solvency, which was calculated fom the financial statement Tunove Change in tunove (%), compaed to the pevious yea Balance sheet total, which seves as a measue of company size Retun on equity (ROE), which was calculated fom the financial statement. The poduct categoies consisted of 18 diffeent poducts, labeled poducts A-R. The poducts ae geneally speaking infomation sevice poducts, whee poduct I (a significant consulting sevice) is the most expensive poduct and poduct L (a simple filteed data poduct) is the cheapest one. Poduct O stands fo oveall puchases of poducts and poduct R fo othe poducts (one-time analyses and othe poducts difficult to categoize). The data collected wee fo the peiod of 2002 to 2006. The data set contained 1,841 customes, i.e. 9,205 ows of data. 12.8 % of the customes had incomplete desciptive data, i.e., 3.6 % of the data values wee missing. The missing data wee not consideed a poblem, as the SOM is able to deal with small amounts of missing data (Bigus 1996). The SOM Atificial neual netwoks (ANNs) have been widely applied to vaious business poblems ( Smith and Gupta 2002; Vellido et al. 1999b). ANNs ae commonly divided into two main categoies: supevised and unsupevised leaning appoaches (Haykin 1999). Supevised netwoks lean pattens by using taget outcomes, and ae thus most often used fo classification tasks, i.e., whee classes ae pedetemined. Maket-diven segmentation would be pefomed using supevised leaning ANNs. Unsupevised leaning is used fo exploatoy analysis, clusteing, and visualization (Kohonen 1998). Kohonen s Self-Oganizing Map (SOM) is the most commonly used unsupevised ANN. The SOM is a twolaye feedfowad netwok, in which each neuon leans to ecognize a specific input patten (Kohonen 2001). Each neuon is epesented by a pototype vecto, i.e. an n-dimensional weight vecto. The algoithm is basically a two-step pocess; in the fist step, the best matching neuon (BMU, best matching unit) fo an input data ow is located on the map, and secondly, it and its suounding neuons within a cetain neighbohood adius ae tuned to bette match (i.e., lean fom) the input data, based upon a leaning ate facto. The pocess is epeated until a cetain stopping citeion is eached, fo example, the taining length. The esult of the taining pocess is a visual clusteing that shows similaities and dissimilaities in the data (Kohonen 2001). Essentially, the SOM is a nonlinea pojection technique that displays high-dimensional data on a twodimensional gid, by peseving the elationships (o topology) in the data but not the actual distances (Deboeck and Kohonen 1998). Commonly, the SOM is visualized using the U-matix (Unified Distance Matix) of the map, which displays the Euclidean distances between neuons in shades of colo (Ultsch 1993). Fom the pespective of segmentation-based CPA, the SOM has seveal advantages. Compaed to mathematical optimization methods and most statistical appoaches, the main advantage of the SOM is that it is a highly visual method. This makes it simple to pesent and explain esults to business decision makes. Also, judging the esults is moe intuitive fo a non-mathematically inclined audience. The SOM is also vey obust, equiing vey little pepocessing of the data, and unlike most statistical appoaches, is non-paametic. The SOM is an exploative tool, meaning that vey little a pioi knowledge is equied, and it is possible to uncove unexpected pattens in data. Decision tees ae simple to use and highly visual appoaches, but coectly deciding the split lines is impeative (Pyle 1999), and they ae unsuitable to exploatoy analysis whee no pedefined classes exist. Regession appoaches and classification-based neual netwoks ae also unable to deal with data when pedefined classes ae not available. The SOM has been widely applied in finance, economy and maketing (Kaski et al. 1998; Kohonen 1998; Oja et al. 2003). Fo example, the SOM has been used fo financial benchmaking (Back et al. 1998; Eklund et al. 2003), maco-economic analysis (Kaski and Kohonen 1996; Länsiluoto 2007), and bankuptcy pediction (Back et al. 1995; Kiviluoto 1998; Matín-del-Bío and Seano-Cinca 1993). Howeve, egadless of its obvious benefits, the SOM has not been widely applied in custome segmentation tasks. Examples include Rushmeie et al. (1997), who used the SOM to visualize demogaphic custome segments fo maketing puposes, Vellido et al. (1999a), who used the SOM fo demogaphic segmentation of online customes, Lee et al. (2004; 2005), who used the SOM fo demogaphic segmentation of online games, and Lingas et al. (2005), who used the SOM fo tempoal analysis of supemaket customes duing a peiod of 24 hous. This study diffes fom the pevious 414

19 th Austalasian Confeence on Infomation Systems Custome Potfolio Analysis Using the SOM 3-5 Dec 2008, Chistchuch Holmbom, et al. ones in that the SOM is hee used fo CPA, based upon demogaphic infomation as well as poduct sales infomation, and fo multiple yeas of data. To ou knowledge, this has not been done peviously. TRAINING THE MODEL Viscovey SOMine 4.0 (http://www.viscovey.net/) was used to tain the maps in this study. SOMine is based upon the batch-som taining algoithm (Kohonen 2001) and also uses a stepwise inceasing map size duing the taining pocess, which makes it a vey efficient implementation of the SOM algoithm (Deboeck 1998). In addition, SOMine is vey use fiendly and includes a numbe of advanced data pe-pocessing and analysis tools, such as automated clusteing of the map based upon Wad s hieachical clusteing method. The demogaphic data of the companies (isk facto, age, solvency, tunove, change in tunove%, balance sheet total, and ROE) wee fist used to ceate one map. The esults of the demogaphic segmentation wee then matched with the sales infomation fo each poduct. Even though the SOM is faily toleant towads noisy o missing data (Bigus 1996; Smith and Gupta 2002), data pe-pocessing is an impotant pat of the data-mining task. Pe-pocessing efes to the task of dealing with data quality issues such as missing, eoneous, o outlie data ( Beson et al. 2000; Famili et al. 1997; Hand et al. 2001; Pyle 1999). In this application, sigmoid (o logistic) tansfomation (Bishop 1995) was used to deal with outlie data. The sigmoid tansfomation was used because it emphasizes the cente input values while educing the influence of exteme input values (Bishop 1995; Laose 2005). Vaiance scaling was futhe used to make the vaiables compaable. Geneally speaking, the size of the map is dependent upon the pupose of the application. A lage hexagonal map is good fo visualization (moe accuate on the individual ecod level), wheeas a small map is moe suitable fo clusteing (squeezes data into a smalle numbe of goups) (Desmet 2001; Kohonen 2001). In this case, a map size of 700 nodes was selected as a balance between clusteing and visualization since the goups wee not expected to be vey homogeneous and we wanted to be able to accuately judge the inta-cluste diffeences. As the softwae uses the batch SOM algoithm, the leaning ate does not need to be specified (Deboeck 1998), and the only othe paamete equied is the tension. The tension is essentially a value fo the neighbohood adius in the final taining stage, whee a small tension esults in high local detail (accuacy), while a high tension has an aveaging (smoothing) effect on the map. In this case, the default value 0.5 (aveage) was used. The neighbohood function is always Gaussian. Although the U-Matix of the SOM can be manually intepeted to identify the clustes, two-stage clusteing (Vesanto and Alhoniemi 2000) is an easie and moe objective method of identifying the clustes on the map. In two-stage clusteing, the neuons on the map ae clusteed based upon thei Euclidean distances, using a suitable clusteing algoithm. In this case, Wad s hieachical clusteing method, included in the SOMine softwae, was used to identify the clustes on the final map. The final map is displayed in Figue 1. The clusteing of the map esulted in ten clustes of vaious sizes, labeled C1-C10. The colo of the cluste only signifies cluste membeship, and does not imply any value. 415

19 th Austalasian Confeence on Infomation Systems Custome Potfolio Analysis Using the SOM 3-5 Dec 2008, Chistchuch Holmbom, et al. Figue 1: A U-matix of the segmentation esults, using the custome data fom yeas 2002-2006. The map was ceated accoding to the vaiables fom the desciptive categoies In ode to intepet the map, and in paticula the chaacteistics of each cluste, the component planes (displayed in Figue 2) of the map must be used. The component planes show the distibution of values acoss the map, accoding to one vaiable at a time. The values accoding to one vaiable ae displayed by the colo of the neuon, whee wam colos (ed, oange, and yellow) illustate high values and cool colos (blue) illustate low values. The appoximate values ae indicated by the scale unde each component plane. The map is intepeted by eading the component planes fo each cluste. Fo example, Cluste 6 displays medium to high values in solvency and ROE, and low values in age, tunove, and balance sheet total. Cluste 6 also shows vaying isk factos, fom low to high. We can conclude that these ae faily young and small companies, although vey pofitable. We can also see that the isk facto is extemely lage in segments C2 and C5, which means that these segments contain less eliable companies. The oldest companies ae found in segments C4, C5 and C7. Figue 2: The component planes of the map, showing the values accoding to one component at a time 416

19 th Austalasian Confeence on Infomation Systems Custome Potfolio Analysis Using the SOM 3-5 Dec 2008, Chistchuch Holmbom, et al. ANALYSIS OF THE RESULTS The esults of the custome segmentation ae summaized in Table 1, which shows the sizes and distinguishing featues of each of the clustes. Clustes C1, C2 and C3 ae the lagest ones. Table 1. Division of customes into clustes accoding to the segmentation pesented in Figue 1 Clustes Customes % Distinguishing featue(s) Puchased poducts Cluste 1 1,659 20.68 No specific attibute A-R, except M Cluste 2 1,689 21.05 High isk facto A-R, except M Cluste 3 1,699 21.17 Highest solvency A-R, except M Cluste 4 1,139 14.19 Oldest companies, high solvency A-R, except M Cluste 5 640 7.98 Lage companies, high solvency A-R, except M & Q Cluste 6 565 7.04 High isk facto, good solvency, vey high pofitability A-R, except M Cluste 7 398 4.96 Both old and young companies, good tunove A-R, except M Cluste 8 91 1.13 Lagest tunove, lage balance sheet A-R, except M & Q total Cluste 9 80 1.00 Lage balance sheet total A-R, except L & M Cluste 10 64 0.80 Lagest change in tunove (%) A-R, except M & Q The clustes identified ae as follows: Cluste 1: an aveage goup with no specifically identifying chaacteistics. Risk facto, age, tunove, and balance sheet total ae low, and solvency is medium to high. Retun on equity is good on aveage. One of the thee lagest goups in tems of numbe of customes. Cluste 2: exhibits a consideably highe isk facto, lowe solvency, and lowe etun on equity than in Cluste 1. This goup also contains the customes with the lowest etun on equity, as well as the companies with the lowest solvency. One of the thee lagest goups in tems of numbe of customes. Cluste 3: simila to Cluste 1 except fo a consideably highe solvency. The aveage company age is somewhat highe, although isk facto seems to be simila to that of Cluste 1. One of the thee lagest goups in tems of numbe of customes. Cluste 4: contains the oldest companies in the dataset, and geneally exhibits a high solvency and good pofitability. Cluste 5: is a mid-size cluste containing lage than aveage companies. Solvency is good, and the companies ae faily new. Pofitability is aveage. Cluste 6: a mid-size cluste that contains the most pofitable companies in the dataset. In geneal small, gowing companies. Nealy half of the cluste displays a vey high isk facto, but some companies ae also vey solvent. Cluste 7: a mid-size cluste that contains faily lage companies in tems of tunove and total assets. Solvency is good on aveage, and the cluste contains a mix of old and new companies. Cluste 8: is one of the thee small clustes and contains the lagest companies in tems of assets and tunove. The companies ae solvent and faily pofitable, and thei isk facto is vey low. Company age is above aveage. Cluste 9: is anothe small goup of lage companies. This cluste diffes fom Cluste 8 in that the companies ae newe and tunove is lowe. Cluste 10: is the smallest and final cluste identified. It contains apidly gowing companies that ae faily pofitable and solvent, and have a faily low isk facto. Afte the clustes wee identified, the next step was to compae the sales infomation fo each poduct categoy to the ceated segments. The full table can be found in Appendix 1. The colo scale in Appendix 1 visualizes 417

19 th Austalasian Confeence on Infomation Systems Custome Potfolio Analysis Using the SOM 3-5 Dec 2008, Chistchuch Holmbom, et al. how the clustes can be divided into goups consisting of the lagest, aveage, and smallest customes accoding to the sales infomation. If the amount of sales wok expended fo each of the segments is the same, the division of the custome segments can be extended to descibe pofitable, aveage and non-pofitable customes. Lagest customes in tems of sales The lagest customes ae located in clustes C7 and C8. 6.1 % of the companies in the custome base belong to this goup. These companies have the lagest aveage sales figues fo nealy evey poduct. In the SOM map pesented in Figue 1, these clustes ae located in the lowe ight cone of the map. Thee ae both young and old companies that possess a lage tunove in Cluste 7. Some of the younge companies in this segment have a slightly highe isk facto, and some of the customes have a high solvency. The lage balance sheet total fo the customes in Cluste 8 is a sign that these companies ae lage in size. These companies have the lagest tunove and a high solvency. Aveage customes in tems of sales Accoding to the segmentation model, the aveage customes in tems of sales ae located in clustes C1, C4, C6, C9 and C10. They constitute 43.7 % of the total custome base. The companies in Cluste 1 do not have a dominating desciptive component. A small pat of the customes in this cluste has an inceased isk facto, anothe pat is slightly olde, and a thid pat has a good solvency. The oldest customes ae located in cluste C4. These have a high solvency, and some of them have an inceased isk facto. The companies in Cluste 6 have a high isk facto, but they possess a high solvency and the lagest etun on equity. The companies in Cluste 9 ae lage in size, as they have a lage balance sheet total. The customes in Cluste 10 ae gowing companies. Also, the specific company that makes the highest oveall puchases belongs to this cluste. The ode of pioity fo these aveage clustes, accoding to the infomation gained fom the segmentation, would be as follows: clustes C10 and C9 (made most puchases), Cluste 4 (aveage) and clustes C1 and C6 (made least puchases). Accoding to the model, Cluste 9 is vey simila to the clustes with the companies who conducted most puchases, i.e., clustes C7 and C8. This would indicate the possibility that futue good customes could be found in Cluste 9. Similaly, the pooest customes in this goup ae located in Cluste 6, which is vey diffeent fom the best pefoming clustes. Pooest customes in tems of sales The companies in clustes C2, C3 and C5 ae the pooest customes, i.e., they puchase the least amount of poducts. Thei shae of the custome base is 50.3 % and these customes, theefoe, constitute the lagest goup. Accoding to the model, the companies in Cluste 2 have a high isk facto. Howeve, many of the single companies that have puchased the lagest amounts of a specific poduct ae located in this cluste. The customes in cluste C3 have the highest solvency. A small shae of these companies has a slightly inceased isk facto, and anothe shae is slightly olde. The companies in Cluste 5 ae olde with a high isk facto. A common facto amongst the pooest customes is the high isk facto, which means that they ae not eliable customes. CONCLUSIONS AND FUTURE RESEARCH In this pape, the use of the SOM fo custome potfolio analysis has been illustated. A custome segmentation based upon demogaphic data was pefomed using the SOM, identifying ten clustes of customes displaying diffeent demogaphic chaacteistics. The esulting clustes wee then coupled with sales data, and a custome potfolio analysis was pefomed in ode to identify pofitable and unpofitable customes. The esulting model was face validated by expets fom the sales depatment of the case oganization. The sales depatment of the case oganization could potentially develop its maketing stategies based on the esults of this wok. Thee ae seveal inteesting topics of eseach that should be pusued in the futue. Fistly, the model cannot identify a univesal demogaphic featue o set of featues that can pedict custome pofitability, although custome size gives an indication of puchase potential. Futhe eseach should be conducted to see if the addition of othe demogaphic featues could incease the pedictive pefomance of the model. Secondly, pedicting puchase potential is potentially a valuable addition to the model. This could be done using statistical models. Thidly, pedicting the level of effot equied to push a custome to a moe pofitable level of elationship should be eseached, e.g., using Makov chain analysis. Finally, maket basket analysis was peliminaily pefomed in Holmbom (2007), and should be futhe developed. 418

19 th Austalasian Confeence on Infomation Systems Custome Potfolio Analysis Using the SOM 3-5 Dec 2008, Chistchuch Holmbom, et al. REFERENCES Back, B., See, K., and Vanhaanta, H. 1998. "Managing Complexity in Lage Data Bases using Self-Oganizing Maps," Accounting Management and Infomation Technologies (8:4), Octobe, pp 191-210. Back, B., Oosteom, G., See, K., and van Wezel, M. 1995. "Intelligent Infomation Systems within Business: Bankuptcy Pedictions using Neual Netwoks," in The 3d Euopean Confeence on Infomation Systems (ECIS'95), June 1-3, Athens, Geece: pp 99-111. Bey, M. J. A., and Linoff, G. S. 2004. Data Mining Techniques: Fo Maketing, Sales, and Custome Relationship Management. Indianapolis, Indiana: Wiley Publishing Inc. Beson, A., Smith, S., and Thealing, K. 2000. Building Data Mining Applications fo CRM. USA: McGaw- Hill Companies Inc. Bigus, J. P. 1996. Data Mining with Neual Netwoks: Solving Business Poblems fom Application Development to Decision Suppot. New Yok, NY: The McGaw-Hill Companies Inc. Bishop, C. M. 1995. Neual Netwoks fo Patten Recognition. Avon: Oxfod Univesity Pess. Buttle, F. 2004. Custome Relationship Management Concepts and Tools. Oxfod: Buttewoth-Heinemann. Chalmeta, R. 2006. "Methodology fo Custome Relationship Management," The Jounal of Systems and Softwae (79:7), July, pp 1015-1024. Datta, Y. 1996. "Maket Segmentation: An Integated Famewok," Long Range Planning (29:6), Decembe, pp 797-811. Deboeck, G. J. 1998. "Softwae Tools fo Self-Oganizing Maps," In Visual Exploations in Finance using Self- Oganizing Maps, Deboeck, G. J. and Kohonen, T. (eds.), Belin: Spinge-Velag, pp 179-194. Deboeck, G. J., and Kohonen, T. 1998. Visual Exploations in Finance with Self-Oganizing Maps. Belin: Spinge-Velag. Desmet, P. 2001. "Buying Behavio Study with Basket Analysis: Pe-Clusteing with a Kohonen Map," Euopean Jounal of Economic and Social Systems (15:2), pp 17-30. Eklund, T., Back, B., Vanhaanta, H., and Visa, A. 2003. "Using the Self-Oganizing Map as a Visualization Tool in Financial Benchmaking," Infomation Visualization (2:3), Septembe, pp 171-181. Famili, A., Shen, W.-M., Webe, R., and Simoudis, E. 1997. "Data Pepocessing and Intelligent Data Analysis," Intelligent Data Analysis (1:1), pp 3-23. Fank, R. E., Massy, W. F., and Wind, Y. 1972. Maket Segmentation. Englewood Cliffs, New Jesey: Penticehall Inc. Hand, D. J., Mannila, H., and Smyth, P. 2001. Pinciples of Data Mining. Boston, USA: MIT Pess, 2001. Haykin, S. 1999. Neual Netwoks - A Compehensive Foundation. Uppe Saddle Rive, N.J.: Pentice Hall Intenational, Inc. Heinich, B. 2005. "Tansfoming Stategic Goals of CRM into Pocess Goals and Activities," Business Pocess Management Jounal (11:6), pp 709-723. Holmbom, A. H. 2007. Identifying Custome Segments using the Self-Oganizing Map, Unpublished Maste's thesis. Tuku, Finland: Abo Akademi Univesity. Kaski, S., Kangas, J., and Kohonen, T. 1998. "Bibliogaphy of Self-Oganizing Map (SOM) Papes 1981-1997," Neual Computing Suveys (1), pp 102-350. Kaski, S., and Kohonen, T. 1996. "Exploatoy Data Analysis by the Self-Oganizing Map: Stuctues of Welfae and Povety in the Wold," In Neual Netwoks in Financial Engineeing, Apostolos, P. N. R., Abu- Mostafa, Y., Moody, J. and Weigend, A. (eds.), Singapoe: Wold Scientific, pp 498-507. Kim, S., Jung, T., Suh, E., and Hwang, H. 2006. "Custome Segmentation and Stategy Development Based on Custome Lifetime Value: A Case Study," Expet Systems with Applications (31:1), July, pp 101-107. Kiviluoto, K. 1998. "Pedicting Bankuptcies with the Self-Oganizing Map," Neuocomputing (21:1-3), Novembe, pp 191-201. Kohonen, T. 2001. Self-Oganizing Maps. Belin: Spinge-Velag. 419

19 th Austalasian Confeence on Infomation Systems Custome Potfolio Analysis Using the SOM 3-5 Dec 2008, Chistchuch Holmbom, et al. Kohonen, T. 1998. "The SOM Methodology," In Visual Exploations in Finance: With Self-Oganizing Maps, Deboeck, G. J. and Kohonen, T. (eds.), Belin: Spinge Velag, pp 159-167. Länsiluoto, A. 2007. "Suitability of Self-Oganising Maps fo Analysing a Maco-Envionment an Empiical Field Suvey," Intenational Jounal of Business Infomation Systems (2:2), Decembe, pp 149-161. Laose, D. T. 2005. Discoveing Knowledge in Data. an Intoduction to Data Mining. Hoboken, NJ: John Wiley & Sons Inc. Lee, S. C., Suh, Y. H., Kim, J. K., and Lee, K. J. 2004. "A Coss-National Maket Segmentation of Online Game Industy using SOM," Expet Systems with Applications (27:4), Novembe, pp 559-570. Lee, S. C., Xiang, J. Y., and Jing, L. B. 2005. "Who ae the Taget Customes in Chinese Online Game Maket?: Segmentation with a Two-Step Appoach," in Poceedings of the 2005 IEEE Intenational Confeence on e- Business Engineeing, Beijing, China, IEEE, pp 736-743. Lingas, P., Hogo, M., Snoek, M., and West, C. 2005. "Tempoal Analysis of Clustes of Supemaket Customes: Conventional Vesus Inteval Set Appoach," Infomation Sciences (172:1-2), June, pp 215-240. Matín-del-Bío, B., and Seano-Cinca, C. 1993. "Self-Oganizing Neual Netwoks fo the Analysis and Repesentation of Data: Some Financial Cases," Neual Computing and Applications (1:2), Septembe, pp 193-206. Oja, M., Kaski, S., and Kohonen, T. 2003. "Bibliogaphy of Self-Oganizing Map (SOM) Papes: 1998-2001 Addendum," Neual Computing Suveys (3), pp 1-156. Paas, L., and Kuijlen, T. 2001. "Towads a Geneal Definition of Custome Relationship Management," Jounal of Database Maketing (9:1), pp 51-60. Pak, H., and Baik, D. 2006. "A Study fo Contol of Client Value using Cluste Analysis," Jounal of Netwok and Compute Applications (29:4), Novembe, pp 262-276. Pyle, D. 1999. Data Pepaation fo Data Mining. San Diego, CA: Academic Pess. Rushmeie, H., Lawence, R., and Almasi, G. 1997. "Case Study: Visualizing Custome Segmentations Poduced by Self Oganizing Maps," in Eighth IEEE Visualization 1997 (VIS'97), Phoenix, AZ, USA, pp 463-466. Rygielski, C., Wang, J. -., and Yen, D. C. 2002. "Data Mining Techniques fo Custome Relationship Management," Technology in Society (24:4), pp 483-502. Shaw, M. J., Subamaniam, C., Tan, G. W., and Welge, M. E. 2001. "Knowledge Management and Data Mining fo Maketing," Decision Suppot Systems (31:1), pp 127-137. Smith, K., and Gupta, J. 2002. Neual Netwoks in Business. Heshey, PA: IDEA Goup Publishing. Teho, H., and Halinen, A. 2007. "Custome Potfolio Analysis Pactices in Diffeent Exchange Contexts," Jounal of Business Reseach (60:7), July, pp 720-730. Tsai, C.-Y., and Chiu, C.-C. 2004. "A Puchase-Based Maket Segmentation Methodology," Expet Systems with Applications (27:2), August, pp 265-276. Tunbull, P. W. 1990. "A Review of Potfolio Planning Models fo Industial Maketing and Puchasing Management," Euopean Jounal of Maketing (24:3), pp 7-22. Ultsch, A. 1993. "Self Oganized Featue Planes fo Monitoing and Knowledge Acquisition of a Chemical Pocess," in The Intenational Confeence on Atificial Neual Netwoks (ICANN93), Spinge-Velag: London, pp 864-867. Vellido, A., Lisboa, P. J. G., and Meehan, K. 1999a. "Segmentation of the on-line Shopping Maket using Neual Netwoks," Expet Systems with Applications (17:4), pp 303-314. Vellido, A., Lisboa, P. J. G., and Vaughan, J. 1999b. "Neual Netwoks in Business: A Suvey of Applications (1992 1998)," Expet Systems with Applications (17:1), pp 51-70. Vesanto, J., and Alhoniemi, E. 2000. "Clusteing of the Self-Oganizing Map," IEEE Tansactions on Neual Netwoks (11:3), May, pp 586-600. Wedel, M., and Kamakua, W. 1999. Maket Segmentation Conceptual and Methodological Foundations. Massachusetts, USA: Kluwe Academic Publishes. 420

19 th Austalasian Confeence on Infomation Systems Custome Potfolio Analysis Using the SOM 3-5 Dec 2008, Chistchuch Holmbom, et al. ACKNOWLEDGEMENTS The authos would like to thank the case oganization fo its paticipation in the study. The authos also gatefully acknowledge the financial suppot of the National Agency of Technology (Titan, gant no. 40063/08), the Academy of Finland (Visiting eseache gant no. 125588), and the Foundation fo Economic Education (gant no. 27999). APPENDIX 1 Clustes C1 C2 C3 C4 C5 C6 C7 C8 C9 C10 Poduct A 2.min min 2.max Max / Poduct B min 2.min Max 2.max Poduct C Min / 2.min Max 2.max Poduct D min 2.max Max / 2.min Poduct E min 2.min Max 2.max Poduct F min 2.max Max 2.min Poduct G min Max 2.max 2.min Poduct H 2.min Max 2.max min Poduct I min 2.min 2.max/ Max Poduct J 2.min 2.max/ Poduct K min 2.min Max / Min Max 2.max Poduct L min 2.min 2.max 0 Max Poduct M 0 0 0 0 0 0 0 0 0 0 Poduct N min 2.max Max 2.min Oveall puchases 2.min min Max 2.max Poduct P 2.min Min 2.max Max Poduct Q 2.max 0 Max / 0 0 Poduct R min 2.min Max 2.max 421

19 th Austalasian Confeence on Infomation Systems Custome Potfolio Analysis Using the SOM 3-5 Dec 2008, Chistchuch Holmbom, et al. Clustes with the two customes making most of the puchases (MAX, 2.MAX) wee maked with wam colos, i.e. diffeent shades of ed, and espectively, clustes with the two least puchasing customes (min, 2.min) wee maked with cool blue colos. The compaison was made accoding to aveage cluste sales. Also, the cluste whee the company that puchased the most, measued in, is located was maked fo each of the poducts sepaately (). COPYRIGHT Holmbom, Eklund and Back 2008. The authos assign to ACIS and educational and non-pofit institutions a non-exclusive licence to use this document fo pesonal use and in couses of instuction povided that the aticle is used in full and this copyight statement is epoduced. The authos also gant a non-exclusive licence to ACIS to publish this document in full in the Confeence Papes and Poceedings. Those documents may be published on the Wold Wide Web, CD-ROM, in pinted fom, and on mio sites on the Wold Wide Web. Any othe usage is pohibited without the expess pemission of the authos. 422