CASE STUDY: WEB-DOMAIN PRICE PREDICTION ON THE SECONDARY MARKET (4-LETTER CASE)
|
|
- Brian Alvin Perry
- 6 years ago
- Views:
Transcription
1 CASE STUDY: WEB-DOMAIN PRICE PREDICTION ON THE SECONDARY MARKET (4-LETTER CASE) MAY 2016 DATA-TRACER.COM
2 TABLE OF CONTENT SECTION 1 Research background Page 3 SECTION 2 Study design Page 8 SECTION 3 Results Page 11 APPENDIX 1 Benchmark models Page 18 APPENDIX 2 Random forest Page 20 2
3 SECTION 1 Research background 3
4 US DOMAIN INDUSTRY ALONE IS ESTIMATED AT $2B US Domain Industry, 2015 Annual Premium Web Domain Sales, USD M $2B revenue Market is dynamically growing in line with world growing e commerce industry +17% 8250 employees 4539 businesses Domain industry is growing. In CAGR constituted17%. Moreover, it is expected to grow even further due to overall growth of web-based businesses. Explosive development of Chinese e commerce is the latest trend fueling the growth of web-domain secondary market. It is already industry of considerable size since its market value (just in US) achieved 2 billion of US dollars. 4 Sources: Domain Name Prices (Dnpric.es), IBISWorld, Quartz.
5 AVERAGE DOMAIN PRICE ON THE SECONDARY MARKET IS STEADILY GROWING Average price of sold domain (index, base year-2006) Top 5 most expensive deals, USD M (excluding web-sites for adults) % Fund.com We.com $8.0 $10.0 Diamond.com $7.5 Z.com $ / / / / /01 Slots.com $5.5 Average price for domain on the secondary market has been growing steadily since Number of free domains (especially short and attractive ones) is constantly declining, causing growth of the secondary market. Most demanded domains achieved seven-digit price tags. 5 Sources: DNJournal, Sedo.com; 1- National Association of Securities Dealers Automated Quotations; 2- The Domain Name Price Index
6 MACHINE LEARNING IS NECESSARY TO PREDICT DOMAIN PRICES Share of domain sales quantities in different price segments 62% up to $100 26% 11% $100-$1000 $1000-$10000 Domain Price 1% $ Examples of web-sites with prices of less than $100 Domain Price rzwv.com $1 ulpq.com $3 xcoi.com $5 kxoy.com $10 pjov.com $20 vugz.com $40 mosf.com $80 ogev.com $100 Majority of domains have price below $100. However, it is extremely difficult to guess the price without application of machine learning technics. The problem is that lions share of domains priced less than $100 do not contain real words. 6
7 PROJECT FEATURES: Project objective: predict price for an arbitrary 4-letter domain offered on the secondary market Data used: over 120,000 domain sales since 2000 Predictors: 200+ features reflecting linguistic, topic interest and market place information Methods employed: non-parametric regression (Random Forrest) Results: predictive accuracy on the test dataset is 82.9% (measured by goodness of fit R 2 ) Possible next steps: development of general predictive model (to all types of domains) Out-of-the-box-solutions: inclusion of Google search data as well as letter combination popularity of Peter Norvig 7
8 SECTION 2 Study design 8
9 THE GOAL OF THE STUDY IS CREATION OF WEB- DOMAIN PRICE PREDICTING MODEL Linguistic characteristics Market place info Topic interest Advanced data mining tool Random Forest $$$ Web-domain price prediction 9
10 THREE TYPES OF INPUT FEATURES ARE USED Linguistic Market place Topic interest Consonant-vowel pattern Letter repetition pattern Letter place pattern Frequency of letter combination usage Undesirable letter availability Whether real word is contained Seller Date of the deal Price of the previous deal of the same domain Number of Google Searches (bid & competition) of the word contained in the domain Domain extension (.com,.org,.tele, etc.) Total number of variables in the dataset
11 SECTION 3 Results 11
12 THE MODEL OF RANDOM FOREST HAS SUBSTANTIAL PREDICTIVE POWER Price Predicted Price Random forest performed well in domain price forecasting. The goodness of fit is 82.9%, which means that model explains 82.9% of variation in domain prices. Random forest s results were compared to linear regression and decision tree models as benchmarks, and its predictions appeared statistically more powerful (details can be found in the appendix). 12 Note: Scatter plot reflects feet for randomly selected sample of 100 observations for logarithmic prices
13 VARIABLES WITH THE HIGHEST PREDICTIVE POWER Partner (Seller) indicator Previous price Date indicator Consonant-vowel pattern Frequency of 2-letter combinations Google Searches of containing word Domain extensions Indicator of company, which has sold the domain name Price of the domain at the moment of last sale Year and month indicator Pattern describing place of consonant and vowel letters in the word Number of times two-letter combination appeared in the set of texts analyzed by Peter Norvig In case domain contain real word, current indicator reflects number of Google Searches for this word Indicator of domain extension 13
14 EXAMPLE OF PRICE PREDICTION ALGORITHM Thai.co Mams.com Yftm.com Is real word contained? yes yes no What is year of deal? What is the seller? Afternic Sedo GoDaddy Predicted Price, USD True Price, USD? As the final output the client would be given model, which returns predicted prices for domain once its characteristics are entered 14
15 EXAMPLE OF PRICE PREDICTION ALGORITHM Thai.co Mams.com Yftm.com Is real word contained? yes yes no What is year of deal? What is the seller? Afternic Sedo GoDaddy Predicted Price, USD True Price, USD? As the final output the client would be given model, which returns predicted prices for domain once its characteristics are entered 15
16 EXAMPLE OF PRICE PREDICTION ALGORITHM Thai.co Mams.com Yftm.com Is real word contained? yes yes no What is year of deal? What is the seller? Afternic Sedo GoDaddy Predicted Price, USD True Price, USD ? As the final output the client would be given model, which returns predicted prices for domain once its characteristics are entered 16
17 PROJECT SUMMARY Market set-up which explains domain prices is pretty complex and depends on many factors. These factors cannot be easily observed and their effects on prices are not obvious. Low and medium price deals constitute lion s share of the market. However, accurate prediction of the price in this segment is rather challenging but lucrative. In order to take in account numerous factors simultaneously we used advanced machine learning technique Random Forest, which is robust to overfitting. Developed statistical model is flexible and, therefore, can be applied to other similar problems (e.g. prediction of price for domains of any length). The research is based on open source data The introduced analytical model shows good forecasting power (R 2 is 82.9%). 17
18 APPENDIX 1 Benchmark models 18
19 RANDOM FOREST IS BETTER THAN BENCHMARKS Goodness of Fit Cross-Validation* 87.3% 87.0% 82.9% 80.6% 77.3% 74.5% We may underline that decision tree performs almost as well as Random Forest for total sample prediction; But due to higher resistance to overfitting Random Forest produces more accurate estimates on the test dataset. Random Forest Decision Tree Linear regression model 19 Note: Cross-Validation means that goodness of fit is measured on the bases of test dataset (which was not used for model fittin g).
20 APPENDIX 2 Random forest 20
21 DECISION TREE IS BASIC ELEMENT OF THE RANDOM FOREST Illustrative example of the Decision Tree segment built on the training data GENERAL IDEA: Decision tree classifies cases into groups or predicts values of a dependent (target) variable based on values of independent (predictor) variables. Independent variables are chosen in the way that groups are separated the best. EXAMPLE EXPLANATION The model determines how combination of various factors affects price of the domain. In the example only one branch of the tree is displayed fully, and it reflects how average price of domain sold on SEDO platform changes with domain extension, price of previous sale and consonant vowel pattern. Extension: com Partner Sedo (Yes/No) Previous Price <$80 Previous Price >=$80 Extension: org Order of variables and size of the tree is determined statistically Extension: net The tree grows from every node on every level (only some branches are displayed here) Extension: other Pattern: cvcv* Pattern: vcvc Pattern: vccv Pattern: cvvc Pattern: ccvv average price: $252 average price: $212 average price: $150 average price: $140 average price: $90 21 *Note: c stands for consonant, v stands for vowel
22 THE DECISION TREE CAN BECOME QUITE LARGE AND COMPLICATED Illustrative example of the section of full Decision Tree built on the training data set When all predictors are used in the analysis the tree becomes very large. However, the single tree is not sufficiently robust method and Random Forest is preferred. 22
23 RANDOM FOREST IS AGGREGATION OF DECISION TREES Random data subset Random variable subset Random data subset Random variable subset Random data subset Random variable subset Decision tree 1 Decision tree 2 Decision tree N* Results of the individual decision trees (typically trees) are aggregated and average prices are computed. Importance of each variable is calculated. 23 Note:(*) Optimal number of trees is determined during analysis - usually about 500 trees are built
24 RANDOM FOREST IS SUITABLE TOOL FOR DOMAIN PRICE PREDICTION The model does not require data to have specific distribution Both categorical and scale variables can be used Weak predictors are effectively incorporated in the model The model is not prone to overfitting, the model is robust Predictive power of the model does not deteriorate when large number of predictors is used. Final output of the model is price, which can be used as predictor of future sale of the domain 24
25 If you have any questions, please contact us: Skype: michael.dopira 25
Jialu Yan, Tingting Gao, Yilin Wei Advised by Dr. German Creamer, PhD, CFA Dec. 11th, Forecasting Rossmann Store Sales Prediction
Jialu Yan, Tingting Gao, Yilin Wei Advised by Dr. German Creamer, PhD, CFA Dec. 11th, 2015 Forecasting Rossmann Store Sales Prediction Problem Understanding It is very important for retail stores to save
More informationApplying Regression Techniques For Predictive Analytics Paviya George Chemparathy
Applying Regression Techniques For Predictive Analytics Paviya George Chemparathy AGENDA 1. Introduction 2. Use Cases 3. Popular Algorithms 4. Typical Approach 5. Case Study 2016 SAPIENT GLOBAL MARKETS
More informationBig Data. Methodological issues in using Big Data for Official Statistics
Giulio Barcaroli Istat (barcarol@istat.it) Big Data Effective Processing and Analysis of Very Large and Unstructured data for Official Statistics. Methodological issues in using Big Data for Official Statistics
More informationIBM SPSS Decision Trees
IBM SPSS Decision Trees 20 IBM SPSS Decision Trees Easily identify groups and predict outcomes Highlights With SPSS Decision Trees you can: Identify groups, segments, and patterns in a highly visual manner
More informationE-Commerce Sales Prediction Using Listing Keywords
E-Commerce Sales Prediction Using Listing Keywords Stephanie Chen (asksteph@stanford.edu) 1 Introduction Small online retailers usually set themselves apart from brick and mortar stores, traditional brand
More informationPRODUCT DESCRIPTIONS AND METRICS
PRODUCT DESCRIPTIONS AND METRICS Adobe PDM - Adobe Analytics (2015v1) The Products and Services described in this PDM are either On-demand Services or Managed Services (as outlined below) and are governed
More informationEnsemble Modeling. Toronto Data Mining Forum November 2017 Helen Ngo
Ensemble Modeling Toronto Data Mining Forum November 2017 Helen Ngo Agenda Introductions Why Ensemble Models? Simple & Complex ensembles Thoughts: Post-real-life Experimentation Downsides of Ensembles
More informationCSC-272 Exam #1 February 13, 2015
CSC-272 Exam #1 February 13, 2015 Name Questions are weighted as indicated. Show your work and state your assumptions for partial credit consideration. Unless explicitly stated, there are NO intended errors
More informationWho Are My Best Customers?
Technical report Who Are My Best Customers? Using SPSS to get greater value from your customer database Table of contents Introduction..............................................................2 Exploring
More informationBeating the Competition with Cognitive Commerce
Beating the Competition with Cognitive Commerce Tom Robertshaw Founder & CEO of Meanbee @bobbyshaw Meanbee UK ecommerce Agency Specialized in Magento Technology First Client revenues average $2-10 million
More informationPredictive analytics [Page 105]
Week 8, Lecture 17 and Lecture 18 Predictive analytics [Page 105] Predictive analytics is a highly computational data-mining technology that uses information and business intelligence to build a predictive
More informationDATA ANALYTICS WITH R, EXCEL & TABLEAU
Learn. Do. Earn. DATA ANALYTICS WITH R, EXCEL & TABLEAU COURSE DETAILS centers@acadgild.com www.acadgild.com 90360 10796 Brief About this Course Data is the foundation for technology-driven digital age.
More informationChapter 13 Knowledge Discovery Systems: Systems That Create Knowledge
Chapter 13 Knowledge Discovery Systems: Systems That Create Knowledge Becerra-Fernandez, et al. -- Knowledge Management 1/e -- 2007 Prentice Hall Chapter Objectives To explain how knowledge is discovered
More informationGIVING ANALYTICS MEANING AGAIN
GIVING ANALYTICS MEANING AGAIN GIVING ANALYTICS MEANING AGAIN When you hear the word analytics what do you think? If it conjures up a litany of buzzwords and software vendors, this is for good reason.
More information3 Ways to Improve Your Targeted Marketing with Analytics
3 Ways to Improve Your Targeted Marketing with Analytics Introduction Targeted marketing is a simple concept, but a key element in a marketing strategy. The goal is to identify the potential customers
More informationPredictive Modeling Using SAS Visual Statistics: Beyond the Prediction
Paper SAS1774-2015 Predictive Modeling Using SAS Visual Statistics: Beyond the Prediction ABSTRACT Xiangxiang Meng, Wayne Thompson, and Jennifer Ames, SAS Institute Inc. Predictions, including regressions
More informationSPM 8.2. Salford Predictive Modeler
SPM 8.2 Salford Predictive Modeler SPM 8.2 The SPM Salford Predictive Modeler software suite is a highly accurate and ultra-fast platform for developing predictive, descriptive, and analytical models from
More informationPredicting Yelp Restaurant Reviews
Predicting Yelp Restaurant Reviews Wael Farhan UCSD: A53070918 9500 Gilman Drive La Jolla, CA, 92093 wfarhan@eng.ucsd.edu ABSTRACT Starting a restaurant is a tricky business. Restaurant owners have to
More informationGlobal Ceramic Machinery Market: Size, Trends & Forecasts ( ) May 2017
Global Ceramic Machinery Market: Size, Trends & Forecasts (2017-2021) May 2017 Global Ceramic Machinery Market Report Scope of the Report The report entitled Global Ceramic Machinery Market: Size, Trends
More informationData Mining in CRM THE CRM STRATEGY
CHAPTER ONE Data Mining in CRM THE CRM STRATEGY Customers are the most important asset of an organization. There cannot be any business prospects without satisfied customers who remain loyal and develop
More informationMarketing & Big Data
Marketing & Big Data Surat Teerakapibal, Ph.D. Lecturer in Marketing Director, Doctor of Philosophy Program in Business Administration Thammasat Business School What is Marketing? Anti-Marketing Marketing
More informationApplication of Machine Learning to Financial Trading
Application of Machine Learning to Financial Trading January 2, 2015 Some slides borrowed from: Andrew Moore s lectures, Yaser Abu Mustafa s lectures About Us Our Goal : To use advanced mathematical and
More informationCredit Card Marketing Classification Trees
Credit Card Marketing Classification Trees From Building Better Models With JMP Pro, Chapter 6, SAS Press (2015). Grayson, Gardner and Stephens. Used with permission. For additional information, see community.jmp.com/docs/doc-7562.
More informationStay ahead of the game with Adalyser
+44 (0) 333 666 7366 Stay ahead of the game with Adalyser Real-time online platform for the collection, analysis and optimisation of offline and online media spend. About Adalyser Developed for our business
More informationTree Depth in a Forest
Tree Depth in a Forest Mark Segal Center for Bioinformatics & Molecular Biostatistics Division of Bioinformatics Department of Epidemiology and Biostatistics UCSF NUS / IMS Workshop on Classification and
More informationPredicting user rating on Amazon Video Game Dataset
Predicting user rating on Amazon Video Game Dataset CSE190A Assignment2 Hongyu Li UC San Diego A900960 holi@ucsd.edu Wei He UC San Diego A12095047 whe@ucsd.edu ABSTRACT Nowadays, accurate recommendation
More informationComputational Gambling
Introduction Computational Gambling Konstantinos Katsiapis Gambling establishments work with the central dogma of Percentage Payout (PP). They give back only a percentage of what they get. For example
More informationECONOMIC MACHINE LEARNING FOR FRAUD DETECTION
ECONOMIC MACHINE LEARNING FOR FRAUD DETECTION Maytal Saar-Tsechansky 2015 UT CID Report #1511 This UT CID research was supported in part by the following organizations: identity.utexas.edu ECONOMIC MACHINE
More informationAustralian Online Search and Directories Advertising Market
Australian Online Search and Directories Advertising Market 2009-2013 1 About this report 1.1 Introduction This report is a multi-client study produced by Frost & Sullivan s ICT Practice in Australia during
More informationInsights from the Wikipedia Contest
Insights from the Wikipedia Contest Kalpit V Desai, Roopesh Ranjan Abstract The Wikimedia Foundation has recently observed that newly joining editors on Wikipedia are increasingly failing to integrate
More informationFrom Fraud Analytics Using Descriptive, Predictive, and Social Network Techniques. Full book available for purchase here.
From Fraud Analytics Using Descriptive, Predictive, and Social Network Techniques. Full book available for purchase here. Contents List of Figures xv Foreword xxiii Preface xxv Acknowledgments xxix Chapter
More informationINTRODUCTION TO THE BPA WORLDWIDE B2B MEDIA EXCHANGE
Contents: INTRODUCTION TO THE BPA WORLDWIDE B2B MEDIA EXCHANGE 1. Introduction 2. Why Programmatic Advertising 3. Why participate in the BPA B2B Media Exchange 4. The requirements to participate 5. The
More informationAchieve Better Insight and Prediction with Data Mining
Clementine 12.0 Specifications Achieve Better Insight and Prediction with Data Mining Data mining provides organizations with a clearer view of current conditions and deeper insight into future events.
More informationET MedialabsPvt. Ltd. Opp. WHY Select GO City ONLINE Walk?- Mall, New Delhi ; Contact :
ET MedialabsPvt. Ltd. www.etmedialabs.com Opp. WHY Select GO City ONLINE Walk?- Mall, New Delhi -110017 ; Contact : 011-41016331 Managing Large Scale Google PPC Campaigns Running ecommerce campaigns on
More informationKnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE
FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?
More informationPredictive Planning for Supply Chain Management
Predictive Planning for Supply Chain Management David Pardoe and Peter Stone Department of Computer Sciences The University of Texas at Austin {dpardoe, pstone}@cs.utexas.edu Abstract Supply chains are
More informationIBM s Analytics Transformation
IBM s Analytics Transformation Value Capture from Big Data, Analytics and Cognitive Technologies Martin Fleming VP, Chief Analytics Officer, and Chief Economist Chief Analytics Office Analytics Aligned
More informationOlin Business School Master of Science in Customer Analytics (MSCA) Curriculum Academic Year. List of Courses by Semester
Olin Business School Master of Science in Customer Analytics (MSCA) Curriculum 2017-2018 Academic Year List of Courses by Semester Foundations Courses These courses are over and above the 39 required credits.
More informationPredicting Customer Behavior Using Data Churn Analytics in Telecom
Predicting Customer Behavior Using Data Churn Analytics in Telecom Tzvi Aviv, PhD, MBA Introduction In antiquity, alchemists worked tirelessly to turn lead into noble gold, as a by-product the sciences
More informationIBM SPSS Modeler Personal
IBM SPSS Modeler Personal Make better decisions with predictive intelligence from the desktop Highlights Helps you identify hidden patterns and trends in your data to predict and improve outcomes Enables
More informationSAP Predictive Analytics Suite
SAP Predictive Analytics Suite Tania Pérez Asensio Where is the Evolution of Business Analytics Heading? Organizations Are Maturing Their Approaches to Solving Business Problems Reactive Wait until a problem
More informationRandom Forests. Parametrization and Dynamic Induction
Random Forests Parametrization and Dynamic Induction Simon Bernard Document and Learning research team LITIS laboratory University of Rouen, France décembre 2014 Random Forest Classifiers Random Forests
More informationToday. Last time. Lecture 5: Discrimination (cont) Jane Fridlyand. Oct 13, 2005
Biological question Experimental design Microarray experiment Failed Lecture : Discrimination (cont) Quality Measurement Image analysis Preprocessing Jane Fridlyand Pass Normalization Sample/Condition
More informationECONOMIC MODELLING & MACHINE LEARNING
ECONOMIC MODELLING & MACHINE LEARNING A PROOF OF CONCEPT NICOLAS WOLOSZKO, OECD TECHNOLOGY POLICY INSTITUTE FEB 22 2017 Economic forecasting with Adaptive Trees 1 2 Motivation Adaptive Trees 3 Proof of
More informationChurn Prediction for Game Industry Based on Cohort Classification Ensemble
Churn Prediction for Game Industry Based on Cohort Classification Ensemble Evgenii Tsymbalov 1,2 1 National Research University Higher School of Economics, Moscow, Russia 2 Webgames, Moscow, Russia etsymbalov@gmail.com
More informationCopyr i g ht 2012, SAS Ins titut e Inc. All rights res er ve d. ENTERPRISE MINER: ANALYTICAL MODEL DEVELOPMENT
ENTERPRISE MINER: ANALYTICAL MODEL DEVELOPMENT ANALYTICAL MODEL DEVELOPMENT AGENDA Enterprise Miner: Analytical Model Development The session looks at: - Supervised and Unsupervised Modelling - Classification
More informationSoftware Quality Metrics. Analyzing & Measuring Customer Satisfaction (Chapter 14)
Software Quality Metrics Analyzing & Measuring Customer Satisfaction (Chapter 14) By Zareen Abbas Reg# 169/MSSE/F07 Usman Thakur Reg# 181/MSSE/F07 1 Overview-Quality Product quality and customer satisfaction
More informatione7 Capacity Expansion Long-term resource planning for resource planners and portfolio managers
e7 Capacity Expansion Long-term resource planning for resource planners and portfolio managers e7 Capacity Expansion Overview The e7 Capacity Expansion solution gives resource planners and portfolio managers
More informationWelcome your.. virtual colleagues!
Welcome your.. virtual colleagues! Abhijit Tuljapurkar Robotic & Cognitive Automation Lead Deloitte Digital Michael Winther Advanced Analytics Lead AIM AUTOMATION. TRANSFORMING HUMAN WORKFORCE 25% Jobs
More informationForecasting diffusion with prelaunch online search traffic data
Forecasting diffusion with prelaunch online search traffic data Oliver Schaer Nikolaos Kourentzes Robert Fildes Higher School of Economics Saint Petersburg 25th May 2016 Lancaster Centre for Forecasting
More informationChapter 8 Analytical Procedures
Slide 8.1 Principles of Auditing: An Introduction to International Standards on Auditing Chapter 8 Analytical Procedures Rick Hayes, Hans Gortemaker and Philip Wallage Slide 8.2 Analytical procedures Analytical
More informationExperiences in the Use of Big Data for Official Statistics
Think Big - Data innovation in Latin America Santiago, Chile 6 th March 2017 Experiences in the Use of Big Data for Official Statistics Antonino Virgillito Istat Introduction The use of Big Data sources
More informationPredicting Corporate Influence Cascades In Health Care Communities
Predicting Corporate Influence Cascades In Health Care Communities Shouzhong Shi, Chaudary Zeeshan Arif, Sarah Tran December 11, 2015 Part A Introduction The standard model of drug prescription choice
More informationStartup Machine Learning: Bootstrapping a fraud detection system. Michael Manapat
Startup Machine Learning: Bootstrapping a fraud detection system Michael Manapat Stripe @mlmanapat About me: Engineering Manager of the Machine Learning Products Team at Stripe About Stripe: Payments infrastructure
More informationData mining and Renewable energy. Cindi Thompson
Data mining and Renewable energy Cindi Thompson June 2012 Analytics, Big Data, and Data Science 1 What is Analytics? makes extensive use of data, statistical and quantitative analysis, explanatory and
More informationENGG1811: Data Analysis using Excel 1
ENGG1811 Computing for Engineers Data Analysis using Excel (weeks 2 and 3) Data Analysis Histogram Descriptive Statistics Correlation Solving Equations Matrix Calculations Finding Optimum Solutions Financial
More informationA Personalized Company Recommender System for Job Seekers Yixin Cai, Ruixi Lin, Yue Kang
A Personalized Company Recommender System for Job Seekers Yixin Cai, Ruixi Lin, Yue Kang Abstract Our team intends to develop a recommendation system for job seekers based on the information of current
More informationDecision Tree Learning. Richard McAllister. Outline. Overview. Tree Construction. Case Study: Determinants of House Price. February 4, / 31
1 / 31 Decision Decision February 4, 2008 2 / 31 Decision 1 2 3 3 / 31 Decision Decision Widely Used Used for approximating discrete-valued functions Robust to noisy data Capable of learning disjunctive
More informationSmart BW Bank. Gerrit Bungeroth, BW-Bank Stefan Weingärtner, AdvancedAnalytics.Academy
Landesbank Baden-Württembergische Bank Gold. It s a rare metal and one of the earth s most valuable and sought-after raw materials. Smart Data @ BW Bank Gerrit Bungeroth, BW-Bank Stefan Weingärtner, AdvancedAnalytics.Academy
More informationIntroduction AdWords Guide
2018 AdWords Guide Introduction In the perfect scenario, you would be able to advertise to potential customers at the exact moment they show purchase intent. That is exactly what Google AdWords does. AdWords
More informationData Mining Applications with R
Data Mining Applications with R Yanchang Zhao Senior Data Miner, RDataMining.com, Australia Associate Professor, Yonghua Cen Nanjing University of Science and Technology, China AMSTERDAM BOSTON HEIDELBERG
More informationUsing Decision Tree to predict repeat customers
Using Decision Tree to predict repeat customers Jia En Nicholette Li Jing Rong Lim Abstract We focus on using feature engineering and decision trees to perform classification and feature selection on the
More informationTHE CONVERSION CYCLE
Accounting Information Systems, 3rd. Ed. The Conversion Cycle 7 CHAPTER 7 THE CONVERSION CYCLE This is perhaps the most complex chapter so far. The first section presents a discussion of a traditional
More informationRetail Sales Benchmarks, KPI Definitions & Measurement Details
The OpsDog Retail Sales Benchmarking Report Retail Sales Benchmarks, KPI Definitions & Measurement Details ABRIDGED CONTENT Purchase to View Full Benchmarking Report! 2017 Edition www.opsdog.com info@opsdog.com
More informationA better marketplace for almonds
A better marketplace for almonds A properly designed online marketplace for almonds will establish fair, competitive prices with reduced volatility. Our online marketplaces have achieved the following
More informationscience and applications
Introduction to data science and applications 1 What s possible with data and analytics? 2 Facebook can predict break-ups? http://www.huffingtonpost.com/2014/02/14/facebook-relationship-study_n_4784291.html
More informationDON T FORGET ABOUT MEASUREMENT. Written by: Miko Kershberg, WSI Digital Marketing Expert
Don t Forget About Measurement // 1 2 12.ch DON T FORGET ABOUT MEASUREMENT Written by: Miko Kershberg, WSI Digital Marketing Expert Don t Forget About Measurement // 2 Table of Contents Introduction...
More informationIBM Digital Recommendations
Service Description IBM Digital Recommendations This Service Description describes the Cloud Service IBM provides to Client. Client means the company and its authorized users and recipients of the Cloud
More information1.0 Chapter Introduction
1.0 Chapter Introduction In this chapter, you will learn to use price index numbers to make the price adjustments necessary to analyze price and cost information collected over time. Price Index Numbers.
More informationFORTUNE FAVORS THE BRAVE EMPOWERING THE BACK OFFICE INSIGHT REPORT
FORTUNE FAVORS THE BRAVE EMPOWERING THE BACK OFFICE INSIGHT REPORT Contents Technology in the back office Regulation Tech trends The future of the back office Conclusions Technology in the back office
More informationChurn Prevention in Telecom Services Industry- A systematic approach to prevent B2B churn using SAS
Paper 1414-2017 Churn Prevention in Telecom Services Industry- A systematic approach to prevent B2B churn using SAS ABSTRACT Krutharth Peravalli, Dr. Dmitriy Khots West Corporation It takes months to find
More informationAutomated Embedded AI Asset Intelligence. Jean-Michel Cambot Founder & Chief Evangelist
Automated Embedded AI Asset Intelligence Jean-Michel Cambot Founder & Chief Evangelist Intelligent Machines must be able to explain every decision they make 2 Then comes the real Magic of Artificial Intelligence
More informationEST Accuracy of FEL 2 Estimates in Process Plants
EST.2215 Accuracy of FEL 2 Estimates in Process Plants Melissa C. Matthews Abstract Estimators use a variety of practices to determine the cost of capital projects at the end of the select stage when only
More informationBIA/Kelsey Local Commerce Monitor: SMB Adoption of Mobile, Social, E-Commerce, Loyalty Programs and Promotions
BIA/Kelsey Local Commerce Monitor: SMB Adoption of Mobile, Social, E-Commerce, Loyalty Programs and Promotions August 21, 2013 For audio: Listen through your speakers or dial in. Once you choose, please
More informationOPTIMIZING GOOGLE SHOPPING: STRUCTURE. Taking a closer look at optimizing Google Shopping and how it is structured
OPTIMIZING GOOGLE SHOPPING: STRUCTURE Taking a closer look at optimizing Google Shopping and how it is structured ABOUT THE AUTHORS PART OF THE QUANTADS PPC TEAM THOMAS BYSKOV MADSEN DIGITAL MARKETING
More informationNIELSEN P$YCLE METHODOLOGY
NIELSEN P$YCLE METHODOLOGY May 2014 PRIZM and P$ycle are registered trademarks of The Nielsen Company (US), LLC Nielsen and the Nielsen logo are trademarks or registered trademarks of CZT/ACN Trademarks,
More informationGlobal Gas and Steam Turbine Markets Conventional Thermal Power Expansion Driven by Emerging Markets and Rising Natural Gas Availability
Global Gas and Steam Turbine Markets Conventional Thermal Power Expansion Driven by Emerging Markets and Rising Natural Gas Availability June 2014 Executive Summary Return to contents 4 Key Findings Global
More informationIntelligent continuous improvement, when BPM meets AI. Miguel Valdés Faura CEO and co-founder
Intelligent continuous improvement, when BPM meets AI Miguel Valdés Faura CEO and co-founder BPM IS NOT DEAD. But we should admit though, BPM has been a bit unsexy lately. And exactly, what is your job
More informationLet the data speak: Machine learning methods for data editing and imputation
Working Paper 31 UNITED NATIONS ECONOMIC COMMISSION FOR EUROPE CONFERENCE OF EUROPEAN STATISTICIANS Work Session on Statistical Data Editing (Budapest, Hungary, 14-16 September 2015) Topic (v): Emerging
More informationLeveraging Smart Meter Data & Expanding Services BY ELLEN FRANCONI, PH.D., BEMP, MEMBER ASHRAE; DAVID JUMP, PH.D., P.E.
ASHRAE www.ashrae.org. Used with permission from ASHRAE Journal. This article may not be copied nor distributed in either paper or digital form without ASHRAE s permission. For more information about ASHRAE,
More informationChannelAdvisor 2017 Analyst Meeting. March 8, 2017
ChannelAdvisor 2017 Analyst Meeting March 8, 2017 Mark Cook - CFO ChannelAdvisor 2017 Analyst Meeting March 8, 2017 Safe Harbor Statement This presentation contains "forward-looking" statements that are
More informationBivariate Data Notes
Bivariate Data Notes Like all investigations, a Bivariate Data investigation should follow the statistical enquiry cycle or PPDAC. Each part of the PPDAC cycle plays an important part in the investigation
More informationUnderstanding Ad Exchanges
Understanding Ad Exchanges Ad exchanges are the platforms that power programmatic advertising, the most popular method for buying and selling digital advertising. Here s a look at how ad exchanges work,
More informationECLT 5810 E-Commerce Data Mining Techniques - Introduction. Prof. Wai Lam
ECLT 5810 E-Commerce Data Mining Techniques - Introduction Prof. Wai Lam Data Opportunities Business infrastructure have improved the ability to collect data Virtually every aspect of business is now open
More informationEvalueserve IP and R&D Solutions
Automated Patent Classification What we have learned from client projects Evalueserve IP and R&D Solutions Agenda Introduction Automated Patent Classification @ Evalueserve IP and R&D Projects Use Cases
More informationNew Technologies in Banking
New Technologies in Banking Frankfurt School of Finance & Management Sonnemannstraße 9-11 60314 Frankfurt, Germany p.rossbach@fs.de Machine Learning Success Stories Customer Profiling Predicting Customer
More informationBusiness Intelligence, 4e (Sharda/Delen/Turban) Chapter 2 Descriptive Analytics I: Nature of Data, Statistical Modeling, and Visualization
Business Intelligence, 4e (Sharda/Delen/Turban) Chapter 2 Descriptive Analytics I: Nature of Data, Statistical Modeling, and Visualization 1) One of SiriusXM's challenges was tracking potential customers
More informationUltiPro Perception Collect and understand employee feedback with surveys and sentiment analysis
UltiPro Perception Collect and understand employee feedback with surveys and sentiment analysis UltiPro Perception offers a modern way for collecting and understanding employee feedback providing a rich,
More informationMake the Jump from Business User to Data Analyst in SAS Visual Analytics
SESUG 2016 Paper 200-2016 Make the Jump from Business User to Data Analyst in SAS Visual Analytics Ryan Kumpfmilller, Zencos Consulting ABSTRACT SAS Visual Analytics is effective in empowering the business
More informationHotel Industry Demand Curves
Journal of Hospitality Financial Management The Professional Refereed Journal of the Association of Hospitality Financial Management Educators Volume 20 Issue 1 Article 6 Summer 2012 Hotel Industry Demand
More informationTo 3PL or Not to 3PL:
To 3PL or Not to 3PL: An Overview of Third Party Logistics Outsourcing 866.672.2862 m33integrated.com 511 Rhett Street, Suite 3 Greenville, South Carolina 29601 Third party logistics (3PL) is the business
More informationBuilding the In-Demand Skills for Analytics and Data Science Course Outline
Day 1 Module 1 - Predictive Analytics Concepts What and Why of Predictive Analytics o Predictive Analytics Defined o Business Value of Predictive Analytics The Foundation for Predictive Analytics o Statistical
More informationUniversal Office Copiers & Printers: Worldwide Market Opportunities and Product Requirements
Universal Office Copiers & Printers: Worldwide Market Opportunities and Product Requirements Each new generation of office output technology has the potential to change the structure of the industry. Advances
More informationThe Analytical Revolution
The Analytical Revolution Colin Shearer Global Executive, Advanced Analytic Solutions IBM Our world is becoming smarter Instrumented Interconnected Intelligent enabling organizations to make faster, better-informed
More informationFinancial Management: Sales and Marketing
Contact Us Financial Management: Sales and Marketing There is a fee associated with participation in APQC's Open Standards Research. If you have any questions about the fee, please contact the APQC helpdesk
More informationBot Insight is here. Improve your company s top-and-bottom-line with powerful, real-time RPA Analytics Go be great.
Bot Insight is here. Improve your company s top-and-bottom-line with powerful, real-time RPA Analytics Go be great. There s so much to be gained. Successful deployment of Robotic Process Automation (RPA)
More informationDIGITAL MARKETING DATA SHEET CHANNELADVISOR DIGITAL MARKETING ENABLES RETAILERS TO: And many more
DIGITAL MARKETING DATA SHEET With more and more digital marketing channels introduced each year, it can be hard to keep track of every place you re advertising let alone how each of your products is performing
More informationPREDICTING EMPLOYEE ATTRITION THROUGH DATA MINING
PREDICTING EMPLOYEE ATTRITION THROUGH DATA MINING Abbas Heiat, College of Business, Montana State University, Billings, MT 59102, aheiat@msubillings.edu ABSTRACT The purpose of this study is to investigate
More informationNew restaurants fail at a surprisingly
Predicting New Restaurant Success and Rating with Yelp Aileen Wang, William Zeng, Jessica Zhang Stanford University aileen15@stanford.edu, wizeng@stanford.edu, jzhang4@stanford.edu December 16, 2016 Abstract
More informationLumière. A Smart Review Analysis Engine. Ruchi Asthana Nathaniel Brennan Zhe Wang
Lumière A Smart Review Analysis Engine Ruchi Asthana Nathaniel Brennan Zhe Wang Purpose A rapid increase in Internet users along with the growing power of online reviews has given birth to fields like
More information