Consumer Segmentation and Knowledge Extraction from Smart Meter and Survey Data

Similar documents
Consumer Segmentation and Knowledge Extraction from Smart Meter and Survey Data

Peaking Interest: How awareness drives the effectiveness of time-of-use electricity pricing

idr: Consumer and Grid Friendly Demand Response System

Is it time for daily time-of-use electricity pricing? Household response to a pricing experiment

Leveraging Smart Meter Data & Expanding Services BY ELLEN FRANCONI, PH.D., BEMP, MEMBER ASHRAE; DAVID JUMP, PH.D., P.E.

Demand Estimation Sub Committee. EUC Modelling Approach Spring th December 2018

2017 Seasonal Savings Evaluation

Estimation of network leakage with smart meters

Big Data, Smart Energy, and Predictive Analytics. Dr. Rosaria Silipo Phil Winters

CO-LOCATION FIELD STUDY Full Report. Pamplona (SPAIN) August November 2017 (4 Months) Season: Summer-Autumn

Statewide Pricing Pilot (SPP)

An Aggregator Framework for European Demand Response Programs. Rune Hylsberg Jacobsen Department of Engineering Aarhus University, Denmark

Smarter Business processes resulting from Smart Data

What lies below the curve

Advanced Metering Infrastructure Analytics -A Case Study

Residential Demand Response using a Single Variable Information System-

Smart energy outlook. February 2017

Demand Estimation Technical Work Group Data Validation and Aggregations - Spring th April 2018

SYST/OR 699 Fall 2016-Final Presentation

Lights out: Why Earth Hour is dimming in B.C.

Extracting business value from Big Data. Dr. Rosaria Silipo Phil Winters

Adaptive Time Series Forecasting of Energy Consumption using Optimized Cluster Analysis

The Potential for Smart Meters in a National Household Energy Survey

ACHIEVING ENERGY CONSERVATION GOALS WITHOUT THE BROAD APPLICATION OF SMART METERS ONTARIO FEDERATION OF AGRICULTURE

Whitepaper: End uses of Load disaggregation

Smart Metering. Big Data and the Value of Analytics WHITE PAPER

Dynamic Pricing Works in a Hot, Humid Climate

Statewide Pricing Pilot (SPP)

Business Analytics & Data Mining Modeling Using R Dr. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee

CitiPower Amended Revised Proposed Tariff Structure Statement

Key Drivers of Households Water Use in South Australia: Practical Implications

Novel model for defining electricity tariffs using residential load profile characterisation

Customer: City of Dubuque

New Clustering-based Forecasting Method for Disaggregated End-consumer Electricity Load Using Smart Grid Data

Dimitris Tselentis. Civil - Transportation Engineer Ph.D. Candidate Researcher.

NSPM Rate Design Pilot

Smart Irrigation Making Every Drop Count Master Gardner State Conference Oct , 2013

Spatial and Temporal Disaggregation of Water Demand and Leakage of the Water Distribution Network in Skiathos, Greece

Renewable Energy in The Netherlands March 2016

Scott Valley Siskiyou County, California: Voluntary Private Well Water Level Monitoring Program Spring 2006 January 2016

A sustainable future with the Bosch inverter The heart of your photovoltaic system

Level II - Whole Building Interval Electric Data with Tenant Plug Load System

Developing Big-Data Infrastructure for Analyzing AIS Vessel Tracking Data on a Global Scale

Automatic Socio-Economic Classification of Households Using Electricity Consumption Data

Residential Energy Consumption: Longer Term Response to Climate Change

Decarbonization of Campus Energy Systems

Applying Regression Techniques For Predictive Analytics Paviya George Chemparathy

Air pollution and children s respiratory health: Do English parents respond to air quality information?

GENERAL SERVICE SCHEDULE GS-TOU OPTIONAL GENERAL SERVICE TIME OF USE METERED RATE

Conserve Energy SoCal Survey: Full Results. November 7, 2016

Machine Learning. Genetic Algorithms

Machine Learning. Genetic Algorithms

Impact of Thermal Storage on Pellet Boiler Performance

Combinational Collaborative Filtering: An Approach For Personalised, Contextually Relevant Product Recommendation Baskets

Is More Always Better? A Comparison of Billing Regression Results Using Monthly, Daily and Hourly AMI Data

AMI Billing Regression Study Final Report. February 23, 2016

Voltage Reduction Field Trials on Distributions Circuits

Michael Cahn and Barry Farrara, UC Cooperative Extension, Monterey Tom Bottoms and Tim Hartz, UC Davis

The energy benefits of View Dynamic Glass

Better Understanding Customers: Developing SMB DNA to Improve Customer Interactions and Catalyze Positive Behavior Changes

Non-Intrusive Load Monitoring (Nilms) Research Activity End-Use Energy Efficiency & Demand Response

Cluster-based Forecasting for Laboratory samples

Customer Targeting for Residential Energy Efficiency Adam Scheer and Sam Borgeson

Analysis of Energy Consumption for Building Operations in Los Angeles County

Smart Monitoring System For Automatic Anomaly Detection and Problem Diagnosis. Xianping Qu March 2015

equest Home Energy Model ME 514 ~ HVAC Sean Rosin University of Idaho/Boise State University Spring 2016

Using Low Voltage Smart System (LVSS) Data for Intelligent Operations and Customer Support

EFFECTS OF OCCUPANT AND DESIGNER BEHAVIORS ON RESIDENTIAL DEEP ENERGY RETROFIT PERFORMANCE

Fixed Price Energy A Guaranteed Deal

Towards Data-Driven Energy Consumption Forecasting of Multi-Family Residential Buildings: Feature Selection via The Lasso

PV Generation Potential 17.1 kbtu/ft2/yr 15.4

Allstream Centre Energy Performance Report

Fixed Price Energy A Guaranteed Deal

Segmentation Successes. February 12, 2014

PROGRAM YEAR 1 ( ) EM&V REPORT FOR THE RESIDENTIAL ENERGY EFFICIENCY BENCHMARKING PROGRAM

Comparison of Feedback- Induced Behaviors Across 3 Types of Feedback. Does the type of feedback matter?

Transform, Modernize And Scale Energy Efficiency Rebate Programs. Utilities

NYSERDA High Performance Development Challenge Energy Monitoring Report Hudson Passive Project

Energy Trust of Oregon New Homes Billing Analysis: Comparison of Modeled vs. Actual Energy Usage

Alberta Energy - Capacity Market Framework Engagement December 2017

Integrated Hourly Electricity Demand Forecast For Ontario

Chuck Sarnoski Delaware DNREC / DAQ

Data-Driven Management for a Sustainable Energy Grid

Insight Report: Domestic Heat Pumps

Efficiency assessment of the household water use

PRODUCT MARKETING - MAILING. Mark Evanoff

Professor Dr. Gholamreza Nakhaeizadeh. Professor Dr. Gholamreza Nakhaeizadeh

Integrated Production & Promotion. Leisure IBE

Demand Estimation Sub Committee. Seasonal Normal Review 2020: Review of CWV Formula

Renewable Energy in The Netherlands September 2015

Use of Historical Weather Data in Planning a Diverse Water Supply System in an Uncertain Climate Future

City of Ames Electric. City of Ames Electric. Smart Energy. Services. Donald Kom Director

Working with landlords and tenants to tackle fuel poverty: aka improving energy efficiency

Energy Efficiency Impact Study

Driving Efficiency with Interval Meter/Smart Meter Data. Mukesh Khattar, Energy Director, Oracle

Smart water in urban distribution networks: limited financial capacity and Big Data analytics

Aalborg Universitet. CLIMA proceedings of the 12th REHVA World Congress Heiselberg, Per Kvols. Publication date: 2016

Mass Market Demand Response and Variable Generation Integration Issues: A Scoping Study

Matching Demand with Supply in the Smart Grid using Agent-Based Multiunit Auction

Transcription:

Consumer Segmentation and Knowledge Extraction from Smart Meter and Survey Data Tri Kurniawan Wijaya 1*, Tanuja Ganu 2, Dipanjan Chakraborty 2, Karl Aberer 1, Deva P. Seetharam 2 1) EPFL, Switzerland & 2) IBM Research, India *) The work is done during the author s internship at IBM Research, India supported by EU FP7 WATTALYST

Smart meters measure energy consumption at homes communicate the measurements to utility companies 0 6 12 18 Hour 2

Smart meters (2) angels demons demand response targeted marketing privacy breach theft detection 3

Challenges 1 Versatile consumer segmentation framework 2 Determine behavioral change over time 3 Identify clusters characteristics 4

1 Consumer segmentation past specific challenges specific applications adhoc near future general versatile framework quick analysis context-aware decision support 5

Our Framework Unsupervised Learning Target Sensors: smart meters, holidays, seasons, special events, Data Selection Configuration Selection Features Generator statistical functions: mean, median, standard deviation, IQR, Context Filtering Temporal Aggregation hourly, daily, weekly, monthly, Context Sensors: weather, temperature, 5 Customized algorithm choose algorithm, #clusters (or auto) 4 Customized features mean, std dev, IQR, median 3 Customized context summer, winter, weekend, January, February, temp > 2 Customized temporal aggregation hourly, every 3 hours, daily, weekly, or monthly 1 Customized data selection period of time, subset of customers, time of day See the complete formalization in our paper

2 Cluster consistency Given all of these clusters, what do we want to know? Does this consumer change her cluster? note that: clusters are label-invariant Individual to cluster consistency: i2c x, C 1, C 2 = C 1(x) C 2 (x) + X\C 1 x (X\C 2 (x)) 1 X 1 consistent = #(friends friends) + #(non friends non friends) customers inconsistent the lower the value, the more likely x changes her cluster 7

Clustering consistency index How far does this consumer change? distance rank dr x, C x = x dist x, C(x) < dist x, C(x), x εc(x) C(x) low high = proportion of fellow cluster members who are farther from the centroid the higher the value, the more confidence we are that x belong to C(x) allows us to be (cluster) size-invariant 8

Clustering consistency index Does the cluster configuration changes? For example, over time? Cluster to cluster consistency: ccc C 1, C 2, X = 1 X i2c(x, C 1, C 2 ) x εx consistent the lower the value, the higher the difference between C 1 and C 2 inconsistent 9

Cluster to cluster consistency the higher, the more similar See comparison with 6 months ago: There are not so much difference between autumn and spring. But, there are a lot of difference between summer and winter. Next slide, more on Jan vs Jul 10

Individual to cluster consistency A B A B Jan Jul In Jan, A and B are in the low consumption cluster i2c( A, Jan, Jul ) = low (changes her cluster) dr(a, Jul) = high i2c( B, Jan, Jul ) = high (stays in the low consumption cluster) Devise a personalized energy (saving) feedback for A! While her friends reduce their consumption in Jul (summer), A did not! 11

3 Knowledge extraction What are the characteristics that define a cluster? get insight from the survey data (consumer characteristics) How discriminative is (q,a) to cluster c? DI c q, a = # c q, a # c (q, a) max {# c q, a, # c (q, a)} DI > 0, discriminative positive DI < 0, discriminative negative 12

Clusters characteristics Clusters based on absolute consumption Cluster Question Answer DI family type single 0.86 low floor area (sq ft.) 805-1073 0.86 #bedrooms 2 0.85 electric shower (-) 20 mins -0.76 medium family type (-) single -0.61 floor area (sq ft.) 2300-2750 0.56 #children 4 0.93 high family type (-) single -0.90 floor area (sq ft.) (-) 1200-0.87 Clusters based on consumption variability Cluster Question Answer DI water pump (-) 1-2hrs -0.88 low family type single 0.80 washing machine (-) 2-3 loads -0.76 electric shower 10-20 mins 0.59 medium family type (-) single -0.55 #children (-) 3-0.54 tumble dryer 2 to 3 loads 0.90 high #children 4 0.88 floor area (sq ft.) 2800 0.79 13

Floor area vs year built Floor area CDFs are clearly distinguishable The year the houses were built CDFs are coincides to each other 14

Appliance ownership for DI 0.60 Clusters based on absolute consumption # Appliance Cluster Ownership DI 1 dishwasher high (-) no -0.76 2 games consoles low (-) yes -0.70 3 tumble dryer low no 0.68 4 dishwasher low no 0.67 5 games consoles high yes 0.61 Clusters based on consumption variability # Appliance Cluster Ownership DI 1 dishwasher high (-) no -0.72 2 tumble dryer high (-) no -0.72 3 tumble dryer low no 0.71 4 games consoles low (-) yes -0.69 5 dishwasher low no 0.67 6 games consoles high yes 0.60 15

Games consoles Fraction of households which own games consoles Since family type is highly discriminative for consumer energy consumption behavior, this correlation might explain why games consoles ownership is also highly discriminative. 16

Conclusion Versatile, context-aware consumer segmentation framework temporal aggregation, context filtering, feature generation Cluster consistency index Which consumers change their clusters? How far? track clusters changes over time Discriminative index Clusters : unsupervised learning; It is imperative to understand what they are made of, extract the main characteristics which define the clusters. 17

end of presentation 18

Cluster to cluster consistency can be used to find out the effect of temporal aggregations on the consumer segmentation results Consumer segmentation based on Absolute consumption Consumption variability Consumption trends (or shape ) Do temporal aggregation granularities matter? the higher, the more similar 19

20

Consumer Segmentation and Knowledge Extraction from Smart Meter and Survey Data Tri Kurniawan Wijaya 1, Tanuja Ganu 2, Dipanjan Chakraborty 2, Karl Aberer 1, Deva P. Seetharam 2 1 EPFL, Switzerland & 2 IBM Research, India supported by EU FP7 WATTALYST

Worldwide deployment 22

Automatic cluster configuration selection From a set of cluster configuration Rank all configurations using the Silhouette, Dunn, and Davies-Bouldin indices Majority voting using the three (ranked) lists using the 1 st rank from each list if the majority is not found, continue to the 2 nd (3 rd, 4 th ) until the majority is found or the lists are exhausted 23

Jan Jul All year long trends absolute variability 24

25

Numerical/ordinal questions special treatment introducing splitting points: how many? where to put? solution: sort answers ascending combinatorial problem create ranges from n-gram of answers - how many? - approximate floor area? 26