A scalable Approach to Sizeindependent
|
|
- Elfreda Turner
- 6 years ago
- Views:
Transcription
1 Research Dublin Dept. of Computer Science Rutgers A scalable Approach to Sizeindependent Network Similarity Michele Berlingerio Tina Eliassi- Rad Danai Koutra Christos Faloutsos WIN, September 28 th - 29 th 2012, NYU Stern School of Business
2 Why network similarity? (1) 2 x- fer learning 1 Graph Database: clustering Danai Koutra (CMU) - danai@cs.cmu.edu 2
3 Why network similarity? (2) 3 Anomaly detec:on: different models? 4 Discon:nuity Detec:on Day 1 Day 2 Day 3 Day 4 Day 5 Danai Koutra (CMU) - danai@cs.cmu.edu 3
4 RoadMap Problem Defini:on NetSimile Experiments Applica<ons Conclusions Danai Koutra (CMU) - danai@cs.cmu.edu 4
5 Network Similarity: DeHinition INPUT: 2 anonymized networks GIVEN: node IDs NOT GIVEN: side- info class labels 1 2 OUTPUT: structural similarity score Danai Koutra (CMU) - danai@cs.cmu.edu 5
6 Network Similarity: Extension INPUT: set of anonymized networks 2 1 k OUTPUT: pairwise structural similarity scores and.8 s 12 = 0.1 s 13 = 0 s 5k = 1 Danai Koutra (CMU) - danai@cs.cmu.edu 6
7 Required Properties P1. effec:veness size- independence intui<veness interpretability P2. scalability Danai Koutra (CMU) - danai@cs.cmu.edu 7
8 RoadMap Problem Defini<on NetSimile Experiments Applica<ons Conclusions Danai Koutra (CMU) - danai@cs.cmu.edu 8
9 NetSimile: overview E ego, N ego 1 Feature Extrac:on 2 Feature Aggrega:on 1 n Ε[X 2 ] Ε[X] Comparison similarity metric Danai Koutra (CMU) - danai@cs.cmu.edu 9
10 Step 1: Feature Extraction egonet Local and egonet features: # of neighbors Why these features? clustering coefficient They satisfy all the constraints! avg. # of neighbors neighbors avg. clustering coeff. of neighbors edges in egonet outgoing edges from egonet # of neighbors of egonet F G = Danai Koutra (CMU) - danai@cs.cmu.edu 10 n o d e s n features
11 Step 2: Feature Aggregation 5 aggregators median n mean Why these aggregators? o d. standard devia<on Moments e They satisfy the s. skewness of feature effectiveness + scalability n distribu<ons constraints! kurtosis median mean 1 2. s.d. features skewness 1 n, median, kurtosis single signature vector per network Danai Koutra (CMU) - danai@cs.cmu.edu 11
12 Step 3: Comparison Networks 1 3 k... Signature Vectors (aggr. features) Similarity Scores s12 s13 G1 Why Canberrasdistance? 2 sg2 d Pi Qi s1k canberra( P, Q ) = i=1 s23 Canberra Pi + Qi distance tried: 0 sg3 Also near s2k ① sensitive to small changes - cosine similarity. ② normalizes the absolute- difference euclidean distance... - hypothesis tesdng:. of the individual comparisons. Mann- Whitney,. Kolmogorov- Smirnov sgk sk-1,k Danai Koutra (CMU) - danai@cs.cmu.edu 12
13 Required Properties: NetSimile P1. effec:veness size- independence intui<veness interpretability avoids the node- correspondence problem P2. scalability LEMMA The run<me complexity for genera<ng NetSimile s signature vectors is linear on the number of edges in the input networks: k # nodes O( f n j + f n j log(n j )) j=1 Danai Koutra (CMU) - danai@cs.cmu.edu 13
14 RoadMap Problem Defini<on NetSimile Experiments Applica<ons Conclusions Danai Koutra (CMU) - danai@cs.cmu.edu 14
15 Experiments: Data 30 real- world networks dblp mul<ple synthe<c networks Barabási- Albert Forest Fire Erdös- Rényi Wa[s- Strogatz querylog Oregon AS Danai Koutra (CMU) - danai@cs.cmu.edu 15
16 Experiments: intuitiveness + interpretability of NetSimile homogeneity in colors Observa:on: NetSimile gives be[er and more intui<ve graph clusters than the EIG method (eval- based compe<tor method). Danai Koutra (CMU) - danai@cs.cmu.edu 16
17 Experiments: NetSimile and node-overlap bigger overlap smaller distance Hypothesis: bigger node overlap => greater similarity Implicit Assump<on: networks are from the same domain Observa:on: The lower the NetSimile score (greater similarity), the higher the normalized node intersec<on of the input networks. Danai Koutra (CMU) - danai@cs.cmu.edu 17
18 Application: Discontinuity Detection in Yahoo! IM 1. Microso^ offers to buy Yahoo!. 2. New features for flickr were announced. nodes: IM users edges: communica<on events Danai Koutra (CMU) - danai@cs.cmu.edu 18
19 RoadMap Problem Defini<on NetSimile Experiments Applica<ons Conclusions Danai Koutra (CMU) - danai@cs.cmu.edu 19
20 Conclusions Novel approach: signature vector for each graph (summariza<on) NetSimile: effec<ve size- independent, intui<ve, interpretable scalable Applicability to a variety of problems Danai Koutra (CMU) - danai@cs.cmu.edu 20
21 Thank you! hbp:// Danai Koutra (CMU) - danai@cs.cmu.edu 21
22 Experiments (2): Are we measuring size? Observa:on: NetSimile is not measuring size there is no correla<on between extracted features and network size. Danai Koutra (CMU) - danai@cs.cmu.edu 22
23 Advantages of NetSimile size- invariant scalable LEMMA The run<me complexity for genera<ng NetSimile s signature vectors is linear on the number of edges in the input networks: k # nodes O( f n j + f n j log(n j )) j=1 avoids the node- correspondence problem Danai Koutra (CMU) - danai@cs.cmu.edu 23
Final Project - Social and Information Network Analysis
Final Project - Social and Information Network Analysis Factors and Variables Affecting Social Media Reviews I. Introduction Humberto Moreira Rajesh Balwani Subramanyan V Dronamraju Dec 11, 2011 Problem
More informationSegmenting Customer Bases in Personalization Applications Using Direct Grouping and Micro-Targeting Approaches
Segmenting Customer Bases in Personalization Applications Using Direct Grouping and Micro-Targeting Approaches Alexander Tuzhilin Stern School of Business New York University (joint work with Tianyi Jiang)
More informationFinal Project Report CS224W Fall 2015 Afshin Babveyh Sadegh Ebrahimi
Final Project Report CS224W Fall 2015 Afshin Babveyh Sadegh Ebrahimi Introduction Bitcoin is a form of crypto currency introduced by Satoshi Nakamoto in 2009. Even though it only received interest from
More informationSecond Gonality of Erdős-Rényi Random Graphs
Second Gonality of Erdős-Rényi Random Graphs Andy Xu and Wendy Wu Mentor: Guangyi Yue May 2, 27 PRIMES Conference Andy Xu and Wendy Wu Mentor: Guangyi Yue Second Gonality of Erdős-Rényi Random Graphs /23
More informationSegmentation and Targeting
Segmentation and Targeting Outline The segmentation-targeting-positioning (STP) framework Segmentation The concept of market segmentation Managing the segmentation process Deriving market segments and
More informationBeer Hipsters: Exploring User Mindsets in Online Beer Reviews
Beer Hipsters: Exploring User Mindsets in Online Beer Reviews Group 27: Yuze Dan Huang, Aaron Lewis, Garrett Schlesinger December 10, 2012 1 Introduction 1.1 Motivation Online product rating systems are
More informationPredicting ratings of peer-generated content with personalized metrics
Predicting ratings of peer-generated content with personalized metrics Project report Tyler Casey tyler.casey09@gmail.com Marius Lazer mlazer@stanford.edu [Group #40] Ashish Mathew amathew9@stanford.edu
More informationGene Expression Data Analysis
Gene Expression Data Analysis Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu BMIF 310, Fall 2009 Gene expression technologies (summary) Hybridization-based
More informationThe Impact of Rumor Transmission on Product Pricing in BBV Weighted Networks
Management Science and Engineering Vol. 11, No. 3, 2017, pp. 55-62 DOI:10.3968/9952 ISSN 1913-0341 [Print] ISSN 1913-035X [Online] www.cscanada.net www.cscanada.org The Impact of Rumor Transmission on
More informationCMU SCS Mining Large Social Networks: Patterns and Anomalies
Mining Large Social Networks: Patterns and Anomalies Christos Faloutsos CMU Thank you The Department of Informatics Happy 20-th! Prof. Yannis Manolopoulos Prof. Kostas Tsichlas Mrs. Nina Daltsidou C. Faloutsos
More informationAir Transportation System Architecture Analysis
Air Transportation System Architecture Analysis Project Final Presentation Advanced System Architecture Spring 2006 May 9 th, 2006 Presentation by: Philippe Bonnefoy Roland Weibel Instructors: Chris Magee,
More informationCREDIT RISK MODELLING Using SAS
Basic Modelling Concepts Advance Credit Risk Model Development Scorecard Model Development Credit Risk Regulatory Guidelines 70 HOURS Practical Learning Live Online Classroom Weekends DexLab Certified
More informationSegmentation and Targeting
Segmentation and Targeting Outline The segmentation-targeting-positioning (STP) framework Segmentation The concept of market segmentation Managing the segmentation process Deriving market segments and
More informationCase studies in Data Mining & Knowledge Discovery
Case studies in Data Mining & Knowledge Discovery Knowledge Discovery is a process Data Mining is just a step of a (potentially) complex sequence of tasks KDD Process Data Mining & Knowledge Discovery
More informationLa convergence du stockage et d'hadoop pour l'entreprise Xavier Guerin VP Southern Europe & Benelux, MapR Technologies
La convergence du stockage et d'hadoop pour l'entreprise Xavier Guerin VP Southern Europe & Benelux, MapR Technologies MapR Technologies - Confiden6al 1 The use of big data will become a key basis of compe77on
More informationCMU SCS Mining Billion-node Graphs: Patterns and Tools
Mining Billion-node Graphs: Patterns and Tools Christos Faloutsos CMU Thank you! Monica Rogati C. Faloutsos (CMU) 2 Our goal: Open source system for mining huge graphs: PEGASUS project (PEta GrAph mining
More informationIBM Research Report. Toward Finding Valuable Topics
RC24975 (W4-42) April 3, 2 Computer Science IBM Research Report Toward Finding Valuable Topics Zhen Wen, Ching-Yung Lin IBM Research Division Thomas J. Watson Research Center P.O. Box 74 Yorktown Heights,
More informationCMU SCS Mining Billion-Node Graphs: Patterns and influence propagation
Mining Billion-Node Graphs: Patterns and influence propagation Christos Faloutsos CMU Thank you! Sinan Aral Foster Provost Arun Sundararajan Shirley Lau C. Faloutsos (CMU) 2 Our goal: Open source system
More informationCMU SCS Large Graph Mining
Large Graph Mining Christos Faloutsos CMU Thank you! Ed Kao Lori Tsoulas Joan Meehan-Dion C. Faloutsos (CMU) 2 Our goal: Open source system for mining huge graphs: PEGASUS project (PEta GrAph mining System)
More informationProtec'ng Privacy in the Archives: Preliminary Explora'ons of Topic Modeling for Born- Digital Collec'ons
Protec'ng Privacy in the Archives: Preliminary Explora'ons of Topic Modeling for Born- Digital Collec'ons Tim Hutchinson 13 December 2017 IEEE 2017: 2 nd CAS Workshop Guiding ques'ons/context! High volume
More information4/30/15. Agenda. Product Development: A Strategic Growth Plan. DATIA June 4, 2015
DATIA June 4, 2015 Product Development: A Strategic Growth Plan Agenda Defini1on of Product Development Steps of Product Design: Design Build Market Review Product Bundling & Ver1cal Strategy Applica1on
More informationUncovering the Small Community Structure in Large Networks: A Local Spectral Approach
Uncovering the Small Community Structure in Large Networks: A Local Spectral Approach Yixuan Li 1, Kun He 2, David Bindel 1 and John E. Hopcroft 1 1 Cornell University, USA 2 Huazhong University of Science
More informationSTATISTICS PART Instructor: Dr. Samir Safi Name:
STATISTICS PART Instructor: Dr. Samir Safi Name: ID Number: Question #1: (20 Points) For each of the situations described below, state the sample(s) type the statistical technique that you believe is the
More informationAn Architecture for Collec/ng Longitudinal Social Data. Jeremy Blackburn, Adriana Iamnitchi University of South Florida
An Architecture for Collec/ng Longitudinal Social Data Jeremy Blackburn, Adriana Iamnitchi University of South Florida Our Experience: Mining the Steam Steam Community: Community OSN Large online social
More informationLinked Big Data Graph Database and Analytics
E6893 Big Data Analytics Lecture 6: Linked Big Data Graph Database and Analytics Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science October 11, 2018 E6893 Big
More informationCo-evolution of Social and Affiliation Networks
Co-evolution of Social and Affiliation Networks Elena Zheleva Dept. of Computer Science University of Maryland College Park, MD 20742, USA elena@cs.umd.edu Hossam Sharara Dept. of Computer Science University
More informationEyal Carmi. Google, 76 Ninth Avenue, New York, NY U.S.A. Gal Oestreicher-Singer and Uriel Stettner
RESEARCH NOTE IS OPRAH CONTAGIOUS? THE DEPTH OF DIFFUSION OF DEMAND SHOCKS IN A PRODUCT NETWORK Eyal Carmi Google, 76 Ninth Avenue, New York, NY 10011 U.S.A. {eyal.carmi@gmail.com} Gal Oestreicher-Singer
More informationW2- Lecture 1: Tes/ng whether two mean values are significantly different:
W- Lecture 1: Tes/ng whether two mean values are significantly different: If the variable is normally distributed about its mean, we can use the formula below to test whether the difference between the
More informationExperience with Adap0ng a WS BPEL Run0me for escience Workflows
Experience with Adap0ng a WS BPEL Run0me for escience Workflows Thilina Gunarathne, Chathura Herath, Eran Chinthaka, Suresh Marru Pervasive Technology Ins0tute Indiana University Introduc0on Scien0st communi0es
More informationSOCIAL MEDIA MINING. Behavior Analytics
SOCIAL MEDIA MINING Behavior Analytics Dear instructors/users of these slides: Please feel free to include these slides in your own material, or modify them as you see fit. If you decide to incorporate
More informationDELI: An Interactive New Product Development and Targeting Tool
DELI: An Interactive New Product Development and Targeting Tool Martin Natter and Andreas Mild Wirtschaftsuniversität Wien Pappenheimgasse 35/3/5 A-1200 Vienna, Austria {martin.natter, andreas.mild}@wu-wien.ac.at
More informationStrength in numbers? Modelling the impact of businesses on each other
Strength in numbers? Modelling the impact of businesses on each other Amir Abbas Sadeghian amirabs@stanford.edu Hakan Inan inanh@stanford.edu Andres Nötzli noetzli@stanford.edu. INTRODUCTION In many cities,
More informationSelecting the best statistical distribution using multiple criteria
Selecting the best statistical distribution using multiple criteria (Final version was accepted for publication by Computers and Industrial Engineering, 2007). ABSTRACT When selecting a statistical distribution
More informationUser Guide. MAGNET : MicroArray & RNAseq Gene expression Network Evalua=on Toolkit. Page 1
User Guide MAGNET : MicroArray & RNAseq Gene expression Network Evalua=on Toolkit Page 1 Case Western Reserve University February 2012 Page 2 Page 3 1 - Introduction This sec=on will introduce MAGNET:
More informationMining the Social Web. Eric Wete June 13, 2017
Mining the Social Web Eric Wete ericwete@gmail.com June 13, 2017 Outline The big picture Features and methods (Political Polarization on Twitter) Summary (Political Polarization on Twitter) Features on
More informationMining Heterogeneous Urban Data at Multiple Granularity Layers
Mining Heterogeneous Urban Data at Multiple Granularity Layers Antonio Attanasio Supervisor: Prof. Silvia Chiusano Co-supervisor: Prof. Tania Cerquitelli collection Urban data analytics Added value urban
More informationJMP Clinical 4.0 adds Ways to Explore Clinical Trials Data Visually
JMP Clinical 4.0 adds Ways to Explore Clinical Trials Data Visually Dr. Valerie Nedbal JMP Pharmaceu4cal Technical Manager SAS Ins4tute Copyright 2010 SAS Institute Inc. All rights reserved. JMP Clinical
More informationPredicting user rating for Yelp businesses leveraging user similarity
Predicting user rating for Yelp businesses leveraging user similarity Kritika Singh kritika@eng.ucsd.edu Abstract Users visit a Yelp business, such as a restaurant, based on its overall rating and often
More informationChallenges for making scalable security management for informa5on and communica5on infrastructure
Challenges for making scalable security management for informa5on and communica5on infrastructure Prof. Dr. Suguru Yamaguchi Graduate School of Informa5on Science, Nara Ins5tute of Science and Technology,
More informationAdd Sophisticated Analytics to Your Repertoire with Data Mining, Advanced Analytics and R
Add Sophisticated Analytics to Your Repertoire with Data Mining, Advanced Analytics and R Why Advanced Analytics Companies that inject big data and analytics into their operations show productivity rates
More informationModel- Based System Development for Managing the Evolu;on of a Common Submarine Combat System
Model- Based System Development for Managing the Evolu;on of a Common Submarine Combat System 18 May 2010 Steven W. Mitchell Lockheed Martin Fellow LM Certified Systems Architect Copyright Lockheed Mar3n,
More informationCase studies in Data Mining & Knowledge Discovery
Case studies in Data Mining & Knowledge Discovery Knowledge Discovery is a process Data Mining is just a step of a (potentially) complex sequence of tasks KDD Process Data Mining & Knowledge Discovery
More informationADAPTIVITY OF COMPLEX NETWORK TOPOLOGIES FOR DESIGNING RESILIENT SUPPLY CHAIN NETWORKS
International Journal of Industrial Engineering, 22(1), 102-116, 2015 ADAPTIVITY OF COMPLEX NETWORK TOPOLOGIES FOR DESIGNING RESILIENT SUPPLY CHAIN NETWORKS Sonia Irshad Mari 1, Young Hae Lee 1,*, Muhammad
More informationSmart Analytics for Big Time-series Data
Smart Analytics for Big Time-series Data Yasushi Sakurai (niversity) Yasuko Matsubara (niversity) Christos Faloutsos (Carnegie Mellon University) 2017 Sakurai, Matsubara & Faloutsos 1 Roadmap - Motivation
More informationMeasuring Coverage and Divergence of Reading Behaviors Among Friends
Measuring Coverage and Divergence of Reading Behaviors Among Friends Long T. Le Rutgers University longtle@cs.rutgers.edu Tina Eliassi-Rad Rutgers University eliassi@cs.rutgers.edu ABSTRACT Given data
More informationAnnexe 1 : statistiques descriptives
Annexe 1 : statistiques descriptives The MEANS Procedure age 532 36.8533835 11.7263657 18.0000000 64.0000000 lnwage 532 2.0597883 0.5156435 0.5596000 3.2692000 ED 532 13.0187970 2.6195743 2.0000000 18.0000000
More informationMicrofinance network analysis and optimal seeding prediction Group 53 Alan Newman (awnewman), Sam McLaughlin (thesam), V. Govind Manian (vgmanian)
1 Microfinance network analysis and optimal seeding prediction Group 53 Alan Newman (awnewman), Sam McLaughlin (thesam), V. Govind Manian (vgmanian) Acknowledgements: We thank the TAs for their advice
More informationDissecting the Human Protein-Protein Interaction Network via Phylogenetic Decomposition
Dissecting the Human Protein-Protein Interaction Network via Phylogenetic Decomposition Cho-Yi Chen 1, Andy Ho 2, Hsin-Yuan Huang 3, Hsueh-Fen Juan 1,2,*, Hsuan-Cheng Huang 4,* 1 Genome and Systems Biology
More informationCo-evolution of strategies and update rules in the prisoner s dilemma game on complex networks
Co-evolution of strategies and update rules in the prisoner s dilemma game on complex networks Alessio Cardillo Department of Physics of Condensed Matter University of Zaragoza and Institute for Biocomputation
More informationDownstream analysis of transcriptomic data
Downstream analysis of transcriptomic data Shamith Samarajiwa CRUK Bioinforma3cs Summer School July 2015 General Methods Dimensionality reduc3on methods (clustering, PCA, MDS) Visualizing PaKerns (heatmaps,
More informationHomophily and Influence in Social Networks
Homophily and Influence in Social Networks Nicola Barbieri nicolabarbieri1@gmail.com References: Maximizing the Spread of Influence through a Social Network, Kempe et Al 2003 Influence and Correlation
More informationSupplementary Figures
Supplementary Figures Supplementary Fig. S1 - Nationwide contributions of the most abundant genera. The figure shows log 10 of the relative percentage of genera, forming 80% of total abundance. (Russian
More informationMYGO PRO vs LIGHTCYCLER NANO
PERFORMANCE MADE EASY MYGO PRO vs LIGHTCYCLER NANO YOUR COMPLETE GUIDE TO UPGRADING CONTENTS This guide describes benefits provided by upgrading your LightCycler Nano to a MyGo Pro. 1. BENEFITS MULTIPLEXING
More informationANALYSING QUANTITATIVE DATA
9 ANALYSING QUANTITATIVE DATA Although, of course, there are other software packages that can be used for quantitative data analysis, including Microsoft Excel, SPSS is perhaps the one most commonly subscribed
More informationBest Practices for Delivering BPM Projects. Jörg Grote BPM Solution Architect. Copyright TIBCO Software Inc.
Best Practices for Delivering BPM Projects Jörg Grote BPM Solution Architect Copyright 2000-2014 TIBCO Software Inc. Agenda Topics covered BPM what is it really? Rules for success? Process Centric Development
More informationStandard Formula, Internal Models and Scope of Validation. Example IM Calibrations Interest Rates Equities. Self-sufficiency
How to Calibrate Risk Distributions Andrew D Smith (andrewdsmith8@deloitte.co.uk) 2 December 2011 2010 The Actuarial Profession www.actuaries.org.uk Agenda Standard Formula, Internal Models and Scope of
More informationAnalytics for Banks. September 19, 2017
Analytics for Banks September 19, 2017 Outline About AlgoAnalytics Problems we can solve for banks Our experience Technology Page 2 About AlgoAnalytics Analytics Consultancy Work at the intersection of
More informationWE consider the general ranking problem, where a computer
5140 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 54, NO. 11, NOVEMBER 2008 Statistical Analysis of Bayes Optimal Subset Ranking David Cossock and Tong Zhang Abstract The ranking problem has become increasingly
More informationPropagation of Innovations in Networked Groups. Winter Mason. Indiana University. Andy Jones. University of North Carolina. Robert L.
Propagation of Innovations 1 Running head: PROPAGATION OF INNOVATIONS IN NETWORKS Propagation of Innovations in Networked Groups Winter Mason Indiana University Andy Jones University of North Carolina
More informationAutomating Operational Decisions in Real-time. Chris Sanden Senior Analytics Engineer
Automating Operational Decisions in Real-time Chris Sanden Senior Analytics Engineer About Me. Senior Analytics Engineer at Netflix Working on Real-time Analytics Part of the Insight Engineering Team Tenure
More informationAn Executive s Guide to the Evolution of Fraud Detection
White Paper: Evolution of Fraud Detection An Executive s Guide to the Evolution of Fraud Detection With attacks coming from every angle, fraud detection is critical in today s world. Companies employ an
More informationMulti-Resource Packing for Cluster Schedulers. CS6453: Johan Björck
Multi-Resource Packing for Cluster Schedulers CS6453: Johan Björck The problem Tasks in modern cluster environments have a diverse set of resource requirements CPU, memory, disk, network... The problem
More informationNetworked Life (CSE 112)
Networked Life (CSE 112) Prof. Michael Kearns Final Examination May 3, 2006 The final exam is closed-book; you should have no materials present other than the exam and a pen or pencil. NAME: PENN ID: Exam
More informationDecision Support Systems
Decision Support Systems 2011/2012 Week 7. Lecture 13 Previous Class FP- tree structure: Complete and compact DB summariza8on Built on decreasing frequency: more frequent items in common branches FP- growth:
More informationSource-code quality. Part 2. Software Visualizations. Andy Kellens. Friday 3 May 13
Source-code quality Part 2. Software Visualizations Andy Kellens If I could say it in words, there would be no reason to paint -- Edward Hopper 2 In this lecture... What is (software) visualization? What
More informationTowards an understanding of Decision Complexity in IT Configuration
Towards an understanding of Decision Complexity in IT Configuration Bin Lin EECS, Northwestern University binlin365@gmail.com Aaron Brown and Joseph L. Hellerstein IBM T. J. Watson Research Center Take-away
More informationCS 6604: Data Mining Large Networks and Time- series. B. Aditya Prakash Lecture #1: Introduc/on
CS 6604: Data Mining Large Networks and Time- series B. Aditya Prakash Lecture #1: Introduc/on Networks are everywhere! Facebook Network [2010] Gene Regulatory Network [Decourty 2008] Human Disease Network
More informationNetflix Optimization: A Confluence of Metrics, Algorithms, and Experimentation. CIKM 2013, UEO Workshop Caitlin Smallwood
Netflix Optimization: A Confluence of Metrics, Algorithms, and Experimentation CIKM 2013, UEO Workshop Caitlin Smallwood 1 Allegheny Monongahela Ohio River 2 TV & Movie Enjoyment Made Easy Stream any video
More informationPREDICTION OF SOCIAL CAPITAL BASED ON THE EMOTIONAL INTELLIGENCE IN EDUCATION MINISTRY
PREDICTION OF SOCIAL CAPITAL BASED ON THE EMOTIONAL INTELLIGENCE IN EDUCATION MINISTRY *Fattah Nazem andsusan Gheytasi Department of Education, College of Education and Counselling, Roudehen Branch, Islamic
More informationFailure Predic,on of Jobs in Compute Clouds: A Google Cluster Case Study
Failure Predic,on of Jobs in Compute Clouds: A Google Cluster Case Study Xin Chen, and Karthik Pa0abiraman University of Bri:sh Columbia (UBC) Charng- da Lu, Unaffiliated Compute Clouds } Infrastructure
More informationLogistics. Final exam date. Project Presentation. Plan for this week. Evolutionary Algorithms. Crossover and Mutation
Logistics Crossover and Mutation Assignments Checkpoint -- Problem Graded -- comments on mycourses Checkpoint --Framework Mostly all graded -- comments on mycourses Checkpoint -- Genotype / Phenotype Due
More informationBuilding the In-Demand Skills for Analytics and Data Science Course Outline
Day 1 Module 1 - Predictive Analytics Concepts What and Why of Predictive Analytics o Predictive Analytics Defined o Business Value of Predictive Analytics The Foundation for Predictive Analytics o Statistical
More informationSPM 8.2. Salford Predictive Modeler
SPM 8.2 Salford Predictive Modeler SPM 8.2 The SPM Salford Predictive Modeler software suite is a highly accurate and ultra-fast platform for developing predictive, descriptive, and analytical models from
More informationDynamic Learning, Herding and Guru Effects in Networks
1 Dynamic Learning, Herding and Guru Effects in Networks Sheri Markose* #** Amadeo Alentorn # Andreas Krause # * Economics Department, University of Essex, Wivenhoe Park, Colchester CO4 3SQ, Great Britain
More informationIntroduction C H A P T E R 1
C H A P T E R 1 Introduction 1 A social network is a social structure made up of a set of actors (such as individuals or organizations) and a complex set of the dyadic ties between these actors. Social
More informationTable. XTMIXED Procedure in STATA with Output Systolic Blood Pressure, use "k:mydirectory,
Table XTMIXED Procedure in STATA with Output Systolic Blood Pressure, 2001. use "k:mydirectory,. xtmixed sbp nage20 nage30 nage40 nage50 nage70 nage80 nage90 winter male dept2 edu_bachelor median_household_income
More informationInfluencer Communities. Influencer Communities. Influencers are having many different conversations
Influencer Communities Influencers are having many different conversations 1 1.0 Background A unique feature of social networks is that people with common interests are following (or friend-ing) similar
More informationPrincipal Survey Results. April 2009 Survey Administra8on Performance Management Ini8a8ve May 18, 2009
Principal Survey Results April 2009 Survey Administra8on Performance Management Ini8a8ve May 18, 2009 Survey Logis8cs Twice annual survey of DPS principals to assess performance of opera8onal departments
More informationInfluence of First Steps in a Community on Ego-Network: Growth, Diversity, and Engagement
Influence of First Steps in a Community on Ego-Network: Growth, Diversity, and Engagement Atef Chaudhury University of Waterloo Waterloo, ON N2L 3G1, Canada a29chaud@uwaterloo.ca Myunghwan Kim LinkedIn
More informationKnowledge-Guided Analysis with KnowEnG Lab
Han Sinha Song Weinshilboum Knowledge-Guided Analysis with KnowEnG Lab KnowEnG Center Powerpoint by Charles Blatti Knowledge-Guided Analysis KnowEnG Center 2017 1 Exercise In this exercise we will be doing
More informationFRAUD ANALYTICS AND DATA WAREHOUSING
FRAUD ANALYTICS AND DATA WAREHOUSING Office of the Comptroller State of Maryland Andrew Schaufele; Director, Bureau of Revenue Estimates Agenda Our Team Background Case with TBD Success Successful Case
More informationRevealing Hidden Connections in Recommendation Networks
Revealing Hidden Connections in Recommendation Networks Rogério Minhano Universidade Federal do ABC - UFABC, Rua Santa Adélia., 66, Santo André, Brazil discover.rogerio@tokiomarine.com.br Stenio Fernandes
More information'.^s i. f-^ >'f' ' ^ J <X. "> ' ',rol-v^ »J?<)J:> 'V* n*^
' '.^s i f-^ >'f' ' ^ ^ V, ^> J
More informationCollusive Behavior in a Network Constrained Market
Collusive Behavior in a Network Constrained Market Karla Atkins Jiangzhuo Chen V.S. Anil Kumar Matthew Macauley Achla Marathe 1 Introduction Market power is the ability of a firm to raise the of a product
More informationSoftware Data Analytics. Nevena Lazarević
Software Data Analytics Nevena Lazarević 1 Selected Literature Perspectives on Data Science for Software Engineering, 1st Edition, Tim Menzies, Laurie Williams, Thomas Zimmermann The Art and Science of
More informationA stochastic approach to multi channel attribution in online advertising
A stochastic approach to multi channel attribution in online advertising Authors Venkatesh Sreedharan (venkateshw@freshworks.com Swaminathan Padmanabhan (swami@freshworks.com Machine Learning @Freshworks
More informationadvanced analysis of gene expression microarray data aidong zhang World Scientific State University of New York at Buffalo, USA
advanced analysis of gene expression microarray data aidong zhang State University of New York at Buffalo, USA World Scientific NEW JERSEY LONDON SINGAPORE BEIJING SHANGHAI HONG KONG TAIPEI CHENNAI Contents
More informationValue over Velocity From Feature Building to Value Delivery
Value over Velocity From Feature Building to Value Delivery Ryan Shriver rshriver@ddig.com @ryanshriver 1 theagileengineer.com Leader of IT Performance Improvement Solu>ons for Dominion Digital in Richmond,
More informationJure Leskovec, Includes joint work with Jaewon Yang, Manuel Gomez Rodriguez, Andreas Krause, Lars Backstrom and Jon Kleinberg.
Jure Leskovec, Stanford University Includes joint work with Jaewon Yang, Manuel Gomez Rodriguez, Andreas Krause, Lars Backstrom and Jon Kleinberg. Information reaches us by personal influence in our social
More informationManagement Science Letters
Management Science Letters 4 (2014) 1003 1008 Contents lists available at GrowingScience Management Science Letters homepage: www.growingscience.com/msl Examining the effect of relationship marketing on
More informationLegislation as a complex network: Modelling and analysis of European Union legal sources
Legislation as a complex networ: Modelling and analysis of European Union legal sources Marios KONIARIS a,1, Ioannis ANAGNOSTOPOULOS b and Yannis VASSILIOU a a KDBSL Lab, School of ECE, Nat. Tech. Univ.
More informationModelling yield risk measures of major crop plants in Poland
Paper prepared for the 13 rd EAAE Seminar PRICE VOLATILITY AND FARM INCOME STABILISATION Modelling Outcomes and Assessing Market and Policy Based Responses Dublin, February 3-4, 01 Modelling yield risk
More informationDisentangling the Influence of Temperature and Antecedent Soil Moisture on Colorado River Water Resources
Disentangling the Influence of Temperature and Antecedent Soil Moisture on Colorado River Water Resources Project start and duraon: September 1, 2014, 24 months Funder: U.S. Department of the Interior,
More informationSPSS 14: quick guide
SPSS 14: quick guide Edition 2, November 2007 If you would like this document in an alternative format please ask staff for help. On request we can provide documents with a different size and style of
More informationList of Tables. 2.1 Fast Moving Consumer Goods (FMCG) Sector of India. 2.2 Fast Moving Consumer Goods (FMCG) Sector of India:
List of Tables Chapter 1 1.1 The Top 10 companies in FMCG sector 1.2 Pre and post liberalization scenario 1.3 FMCG Category and products Chapter 2 2.1 Fast Moving Consumer Goods (FMCG) Sector of India
More informationSupplementary materials
Supplementary materials Calculation of the growth rate for each gene In the growth rate dataset, each gene has many different growth rates under different conditions. The average growth rate for gene i
More informationCMU SCS Large Graph Mining Patterns, Tools and Cascade analysis
Large Graph Mining Patterns, Tools and Cascade analysis Christos Faloutsos CMU Thank you! Brian Gallagher Jan Winfield C. Faloutsos (CMU) 2 Roadmap Introduction Motivation Why big data Why (big) graphs?
More informationQute: Quality-of-Monitoring Aware Sensing and Routing Strategy in Wireless Sensor Networks S H A O J I E T A N G T E M P L E U N I V E R S I T Y
Qute: Quality-of-Monitoring Aware Sensing and Routing Strategy in Wireless Sensor Networks S H A O J I E T A N G J I E WU T E M P L E U N I V E R S I T Y Examples of WSNs Wireless Sensor Networks (WSNs)
More informationDesign Metrics and Visualization Techniques for Analyzing the Performance of MOEAs in DSE
Design Metrics and Visualization Techniques for Analyzing the Performance of MOEAs in DSE Toktam Taghavi, Andy D. Pimentel Computer Systems Architecture Group, Informatics Institute University of Amsterdam
More informationSales Forecast and Business Management Strategy - ADDM (Advanced Data Decomposition Model)
Delivering intelligent business solutions to World Class Retailers www.ibs-softsolutions.com Soft Solutions Ibs AMS Sales Forecast and Business Management Strategy - ADDM (Advanced Data Decomposition Model)
More information