Methodologies for Improved Tag Cloud Generation with Clusterin

Size: px
Start display at page:

Download "Methodologies for Improved Tag Cloud Generation with Clusterin"

Transcription

1 Methodologies for Improved Tag Cloud Generation with Clustering. Martin Leginus, Peter Dolog, Ricardo Lage, and Frederico Durao Department of Computer Science, Aalborg University July, 2012

2 Agenda Introduction Syntactical pre-clustering of tags Improving coverage and diversity of tag clouds with clustering Experiments

3 Introduction Tag clouds Problems Methodologies Experiments Conclusion Social collaborative tagging Tagging is a process when a user assigns a tag to an item Users collaboratively annotate items with tags It improves searching, discovering and categorizing of content Tagging information of users (implicit ratings) is utilized by tag-based recommenders

4 Tag clouds A tag cloud is a visual depiction of user-generated tags typically used to describe the content of web sites [Wikipedia, ] Often used as visual information retrieval interface.

5 Common use cases of tag clouds Content Browsing Navigation along the site content Serendipitous discovery

6 Problems The majority of tag clouds are constructed according to the tag popularity/frequency. These clouds suffer from: A cloud does not depict whole spectrum of tag space (missing rarely used tags which can be interesting for users) as only most frequent tags are considered. New tags (new content areas) in a system are hardly depicted in a tag cloud because of they low tag frequency. The most popular tags can be often useless for content discrimination or discovery. (see tags: night, autumn, cool..)

7 Problems Syntactically similar tags cause redudancy

8 Tag cloud s metrics Synthetic metrics express a quality of tags selection process. A coverage for a particular tag t expresses how many of considered documents were annotated with a tag t Coverage(t) = Dt D a, (1) Overlap of T c: Different tags in T c may be assigned with the same item in D Tc. The overlap metric captures the extent of such redundancy. Overlap(T c) = avg ti t j D ti D tj min{ D ti, D tj }, (2) We introduce a new metric chained coverage that captures how many documents are covered by a considered tag given that documents covered by previously selected tags are not considered. This metric combines coverage and overlap altogether Chained coverage(t T s) = Dt \ D T s, (3) D a

9 Syntactical Pre-clustering of Tags Tags in these systems can have the same semantical meaning however they are syntactically different i.e., typos, singular and plural forms and compounded tags. Levenhstein distance (measures the number of required changes: substitution, insertion and deletion of a character are allowed operations to transform one tag into another) is computed for each tag pair from the tag space. Tag space is divided into clusters The most frequent tag from each cluster is used for further computations

10 Improving coverage and diversity of clouds with clustering We explore 3 different clustering techniques, each obtained cluster expresses a laten topic in the tag space. The goal is to cover as many clusters as possible. 1 A tag space is clustered and divided into disjoint clusters. 2 Tags are selected proportionally from each cluster according to their coverage. 3 The tags selection algorithm maximizes a chained coverage metric. The maximization of the chained coverage promotes (specific) tags with the high coverage of not yet covered documents by previously selected tags. On the other hand frequent tags with low chained coverage (general meaning) are omitted.

11 Experiments The improvements of the methodologies are evaluated in terms of coverage and overlap of generated tag clouds. The evaluation is conducted on the following datasets: Bibsonomy contains 5794 distinct users, items and tags. The total number of tagging posts is Delicious dataset contains users, unique tags and bookmarks. The total number of tagging posts is

12 Syntactical pre-clustering of tags Table: Mean values of coverage and overlap for the baseline and syntactical pre-clustering methods on BibSonomy and Delicious datasets. Coverage Overlap Dataset Baseline Pre-Clustering Baseline Pre-Clustering BibSonomy Delicious Coverage had a 5% (5079 documents) increase on BiSonomy dataset and 3.5% (3072 documents) increase on Delicious. Overlap, on the other hand, had similar results.

13 Syntactical pre-clustering of tags As number of tags increases, coverage and overlap improves in a logarithmic fashion. Baseline Syntactical clustering Coverage Overlap Delicious Number of tags in the tag cloud Delicious Number of tags in the tag cloud Coverage Overlap Bibsonomy Number of tags in the tag cloud Bibsonomy Number of tags in the tag cloud Figure: Coverage and overlap results for baseline (red) and pre-clustering (black) methods and their corresponding logarithmic fit.

14 Improving coverage and diversity of clouds with clustering The average improvements are presented in the following tables. Coverage Dataset Baseline K-means Hierarchical Feature hashing BibSonomy Delicious Table: Mean values of coverage for the baseline and different clustering methods on BibSonomy and Delicious datasets. Overlap Dataset Baseline K-means Hierarchical Feature hashing BibSonomy Delicious Table: Mean values of overlap for the baseline and different clustering methods on BibSonomy and Delicious datasets.

15 Improving coverage and diversity of clouds with clustering The proposed methodology improves the coverage on both datasets. Similarly, the overlap of generated tag clouds is decreased. The best performing clustering technique is hierarchical clustering which computes a tag pairs co-occurrences. Coverage Overlap Delicious Number of tags in the tag cloud Delicious 0.1 Baseline K-means Hierarchical Feature hashing Number of tags in the tag cloud Coverage Overlap Bibsonomy Number of tags in the tag cloud Bibsonomy Number of tags in the tag cloud Figure: Improvements of coverage and overlap on Bibsonomy and Delicious datasets with different clustering techniques and their corresponding logarithmic fit.

16 Conclusion and future work Syntactical pre-clustering of tags improves coverage of tag clouds it prohibits a depiction of the syntactically similar tags = more diverse tag clouds Second methodology improves coverage and decreases overlap of tag clouds introduced metric chained coverage simplifies a selection process As a future work we intend to explore possible new metrics that would incorporate well-known metrics altogether and in a such way simplify a selection process of tags.

Tag cloud generation for results of multiple keywords queries

Tag cloud generation for results of multiple keywords queries Tag cloud generation for results of multiple keywords queries Martin Leginus, Peter Dolog and Ricardo Gomes Lage IWIS, Department of Computer Science, Aalborg University What tag clouds are? Tag cloud

More information

M-Eco enhanced Adaptation Service (D5.2) Dolog, Peter; Durao, Frederico Araujo; Lage, Ricardo Gomes; Leginus, Martin; Pan, Rong

M-Eco enhanced Adaptation Service (D5.2) Dolog, Peter; Durao, Frederico Araujo; Lage, Ricardo Gomes; Leginus, Martin; Pan, Rong Aalborg Universitet M-Eco enhanced Adaptation Service (D5.2) Dolog, Peter; Durao, Frederico Araujo; Lage, Ricardo Gomes; Leginus, Martin; Pan, Rong Publication date: 2012 Document Version Accepted author

More information

Entity Grouping for Accessing Social Streams via Word Clouds

Entity Grouping for Accessing Social Streams via Word Clouds Entity Grouping for Accessing Social Streams via Word Clouds Martin Leginus 1, Leon Derczynski 2, and Peter Dolog 1 1 Department of Computer Science, Aalborg University, Selma Lagerlofs Vej 300, 9200 Aalborg,

More information

Graz University of Technology Knowledge Management Institute EVALUATING TAG-BASED INFORMATION ACCESS IN IMAGE COLLECTIONS.

Graz University of Technology Knowledge Management Institute EVALUATING TAG-BASED INFORMATION ACCESS IN IMAGE COLLECTIONS. Graz University of Technology Knowledge Management Institute EVALUATING TAG-BASED INFORMATION ACCESS IN IMAGE COLLECTIONS Technical Report Christoph Trattner with Yi-ling Lin Denis Parra Zhen Yue Peter

More information

Individual and Social Behavior in Tagging Systems

Individual and Social Behavior in Tagging Systems Individual and Social Behavior in Tagging Systems Elizeu Santos-Neto,David Condon,Nazareno Andrade +,Adriana Iamnitchi,Matei Ripeanu Electrical & Computer Engineer University of British Columbia 2332 Mail

More information

On utility of temporal embeddings for skill matching. Manisha Verma, PhD student, UCL Nathan Francis, NJFSearch

On utility of temporal embeddings for skill matching. Manisha Verma, PhD student, UCL Nathan Francis, NJFSearch On utility of temporal embeddings for skill matching Manisha Verma, PhD student, UCL Nathan Francis, NJFSearch Skill Trend Importance 1. Constant evolution of labor market yields differences in importance

More information

OntoNaviERP: Ontology-supported Navigation in ERP Software Documentation

OntoNaviERP: Ontology-supported Navigation in ERP Software Documentation OntoNaviERP: Ontology-supported Navigation in ERP Software Documentation 1,2 and Andreas Wechselberger 1 1 E-Business and Web Science Research Group, Bundeswehr University Munich, Germany 2 STI Innsbruck,

More information

Conclusions and Future Work

Conclusions and Future Work Chapter 9 Conclusions and Future Work Having done the exhaustive study of recommender systems belonging to various domains, stock market prediction systems, social resource recommender, tag recommender

More information

Evaluating Tag-Based Information Access in Image Collections

Evaluating Tag-Based Information Access in Image Collections Evaluating Tag-Based Information Access in Image Collections Denis Parra dap89@pitt.edu Christoph Trattner ctrattner@iicm.edu Zhen Yue zhy18@pitt.edu Yi-ling Lin yil54@pitt.edu Peter Brusilovsky peterb@pitt.edu

More information

BSC 4934: Q BIC Capstone Workshop. Giri Narasimhan. ECS 254A; Phone: x3748

BSC 4934: Q BIC Capstone Workshop. Giri Narasimhan. ECS 254A; Phone: x3748 BSC 4934: Q BIC Capstone Workshop Giri Narasimhan ECS 254A; Phone: x3748 giri@cis.fiu.edu http://www.cis.fiu.edu/~giri/teach/bsc4934_su09.html 24 June through 7 July, 2009 06/30/09 Q'BIC Bioinformatics

More information

Finding Compensatory Pathways in Yeast Genome

Finding Compensatory Pathways in Yeast Genome Finding Compensatory Pathways in Yeast Genome Olga Ohrimenko Abstract Pathways of genes found in protein interaction networks are used to establish a functional linkage between genes. A challenging problem

More information

UNED: Evaluating Text Similarity Measures without Human Assessments

UNED: Evaluating Text Similarity Measures without Human Assessments UNED: Evaluating Text Similarity Measures without Human Assessments Enrique Amigó Julio Gonzalo Jesús Giménez Felisa Verdejo UNED, Madrid {enrique,julio,felisa}@lsi.uned.es Google, Dublin jesgim@gmail.com

More information

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with

More information

IMPLEMENTATION OF FOLKSONOMY BASED TAG CLOUD MODEL FOR INFORMATION RETRIEVAL FROM DOCUMENT REPOSITORY IN AN INDIAN UNIVERSITY

IMPLEMENTATION OF FOLKSONOMY BASED TAG CLOUD MODEL FOR INFORMATION RETRIEVAL FROM DOCUMENT REPOSITORY IN AN INDIAN UNIVERSITY IMPLEMENTATION OF FOLKSONOMY BASED TAG CLOUD MODEL FOR INFORMATION RETRIEVAL FROM DOCUMENT REPOSITORY IN AN INDIAN UNIVERSITY Sohil D. Pandya 1 Paresh V. Virparia 2 and Rinku Chavda 3 1 Sardar Vallabhbhai

More information

Speech Analytics Transcription Accuracy

Speech Analytics Transcription Accuracy Speech Analytics Transcription Accuracy Understanding Verint s speech analytics transcription and categorization accuracy Verint.com Twitter.com/verint Facebook.com/verint Blog.verint.com Table of Contents

More information

Introduction to EMBASE on Ovid

Introduction to EMBASE on Ovid Introduction to EMBASE on Ovid EMBASE is a key resource for generating systematic reviews and supporting effective evidence-based medicine and drug and medical device searching EMBASE Facts Extensive EMTREE

More information

Automatic Tagging and Categorisation: Improving knowledge management and retrieval

Automatic Tagging and Categorisation: Improving knowledge management and retrieval Automatic Tagging and Categorisation: Improving knowledge management and retrieval 1. Introduction Unlike past business practices, the modern enterprise is increasingly reliant on the efficient processing

More information

Brochure. Information Management & Governance. Find and Control Enterprise Content. Micro Focus ControlPoint

Brochure. Information Management & Governance. Find and Control Enterprise Content. Micro Focus ControlPoint Brochure Information Management & Governance Find and Control Enterprise Content Micro Focus ControlPoint Brochure Find and Control Enterprise Content Micro Focus ControlPoint: A Better Way to Manage Data

More information

Evaluating Tagging Behavior in Social Bookmarking Systems: Metrics and design heuristics

Evaluating Tagging Behavior in Social Bookmarking Systems: Metrics and design heuristics Evaluating Tagging Behavior in Social Bookmarking Systems: Metrics and design heuristics Umer Farooq 1, Thomas G. Kannampallil 1, Yang Song 2, Craig H. Ganoe 1, John M. Carroll 1, and C. Lee Giles 2 1

More information

Novel Research Impact Indicators

Novel Research Impact Indicators Vol. 23, no. 4 (2014) 300 309 ISSN: 1435-5205 e-issn: 2213-056X Novel Research Impact Indicators Martin Fenner Hannover Medical School, Hannover, Germany and Public Library of Science, San Francisco, CA,

More information

CLASS/YEAR: II MCA SUB.CODE&NAME: MC7303, SOFTWARE ENGINEERING. 1. Define Software Engineering. Software Engineering: 2. What is a process Framework? Process Framework: UNIT-I 2MARKS QUESTIONS AND ANSWERS

More information

Mining Social Topologies from Email for Online Data Sharing Diana MacLean, Sudheendra Hangal, Seng Keat Teh, Monica Lam and Jeffrey Heer Stanford University A Social Topology!)*#+,(!))*+,%-*( 0,12,"3'(.%/*"-'(

More information

A Weighted Tag Similarity Measure Based on a Collaborative Weight Model

A Weighted Tag Similarity Measure Based on a Collaborative Weight Model A Weighted Tag Similarity Measure Based on a Collaborative Weight Model G.R.J.Srinivas Niket Tandon Search and Information Max Planck Institute, Extraction Lab, IIIT Hyderabad, Germany India ntandon@mpi-inf.mpg.de

More information

Creation of a PAM matrix

Creation of a PAM matrix Rationale for substitution matrices Substitution matrices are a way of keeping track of the structural, physical and chemical properties of the amino acids in proteins, in such a fashion that less detrimental

More information

Data Analytics with MATLAB Adam Filion Application Engineer MathWorks

Data Analytics with MATLAB Adam Filion Application Engineer MathWorks Data Analytics with Adam Filion Application Engineer MathWorks 2015 The MathWorks, Inc. 1 Case Study: Day-Ahead Load Forecasting Goal: Implement a tool for easy and accurate computation of dayahead system

More information

Controlled Unclassified Information Guide

Controlled Unclassified Information Guide Controlled Unclassified Information Guide Program: (CHESS) Program Manager: Mr. Dustin Fraze Program Security Officer: Ms. Denice Holden Date: March 29, 2018 Version: 1.0 1 Background The CHESS program

More information

Mining Tweets for Tag Recommendation on Social Media

Mining Tweets for Tag Recommendation on Social Media Mining Tweets for Tag Recommendation on Social Media Denzil Correa and Ashish Sureka Indraprastha Institute of Information Technology (IIIT-Delhi), India {denzilc, ashish} @iiitd.ac.in http://www.iiitd.ac.in/

More information

Document and Media Exploitation (DOMEX)

Document and Media Exploitation (DOMEX) SOLUTION BRIEF: DOCUMENT AND MEDIA EXPLOITATION (DOMEX)........................................ Document and Media Exploitation (DOMEX) Who should read this paper DOMEX analysts looking to quickly prioritize,

More information

Genome Assembly Using de Bruijn Graphs. Biostatistics 666

Genome Assembly Using de Bruijn Graphs. Biostatistics 666 Genome Assembly Using de Bruijn Graphs Biostatistics 666 Previously: Reference Based Analyses Individual short reads are aligned to reference Genotypes generated by examining reads overlapping each position

More information

The Influence of Frequency, Recency and Semantic Context on the Reuse of Tags in Social Tagging Systems

The Influence of Frequency, Recency and Semantic Context on the Reuse of Tags in Social Tagging Systems The Influence of Frequency, Recency and Semantic Context on the Reuse of Tags in Social Tagging Systems ABSTRACT Dominik Kowald Know-Center Graz University of Technology Graz, Austria dkowald@know-center.at

More information

CHAPTER 4 A FRAMEWORK FOR CUSTOMER LIFETIME VALUE USING DATA MINING TECHNIQUES

CHAPTER 4 A FRAMEWORK FOR CUSTOMER LIFETIME VALUE USING DATA MINING TECHNIQUES 49 CHAPTER 4 A FRAMEWORK FOR CUSTOMER LIFETIME VALUE USING DATA MINING TECHNIQUES 4.1 INTRODUCTION Different groups of customers prefer some special products. Customers type recognition is one of the main

More information

Taxonomy Development for Knowledge Management

Taxonomy Development for Knowledge Management Taxonomy Development for Knowledge Management Date : 24/07/2008 Mary Whittaker Librarian Boeing Library Services The Boeing Company PO Box 3707 M/C 62-LC Seattle WA 98124 +1-425-306-2086 +1-425-965-0119

More information

SOLUTION BRIEF CA AGILE REQUIREMENTS DESIGNER FOR CA AGILE CENTRAL. CA Agile Requirements Designer for CA Agile Central

SOLUTION BRIEF CA AGILE REQUIREMENTS DESIGNER FOR CA AGILE CENTRAL. CA Agile Requirements Designer for CA Agile Central SOLUTION BRIEF CA AGILE REQUIREMENTS DESIGNER FOR CA AGILE CENTRAL CA Agile Requirements Designer for CA Agile Central Automatically convert user stories into the smallest set of test cases needed to fully

More information

Data Analytics for Engineers

Data Analytics for Engineers Data Analytics for Engineers Will Wilson Application Engineer MathWorks 2016 The MathWorks, Inc. 1 Agenda Definition Common Challenges Case Study Wrap Up 2 What is Data Analytics? Data Analytics is the

More information

GenScale Scalable, Optimized and Parallel Algorithms for Genomics. Dominique LAVENIER

GenScale Scalable, Optimized and Parallel Algorithms for Genomics. Dominique LAVENIER GenScale Scalable, Optimized and Parallel Algorithms for Genomics Dominique LAVENIER Context New Sequencing Technologies - NGS Exponential growth of genomic data Drastic decreasing of costs Emergence of

More information

De novo meta-assembly of ultra-deep sequencing data

De novo meta-assembly of ultra-deep sequencing data De novo meta-assembly of ultra-deep sequencing data Hamid Mirebrahim 1, Timothy J. Close 2 and Stefano Lonardi 1 1 Department of Computer Science and Engineering 2 Department of Botany and Plant Sciences

More information

Following text taken from Suresh Kumar. Bioinformatics Web - Comprehensive educational resource on Bioinformatics. 6th May.2005

Following text taken from Suresh Kumar. Bioinformatics Web - Comprehensive educational resource on Bioinformatics. 6th May.2005 Bioinformatics is the recording, annotation, storage, analysis, and searching/retrieval of nucleic acid sequence (genes and RNAs), protein sequence and structural information. This includes databases of

More information

Profile HMMs. 2/10/05 CAP5510/CGS5166 (Lec 10) 1 START STATE 1 STATE 2 STATE 3 STATE 4 STATE 5 STATE 6 END

Profile HMMs. 2/10/05 CAP5510/CGS5166 (Lec 10) 1 START STATE 1 STATE 2 STATE 3 STATE 4 STATE 5 STATE 6 END Profile HMMs START STATE 1 STATE 2 STATE 3 STATE 4 STATE 5 STATE 6 END 2/10/05 CAP5510/CGS5166 (Lec 10) 1 Profile HMMs with InDels Insertions Deletions Insertions & Deletions DELETE 1 DELETE 2 DELETE 3

More information

Bionano Solve Theory of Operation: Variant Annotation Pipeline

Bionano Solve Theory of Operation: Variant Annotation Pipeline Bionano Solve Theory of Operation: Variant Annotation Pipeline Document Number: 30190 Document Revision: B For Research Use Only. Not for use in diagnostic procedures. Copyright 2018 Bionano Genomics,

More information

Process Mining Applied to the BPI Challenge 2012: Divide and Conquer While Discerning Resources

Process Mining Applied to the BPI Challenge 2012: Divide and Conquer While Discerning Resources Process Mining Applied to the BPI Challenge 2012: Divide and Conquer While Discerning Resources R.P. Jagadeesh Chandra Bose and Wil M.P. van der Aalst Department of Mathematics and Computer Science Eindhoven

More information

Text Mining. Theory and Applications Anurag Nagar

Text Mining. Theory and Applications Anurag Nagar Text Mining Theory and Applications Anurag Nagar Topics Introduction What is Text Mining Features of Text Document Representation Vector Space Model Document Similarities Document Classification and Clustering

More information

!!!!! ifolio Portfolio Summary. for more information August, 2014

!!!!! ifolio Portfolio Summary. for more information   August, 2014 ifolio Portfolio Summary August, 2014 for more information www.concerttechnology.com bizdev@concerttechnology.com C o n c e r t T e c h n o l o g y Overview The Amazon Kindle, introduced in 2007, help

More information

CJM-ex: Goal-oriented Exploration of Customer Journey Maps using Event Logs and Data Analytics

CJM-ex: Goal-oriented Exploration of Customer Journey Maps using Event Logs and Data Analytics CJM-ex: Goal-oriented Exploration of Customer Journey Maps using Event Logs and Data Analytics Gaël Bernard 1 and Periklis Andritsos 2 1 University of Lausanne, Faculty of Business and Economics (HEC),

More information

Research on Customer Knowledge Acquisition Model based on Data Mining

Research on Customer Knowledge Acquisition Model based on Data Mining JOURNAL OF SIMULATION, VOL. 5, NO. 2, May 2017 147 Research on Knowledge Acquisition Model based on Data Mining Shen Nali Southwest University of Political Science and Law, School of Management, Chongqing,China

More information

ChIP-seq and RNA-seq. Farhat Habib

ChIP-seq and RNA-seq. Farhat Habib ChIP-seq and RNA-seq Farhat Habib fhabib@iiserpune.ac.in Biological Goals Learn how genomes encode the diverse patterns of gene expression that define each cell type and state. Protein-DNA interactions

More information

ROAD TO STATISTICAL BIOINFORMATICS CHALLENGE 1: MULTIPLE-COMPARISONS ISSUE

ROAD TO STATISTICAL BIOINFORMATICS CHALLENGE 1: MULTIPLE-COMPARISONS ISSUE CHAPTER1 ROAD TO STATISTICAL BIOINFORMATICS Jae K. Lee Department of Public Health Science, University of Virginia, Charlottesville, Virginia, USA There has been a great explosion of biological data and

More information

Tagging That Works. Thomas Vander Wal Presented to: Web 2.0 Expo San Francisco, California :: 16 April 2007

Tagging That Works. Thomas Vander Wal Presented to: Web 2.0 Expo San Francisco, California :: 16 April 2007 Tagging That Works Thomas Vander Wal Presented to: Web 2.0 Expo San Francisco, California :: 16 April 2007 What Is A Tag? What s A Tag? Wutzatag? Tagging: Definition Simple data/metadata externally applied

More information

MaSiMe: A Customized Similarity Measure and Its Application for Tag Cloud Refactoring

MaSiMe: A Customized Similarity Measure and Its Application for Tag Cloud Refactoring MaSiMe: A Customized Similarity Measure and Its Application for Tag Cloud Refactoring David Urdiales-Nieto, Jorge Martinez-Gil, and José F. Aldana-Montes University of Málaga, Department of Computer Languages

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Dr. Taysir Hassan Abdel Hamid Lecturer, Information Systems Department Faculty of Computer and Information Assiut University taysirhs@aun.edu.eg taysir_soliman@hotmail.com

More information

Geneious Biologics. Jannick Bendtsen, PhD. Vice President Technology Services Biomatters 13 December 2017

Geneious Biologics. Jannick Bendtsen, PhD. Vice President Technology Services Biomatters 13 December 2017 Geneious Biologics Jannick Bendtsen, PhD. Vice President Technology Services Biomatters 13 December 2017 // About Biomatters Founded in 2003 in Auckland, New Zealand Flagship Geneious software first released

More information

WHITE PAPER RETHINKING THE PROCUREMENT VALUE EQUATION

WHITE PAPER RETHINKING THE PROCUREMENT VALUE EQUATION RETHINKING THE PROCUREMENT : : 2 EACH FUNCTIONAL AREA WITHIN PROCUREMENT PROVIDES BENEFITS, BE IT HARD DOLLAR SAVINGS, REDUCED CYCLE TIME, OR INCREASED CASH FLOW, AND HAS A COST. IF WE THINK OF THE STANDARD

More information

Five Key Features Required for a Perfect Fit Distributed Control System

Five Key Features Required for a Perfect Fit Distributed Control System Five Key Features Required for a Perfect Fit Distributed Control System Identify the Right DCS Solution for Your Industrial Operation Introduction For industrial organizations, it is imperative to increase

More information

Process Mining techniques in complex Administrative Processes

Process Mining techniques in complex Administrative Processes Process Mining techniques in complex Administrative Processes Jan Suchy, Milan Suchy GRADIENT ECM, Kosicka 56, 82108 Bratislava, Slovakia {Jan.Suchy, Milan.Suchy}@gradientecm.com Abstract. This research

More information

Content Reuse and Interest Sharing in Tagging Communities

Content Reuse and Interest Sharing in Tagging Communities Content Reuse and Interest Sharing in Tagging Communities Elizeu Santos-Neto *, Matei Ripeanu *, Adriana Iamnitchi + University of British Columbia, Electrical and Computer Engineering Department * 2332

More information

Application of Data Mining In Agriculture

Application of Data Mining In Agriculture Application of Data Mining In Agriculture P. Grace Sharon Student, Department of Information Technology, Saveetha School of Engineering, Saveetha University, Chennai 602 105, India Abstract: Data mining

More information

RETHINKING THE PROCUREMENT VALUE EQUATION

RETHINKING THE PROCUREMENT VALUE EQUATION RETHINKING THE PROCUREMENT VALUE EQUATION CONTENTS CONTENTS Introduction... 3 Value Contributors... 4 Why Choose A Full Source Source-to-Pay Solution...5 Considerations in Choosing a Source-to-Pay Solution...

More information

Measuring the Price of Discrimination with Data on Poker Games

Measuring the Price of Discrimination with Data on Poker Games Measuring the Price of Discrimination with Data on Poker Games May 29, 2013 Dr. Ingo Fiedler 1. Introduction Economic theory suggests that discrimination is price sensitive and money an equalizer This

More information

Gene List Enrichment Analysis

Gene List Enrichment Analysis Outline Gene List Enrichment Analysis George Bell, Ph.D. BaRC Hot Topics March 16, 2010 Why do enrichment analysis? Main types Selecting or ranking genes Annotation sources Statistics Remaining issues

More information

A logistic regression model for Semantic Web service matchmaking

A logistic regression model for Semantic Web service matchmaking . BRIEF REPORT. SCIENCE CHINA Information Sciences July 2012 Vol. 55 No. 7: 1715 1720 doi: 10.1007/s11432-012-4591-x A logistic regression model for Semantic Web service matchmaking WEI DengPing 1*, WANG

More information

Jan Schmidt (Bamberg), Thomas N. Burg (Vienna) ICA Conference, Dresden,

Jan Schmidt (Bamberg), Thomas N. Burg (Vienna) ICA Conference, Dresden, Titel Jan Schmidt (Bamberg), Thomas N. Burg (Vienna) Bottom-up Classification of Content in Networked Organizational Communication ICA Conference, Dresden, 22.06.2006 #2 of 18 Agenda 1. an overview 2.

More information

Keyword Extraction using Word Co-occurrence TIR 2010, Bilbao 31 August 2010

Keyword Extraction using Word Co-occurrence TIR 2010, Bilbao 31 August 2010 Keyword Extraction using Word Co-occurrence TIR 2010, Bilbao 31 August 2010 Christian Wartena (Novay Rogier Brussee (Univ. of Applied Sciences Utrecht, presenter Wout Slakhorst (Novay Problem description

More information

EMC M&R (WATCH4NET) Cross-Domain Performance, Capacity and SLA Management. Ensure high service quality to users ESSENTIALS

EMC M&R (WATCH4NET) Cross-Domain Performance, Capacity and SLA Management. Ensure high service quality to users ESSENTIALS EMC M&R (WATCH4NET) Cross-Domain Performance, Capacity and SLA Management Ensure high service quality to users The data center infrastructure is a rapidly-evolving environment containing hundreds or thousands

More information

HIERARCHICAL LOCATION CLASSIFICATION OF TWITTER USERS WITH A CONTENT BASED PROBABILITY MODEL. Mounika Nukala

HIERARCHICAL LOCATION CLASSIFICATION OF TWITTER USERS WITH A CONTENT BASED PROBABILITY MODEL. Mounika Nukala HIERARCHICAL LOCATION CLASSIFICATION OF TWITTER USERS WITH A CONTENT BASED PROBABILITY MODEL by Mounika Nukala Submitted in partial fulfilment of the requirements for the degree of Master of Computer Science

More information

Report Consolidator. Author: Ravisankar Manickam

Report Consolidator. Author: Ravisankar Manickam Report Consolidator Author: Ravisankar Manickam Agenda Abstract Types of Metrics Report Consolidator Tool Conclusion Logica 2011. All rights reserved No. 2 Abstract» Every company s ultimate endeavor is

More information

Ch. 6: Understanding and Characterizing the Workload

Ch. 6: Understanding and Characterizing the Workload Ch. 6: Understanding and Characterizing the Workload Kenneth Mitchell School of Computing & Engineering, University of Missouri-Kansas City, Kansas City, MO 64110 Kenneth Mitchell, CS & EE dept., SCE,

More information

Software Reliability and Testing: Know When To Say When. SSTC June 2007 Dale Brenneman McCabe Software

Software Reliability and Testing: Know When To Say When. SSTC June 2007 Dale Brenneman McCabe Software Software Reliability and Testing: Know When To Say When SSTC June 2007 Dale Brenneman McCabe Software 1 SW Components with Higher Reliability Risk, in terms of: Change Status (new or modified in this build/release)

More information

Trust-Networks in Recommender Systems

Trust-Networks in Recommender Systems San Jose State University SJSU ScholarWorks Master's Projects Master's Theses and Graduate Research 2008 Trust-Networks in Recommender Systems Kristen Mori San Jose State University Follow this and additional

More information

In 1996, the genome of Saccharomyces cerevisiae was completed due to the work of

In 1996, the genome of Saccharomyces cerevisiae was completed due to the work of Summary: Kellis, M. et al. Nature 423,241-253. Background In 1996, the genome of Saccharomyces cerevisiae was completed due to the work of approximately 600 scientists world-wide. This group of researchers

More information

Context-aware recommendation

Context-aware recommendation Context-aware recommendation Eirini Kolomvrezou, Hendrik Heuer Special Course in Computer and Information Science User Modelling & Recommender Systems Aalto University Context-aware recommendation 2 Recommendation

More information

Testing Calculation Engines using Input Space Partitioning & Automation Thesis for the MS in Software Engineering Jeff Offutt and Chandra Alluri

Testing Calculation Engines using Input Space Partitioning & Automation Thesis for the MS in Software Engineering Jeff Offutt and Chandra Alluri Testing Calculation Engines using Input Space Partitioning & An Industrial Study of Applying Input Space Partitioning to Test Financial Calculation Engines Automation Thesis for the MS in Software Engineering

More information

Enhancing social tagging with a knowledge organization system. Brian Matthews, K. Golub, C. Jones, J. Moon, M. L. Nielsen, B. Puzoń, D.

Enhancing social tagging with a knowledge organization system. Brian Matthews, K. Golub, C. Jones, J. Moon, M. L. Nielsen, B. Puzoń, D. Enhancing social tagging with a knowledge organization system Brian Matthews, K. Golub, C. Jones, J. Moon, M. L. Nielsen, B. Puzoń, D. Tudhope EnTag: Enhancing Social Tagging for Discovery K. Golub, C.

More information

A Quick Chat About SOMF Structural Modeling

A Quick Chat About SOMF Structural Modeling www.modelingconcepts.com Do not be afraid to ask! A Quick Chat About SOMF Structural Modeling For architects, business analysts, system analysts, software developers, modelers, team leaders, and managers

More information

SimilarWeb vs. Direct Measurement What is the Difference?

SimilarWeb vs. Direct Measurement What is the Difference? SimilarWeb vs. Direct Measurement What is the Difference? Understand the differences between SimilarWeb and direct measurement tools to gain the most value out of your digital market intelligence. What

More information

Introduction to the MiSeq

Introduction to the MiSeq Introduction to the MiSeq 2011 Illumina, Inc. All rights reserved. Illumina, illuminadx, BeadArray, BeadXpress, cbot, CSPro, DASL, Eco, Genetic Energy, GAIIx, Genome Analyzer, GenomeStudio, GoldenGate,

More information

Explore the Structure of Social Tags by Subsumption Relations

Explore the Structure of Social Tags by Subsumption Relations Explore the Structure of Social Tags by Subsumption Relations Xiance Si, Zhiyuan Liu, Maosong Sun Department of Computer Science and Technology State Key Lab on Intelligent Technology and Systems National

More information

The 4-Dimensional Swimlane: An expanded view of process modeling July 3, 2018 Dr. Joseph Drasin

The 4-Dimensional Swimlane: An expanded view of process modeling July 3, 2018 Dr. Joseph Drasin : An expanded view of process modeling July 3, 2018 Dr. Joseph Drasin Representing processes in a manner that can be easily consumed by decisionmakers remains one of the most challenging aspects of being

More information

RNA standards v May

RNA standards v May Standards, Guidelines and Best Practices for RNA-Seq: 2010/2011 I. Introduction: Sequence based assays of transcriptomes (RNA-seq) are in wide use because of their favorable properties for quantification,

More information

A Systematic Approach to Performance Evaluation

A Systematic Approach to Performance Evaluation A Systematic Approach to Performance evaluation is the process of determining how well an existing or future computer system meets a set of alternative performance objectives. Arbitrarily selecting performance

More information

ANALYTIC SOLUTIONS WITH DISPARATE DATA

ANALYTIC SOLUTIONS WITH DISPARATE DATA ANALYTIC SOLUTIONS WITH DISPARATE DATA CQSDI 2018, Cape Canaveral, FL John Schroeder and Chad Hall Lockheed Martin Aeronautics Enterprise Integration Advanced Analytics We see disparate data sources as

More information

Agenda Endorsement System

Agenda Endorsement System Agenda Endorsement System Ankitha Premkumar apremkum@usc.edu Prathibha Muralidharan prathibm@usc.edu Sharath Ravishankar ravishas@usc.edu Reshma Malla reshmama@usc.edu ABSTRACT Planning is a pivotal activity.

More information

Figure S4 A-H : Initiation site properties and evolutionary changes

Figure S4 A-H : Initiation site properties and evolutionary changes A 0.3 Figure S4 A-H : Initiation site properties and evolutionary changes G-correction not used 0.25 Fraction of total counts 0.2 0.5 0. tag 2 tags 3 tags 4 tags 5 tags 6 tags 7tags 8tags 9 tags >9 tags

More information

The development of hardware, software and scientific advancements. made the computerization of business easier. Scientific advancements

The development of hardware, software and scientific advancements. made the computerization of business easier. Scientific advancements Chapter 5 A CASE STUDY ON A SUPERMARKET 5.1 Introduction The development of hardware, software and scientific advancements made the computerization of business easier. Scientific advancements have made

More information

Fatigue Monitoring for Demonstrating Fatigue Design Basis Compliance

Fatigue Monitoring for Demonstrating Fatigue Design Basis Compliance Fatigue Monitoring for Demonstrating Fatigue Design Basis Compliance Gary L. Stevens, Arthur F. Deardorff, David A. Gerber Structural Integrity Associates 3315 Almaden Expressway, Suite 24 San Jose, CA

More information

Training Data Tools to Enhance Safety Intelligence

Training Data Tools to Enhance Safety Intelligence Training Data Tools to Enhance Safety Intelligence Marco Merens Chief, Integrated Aviation Analysis Air Navigation Bureau Addis Ababa, Ethiopia 12 April 2017 Data-driven Decision Making for Training Using

More information

De Novo Assembly of High-throughput Short Read Sequences

De Novo Assembly of High-throughput Short Read Sequences De Novo Assembly of High-throughput Short Read Sequences Chuming Chen Center for Bioinformatics and Computational Biology (CBCB) University of Delaware NECC Third Skate Genome Annotation Workshop May 23,

More information

Text Analysis of American Airlines Customer Reviews

Text Analysis of American Airlines Customer Reviews SESUG 2016 Paper EPO-281 Text Analysis of American Airlines Customer Reviews Rajesh Tolety, Oklahoma State University Saurabh Kumar Choudhary, Oklahoma State University ABSTRACT Which airline should I

More information

Resolution of Chemical Disease Relations with Diverse Features and Rules

Resolution of Chemical Disease Relations with Diverse Features and Rules Resolution of Chemical Disease Relations with Diverse Features and Rules Dingcheng Li*, Naveed Afzal*, Majid Rastegar Mojarad, Ravikumar Komandur Elayavilli, Sijia Liu, Yanshan Wang, Feichen Shen, Hongfang

More information

A Guide to the Business Analysis Body of Knowledge (BABOK Guide), Version 2.0 Skillport

A Guide to the Business Analysis Body of Knowledge (BABOK Guide), Version 2.0 Skillport A Guide to the Business Analysis Body of Knowledge (BABOK Guide), Version 2.0 by The International Institute of Business Analysis (IIBA) International Institute of Business Analysis. (c) 2009. Copying

More information

CA Project & Portfolio Management

CA Project & Portfolio Management CA FOUNDATION PAPER JULY 2018 CA Project & Portfolio Management CA PPM Reconstructing resource management tools to simplify tasks, drive collaboration and facilitate action Executive Summary Challenge

More information

InCites Benchmarking & Analytics

InCites Benchmarking & Analytics InCites Benchmarking & Analytics Massimiliano Carloni Solution Specialist massimiliano.carloni@clarivate.com June 2018 Agenda 1. Clarivate Analytics: news 2. Publons & Kopernio plug-in 3. Web of Science

More information

SoC Planning, Management, Reporting, Auditing, and Signoff

SoC Planning, Management, Reporting, Auditing, and Signoff SoC Planning, Management, Reporting, Auditing, and Signoff By Cadence Accurately monitoring progress on complex integrated circuit designs (or SoCs) has become more difficult as the designs have increased

More information

Collaborative Print Archives Framework. Planning Meeting #2 March 26, 2010

Collaborative Print Archives Framework. Planning Meeting #2 March 26, 2010 Collaborative Print Archives Framework Planning Meeting #2 March 26, 2010 Agenda I. Review of February meeting II. Followup on plans for a print archives metadata system III.Options for a CRL print archiving

More information

Gain control over all enterprise content

Gain control over all enterprise content Brochure Gain control over all enterprise content HP ControlPoint A better way to manage big data Most organizations today store data in a number of business systems and information repositories. This

More information

Data Mining for Biological Data Analysis

Data Mining for Biological Data Analysis Data Mining for Biological Data Analysis Data Mining and Text Mining (UIC 583 @ Politecnico di Milano) References Data Mining Course by Gregory-Platesky Shapiro available at www.kdnuggets.com Jiawei Han

More information

Generative Models for Networks and Applications to E-Commerce

Generative Models for Networks and Applications to E-Commerce Generative Models for Networks and Applications to E-Commerce Patrick J. Wolfe (with David C. Parkes and R. Kang-Xing Jin) Division of Engineering and Applied Sciences Department of Statistics Harvard

More information

Preface to the third edition Preface to the first edition Acknowledgments

Preface to the third edition Preface to the first edition Acknowledgments Contents Foreword Preface to the third edition Preface to the first edition Acknowledgments Part I PRELIMINARIES XXI XXIII XXVII XXIX CHAPTER 1 Introduction 3 1.1 What Is Business Analytics?................

More information

TWEETDICT: Identification of Topically Related Twitter Hashtags

TWEETDICT: Identification of Topically Related Twitter Hashtags TWEETDICT: Identification of Topically Related Twitter Hashtags Fabian Dreer dreer@cip.ifi.lmu.de Patrick Elsässer elsaesser@cip.ifi.lmu.de Eduard Saller sallere@cip.ifi.lmu.de Desislava Zhekova zhekova@cis.uni-muenchen.de

More information