Methodologies for Improved Tag Cloud Generation with Clusterin
|
|
- Vanessa Blake
- 5 years ago
- Views:
Transcription
1 Methodologies for Improved Tag Cloud Generation with Clustering. Martin Leginus, Peter Dolog, Ricardo Lage, and Frederico Durao Department of Computer Science, Aalborg University July, 2012
2 Agenda Introduction Syntactical pre-clustering of tags Improving coverage and diversity of tag clouds with clustering Experiments
3 Introduction Tag clouds Problems Methodologies Experiments Conclusion Social collaborative tagging Tagging is a process when a user assigns a tag to an item Users collaboratively annotate items with tags It improves searching, discovering and categorizing of content Tagging information of users (implicit ratings) is utilized by tag-based recommenders
4 Tag clouds A tag cloud is a visual depiction of user-generated tags typically used to describe the content of web sites [Wikipedia, ] Often used as visual information retrieval interface.
5 Common use cases of tag clouds Content Browsing Navigation along the site content Serendipitous discovery
6 Problems The majority of tag clouds are constructed according to the tag popularity/frequency. These clouds suffer from: A cloud does not depict whole spectrum of tag space (missing rarely used tags which can be interesting for users) as only most frequent tags are considered. New tags (new content areas) in a system are hardly depicted in a tag cloud because of they low tag frequency. The most popular tags can be often useless for content discrimination or discovery. (see tags: night, autumn, cool..)
7 Problems Syntactically similar tags cause redudancy
8 Tag cloud s metrics Synthetic metrics express a quality of tags selection process. A coverage for a particular tag t expresses how many of considered documents were annotated with a tag t Coverage(t) = Dt D a, (1) Overlap of T c: Different tags in T c may be assigned with the same item in D Tc. The overlap metric captures the extent of such redundancy. Overlap(T c) = avg ti t j D ti D tj min{ D ti, D tj }, (2) We introduce a new metric chained coverage that captures how many documents are covered by a considered tag given that documents covered by previously selected tags are not considered. This metric combines coverage and overlap altogether Chained coverage(t T s) = Dt \ D T s, (3) D a
9 Syntactical Pre-clustering of Tags Tags in these systems can have the same semantical meaning however they are syntactically different i.e., typos, singular and plural forms and compounded tags. Levenhstein distance (measures the number of required changes: substitution, insertion and deletion of a character are allowed operations to transform one tag into another) is computed for each tag pair from the tag space. Tag space is divided into clusters The most frequent tag from each cluster is used for further computations
10 Improving coverage and diversity of clouds with clustering We explore 3 different clustering techniques, each obtained cluster expresses a laten topic in the tag space. The goal is to cover as many clusters as possible. 1 A tag space is clustered and divided into disjoint clusters. 2 Tags are selected proportionally from each cluster according to their coverage. 3 The tags selection algorithm maximizes a chained coverage metric. The maximization of the chained coverage promotes (specific) tags with the high coverage of not yet covered documents by previously selected tags. On the other hand frequent tags with low chained coverage (general meaning) are omitted.
11 Experiments The improvements of the methodologies are evaluated in terms of coverage and overlap of generated tag clouds. The evaluation is conducted on the following datasets: Bibsonomy contains 5794 distinct users, items and tags. The total number of tagging posts is Delicious dataset contains users, unique tags and bookmarks. The total number of tagging posts is
12 Syntactical pre-clustering of tags Table: Mean values of coverage and overlap for the baseline and syntactical pre-clustering methods on BibSonomy and Delicious datasets. Coverage Overlap Dataset Baseline Pre-Clustering Baseline Pre-Clustering BibSonomy Delicious Coverage had a 5% (5079 documents) increase on BiSonomy dataset and 3.5% (3072 documents) increase on Delicious. Overlap, on the other hand, had similar results.
13 Syntactical pre-clustering of tags As number of tags increases, coverage and overlap improves in a logarithmic fashion. Baseline Syntactical clustering Coverage Overlap Delicious Number of tags in the tag cloud Delicious Number of tags in the tag cloud Coverage Overlap Bibsonomy Number of tags in the tag cloud Bibsonomy Number of tags in the tag cloud Figure: Coverage and overlap results for baseline (red) and pre-clustering (black) methods and their corresponding logarithmic fit.
14 Improving coverage and diversity of clouds with clustering The average improvements are presented in the following tables. Coverage Dataset Baseline K-means Hierarchical Feature hashing BibSonomy Delicious Table: Mean values of coverage for the baseline and different clustering methods on BibSonomy and Delicious datasets. Overlap Dataset Baseline K-means Hierarchical Feature hashing BibSonomy Delicious Table: Mean values of overlap for the baseline and different clustering methods on BibSonomy and Delicious datasets.
15 Improving coverage and diversity of clouds with clustering The proposed methodology improves the coverage on both datasets. Similarly, the overlap of generated tag clouds is decreased. The best performing clustering technique is hierarchical clustering which computes a tag pairs co-occurrences. Coverage Overlap Delicious Number of tags in the tag cloud Delicious 0.1 Baseline K-means Hierarchical Feature hashing Number of tags in the tag cloud Coverage Overlap Bibsonomy Number of tags in the tag cloud Bibsonomy Number of tags in the tag cloud Figure: Improvements of coverage and overlap on Bibsonomy and Delicious datasets with different clustering techniques and their corresponding logarithmic fit.
16 Conclusion and future work Syntactical pre-clustering of tags improves coverage of tag clouds it prohibits a depiction of the syntactically similar tags = more diverse tag clouds Second methodology improves coverage and decreases overlap of tag clouds introduced metric chained coverage simplifies a selection process As a future work we intend to explore possible new metrics that would incorporate well-known metrics altogether and in a such way simplify a selection process of tags.
Tag cloud generation for results of multiple keywords queries
Tag cloud generation for results of multiple keywords queries Martin Leginus, Peter Dolog and Ricardo Gomes Lage IWIS, Department of Computer Science, Aalborg University What tag clouds are? Tag cloud
More informationM-Eco enhanced Adaptation Service (D5.2) Dolog, Peter; Durao, Frederico Araujo; Lage, Ricardo Gomes; Leginus, Martin; Pan, Rong
Aalborg Universitet M-Eco enhanced Adaptation Service (D5.2) Dolog, Peter; Durao, Frederico Araujo; Lage, Ricardo Gomes; Leginus, Martin; Pan, Rong Publication date: 2012 Document Version Accepted author
More informationEntity Grouping for Accessing Social Streams via Word Clouds
Entity Grouping for Accessing Social Streams via Word Clouds Martin Leginus 1, Leon Derczynski 2, and Peter Dolog 1 1 Department of Computer Science, Aalborg University, Selma Lagerlofs Vej 300, 9200 Aalborg,
More informationGraz University of Technology Knowledge Management Institute EVALUATING TAG-BASED INFORMATION ACCESS IN IMAGE COLLECTIONS.
Graz University of Technology Knowledge Management Institute EVALUATING TAG-BASED INFORMATION ACCESS IN IMAGE COLLECTIONS Technical Report Christoph Trattner with Yi-ling Lin Denis Parra Zhen Yue Peter
More informationIndividual and Social Behavior in Tagging Systems
Individual and Social Behavior in Tagging Systems Elizeu Santos-Neto,David Condon,Nazareno Andrade +,Adriana Iamnitchi,Matei Ripeanu Electrical & Computer Engineer University of British Columbia 2332 Mail
More informationOn utility of temporal embeddings for skill matching. Manisha Verma, PhD student, UCL Nathan Francis, NJFSearch
On utility of temporal embeddings for skill matching Manisha Verma, PhD student, UCL Nathan Francis, NJFSearch Skill Trend Importance 1. Constant evolution of labor market yields differences in importance
More informationOntoNaviERP: Ontology-supported Navigation in ERP Software Documentation
OntoNaviERP: Ontology-supported Navigation in ERP Software Documentation 1,2 and Andreas Wechselberger 1 1 E-Business and Web Science Research Group, Bundeswehr University Munich, Germany 2 STI Innsbruck,
More informationConclusions and Future Work
Chapter 9 Conclusions and Future Work Having done the exhaustive study of recommender systems belonging to various domains, stock market prediction systems, social resource recommender, tag recommender
More informationEvaluating Tag-Based Information Access in Image Collections
Evaluating Tag-Based Information Access in Image Collections Denis Parra dap89@pitt.edu Christoph Trattner ctrattner@iicm.edu Zhen Yue zhy18@pitt.edu Yi-ling Lin yil54@pitt.edu Peter Brusilovsky peterb@pitt.edu
More informationBSC 4934: Q BIC Capstone Workshop. Giri Narasimhan. ECS 254A; Phone: x3748
BSC 4934: Q BIC Capstone Workshop Giri Narasimhan ECS 254A; Phone: x3748 giri@cis.fiu.edu http://www.cis.fiu.edu/~giri/teach/bsc4934_su09.html 24 June through 7 July, 2009 06/30/09 Q'BIC Bioinformatics
More informationFinding Compensatory Pathways in Yeast Genome
Finding Compensatory Pathways in Yeast Genome Olga Ohrimenko Abstract Pathways of genes found in protein interaction networks are used to establish a functional linkage between genes. A challenging problem
More informationUNED: Evaluating Text Similarity Measures without Human Assessments
UNED: Evaluating Text Similarity Measures without Human Assessments Enrique Amigó Julio Gonzalo Jesús Giménez Felisa Verdejo UNED, Madrid {enrique,julio,felisa}@lsi.uned.es Google, Dublin jesgim@gmail.com
More informationFrom Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow
From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with
More informationIMPLEMENTATION OF FOLKSONOMY BASED TAG CLOUD MODEL FOR INFORMATION RETRIEVAL FROM DOCUMENT REPOSITORY IN AN INDIAN UNIVERSITY
IMPLEMENTATION OF FOLKSONOMY BASED TAG CLOUD MODEL FOR INFORMATION RETRIEVAL FROM DOCUMENT REPOSITORY IN AN INDIAN UNIVERSITY Sohil D. Pandya 1 Paresh V. Virparia 2 and Rinku Chavda 3 1 Sardar Vallabhbhai
More informationSpeech Analytics Transcription Accuracy
Speech Analytics Transcription Accuracy Understanding Verint s speech analytics transcription and categorization accuracy Verint.com Twitter.com/verint Facebook.com/verint Blog.verint.com Table of Contents
More informationIntroduction to EMBASE on Ovid
Introduction to EMBASE on Ovid EMBASE is a key resource for generating systematic reviews and supporting effective evidence-based medicine and drug and medical device searching EMBASE Facts Extensive EMTREE
More informationAutomatic Tagging and Categorisation: Improving knowledge management and retrieval
Automatic Tagging and Categorisation: Improving knowledge management and retrieval 1. Introduction Unlike past business practices, the modern enterprise is increasingly reliant on the efficient processing
More informationBrochure. Information Management & Governance. Find and Control Enterprise Content. Micro Focus ControlPoint
Brochure Information Management & Governance Find and Control Enterprise Content Micro Focus ControlPoint Brochure Find and Control Enterprise Content Micro Focus ControlPoint: A Better Way to Manage Data
More informationEvaluating Tagging Behavior in Social Bookmarking Systems: Metrics and design heuristics
Evaluating Tagging Behavior in Social Bookmarking Systems: Metrics and design heuristics Umer Farooq 1, Thomas G. Kannampallil 1, Yang Song 2, Craig H. Ganoe 1, John M. Carroll 1, and C. Lee Giles 2 1
More informationNovel Research Impact Indicators
Vol. 23, no. 4 (2014) 300 309 ISSN: 1435-5205 e-issn: 2213-056X Novel Research Impact Indicators Martin Fenner Hannover Medical School, Hannover, Germany and Public Library of Science, San Francisco, CA,
More informationCLASS/YEAR: II MCA SUB.CODE&NAME: MC7303, SOFTWARE ENGINEERING. 1. Define Software Engineering. Software Engineering: 2. What is a process Framework? Process Framework: UNIT-I 2MARKS QUESTIONS AND ANSWERS
More informationMining Social Topologies from Email for Online Data Sharing Diana MacLean, Sudheendra Hangal, Seng Keat Teh, Monica Lam and Jeffrey Heer Stanford University A Social Topology!)*#+,(!))*+,%-*( 0,12,"3'(.%/*"-'(
More informationA Weighted Tag Similarity Measure Based on a Collaborative Weight Model
A Weighted Tag Similarity Measure Based on a Collaborative Weight Model G.R.J.Srinivas Niket Tandon Search and Information Max Planck Institute, Extraction Lab, IIIT Hyderabad, Germany India ntandon@mpi-inf.mpg.de
More informationCreation of a PAM matrix
Rationale for substitution matrices Substitution matrices are a way of keeping track of the structural, physical and chemical properties of the amino acids in proteins, in such a fashion that less detrimental
More informationData Analytics with MATLAB Adam Filion Application Engineer MathWorks
Data Analytics with Adam Filion Application Engineer MathWorks 2015 The MathWorks, Inc. 1 Case Study: Day-Ahead Load Forecasting Goal: Implement a tool for easy and accurate computation of dayahead system
More informationControlled Unclassified Information Guide
Controlled Unclassified Information Guide Program: (CHESS) Program Manager: Mr. Dustin Fraze Program Security Officer: Ms. Denice Holden Date: March 29, 2018 Version: 1.0 1 Background The CHESS program
More informationMining Tweets for Tag Recommendation on Social Media
Mining Tweets for Tag Recommendation on Social Media Denzil Correa and Ashish Sureka Indraprastha Institute of Information Technology (IIIT-Delhi), India {denzilc, ashish} @iiitd.ac.in http://www.iiitd.ac.in/
More informationDocument and Media Exploitation (DOMEX)
SOLUTION BRIEF: DOCUMENT AND MEDIA EXPLOITATION (DOMEX)........................................ Document and Media Exploitation (DOMEX) Who should read this paper DOMEX analysts looking to quickly prioritize,
More informationGenome Assembly Using de Bruijn Graphs. Biostatistics 666
Genome Assembly Using de Bruijn Graphs Biostatistics 666 Previously: Reference Based Analyses Individual short reads are aligned to reference Genotypes generated by examining reads overlapping each position
More informationThe Influence of Frequency, Recency and Semantic Context on the Reuse of Tags in Social Tagging Systems
The Influence of Frequency, Recency and Semantic Context on the Reuse of Tags in Social Tagging Systems ABSTRACT Dominik Kowald Know-Center Graz University of Technology Graz, Austria dkowald@know-center.at
More informationCHAPTER 4 A FRAMEWORK FOR CUSTOMER LIFETIME VALUE USING DATA MINING TECHNIQUES
49 CHAPTER 4 A FRAMEWORK FOR CUSTOMER LIFETIME VALUE USING DATA MINING TECHNIQUES 4.1 INTRODUCTION Different groups of customers prefer some special products. Customers type recognition is one of the main
More informationTaxonomy Development for Knowledge Management
Taxonomy Development for Knowledge Management Date : 24/07/2008 Mary Whittaker Librarian Boeing Library Services The Boeing Company PO Box 3707 M/C 62-LC Seattle WA 98124 +1-425-306-2086 +1-425-965-0119
More informationSOLUTION BRIEF CA AGILE REQUIREMENTS DESIGNER FOR CA AGILE CENTRAL. CA Agile Requirements Designer for CA Agile Central
SOLUTION BRIEF CA AGILE REQUIREMENTS DESIGNER FOR CA AGILE CENTRAL CA Agile Requirements Designer for CA Agile Central Automatically convert user stories into the smallest set of test cases needed to fully
More informationData Analytics for Engineers
Data Analytics for Engineers Will Wilson Application Engineer MathWorks 2016 The MathWorks, Inc. 1 Agenda Definition Common Challenges Case Study Wrap Up 2 What is Data Analytics? Data Analytics is the
More informationGenScale Scalable, Optimized and Parallel Algorithms for Genomics. Dominique LAVENIER
GenScale Scalable, Optimized and Parallel Algorithms for Genomics Dominique LAVENIER Context New Sequencing Technologies - NGS Exponential growth of genomic data Drastic decreasing of costs Emergence of
More informationDe novo meta-assembly of ultra-deep sequencing data
De novo meta-assembly of ultra-deep sequencing data Hamid Mirebrahim 1, Timothy J. Close 2 and Stefano Lonardi 1 1 Department of Computer Science and Engineering 2 Department of Botany and Plant Sciences
More informationFollowing text taken from Suresh Kumar. Bioinformatics Web - Comprehensive educational resource on Bioinformatics. 6th May.2005
Bioinformatics is the recording, annotation, storage, analysis, and searching/retrieval of nucleic acid sequence (genes and RNAs), protein sequence and structural information. This includes databases of
More informationProfile HMMs. 2/10/05 CAP5510/CGS5166 (Lec 10) 1 START STATE 1 STATE 2 STATE 3 STATE 4 STATE 5 STATE 6 END
Profile HMMs START STATE 1 STATE 2 STATE 3 STATE 4 STATE 5 STATE 6 END 2/10/05 CAP5510/CGS5166 (Lec 10) 1 Profile HMMs with InDels Insertions Deletions Insertions & Deletions DELETE 1 DELETE 2 DELETE 3
More informationBionano Solve Theory of Operation: Variant Annotation Pipeline
Bionano Solve Theory of Operation: Variant Annotation Pipeline Document Number: 30190 Document Revision: B For Research Use Only. Not for use in diagnostic procedures. Copyright 2018 Bionano Genomics,
More informationProcess Mining Applied to the BPI Challenge 2012: Divide and Conquer While Discerning Resources
Process Mining Applied to the BPI Challenge 2012: Divide and Conquer While Discerning Resources R.P. Jagadeesh Chandra Bose and Wil M.P. van der Aalst Department of Mathematics and Computer Science Eindhoven
More informationText Mining. Theory and Applications Anurag Nagar
Text Mining Theory and Applications Anurag Nagar Topics Introduction What is Text Mining Features of Text Document Representation Vector Space Model Document Similarities Document Classification and Clustering
More information!!!!! ifolio Portfolio Summary. for more information August, 2014
ifolio Portfolio Summary August, 2014 for more information www.concerttechnology.com bizdev@concerttechnology.com C o n c e r t T e c h n o l o g y Overview The Amazon Kindle, introduced in 2007, help
More informationCJM-ex: Goal-oriented Exploration of Customer Journey Maps using Event Logs and Data Analytics
CJM-ex: Goal-oriented Exploration of Customer Journey Maps using Event Logs and Data Analytics Gaël Bernard 1 and Periklis Andritsos 2 1 University of Lausanne, Faculty of Business and Economics (HEC),
More informationResearch on Customer Knowledge Acquisition Model based on Data Mining
JOURNAL OF SIMULATION, VOL. 5, NO. 2, May 2017 147 Research on Knowledge Acquisition Model based on Data Mining Shen Nali Southwest University of Political Science and Law, School of Management, Chongqing,China
More informationChIP-seq and RNA-seq. Farhat Habib
ChIP-seq and RNA-seq Farhat Habib fhabib@iiserpune.ac.in Biological Goals Learn how genomes encode the diverse patterns of gene expression that define each cell type and state. Protein-DNA interactions
More informationROAD TO STATISTICAL BIOINFORMATICS CHALLENGE 1: MULTIPLE-COMPARISONS ISSUE
CHAPTER1 ROAD TO STATISTICAL BIOINFORMATICS Jae K. Lee Department of Public Health Science, University of Virginia, Charlottesville, Virginia, USA There has been a great explosion of biological data and
More informationTagging That Works. Thomas Vander Wal Presented to: Web 2.0 Expo San Francisco, California :: 16 April 2007
Tagging That Works Thomas Vander Wal Presented to: Web 2.0 Expo San Francisco, California :: 16 April 2007 What Is A Tag? What s A Tag? Wutzatag? Tagging: Definition Simple data/metadata externally applied
More informationMaSiMe: A Customized Similarity Measure and Its Application for Tag Cloud Refactoring
MaSiMe: A Customized Similarity Measure and Its Application for Tag Cloud Refactoring David Urdiales-Nieto, Jorge Martinez-Gil, and José F. Aldana-Montes University of Málaga, Department of Computer Languages
More informationIntroduction to Bioinformatics
Introduction to Bioinformatics Dr. Taysir Hassan Abdel Hamid Lecturer, Information Systems Department Faculty of Computer and Information Assiut University taysirhs@aun.edu.eg taysir_soliman@hotmail.com
More informationGeneious Biologics. Jannick Bendtsen, PhD. Vice President Technology Services Biomatters 13 December 2017
Geneious Biologics Jannick Bendtsen, PhD. Vice President Technology Services Biomatters 13 December 2017 // About Biomatters Founded in 2003 in Auckland, New Zealand Flagship Geneious software first released
More informationWHITE PAPER RETHINKING THE PROCUREMENT VALUE EQUATION
RETHINKING THE PROCUREMENT : : 2 EACH FUNCTIONAL AREA WITHIN PROCUREMENT PROVIDES BENEFITS, BE IT HARD DOLLAR SAVINGS, REDUCED CYCLE TIME, OR INCREASED CASH FLOW, AND HAS A COST. IF WE THINK OF THE STANDARD
More informationFive Key Features Required for a Perfect Fit Distributed Control System
Five Key Features Required for a Perfect Fit Distributed Control System Identify the Right DCS Solution for Your Industrial Operation Introduction For industrial organizations, it is imperative to increase
More informationProcess Mining techniques in complex Administrative Processes
Process Mining techniques in complex Administrative Processes Jan Suchy, Milan Suchy GRADIENT ECM, Kosicka 56, 82108 Bratislava, Slovakia {Jan.Suchy, Milan.Suchy}@gradientecm.com Abstract. This research
More informationContent Reuse and Interest Sharing in Tagging Communities
Content Reuse and Interest Sharing in Tagging Communities Elizeu Santos-Neto *, Matei Ripeanu *, Adriana Iamnitchi + University of British Columbia, Electrical and Computer Engineering Department * 2332
More informationApplication of Data Mining In Agriculture
Application of Data Mining In Agriculture P. Grace Sharon Student, Department of Information Technology, Saveetha School of Engineering, Saveetha University, Chennai 602 105, India Abstract: Data mining
More informationRETHINKING THE PROCUREMENT VALUE EQUATION
RETHINKING THE PROCUREMENT VALUE EQUATION CONTENTS CONTENTS Introduction... 3 Value Contributors... 4 Why Choose A Full Source Source-to-Pay Solution...5 Considerations in Choosing a Source-to-Pay Solution...
More informationMeasuring the Price of Discrimination with Data on Poker Games
Measuring the Price of Discrimination with Data on Poker Games May 29, 2013 Dr. Ingo Fiedler 1. Introduction Economic theory suggests that discrimination is price sensitive and money an equalizer This
More informationGene List Enrichment Analysis
Outline Gene List Enrichment Analysis George Bell, Ph.D. BaRC Hot Topics March 16, 2010 Why do enrichment analysis? Main types Selecting or ranking genes Annotation sources Statistics Remaining issues
More informationA logistic regression model for Semantic Web service matchmaking
. BRIEF REPORT. SCIENCE CHINA Information Sciences July 2012 Vol. 55 No. 7: 1715 1720 doi: 10.1007/s11432-012-4591-x A logistic regression model for Semantic Web service matchmaking WEI DengPing 1*, WANG
More informationJan Schmidt (Bamberg), Thomas N. Burg (Vienna) ICA Conference, Dresden,
Titel Jan Schmidt (Bamberg), Thomas N. Burg (Vienna) Bottom-up Classification of Content in Networked Organizational Communication ICA Conference, Dresden, 22.06.2006 #2 of 18 Agenda 1. an overview 2.
More informationKeyword Extraction using Word Co-occurrence TIR 2010, Bilbao 31 August 2010
Keyword Extraction using Word Co-occurrence TIR 2010, Bilbao 31 August 2010 Christian Wartena (Novay Rogier Brussee (Univ. of Applied Sciences Utrecht, presenter Wout Slakhorst (Novay Problem description
More informationEMC M&R (WATCH4NET) Cross-Domain Performance, Capacity and SLA Management. Ensure high service quality to users ESSENTIALS
EMC M&R (WATCH4NET) Cross-Domain Performance, Capacity and SLA Management Ensure high service quality to users The data center infrastructure is a rapidly-evolving environment containing hundreds or thousands
More informationHIERARCHICAL LOCATION CLASSIFICATION OF TWITTER USERS WITH A CONTENT BASED PROBABILITY MODEL. Mounika Nukala
HIERARCHICAL LOCATION CLASSIFICATION OF TWITTER USERS WITH A CONTENT BASED PROBABILITY MODEL by Mounika Nukala Submitted in partial fulfilment of the requirements for the degree of Master of Computer Science
More informationReport Consolidator. Author: Ravisankar Manickam
Report Consolidator Author: Ravisankar Manickam Agenda Abstract Types of Metrics Report Consolidator Tool Conclusion Logica 2011. All rights reserved No. 2 Abstract» Every company s ultimate endeavor is
More informationCh. 6: Understanding and Characterizing the Workload
Ch. 6: Understanding and Characterizing the Workload Kenneth Mitchell School of Computing & Engineering, University of Missouri-Kansas City, Kansas City, MO 64110 Kenneth Mitchell, CS & EE dept., SCE,
More informationSoftware Reliability and Testing: Know When To Say When. SSTC June 2007 Dale Brenneman McCabe Software
Software Reliability and Testing: Know When To Say When SSTC June 2007 Dale Brenneman McCabe Software 1 SW Components with Higher Reliability Risk, in terms of: Change Status (new or modified in this build/release)
More informationTrust-Networks in Recommender Systems
San Jose State University SJSU ScholarWorks Master's Projects Master's Theses and Graduate Research 2008 Trust-Networks in Recommender Systems Kristen Mori San Jose State University Follow this and additional
More informationIn 1996, the genome of Saccharomyces cerevisiae was completed due to the work of
Summary: Kellis, M. et al. Nature 423,241-253. Background In 1996, the genome of Saccharomyces cerevisiae was completed due to the work of approximately 600 scientists world-wide. This group of researchers
More informationContext-aware recommendation
Context-aware recommendation Eirini Kolomvrezou, Hendrik Heuer Special Course in Computer and Information Science User Modelling & Recommender Systems Aalto University Context-aware recommendation 2 Recommendation
More informationTesting Calculation Engines using Input Space Partitioning & Automation Thesis for the MS in Software Engineering Jeff Offutt and Chandra Alluri
Testing Calculation Engines using Input Space Partitioning & An Industrial Study of Applying Input Space Partitioning to Test Financial Calculation Engines Automation Thesis for the MS in Software Engineering
More informationEnhancing social tagging with a knowledge organization system. Brian Matthews, K. Golub, C. Jones, J. Moon, M. L. Nielsen, B. Puzoń, D.
Enhancing social tagging with a knowledge organization system Brian Matthews, K. Golub, C. Jones, J. Moon, M. L. Nielsen, B. Puzoń, D. Tudhope EnTag: Enhancing Social Tagging for Discovery K. Golub, C.
More informationA Quick Chat About SOMF Structural Modeling
www.modelingconcepts.com Do not be afraid to ask! A Quick Chat About SOMF Structural Modeling For architects, business analysts, system analysts, software developers, modelers, team leaders, and managers
More informationSimilarWeb vs. Direct Measurement What is the Difference?
SimilarWeb vs. Direct Measurement What is the Difference? Understand the differences between SimilarWeb and direct measurement tools to gain the most value out of your digital market intelligence. What
More informationIntroduction to the MiSeq
Introduction to the MiSeq 2011 Illumina, Inc. All rights reserved. Illumina, illuminadx, BeadArray, BeadXpress, cbot, CSPro, DASL, Eco, Genetic Energy, GAIIx, Genome Analyzer, GenomeStudio, GoldenGate,
More informationExplore the Structure of Social Tags by Subsumption Relations
Explore the Structure of Social Tags by Subsumption Relations Xiance Si, Zhiyuan Liu, Maosong Sun Department of Computer Science and Technology State Key Lab on Intelligent Technology and Systems National
More informationThe 4-Dimensional Swimlane: An expanded view of process modeling July 3, 2018 Dr. Joseph Drasin
: An expanded view of process modeling July 3, 2018 Dr. Joseph Drasin Representing processes in a manner that can be easily consumed by decisionmakers remains one of the most challenging aspects of being
More informationRNA standards v May
Standards, Guidelines and Best Practices for RNA-Seq: 2010/2011 I. Introduction: Sequence based assays of transcriptomes (RNA-seq) are in wide use because of their favorable properties for quantification,
More informationA Systematic Approach to Performance Evaluation
A Systematic Approach to Performance evaluation is the process of determining how well an existing or future computer system meets a set of alternative performance objectives. Arbitrarily selecting performance
More informationANALYTIC SOLUTIONS WITH DISPARATE DATA
ANALYTIC SOLUTIONS WITH DISPARATE DATA CQSDI 2018, Cape Canaveral, FL John Schroeder and Chad Hall Lockheed Martin Aeronautics Enterprise Integration Advanced Analytics We see disparate data sources as
More informationAgenda Endorsement System
Agenda Endorsement System Ankitha Premkumar apremkum@usc.edu Prathibha Muralidharan prathibm@usc.edu Sharath Ravishankar ravishas@usc.edu Reshma Malla reshmama@usc.edu ABSTRACT Planning is a pivotal activity.
More informationFigure S4 A-H : Initiation site properties and evolutionary changes
A 0.3 Figure S4 A-H : Initiation site properties and evolutionary changes G-correction not used 0.25 Fraction of total counts 0.2 0.5 0. tag 2 tags 3 tags 4 tags 5 tags 6 tags 7tags 8tags 9 tags >9 tags
More informationThe development of hardware, software and scientific advancements. made the computerization of business easier. Scientific advancements
Chapter 5 A CASE STUDY ON A SUPERMARKET 5.1 Introduction The development of hardware, software and scientific advancements made the computerization of business easier. Scientific advancements have made
More informationFatigue Monitoring for Demonstrating Fatigue Design Basis Compliance
Fatigue Monitoring for Demonstrating Fatigue Design Basis Compliance Gary L. Stevens, Arthur F. Deardorff, David A. Gerber Structural Integrity Associates 3315 Almaden Expressway, Suite 24 San Jose, CA
More informationTraining Data Tools to Enhance Safety Intelligence
Training Data Tools to Enhance Safety Intelligence Marco Merens Chief, Integrated Aviation Analysis Air Navigation Bureau Addis Ababa, Ethiopia 12 April 2017 Data-driven Decision Making for Training Using
More informationDe Novo Assembly of High-throughput Short Read Sequences
De Novo Assembly of High-throughput Short Read Sequences Chuming Chen Center for Bioinformatics and Computational Biology (CBCB) University of Delaware NECC Third Skate Genome Annotation Workshop May 23,
More informationText Analysis of American Airlines Customer Reviews
SESUG 2016 Paper EPO-281 Text Analysis of American Airlines Customer Reviews Rajesh Tolety, Oklahoma State University Saurabh Kumar Choudhary, Oklahoma State University ABSTRACT Which airline should I
More informationResolution of Chemical Disease Relations with Diverse Features and Rules
Resolution of Chemical Disease Relations with Diverse Features and Rules Dingcheng Li*, Naveed Afzal*, Majid Rastegar Mojarad, Ravikumar Komandur Elayavilli, Sijia Liu, Yanshan Wang, Feichen Shen, Hongfang
More informationA Guide to the Business Analysis Body of Knowledge (BABOK Guide), Version 2.0 Skillport
A Guide to the Business Analysis Body of Knowledge (BABOK Guide), Version 2.0 by The International Institute of Business Analysis (IIBA) International Institute of Business Analysis. (c) 2009. Copying
More informationCA Project & Portfolio Management
CA FOUNDATION PAPER JULY 2018 CA Project & Portfolio Management CA PPM Reconstructing resource management tools to simplify tasks, drive collaboration and facilitate action Executive Summary Challenge
More informationInCites Benchmarking & Analytics
InCites Benchmarking & Analytics Massimiliano Carloni Solution Specialist massimiliano.carloni@clarivate.com June 2018 Agenda 1. Clarivate Analytics: news 2. Publons & Kopernio plug-in 3. Web of Science
More informationSoC Planning, Management, Reporting, Auditing, and Signoff
SoC Planning, Management, Reporting, Auditing, and Signoff By Cadence Accurately monitoring progress on complex integrated circuit designs (or SoCs) has become more difficult as the designs have increased
More informationCollaborative Print Archives Framework. Planning Meeting #2 March 26, 2010
Collaborative Print Archives Framework Planning Meeting #2 March 26, 2010 Agenda I. Review of February meeting II. Followup on plans for a print archives metadata system III.Options for a CRL print archiving
More informationGain control over all enterprise content
Brochure Gain control over all enterprise content HP ControlPoint A better way to manage big data Most organizations today store data in a number of business systems and information repositories. This
More informationData Mining for Biological Data Analysis
Data Mining for Biological Data Analysis Data Mining and Text Mining (UIC 583 @ Politecnico di Milano) References Data Mining Course by Gregory-Platesky Shapiro available at www.kdnuggets.com Jiawei Han
More informationGenerative Models for Networks and Applications to E-Commerce
Generative Models for Networks and Applications to E-Commerce Patrick J. Wolfe (with David C. Parkes and R. Kang-Xing Jin) Division of Engineering and Applied Sciences Department of Statistics Harvard
More informationPreface to the third edition Preface to the first edition Acknowledgments
Contents Foreword Preface to the third edition Preface to the first edition Acknowledgments Part I PRELIMINARIES XXI XXIII XXVII XXIX CHAPTER 1 Introduction 3 1.1 What Is Business Analytics?................
More informationTWEETDICT: Identification of Topically Related Twitter Hashtags
TWEETDICT: Identification of Topically Related Twitter Hashtags Fabian Dreer dreer@cip.ifi.lmu.de Patrick Elsässer elsaesser@cip.ifi.lmu.de Eduard Saller sallere@cip.ifi.lmu.de Desislava Zhekova zhekova@cis.uni-muenchen.de
More information