A logistic regression model for Semantic Web service matchmaking

Size: px
Start display at page:

Download "A logistic regression model for Semantic Web service matchmaking"

Transcription

1 . BRIEF REPORT. SCIENCE CHINA Information Sciences July 2012 Vol. 55 No. 7: doi: /s x A logistic regression model for Semantic Web service matchmaking WEI DengPing 1*, WANG Ting 1 & WANG Ji 2 1 School of Computer, National University of Defense Technology, Changsha , China; 2 National Laboratory for Parallel and Distributed Processing, Changsha , China Received March 24, 2011; accepted February 21, 2012; published online May 17, 2012 Abstract Semantic Web service matchmaking, as one of the most challenging problems in Semantic Web services (SWS), aims to filter and rank a set of services with respect to a service query by using a certain matching strategy. In this paper, we propose a logistic regression based method to aggregate several matching strategies instead of a fixed integration (e.g., the weighted sum) for SWS matchmaking. The logistic regression model is trained on training data derived from binary relevance assessments of existing test collections, and then used to predict the probability of relevance between a new pair of query and service according to their matching values obtained from various matching strategies. Services are then ranked according to the probabilities of relevance with respect to each query. Our method is evaluated on two main test collections, SAWSDL-TC2 and Jena Geography Dataset(JGD). Experimental results show that the logistic regression model can effectively predict the relevance between a query and a service, and hence can improve the effectiveness of service matchmaking. Keywords Semantic Web service, matchmaking, logistic regression Citation Wei D P, Wang T, Wang J. A logistic regression model for Semantic Web service matchmaking. Sci China Inf Sci, 2012, 55: , doi: /s x 1 Introduction Semantic Web services (SWS), as an application of the ideas of the Semantic Web to the service oriented computing, has attracted much attention recently [1]. SWS matchmaking is one of the most challenging problems in SWS [2], which aims to filter and rank a set of services with respect to a query by using a certain matching strategy that measures the similarity between a query and a service. A variety of competing matching strategies have been proposed recently [3,4], among which integrated matching strategies that combine the matching results obtained from different matching strategies have been shown to be promising according to the intensive comparisons from various service matchmaking contests 1). Integration provides a comprehensive and complementary way to measure the similarity between a query and a service by considering different descriptions of Web services. Thus, how to effectively integrate individual similarity values obtained from useful matching strategies into an overall score becomes an important issue. *Corresponding author ( dpwei@nudt.edu.cn) 1) klusch/s3/index.html c Science China Press and Springer-Verlag Berlin Heidelberg 2012 info.scichina.com

2 1716 Wei D P, et al. Sci China Inf Sci July 2012 Vol. 55 No. 7 An intuitive integration way is to use empirical values as the weights of different matching strategies. For example, URBE [5] uses weighted sum to integrate several similarity values into an overall score. However, these empirical weights are difficult to be predicted correctly in practice, due to the various characteristics of applications. To alleviate this problem, several machine learning based methods have been used to learn these weights for service discovery. Christoph et al. [6] proposed the SWS matchmaker imatcher which integrates various text similarity measures using different machine learning algorithms. Klusch et al. [7] also proposed the SAWSDL service matchmaker SAWSDL-MX2 that integrates three matching variants using support vector machine (SVM), including logic-based, text similarity based matching of semantic annotations, and structural matching. The logistic regression model is a popular model for binary data prediction, regression and classification [8], and it has been successfully applied in several applications such as text retrieval [9]. Essentially, the service matchmaking problem can be viewed as a binary data prediction problem of judging whether a service is relevant to a query or not. In addition, the logistic regression provides a normal way to analyze the contribution of each matching strategy to service matchmaking in a specific domain according to the estimates of the coefficients, which is of practical help for domain experts to select appropriate matching strategies for their specific applications. Based on this insight, in this paper, we propose a method that exploits the logistic regression model to integrate various matching strategies and to predict the probability of relevance between a query and a service based on their individual matching scores. Following our previous work [10,11], we adopt several matching strategies to compute the individual similarity values, and then integrate them into an overall similarity using the trained logistic regression model. Experimental results show the logistic regression model outperforms all basic matching strategies in terms of recall and precision, and also outperforms the well-known integrated matchmakers. 2 The approach Let x = {x 1,x 2,...,x k } denote k matching strategies used to calculate the similarity values between a pair of query and service (q j,s i ). A set of similarity values {x 1 (q j,s i ),x 2 (q j,s i ),...,x k (q j,s i )} between each pair of query and service (q j,s i ) are then obtained by using these k matching strategies respectively, where x l (q j,s i ) is the similarity value between q j and s i, which is calculated by using the matching strategy x l. Our aim is to establish a function to integrate these individual similarity scores into an overall score which is used to rank services. Logistic regression is a variation of ordinary regression which is used when the response variable is a binary variable (occurrence or non-occurrence of the outcome event) and the input variables are continuous, categorical, or both. Service matchmaking problem can be essentially viewed as a binary prediction problem that judges whether a service is relevant to a query according to the similarity values obtained from various matching strategies. Therefore, we are interested in predicting the probability of relevance between a pair of query q j and service s i. Let R denote the relevance between a pair of query and service, where R = 1 indicates that the service is relevant to the query, and R = 0 indicates that it is irrelevant to the query. We model the conditional mean of R given a pair of service and query (q j,s i ) and the set of matching strategies x = {x 1,x 2,...,x k }, i.e., E(R =1 x, q j,s i ), via the following logistic regression function: eβ0+β1x1+ +β kx k E(R =1 x, q j,s i )= 1+e β0+β1x1+ +β. (1) kx k This function produces E(R =1 x, q j,s i ) between 0 and 1, and the terms β = {β 0,...,β k } are unknown parameters (called regression coefficients), to be estimated based on the available observations. A logit transformation of E(R =1 x, q j,s i ) is defined as [ ] E(R =1 x, qj,s i ) g(x 1,x 2,...,x k )= ln 1 E(R =1 x, q j,s i ) = β 0 + β 1 x 1 + β 2 x β k x k. (2)

3 Wei D P, et al. Sci China Inf Sci July 2012 Vol. 55 No Table 1 The basic matching strategies Matching strategy Similarity measure Function Name Description text Dice s coefficient Cosine similarity 2 B(nr ) B(ns) Sim(n r,n s)= B(n, r) + B(n s) B(x): set of bigrams in string x; n x:nameofx Sim(v r,v s)= vr i=1 (vr i vs i )2 vr vs, i=1 v2 r i i=1 v2 s i v x: classic vector space model of description text in x 1 Sim(v r,v s)= vr 1+ i=1 (vr i vs i )2 Syntactic IO Euclidean distance v x: boolean vector for the unfolded concept expressions of I/O concepts of x Semantic IO Logic based matching The algorithm described in [10] To obtain the estimates of the unknown parameters {β i }, the common used maximum likelihood is employed, which maximizes the probability of obtaining the observed service test collections. Let the matrix S denote matching values of the observed pairs of query and service according to the k matching strategies, whose relevance is known in advance. Let s i. =[s i,1,s i,2,...,s i,k ]denotetheith row of the matrix S, ands i,j denote the similarity value between the ith pair of query and service according to the jth matching strategy. In this paper, we select the basic matching strategies listed in Table 1 as x = {x 1,x 2,x 3,x 4 } in Eq. (1), i.e., name based matching strategy (Name), description text based matching strategy (Description text), semantic annotation based syntactic matching strategy (Syntactic IO) and semantic matching strategy (Semantic IO). The selected matching strategies are based on the most commonly used description components among several Semantic Web service ontologies/specifications that may be complementary to describing the functional properties of Web services. S = i=1 x 1 x 2 x k s 1,1 s 1,2 s 1,k s 2,1 s 2,2 s 2,k s m,1 s m,2 s m,k In order to apply maximum likelihood, the likelihood function is constructed as follows. m [ ] Ri [ ] 1 Ri 1 1 L(β) = 1+e si.β 1+e si.β. (4) The maximum likelihood estimators (MLE) of the parameter {β i } are calculated by using the well-known Quasi Newton method based on the samples in Matrix S. Finally, the probability of relevance of a new pair of query and service can be calculated by Eq. (1). Generally, the relevant services of a query are much less than the advertised services; thus the number of irrelevant pairs of query and service is much larger than the number of relevant pairs of query and service. This unbalanced training data set will lead to the effect that conventional machine learning methods are biased toward a larger class. To overcome this problem, cost sensitive model is developed by defining the penalty of each kind of samples. Our goal is to use the learned model to predict the probabilities that a service is relevant to the query, in order that the matchmaker ranks services according to these probabilities. Normally, users want to find their desired services at the top of the ranking list, without caring whether all the relevant services are returned. From this point of view, a false negative prediction is, therefore, considered to have more serious consequences than a false positive prediction in this work. Thus, the misclassifying an irrelevant pair of query and service is set to 40 times as expensive as misclassifying a relevant pair of query and service in this cost sensitive model, since relevant services are about 1/40 fewer than non-relevant services for each query in general.. (3)

4 1718 Wei D P, et al. Sci China Inf Sci July 2012 Vol. 55 No. 7 Figure 1 Performance comparison of statistical model based strategies. (a) SAWSDL-TC2; (b) JGD. 3 Experimental results In this evaluation, we use two test collections from Semantic Service Selection (S3) contest ) : SAWSDL-TC2 and Jena Geography Dataset (JGD). Each test collection is represented by a set of vectors with cardinality Q P in matrix S, inwhichq and P represent the sets of queries and services respectively in the test collection. Each row in matrix S corresponds to a sample, which represents the similarity values of a pair of query and service vs. the matching strategies respectively. The set of samples is divided into Q folds, and each fold consists of all the samples related to a query. Each time, we take one fold as test set (related to one query) and learn the logistic regression model on the remaining Q 1 folds, and then measure the effectiveness on the test query. Finally, the macro-average of the results of the Q runs is considered as the performance of the statistical model based matching strategies on the whole test collection. This approach follows the standard N-fold cross validation in machine learning. To show the performance of our method, in this paper, we also implement other machine learning based matchmaking methods based on the same matching strategies by using WEKA [12], such as ɛ-svr, linear regression, J48 decision tree, Adaboosting based J48, etc. Figure 1 shows a performance comparison between logistic regression model and other statistical model based matchmakers. On SAWSDL-TC2 (Figure 1(a)), the logistic regression based matchmaker (logisitic) outperforms other statistical model based matchmakers with a mean average precision (MAP) of 0.749, although it is slightly outperformed at the beginning by ɛ-svr based matchmaker (MAP=0.723). On JGD, logistic regression based matchmaker (MAP=0.67) outperforms other matchmakers before half of the relevant services are returned. On the whole, ɛ-svr based matchmaker performs best with MAP of The logistic regression based matchmaker outperforms J48, Adaboosting J48 and linear regression based matchmakers. Figure 2 shows a performance comparison between logistic regression model and the basic matchmaking strategies which are used to learn the model in this paper. Figure 2(a) indicates that, on SAWSDL- TC2, logistic based matchmaker outperforms each basic matchmaking strategy. On JGD (Figure 2(b)), the same conclusion can also be drawn, although it is slightly outperformed at the very beginning by single matching strategies such as semantic annotations based matchmaking strategy (semantic IO) and description text based matchmaking strategy (text). In addition, we also compare our method with the well-known SVM based matchmaker SAWSDL-MX2 that integrates different matching strategies from those used in this paper. The mean average precision of our method is on SAWSDL-TC2 and 0.67 on JGD, while the MAP of SAWSDL-TC2 is on SAWSDL-TC2 and 0.45 on JGD. In summary, our logistic regression model can effectively integrate the commonly used matching strategies shown in Table 1, and also improve the effectiveness of service matchmaking by learning from other s 2) klusch/s3/html/2009.html

5 Wei D P, et al. Sci China Inf Sci July 2012 Vol. 55 No Figure 2 The performance comparison between logistic regression model and basic matching strategies. (a) SAWSDL- TC2; (b) JGD. strong points to offset one s weaknesses. It also indicates that selecting proper basic matching strategies is very important to integrated service matchmaking, since each matching strategy may contribute differently in service matchmaking. This is another advantage of our method, since logistic regression can help us to select proper matching strategies according to the estimates of the coefficients. 4 Conclusions This paper proposes a novel method for Semantic Web service matchmaking, which employs logistic regression to aggregate multi-results obtained from several basic matching strategies into an overall similarity value. Experiments show that the logistic regression model is able to provide an overall and almost overwhelming performance. We can, therefore, conclude that the logistic regression model used in this paper is effective and appropriate for integrating individual similarity values obtained from various matching strategies on different description components. Acknowledgements The research was supported by National Grand Fundamental Research Program of China (Grant No. 2011CB ) and National Natural Science Foundation of China (Grant No ). References 1 Wang H B, Huang J Z X, Qu Y Z, et al. Web services: problems and future directions. Web Semant Sci Serv Agent World Wide Web, 2004, 1: Klusch M. Semantic service coordination. In: Schumacher M, Helin H, Schuldt H, eds. CASCOM: Intelligent Service Coordination in the Semantic Web. Berlin: Springer, Deng S G, Wu Z H, Wu J, et al. An efficient service discovery method and its application. Int J Web Serv Res, 2009, 6: Cai S B, Zou Y Z, Xie B, et al. Mining the Web of trust for Web services selection. In: Proceedings of 2008 IEEE International Conference on Web Services (ICWS 2008). Washington: IEEE Computer Society, Plebani P, Pernici B. URBE: web service retrieval based on similarity evaluation. IEEE Trans Knowl Data Eng, 2009, 21: Christoph K, Abraham B. The creation and evaluation of isparql strategies for matchmaking. In: Proceedings of the 5th European Semantic Web Conference (ESWC). Berlin: Springer, Klusch M, Kapahnke P, Zinnikus I. Adaptive hybrid semantic selection of SAWSDL services with SAWSDL-MX2. Int J Semant Web Inf Syst, 2010, 6: 1 26

6 1720 Wei D P, et al. Sci China Inf Sci July 2012 Vol. 55 No. 7 8 Hosmer D W, Lemesbow S. Applied logistic regression. 2nd ed. New York: Wiley Inc, Gey F C. Inferring probability of relevance using the method of logistic regression. In: Proceedings of the 7th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: Springer- Verlag, Wei D P, Wang T, Tang J T, et al. SAWSDL-iMatcher: A customizable and effective Semantic Web service matchmaker. Web Semant Sci Serv Agent World Wide Web, 2011, 9: Wei D P, Wang T, Wang J, et al. Extracting semantic constraint from description text for Semantic Web service discovery. In: Proceedings of the 7th International Semantic Web Conference. Berlin: Springer, Hall M, Frank E, Holmes G, et al. The WEKA data mining software: An update. SIGKDD Explor, 2009, 11: 10 18

Improving Ranking-based Recommendation by Social Information and Negative Similarity Ying Liu a,b, * Jiajun Yang a

Improving Ranking-based Recommendation by Social Information and Negative Similarity Ying Liu a,b, * Jiajun Yang a Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 55 (2015 ) 732 740 Information Technology and Quantitative Management (ITQM 2015) Improving Ranking-based Recommendation

More information

Predicting user rating for Yelp businesses leveraging user similarity

Predicting user rating for Yelp businesses leveraging user similarity Predicting user rating for Yelp businesses leveraging user similarity Kritika Singh kritika@eng.ucsd.edu Abstract Users visit a Yelp business, such as a restaurant, based on its overall rating and often

More information

An Implementation of genetic algorithm based feature selection approach over medical datasets

An Implementation of genetic algorithm based feature selection approach over medical datasets An Implementation of genetic algorithm based feature selection approach over medical s Dr. A. Shaik Abdul Khadir #1, K. Mohamed Amanullah #2 #1 Research Department of Computer Science, KhadirMohideen College,

More information

Do Ads Compete or Collaborate? Designing Click Models with Full Relationship Incorporated

Do Ads Compete or Collaborate? Designing Click Models with Full Relationship Incorporated Do s Compete or Collaborate? Designing Click Models with Full Relationship Incorporated Xin Xin School of Computer Science Beijing Institute of Technology xxin@bit.edu.cn Michael R. Lyu The Chinese University

More information

Cold-start Solution to Location-based Entity Shop. Recommender Systems Using Online Sales Records

Cold-start Solution to Location-based Entity Shop. Recommender Systems Using Online Sales Records Cold-start Solution to Location-based Entity Shop Recommender Systems Using Online Sales Records Yichen Yao 1, Zhongjie Li 2 1 Department of Engineering Mechanics, Tsinghua University, Beijing, China yaoyichen@aliyun.com

More information

Predicting Yelp Ratings From Business and User Characteristics

Predicting Yelp Ratings From Business and User Characteristics Predicting Yelp Ratings From Business and User Characteristics Jeff Han Justin Kuang Derek Lim Stanford University jeffhan@stanford.edu kuangj@stanford.edu limderek@stanford.edu I. Abstract With online

More information

Silvia Calegari, Marco Comerio, Andrea Maurino,

Silvia Calegari, Marco Comerio, Andrea Maurino, A Semantic and Information Retrieval based Approach to Service Contract Selection Silvia Calegari, Marco Comerio, Andrea Maurino, Emanuele Panzeri, and Gabriella Pasi Department of Informatics, Systems

More information

Management Science Letters

Management Science Letters Management Science Letters 1 (2011) 449 456 Contents lists available at GrowingScience Management Science Letters homepage: www.growingscience.com/msl Improving electronic customers' profile in recommender

More information

A Comparative Study of Filter-based Feature Ranking Techniques

A Comparative Study of Filter-based Feature Ranking Techniques Western Kentucky University From the SelectedWorks of Dr. Huanjing Wang August, 2010 A Comparative Study of Filter-based Feature Ranking Techniques Huanjing Wang, Western Kentucky University Taghi M. Khoshgoftaar,

More information

Classification Model for Intent Mining in Personal Website Based on Support Vector Machine

Classification Model for Intent Mining in Personal Website Based on Support Vector Machine , pp.145-152 http://dx.doi.org/10.14257/ijdta.2016.9.2.16 Classification Model for Intent Mining in Personal Website Based on Support Vector Machine Shuang Zhang, Nianbin Wang School of Computer Science

More information

A Propagation-based Algorithm for Inferring Gene-Disease Associations

A Propagation-based Algorithm for Inferring Gene-Disease Associations A Propagation-based Algorithm for Inferring Gene-Disease Associations Oron Vanunu Roded Sharan Abstract: A fundamental challenge in human health is the identification of diseasecausing genes. Recently,

More information

An Analysis Framework for Content-based Job Recommendation. Author(s) Guo, Xingsheng; Jerbi, Houssem; O'Mahony, Michael P.

An Analysis Framework for Content-based Job Recommendation. Author(s) Guo, Xingsheng; Jerbi, Houssem; O'Mahony, Michael P. Provided by the author(s) and University College Dublin Library in accordance with publisher policies. Please cite the published version when available. Title An Analysis Framework for Content-based Job

More information

WaterlooClarke: TREC 2015 Total Recall Track

WaterlooClarke: TREC 2015 Total Recall Track WaterlooClarke: TREC 2015 Total Recall Track Haotian Zhang, Wu Lin, Yipeng Wang, Charles L. A. Clarke and Mark D. Smucker Data System Group University of Waterloo TREC, 2015 Haotian Zhang, Wu Lin, Yipeng

More information

SOCIAL MEDIA MINING. Behavior Analytics

SOCIAL MEDIA MINING. Behavior Analytics SOCIAL MEDIA MINING Behavior Analytics Dear instructors/users of these slides: Please feel free to include these slides in your own material, or modify them as you see fit. If you decide to incorporate

More information

Rank hotels on Expedia.com to maximize purchases

Rank hotels on Expedia.com to maximize purchases Rank hotels on Expedia.com to maximize purchases Nishith Khantal, Valentina Kroshilina, Deepak Maini December 14, 2013 1 Introduction For an online travel agency (OTA), matching users to hotel inventory

More information

Inferring Social Ties across Heterogeneous Networks

Inferring Social Ties across Heterogeneous Networks Inferring Social Ties across Heterogeneous Networks CS 6001 Complex Network Structures HARISH ANANDAN Introduction Social Ties Information carrying connections between people It can be: Strong, weak or

More information

When to Book: Predicting Flight Pricing

When to Book: Predicting Flight Pricing When to Book: Predicting Flight Pricing Qiqi Ren Stanford University qiqiren@stanford.edu Abstract When is the best time to purchase a flight? Flight prices fluctuate constantly, so purchasing at different

More information

Enhanced Cost Sensitive Boosting Network for Software Defect Prediction

Enhanced Cost Sensitive Boosting Network for Software Defect Prediction Enhanced Cost Sensitive Boosting Network for Software Defect Prediction Sreelekshmy. P M.Tech, Department of Computer Science and Engineering, Lourdes Matha College of Science & Technology, Kerala,India

More information

A Preliminary Evaluation of China s Implementation Progress in Energy Intensity Targets

A Preliminary Evaluation of China s Implementation Progress in Energy Intensity Targets A Preliminary Evaluation of China s Implementation Progress in Energy Intensity Targets Yahua Wang and Jiaochen Liang Abstract China proposed an ambitious goal of reducing energy consumption per unit of

More information

Preprocessing Technique for Discrimination Prevention in Data Mining

Preprocessing Technique for Discrimination Prevention in Data Mining The International Journal Of Engineering And Science (IJES) Volume 3 Issue 6 Pages 12-16 2014 ISSN (e): 2319 1813 ISSN (p): 2319 1805 Preprocessing Technique for Discrimination Prevention in Data Mining

More information

Application of Decision Trees in Mining High-Value Credit Card Customers

Application of Decision Trees in Mining High-Value Credit Card Customers Application of Decision Trees in Mining High-Value Credit Card Customers Jian Wang Bo Yuan Wenhuang Liu Graduate School at Shenzhen, Tsinghua University, Shenzhen 8, P.R. China E-mail: gregret24@gmail.com,

More information

Content Based Filtering

Content Based Filtering CS-590I Information Retrieval Content Based Filtering Luo Si Department of Computer Science Purdue University Outline Introduction to content based filtering Applications main research problems Main framework

More information

Standard Deviation as a Query Hardness Estimator

Standard Deviation as a Query Hardness Estimator Standard Deviation as a Query Hardness Estimator Joaquín Pérez-Iglesias and Lourdes Araujo Universidad Nacional de Educación a Distancia Madrid 28040, Spain joaquin.perez@lsi.uned.es, lurdes@lsi.uned.es

More information

PubRec: Recommending Publications Based On Publicly Available Meta-Data

PubRec: Recommending Publications Based On Publicly Available Meta-Data PubRec: Recommending Publications Based On Publicly Available Meta-Data Anas Alzoghbi, Victor Anthony Arrascue Ayala, Peter M. Fischer, and Georg Lausen Department of Computer Science, University of Freiburg

More information

Predicting Purchase Behavior of E-commerce Customer, One-stage or Two-stage?

Predicting Purchase Behavior of E-commerce Customer, One-stage or Two-stage? 2016 International Conference on Artificial Intelligence and Computer Science (AICS 2016) ISBN: 978-1-60595-411-0 Predicting Purchase Behavior of E-commerce Customer, One-stage or Two-stage? Chen CHEN

More information

Agreement of Relevance Assessment between Human Assessors and Crowdsourced Workers in Information Retrieval Systems Evaluation Experimentation

Agreement of Relevance Assessment between Human Assessors and Crowdsourced Workers in Information Retrieval Systems Evaluation Experimentation Agreement of Relevance Assessment between Human Assessors and Crowdsourced Workers in Information Retrieval Systems Evaluation Experimentation Parnia Samimi and Sri Devi Ravana Abstract Relevance judgment

More information

ML Methods for Solving Complex Sorting and Ranking Problems in Human Hiring

ML Methods for Solving Complex Sorting and Ranking Problems in Human Hiring ML Methods for Solving Complex Sorting and Ranking Problems in Human Hiring 1 Kavyashree M Bandekar, 2 Maddala Tejasree, 3 Misba Sultana S N, 4 Nayana G K, 5 Harshavardhana Doddamani 1, 2, 3, 4 Engineering

More information

On of the major merits of the Flag Model is its potential for representation. There are three approaches to such a task: a qualitative, a

On of the major merits of the Flag Model is its potential for representation. There are three approaches to such a task: a qualitative, a Regime Analysis Regime Analysis is a discrete multi-assessment method suitable to assess projects as well as policies. The strength of the Regime Analysis is that it is able to cope with binary, ordinal,

More information

Big Data. Methodological issues in using Big Data for Official Statistics

Big Data. Methodological issues in using Big Data for Official Statistics Giulio Barcaroli Istat (barcarol@istat.it) Big Data Effective Processing and Analysis of Very Large and Unstructured data for Official Statistics. Methodological issues in using Big Data for Official Statistics

More information

Accurate Campaign Targeting Using Classification Algorithms

Accurate Campaign Targeting Using Classification Algorithms Accurate Campaign Targeting Using Classification Algorithms Jieming Wei Sharon Zhang Introduction Many organizations prospect for loyal supporters and donors by sending direct mail appeals. This is an

More information

Logistic Regression with Expert Intervention

Logistic Regression with Expert Intervention Smart Cities Symposium Prague 2016 1 Logistic Regression with Expert Intervention Pavla Pecherková and Ivan Nagy Abstract This paper deals with problem of analysis of traffic data. A traffic network has

More information

A Survey on Recommendation Techniques in E-Commerce

A Survey on Recommendation Techniques in E-Commerce A Survey on Recommendation Techniques in E-Commerce Namitha Ann Regi Post-Graduate Student Department of Computer Science and Engineering Karunya University, India P. Rebecca Sandra Assistant Professor

More information

Predicting Restaurants Rating And Popularity Based On Yelp Dataset

Predicting Restaurants Rating And Popularity Based On Yelp Dataset CS 229 MACHINE LEARNING FINAL PROJECT 1 Predicting Restaurants Rating And Popularity Based On Yelp Dataset Yiwen Guo, ICME, Anran Lu, ICME, and Zeyu Wang, Department of Economics, Stanford University Abstract

More information

Predictive Analytics Cheat Sheet

Predictive Analytics Cheat Sheet Predictive Analytics The use of advanced technology to help legal teams separate datasets by relevancy or issue in order to prioritize documents for expedited review. Often referred to as Technology Assisted

More information

Near-Balanced Incomplete Block Designs with An Application to Poster Competitions

Near-Balanced Incomplete Block Designs with An Application to Poster Competitions Near-Balanced Incomplete Block Designs with An Application to Poster Competitions arxiv:1806.00034v1 [stat.ap] 31 May 2018 Xiaoyue Niu and James L. Rosenberger Department of Statistics, The Pennsylvania

More information

A Personalized Company Recommender System for Job Seekers Yixin Cai, Ruixi Lin, Yue Kang

A Personalized Company Recommender System for Job Seekers Yixin Cai, Ruixi Lin, Yue Kang A Personalized Company Recommender System for Job Seekers Yixin Cai, Ruixi Lin, Yue Kang Abstract Our team intends to develop a recommendation system for job seekers based on the information of current

More information

Resolution of Chemical Disease Relations with Diverse Features and Rules

Resolution of Chemical Disease Relations with Diverse Features and Rules Resolution of Chemical Disease Relations with Diverse Features and Rules Dingcheng Li*, Naveed Afzal*, Majid Rastegar Mojarad, Ravikumar Komandur Elayavilli, Sijia Liu, Yanshan Wang, Feichen Shen, Hongfang

More information

Supply Chain Network Design under Uncertainty

Supply Chain Network Design under Uncertainty Proceedings of the 11th Annual Conference of Asia Pacific Decision Sciences Institute Hong Kong, June 14-18, 2006, pp. 526-529. Supply Chain Network Design under Uncertainty Xiaoyu Ji 1 Xiande Zhao 2 1

More information

OntoNaviERP: Ontology-supported Navigation in ERP Software Documentation

OntoNaviERP: Ontology-supported Navigation in ERP Software Documentation OntoNaviERP: Ontology-supported Navigation in ERP Software Documentation 1,2 and Andreas Wechselberger 1 1 E-Business and Web Science Research Group, Bundeswehr University Munich, Germany 2 STI Innsbruck,

More information

Quantitative Analysis

Quantitative Analysis Chapter 16 Quantitative Analysis Danilo Bonardi, Luís Botelho, Matthias Klusch, António L. Lopes, Thorsten Möller, Alexandre de Oliveira e Sousa, and Matteo Vasirani 16.1 Introduction The different software

More information

Trend Extraction Method using Co-occurrence Patterns from Tweets

Trend Extraction Method using Co-occurrence Patterns from Tweets Information Engineering Express International Institute of Applied Informatics 2016, Vol.2, No.4, 1 10 Trend Extraction Method using Co-occurrence Patterns from Tweets Shotaro Noda and Katsuhide Fujita

More information

Product improvement based on online reviews from product designer s perspective

Product improvement based on online reviews from product designer s perspective IOP Conference Series: Materials Science and Engineering PAPER OPEN ACCESS Product improvement based on online reviews from product designer s perspective To cite this article: Shugang Li and Jiali Kong

More information

The Wisdom Of the Crowds: Enhanced Reputation-Based Filtering

The Wisdom Of the Crowds: Enhanced Reputation-Based Filtering The Wisdom Of the Crowds: Enhanced Reputation-Based Filtering Jason Feriante feriante@cs.wisc.edu CS 761 Spring 2015 Department of Computer Science University of Wisconsin-Madison Abstract Crowdsourcing

More information

SOCIALQ&A: AN ONLINE SOCIAL NETWORK BASED QUESTION AND ANSWER SYSTEM

SOCIALQ&A: AN ONLINE SOCIAL NETWORK BASED QUESTION AND ANSWER SYSTEM SOCIALQ&A: AN ONLINE SOCIAL NETWORK BASED QUESTION AND ANSWER SYSTEM AUTHORS: H. SHEN, G. LIU, H. WANG, AND N. VITHLANI PRESENTED BY: NICOLE MCNABB OVERVIEW Introduction Design of SocialQ&A Security and

More information

HeteroSales: Utilizing Heterogeneous Social Networks to Identify the Next Enterprise Customer

HeteroSales: Utilizing Heterogeneous Social Networks to Identify the Next Enterprise Customer HeteroSales: Utilizing Heterogeneous Social Networks to Identify the Next Enterprise Customer Qingbo Hu qhu5@uic.edu Qiang Zhu qzhu@linkedin.com Sihong Xie sxie6@uic.edu Songtao Guo soguo@linkedin.com

More information

Proposal for ISyE6416 Project

Proposal for ISyE6416 Project Profit-based classification in customer churn prediction: a case study in banking industry 1 Proposal for ISyE6416 Project Profit-based classification in customer churn prediction: a case study in banking

More information

REAL-TIME CRASH RISK ANALYSIS OF URBAN ARTERIALS INCORPORATING BLUETOOTH, WEATHER, AND ADAPTIVE SIGNAL CONTROL DATA

REAL-TIME CRASH RISK ANALYSIS OF URBAN ARTERIALS INCORPORATING BLUETOOTH, WEATHER, AND ADAPTIVE SIGNAL CONTROL DATA Yuan et al. 1 18-00590 REAL-TIME CRASH RISK ANALYSIS OF URBAN ARTERIALS INCORPORATING BLUETOOTH, WEATHER, AND ADAPTIVE SIGNAL CONTROL DATA Jinghui Yuan (Corresponding Author), M.S., PhD student Department

More information

An Efficient and Effective Immune Based Classifier

An Efficient and Effective Immune Based Classifier Journal of Computer Science 7 (2): 148-153, 2011 ISSN 1549-3636 2011 Science Publications An Efficient and Effective Immune Based Classifier Shahram Golzari, Shyamala Doraisamy, Md Nasir Sulaiman and Nur

More information

Using Decision Tree to predict repeat customers

Using Decision Tree to predict repeat customers Using Decision Tree to predict repeat customers Jia En Nicholette Li Jing Rong Lim Abstract We focus on using feature engineering and decision trees to perform classification and feature selection on the

More information

Prediction of Success or Failure of Software Projects based on Reusability Metrics using Support Vector Machine

Prediction of Success or Failure of Software Projects based on Reusability Metrics using Support Vector Machine Prediction of Success or Failure of Software Projects based on Reusability Metrics using Support Vector Machine R. Sathya Assistant professor, Department of Computer Science & Engineering Annamalai University

More information

Multi-factor Stock Selection Model Based on Adaboost

Multi-factor Stock Selection Model Based on Adaboost Multi-factor Stock Selection Model Based on Adaboost Ru Zhang Finance Department of International Business School, Jinan University Zhuhai, 519070, China E-mail: zhangru@stu2016.nu.edu.cn Tong Cao (Corresponding

More information

Comparative study on demand forecasting by using Autoregressive Integrated Moving Average (ARIMA) and Response Surface Methodology (RSM)

Comparative study on demand forecasting by using Autoregressive Integrated Moving Average (ARIMA) and Response Surface Methodology (RSM) Comparative study on demand forecasting by using Autoregressive Integrated Moving Average (ARIMA) and Response Surface Methodology (RSM) Nummon Chimkeaw, Yonghee Lee, Hyunjeong Lee and Sangmun Shin Department

More information

Quantitative evaluation of non-verbal communication for competence analysis

Quantitative evaluation of non-verbal communication for competence analysis /31 Quantitative evaluation of non-verbal communication for competence analysis Alvaro CEPERO, Albert CLAPÉS, Sergio ESCALERA CCIA 2013 /31 Outline 1 Introduction 2 System Low-level features High-level

More information

Neighbor selection for cold users in collaborative filtering with positive-only feedback

Neighbor selection for cold users in collaborative filtering with positive-only feedback Neighbor selection for cold users in collaborative filtering with positive-only feedback Alejandro Bellogín 1, Ignacio Fernández-Tobías 2, Iván Cantador 1, and Paolo Tomeo 3 1 Universidad Autónoma de Madrid,

More information

A STUDY ON STATISTICAL BASED FEATURE SELECTION METHODS FOR CLASSIFICATION OF GENE MICROARRAY DATASET

A STUDY ON STATISTICAL BASED FEATURE SELECTION METHODS FOR CLASSIFICATION OF GENE MICROARRAY DATASET A STUDY ON STATISTICAL BASED FEATURE SELECTION METHODS FOR CLASSIFICATION OF GENE MICROARRAY DATASET 1 J.JEYACHIDRA, M.PUNITHAVALLI, 1 Research Scholar, Department of Computer Science and Applications,

More information

Comparison of Different Independent Component Analysis Algorithms for Sales Forecasting

Comparison of Different Independent Component Analysis Algorithms for Sales Forecasting International Journal of Humanities Management Sciences IJHMS Volume 2, Issue 1 2014 ISSN 2320 4044 Online Comparison of Different Independent Component Analysis Algorithms for Sales Forecasting Wensheng

More information

Proactive Data Mining Using Decision Trees

Proactive Data Mining Using Decision Trees 2012 IEEE 27-th Convention of Electrical and Electronics Engineers in Israel Proactive Data Mining Using Decision Trees Haim Dahan and Oded Maimon Dept. of Industrial Engineering Tel-Aviv University Tel

More information

Machine learning-based approaches for BioCreative III tasks

Machine learning-based approaches for BioCreative III tasks Machine learning-based approaches for BioCreative III tasks Shashank Agarwal 1, Feifan Liu 2, Zuofeng Li 2 and Hong Yu 1,2,3 1 Medical Informatics, College of Engineering and Applied Sciences, University

More information

Predicting the Odds of Getting Retweeted

Predicting the Odds of Getting Retweeted Predicting the Odds of Getting Retweeted Arun Mahendra Stanford University arunmahe@stanford.edu 1. Introduction Millions of people tweet every day about almost any topic imaginable, but only a small percent

More information

Predicting Corporate Influence Cascades In Health Care Communities

Predicting Corporate Influence Cascades In Health Care Communities Predicting Corporate Influence Cascades In Health Care Communities Shouzhong Shi, Chaudary Zeeshan Arif, Sarah Tran December 11, 2015 Part A Introduction The standard model of drug prescription choice

More information

Automatic Detection of Rumor on Social Network

Automatic Detection of Rumor on Social Network Automatic Detection of Rumor on Social Network Qiao Zhang 1,2, Shuiyuan Zhang 1,2, Jian Dong 3, Jinhua Xiong 2(B), and Xueqi Cheng 2 1 University of Chinese Academy of Sciences, Beijing, China 2 Institute

More information

A Brief History. Bootstrapping. Bagging. Boosting (Schapire 1989) Adaboost (Schapire 1995)

A Brief History. Bootstrapping. Bagging. Boosting (Schapire 1989) Adaboost (Schapire 1995) A Brief History Bootstrapping Bagging Boosting (Schapire 1989) Adaboost (Schapire 1995) What s So Good About Adaboost Improves classification accuracy Can be used with many different classifiers Commonly

More information

CONNECTING CORPORATE GOVERNANCE TO COMPANIES PERFORMANCE BY ARTIFICIAL NEURAL NETWORKS

CONNECTING CORPORATE GOVERNANCE TO COMPANIES PERFORMANCE BY ARTIFICIAL NEURAL NETWORKS CONNECTING CORPORATE GOVERNANCE TO COMPANIES PERFORMANCE BY ARTIFICIAL NEURAL NETWORKS Darie MOLDOVAN, PhD * Mircea RUSU, PhD student ** Abstract The objective of this paper is to demonstrate the utility

More information

Chemical compound and drug name recognition using CRFs and semantic similarity based on ChEBI

Chemical compound and drug name recognition using CRFs and semantic similarity based on ChEBI Chemical compound and drug name recognition using CRFs and semantic similarity based on ChEBI Andre Lamurias, Tiago Grego, and Francisco M. Couto Dep. de Informática, Faculdade de Ciências, Universidade

More information

Evaluating Workflow Trust using Hidden Markov Modeling and Provenance Data

Evaluating Workflow Trust using Hidden Markov Modeling and Provenance Data Evaluating Workflow Trust using Hidden Markov Modeling and Provenance Data Mahsa Naseri and Simone A. Ludwig Abstract In service-oriented environments, services with different functionalities are combined

More information

Applications of Machine Learning to Predict Yelp Ratings

Applications of Machine Learning to Predict Yelp Ratings Applications of Machine Learning to Predict Yelp Ratings Kyle Carbon Aeronautics and Astronautics kcarbon@stanford.edu Kacyn Fujii Electrical Engineering khfujii@stanford.edu Prasanth Veerina Computer

More information

Title: Genome-Wide Predictions of Transcription Factor Binding Events using Multi- Dimensional Genomic and Epigenomic Features Background

Title: Genome-Wide Predictions of Transcription Factor Binding Events using Multi- Dimensional Genomic and Epigenomic Features Background Title: Genome-Wide Predictions of Transcription Factor Binding Events using Multi- Dimensional Genomic and Epigenomic Features Team members: David Moskowitz and Emily Tsang Background Transcription factors

More information

A Field Relevance Model for Structured Document Retrieval

A Field Relevance Model for Structured Document Retrieval A Field Relevance Model for Structured Document Retrieval Jin Young Kim and W. Bruce Croft Center for Intelligent Information Retrieval Department of Computer Science University of Massachusetts Amherst

More information

Does Ad-Context Matter on the Effectiveness of Online Advertising?

Does Ad-Context Matter on the Effectiveness of Online Advertising? Does Ad-Context Matter on the Effectiveness of Online Advertising? Caihong Sun, Meina Zhang, Meiyun Zuo* Key Laboratory of Data Engineering and Knowledge Engineering, School of Information, Renmin University

More information

Fraud Detection for MCC Manipulation

Fraud Detection for MCC Manipulation 2016 International Conference on Informatics, Management Engineering and Industrial Application (IMEIA 2016) ISBN: 978-1-60595-345-8 Fraud Detection for MCC Manipulation Hong-feng CHAI 1, Xin LIU 2, Yan-jun

More information

Personalized Categorization of Financial Transactions

Personalized Categorization of Financial Transactions Personalized Categorization of Financial Transactions Alex Ran Intuit, Inc. Chris Lesner Intuit, Inc. Wei Wang Intuit, Inc Marko Rukonic Intuit, Inc. ABSTRACT An important aspect of =inancial accounting

More information

Study on Talent Introduction Strategies in Zhejiang University of Finance and Economics Based on Data Mining

Study on Talent Introduction Strategies in Zhejiang University of Finance and Economics Based on Data Mining International Journal of Statistical Distributions and Applications 2018; 4(1): 22-28 http://www.sciencepublishinggroup.com/j/ijsda doi: 10.11648/j.ijsd.20180401.13 ISSN: 2472-3487 (Print); ISSN: 2472-3509

More information

Reveal Motif Patterns from Financial Stock Market

Reveal Motif Patterns from Financial Stock Market Reveal Motif Patterns from Financial Stock Market Prakash Kumar Sarangi Department of Information Technology NM Institute of Engineering and Technology, Bhubaneswar, India. Prakashsarangi89@gmail.com.

More information

Determining NDMA Formation During Disinfection Using Treatment Parameters Introduction Water disinfection was one of the biggest turning points for

Determining NDMA Formation During Disinfection Using Treatment Parameters Introduction Water disinfection was one of the biggest turning points for Determining NDMA Formation During Disinfection Using Treatment Parameters Introduction Water disinfection was one of the biggest turning points for human health in the past two centuries. Adding chlorine

More information

Sentiment Analysis and Political Party Classification in 2016 U.S. President Debates in Twitter

Sentiment Analysis and Political Party Classification in 2016 U.S. President Debates in Twitter Sentiment Analysis and Political Party Classification in 2016 U.S. President Debates in Twitter Tianyu Ding 1 and Junyi Deng 1 and Jingting Li 1 and Yu-Ru Lin 1 1 University of Pittsburgh, Pittsburgh PA

More information

Predicting Popularity of Messages in Twitter using a Feature-weighted Model

Predicting Popularity of Messages in Twitter using a Feature-weighted Model International Journal of Advanced Intelligence Volume 0, Number 0, pp.xxx-yyy, November, 20XX. c AIA International Advanced Information Institute Predicting Popularity of Messages in Twitter using a Feature-weighted

More information

HybridRank: Ranking in the Twitter Hybrid Networks

HybridRank: Ranking in the Twitter Hybrid Networks HybridRank: Ranking in the Twitter Hybrid Networks Jianyu Li Department of Computer Science University of Maryland, College Park jli@cs.umd.edu ABSTRACT User influence in social media may depend on multiple

More information

Financial Time Series Segmentation Based On Turning Points

Financial Time Series Segmentation Based On Turning Points Proceedings of 2011 International Conference on System Science and Engineering, Macau, China - June 2011 Financial Time Series Segmentation Based On Turning Points Jiangling Yin, Yain-Whar Si, Zhiguo Gong

More information

Mining a Marketing Campaigns Data of Bank

Mining a Marketing Campaigns Data of Bank Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 6.017 IJCSMC,

More information

PREDICTING PREVENTABLE ADVERSE EVENTS USING INTEGRATED SYSTEMS PHARMACOLOGY

PREDICTING PREVENTABLE ADVERSE EVENTS USING INTEGRATED SYSTEMS PHARMACOLOGY PREDICTING PREVENTABLE ADVERSE EVENTS USING INTEGRATED SYSTEMS PHARMACOLOGY GUY HASKIN FERNALD 1, DORNA KASHEF 2, NICHOLAS P. TATONETTI 1 Center for Biomedical Informatics Research 1, Department of Computer

More information

Exploring Similarities of Conserved Domains/Motifs

Exploring Similarities of Conserved Domains/Motifs Exploring Similarities of Conserved Domains/Motifs Sotiria Palioura Abstract Traditionally, proteins are represented as amino acid sequences. There are, though, other (potentially more exciting) representations;

More information

Business Capability-centric Management of Services and Process Models (Extended Abstract)

Business Capability-centric Management of Services and Process Models (Extended Abstract) Business Capability-centric Management of Services and Process Models (Extended Abstract) Wassim Derguech Insight Centre for Data Analytics - National University of Ireland, Galway wassim.derguech@insight-centre.org

More information

Machine Learning Models for Sales Time Series Forecasting

Machine Learning Models for Sales Time Series Forecasting Article Machine Learning Models for Sales Time Series Forecasting Bohdan M. Pavlyshenko SoftServe, Inc., Ivan Franko National University of Lviv * Correspondence: bpavl@softserveinc.com, b.pavlyshenko@gmail.com

More information

E-Commerce Sales Prediction Using Listing Keywords

E-Commerce Sales Prediction Using Listing Keywords E-Commerce Sales Prediction Using Listing Keywords Stephanie Chen (asksteph@stanford.edu) 1 Introduction Small online retailers usually set themselves apart from brick and mortar stores, traditional brand

More information

An Improved Contract Net Protocol with Multi-Agent for Reservoir Flood Control Dispatch

An Improved Contract Net Protocol with Multi-Agent for Reservoir Flood Control Dispatch Journal of Water Resource and Protection, 2011, 3, 735-746 doi:10.4236/jwarp.2011.310084 Published Online October 2011 (http://www.scirp.org/journal/jwarp) An Improved Contract Net Protocol with Multi-Agent

More information

THREE LEVEL HIERARCHICAL BAYESIAN ESTIMATION IN CONJOINT PROCESS

THREE LEVEL HIERARCHICAL BAYESIAN ESTIMATION IN CONJOINT PROCESS Please cite this article as: Paweł Kopciuszewski, Three level hierarchical Bayesian estimation in conjoint process, Scientific Research of the Institute of Mathematics and Computer Science, 2006, Volume

More information

A Protein Secondary Structure Prediction Method Based on BP Neural Network Ru-xi YIN, Li-zhen LIU*, Wei SONG, Xin-lei ZHAO and Chao DU

A Protein Secondary Structure Prediction Method Based on BP Neural Network Ru-xi YIN, Li-zhen LIU*, Wei SONG, Xin-lei ZHAO and Chao DU 2017 2nd International Conference on Artificial Intelligence: Techniques and Applications (AITA 2017 ISBN: 978-1-60595-491-2 A Protein Secondary Structure Prediction Method Based on BP Neural Network Ru-xi

More information

Evaluation of Logistic Regression Model with Feature Selection Methods on Medical Dataset

Evaluation of Logistic Regression Model with Feature Selection Methods on Medical Dataset Evaluation of Logistic Regression Model with Feature Selection Methods on Medical Dataset Abstract Raghavendra B. K., Dr. M.G.R. Educational and Research Institute, Chennai600 095. Email: raghavendra_bk@rediffmail.com,

More information

A Weighted Tag Similarity Measure Based on a Collaborative Weight Model

A Weighted Tag Similarity Measure Based on a Collaborative Weight Model A Weighted Tag Similarity Measure Based on a Collaborative Weight Model G.R.J.Srinivas Niket Tandon Search and Information Max Planck Institute, Extraction Lab, IIIT Hyderabad, Germany India ntandon@mpi-inf.mpg.de

More information

Churn Prediction for Game Industry Based on Cohort Classification Ensemble

Churn Prediction for Game Industry Based on Cohort Classification Ensemble Churn Prediction for Game Industry Based on Cohort Classification Ensemble Evgenii Tsymbalov 1,2 1 National Research University Higher School of Economics, Moscow, Russia 2 Webgames, Moscow, Russia etsymbalov@gmail.com

More information

Predicting Replacement of Smartphones with Mobile App Usage

Predicting Replacement of Smartphones with Mobile App Usage Predicting Replacement of Smartphones with Mobile App Usage Dun Yang 1, Zhiang Wu 1(B), Xiaopeng Wang 2, Jie Cao 1, and Guandong Xu 3 1 School of Info. Engineering, Nanjing University of Finance and Economics,

More information

Web Customer Modeling for Automated Session Prioritization on High Traffic Sites

Web Customer Modeling for Automated Session Prioritization on High Traffic Sites Web Customer Modeling for Automated Session Prioritization on High Traffic Sites Nicolas Poggi 1, Toni Moreno 2,3, Josep Lluis Berral 1, Ricard Gavaldà 4, and Jordi Torres 1,2 1 Computer Architecture Department,

More information

Integrating New Cost Model into HMA-Based Grid Resource Scheduling

Integrating New Cost Model into HMA-Based Grid Resource Scheduling Integrating New Cost Model into HMA-Based Grid Resource Scheduling Jun-yan Zhang, Fan Min, and Guo-wei Yang College of Computer Science and Engineering, University of Electronic Science and Technology

More information

Predicting Corporate 8-K Content Using Machine Learning Techniques

Predicting Corporate 8-K Content Using Machine Learning Techniques Predicting Corporate 8-K Content Using Machine Learning Techniques Min Ji Lee Graduate School of Business Stanford University Stanford, California 94305 E-mail: minjilee@stanford.edu Hyungjun Lee Department

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS246: Mining Massive Datasets Jure Leskovec, Stanford University CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu 3/5/18 Jure Leskovec, Stanford C246: Mining Massive Datasets 2 High dim. data Graph data Infinite data Machine

More information

From Ordinal Ranking to Binary Classification

From Ordinal Ranking to Binary Classification From Ordinal Ranking to Binary Classification Hsuan-Tien Lin Learning Systems Group, California Institute of Technology Talk at Caltech CS/IST Lunch Bunch March 4, 2008 Benefited from joint work with Dr.

More information

MISSING DATA CLASSIFICATION OF CHRONIC KIDNEY DISEASE

MISSING DATA CLASSIFICATION OF CHRONIC KIDNEY DISEASE MISSING DATA CLASSIFICATION OF CHRONIC KIDNEY DISEASE Wala Abedalkhader and Noora Abdulrahman Department of Engineering Systems and Management, Masdar Institute of Science and Technology, Abu Dhabi, United

More information

Statistical Methods for Network Analysis of Biological Data

Statistical Methods for Network Analysis of Biological Data The Protein Interaction Workshop, 8 12 June 2015, IMS Statistical Methods for Network Analysis of Biological Data Minghua Deng, dengmh@pku.edu.cn School of Mathematical Sciences Center for Quantitative

More information

A Bayesian Predictor of Airline Class Seats Based on Multinomial Event Model

A Bayesian Predictor of Airline Class Seats Based on Multinomial Event Model 2016 IEEE International Conference on Big Data (Big Data) A Bayesian Predictor of Airline Class Seats Based on Multinomial Event Model Bingchuan Liu Ctrip.com Shanghai, China bcliu@ctrip.com Yudong Tan

More information

TEST CASE PRIORITIZATION USING FUZZY LOGIC BASED ON REQUIREMENT PRIORITIZING

TEST CASE PRIORITIZATION USING FUZZY LOGIC BASED ON REQUIREMENT PRIORITIZING TEST CASE PRIORITIZATION USING FUZZY LOGIC BASED ON REQUIREMENT PRIORITIZING Usha Badhera 1 and Debarupa Biswas 2 Computer Science Department, Banasthali University, India 1 ushas133@yahoo.com, 2 biswas.debarupa@gmail.com

More information