Identifying Implicit Relationships between Social Media Users to Support Social Commerce

Size: px
Start display at page:

Download "Identifying Implicit Relationships between Social Media Users to Support Social Commerce"

Transcription

1 Identifying Implicit Relationships between Social Media Users to Support Social Commerce Christopher C. Yang Haodong Yang ABSTRACT The Internet is an ideal platform for business-to-consumer (B2C) and business-to-business (B2B) electronic commerce where businesses and consumers conduct commerce activities such as searching for consumer products, promoting business, managing supply chain and making electronic transactions. With the advance of Web 2.0 technologies and the popularity of social media sites, social commerce offers new opportunities of social interaction between electronic commerce consumers as well as social interaction between consumers and e-retailers. The user contributed content provides a tremendous amount of information that may assist in electronic commerce services. Social network analysis and mining has been a powerful tool for electronic commerce vendors and marketing companies to understand the user behavior which is useful for identifying potential customers of their products. However, the capability of social network analysis and mining diminishes when the social network data is incomplete, especially when there are only limited ties available. The social networks extracted from explicit relationships in social media are usually sparse. Many social media users who have similar interest may not have direct interactions with one another or purchase the same products. Therefore, the explicit relationships between electronic commerce users are not sufficient to construct social networks for effective social network analysis and mining. In this work, we propose the temporal analysis techniques to identify implicit relationships for enriching the social network structure. We have conducted an experiment on Digg.com, which is a social media site for users to discover and share content from anywhere of the Web. The experiment shows that the temporal analysis techniques outperform the baseline techniques that only rely on explicit relationships. Categories and Subject Descriptors H.2.8 [Database Management]: Database applications---data mining; H.3.3 [Information Storage and Retrieval]: Information Search and Retrieval; H.5.4 [Information Interfaces and Presentation]: Hypertext/Hypermedia; I.5.4 [Pattern Recognition]: Applications ICEC '12, August , Singapore, Singapore Copyright 2012 ACM /12/08 $ Xuning Tang xt24@drexel.edu Ling Jiang General Terms ling.jiang@drexel.edu Algorithms, Measurement, Experimentation, Human Factors Keywords User Activeness, Explicit Relationship, Implicit Relationship, Social Media Analytics, Web 2.0, Temporal Analysis, Spectral Analysis, Spectral Coherence, Social Commerce, Social Network Analysis and Mining 1. INTRODUCTION The Internet is an ideal platform for business-to-consumer (B2C) and business-to-business (B2B) electronic commerce where businesses and consumers conduct commerce activities such as searching for consumer products, promoting business, managing supply chain and making electronic transactions. The success of many electronic commerce retailers such as Amazon and E-Bay has proven that the Internet is an ideal platform for electronic commerce. In recent years, following the success of MySpace, Facebook and Twitter, social media has drawn significant attention in electronic commerce. It is important to identify how we may use social media to facilitate the electronic commerce services and stimulate the electronic commerce transactions [21][22][24]. Social commerce, which is considered as part of electronic commerce, is drawing increasing interest from the academic and industry in developing new theories and technologies to understand the user behavior of social media and extracting knowledge from the user-contributed content and social network structure. Social network analysis and mining is powerful in identifying influential users, discovering special interest subgroups, determining user roles, and understanding community evolution [20][23][25][26]. These techniques help us to understand the development of electronic commerce user interest for matching products with potential consumers. However, social network analysis and mining is useful in achieving these facilitations only if we have a social network that captures the ties between actors as input [20][23]. In conducting social network analysis and mining on social media, we always face a challenge of capturing the user relationships of common interest. Social media is a large space that involves a tremendous number of users and interactions. The explicit interactions between users may only

2 represent part of the user relationships but many users who have common interests are not necessarily interacting with each other explicitly. The user relationship can be classified as two types of relationships, namely explicit relationship and implicit relationship [19]. On one hand, the explicit relationship is defined as the relationship that can be captured by the user direct interaction through the functions in social media sites. For example, a Web forum user A makes a comment on a post made by another user B; a Twitter user A retweets a tweet posted by another Twitter user B; a Facebook user A accept a friend request from another Facebook user B. These explicit interactions between two social media users indicate a relationship between them because of their common interest in a particular content/action or an intention of building a close relationship. On the other hand, the implicit relationship corresponds to the relationship drawn from the user coherent interest or activity that cannot be traced by any public records of interactions between two users in a social media site. Extracting implicit relationships between users is relatively more challenging. Two users who have common interest on certain content are behaving similarly to follow the related events; however, these two users may not have any direct interaction with each other. For example, two users who are interested in a particular movie may post comments or tweets about their opinions on the movie when they see the trailers or news of this movie on the Web. However, these two users may not have any direct interactions to comment on each other s opinions. From the electronic commerce perspective, identifying the implicit relationship between users provide valuable information in marketing and recommending consumer products to potential consumers that overcome the sparseness of ties in social network constructed by explicit relationships. In this paper, we propose the temporal analysis approach to address the problem of identifying implicit relationships between online social media users. An experiment is conducted to compare the performance of the proposed implicit relationship identification techniques with the explicit relationship extractions. It shows that the proposed implicit relationship identification techniques obtain a higher F-1 measure value. 2. RELATED WORK In business applications, social network is widely used to represent customer relationships, buyer-seller relationships or buyer-supplier relationships. For instance, in a customer relationship management system, customers and their relationships can be represented by a social network where each node denotes a customer and each edge corresponds to a relationship between two customers. In most of the situations, the network structure, especially for the links of a network, are declared by the users. For example, in online social media sites, such as Facebook and Twitter, users can explicitly follow each other or accept a friendship request from another user. However, these explicit relationships are not always publically available. As a result, there is a research need to acquire implicit relationships between users. Link prediction is a related research topic of inferring implicit relationships among users. Liben-Nowell and Kleinberg defined the link prediction problem as inferring missing interactions among users which are highly probable to occur in the future [11]. They further summarized the existing link prediction approaches into neighborhood-based link prediction approach and pathensemble based link prediction approach [11]. For neighborhoodbased link prediction approach, Newman computed the probability of collaboration between two users by counting the common neighbors of two nodes in a collaborative network [16]; Adamic and Adar employed a score function which refines the simple counting of common neighbors by weighting rarer neighbors more heavily [1]. Different from the neighborhoodbased link prediction approaches, the path-ensemble based approach take into account the distance between two nodes. Katz proposed a formula to measure the probability of future link which sums over the paths between two nodes by weighting shorter paths more heavily [10]. Liben-Nowell and Kleinberg proposed a normalized and symmetric version of hitting time to tackle the link prediction problem [11], where hitting time was defined as the expected time in a random walk to reach a node starting from another node. A shorter hitting time corresponds to a higher proximity between two users. Besides the neighborhood and path-ensemble based approaches, some recent works focused on training a classifier which takes proximity, aggregation and topological features into account to conduct link prediction [2, 12]. However, most of the existing link prediction techniques need a social network as an initial input and their objective is predicting future interactions given the current network snapshot. In our problem, we assume such an initial social network is unavailable because explicit connections between users are missing or incomplete. Instead, we utilize temporal analysis to identify the implicit relationship between social media users. Some previous techniques discover implicit relationships based on user similarity. McPherson et al. introduced the notion of homophily and considered that social relationships are likely to form between people of similar characteristics [13]. Provost et al. introduced methods for extracting quasi-social network from data on visitations to social networking pages [17]. A link is placed between two users in this quasi-social network depending on their visits to the common web pages. Similarly, proximity can also be interpreted as interaction in physical spaces [4]. However, this explicit quasi-relationship is not always available in all social media sites. In this work, we propose to use only temporal data to extract implicit relationship regardless of their specific activities such as visiting the same web pages or commenting on the same web objects. In this work, we propose to represent user activeness by a time series, namely user feature vector, and investigate user behavior by spectral analysis techniques. Spectral coherence score between two user feature vectors is used to represent their potential relationships. Signal processing techniques have been investigated in a wide variety of fields including genetics [15], economics [7], and neuroscience [6, 9]. Signal processing techniques were also applied to extract semantically related search engine queries [5] and cluster words in news streams to detect popular events [8]. In this work, we employ the signal processing techniques to analyze the user frequency of social medial activities and determine the implicit relationship between any two users even if these two users do not have any explicit interaction or relationship. 3. METHOD 3.1 Temporal Analysis Direct user interactions may only be useful in extracting the explicit relationships but are not adequate to identify the implicit relationships. Indeed, the implicit relationship cannot be determined by simply tracing the one-to-one interaction between users. By identifying and integrating implicit relationships, we

3 are able to obtain a more comprehensive view and achieve better social media analytic performance. In this section, we propose the temporal analysis research framework to identify implicit relationship from online social media Temporal Coherence Analysis General speaking, web contents nowadays are generated mostly by web users. External events, such as the release of new iphone 4s or the anniversary sale of a famous brand, will trigger a mass of web contents contributed by web users with common interest. As a result, even though two users do not explicit interact with each other, as long as they react similarly to common external events, it is possible that these two users might share common interest or have an implicit relationship. In other words, we assume that an implicit relationship exists between two users when their daily activeness has strong temporal coherence. The objective of temporal coherence analysis in this section is calculating the temporal similarities between any pair of users, which will result in a similarity matrix to represent the strength of implicit relationships between users. To quantify the similarity between any pair of users, say i and j, we first represent them by user feature vectors, then we compute auto-spectrum for each individual vector and cross-spectrum of i and j, finally the spectral coherence of i and j is employed to quantify their similarity. User Feature Vector. In our work, users are represented by vectors. Given any online social media, let T be the period (in days) during which we investigate user behavior and interaction. For each user, we represent his/her activeness by a vector defined as below: (1) where each element represents the activeness of user on the i th day. can be defined in a very flexible manner according to the context. Several attributes can be used to quantify user activeness depending on which online social media we are studying, for example, the number of messages a user post daily, the number of URL a user tag daily, the number of tweets a user post or retweet daily, or the number of video a user click and comment daily. In a simple manner, can be defined as: where is the number of messages posted by user u on day, is the total number of messages posted by all users including u on day. It is worth mentioning that can be defined in more delicate ways when more user information is given. In this paper, we focus our research on studying the implicit relationships among users in online social media (data from Digg.com will be used as our test bed in the experiment). Given all the user contributed content or user actions, we define as: (2) (3) where is the number of messages (stories or comments) contributed by user u on day, is the total number of messages contributed by all users including u on day, is the total number of unique threads that user participated (submitted or made comments to) over time, and S is the total number of unique threads over T. Equation (3) consists of two components: and. measures the number of messages that user u contributes on day normalized by the total number of messages of day. The higher is, the more messages that u contributes, leading to a larger activeness of u at day. penalizes a user if s/he participated in too many different stories which may imply that this user does not have a specific focus. In this study, we consider a group of m users form an m- dimensional multivariate process. By employing the user feature vector defined above, the m-dimensional multivariate process can be denoted as A, where each row denotes a user and each column indicates the activeness of these users on a specific day: ( ) (4) Spectral Analysis User behaviors in terms of multivariate time series are often rich in oscillatory content, leading them naturally to spectral analysis. By computing the spectrum of user, we can quantify the overall activeness of a user in online social media. To calculate spectral estimates of users (auto- or cross-spectrum and coherence), we first perform a Fourier transform on each. Given a finite length user feature vector of discrete time process, t=1, 2,, T, the Fourier transform of the data sequence is defined as follow: (5) A simple estimate of the spectrum is taking the square of the Fourier transform of the data sequence, i.e.,. However, this suffers from the difficulties of bias and leakage. To resolve these issues, we applied multitaper technique to obtain smoother Fourier-based spectral density with reduced estimation bias [14]. In the multitaper technique, we apply K tapers successively to the i th user feature vector and take the Fourier transform: (6) where (k = 1, 2,, K) represent K orthogonal taper functions with appropriate properties. A particular choice of these taper functions, with optimal leakage properties, is given by the discrete prolate spheroidal sequences (DPSS) [18]. The multitaper estimates for the spectrum is defined as: (7) Since the spectrum score of each user varies with different frequency, which implies that a user performs differently at different period, in this work, we define the dominant power spectrum of user as its maximum spectrum value across all potential frequencies: ( ) (8) which can be used to represent the overall activeness of user in social media. Similarly, the cross-spectrum between user behavior processes and is defined as follows: (9)

4 where denotes the complex-conjugate transpose of. We then have the spectral density matrix for the multivariate processes as: ( ) (10) with the off-diagonal elements representing cross-spectrum and diagonal elements representing auto-spectrum Measure Users Similarity by Spectral Coherence In this work, we quantify the similarity of two user feature vectors by using spectral coherence. Spectral coherency for any pair of user feature vectors and is calculated as: (11) Spectral coherency is a complex quantity which provides estimation of the strength of coupling between two processes. Its absolute value is called spectral coherence with ranges from 0 to 1. A spectral coherence value of 1 indicates that the two signals have a constant phase relationship, and a value of 0 indicates the absence of any phase relationship. Although correlation can also indicate the coupling between two user feature vectors, we choose spectral coherence here since it does not only tell us how similar two user feature vectors are, but also informs us that at which frequency these two user feature vectors are similar. In this study, we obtain an overall spectral coherence score for each pair of users by summing over their spectral coherence value in different frequency, so that we have (12) It is important to note that this spectral coherence score between two users is employed to represent the similarity of these two users across all frequencies. It can be used in many practical applications Filtering Algorithm Given a similarity matrix of users, in this work, we use two kinds of filtering algorithms to construct networks. One is Mutual Top- K Filtering and the other one is simple Thresholding. Mutual Top-K Filtering Given a similarity matrix, for each user, we sort the similarity scores between i and all other users in the descending order and retrieve users that have the top similarity score with, denoted as. Parameter k is provided as an input, which is a percentage, indicating the proportion of users retrieved. As a result, has relatively higher similarity with users in. Secondly, for each users in, we check whether also belongs to to ensures that has relatively high similarity with too. If it is true, we retain j in, otherwise we remove j from. We ensure that each user will be associated to at least one other user. As a result, if is an empty set, we correlate with the user p that has the highest. At last, except for the relationships in Candidate(i) where i from 1 to N, we set all other elements in the similarity matrix equal to zero. By considering the original similarity matrix as a fully connected network, this filtering step removes edges with relatively lower weights (similarity scores) and constructs a network. Below is the pseudo code of the Mutual Top-K Filtering Algorithm, where N denotes the total number of users: Mutual Top-K Filtering Algorithm INPUT: Spectral Coherence scores of any pair of N users, ranking threshold OUTPUT: clusters of users /*Filtering*/ 1: for each users 2: Sort spectral coherence score in descending order, where 3: Assign users with top spectral coherence score to // stores users //having high coherence with ; 4: end for 5: for each users 6: for each user in 7: if ( ) then 8: retain in ; 9: else remove from 10: end for 11: if 12: Assign user p who has the highest coherence with to 13: end if 14: end for Thresholding As Mutual Top-K Filtering, for each user, we also sort the similarity scores between i and all other users in the descending order. However, in this method, after calculating the similarities for each pair of users, we only retain the pairs whose similarity are larger than a predetermined threshold and remove the rest. 4. EXPERIMENT 4.1 Dataset In this experiment, we use the social media site, Digg.com, as testbed. Digg.com provides a platform for Web users to submit stories for discussions. Stories (or news) are submitted by Digg.com users. Users who find interest in a story can make comments or dig this story. For each story in Digg.com, we collected its story ID, submitter user ID, IDs of all the comments of this story, user IDs of all commenter, and the corresponding timestamps and saved in our dataset. We utilized the Digg API to collect the dataset from Digg.com. We arbitrary selected five popular newswires and all stories of these newswires submitted by Digg users to Digg.com for discussions were collected. These five news sources were: CNN, BBC, NPR, The Washington Post, and Yahoo! News. The whole dataset spanned 3 months from March 1st 2011 to May 31st During these three months, we collected all the top news (which are defined and recommended by Digg.com), comments of all these stories, all user information and timestamp. Specifically, for each story that we collected, we recorded its story ID, submitter ID, brief abstract of the story, all comments and the commenter s IDs, and the corresponding timestamps for both the story and the comments. In total, there were 12,742 stories, 13,531 comments (only 590 stories have comments) and 286 active users (we

5 defined the users who were active in more than 10 days during this 3 months period to be active users and filtered the inactive ones). 4.2 Gold Standard To evaluate the performance of the proposed techniques, we recruited two human annotators to analyze the dataset and generate a Gold Standard. The generation of gold standard consists of the following steps. 1) 50 pairs of Digg users in the dataset were selected randomly, 2) the human annotators independently examined all the stories and comments submitted by each pair of users, identified the subjects of each story or comment, and determined the areas that each user was most interested in, 3) the annotators independently identified if a relationship of common interest existed between each pair of users. To determine the reliability of the Gold Standard produced by the two human annotators, we used the weighted Kappa measure to compute the inter-rater agreement. Weighted Kappa measure is a statistical measure extended from Kappa measure for computing the agreement between two ordered lists. It has a maximum value of 1 when there is perfect agreement between two raters and a value of 0 when the agreement is not better than by chance. In general, a weighted kappa measure value larger than 0.8 is considered a very good agreement between the two raters [3]. In our experiment, the weighted Kappa measure was 0.84, which means the two annotators had a very good agreement and the Gold Standard was reliable for the experiment. 4.3 Baseline Method To test the effectiveness of our proposed techniques, we compared our proposed technique with two baseline methods, 1 N and N N, which only consider explicit interactions. For each story in the dataset, we had user ID of the submitter and the Digg users who made comments on this story. The 1 N method constructs a 1 to N network where each link corresponds to an interaction between the submitter and a commenter of the story. In 1 N method, we only considered the explicit relationship between two users when the submitter of a story received a comment from another Digg user. The N N method constructed a fully connected network where any two users within a same story, including story submitter and all commenters, were connected by links. In N N method, in addition to the submit and comment explicit interaction, we consider an explicit relationship occurring between two users when they are commenting on the same story that both of them are interested in. 4.4 Measurement In our experiment, precision, recall, and F-1 measures are used as the metrics to evaluate the performance of the proposed techniques. These metrics are measured in terms of four parameters true positive (TP), false positive (FP), true negative (TN) or false negative (FN) as illustrated in Table 1. Table 1 TP, FP, TN and FN Temporal Analysis Algorithms Human Annotators Yes No Yes TP FN Yes denotes that the relationship is determined by an algorithm or the human annotators; No denotes that there is no relationship between the two users determined by an algorithm or the human annotators. If a relationship is determined by both an algorithm and the human annotators, it is a TP. The formulations of Precision, Recall and F1-Measure are presented below. Precision: It is defined as the number of TP divided by the number of TP plus FP. Precision = TP / (TP+FP) Recall: It is defined as the number of TP divided by the number of TP plus FN. Recall = TP / (TP+FN) F1-Measure: It is a harmonic mean of precision and recall which is defined as: F1-Measure = 2 Precision Recall / (Precision + Recall) 4.5 Results In this experiment, we used the results of Digg users relationships generated by human annotators as the Gold Standard. We applied the baseline approach and our techniques on the same dataset separately to generate the results of the relationships among Digg users. Both the 1 N and N N baseline methods were employed to identify Explicit Relationship (ER), whereas both of the proposed Thresholding and Mutual Top-K Filtering approaches were used to identify the Implicit Relationship (IR). The results produced by each approach were compared with the Gold Standard respectively. Table 2 demonstrates the Recall and Precision trend, and Figure 1 presents the F1 measures of each approach. Table 2 Recall-Precision IR (Thresholding) IR (Mutual Top-K Filtering) Threshold Recall Precision Recall Precision ER(1 N) ER (N N) Recall Precision Recall Precision No FP TN

6 F1 Measure Thresholding Baseline 1XN Threshold Figure 1. F1 Measure Mutual Top-K Filtering Baseline NXN As shown in Table 2, both Thresholding and Mutual Top-K Filtering approaches outperform the two baseline methods as the threshold increased, but the difference between the Thresholding and Mutual Top-K Filtering is not substantial. The baseline methods, either 1 N or N N, achieve very high precision (i.e Precision = 1.00 for 1 N and Precision = 0.83 for N N). This is because the number of explicit relationships that can be identified by 1 N and N N is small but the extracted explicit relationships are true relationships. However, the recall is extremely poor (i.e. Recall = 0.04 for 1 N and Recall = 0.22 for N N). This is because many true relationships cannot be extracted by the explicit relationships. On the other hand, the Thresholding and Mutual Top-K Filtering suffer in lower precision but they achieve substantially higher recall. That means the implicit relationships identification can extract substantially more true relationships; however, it also extract more false relationships. Figure 1 shows that both Thresholding and Mutual Top-K Filtering obtain better results than 1 N baseline method in F1 measure. Thresholding is superior to N N baseline method in F1 measure with the threshold larger than 0.2, while Mutual Top-K Filtering with threshold larger than And the difference increases as the threshold increases. By identifying the implicit relationships among Digg users, our proposed techniques can identify user relationship better than relying on explicit user relationships. The two baseline approaches based on explicit interactions achieved relatively higher precision but lower recall, which means that an explicit interaction between web users is an effective indicator to determine if two users have common interest. Unfortunately, these explicit interactions can only capture a small percentage of potential relationships between web users. Indeed, using the explicit interactions can only recall less than 22% of user relationship of common interest. In this work, we propose the temporal analysis technique to identify the implicit relationships. As shown in Fig. 1, using F-1 measure, our proposed technique consistently outperformed the 1 N baseline. In addition, as the threshold increases, these two proposed approaches also substantially outperformed the N N baseline method. However, there is a tradeoff of the proposed technique. It recalls substantially more relationships but it achieves lower precisions. That means it extracts more false positives while it can extract substantially more true positives. The true positives are considered important when we use the extracted relationships to recommend social media users to promote interactions. The social media sites have a large volume of user contributed content as well as a large number of users. It is impossible for any social media users to follow the relevant information from such a huge content space and user space. Each social media user is only aware of the existence of a relatively small number of users who have the common interest. It explains why there are so many isolated communities in social media. Unless there is a good recommendation system to feed the social media users about other users of common interests, the interactions in social media will still be limited. Identifying the true positives is important to connect the users and communities together even with the sacrifice of offering false positives. It takes the user effort to filter the false positives but the users can identify substantially more potential relationship that they are not aware of otherwise. On the other hand, if a recommendation system has a high precision (few false positives) but it only extracts very few true positives, the utility of the recommendations remains limited. 5. CONCLUSION Due to the popularity of social media, social commerce has drawn substantial attention in both academic and industry. Social commerce offers new opportunities for electronic commerce companies to collect user data for understanding their behavior as well as the market. A social network is composed of actors (social media users) and ties (user relationships). Social network analysis and mining techniques support social commerce by identifying potential consumers based on the user relationships. The user relationships in social media include the explicit relationships and the implicit relationships. The explicit relationships can be easily extracted by the direct interactions between users. However, the explicit relationships do not capture many user relationships. In this paper, we employ temporal analysis to extract the implicit relationships between social media users. We utilize spectral analysis and spectral coherence to measure the similarity between two user feature vectors. A simple thresholding method and a Top-K Filtering method are developed to extract the implicit relationships. An experiment is conducted using the Digg.com dataset. Two human annotators are recruited to generate the gold standard. The experiment result shows that the proposed techniques achieve substantially higher recall but suffer in precision. The results on the harmonic mean of precision and recall, F1 measures, show that the proposed techniques (implicit relationships) outperformed the baseline methods (explicit relationships). In the future, we plan to further enhance the filtering techniques as well as extending the current techniques for user clustering and community detection applications. 6. REFERENCES [1] Adamic, L. A. and Adar, E Friends and neighbors on the web. Social networks, 25, 3 (2003), [2] Al Hasan, M., Chaoji, V., Salem, S. and Zaki, M Link prediction using supervised learning. Citeseer, [3] Bakeman, R. and Gottman, J. M Observing interaction: An introduction to sequential analysis. Cambridge Univ Pr. [4] Bonchi, F., Castillo, C., Gionis, A. and Jaimes, A Social network analysis and mining for business applications. ACM Transactions on Intelligent Systems and Technology (TIST), 2, 3 (2011), 22.

7 [5] Chien, S. and Immorlica, N Semantic similarity between search engine queries using temporal correlation. ACM, [6] Goebel, R., Roebroeck, A., Kim, D. and Formisano, E Investigating directed cortical interactions in time-resolved fmri data using vector autoregressive modeling and Granger causality mapping. Magnetic Resonance Imaging, 21, 10 (2003), [7] Granger, C Investigating causal relations by econometric models and cross-spectral methods. Econometrica: Journal of the Econometric Society, 37, 3 (1969), [8] He, Q., Chang, K. and Lim, E Analyzing feature trajectories for event detection. ACM, [9] Hirabayashi, T., Takeuchi, D., Tamura, K. and Miyashita, Y Triphasic Dynamics of Stimulus-Dependent Information Flow between Single Neurons in Macaque Inferior Temporal Cortex. Journal of Neuroscience, 30, 31 (2010), [10] Katz, L A new status index derived from sociometric analysis. Psychometrika, 18, 1 (1953), [11] Liben-Nowell, D. and Kleinberg, J The link-prediction problem for social networks. Journal of the American Society for Information Science and Technology, 58, 7 (2007), [12] Lichtenwalter, R. N., Lussier, J. T. and Chawla, N. V New perspectives and methods in link prediction. ACM, [13] McPherson, M., Smith-Lovin, L. and Cook, J. M Birds of a feather: Homophily in social networks. Annual review of sociology, (2001), [14] Mitra, P. and Pesaran, B Analysis of dynamic brain imaging data. Biophysical journal, 76, 2 (1999), [15] Mukhopadhyay, N. and Chatterjee, S Causality and pathway search in microarray time series experiment. Bioinformatics, 23, 4 (2007), 442. [16] Newman, M. E. J Clustering and preferential attachment in growing networks. Physical Review E, 64, 2 (2001), [17] Provost, F., Dalessandro, B., Hook, R., Zhang, X. and Murray, A Audience selection for on-line brand advertising: privacy-friendly social network targeting. ACM, [18] Slepian, D First passage time for a particular Gaussian process. The Annals of Mathematical Statistics, 32, 2 (1961), [19] X. Tang, C. C. Yang, and X. Gong, A Spectral Analysis Approach for Social Media Community Detection, Proceedings of International Conference on Social Informatics 2011, Singapore, October 6 8, [20] C. C. Yang and X. Tang, Social Networks Integration and Privacy Preservation using Subgraph Generalization, Proceedings of AMC SIGKDD Workshop on CyberSecurity and Intelligence Informatics, Paris, France, June 28, [21] C. C. Yang, Y. C. Wong, and C. Wei, Classifying Web Opinions for Consumer Product Analysis, Proceedings of the 11th International Conference on Electronic Commerce, Taipei, Taiwan, August 12-15, [22] C. Yang, C. Wei, and C. C. Yang, Extracting Customer Knowledge from Online Consumer Reviews: A Collaborative-Filtering-based Opinion Sentence Identification Approach, Proceedings of the 11th International Conference on Electronic Commerce, Taipei, Taiwan, August 12-15, [23] C. C. Yang and B. Thuraisingham, Privacy-Preserved Social Network Integration and Analysis for Security Informatics, IEEE Intelligent Systems, vol.25, no.3, 2010, pp [24] C. C. Yang, X. Tang, Y. C. Wong, and C. Wei, Understanding Online Consumer Review Opinions with Sentiment Analysis using Machine Learning, Pacific Asia Journal of the Association for Information Systems, vol.2, no.3, [25] C. C. Yang and T. Ng, Analyzing and Visualizing Web Opinion Development and Social Interactions with Density Based Clustering, IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans, vol.41, no.6, 2011, pp [26] C. C. Yang, X. Tang, J. Huang, and J. Unger, A Study of Social Interactions in Online Health Communities, Proceedings of IEEE International Conference on Social Computing, Boston, MA, October 9 11, 2011.

SOCIAL MEDIA MINING. Behavior Analytics

SOCIAL MEDIA MINING. Behavior Analytics SOCIAL MEDIA MINING Behavior Analytics Dear instructors/users of these slides: Please feel free to include these slides in your own material, or modify them as you see fit. If you decide to incorporate

More information

Social Network Collaborative Filtering

Social Network Collaborative Filtering Social Network Collaborative Filtering Rong Zheng, Foster Provost, Anindya Ghose Abstract This paper reports on a preliminary empirical study comparing methods for collaborative filtering (CF) using explicit

More information

The Science of Social Media. Kristina Lerman USC Information Sciences Institute

The Science of Social Media. Kristina Lerman USC Information Sciences Institute The Science of Social Media Kristina Lerman USC Information Sciences Institute ML meetup, July 2011 What is a science? Explain observed phenomena Make verifiable predictions Help engineer systems with

More information

Exploring Similarities of Conserved Domains/Motifs

Exploring Similarities of Conserved Domains/Motifs Exploring Similarities of Conserved Domains/Motifs Sotiria Palioura Abstract Traditionally, proteins are represented as amino acid sequences. There are, though, other (potentially more exciting) representations;

More information

Final Report Evaluating Social Networks as a Medium of Propagation for Real-Time/Location-Based News

Final Report Evaluating Social Networks as a Medium of Propagation for Real-Time/Location-Based News Final Report Evaluating Social Networks as a Medium of Propagation for Real-Time/Location-Based News Mehmet Ozan Kabak, Group Number: 45 December 10, 2012 Abstract This work is concerned with modeling

More information

Who is Retweeting the Tweeters? Modeling Originating and Promoting Behaviors in Twitter Network

Who is Retweeting the Tweeters? Modeling Originating and Promoting Behaviors in Twitter Network Who is Retweeting the Tweeters? Modeling Originating and Promoting Behaviors in Twitter Network Aek Palakorn Achananuparp, Ee-Peng Lim, Jing Jiang, and Tuan-Anh Hoang Living Analytics Research Centre,

More information

A Destination Prediction Method Based on Behavioral Pattern Analysis of Nonperiodic Position Logs

A Destination Prediction Method Based on Behavioral Pattern Analysis of Nonperiodic Position Logs A Destination Prediction Method Based on Behavioral Pattern Analysis of Nonperiodic Position Logs Fumitaka Nakahara * and Takahiro Murakami ** * Service Platforms Research Laboratories, NEC Corporation

More information

Uncovering the Small Community Structure in Large Networks: A Local Spectral Approach

Uncovering the Small Community Structure in Large Networks: A Local Spectral Approach Uncovering the Small Community Structure in Large Networks: A Local Spectral Approach Yixuan Li 1, Kun He 2, David Bindel 1 and John E. Hopcroft 1 1 Cornell University, USA 2 Huazhong University of Science

More information

The effect of Product Ratings on Viral Marketing CS224W Project proposal

The effect of Product Ratings on Viral Marketing CS224W Project proposal The effect of Product Ratings on Viral Marketing CS224W Project proposal Stefan P. Hau-Riege, stefanhr@stanford.edu In network-based marketing, social influence is considered in order to optimize marketing

More information

Predicting user rating for Yelp businesses leveraging user similarity

Predicting user rating for Yelp businesses leveraging user similarity Predicting user rating for Yelp businesses leveraging user similarity Kritika Singh kritika@eng.ucsd.edu Abstract Users visit a Yelp business, such as a restaurant, based on its overall rating and often

More information

Business Network Analytics

Business Network Analytics Business Network Analytics Sep, 2017 Daning Hu Department of Informatics University of Zurich Business Intelligence Research Group F Schweitzer et al. Science 2009 Research Methods and Goals What Why How

More information

Machine Learning for Display Advertising

Machine Learning for Display Advertising Machine Learning for Display Advertising The organizers asked me to talk a little about Media6Degrees The Idea: Social Targeting for Online Advertising Performance Current ad spending seems disproportionate

More information

2016 U.S. PRESIDENTIAL ELECTION FAKE NEWS

2016 U.S. PRESIDENTIAL ELECTION FAKE NEWS 2016 U.S. PRESIDENTIAL ELECTION FAKE NEWS OVERVIEW Introduction Main Paper Related Work and Limitation Proposed Solution Preliminary Result Conclusion and Future Work TWITTER: A SOCIAL NETWORK AND A NEWS

More information

Data Mining in Social Network. Presenter: Keren Ye

Data Mining in Social Network. Presenter: Keren Ye Data Mining in Social Network Presenter: Keren Ye References Kwak, Haewoon, et al. "What is Twitter, a social network or a news media?." Proceedings of the 19th international conference on World wide web.

More information

Predicting Popularity of Messages in Twitter using a Feature-weighted Model

Predicting Popularity of Messages in Twitter using a Feature-weighted Model International Journal of Advanced Intelligence Volume 0, Number 0, pp.xxx-yyy, November, 20XX. c AIA International Advanced Information Institute Predicting Popularity of Messages in Twitter using a Feature-weighted

More information

Predicting Purchase Behavior of E-commerce Customer, One-stage or Two-stage?

Predicting Purchase Behavior of E-commerce Customer, One-stage or Two-stage? 2016 International Conference on Artificial Intelligence and Computer Science (AICS 2016) ISBN: 978-1-60595-411-0 Predicting Purchase Behavior of E-commerce Customer, One-stage or Two-stage? Chen CHEN

More information

Classification Model for Intent Mining in Personal Website Based on Support Vector Machine

Classification Model for Intent Mining in Personal Website Based on Support Vector Machine , pp.145-152 http://dx.doi.org/10.14257/ijdta.2016.9.2.16 Classification Model for Intent Mining in Personal Website Based on Support Vector Machine Shuang Zhang, Nianbin Wang School of Computer Science

More information

Final Report: Local Structure and Evolution for Cascade Prediction

Final Report: Local Structure and Evolution for Cascade Prediction Final Report: Local Structure and Evolution for Cascade Prediction Jake Lussier (lussier1@stanford.edu), Jacob Bank (jbank@stanford.edu) ABSTRACT Information cascades in large social networks are complex

More information

Group #2 Project Final Report: Information Flows on Twitter

Group #2 Project Final Report: Information Flows on Twitter Group #2 Project Final Report: Information Flows on Twitter Huang-Wei Chang, Te-Yuan Huang 1 Introduction Twitter has become a very popular microblog website and had attracted millions of users up to 2009.

More information

Predicting the Odds of Getting Retweeted

Predicting the Odds of Getting Retweeted Predicting the Odds of Getting Retweeted Arun Mahendra Stanford University arunmahe@stanford.edu 1. Introduction Millions of people tweet every day about almost any topic imaginable, but only a small percent

More information

Empower Social Influence by Linking the Social Opinion to Social Network

Empower Social Influence by Linking the Social Opinion to Social Network Association for Information Systems AIS Electronic Library (AISeL) PACIS 2017 Proceedings Pacific Asia Conference on Information Systems (PACIS) Summer 7-19-2017 Empower Social Influence by Linking the

More information

Can Cascades be Predicted?

Can Cascades be Predicted? Can Cascades be Predicted? Rediet Abebe and Thibaut Horel September 22, 2014 1 Introduction In this presentation, we discuss the paper Can Cascades be Predicted? by Cheng, Adamic, Dow, Kleinberg, and Leskovec,

More information

TwitterRank: Finding Topicsensitive Influential Twitterers

TwitterRank: Finding Topicsensitive Influential Twitterers TwitterRank: Finding Topicsensitive Influential Twitterers Jianshu Weng, Ee-Peng Lim, Jing Jiang Singapore Management University Qi He Pennsylvania State University WSDM 2010 Feb 5, 2010 Outline Introduction

More information

SENTIMENT-BASED MODEL FOR REPUTATION SYSTEMS IN AMAZON

SENTIMENT-BASED MODEL FOR REPUTATION SYSTEMS IN AMAZON SENTIMENT-BASED MODEL FOR REPUTATION SYSTEMS IN AMAZON Milad Sharif msharif@stanford.edu Soheil Norouzi snorouzi@stanford.edu 1. INTRODUCTION When buyers purchase products from an online retailer such

More information

TwiTTer Module 5 SeSSion 2: TwiTTer MoniToring And MeASuring ToolS

TwiTTer Module 5 SeSSion 2: TwiTTer MoniToring And MeASuring ToolS Twitter Module 5 Session 2: Twitter Monitoring And Measuring Tools Table of Contents Tools 1 Klout 1 Twitalyzer 2 Simply Measured 3 TweetReach 4 Social Mention 4 Welcome to session number 2: Twitter monitoring

More information

Prediction of Personalized Rating by Combining Bandwagon Effect and Social Group Opinion: using Hadoop-Spark Framework

Prediction of Personalized Rating by Combining Bandwagon Effect and Social Group Opinion: using Hadoop-Spark Framework Prediction of Personalized Rating by Combining Bandwagon Effect and Social Group Opinion: using Hadoop-Spark Framework Lu Sun 1, Kiejin Park 2 and Limei Peng 1 1 Department of Industrial Engineering, Ajou

More information

HybridRank: Ranking in the Twitter Hybrid Networks

HybridRank: Ranking in the Twitter Hybrid Networks HybridRank: Ranking in the Twitter Hybrid Networks Jianyu Li Department of Computer Science University of Maryland, College Park jli@cs.umd.edu ABSTRACT User influence in social media may depend on multiple

More information

Cold-start Solution to Location-based Entity Shop. Recommender Systems Using Online Sales Records

Cold-start Solution to Location-based Entity Shop. Recommender Systems Using Online Sales Records Cold-start Solution to Location-based Entity Shop Recommender Systems Using Online Sales Records Yichen Yao 1, Zhongjie Li 2 1 Department of Engineering Mechanics, Tsinghua University, Beijing, China yaoyichen@aliyun.com

More information

Context-aware recommendation

Context-aware recommendation Context-aware recommendation Eirini Kolomvrezou, Hendrik Heuer Special Course in Computer and Information Science User Modelling & Recommender Systems Aalto University Context-aware recommendation 2 Recommendation

More information

A logistic regression model for Semantic Web service matchmaking

A logistic regression model for Semantic Web service matchmaking . BRIEF REPORT. SCIENCE CHINA Information Sciences July 2012 Vol. 55 No. 7: 1715 1720 doi: 10.1007/s11432-012-4591-x A logistic regression model for Semantic Web service matchmaking WEI DengPing 1*, WANG

More information

A STUDY ON SNA: MEASURE AVERAGE DEGREE AND AVERAGE WEIGHTED DEGREE OF KNOWLEDGE DIFFUSION IN GEPHI

A STUDY ON SNA: MEASURE AVERAGE DEGREE AND AVERAGE WEIGHTED DEGREE OF KNOWLEDGE DIFFUSION IN GEPHI A STUDY ON SNA: MEASURE AVERAGE DEGREE AND AVERAGE WEIGHTED DEGREE OF KNOWLEDGE DIFFUSION IN GEPHI Ayyappan.G 1, Dr.C.Nalini 2, Dr.A.Kumaravel 3 Research Scholar, Department of CSE, Bharath University,Chennai

More information

A Survey on Recommendation Techniques in E-Commerce

A Survey on Recommendation Techniques in E-Commerce A Survey on Recommendation Techniques in E-Commerce Namitha Ann Regi Post-Graduate Student Department of Computer Science and Engineering Karunya University, India P. Rebecca Sandra Assistant Professor

More information

Link Prediction in Bipartite Venture Capital Investment Networks

Link Prediction in Bipartite Venture Capital Investment Networks Link Prediction in Bipartite Venture Capital Investment Networks Charles Zhang Statistics Stanford University cyzhang@stanford.edu Ethan Chan Computer Science Stanford University ethancys@cs.stanford.edu

More information

Lecture 18: Toy models of human interaction: use and abuse

Lecture 18: Toy models of human interaction: use and abuse Lecture 18: Toy models of human interaction: use and abuse David Aldous November 2, 2017 Network can refer to many different things. In ordinary language social network refers to Facebook-like activities,

More information

Assortativity Patterns in Multi-dimensional Inter-organizational Networks: A Case Study of the Humanitarian Relief Sector

Assortativity Patterns in Multi-dimensional Inter-organizational Networks: A Case Study of the Humanitarian Relief Sector Assortativity Patterns in Multi-dimensional Inter-organizational Networks: A Case Study of the Humanitarian Relief Sector Kang Zhao, Louis-Marie Ngamassi, John Yen, Carleen Maitland, and Andrea Tapia College

More information

Financial Time Series Segmentation Based On Turning Points

Financial Time Series Segmentation Based On Turning Points Proceedings of 2011 International Conference on System Science and Engineering, Macau, China - June 2011 Financial Time Series Segmentation Based On Turning Points Jiangling Yin, Yain-Whar Si, Zhiguo Gong

More information

Prediction of Success or Failure of Software Projects based on Reusability Metrics using Support Vector Machine

Prediction of Success or Failure of Software Projects based on Reusability Metrics using Support Vector Machine Prediction of Success or Failure of Software Projects based on Reusability Metrics using Support Vector Machine R. Sathya Assistant professor, Department of Computer Science & Engineering Annamalai University

More information

Distance and Friendship: A Distance-based Model for Link Prediction in Social Networks

Distance and Friendship: A Distance-based Model for Link Prediction in Social Networks Distance and Friendship: A Distance-based Model for Link Prediction in Social Networks Yang Zhang 1 and Jun Pang 1,2 1 Faculty of Science, Technology and Communication 2 Interdisciplinary Centre for Security,

More information

Data Science Challenges for Online Advertising A Survey on Methods and Applications from a Machine Learning Perspective

Data Science Challenges for Online Advertising A Survey on Methods and Applications from a Machine Learning Perspective Data Science Challenges for Online Advertising A Survey on Methods and Applications from a Machine Learning Perspective IWD2016 Dublin, March 2016 Online Advertising Landscape [Introduction to Computational

More information

Worker Skill Estimation from Crowdsourced Mutual Assessments

Worker Skill Estimation from Crowdsourced Mutual Assessments Worker Skill Estimation from Crowdsourced Mutual Assessments Shuwei Qiang The George Washington University Amrinder Arora BizMerlin Current approaches for estimating skill levels of workforce either do

More information

Final Report: Local Structure and Evolution for Cascade Prediction

Final Report: Local Structure and Evolution for Cascade Prediction Final Report: Local Structure and Evolution for Cascade Prediction Jake Lussier (lussier1@stanford.edu), Jacob Bank (jbank@stanford.edu) December 10, 2011 Abstract Information cascades in large social

More information

Cross-Pollination of Information in Online Social Media: A Case Study on Popular Social Networks

Cross-Pollination of Information in Online Social Media: A Case Study on Popular Social Networks Cross-Pollination of Information in Online Social Media: A Case Study on Popular Social Networks Paridhi Jain, Tiago Rodrigues, Gabriel Magno, Ponnurangam Kumaraguru, Virgílio Almeida Indraprastha Institute

More information

Predict Commercial Promoted Contents Will Be Clicked By User

Predict Commercial Promoted Contents Will Be Clicked By User Predict Commercial Promoted Contents Will Be Clicked By User Gary(Xinran) Guo garyguo@stanford.edu SUNetID: garyguo Stanford University 1. Introduction As e-commerce, social media grows rapidly, advertisements

More information

Markov and Non-Markov Models for Brand Switching and Market Share Forecasting

Markov and Non-Markov Models for Brand Switching and Market Share Forecasting Markov and Non-Markov s for Brand Switching and Market Share Forecasting Ka Ching Chan Member IEEE Abstract Markov chain has been a popular approach for the brand switching problem and market share forecasting

More information

The Future of Marketing, Channels & Optimisation.

The Future of Marketing, Channels & Optimisation. The Future of Marketing, Channels & Optimisation. 1 CONVERT &MINE The mission of &Mine s digital advertising is to provide ideal and scalable results for brand awareness, marketing, lead generation and

More information

PRODUCT REVIEW SENTIMENT ANALYSIS WITH TRENDY SYNTHETIC DICTIONARY AND AMBIGUITY MITIGATION

PRODUCT REVIEW SENTIMENT ANALYSIS WITH TRENDY SYNTHETIC DICTIONARY AND AMBIGUITY MITIGATION PRODUCT REVIEW SENTIMENT ANALYSIS WITH TRENDY SYNTHETIC DICTIONARY AND AMBIGUITY MITIGATION Prof. A. N. Nawathe anunawathe@gmail.com Godge Isha Sudhir ishagodge37@gmail.com Dang Poornima Mahesh dang.poornima@gmail.com

More information

Application of Decision Trees in Mining High-Value Credit Card Customers

Application of Decision Trees in Mining High-Value Credit Card Customers Application of Decision Trees in Mining High-Value Credit Card Customers Jian Wang Bo Yuan Wenhuang Liu Graduate School at Shenzhen, Tsinghua University, Shenzhen 8, P.R. China E-mail: gregret24@gmail.com,

More information

Weka Evaluation: Assessing the performance

Weka Evaluation: Assessing the performance Weka Evaluation: Assessing the performance Lab3 (in- class): 21 NOV 2016, 13:00-15:00, CHOMSKY ACKNOWLEDGEMENTS: INFORMATION, EXAMPLES AND TASKS IN THIS LAB COME FROM SEVERAL WEB SOURCES. Learning objectives

More information

Friendship Prediction on Social Network Users

Friendship Prediction on Social Network Users SocialCom/PASSAT/BigData/EconCom/BioMedCom 2013 Friendship Prediction on Social Network Users Kuan-Hsi Chen and Tyne Liang Department of Computer Science National Chiao Tung University Hsinchu, Taiwan

More information

User Profiling in an Ego Network: Co-profiling Attributes and Relationships

User Profiling in an Ego Network: Co-profiling Attributes and Relationships User Profiling in an Ego Network: Co-profiling Attributes and Relationships Rui Li, Chi Wang, Kevin Chen-Chuan Chang, {ruili1, chiwang1, kcchang}@illinois.edu Department of Computer Science, University

More information

Designing Of Effective Shopper Purchase Analysis Model Based On Chrip Likes

Designing Of Effective Shopper Purchase Analysis Model Based On Chrip Likes Designing Of Effective Shopper Purchase Analysis Model Based On Chrip Likes Palla Jyothsna #1, S.Igni Sabasti Prabu *2 # Department of Information Technology, Sathyabama University Chennai, Tamil Nadu,

More information

A Protein Secondary Structure Prediction Method Based on BP Neural Network Ru-xi YIN, Li-zhen LIU*, Wei SONG, Xin-lei ZHAO and Chao DU

A Protein Secondary Structure Prediction Method Based on BP Neural Network Ru-xi YIN, Li-zhen LIU*, Wei SONG, Xin-lei ZHAO and Chao DU 2017 2nd International Conference on Artificial Intelligence: Techniques and Applications (AITA 2017 ISBN: 978-1-60595-491-2 A Protein Secondary Structure Prediction Method Based on BP Neural Network Ru-xi

More information

Final Project - Social and Information Network Analysis

Final Project - Social and Information Network Analysis Final Project - Social and Information Network Analysis Factors and Variables Affecting Social Media Reviews I. Introduction Humberto Moreira Rajesh Balwani Subramanyan V Dronamraju Dec 11, 2011 Problem

More information

ABM PLAYBOOK TESTING WITH ABM ANALYTICS: 4 STEPS TO SEE FUNNEL PERFORMANCE FOR ANYTHING

ABM PLAYBOOK TESTING WITH ABM ANALYTICS: 4 STEPS TO SEE FUNNEL PERFORMANCE FOR ANYTHING ABM PLAYBOOK TESTING WITH ABM ANALYTICS: 4 STEPS TO SEE FUNNEL PERFORMANCE FOR ANYTHING ABM Analytics by Demandbase is a great tool for evaluating how your target account list is performing across the

More information

Cyber-Social-Physical Features for Mood Prediction over Online Social Networks

Cyber-Social-Physical Features for Mood Prediction over Online Social Networks DEIM Forum 2017 B1-3 Cyber-Social-Physical Features for Mood Prediction over Online Social Networks Chaima DHAHRI Kazunori MATSUMOTO and Keiichiro HOASHI KDDI Research, Inc 2-1-15 Ohara, Fujimino-shi,

More information

Influence of First Steps in a Community on Ego-Network: Growth, Diversity, and Engagement

Influence of First Steps in a Community on Ego-Network: Growth, Diversity, and Engagement Influence of First Steps in a Community on Ego-Network: Growth, Diversity, and Engagement Atef Chaudhury University of Waterloo Waterloo, ON N2L 3G1, Canada a29chaud@uwaterloo.ca Myunghwan Kim LinkedIn

More information

An Analysis Framework for Content-based Job Recommendation. Author(s) Guo, Xingsheng; Jerbi, Houssem; O'Mahony, Michael P.

An Analysis Framework for Content-based Job Recommendation. Author(s) Guo, Xingsheng; Jerbi, Houssem; O'Mahony, Michael P. Provided by the author(s) and University College Dublin Library in accordance with publisher policies. Please cite the published version when available. Title An Analysis Framework for Content-based Job

More information

Machine learning applications in genomics: practical issues & challenges. Yuzhen Ye School of Informatics and Computing, Indiana University

Machine learning applications in genomics: practical issues & challenges. Yuzhen Ye School of Informatics and Computing, Indiana University Machine learning applications in genomics: practical issues & challenges Yuzhen Ye School of Informatics and Computing, Indiana University Reference Machine learning applications in genetics and genomics

More information

A Comparative Study of Recommendation Methods for Mobile OSN Users

A Comparative Study of Recommendation Methods for Mobile OSN Users A Comparative Study of Recommendation Methods for Mobile OSN Users Shyam Krishna K 1, Dr. Vince Paul 2 M.Tech Student, Department of Computer Science &Engineering, Sahrdaya College of Engineering & Technology,

More information

Diffusive Logistic Model Towards Predicting Information Diffusion in Online Social Networks

Diffusive Logistic Model Towards Predicting Information Diffusion in Online Social Networks Diffusive Logistic Model Towards Predicting Information Diffusion in Online Social Networks Feng Wang, Haiyan Wang, Kuai Xu Arizona State University Email: {fwang5, haiyan.wang, kuai.xu}@asu.edu Abstract

More information

Indian Election Trend Prediction Using Improved Competitive Vector Regression Model

Indian Election Trend Prediction Using Improved Competitive Vector Regression Model Indian Election Trend Prediction Using Improved Competitive Vector Regression Model Navya S 1 1 Department of Computer Science and Engineering, University, India Abstract Election result forecasting has

More information

Product Aspect Ranking Based on the Customers Reviews.

Product Aspect Ranking Based on the Customers Reviews. Product Aspect Ranking Based on the Customers Reviews. Kadam Snehal, Dhomse Kalyani, Bhumkar Smita, Janrao Swati, Ambhore Shakuntala BE Scholar, Department of Computer Engineering S.R.E.S. s College of

More information

A Comparative Study of Filter-based Feature Ranking Techniques

A Comparative Study of Filter-based Feature Ranking Techniques Western Kentucky University From the SelectedWorks of Dr. Huanjing Wang August, 2010 A Comparative Study of Filter-based Feature Ranking Techniques Huanjing Wang, Western Kentucky University Taghi M. Khoshgoftaar,

More information

Text Mining. Theory and Applications Anurag Nagar

Text Mining. Theory and Applications Anurag Nagar Text Mining Theory and Applications Anurag Nagar Topics Introduction What is Text Mining Features of Text Document Representation Vector Space Model Document Similarities Document Classification and Clustering

More information

MFMS: Maximal Frequent Module Set mining from multiple human gene expression datasets

MFMS: Maximal Frequent Module Set mining from multiple human gene expression datasets MFMS: Maximal Frequent Module Set mining from multiple human gene expression datasets Saeed Salem North Dakota State University Cagri Ozcaglar Amazon 8/11/2013 Introduction Gene expression analysis Use

More information

Trend Extraction Method using Co-occurrence Patterns from Tweets

Trend Extraction Method using Co-occurrence Patterns from Tweets Information Engineering Express International Institute of Applied Informatics 2016, Vol.2, No.4, 1 10 Trend Extraction Method using Co-occurrence Patterns from Tweets Shotaro Noda and Katsuhide Fujita

More information

Survival Outcome Prediction for Cancer Patients based on Gene Interaction Network Analysis and Expression Profile Classification

Survival Outcome Prediction for Cancer Patients based on Gene Interaction Network Analysis and Expression Profile Classification Survival Outcome Prediction for Cancer Patients based on Gene Interaction Network Analysis and Expression Profile Classification Final Project Report Alexander Herrmann Advised by Dr. Andrew Gentles December

More information

Me Too 2.0: An Analysis of Viral Retweets on the Twittersphere

Me Too 2.0: An Analysis of Viral Retweets on the Twittersphere Me Too 2.0: An Analysis of Viral Retweets on the Twittersphere Rio Akasaka Department of Computer Science rio@cs.stanford.edu Patrick Grafe Department of Computer Science pgrafe@stanford.edu Makoto Kondo

More information

A Survey on Influence Analysis in Social Networks

A Survey on Influence Analysis in Social Networks A Survey on Influence Analysis in Social Networks Jessie Yin 1 Introduction With growing popularity of Web 2.0, recent years have witnessed wide-spread adoption of rich social media applications such as

More information

Inferring Social Ties across Heterogeneous Networks

Inferring Social Ties across Heterogeneous Networks Inferring Social Ties across Heterogeneous Networks CS 6001 Complex Network Structures HARISH ANANDAN Introduction Social Ties Information carrying connections between people It can be: Strong, weak or

More information

Executive Summary: Quantifying the Role of Social Voice in Marketing Effectiveness. 10/25/2012 MarketShare & KellerFay Group

Executive Summary: Quantifying the Role of Social Voice in Marketing Effectiveness. 10/25/2012 MarketShare & KellerFay Group Executive Summary: Quantifying the Role of Social Voice in Marketing Effectiveness 10/25/2012 MarketShare & KellerFay Group. Introduction As Social Voice (online and offline word of mouth) becomes increasingly

More information

Influencer Communities. Influencer Communities. Influencers are having many different conversations

Influencer Communities. Influencer Communities. Influencers are having many different conversations Influencer Communities Influencers are having many different conversations 1 1.0 Background A unique feature of social networks is that people with common interests are following (or friend-ing) similar

More information

DIGITAL MARKETING DATA SHEET CHANNELADVISOR DIGITAL MARKETING ENABLES RETAILERS TO: And many more

DIGITAL MARKETING DATA SHEET CHANNELADVISOR DIGITAL MARKETING ENABLES RETAILERS TO: And many more DIGITAL MARKETING DATA SHEET With more and more digital marketing channels introduced each year, it can be hard to keep track of every place you re advertising let alone how each of your products is performing

More information

Keywords acrm, RFM, Clustering, Classification, SMOTE, Metastacking

Keywords acrm, RFM, Clustering, Classification, SMOTE, Metastacking Volume 5, Issue 9, September 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Comparative

More information

2 Maria Carolina Monard and Gustavo E. A. P. A. Batista

2 Maria Carolina Monard and Gustavo E. A. P. A. Batista Graphical Methods for Classifier Performance Evaluation Maria Carolina Monard and Gustavo E. A. P. A. Batista University of São Paulo USP Institute of Mathematics and Computer Science ICMC Department of

More information

Introduction to Recommendation Engines

Introduction to Recommendation Engines Introduction to Recommendation Engines A guide to algorithmically predicting what your customers want and when. By Tuck Ngun, PhD Introduction Recommendation engines have become a popular solution for

More information

Improving Urban Mobility Through Urban Analytics Using Electronic Smart Card Data

Improving Urban Mobility Through Urban Analytics Using Electronic Smart Card Data Improving Urban Mobility Through Urban Analytics Using Electronic Smart Card Data Mayuree Binjolkar, Daniel Dylewsky, Andrew Ju, Wenonah Zhang, Mark Hallenbeck Data Science for Social Good-2017,University

More information

A SURVEY ON PRODUCT REVIEW SENTIMENT ANALYSIS

A SURVEY ON PRODUCT REVIEW SENTIMENT ANALYSIS A SURVEY ON PRODUCT REVIEW SENTIMENT ANALYSIS Godge Isha Sudhir ishagodge37@gmail.com Arvikar Shruti Sanjay shrutiarvikar89@gmail.com Dang Poornima Mahesh dang.poornima@gmail.com Maske Rushikesh Kantrao

More information

Evaluating Workflow Trust using Hidden Markov Modeling and Provenance Data

Evaluating Workflow Trust using Hidden Markov Modeling and Provenance Data Evaluating Workflow Trust using Hidden Markov Modeling and Provenance Data Mahsa Naseri and Simone A. Ludwig Abstract In service-oriented environments, services with different functionalities are combined

More information

Predicting Loyal Customers for Sellers on Tmall to Increase Return on Promoting Cost

Predicting Loyal Customers for Sellers on Tmall to Increase Return on Promoting Cost Predicting Loyal Customers for Sellers on Tmall to Increase Return on Promoting Cost Wendy Huang Yu-Chih, Shih Jessy Yang Zoe Cheng BADM Team 9 Summary Sellers on E-commerce platform sometimes run big

More information

Stream Clustering of Tweets

Stream Clustering of Tweets Stream Clustering of Tweets Sophie Baillargeon Département de mathématiques et de statistique Université Laval Québec (Québec), Canada G1V 0A6 Email: sophie.baillargeon@mat.ulaval.ca Simon Hallé Thales

More information

E-Commerce Sales Prediction Using Listing Keywords

E-Commerce Sales Prediction Using Listing Keywords E-Commerce Sales Prediction Using Listing Keywords Stephanie Chen (asksteph@stanford.edu) 1 Introduction Small online retailers usually set themselves apart from brick and mortar stores, traditional brand

More information

Automatic Detection of Rumor on Social Network

Automatic Detection of Rumor on Social Network Automatic Detection of Rumor on Social Network Qiao Zhang 1,2, Shuiyuan Zhang 1,2, Jian Dong 3, Jinhua Xiong 2(B), and Xueqi Cheng 2 1 University of Chinese Academy of Sciences, Beijing, China 2 Institute

More information

Expert Based Prediction of User Preferences

Expert Based Prediction of User Preferences Expert Based Prediction of User Preferences M. Dima Department of Informatics National and Kapodistrian University of Athens, Greece D. Vogiatzis Institute of Informatics and Telecommunications, National

More information

TOWARD MORE DIVERSE RECOMMENDATIONS: ITEM RE-RANKING METHODS FOR RECOMMENDER SYSTEMS

TOWARD MORE DIVERSE RECOMMENDATIONS: ITEM RE-RANKING METHODS FOR RECOMMENDER SYSTEMS TOWARD MORE DIVERSE RECOMMENDATIONS: ITEM RE-RANKING METHODS FOR RECOMMENDER SYSTEMS Gediminas Adomavicius YoungOk Kwon Department of Information and Decision Sciences Carlson School of Management, University

More information

Predicting ratings of peer-generated content with personalized metrics

Predicting ratings of peer-generated content with personalized metrics Predicting ratings of peer-generated content with personalized metrics Project report Tyler Casey tyler.casey09@gmail.com Marius Lazer mlazer@stanford.edu [Group #40] Ashish Mathew amathew9@stanford.edu

More information

Jure Leskovec, Includes joint work with Jaewon Yang, Manuel Gomez Rodriguez, Andreas Krause, Lars Backstrom and Jon Kleinberg.

Jure Leskovec, Includes joint work with Jaewon Yang, Manuel Gomez Rodriguez, Andreas Krause, Lars Backstrom and Jon Kleinberg. Jure Leskovec, Stanford University Includes joint work with Jaewon Yang, Manuel Gomez Rodriguez, Andreas Krause, Lars Backstrom and Jon Kleinberg. Information reaches us by personal influence in our social

More information

(32) where for are sub-matrices and, for are sub-vectors. Such a system of equations can be solved by solving the following sub-problems

(32) where for are sub-matrices and, for are sub-vectors. Such a system of equations can be solved by solving the following sub-problems Module 4 : Solving Linear Algebraic Equations Section 4 : Direct Methods for Solving Sparse Linear Systems 4 Direct Methods for Solving Sparse Linear Systems A system of Linear equations --------(32) is

More information

Tutorial at the ACM SIGKDD conference, Jure Leskovec Stanford University

Tutorial at the ACM SIGKDD conference, Jure Leskovec Stanford University Tutorial at the ACM SIGKDD conference, 2011 http://snap.stanford.edu/proj/socmedia-kdd Jure Leskovec Stanford University 8/21/2011 Jure Leskovec: Social Media Analytics (KDD '11 tutorial) 2 In Social Media

More information

The Social Engagement Index. A Benchmark Report by Spredfast on the State of Enterprise Social Programs

The Social Engagement Index. A Benchmark Report by Spredfast on the State of Enterprise Social Programs The Social Engagement Index A Benchmark Report by Spredfast on the State of Enterprise Social Programs Published: August 2012 Table of Contents I. Introduction II. III. IV. Summary of Findings Social Engagement

More information

Machine learning-based approaches for BioCreative III tasks

Machine learning-based approaches for BioCreative III tasks Machine learning-based approaches for BioCreative III tasks Shashank Agarwal 1, Feifan Liu 2, Zuofeng Li 2 and Hong Yu 1,2,3 1 Medical Informatics, College of Engineering and Applied Sciences, University

More information

Will My Followers Tweet? Predicting Twitter Engagement using Machine Learning. David Darmon, Jared Sylvester, Michelle Girvan and William Rand

Will My Followers Tweet? Predicting Twitter Engagement using Machine Learning. David Darmon, Jared Sylvester, Michelle Girvan and William Rand Will My Followers Tweet? Predicting Twitter Engagement using Machine Learning David Darmon, Jared Sylvester, Michelle Girvan and William Rand Abstract: Managers who are able to understand how social media

More information

Tweeting Questions in Academic Conferences: Seeking or Promoting Information?

Tweeting Questions in Academic Conferences: Seeking or Promoting Information? Tweeting Questions in Academic Conferences: Seeking or Promoting Information? Xidao Wen, University of Pittsburgh Yu-Ru Lin, University of Pittsburgh Abstract The fast growth of social media has reshaped

More information

Facebook Open Graph and the Future of Personalization

Facebook Open Graph and the Future of Personalization view POINT OF OF view POINT Facebook Open Graph and the Future of Personalization Hilding Anderson and Rob Gonda, Marketing Strategy & Analysis, SapientNitro Introduction Facebook introduced Open Graph

More information

Statistical Analysis of Gene Expression Data Using Biclustering Coherent Column

Statistical Analysis of Gene Expression Data Using Biclustering Coherent Column Volume 114 No. 9 2017, 447-454 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu 1 ijpam.eu Statistical Analysis of Gene Expression Data Using Biclustering Coherent

More information

Understanding the Voice of the Customer

Understanding the Voice of the Customer Business white paper Understanding the Voice of the Customer How to effectively leverage customer insight with business discovery analytics Table of contents 3 Today s diverse brand interactions 3 A new

More information

TEXT MINING APPROACH TO EXTRACT KNOWLEDGE FROM SOCIAL MEDIA DATA TO ENHANCE BUSINESS INTELLIGENCE

TEXT MINING APPROACH TO EXTRACT KNOWLEDGE FROM SOCIAL MEDIA DATA TO ENHANCE BUSINESS INTELLIGENCE International Journal of Advance Research In Science And Engineering http://www.ijarse.com TEXT MINING APPROACH TO EXTRACT KNOWLEDGE FROM SOCIAL MEDIA DATA TO ENHANCE BUSINESS INTELLIGENCE R. Jayanthi

More information

Social Interaction in the Flickr Social Network

Social Interaction in the Flickr Social Network Social Interaction in the Flickr Social Network Karthik Gopalakrishnan, Arun Pandey and Joydeep Chandra Department of Computer Science and Engineering Indian Institute of Technology Patna Patna, India

More information