Mining Social Text Data. ZHAO Xin Renmin University of China

Size: px
Start display at page:

Download "Mining Social Text Data. ZHAO Xin Renmin University of China"

Transcription

1 Mining Social Text Data ZHAO Xin Renmin University of China

2 Overview Introduction to social text data Basic techniques to manipulate text data Typical text mining applications for social science Hot trend and future direction Conclusion

3 Overview Introduction to social text data Basic techniques to manipulate text data Typical text mining applications for social science Hot trend and future direction Conclusion

4 Social Text Data Various kinds of social media Text data is one of the most important data types Tweets Blogs Comments Profiles Messages

5 Characteristics of social text data User-oriented Social text data is created by users themselves A social networking account corresponds to an offline user in real life Users can generate text contents as they like It provides a valuable data resource to understand social users

6 Characteristics of social text data Opinionated text Not only including facts

7 Characteristics of social text data Informal and noisy text New terms were created by users A traditional vocabulary cannot work

8 Characteristics of social text data Rich-link text On Twitter, we can simply connect ourselves with any celebrity account by using two typical ways: mention and retweet (RT)

9 Characteristics of social text data Rich-type text Multiple types of information are often embedded in the text E.g., Hashtag Check-ins (Location and time) Phone type

10 Overview Introduction to social text data Basic techniques to manipulate text data Typical text mining applications for social science Hot trend and future direction Conclusion

11 Fundamental natural language processing techniques Turn text into semantic units Provide semantic units with meaningful annotations

12 Fundamental natural language processing techniques Turn text into semantic units Provide semantic units with meaningful annotations

13 For English Tokenization Delimiter-based tokenization method Is it a simple task?

14 For Chinese Chinese Segmentation No space characters help 他明天去香港 -> 他 -- 明天 -- 去 -- 香港 (He is going to HK tomorrow.) Nearly a well-done task On standard text collections, the accuracy is very high For social text data, it needs more improving Challenge: informal writing styles and out-ofvocabulary terms E.g., new terms created by social users

15 Fundamental natural language processing techniques Turn text into semantic units Provide semantic units with meaningful annotations

16 Part-of-speech tagging Word-category tagging/labeling/classification

17 Part-of-speech tagging (POS tagging) Word-category tagging

18 Named Entity Recognition (NER) A very important task: find and classify entity mentions in text, for example: The decision by the independent MP Andrew Wilkie to withdraw his support for the minority Labor government sounded dramatic but it should not further threaten its stability. When, after the 2010 election, Wilkie, Rob Oakeshott, Tony Windsor and the Greens agreed to support Labor, they gave just two guarantees: confidence and supply.

19 Named Entity Recognition (NER) A very important task: find and classify entity mentions in text, for example: The decision by the independent MP Andrew Wilkie to withdraw his support for the minority Labor government sounded dramatic but it should not further threaten its stability. When, after the 2010 election, Wilkie, Rob Oakeshott, Tony Windsor and the Greens agreed to support Labor, they gave just two guarantees: confidence and supply.

20 Named Entity Recognition (NER) A very important task: find and classify entity mentions in text, for example: The decision by the independent MP Andrew Wilkie to withdraw his support for the minority Labor government sounded dramatic but it should not further threaten its stability. When, after the 2010 election, Wilkie, Rob Oakeshott, Tony Windsor and the Greens agreed to support Labor, they gave just two guarantees: confidence and supply. Person Date Location Organization

21 Entity Linking President Obama won the 2009 Nobel Peace Prize

22 Semantic Analysis Model Statistical topic models Word embedding models

23 Semantic Analysis Model Statistical topic models Word embedding models

24 Question: Statistical topic models Given a large text dataset, how do we know the major topics in it? We need a way to summarize and compress information

25 Output I: Statistical topic models Topics Can be understood as clusters of words

26 Output II: Statistical topic models Per-document topic distribution Can be understood as document representation

27 Put together Statistical topic models

28 Key ideas in topic models Capturing word co-occurrence If two words frequently co-occur in documents, these two words tend to be captured in top positions of the same topics The number of training documents are supposed to be large A few hundred documents will not work well

29 Semantic Analysis Model Statistical topic models Word embedding model

30 Word embedding Distributional semantics

31 Word embedding Distributional semantics

32 Word2Vec Input: a sequence of word tokens from a vocabulary Requiring a large amount of documents Output: a fixed-length embedding vector for each term in the vocabulary The embedding vector usually has several hundred dimensions with dense values

33 Word embedding Usage: finding similar words E.g., query word= Sweden

34 Interesting Observations China-Beijing = Japan - Tokyo

35 Overview Introduction to social text data Basic techniques to manipulate text data Typical text mining applications for social science Hot trend and future direction Conclusion

36 Typical text mining applications for Data cleaning social science Sentiment analysis User profiling User interest modeling

37 Typical text mining applications for social science Data cleaning Sentiment analysis User profiling User interest modeling

38 Data cleaning Cleaning social text data Very challenging to be perfect in practice E.g., word normalization SG => Singapore HK => Hong Kong But fortunately standard techniques usually work moderately well (with slight modifications) in many cases E.g., word2vec and topic models can capture the above abbreviation patterns to some extent

39 Spam detection Data cleaning Machine learning or rule-based approach The key is to select effective features based on user behaviors Multiple postings in a short period Redundant or nearly redundant postings Irrelevant postings

40 Typical text mining applications for Data cleaning social science Sentiment analysis User profiling User interest modeling

41 Key technical components Opinion extraction I bought an iphone a few days ago. It is such a nice phone. The touch screen is really cool. The voice quality is clear too. What do we see? Opinion holders: persons who make the comments Opinion targets: entities and their features/aspects Sentiments: opinions towards opinion targets

42 Key technical components Polarity identification Simplest form: Is the attitude of this text positive or negative? A classification-based approach Feature selection Opinion lexicon More complex forms: Five-star rating prediction A regression-based approach

43 For ecommerce: Feature-based opinion summary For each extracted aspect, we can derive the corresponding opinion score. 43

44 CALM Dow Jones Using Twitter sentiment for stock price prediction Bollen et al. (2011) The CALM mood predicts DJIA 3 days later 44

45 Twitter sentiment versus Gallup Poll of Consumer Confidence Brendan O'Connor, Ramnath Balasubramanyan, Bryan R. Routledge, and Noah A. Smith From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series. In ICWSM-2010

46 Typical text mining applications for social science Data cleaning and preprocessing Sentiment analysis User profiling User interest modeling

47 User profiling Direct extraction of demographics from social profiles Users fill in the profiles Sex Age Job Interests.

48 User profiling Implicit inference from social text data E.g., Amazon online reviews I bought my wife a new phone => SEX = MALE I worked in an IT company... => JOB = IT

49 Typical text mining applications for social science Data cleaning and preprocessing Sentiment analysis User profiling User interest modeling

50 User interest modeling Topical expert ranking Given a topic, we need to identify influential experts for that topic Two major components Interest identification and Influence ranking (Individual) Interest identification Find out what topics is a celebrity interested in (Overall) Influence ranking Decide how influential a celebrity is in a given topic

51 User interest modeling Topical expert ranking Topical word cloud Topical celebrity ranking

52 User interest modeling Collective interest or attention modeling Event detection

53 Overview Introduction to social text data Basic techniques to manipulate text data Typical text mining applications for social science Hot trend and future direction Conclusion

54 Deep learning for text mining Deep neural models are promising But relying on tedious parameter tuning and large training datasets Shallow neural models are more practical and easier to use by non-experts E.g., word2vec

55 Future directions for text mining Designing more effective information extraction models Joint utilizing multi-source information Developing large-scale and efficient algorithms

56 Toolkits NLP Toolkit list Topic models gibbslda.sourceforge.net Word2vec

57 Acknowledgement Thanks! Thanks for Prof. Zhu s invitation and organization Thanks for your attention Some slides are modified based on the Stanford NLP course slides.

58 References Wei Shen, Jianyong Wang, and Jiawei Han. Entity Linking with a Knowledge Base: Issues, Techniques, and Solutions. Transactions on Knowledge & Data Engineering 27(2): (2015) Mikolov, Tomas; Sutskever, Ilya; Chen, Kai; Corrado, Greg S.; Dean, Jeff (2013). Distributed representations of words and phrases and their compositionality. NIPS D. Blei. Probabilistic topic models. Communications of the ACM, 55(4):77 84, Ee-Peng Lim, Viet-An Nguyen, Nitin Jindal, Bing Liu, Hady Wirawan Lauw. Detecting product review spammers using rating behaviors. CIKM 2010: Yoshua Bengio, Aaron Courville, Pascal Vincent. Representation Learning: A Review and New Perspectives. Jinpeng Wang, Xin Zhao and Xiaoming Li. Mining New Business Opportunities: Identifying Trend related Products by Leveraging Commercial Intents from Microblogs. In EMNLP 13. Xin Zhao, YanweiGuo, Yulan He, Han Jiang, Yuexin Wu and Xiaoming Li. A Demographic-based System for Product Recommendation On Microblogs. In KDD Jinpeng Wang, Gao Cong, Xin Zhao, Xiaoming Li. Mining User Intents in Twitter: A Semi-Supervised Approach to Inferring Intent Categories for Tweets. In AAAI Xin Zhao, Jinpeng Wang, Yulan He, Ji-Rong Wen, Edward Y. Chang, Xiaoming Li. Mining Product Adopter Information from Online Reviews for Improving Product Recommendation. TKDD 10(3): 29 (2016) Xin Zhao, Yanwei Guo, Rui Yan, Yulan He, Xiaoming Li. Timeline generation with social attention. SIGIR 2013:

DETECTING COMMUNITIES BY SENTIMENT ANALYSIS

DETECTING COMMUNITIES BY SENTIMENT ANALYSIS DETECTING COMMUNITIES BY SENTIMENT ANALYSIS OF CONTROVERSIAL TOPICS SBP-BRiMS 2016 Kangwon Seo 1, Rong Pan 1, & Aleksey Panasyuk 2 1 Arizona State University 2 Air Force Research Lab July 1, 2016 OUTLINE

More information

From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series

From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series From Tweets to Polls: Linking Text Sentiment to Public Brendan O Connor Opinion Time Series Machine Learning Department Carnegie Mellon University http://brenocon.com Presentation at AAPOR, May 2012 Joint

More information

Data Preprocessing, Sentiment Analysis & NER On Twitter Data.

Data Preprocessing, Sentiment Analysis & NER On Twitter Data. IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727 PP 73-79 www.iosrjournals.org Data Preprocessing, Sentiment Analysis & NER On Twitter Data. Mr.SanketPatil, Prof.VarshaWangikar,

More information

Text, Web, and Social Media Analytics

Text, Web, and Social Media Analytics Kecerdasan Bisnis dalam Praktek Predictive Analytics II Text, Web, and Social Media Analytics Husni Lab. Riset JTIF UTM Sumber awal: http://mail.tku.edu.tw/myday/teaching/1071/bi/1071bi07_business_intelligence.pptx

More information

IMPLEMENTING ARTIFICIAL NEURAL NETWORK IN ENHANCED MICRO BLOG FOR SOCIAL NETWORK, E-COMMERCE AND NEWS CHANNELS

IMPLEMENTING ARTIFICIAL NEURAL NETWORK IN ENHANCED MICRO BLOG FOR SOCIAL NETWORK, E-COMMERCE AND NEWS CHANNELS IMPLEMENTING ARTIFICIAL NEURAL NETWORK IN ENHANCED MICRO BLOG FOR SOCIAL NETWORK, E-COMMERCE AND NEWS CHANNELS 1 Rathidevi. M, 2 Dr.S.V.Sudha 1 PG Scholar, Department of Computer Science and Engineering,

More information

2016 U.S. PRESIDENTIAL ELECTION FAKE NEWS

2016 U.S. PRESIDENTIAL ELECTION FAKE NEWS 2016 U.S. PRESIDENTIAL ELECTION FAKE NEWS OVERVIEW Introduction Main Paper Related Work and Limitation Proposed Solution Preliminary Result Conclusion and Future Work TWITTER: A SOCIAL NETWORK AND A NEWS

More information

Tweet Segmentation Using Correlation & Association

Tweet Segmentation Using Correlation & Association 2017 IJSRST Volume 3 Issue 3 Print ISSN: 2395-6011 Online ISSN: 2395-602X Themed Section: Science and Technology Tweet Segmentation Using Correlation & Association Mr. Umesh A. Patil 1, Miss. Madhuri M.

More information

Sentiment Analysis: A discovery challenge. Bing Liu University of Illinois at Chicago

Sentiment Analysis: A discovery challenge. Bing Liu University of Illinois at Chicago Sentiment Analysis: A discovery challenge Bing Liu University of Illinois at Chicago liub@cs.uic.edu Introduction Opinion mining or sentiment analysis q computational study of opinions, sentiments, appraisal,

More information

ADVANCES in NATURAL and APPLIED SCIENCES

ADVANCES in NATURAL and APPLIED SCIENCES ADVANCES in NATURAL and APPLIED SCIENCES ISSN: 1995-0772 Published BYAENSI Publication EISSN: 1998-1090 http://www.aensiweb.com/anas 2017 June 11(8): pages 537-541 Open Access Journal Tweet Segmentation

More information

TEXT MINING - SENTIMENT ANALYSIS /OPINION MINING

TEXT MINING - SENTIMENT ANALYSIS /OPINION MINING TEXT MINING - SENTIMENT ANALYSIS /OPINION MINING SRI SURYA NULU EMIS-DATA MINING 8331 SOUTHERN METHODIST UNIVERSITY 1 SNULU@SMU.EDU SRISURYANULU@GMAIL.COM AGENDA Introduction Definition of opinion mining

More information

CONNECTING SOCIAL MEDIA TO ECOMMERCE USING MICROBLOGGING AND ARTIFICIAL NEURAL NETWORK

CONNECTING SOCIAL MEDIA TO ECOMMERCE USING MICROBLOGGING AND ARTIFICIAL NEURAL NETWORK CONNECTING SOCIAL MEDIA TO ECOMMERCE USING MICROBLOGGING AND ARTIFICIAL NEURAL NETWORK Ms.S.P.VidhyaPriya 1,B.Gokhila 2, T.Santhiya 3, K.Saranya 4 1 M.E.,Assistant Professor-CSE, Kathir College Of Engineering,

More information

International Journal of Scientific & Engineering Research, Volume 6, Issue 3, March ISSN Web and Text Mining Sentiment Analysis

International Journal of Scientific & Engineering Research, Volume 6, Issue 3, March ISSN Web and Text Mining Sentiment Analysis International Journal of Scientific & Engineering Research, Volume 6, Issue 3, March-2015 672 Web and Text Mining Sentiment Analysis Ms. Anjana Agrawal Abstract This paper describes the key steps followed

More information

Predicting the Odds of Getting Retweeted

Predicting the Odds of Getting Retweeted Predicting the Odds of Getting Retweeted Arun Mahendra Stanford University arunmahe@stanford.edu 1. Introduction Millions of people tweet every day about almost any topic imaginable, but only a small percent

More information

Smartphone Product Review Sentiment Analysis using Logistic Regression

Smartphone Product Review Sentiment Analysis using Logistic Regression I J C T A, 9(26) 2016, pp. 343-349 International Science Press Smartphone Product Review Sentiment Analysis using Logistic Regression M. Lovelin Ponn Felciah 1 and R. Anbuselvi 2 ABSTRACT Sentiment analysis

More information

Sentiment and Intent Analysis for Business Intelligence

Sentiment and Intent Analysis for Business Intelligence Sentiment and Intent Analysis for Business Intelligence Lalit Pathak 1, Sanjana Mulchandani 2, Mayuri Kate 3, Kajal Khatwani 4, Sujata Khedkar 5 U.G. Student, Department of Computer Engineering, VESIT

More information

Using Twitter as a source of information for stock market prediction

Using Twitter as a source of information for stock market prediction Using Twitter as a source of information for stock market prediction Argimiro Arratia argimiro@lsi.upc.edu LSI, Univ. Politécnica de Catalunya, Barcelona (Joint work with Marta Arias and Ramón Xuriguera)

More information

Product improvement based on online reviews from product designer s perspective

Product improvement based on online reviews from product designer s perspective IOP Conference Series: Materials Science and Engineering PAPER OPEN ACCESS Product improvement based on online reviews from product designer s perspective To cite this article: Shugang Li and Jiali Kong

More information

TWITTER SENTIMENT ANALYSIS AND TEXT MINING IN R

TWITTER SENTIMENT ANALYSIS AND TEXT MINING IN R TWITTER SENTIMENT ANALYSIS AND TEXT MINING IN R Rishabh Nigam Department of IT Maharaja Agrasen Institute of Technology, Delhi, India Email- nigamrishabh10@gmail.com Nidhi Sengar Department of IT Maharaja

More information

PRODUCT REVIEW SENTIMENT ANALYSIS WITH TRENDY SYNTHETIC DICTIONARY AND AMBIGUITY MITIGATION

PRODUCT REVIEW SENTIMENT ANALYSIS WITH TRENDY SYNTHETIC DICTIONARY AND AMBIGUITY MITIGATION PRODUCT REVIEW SENTIMENT ANALYSIS WITH TRENDY SYNTHETIC DICTIONARY AND AMBIGUITY MITIGATION Prof. A. N. Nawathe anunawathe@gmail.com Godge Isha Sudhir ishagodge37@gmail.com Dang Poornima Mahesh dang.poornima@gmail.com

More information

Trend Extraction Method using Co-occurrence Patterns from Tweets

Trend Extraction Method using Co-occurrence Patterns from Tweets Information Engineering Express International Institute of Applied Informatics 2016, Vol.2, No.4, 1 10 Trend Extraction Method using Co-occurrence Patterns from Tweets Shotaro Noda and Katsuhide Fujita

More information

Application of Location-Based Sentiment Analysis Using Twitter for Identifying Trends Towards Indian General Elections 2014

Application of Location-Based Sentiment Analysis Using Twitter for Identifying Trends Towards Indian General Elections 2014 Application of Location-Based Sentiment Analysis Using Twitter for Identifying Trends Towards Indian General Elections 2014 Omaima Almatrafi Suhem Parack Bravim Chavan George Mason University George Mason

More information

A SURVEY ON PRODUCT REVIEW SENTIMENT ANALYSIS

A SURVEY ON PRODUCT REVIEW SENTIMENT ANALYSIS A SURVEY ON PRODUCT REVIEW SENTIMENT ANALYSIS Godge Isha Sudhir ishagodge37@gmail.com Arvikar Shruti Sanjay shrutiarvikar89@gmail.com Dang Poornima Mahesh dang.poornima@gmail.com Maske Rushikesh Kantrao

More information

CS47300: Web Information Search and Management

CS47300: Web Information Search and Management CS47300: Web Information Search and Management Sentiment Analysis 27 October 2017 Prof. Chris Clifton Most slides courtesy Prof. Bing Liu Introduction Two main types of textual information. Facts and Opinions

More information

Product Aspect Ranking Based on the Customers Reviews.

Product Aspect Ranking Based on the Customers Reviews. Product Aspect Ranking Based on the Customers Reviews. Kadam Snehal, Dhomse Kalyani, Bhumkar Smita, Janrao Swati, Ambhore Shakuntala BE Scholar, Department of Computer Engineering S.R.E.S. s College of

More information

Estimating the Impact of User Personality Traits on electronic Word-of-Mouth Text-mining Social Media Platforms

Estimating the Impact of User Personality Traits on electronic Word-of-Mouth Text-mining Social Media Platforms Estimating the Impact of User Personality Traits on electronic Word-of-Mouth Text-mining Social Media Platforms Panos Adamopoulos Goizueta Business School Emory University padamop@emory.edu Anindya Ghose

More information

Reaction Paper Regarding the Flow of Influence and Social Meaning Across Social Media Networks

Reaction Paper Regarding the Flow of Influence and Social Meaning Across Social Media Networks Reaction Paper Regarding the Flow of Influence and Social Meaning Across Social Media Networks Mahalia Miller Daniel Wiesenthal October 6, 2010 1 Introduction One topic of current interest is how language

More information

Cyber-Social-Physical Features for Mood Prediction over Online Social Networks

Cyber-Social-Physical Features for Mood Prediction over Online Social Networks DEIM Forum 2017 B1-3 Cyber-Social-Physical Features for Mood Prediction over Online Social Networks Chaima DHAHRI Kazunori MATSUMOTO and Keiichiro HOASHI KDDI Research, Inc 2-1-15 Ohara, Fujimino-shi,

More information

Comparing Twitter and Traditional Media using Topic Models

Comparing Twitter and Traditional Media using Topic Models Singapore Management University Institutional Knowledge at Singapore Management University Research Collection School Of Information Systems School of Information Systems 4-2011 Comparing Twitter and Traditional

More information

COMMTRUST: A MULTI-DIMENSIONAL TRUST MODEL FOR E-COMMERCE APPLICATIONS

COMMTRUST: A MULTI-DIMENSIONAL TRUST MODEL FOR E-COMMERCE APPLICATIONS COMMTRUST: A MULTI-DIMENSIONAL TRUST MODEL FOR E-COMMERCE APPLICATIONS M. Divya 1, Y. Sagar 2 1 M.Tech (Software Engineering) Student, VNR Vignana Jyothi Institute of Engineering & Technology, Hyderabad,

More information

Sentiment Analysis and Political Party Classification in 2016 U.S. President Debates in Twitter

Sentiment Analysis and Political Party Classification in 2016 U.S. President Debates in Twitter Sentiment Analysis and Political Party Classification in 2016 U.S. President Debates in Twitter Tianyu Ding 1 and Junyi Deng 1 and Jingting Li 1 and Yu-Ru Lin 1 1 University of Pittsburgh, Pittsburgh PA

More information

Lumière. A Smart Review Analysis Engine. Ruchi Asthana Nathaniel Brennan Zhe Wang

Lumière. A Smart Review Analysis Engine. Ruchi Asthana Nathaniel Brennan Zhe Wang Lumière A Smart Review Analysis Engine Ruchi Asthana Nathaniel Brennan Zhe Wang Purpose A rapid increase in Internet users along with the growing power of online reviews has given birth to fields like

More information

Collective Supervision of Topic Models for Predicting Surveys with Social Media

Collective Supervision of Topic Models for Predicting Surveys with Social Media Joint work with Adrian Benton, Braden Hancock, and Mark Dredze Collective Supervision of Topic Models for Predicting Surveys with Social Media Michael J. Paul University of Colorado, Boulder February 14,

More information

Public Opinion Mining For Effective Policy Making

Public Opinion Mining For Effective Policy Making Public Opinion Mining For Effective Policy Making Mr. AmitVasant Pandhare 1, Dr. Abrar Alvi 2, 1 Student,Computer Science and Engg. Prof.RamMeghe Institute Of Tech. and Research, Badnera, Amravati, Maharashtra,

More information

HIERARCHICAL LOCATION CLASSIFICATION OF TWITTER USERS WITH A CONTENT BASED PROBABILITY MODEL. Mounika Nukala

HIERARCHICAL LOCATION CLASSIFICATION OF TWITTER USERS WITH A CONTENT BASED PROBABILITY MODEL. Mounika Nukala HIERARCHICAL LOCATION CLASSIFICATION OF TWITTER USERS WITH A CONTENT BASED PROBABILITY MODEL by Mounika Nukala Submitted in partial fulfilment of the requirements for the degree of Master of Computer Science

More information

Automatic Detection of Rumor on Social Network

Automatic Detection of Rumor on Social Network Automatic Detection of Rumor on Social Network Qiao Zhang 1,2, Shuiyuan Zhang 1,2, Jian Dong 3, Jinhua Xiong 2(B), and Xueqi Cheng 2 1 University of Chinese Academy of Sciences, Beijing, China 2 Institute

More information

Predicting Popularity of Messages in Twitter using a Feature-weighted Model

Predicting Popularity of Messages in Twitter using a Feature-weighted Model International Journal of Advanced Intelligence Volume 0, Number 0, pp.xxx-yyy, November, 20XX. c AIA International Advanced Information Institute Predicting Popularity of Messages in Twitter using a Feature-weighted

More information

A SURVEY PAPER ON TWITTER DATA ANALYSIS FOR TERRORIST ATTACKS AND VARIOUS ASPECTS

A SURVEY PAPER ON TWITTER DATA ANALYSIS FOR TERRORIST ATTACKS AND VARIOUS ASPECTS A SURVEY PAPER ON TWITTER DATA ANALYSIS FOR TERRORIST ATTACKS AND VARIOUS ASPECTS Prof. Anjalidevi Pujari 1, Neha Amolik 2, Varsha Ibitwar 3, Apurva Keskar 4 1 Professor, Department of computer engineering,

More information

Weighted Fuzzy Rule Based Sentiment Prediction Analysis on Tweets

Weighted Fuzzy Rule Based Sentiment Prediction Analysis on Tweets , pp.240-244 http://dx.doi.org/10.14257/astl.2017.143.48 Weighted Fuzzy Rule Based Sentiment Prediction Analysis on Tweets Syed Muzamil Basha, Dharmendra Singh Rajput, Iyengar N.Ch.S.N, Ronnie D. Caytiles

More information

Analyzing Self-Driving Cars on Twitter

Analyzing Self-Driving Cars on Twitter Analyzing Self-Driving Cars on Twitter Rizwan Sadiq 1 and Mohsin Khan 2 1 Lecturer at COMSATS Institute of Information Technology, Abbottabad, Pakistan rizwansadiq@ciit.net.pk 2 International Islamic University,

More information

Tweeting Questions in Academic Conferences: Seeking or Promoting Information?

Tweeting Questions in Academic Conferences: Seeking or Promoting Information? Tweeting Questions in Academic Conferences: Seeking or Promoting Information? Xidao Wen, University of Pittsburgh Yu-Ru Lin, University of Pittsburgh Abstract The fast growth of social media has reshaped

More information

KNOWLEDGE DISCOVERY AND TWITTER SENTIMENT ANALYSIS: MINING PUBLIC OPINION AND STUDYING ITS CORRELATION WITH POPULARITY OF INDIAN MOVIES

KNOWLEDGE DISCOVERY AND TWITTER SENTIMENT ANALYSIS: MINING PUBLIC OPINION AND STUDYING ITS CORRELATION WITH POPULARITY OF INDIAN MOVIES INTERNATIONAL JOURNAL OF MANAGEMENT (IJM) Volume 6, Issue 1, January (2015), pp. 686-696 IAEME ISSN 0976-6502 (Print) ISSN 0976-6510 (Online) Volume 6, Issue 1, January (2015), pp. 697-705 IAEME: http://www.iaeme.com/ijm.asp

More information

RANKING RADICALLY IMPLICIT WEB FORUM USERS BY ASPECT SCORE

RANKING RADICALLY IMPLICIT WEB FORUM USERS BY ASPECT SCORE RANKING RADICALLY IMPLICIT WEB FORUM USERS BY ASPECT SCORE Prithivika.M Kothai.K Padmapriya.M Mrs.C.Jackulin,Assistant Professor, Department of CSE, Abstract In the computational linguistics the extraction

More information

Public Opinion Mining on Social Media: A Case Study of Twitter Opinion on Nuclear Power 1

Public Opinion Mining on Social Media: A Case Study of Twitter Opinion on Nuclear Power 1 , pp.224-228 http://dx.doi.org/10.14257/astl.2014.51.51 Public Opinion Mining on Social Media: A Case Study of Twitter Opinion on Nuclear Power 1 DongSung Kim 2 and Jong Woo Kim 2,3 2 222 Wangsimni-ro,

More information

Discovering business insights from the voice of the customer. Medallia s Approach to Text Analytics. by Ji Fang, Ph.D.

Discovering business insights from the voice of the customer. Medallia s Approach to Text Analytics. by Ji Fang, Ph.D. Discovering business insights from the voice of the customer. Medallia s Approach to Text Analytics by Ji Fang, Ph.D. TABLE OF CONTENTS A good text analytics solution... The science behind text analytics...

More information

On utility of temporal embeddings for skill matching. Manisha Verma, PhD student, UCL Nathan Francis, NJFSearch

On utility of temporal embeddings for skill matching. Manisha Verma, PhD student, UCL Nathan Francis, NJFSearch On utility of temporal embeddings for skill matching Manisha Verma, PhD student, UCL Nathan Francis, NJFSearch Skill Trend Importance 1. Constant evolution of labor market yields differences in importance

More information

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 1

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 1 Real Time Social Data Analysis for Forecasting Nature Disasters P.Balaji, G.Ilancheziapandian 2, A.Appandairaj 3 PG Student, Department of Computer Science and Engineering, Ganadipathy Tulsi s Jain Engineering

More information

Comparing Twitter and Traditional Media Using Topic Models

Comparing Twitter and Traditional Media Using Topic Models Comparing Twitter and Traditional Media Using Topic Models Wayne Xin Zhao 1, Jing Jiang 2, Jianshu Weng 2,JingHe 1,Ee-PengLim 2, Hongfei Yan 1, and Xiaoming Li 1 1 Peking University, China 2 Singapore

More information

Online Journal of Communication and Media Technologies Volume: 7 Issue: 1 January Social Media Aided Sentiment Analysis in Forecasting

Online Journal of Communication and Media Technologies Volume: 7 Issue: 1 January Social Media Aided Sentiment Analysis in Forecasting Social Media Aided Sentiment Analysis in Forecasting K.Nirmala Devi, Kongu Engineering College, India Abstract User generated contents on web and social media grow rapidly in this emerging information

More information

Inclusion London Guide to Twitter for Campaigners

Inclusion London Guide to Twitter for Campaigners Inclusion London Guide to Twitter for Campaigners 1 Contents Section 1: Twitter Basics... 3 What is Twitter?... 3 How it works... 3 Basics of a tweet... 4 Basics of the Twitter layout... 5 How to tweet...

More information

Predicting User Opinion in E-commerce Website Using Aggregate Ranking Algorithm

Predicting User Opinion in E-commerce Website Using Aggregate Ranking Algorithm Predicting User Opinion in E-commerce Website Using Aggregate Ranking Algorithm Pusarla.Divya Bharathi 1, A.Gauthami Latha 2 P.G. Student, Department of Computer Engineering, Miracle Educational Society

More information

Speech Analytics Transcription Accuracy

Speech Analytics Transcription Accuracy Speech Analytics Transcription Accuracy Understanding Verint s speech analytics transcription and categorization accuracy Verint.com Twitter.com/verint Facebook.com/verint Blog.verint.com Table of Contents

More information

Normalization of Text in Social Media: Analyzing the Need for Pre-processing Techniques and its Roles Nimala.K Dr.Thenmozhi. S Dr.ThamizhArasan.

Normalization of Text in Social Media: Analyzing the Need for Pre-processing Techniques and its Roles Nimala.K Dr.Thenmozhi. S Dr.ThamizhArasan. Volume 119 No. 12 2018, 2033-2037 ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu Normalization of Text in Social Media: Analyzing the Need for Pre-processing Techniques and its Roles

More information

Cloud Assisted Trend Analysis of Twitter Data using Hadoop

Cloud Assisted Trend Analysis of Twitter Data using Hadoop ISSN: 2454-132X Impact factor: 4.295 (Volume 4, Issue 2) Available online at: www.ijariit.com Cloud Assisted Trend Analysis of Twitter Data using Hadoop Aman Gupta vasugupta42@gmail.com Ishaan Bhasin ishaanbhasin96@gmail.com

More information

Project Presentations - 2

Project Presentations - 2 Project Presentations - 2 CMSC 498J: Social Media Computing Department of Computer Science University of Maryland Spring 2016 Hadi Amiri hadi@umd.edu Project Titles G8. Trends of Trendy Words David, Kevin,

More information

Stream Clustering of Tweets

Stream Clustering of Tweets Stream Clustering of Tweets Sophie Baillargeon Département de mathématiques et de statistique Université Laval Québec (Québec), Canada G1V 0A6 Email: sophie.baillargeon@mat.ulaval.ca Simon Hallé Thales

More information

Robust Social Event Detection in Twitter. Final Presentation S. E. Bekçe & F. Amira CS533 Spring 2016

Robust Social Event Detection in Twitter. Final Presentation S. E. Bekçe & F. Amira CS533 Spring 2016 1 Robust Social Event Detection in Twitter Final Presentation S. E. Bekçe & F. Amira CS533 Spring 2016 2 Motivation & Goals Protests in Turkey are common but they don t often broadcasted by news agencies

More information

Mining Trending Hash Tags for Arabic Sentiment Analysis

Mining Trending Hash Tags for Arabic Sentiment Analysis Mining Trending Hash Tags for Arabic Sentiment Analysis Yahya AlMurtadha Department of Computer Science Faculty of Computing and Information Technology, University of Tabuk Tabuk, Kingdom of Saudi Arabia

More information

Mining User Intents in Twitter: A Semi-Supervised Approach to Inferring Intent Categories for Tweets

Mining User Intents in Twitter: A Semi-Supervised Approach to Inferring Intent Categories for Tweets Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence Mining User Intents in Twitter: A Semi-Supervised Approach to Inferring Intent Categories for Tweets Jinpeng Wang 1, Gao Cong

More information

The Analysis of Hotel Customer Generated Contents in Weblogs

The Analysis of Hotel Customer Generated Contents in Weblogs 2011 International Conference on Financial Management and Economics IPEDR vol.11 (2011) (2011) IACSIT Press, Singapore The Analysis of Hotel Customer Generated Contents in Weblogs Rueijiau Sung, Chaochang

More information

Communicating with the Modern Citizen

Communicating with the Modern Citizen Communicating with the Modern Citizen Rise of the social citizen Changes how we work 44% of citizens complain via social media 20% expect a response within one hour via social media Always on -- Citizens

More information

Quantising Opinions for Political Tweets Analysis

Quantising Opinions for Political Tweets Analysis Quantising Opinions for Political Tweets Analysis Yulan He 1, Hassan Saif 1, Zhongyu Wei 2, Kam-Fai Wong 2 1 Knowledge Media Institute, The Open University, UK {y.he, h.saif}@open.ac.uk 2 Dept. of Systems

More information

Real-Time News Certification System on Sina Weibo

Real-Time News Certification System on Sina Weibo Real-Time News Certification System on Sina Weibo Xing Zhou 1,2, Juan Cao 1, Zhiwei Jin 1,2,Fei Xie 3,Yu Su 3,Junqiang Zhang 1,2,Dafeng Chu 3,,Xuehui Cao 3 1 Key Laboratory of Intelligent Information Processing

More information

Sentiment analysis using Singular Value Decomposition

Sentiment analysis using Singular Value Decomposition International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347 5161 2016 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Research Article Veena

More information

Empower Social Influence by Linking the Social Opinion to Social Network

Empower Social Influence by Linking the Social Opinion to Social Network Association for Information Systems AIS Electronic Library (AISeL) PACIS 2017 Proceedings Pacific Asia Conference on Information Systems (PACIS) Summer 7-19-2017 Empower Social Influence by Linking the

More information

Suraj Shinde. Inteligencia Artificial aplicada a la Industria. May 31th - Medellín, Colombia

Suraj Shinde. Inteligencia Artificial aplicada a la Industria. May 31th - Medellín, Colombia Suraj Shinde Inteligencia Artificial aplicada a la Industria May 31th - Medellín, Colombia Artificial Intelligence Overview Field of AI: Machine Learning is about training algorithms to build models to

More information

10 Online Communication Channels

10 Online Communication Channels 10 Online Communication Channels Ppt 11-12 Online communication channels and tools toolkit Spread the word Love it or loathe it, social media is here to stay and should form an essential component of any

More information

Reputation trust Score Evaluation for E-Commerce Using Feedback Rating and Comments

Reputation trust Score Evaluation for E-Commerce Using Feedback Rating and Comments Reputation trust Score Evaluation for E-Commerce Using Feedback Rating and Comments Vinaya Ramesh Firake BE(IT),ME(CSE) Abstract: Reputation trust is one of the important factors of the development of

More information

Classification Model for Intent Mining in Personal Website Based on Support Vector Machine

Classification Model for Intent Mining in Personal Website Based on Support Vector Machine , pp.145-152 http://dx.doi.org/10.14257/ijdta.2016.9.2.16 Classification Model for Intent Mining in Personal Website Based on Support Vector Machine Shuang Zhang, Nianbin Wang School of Computer Science

More information

How hot will it get? Modeling scientific discourse about literature

How hot will it get? Modeling scientific discourse about literature How hot will it get? Modeling scientific discourse about literature Project Aims Natalie Telis, CS229 ntelis@stanford.edu Many metrics exist to provide heuristics for quality of scientific literature,

More information

Predicting Restaurants Rating And Popularity Based On Yelp Dataset

Predicting Restaurants Rating And Popularity Based On Yelp Dataset CS 229 MACHINE LEARNING FINAL PROJECT 1 Predicting Restaurants Rating And Popularity Based On Yelp Dataset Yiwen Guo, ICME, Anran Lu, ICME, and Zeyu Wang, Department of Economics, Stanford University Abstract

More information

Who is Retweeting the Tweeters? Modeling Originating and Promoting Behaviors in Twitter Network

Who is Retweeting the Tweeters? Modeling Originating and Promoting Behaviors in Twitter Network Who is Retweeting the Tweeters? Modeling Originating and Promoting Behaviors in Twitter Network Aek Palakorn Achananuparp, Ee-Peng Lim, Jing Jiang, and Tuan-Anh Hoang Living Analytics Research Centre,

More information

Mining Reviews for Product Comparison and Recommendation

Mining Reviews for Product Comparison and Recommendation + Mining Reviews for Product Comparison and Recommendation Jianshu Sun, Chong Long, Xiaoyan Zhu, and Minlie Huang Abstract Recently, as the amount of customer reviews grows rapidly on product service websites,

More information

Hashtag-centric Immersive Search on Social Media

Hashtag-centric Immersive Search on Social Media Hashtag-centric Immersive Search on Social Media Yuqi Gao, Jitao Sang, Tongwei Ren, Changsheng Xu State Key Laboratory for Novel Software Technology, Nanjing University National Lab of Pattern Recognition,

More information

#bias: Measuring the Tweeting Behavior of Propagandists

#bias: Measuring the Tweeting Behavior of Propagandists Proceedings of the Sixth International AAAI Conference on Weblogs and Social Media #bias: Measuring the Tweeting Behavior of Propagandists Cristian Lumezanu NEC Laboratories America, Georgia Tech lume@nec-labs.com

More information

Twitter Sentiment Analysis on Demonetization tweets in India Using R language

Twitter Sentiment Analysis on Demonetization tweets in India Using R language Impact Factor Value: 4.029 ISSN: 2349-7084 International Journal of Computer Engineering In Research Trends Volume 4, Issue 6, June-2017, pp. 252-258 www.ijcert.org Twitter Sentiment Analysis on Demonetization

More information

Indian Election Trend Prediction Using Improved Competitive Vector Regression Model

Indian Election Trend Prediction Using Improved Competitive Vector Regression Model Indian Election Trend Prediction Using Improved Competitive Vector Regression Model Navya S 1 1 Department of Computer Science and Engineering, University, India Abstract Election result forecasting has

More information

Identifying Dynamic Topics of Interest across Social Networks

Identifying Dynamic Topics of Interest across Social Networks Identifying Dynamic Topics of Interest across Social Networks Mohamed Salaheldin Aly, Abeer Al Korany Department of Computer Science Faculty of Computers and Information Cairo, Egypt Abstract Information

More information

Large Scale High-Precision Topic Modeling on Twitter. Shuang Yang, Alek Kolcz Andy Schlaikjer, Pankaj Gupta

Large Scale High-Precision Topic Modeling on Twitter. Shuang Yang, Alek Kolcz Andy Schlaikjer, Pankaj Gupta Large Scale High-Precision Topic Modeling on Twitter Shuang Yang, Alek Kolcz Andy Schlaikjer, Pankaj Gupta Topic modeling of Tweets Many Use Cases Business intelligence & analytics sports technology government

More information

Classification of Microblogging Messages into General Topics Based on the Content of the Messages with Clustering

Classification of Microblogging Messages into General Topics Based on the Content of the Messages with Clustering Classification of Microblogging Messages into General Topics Based on the Content of the Messages with Clustering Santrupti S. Desai, Manohar Madgi Abstract Since microblogging messages such as tweets

More information

Assistant Professor Neha Pandya Department of Information Technology, Parul Institute Of Engineering & Technology Gujarat Technological University

Assistant Professor Neha Pandya Department of Information Technology, Parul Institute Of Engineering & Technology Gujarat Technological University Feature Level Text Categorization For Opinion Mining Gandhi Vaibhav C. Computer Engineering Parul Institute Of Engineering & Technology Gujarat Technological University Assistant Professor Neha Pandya

More information

SAS Visual Analytics: Text Analytics Using Word Clouds

SAS Visual Analytics: Text Analytics Using Word Clouds ABSTRACT Paper 1687-2018 SAS Visual Analytics: Text Analytics Using Word Clouds Jenine Milum, Atlanta, GA, USA There exists a limit to employee skills and capacity to efficiently analyze volumes of textual

More information

TEXT MINING APPROACH TO EXTRACT KNOWLEDGE FROM SOCIAL MEDIA DATA TO ENHANCE BUSINESS INTELLIGENCE

TEXT MINING APPROACH TO EXTRACT KNOWLEDGE FROM SOCIAL MEDIA DATA TO ENHANCE BUSINESS INTELLIGENCE International Journal of Advance Research In Science And Engineering http://www.ijarse.com TEXT MINING APPROACH TO EXTRACT KNOWLEDGE FROM SOCIAL MEDIA DATA TO ENHANCE BUSINESS INTELLIGENCE R. Jayanthi

More information

Various Techniques for Efficient Retrieval of Contents across Social Networks Based On Events

Various Techniques for Efficient Retrieval of Contents across Social Networks Based On Events Various Techniques for Efficient Retrieval of Contents across Social Networks Based On Events SAarif Ahamed 1 First Year ME (CSE) Department of CSE MIET EC ahamedaarif@yahoocom BAVishnupriya 1 First Year

More information

Final Report: Local Structure and Evolution for Cascade Prediction

Final Report: Local Structure and Evolution for Cascade Prediction Final Report: Local Structure and Evolution for Cascade Prediction Jake Lussier (lussier1@stanford.edu), Jacob Bank (jbank@stanford.edu) ABSTRACT Information cascades in large social networks are complex

More information

SENTIMENT-BASED MODEL FOR REPUTATION SYSTEMS IN AMAZON

SENTIMENT-BASED MODEL FOR REPUTATION SYSTEMS IN AMAZON SENTIMENT-BASED MODEL FOR REPUTATION SYSTEMS IN AMAZON Milad Sharif msharif@stanford.edu Soheil Norouzi snorouzi@stanford.edu 1. INTRODUCTION When buyers purchase products from an online retailer such

More information

A Two-Phase Model for Retweet Number Prediction

A Two-Phase Model for Retweet Number Prediction A Two-Phase Model for Retweet Number Prediction Gang Liu 1, Chuan Shi 1,*, Qing Chen 2, Bin Wu 1, and Jiayin Qi 1 1 Beijing University of Posts and Telecommunications, Beijing, China {liugang519,shichuan,wubin}@bupt.edu.cn,

More information

Analyzing Emotion on Twitter for User Modeling

Analyzing Emotion on Twitter for User Modeling Analyzing Emotion on Twitter for User Modeling Master s Thesis, October 30 th 2013 Zhaolong Li Analyzing Emotion on Twitter for User Modeling THESIS Submitted in the partial

More information

Data Mining in Social Network. Presenter: Keren Ye

Data Mining in Social Network. Presenter: Keren Ye Data Mining in Social Network Presenter: Keren Ye References Kwak, Haewoon, et al. "What is Twitter, a social network or a news media?." Proceedings of the 19th international conference on World wide web.

More information

LE NUOVE FRONTIERE DALL AI ALL AR E L IMPATTO SULLA QUOTIDIANITÀ

LE NUOVE FRONTIERE DALL AI ALL AR E L IMPATTO SULLA QUOTIDIANITÀ LE NUOVE FRONTIERE DALL AI ALL AR E L IMPATTO SULLA QUOTIDIANITÀ Deloitte Analytics & Information Management Torino, 26/03/2018 1 AUGMENTED REALITY FUNDAMENTALS EXAMPLES OF DEEP LEARNING ARTIFICIAL INTELLIGENCE

More information

Knowledge Discovery and Data Mining

Knowledge Discovery and Data Mining Knowledge Discovery and Data Mining Unit # 19 1 Acknowledgement The following discussion is based on the paper Mining Big Data: Current Status, and Forecast to the Future by Fan and Bifet and online presentation

More information

Feature Extraction from Micro-blogs for Comparison of Products and Services

Feature Extraction from Micro-blogs for Comparison of Products and Services Feature Extraction from Micro-blogs for Comparison of Products and Services Peng Zhao, Xue Li, and Ke Wang Nanjing University, 210093 China, The University of Queensland, QLD 4072 Australia, Simon Fraser

More information

Data enrichment and model creation using text mining and other unstructured data

Data enrichment and model creation using text mining and other unstructured data Image: used under license from shutterstock.com Data enrichment and model creation using text mining and other unstructured data Dr. Andreas Nawroth Head of Artificial Intelligence German Data Science

More information

PAST research has shown that real-time Twitter data can

PAST research has shown that real-time Twitter data can Algorithmic Trading of Cryptocurrency Based on Twitter Sentiment Analysis Stuart Colianni, Stephanie Rosales, and Michael Signorotti ABSTRACT PAST research has shown that real-time Twitter data can be

More information

International Journal of Advance Research in Engineering, Science & Technology DUAL SENTIMENTAL ANALYSIS OF USER COMMENTS IN ONLINE REVIEW SYSTEM

International Journal of Advance Research in Engineering, Science & Technology DUAL SENTIMENTAL ANALYSIS OF USER COMMENTS IN ONLINE REVIEW SYSTEM Impact Factor (SJIF): 5.301 International Journal of Advance Research in Engineering, Science & Technology e-issn: 2393-9877, p-issn: 2394-2444 Volume 5, Issue 3, March-2018 DUAL SENTIMENTAL ANALYSIS OF

More information

Community Level Topic Diffusion

Community Level Topic Diffusion Community Level Topic Diffusion Zhiting Hu 1,3, Junjie Yao 2, Bin Cui 1, Eric Xing 1,3 1 Peking Univ., China 2 East China Normal Univ., China 3 Carnegie Mellon Univ. OUTLINE Background Model: COLD Diffusion

More information

International Journal of Advance Engineering and Research Development A SENTIMENT MINING APPROACH TO BIG DATA ANALYTICS COMPARISON STUDY

International Journal of Advance Engineering and Research Development A SENTIMENT MINING APPROACH TO BIG DATA ANALYTICS COMPARISON STUDY Scientific Journal of Impact Factor (SJIF): 5.71 International Journal of Advance Engineering and Research Development Volume 5, Issue 03, March -2018 A SENTIMENT MINING APPROACH TO BIG DATA ANALYTICS

More information

15. Text Data Visualization. Prof. Tulasi Prasad Sariki SCSE, VIT, Chennai

15. Text Data Visualization. Prof. Tulasi Prasad Sariki SCSE, VIT, Chennai 15. Text Data Visualization Prof. Tulasi Prasad Sariki SCSE, VIT, Chennai www.learnersdesk.weebly.com Why Visualize Text? Understanding get the gist of a document Grouping cluster for overview or classifcation

More information

Are Twitter Users Equal in Predicting Elections? A Study of User Groups in Predicting 2012 U.S. Republican Primaries

Are Twitter Users Equal in Predicting Elections? A Study of User Groups in Predicting 2012 U.S. Republican Primaries Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 12-2012 Are Twitter Users Equal in Predicting Elections? A Study of

More information

Finding Opinion Strength Using Fuzzy Logic on Web Reviews

Finding Opinion Strength Using Fuzzy Logic on Web Reviews Animesh Kar 1, Deba Prasad Mandal 2 1 Guru Nanak Institute of Technology, Kolkata, India. karanimesh04@yahoo.co.in 2 Indian Statistical Institute, Kolkata, India. dpmandal@isical.ac.in doi:10.4156/ ijei.vol2.

More information