Mining Social Text Data. ZHAO Xin Renmin University of China
|
|
- Edward Wood
- 5 years ago
- Views:
Transcription
1 Mining Social Text Data ZHAO Xin Renmin University of China
2 Overview Introduction to social text data Basic techniques to manipulate text data Typical text mining applications for social science Hot trend and future direction Conclusion
3 Overview Introduction to social text data Basic techniques to manipulate text data Typical text mining applications for social science Hot trend and future direction Conclusion
4 Social Text Data Various kinds of social media Text data is one of the most important data types Tweets Blogs Comments Profiles Messages
5 Characteristics of social text data User-oriented Social text data is created by users themselves A social networking account corresponds to an offline user in real life Users can generate text contents as they like It provides a valuable data resource to understand social users
6 Characteristics of social text data Opinionated text Not only including facts
7 Characteristics of social text data Informal and noisy text New terms were created by users A traditional vocabulary cannot work
8 Characteristics of social text data Rich-link text On Twitter, we can simply connect ourselves with any celebrity account by using two typical ways: mention and retweet (RT)
9 Characteristics of social text data Rich-type text Multiple types of information are often embedded in the text E.g., Hashtag Check-ins (Location and time) Phone type
10 Overview Introduction to social text data Basic techniques to manipulate text data Typical text mining applications for social science Hot trend and future direction Conclusion
11 Fundamental natural language processing techniques Turn text into semantic units Provide semantic units with meaningful annotations
12 Fundamental natural language processing techniques Turn text into semantic units Provide semantic units with meaningful annotations
13 For English Tokenization Delimiter-based tokenization method Is it a simple task?
14 For Chinese Chinese Segmentation No space characters help 他明天去香港 -> 他 -- 明天 -- 去 -- 香港 (He is going to HK tomorrow.) Nearly a well-done task On standard text collections, the accuracy is very high For social text data, it needs more improving Challenge: informal writing styles and out-ofvocabulary terms E.g., new terms created by social users
15 Fundamental natural language processing techniques Turn text into semantic units Provide semantic units with meaningful annotations
16 Part-of-speech tagging Word-category tagging/labeling/classification
17 Part-of-speech tagging (POS tagging) Word-category tagging
18 Named Entity Recognition (NER) A very important task: find and classify entity mentions in text, for example: The decision by the independent MP Andrew Wilkie to withdraw his support for the minority Labor government sounded dramatic but it should not further threaten its stability. When, after the 2010 election, Wilkie, Rob Oakeshott, Tony Windsor and the Greens agreed to support Labor, they gave just two guarantees: confidence and supply.
19 Named Entity Recognition (NER) A very important task: find and classify entity mentions in text, for example: The decision by the independent MP Andrew Wilkie to withdraw his support for the minority Labor government sounded dramatic but it should not further threaten its stability. When, after the 2010 election, Wilkie, Rob Oakeshott, Tony Windsor and the Greens agreed to support Labor, they gave just two guarantees: confidence and supply.
20 Named Entity Recognition (NER) A very important task: find and classify entity mentions in text, for example: The decision by the independent MP Andrew Wilkie to withdraw his support for the minority Labor government sounded dramatic but it should not further threaten its stability. When, after the 2010 election, Wilkie, Rob Oakeshott, Tony Windsor and the Greens agreed to support Labor, they gave just two guarantees: confidence and supply. Person Date Location Organization
21 Entity Linking President Obama won the 2009 Nobel Peace Prize
22 Semantic Analysis Model Statistical topic models Word embedding models
23 Semantic Analysis Model Statistical topic models Word embedding models
24 Question: Statistical topic models Given a large text dataset, how do we know the major topics in it? We need a way to summarize and compress information
25 Output I: Statistical topic models Topics Can be understood as clusters of words
26 Output II: Statistical topic models Per-document topic distribution Can be understood as document representation
27 Put together Statistical topic models
28 Key ideas in topic models Capturing word co-occurrence If two words frequently co-occur in documents, these two words tend to be captured in top positions of the same topics The number of training documents are supposed to be large A few hundred documents will not work well
29 Semantic Analysis Model Statistical topic models Word embedding model
30 Word embedding Distributional semantics
31 Word embedding Distributional semantics
32 Word2Vec Input: a sequence of word tokens from a vocabulary Requiring a large amount of documents Output: a fixed-length embedding vector for each term in the vocabulary The embedding vector usually has several hundred dimensions with dense values
33 Word embedding Usage: finding similar words E.g., query word= Sweden
34 Interesting Observations China-Beijing = Japan - Tokyo
35 Overview Introduction to social text data Basic techniques to manipulate text data Typical text mining applications for social science Hot trend and future direction Conclusion
36 Typical text mining applications for Data cleaning social science Sentiment analysis User profiling User interest modeling
37 Typical text mining applications for social science Data cleaning Sentiment analysis User profiling User interest modeling
38 Data cleaning Cleaning social text data Very challenging to be perfect in practice E.g., word normalization SG => Singapore HK => Hong Kong But fortunately standard techniques usually work moderately well (with slight modifications) in many cases E.g., word2vec and topic models can capture the above abbreviation patterns to some extent
39 Spam detection Data cleaning Machine learning or rule-based approach The key is to select effective features based on user behaviors Multiple postings in a short period Redundant or nearly redundant postings Irrelevant postings
40 Typical text mining applications for Data cleaning social science Sentiment analysis User profiling User interest modeling
41 Key technical components Opinion extraction I bought an iphone a few days ago. It is such a nice phone. The touch screen is really cool. The voice quality is clear too. What do we see? Opinion holders: persons who make the comments Opinion targets: entities and their features/aspects Sentiments: opinions towards opinion targets
42 Key technical components Polarity identification Simplest form: Is the attitude of this text positive or negative? A classification-based approach Feature selection Opinion lexicon More complex forms: Five-star rating prediction A regression-based approach
43 For ecommerce: Feature-based opinion summary For each extracted aspect, we can derive the corresponding opinion score. 43
44 CALM Dow Jones Using Twitter sentiment for stock price prediction Bollen et al. (2011) The CALM mood predicts DJIA 3 days later 44
45 Twitter sentiment versus Gallup Poll of Consumer Confidence Brendan O'Connor, Ramnath Balasubramanyan, Bryan R. Routledge, and Noah A. Smith From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series. In ICWSM-2010
46 Typical text mining applications for social science Data cleaning and preprocessing Sentiment analysis User profiling User interest modeling
47 User profiling Direct extraction of demographics from social profiles Users fill in the profiles Sex Age Job Interests.
48 User profiling Implicit inference from social text data E.g., Amazon online reviews I bought my wife a new phone => SEX = MALE I worked in an IT company... => JOB = IT
49 Typical text mining applications for social science Data cleaning and preprocessing Sentiment analysis User profiling User interest modeling
50 User interest modeling Topical expert ranking Given a topic, we need to identify influential experts for that topic Two major components Interest identification and Influence ranking (Individual) Interest identification Find out what topics is a celebrity interested in (Overall) Influence ranking Decide how influential a celebrity is in a given topic
51 User interest modeling Topical expert ranking Topical word cloud Topical celebrity ranking
52 User interest modeling Collective interest or attention modeling Event detection
53 Overview Introduction to social text data Basic techniques to manipulate text data Typical text mining applications for social science Hot trend and future direction Conclusion
54 Deep learning for text mining Deep neural models are promising But relying on tedious parameter tuning and large training datasets Shallow neural models are more practical and easier to use by non-experts E.g., word2vec
55 Future directions for text mining Designing more effective information extraction models Joint utilizing multi-source information Developing large-scale and efficient algorithms
56 Toolkits NLP Toolkit list Topic models gibbslda.sourceforge.net Word2vec
57 Acknowledgement Thanks! Thanks for Prof. Zhu s invitation and organization Thanks for your attention Some slides are modified based on the Stanford NLP course slides.
58 References Wei Shen, Jianyong Wang, and Jiawei Han. Entity Linking with a Knowledge Base: Issues, Techniques, and Solutions. Transactions on Knowledge & Data Engineering 27(2): (2015) Mikolov, Tomas; Sutskever, Ilya; Chen, Kai; Corrado, Greg S.; Dean, Jeff (2013). Distributed representations of words and phrases and their compositionality. NIPS D. Blei. Probabilistic topic models. Communications of the ACM, 55(4):77 84, Ee-Peng Lim, Viet-An Nguyen, Nitin Jindal, Bing Liu, Hady Wirawan Lauw. Detecting product review spammers using rating behaviors. CIKM 2010: Yoshua Bengio, Aaron Courville, Pascal Vincent. Representation Learning: A Review and New Perspectives. Jinpeng Wang, Xin Zhao and Xiaoming Li. Mining New Business Opportunities: Identifying Trend related Products by Leveraging Commercial Intents from Microblogs. In EMNLP 13. Xin Zhao, YanweiGuo, Yulan He, Han Jiang, Yuexin Wu and Xiaoming Li. A Demographic-based System for Product Recommendation On Microblogs. In KDD Jinpeng Wang, Gao Cong, Xin Zhao, Xiaoming Li. Mining User Intents in Twitter: A Semi-Supervised Approach to Inferring Intent Categories for Tweets. In AAAI Xin Zhao, Jinpeng Wang, Yulan He, Ji-Rong Wen, Edward Y. Chang, Xiaoming Li. Mining Product Adopter Information from Online Reviews for Improving Product Recommendation. TKDD 10(3): 29 (2016) Xin Zhao, Yanwei Guo, Rui Yan, Yulan He, Xiaoming Li. Timeline generation with social attention. SIGIR 2013:
DETECTING COMMUNITIES BY SENTIMENT ANALYSIS
DETECTING COMMUNITIES BY SENTIMENT ANALYSIS OF CONTROVERSIAL TOPICS SBP-BRiMS 2016 Kangwon Seo 1, Rong Pan 1, & Aleksey Panasyuk 2 1 Arizona State University 2 Air Force Research Lab July 1, 2016 OUTLINE
More informationFrom Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series
From Tweets to Polls: Linking Text Sentiment to Public Brendan O Connor Opinion Time Series Machine Learning Department Carnegie Mellon University http://brenocon.com Presentation at AAPOR, May 2012 Joint
More informationData Preprocessing, Sentiment Analysis & NER On Twitter Data.
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727 PP 73-79 www.iosrjournals.org Data Preprocessing, Sentiment Analysis & NER On Twitter Data. Mr.SanketPatil, Prof.VarshaWangikar,
More informationText, Web, and Social Media Analytics
Kecerdasan Bisnis dalam Praktek Predictive Analytics II Text, Web, and Social Media Analytics Husni Lab. Riset JTIF UTM Sumber awal: http://mail.tku.edu.tw/myday/teaching/1071/bi/1071bi07_business_intelligence.pptx
More informationIMPLEMENTING ARTIFICIAL NEURAL NETWORK IN ENHANCED MICRO BLOG FOR SOCIAL NETWORK, E-COMMERCE AND NEWS CHANNELS
IMPLEMENTING ARTIFICIAL NEURAL NETWORK IN ENHANCED MICRO BLOG FOR SOCIAL NETWORK, E-COMMERCE AND NEWS CHANNELS 1 Rathidevi. M, 2 Dr.S.V.Sudha 1 PG Scholar, Department of Computer Science and Engineering,
More information2016 U.S. PRESIDENTIAL ELECTION FAKE NEWS
2016 U.S. PRESIDENTIAL ELECTION FAKE NEWS OVERVIEW Introduction Main Paper Related Work and Limitation Proposed Solution Preliminary Result Conclusion and Future Work TWITTER: A SOCIAL NETWORK AND A NEWS
More informationTweet Segmentation Using Correlation & Association
2017 IJSRST Volume 3 Issue 3 Print ISSN: 2395-6011 Online ISSN: 2395-602X Themed Section: Science and Technology Tweet Segmentation Using Correlation & Association Mr. Umesh A. Patil 1, Miss. Madhuri M.
More informationSentiment Analysis: A discovery challenge. Bing Liu University of Illinois at Chicago
Sentiment Analysis: A discovery challenge Bing Liu University of Illinois at Chicago liub@cs.uic.edu Introduction Opinion mining or sentiment analysis q computational study of opinions, sentiments, appraisal,
More informationADVANCES in NATURAL and APPLIED SCIENCES
ADVANCES in NATURAL and APPLIED SCIENCES ISSN: 1995-0772 Published BYAENSI Publication EISSN: 1998-1090 http://www.aensiweb.com/anas 2017 June 11(8): pages 537-541 Open Access Journal Tweet Segmentation
More informationTEXT MINING - SENTIMENT ANALYSIS /OPINION MINING
TEXT MINING - SENTIMENT ANALYSIS /OPINION MINING SRI SURYA NULU EMIS-DATA MINING 8331 SOUTHERN METHODIST UNIVERSITY 1 SNULU@SMU.EDU SRISURYANULU@GMAIL.COM AGENDA Introduction Definition of opinion mining
More informationCONNECTING SOCIAL MEDIA TO ECOMMERCE USING MICROBLOGGING AND ARTIFICIAL NEURAL NETWORK
CONNECTING SOCIAL MEDIA TO ECOMMERCE USING MICROBLOGGING AND ARTIFICIAL NEURAL NETWORK Ms.S.P.VidhyaPriya 1,B.Gokhila 2, T.Santhiya 3, K.Saranya 4 1 M.E.,Assistant Professor-CSE, Kathir College Of Engineering,
More informationInternational Journal of Scientific & Engineering Research, Volume 6, Issue 3, March ISSN Web and Text Mining Sentiment Analysis
International Journal of Scientific & Engineering Research, Volume 6, Issue 3, March-2015 672 Web and Text Mining Sentiment Analysis Ms. Anjana Agrawal Abstract This paper describes the key steps followed
More informationPredicting the Odds of Getting Retweeted
Predicting the Odds of Getting Retweeted Arun Mahendra Stanford University arunmahe@stanford.edu 1. Introduction Millions of people tweet every day about almost any topic imaginable, but only a small percent
More informationSmartphone Product Review Sentiment Analysis using Logistic Regression
I J C T A, 9(26) 2016, pp. 343-349 International Science Press Smartphone Product Review Sentiment Analysis using Logistic Regression M. Lovelin Ponn Felciah 1 and R. Anbuselvi 2 ABSTRACT Sentiment analysis
More informationSentiment and Intent Analysis for Business Intelligence
Sentiment and Intent Analysis for Business Intelligence Lalit Pathak 1, Sanjana Mulchandani 2, Mayuri Kate 3, Kajal Khatwani 4, Sujata Khedkar 5 U.G. Student, Department of Computer Engineering, VESIT
More informationUsing Twitter as a source of information for stock market prediction
Using Twitter as a source of information for stock market prediction Argimiro Arratia argimiro@lsi.upc.edu LSI, Univ. Politécnica de Catalunya, Barcelona (Joint work with Marta Arias and Ramón Xuriguera)
More informationProduct improvement based on online reviews from product designer s perspective
IOP Conference Series: Materials Science and Engineering PAPER OPEN ACCESS Product improvement based on online reviews from product designer s perspective To cite this article: Shugang Li and Jiali Kong
More informationTWITTER SENTIMENT ANALYSIS AND TEXT MINING IN R
TWITTER SENTIMENT ANALYSIS AND TEXT MINING IN R Rishabh Nigam Department of IT Maharaja Agrasen Institute of Technology, Delhi, India Email- nigamrishabh10@gmail.com Nidhi Sengar Department of IT Maharaja
More informationPRODUCT REVIEW SENTIMENT ANALYSIS WITH TRENDY SYNTHETIC DICTIONARY AND AMBIGUITY MITIGATION
PRODUCT REVIEW SENTIMENT ANALYSIS WITH TRENDY SYNTHETIC DICTIONARY AND AMBIGUITY MITIGATION Prof. A. N. Nawathe anunawathe@gmail.com Godge Isha Sudhir ishagodge37@gmail.com Dang Poornima Mahesh dang.poornima@gmail.com
More informationTrend Extraction Method using Co-occurrence Patterns from Tweets
Information Engineering Express International Institute of Applied Informatics 2016, Vol.2, No.4, 1 10 Trend Extraction Method using Co-occurrence Patterns from Tweets Shotaro Noda and Katsuhide Fujita
More informationApplication of Location-Based Sentiment Analysis Using Twitter for Identifying Trends Towards Indian General Elections 2014
Application of Location-Based Sentiment Analysis Using Twitter for Identifying Trends Towards Indian General Elections 2014 Omaima Almatrafi Suhem Parack Bravim Chavan George Mason University George Mason
More informationA SURVEY ON PRODUCT REVIEW SENTIMENT ANALYSIS
A SURVEY ON PRODUCT REVIEW SENTIMENT ANALYSIS Godge Isha Sudhir ishagodge37@gmail.com Arvikar Shruti Sanjay shrutiarvikar89@gmail.com Dang Poornima Mahesh dang.poornima@gmail.com Maske Rushikesh Kantrao
More informationCS47300: Web Information Search and Management
CS47300: Web Information Search and Management Sentiment Analysis 27 October 2017 Prof. Chris Clifton Most slides courtesy Prof. Bing Liu Introduction Two main types of textual information. Facts and Opinions
More informationProduct Aspect Ranking Based on the Customers Reviews.
Product Aspect Ranking Based on the Customers Reviews. Kadam Snehal, Dhomse Kalyani, Bhumkar Smita, Janrao Swati, Ambhore Shakuntala BE Scholar, Department of Computer Engineering S.R.E.S. s College of
More informationEstimating the Impact of User Personality Traits on electronic Word-of-Mouth Text-mining Social Media Platforms
Estimating the Impact of User Personality Traits on electronic Word-of-Mouth Text-mining Social Media Platforms Panos Adamopoulos Goizueta Business School Emory University padamop@emory.edu Anindya Ghose
More informationReaction Paper Regarding the Flow of Influence and Social Meaning Across Social Media Networks
Reaction Paper Regarding the Flow of Influence and Social Meaning Across Social Media Networks Mahalia Miller Daniel Wiesenthal October 6, 2010 1 Introduction One topic of current interest is how language
More informationCyber-Social-Physical Features for Mood Prediction over Online Social Networks
DEIM Forum 2017 B1-3 Cyber-Social-Physical Features for Mood Prediction over Online Social Networks Chaima DHAHRI Kazunori MATSUMOTO and Keiichiro HOASHI KDDI Research, Inc 2-1-15 Ohara, Fujimino-shi,
More informationComparing Twitter and Traditional Media using Topic Models
Singapore Management University Institutional Knowledge at Singapore Management University Research Collection School Of Information Systems School of Information Systems 4-2011 Comparing Twitter and Traditional
More informationCOMMTRUST: A MULTI-DIMENSIONAL TRUST MODEL FOR E-COMMERCE APPLICATIONS
COMMTRUST: A MULTI-DIMENSIONAL TRUST MODEL FOR E-COMMERCE APPLICATIONS M. Divya 1, Y. Sagar 2 1 M.Tech (Software Engineering) Student, VNR Vignana Jyothi Institute of Engineering & Technology, Hyderabad,
More informationSentiment Analysis and Political Party Classification in 2016 U.S. President Debates in Twitter
Sentiment Analysis and Political Party Classification in 2016 U.S. President Debates in Twitter Tianyu Ding 1 and Junyi Deng 1 and Jingting Li 1 and Yu-Ru Lin 1 1 University of Pittsburgh, Pittsburgh PA
More informationLumière. A Smart Review Analysis Engine. Ruchi Asthana Nathaniel Brennan Zhe Wang
Lumière A Smart Review Analysis Engine Ruchi Asthana Nathaniel Brennan Zhe Wang Purpose A rapid increase in Internet users along with the growing power of online reviews has given birth to fields like
More informationCollective Supervision of Topic Models for Predicting Surveys with Social Media
Joint work with Adrian Benton, Braden Hancock, and Mark Dredze Collective Supervision of Topic Models for Predicting Surveys with Social Media Michael J. Paul University of Colorado, Boulder February 14,
More informationPublic Opinion Mining For Effective Policy Making
Public Opinion Mining For Effective Policy Making Mr. AmitVasant Pandhare 1, Dr. Abrar Alvi 2, 1 Student,Computer Science and Engg. Prof.RamMeghe Institute Of Tech. and Research, Badnera, Amravati, Maharashtra,
More informationHIERARCHICAL LOCATION CLASSIFICATION OF TWITTER USERS WITH A CONTENT BASED PROBABILITY MODEL. Mounika Nukala
HIERARCHICAL LOCATION CLASSIFICATION OF TWITTER USERS WITH A CONTENT BASED PROBABILITY MODEL by Mounika Nukala Submitted in partial fulfilment of the requirements for the degree of Master of Computer Science
More informationAutomatic Detection of Rumor on Social Network
Automatic Detection of Rumor on Social Network Qiao Zhang 1,2, Shuiyuan Zhang 1,2, Jian Dong 3, Jinhua Xiong 2(B), and Xueqi Cheng 2 1 University of Chinese Academy of Sciences, Beijing, China 2 Institute
More informationPredicting Popularity of Messages in Twitter using a Feature-weighted Model
International Journal of Advanced Intelligence Volume 0, Number 0, pp.xxx-yyy, November, 20XX. c AIA International Advanced Information Institute Predicting Popularity of Messages in Twitter using a Feature-weighted
More informationA SURVEY PAPER ON TWITTER DATA ANALYSIS FOR TERRORIST ATTACKS AND VARIOUS ASPECTS
A SURVEY PAPER ON TWITTER DATA ANALYSIS FOR TERRORIST ATTACKS AND VARIOUS ASPECTS Prof. Anjalidevi Pujari 1, Neha Amolik 2, Varsha Ibitwar 3, Apurva Keskar 4 1 Professor, Department of computer engineering,
More informationWeighted Fuzzy Rule Based Sentiment Prediction Analysis on Tweets
, pp.240-244 http://dx.doi.org/10.14257/astl.2017.143.48 Weighted Fuzzy Rule Based Sentiment Prediction Analysis on Tweets Syed Muzamil Basha, Dharmendra Singh Rajput, Iyengar N.Ch.S.N, Ronnie D. Caytiles
More informationAnalyzing Self-Driving Cars on Twitter
Analyzing Self-Driving Cars on Twitter Rizwan Sadiq 1 and Mohsin Khan 2 1 Lecturer at COMSATS Institute of Information Technology, Abbottabad, Pakistan rizwansadiq@ciit.net.pk 2 International Islamic University,
More informationTweeting Questions in Academic Conferences: Seeking or Promoting Information?
Tweeting Questions in Academic Conferences: Seeking or Promoting Information? Xidao Wen, University of Pittsburgh Yu-Ru Lin, University of Pittsburgh Abstract The fast growth of social media has reshaped
More informationKNOWLEDGE DISCOVERY AND TWITTER SENTIMENT ANALYSIS: MINING PUBLIC OPINION AND STUDYING ITS CORRELATION WITH POPULARITY OF INDIAN MOVIES
INTERNATIONAL JOURNAL OF MANAGEMENT (IJM) Volume 6, Issue 1, January (2015), pp. 686-696 IAEME ISSN 0976-6502 (Print) ISSN 0976-6510 (Online) Volume 6, Issue 1, January (2015), pp. 697-705 IAEME: http://www.iaeme.com/ijm.asp
More informationRANKING RADICALLY IMPLICIT WEB FORUM USERS BY ASPECT SCORE
RANKING RADICALLY IMPLICIT WEB FORUM USERS BY ASPECT SCORE Prithivika.M Kothai.K Padmapriya.M Mrs.C.Jackulin,Assistant Professor, Department of CSE, Abstract In the computational linguistics the extraction
More informationPublic Opinion Mining on Social Media: A Case Study of Twitter Opinion on Nuclear Power 1
, pp.224-228 http://dx.doi.org/10.14257/astl.2014.51.51 Public Opinion Mining on Social Media: A Case Study of Twitter Opinion on Nuclear Power 1 DongSung Kim 2 and Jong Woo Kim 2,3 2 222 Wangsimni-ro,
More informationDiscovering business insights from the voice of the customer. Medallia s Approach to Text Analytics. by Ji Fang, Ph.D.
Discovering business insights from the voice of the customer. Medallia s Approach to Text Analytics by Ji Fang, Ph.D. TABLE OF CONTENTS A good text analytics solution... The science behind text analytics...
More informationOn utility of temporal embeddings for skill matching. Manisha Verma, PhD student, UCL Nathan Francis, NJFSearch
On utility of temporal embeddings for skill matching Manisha Verma, PhD student, UCL Nathan Francis, NJFSearch Skill Trend Importance 1. Constant evolution of labor market yields differences in importance
More informationPublished by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org) 1
Real Time Social Data Analysis for Forecasting Nature Disasters P.Balaji, G.Ilancheziapandian 2, A.Appandairaj 3 PG Student, Department of Computer Science and Engineering, Ganadipathy Tulsi s Jain Engineering
More informationComparing Twitter and Traditional Media Using Topic Models
Comparing Twitter and Traditional Media Using Topic Models Wayne Xin Zhao 1, Jing Jiang 2, Jianshu Weng 2,JingHe 1,Ee-PengLim 2, Hongfei Yan 1, and Xiaoming Li 1 1 Peking University, China 2 Singapore
More informationOnline Journal of Communication and Media Technologies Volume: 7 Issue: 1 January Social Media Aided Sentiment Analysis in Forecasting
Social Media Aided Sentiment Analysis in Forecasting K.Nirmala Devi, Kongu Engineering College, India Abstract User generated contents on web and social media grow rapidly in this emerging information
More informationInclusion London Guide to Twitter for Campaigners
Inclusion London Guide to Twitter for Campaigners 1 Contents Section 1: Twitter Basics... 3 What is Twitter?... 3 How it works... 3 Basics of a tweet... 4 Basics of the Twitter layout... 5 How to tweet...
More informationPredicting User Opinion in E-commerce Website Using Aggregate Ranking Algorithm
Predicting User Opinion in E-commerce Website Using Aggregate Ranking Algorithm Pusarla.Divya Bharathi 1, A.Gauthami Latha 2 P.G. Student, Department of Computer Engineering, Miracle Educational Society
More informationSpeech Analytics Transcription Accuracy
Speech Analytics Transcription Accuracy Understanding Verint s speech analytics transcription and categorization accuracy Verint.com Twitter.com/verint Facebook.com/verint Blog.verint.com Table of Contents
More informationNormalization of Text in Social Media: Analyzing the Need for Pre-processing Techniques and its Roles Nimala.K Dr.Thenmozhi. S Dr.ThamizhArasan.
Volume 119 No. 12 2018, 2033-2037 ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu Normalization of Text in Social Media: Analyzing the Need for Pre-processing Techniques and its Roles
More informationCloud Assisted Trend Analysis of Twitter Data using Hadoop
ISSN: 2454-132X Impact factor: 4.295 (Volume 4, Issue 2) Available online at: www.ijariit.com Cloud Assisted Trend Analysis of Twitter Data using Hadoop Aman Gupta vasugupta42@gmail.com Ishaan Bhasin ishaanbhasin96@gmail.com
More informationProject Presentations - 2
Project Presentations - 2 CMSC 498J: Social Media Computing Department of Computer Science University of Maryland Spring 2016 Hadi Amiri hadi@umd.edu Project Titles G8. Trends of Trendy Words David, Kevin,
More informationStream Clustering of Tweets
Stream Clustering of Tweets Sophie Baillargeon Département de mathématiques et de statistique Université Laval Québec (Québec), Canada G1V 0A6 Email: sophie.baillargeon@mat.ulaval.ca Simon Hallé Thales
More informationRobust Social Event Detection in Twitter. Final Presentation S. E. Bekçe & F. Amira CS533 Spring 2016
1 Robust Social Event Detection in Twitter Final Presentation S. E. Bekçe & F. Amira CS533 Spring 2016 2 Motivation & Goals Protests in Turkey are common but they don t often broadcasted by news agencies
More informationMining Trending Hash Tags for Arabic Sentiment Analysis
Mining Trending Hash Tags for Arabic Sentiment Analysis Yahya AlMurtadha Department of Computer Science Faculty of Computing and Information Technology, University of Tabuk Tabuk, Kingdom of Saudi Arabia
More informationMining User Intents in Twitter: A Semi-Supervised Approach to Inferring Intent Categories for Tweets
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence Mining User Intents in Twitter: A Semi-Supervised Approach to Inferring Intent Categories for Tweets Jinpeng Wang 1, Gao Cong
More informationThe Analysis of Hotel Customer Generated Contents in Weblogs
2011 International Conference on Financial Management and Economics IPEDR vol.11 (2011) (2011) IACSIT Press, Singapore The Analysis of Hotel Customer Generated Contents in Weblogs Rueijiau Sung, Chaochang
More informationCommunicating with the Modern Citizen
Communicating with the Modern Citizen Rise of the social citizen Changes how we work 44% of citizens complain via social media 20% expect a response within one hour via social media Always on -- Citizens
More informationQuantising Opinions for Political Tweets Analysis
Quantising Opinions for Political Tweets Analysis Yulan He 1, Hassan Saif 1, Zhongyu Wei 2, Kam-Fai Wong 2 1 Knowledge Media Institute, The Open University, UK {y.he, h.saif}@open.ac.uk 2 Dept. of Systems
More informationReal-Time News Certification System on Sina Weibo
Real-Time News Certification System on Sina Weibo Xing Zhou 1,2, Juan Cao 1, Zhiwei Jin 1,2,Fei Xie 3,Yu Su 3,Junqiang Zhang 1,2,Dafeng Chu 3,,Xuehui Cao 3 1 Key Laboratory of Intelligent Information Processing
More informationSentiment analysis using Singular Value Decomposition
International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347 5161 2016 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Research Article Veena
More informationEmpower Social Influence by Linking the Social Opinion to Social Network
Association for Information Systems AIS Electronic Library (AISeL) PACIS 2017 Proceedings Pacific Asia Conference on Information Systems (PACIS) Summer 7-19-2017 Empower Social Influence by Linking the
More informationSuraj Shinde. Inteligencia Artificial aplicada a la Industria. May 31th - Medellín, Colombia
Suraj Shinde Inteligencia Artificial aplicada a la Industria May 31th - Medellín, Colombia Artificial Intelligence Overview Field of AI: Machine Learning is about training algorithms to build models to
More information10 Online Communication Channels
10 Online Communication Channels Ppt 11-12 Online communication channels and tools toolkit Spread the word Love it or loathe it, social media is here to stay and should form an essential component of any
More informationReputation trust Score Evaluation for E-Commerce Using Feedback Rating and Comments
Reputation trust Score Evaluation for E-Commerce Using Feedback Rating and Comments Vinaya Ramesh Firake BE(IT),ME(CSE) Abstract: Reputation trust is one of the important factors of the development of
More informationClassification Model for Intent Mining in Personal Website Based on Support Vector Machine
, pp.145-152 http://dx.doi.org/10.14257/ijdta.2016.9.2.16 Classification Model for Intent Mining in Personal Website Based on Support Vector Machine Shuang Zhang, Nianbin Wang School of Computer Science
More informationHow hot will it get? Modeling scientific discourse about literature
How hot will it get? Modeling scientific discourse about literature Project Aims Natalie Telis, CS229 ntelis@stanford.edu Many metrics exist to provide heuristics for quality of scientific literature,
More informationPredicting Restaurants Rating And Popularity Based On Yelp Dataset
CS 229 MACHINE LEARNING FINAL PROJECT 1 Predicting Restaurants Rating And Popularity Based On Yelp Dataset Yiwen Guo, ICME, Anran Lu, ICME, and Zeyu Wang, Department of Economics, Stanford University Abstract
More informationWho is Retweeting the Tweeters? Modeling Originating and Promoting Behaviors in Twitter Network
Who is Retweeting the Tweeters? Modeling Originating and Promoting Behaviors in Twitter Network Aek Palakorn Achananuparp, Ee-Peng Lim, Jing Jiang, and Tuan-Anh Hoang Living Analytics Research Centre,
More informationMining Reviews for Product Comparison and Recommendation
+ Mining Reviews for Product Comparison and Recommendation Jianshu Sun, Chong Long, Xiaoyan Zhu, and Minlie Huang Abstract Recently, as the amount of customer reviews grows rapidly on product service websites,
More informationHashtag-centric Immersive Search on Social Media
Hashtag-centric Immersive Search on Social Media Yuqi Gao, Jitao Sang, Tongwei Ren, Changsheng Xu State Key Laboratory for Novel Software Technology, Nanjing University National Lab of Pattern Recognition,
More information#bias: Measuring the Tweeting Behavior of Propagandists
Proceedings of the Sixth International AAAI Conference on Weblogs and Social Media #bias: Measuring the Tweeting Behavior of Propagandists Cristian Lumezanu NEC Laboratories America, Georgia Tech lume@nec-labs.com
More informationTwitter Sentiment Analysis on Demonetization tweets in India Using R language
Impact Factor Value: 4.029 ISSN: 2349-7084 International Journal of Computer Engineering In Research Trends Volume 4, Issue 6, June-2017, pp. 252-258 www.ijcert.org Twitter Sentiment Analysis on Demonetization
More informationIndian Election Trend Prediction Using Improved Competitive Vector Regression Model
Indian Election Trend Prediction Using Improved Competitive Vector Regression Model Navya S 1 1 Department of Computer Science and Engineering, University, India Abstract Election result forecasting has
More informationIdentifying Dynamic Topics of Interest across Social Networks
Identifying Dynamic Topics of Interest across Social Networks Mohamed Salaheldin Aly, Abeer Al Korany Department of Computer Science Faculty of Computers and Information Cairo, Egypt Abstract Information
More informationLarge Scale High-Precision Topic Modeling on Twitter. Shuang Yang, Alek Kolcz Andy Schlaikjer, Pankaj Gupta
Large Scale High-Precision Topic Modeling on Twitter Shuang Yang, Alek Kolcz Andy Schlaikjer, Pankaj Gupta Topic modeling of Tweets Many Use Cases Business intelligence & analytics sports technology government
More informationClassification of Microblogging Messages into General Topics Based on the Content of the Messages with Clustering
Classification of Microblogging Messages into General Topics Based on the Content of the Messages with Clustering Santrupti S. Desai, Manohar Madgi Abstract Since microblogging messages such as tweets
More informationAssistant Professor Neha Pandya Department of Information Technology, Parul Institute Of Engineering & Technology Gujarat Technological University
Feature Level Text Categorization For Opinion Mining Gandhi Vaibhav C. Computer Engineering Parul Institute Of Engineering & Technology Gujarat Technological University Assistant Professor Neha Pandya
More informationSAS Visual Analytics: Text Analytics Using Word Clouds
ABSTRACT Paper 1687-2018 SAS Visual Analytics: Text Analytics Using Word Clouds Jenine Milum, Atlanta, GA, USA There exists a limit to employee skills and capacity to efficiently analyze volumes of textual
More informationTEXT MINING APPROACH TO EXTRACT KNOWLEDGE FROM SOCIAL MEDIA DATA TO ENHANCE BUSINESS INTELLIGENCE
International Journal of Advance Research In Science And Engineering http://www.ijarse.com TEXT MINING APPROACH TO EXTRACT KNOWLEDGE FROM SOCIAL MEDIA DATA TO ENHANCE BUSINESS INTELLIGENCE R. Jayanthi
More informationVarious Techniques for Efficient Retrieval of Contents across Social Networks Based On Events
Various Techniques for Efficient Retrieval of Contents across Social Networks Based On Events SAarif Ahamed 1 First Year ME (CSE) Department of CSE MIET EC ahamedaarif@yahoocom BAVishnupriya 1 First Year
More informationFinal Report: Local Structure and Evolution for Cascade Prediction
Final Report: Local Structure and Evolution for Cascade Prediction Jake Lussier (lussier1@stanford.edu), Jacob Bank (jbank@stanford.edu) ABSTRACT Information cascades in large social networks are complex
More informationSENTIMENT-BASED MODEL FOR REPUTATION SYSTEMS IN AMAZON
SENTIMENT-BASED MODEL FOR REPUTATION SYSTEMS IN AMAZON Milad Sharif msharif@stanford.edu Soheil Norouzi snorouzi@stanford.edu 1. INTRODUCTION When buyers purchase products from an online retailer such
More informationA Two-Phase Model for Retweet Number Prediction
A Two-Phase Model for Retweet Number Prediction Gang Liu 1, Chuan Shi 1,*, Qing Chen 2, Bin Wu 1, and Jiayin Qi 1 1 Beijing University of Posts and Telecommunications, Beijing, China {liugang519,shichuan,wubin}@bupt.edu.cn,
More informationAnalyzing Emotion on Twitter for User Modeling
Analyzing Emotion on Twitter for User Modeling Master s Thesis, October 30 th 2013 Zhaolong Li Analyzing Emotion on Twitter for User Modeling THESIS Submitted in the partial
More informationData Mining in Social Network. Presenter: Keren Ye
Data Mining in Social Network Presenter: Keren Ye References Kwak, Haewoon, et al. "What is Twitter, a social network or a news media?." Proceedings of the 19th international conference on World wide web.
More informationLE NUOVE FRONTIERE DALL AI ALL AR E L IMPATTO SULLA QUOTIDIANITÀ
LE NUOVE FRONTIERE DALL AI ALL AR E L IMPATTO SULLA QUOTIDIANITÀ Deloitte Analytics & Information Management Torino, 26/03/2018 1 AUGMENTED REALITY FUNDAMENTALS EXAMPLES OF DEEP LEARNING ARTIFICIAL INTELLIGENCE
More informationKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 19 1 Acknowledgement The following discussion is based on the paper Mining Big Data: Current Status, and Forecast to the Future by Fan and Bifet and online presentation
More informationFeature Extraction from Micro-blogs for Comparison of Products and Services
Feature Extraction from Micro-blogs for Comparison of Products and Services Peng Zhao, Xue Li, and Ke Wang Nanjing University, 210093 China, The University of Queensland, QLD 4072 Australia, Simon Fraser
More informationData enrichment and model creation using text mining and other unstructured data
Image: used under license from shutterstock.com Data enrichment and model creation using text mining and other unstructured data Dr. Andreas Nawroth Head of Artificial Intelligence German Data Science
More informationPAST research has shown that real-time Twitter data can
Algorithmic Trading of Cryptocurrency Based on Twitter Sentiment Analysis Stuart Colianni, Stephanie Rosales, and Michael Signorotti ABSTRACT PAST research has shown that real-time Twitter data can be
More informationInternational Journal of Advance Research in Engineering, Science & Technology DUAL SENTIMENTAL ANALYSIS OF USER COMMENTS IN ONLINE REVIEW SYSTEM
Impact Factor (SJIF): 5.301 International Journal of Advance Research in Engineering, Science & Technology e-issn: 2393-9877, p-issn: 2394-2444 Volume 5, Issue 3, March-2018 DUAL SENTIMENTAL ANALYSIS OF
More informationCommunity Level Topic Diffusion
Community Level Topic Diffusion Zhiting Hu 1,3, Junjie Yao 2, Bin Cui 1, Eric Xing 1,3 1 Peking Univ., China 2 East China Normal Univ., China 3 Carnegie Mellon Univ. OUTLINE Background Model: COLD Diffusion
More informationInternational Journal of Advance Engineering and Research Development A SENTIMENT MINING APPROACH TO BIG DATA ANALYTICS COMPARISON STUDY
Scientific Journal of Impact Factor (SJIF): 5.71 International Journal of Advance Engineering and Research Development Volume 5, Issue 03, March -2018 A SENTIMENT MINING APPROACH TO BIG DATA ANALYTICS
More information15. Text Data Visualization. Prof. Tulasi Prasad Sariki SCSE, VIT, Chennai
15. Text Data Visualization Prof. Tulasi Prasad Sariki SCSE, VIT, Chennai www.learnersdesk.weebly.com Why Visualize Text? Understanding get the gist of a document Grouping cluster for overview or classifcation
More informationAre Twitter Users Equal in Predicting Elections? A Study of User Groups in Predicting 2012 U.S. Republican Primaries
Wright State University CORE Scholar Kno.e.sis Publications The Ohio Center of Excellence in Knowledge- Enabled Computing (Kno.e.sis) 12-2012 Are Twitter Users Equal in Predicting Elections? A Study of
More informationFinding Opinion Strength Using Fuzzy Logic on Web Reviews
Animesh Kar 1, Deba Prasad Mandal 2 1 Guru Nanak Institute of Technology, Kolkata, India. karanimesh04@yahoo.co.in 2 Indian Statistical Institute, Kolkata, India. dpmandal@isical.ac.in doi:10.4156/ ijei.vol2.
More information