A Quantified Approach for Analyzing the User Rating Behaviour in Social Media
|
|
- Jessie Walsh
- 5 years ago
- Views:
Transcription
1 A Quantified Approach for Analyzing the User Rating Behaviour in Social Media P. Surya 1, Dr. B. Umadevi 2 1 Research Scholar, 2 Assistant Professor & Head, P.G & Research Department of Computer Science, Raja Doraisingam Govt. Arts College, Sivagangai, TamilNadu, India Abstract: The technological revolution made a significant change in the society. Today the human society uses the mobile phones not only for the communication but also for sharing their views and other joyful moments. In this venture, the social Medias like face book and WhatsApp plays vital role. Around 2.5 billion people are pressing the like buttons around the world. Through internet the people are spending minimum three hours per day in chatting, poking, and tweeting on the social media. The increased volume in data set is a big problem for the social media and more over the data set of this category are unstructured. It has become as big data in the social. Handling big data is an issue by its characteristics such as volume, velocity, variety, veracity and value. This paper analyses the user rating on three attributes such as likes, photos, status. It makes an analytical view of the users interest towards various things in the social media. Keywords: Facebook, HDFS, SVM, MapReduce. 54 I. INTRODUCTION The Face book social network gained very much popularity among then people around the world. Everyday millions of users share their information in the form of text, images or videos. Facebook engineers or analysts manipulate this large data set using Hadoop. Data set is of one peta byte disk space while 25 CPU cores. Hadoop is used as an open source for distributed framework for distributed storage. It is found by Apache foundation & usually processes large data sets. Hadoop includes distributed file system. The Large files extend in small sets and referred as cluster. The Packets are transferred to the cluster nodes in parallel form. A. Hadoop for Face Book: Hadoop is an open-source software framework used for distributed storage and processing of dataset of big data using the Map Reduce programming model. It consists of computer clusters built from commodity hardware. All the modules in Hadoop are designed with a fundamental assumption that hardware failures are common occurrences and should be automatically handled by the framework [1]. The core of Apache Hadoop consists of a storage part, known as Hadoop Distributed File System (HDFS), and a processing part which is a Map Reduce programming model. Hadoop splits files into large blocks and distributes them across nodes in a cluster. It then transfers packaged code into nodes to process the data in parallel [2]. This approach takes advantage of data locality where nodes manipulate the data they have access to. This allows the dataset to be processed faster and more efficiently than it would be in a more conventional supercomputer architecture that relies on a parallel file system where computation and data are distributed via high-speed networking. B. Hadoop an Overview: The Hadoop framework consists of the following modules. Hadoop Common, it includes libraries and utilities required by the other modules. Hadoop Distributed File System (HDFS) contains distributed filesystem that stores data on commodity machines, providing very high aggregate bandwidth across the cluster. Hadoop YARN is a platform controls computing resources in clusters for managing and also for schedule the applications. The Hadoop Map is a Map Reduce programming model for large-scale data processing. The Hadoop framework has been written in the Java programming language, with some native code in C and command line utilities written as shell scripts [3]. The Map Reduce Java code is common, any programming language can be used with "Hadoop Streaming" to implement the "map" and "reduce" parts of the user's program other projects in the Hadoop ecosystem expose richer user interfaces. C. Hadoop Distributed File System (HDFS): The HDFS is a distributed, scalable, and portable file system written in Java for the Hadoop framework [4]. A Hadoop cluster has nominally a single name node plus a cluster of data nodes, although redundancy options are available for the name node due to its criticality. Each data node serves up blocks of data over the network using a block protocol specific to HDFS. The file system uses TCP/IP sockets for communication [5]. Clients use remote procedure calls (RPC) to communicate with each other. HDFS stores large across multiple machines. It achieves reliability by replicating the data across multiple hosts, and hence theoretically does not require redundant array of independent disks (RAID) storage on hosts [6]. HDFS is not fully POSIX-compliant, because the requirements for a POSIX file-system differ from the target goals of a Hadoop application. HDFS was designed for mostly immutable files and may not be suitable for systems requiring concurrent write-
2 operations. HDFS can be mounted directly with a File system in User space (FUSE) virtual file system on Linux and some other UNIX systems [6]. II. BACKGROUND AND RELATED WORKS Social Media is an umbrella term that describes websites and online tools that people use to connect and share content, experiences, opinions and media [7]. It enables conversations and interactions with people online. Examples of Social Media platforms are Facebook, Twitter and YouTube. While social media is great for staying in touch with friends and family, it also provides businesses of all shapes and sizes with a fantastic opportunity to communicate directly with new and existing customers - and at minimal cost. Both Facebook and Twitter allow small businesses to share descriptions about themselves, photographs, and information about their products and how to buy them, with new and existing customers at the click of a mouse. Recently, Pushpa, GauravGarg has been made a Review on User Behaviour Analysis using KNN and SVM vide Tweetson Big Data[ 8]. Another author Dr. B. Lavanya, B. Divya has been analysed an accident prediction system with huge collection of past records by applying effective predictive data mining techniques such as Support Vector Machine (SVM) and K-Nearest Neighbor (K-NN) which have a greater capacity to handle huge and noisy data that are used to predict accidents with more accuracy[9]. Presently, AnushreePriyadarshini and SonaliAgarwal has been analyzed the impact of penalty and kernel parameters on the performance of parallel SVM [1]. Their experimental results also analyzed that the computation time taken by the SVM with multi-node cluster is less as compared to the single node cluster for large dataset. Currently, Zhanquan Sun, Geoffrey Fox has been made an analysis on parallel SVM based on iterative MapReduce model Twister is studied. Their analysis results show that the parallel SVM based on iterative MapReduce is efficient in data intensive problems [11]. III. METHODOLOGY The effectiveness of well-known sentiment classification algorithms on our novel corpus of Facebook features. The different machine learning techniques such as Naïve Bayes, KNN, Neural Networks and Support Vector Machine are used to determine the different types of features posted by the members of the Facebook. This section deals with the proposed methodology. The Support Vector Machine [12] algorithm is used as the back bone of the method. It is adapted into the process of face book data analysis. The goal of this study is to improve the effectiveness of the proposed methodology to predict the user responses. The proposed algorithm in predicting the face book user will be described in the next section. A. Support Vector Machine: The Support Vector Machine (SVM) Support Vector Machine classification method is a very effective way for classification, and its results are better than other classification algorithms, in general such Naïve Bayes and decision trees, etc. The aim of the SVM is to identify a hyper-plane that separates two classes of data. The chosen hyper-plane creates the largest margin between the two classes to make the points belonging to different classes and also make those points away from the hyper-plane as far as possible. In other word, using SVM classification method is equivalent to solving a constrained optimization problem. Support Vector Machine (SVM) is a classification technique based on statistical learning theory. It is based on the idea of a hyper plane classifier [12]. The goal of SVM is to find a linear optimal hyper plane so that the margin of separation between the two classes is maximized. We choose SVM as the classifier because of its often reported best performance and it has been adopted by many previous text classification studies. The method suggested in this paper is to predict or classify the facebook users are belonging to the process of data mining which is given in Fig 1. There are five main stages in this method. The stages are Data collection, preprocessing, Data Transformation, classification and result interpretation. Data collection is gathering information available on facebook data for the three countries like India, Australia and South Africa during the year 216. During pre-processing [13] stage data cleaning, attributes selection, dimensionality reduction, and data partitioning are applied to get better prediction. Subsequently the Extracted data is transformed for classification. Whereas, in classification stage Data Mining [14] algorithms are used for the classification of data. Normally, at this stage different Data Mining algorithms are executed with different variables and compared to select algorithm [15] which produce best results. Finally, in interpretation stage models obtained from previous stage are analyzed to predict user responses on facebook data. 55
3 IV. EXPERIMENT RESULTS The improved communication technology increases the sharing complexities and reduces the potential applications. The way people wants to exhibit their views, is highly promoted by so many number of social networks. Among them Face book takes place a vital role. Fig 1: Method Proposed for Predicting the User Responses The most universal features of the face book are the Photos, links and status. These are the different perceptions people can share their views, thoughts, and joys and also supports for business promotion. The data has been collected with several attributes. The data set (face book) holds for the year 216 for the three countries India, Australia and South Africa which is given in Fig Fig 2 : Sample Dataset
4 The data set is pre processed by the Hadoop Distributed File System (HDFS). From the data set the country and the essential components or features such as photos, likes and status are extracted which is shown in Fig 3. The extractions are tested through the SVM algorithm and it categories the responses made by the face book members. The results are furnished below in Table I and Fig 4. In Table I and Fig 4 one among the features such as photo is considered of maximum number of users posted in the face book. The graph explains that the photos are most liked only in South Africa by the users. In general the members or users may post so many numbers of useful links among themselves. In Table II and Fig 5 the maximum number of users shared or commented found mostly only in India. At the same time the Table III and Fig 6 explains that status feature and likes is highly posted only by the users of Australia. The Table IV and Fig 7 the overall face book members response towards sharing only the photo is occupies the highest percent among the three Facebook features. From this the highest occupation for South Africa, India and Australia respectively. So the most important to understand from this analysis is that the members are interest only towards sharing their joys and happiness rather than text and information s. Fig 3: Selection of Country for Data Extraction Table I. Photos Interactions for (India, Australia and South Africa) COUNTRY NAME COMMENTS LIKES SHARES SOUTH AFRICA Table II. Links Interactions for (India, Australia and South Africa) COUNTRY NAME COMMENTS LIKES SHARES SOUTH AFRICA
5 User-Responses User-Responses International Journal of Electrical Electronics & Computer Science Engineering Table III. Status Interactions for (India, Australia and South Africa) COUNTRY NAME COMMENTS LIKES SHARES SOUTH AFRICA Table IV. Overall Total Interactions for (India, Australia and South Africa) COUNTRY NAME PHOTOS LINKS STATUS SOUTH AFRICA PHOTOS - Comparison COMMENTS LIKES SHARES Posted SOUTH-AFRICA Fig. 4. Photos Interactions for (India, Australia and South Africa) LINKS - Comparison COMMENTS LIKES SHARES Posted SOUTH-AFRICA Fig. 5. Links Interactions for (India, Australia and South Africa) 58
6 User-Responses User-Responses International Journal of Electrical Electronics & Computer Science Engineering STATUS - Comparison COMMENTS LIKES SHARES Posted SOUTH-AFRICA Fig. 6. Status Interactions for (India, Australia and South Africa) Overall Total-Interactions PHOTOS LINKS STATUS Features SOUTH AFRICA 59 V. CONCLUSION Fig. 7. Overall Total Interactions (India, Australia and South Africa) The research imitates with a scope to determine the maximum benefits for the users gathered through the social network. It starts with various parameters to identify the real interactions behind the face book. The Data mining algorithm SVM supports its maximum to classify the users interest towards sharing of Photos, Links and Status. In this it is identified that the photos sharing takes top most position. The next rank goes to status as well as for links. The research would like to conclude that it has the prime feature is to share the photos rather than other information through the network. VI. [1] [2] REFERENCES [3] P. Sachin Bappalige Feed, An introduction to Apache Hadoop for big data, August 214. [4] Evans, Chris, Big data storage: Hadoop storage basics, June 216.
7 [5] Kumar Gautam, Big Data - Part1, February 216. [6] Pessach and Yaniv, Distributed Storage, Distributed Storage: Concepts, Algorithms, and Implementations, Amazon.com, 213. [7] Bharat Naiknaware, Bindesh Kushwaha, Seema Kawathekar, Social Media Sentiment Analysis using Machine Learning Classifiers,, International Journal of Computer Science & Mobile Computing (IJCSMC), Vol. 6, Issue. 6, June 217, pg [8] Dr. B. Lavanya and B. Divya, Big Data Analysis Using SVM and KNN Data Mining Techniques, International Journal of Computer Science and Mobile Computing (IJCSMC), Vol. 6, Issue. 1, January 217, pg [9] Pushpa & Gaurav Garg, Review on User Behaviour Analysis using KNN/SVM vide Tweetson Big Data, International Journal of Innovative Research in Computer & Communication Engineering, Vol. 5, Issue 4, April 217. [1] Zhanquan Sun and Geoffrey Charles Foxn, Study on Parallel SVM Based on MapReduce, CiteSeer. [11] Safa Ben Hamouda and Jalel Akaichi, Social Networks Text Mining for Sentiment Classification: The case of Facebook statuses updates in the Arabic Spring Era, International Journal of Application or Innovation in Engineering & Management (IJAIEM), Volume 2, Issue 5, May 213. [12] Bo Guo, Rui Zhang, Guang Xu, Chuangming Shi and Li Yang, Predicting Students Performance in Educational Data Mining, IEEE Xplore, 24 March 216. [13] B. Umadevi D. Sundar, Dr. P. Alli, An Optimized Approach to Predict the Stock Market Behavior and Investment Decision Making using Benchmark Algorithms for Naïve Investors, Computational Intelligence and Computing Research (ICCIC), 213 IEEE International Conference on ( IEEE Xplore Digital Library), pg1-5. [14] B. Umadevi, D. Sundar, Dr. P. Alli, A Study on Stock Market Analysis for Stock Selection - Naïve Investors Perspective using Data Mining Technique, International Journal of Computer Applications, ( ), Vol 34 No.3, 211. [15] Dr. B. Umadevi and M. Snehapriya, A Review On Various Data Mining Techniques In Social Media, International Journal of Innovative Research in Computer & Communication Engineering, Vol 5, Issue 4, April
Knowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 19 1 Acknowledgement The following discussion is based on the paper Mining Big Data: Current Status, and Forecast to the Future by Fan and Bifet and online presentation
More informationSpark, Hadoop, and Friends
Spark, Hadoop, and Friends (and the Zeppelin Notebook) Douglas Eadline Jan 4, 2017 NJIT Presenter Douglas Eadline deadline@basement-supercomputing.com @thedeadline HPC/Hadoop Consultant/Writer http://www.basement-supercomputing.com
More informationNew Approach for scheduling tasks and/or jobs in Big Data Cluster
New Approach for scheduling tasks and/or jobs in Big Data Cluster IT College, Chairperson of MS Dept. Agenda Introduction What is Big Data? The 4 characteristics of Big Data V4s Different Categories of
More informationIntro to Big Data and Hadoop
Intro to Big and Hadoop Portions copyright 2001 SAS Institute Inc., Cary, NC, USA. All Rights Reserved. Reproduced with permission of SAS Institute Inc., Cary, NC, USA. SAS Institute Inc. makes no warranties
More informationCOMP9321 Web Application Engineering
COMP9321 Web Application Engineering Semester 1, 2017 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 11 (Part II) http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2457
More informationData Analytics with MATLAB Adam Filion Application Engineer MathWorks
Data Analytics with Adam Filion Application Engineer MathWorks 2015 The MathWorks, Inc. 1 Case Study: Day-Ahead Load Forecasting Goal: Implement a tool for easy and accurate computation of dayahead system
More informationSunnie Chung. Cleveland State University
Sunnie Chung Cleveland State University Data Scientist Big Data Processing Data Mining 2 INTERSECT of Computer Scientists and Statisticians with Knowledge of Data Mining AND Big data Processing Skills:
More informationAccelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica
Accelerating Your Big Data Analytics Jeff Healey, Director Product Marketing, HPE Vertica Recent Waves of Disruption IT Infrastructu re for Analytics Data Warehouse Modernization Big Data/ Hadoop Cloud
More informationE-guide Hadoop Big Data Platforms Buyer s Guide part 1
Hadoop Big Data Platforms Buyer s Guide part 1 Your expert guide to Hadoop big data platforms for managing big data David Loshin, Knowledge Integrity Inc. Companies of all sizes can use Hadoop, as vendors
More informationBy: Shrikant Gawande (Cloudera Certified )
By: Shrikant Gawande (Cloudera Certified ) What is Big Data? For every 30 mins, a airline jet collects 10 terabytes of sensor data (flying time) NYSE generates about one terabyte of new trade data per
More informationABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA.
ABOUT THIS TRAINING: The world of Hadoop and Big Data" can be intimidating - hundreds of different technologies with cryptic names form the Hadoop ecosystem. This comprehensive training has been designed
More informationBIG DATA AND HADOOP DEVELOPER
BIG DATA AND HADOOP DEVELOPER Approximate Duration - 60 Hrs Classes + 30 hrs Lab work + 20 hrs Assessment = 110 Hrs + 50 hrs Project Total duration of course = 160 hrs Lesson 00 - Course Introduction 0.1
More informationTutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA
Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA http://kzhang6.people.uic.edu/tutorial/amcis2014.html August 7, 2014 Schedule I. Introduction to big data
More informationMoving From Contact Center to Customer Engagement
Daitan White Paper Moving From Contact Center to Customer Engagement USING THE CLOUD, BIG DATA AND WEBRTC TO GET THERE Highly Reliable Software Development Services http://www.daitangroup.com Daitan Group
More informationBig Data The Big Story
Big Data The Big Story Jean-Pierre Dijcks Big Data Product Mangement 1 Agenda What is Big Data? Architecting Big Data Building Big Data Solutions Oracle Big Data Appliance and Big Data Connectors Customer
More informationTEXT MINING APPROACH TO EXTRACT KNOWLEDGE FROM SOCIAL MEDIA DATA TO ENHANCE BUSINESS INTELLIGENCE
International Journal of Advance Research In Science And Engineering http://www.ijarse.com TEXT MINING APPROACH TO EXTRACT KNOWLEDGE FROM SOCIAL MEDIA DATA TO ENHANCE BUSINESS INTELLIGENCE R. Jayanthi
More informationBringing the Power of SAS to Hadoop Title
WHITE PAPER Bringing the Power of SAS to Hadoop Title Combine SAS World-Class Analytics With Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities ii Contents Introduction... 1 What
More informationInternational Journal of Advance Engineering and Research Development A SENTIMENT MINING APPROACH TO BIG DATA ANALYTICS COMPARISON STUDY
Scientific Journal of Impact Factor (SJIF): 5.71 International Journal of Advance Engineering and Research Development Volume 5, Issue 03, March -2018 A SENTIMENT MINING APPROACH TO BIG DATA ANALYTICS
More informationComparing Application Performance on HPC-based Hadoop Platforms with Local Storage and Dedicated Storage
Comparing Application Performance on HPC-based Hadoop Platforms with Local Storage and Dedicated Storage Zhuozhao Li *, Haiying Shen *, Jeffrey Denton and Walter Ligon * Department of Computer Science,
More informationPrediction of Personalized Rating by Combining Bandwagon Effect and Social Group Opinion: using Hadoop-Spark Framework
Prediction of Personalized Rating by Combining Bandwagon Effect and Social Group Opinion: using Hadoop-Spark Framework Lu Sun 1, Kiejin Park 2 and Limei Peng 1 1 Department of Industrial Engineering, Ajou
More informationA REVIEW ON HADOOP ARCHITECTURE FOR BIG DATA
International Journal of Research in Engineering, Technology and Science, Volume VI, Special Issue, July 2016 www.ijrets.com, editor@ijrets.com, ISSN 2454-1915 A REVIEW ON HADOOP ARCHITECTURE FOR BIG DATA
More informationResearch of the Social Media Data Analyzing Platform Based on Cloud Mining Yi-Tang ZENG, Yu-Feng ZHANG, Sheng CAO, Li LI, Cheng-Wei ZHANG *
2016 3 rd International Conference on Social Science (ICSS 2016) ISBN: 978-1-60595-410-3 Research of the Social Media Data Analyzing Platform Based on Cloud Mining Yi-Tang ZENG, Yu-Feng ZHANG, Sheng CAO,
More informationFrom Information to Insight: The Big Value of Big Data. Faire Ann Co Marketing Manager, Information Management Software, ASEAN
From Information to Insight: The Big Value of Big Data Faire Ann Co Marketing Manager, Information Management Software, ASEAN The World is Changing and Becoming More INSTRUMENTED INTERCONNECTED INTELLIGENT
More informationKnowledgeSTUDIO. Advanced Modeling for Better Decisions. Data Preparation, Data Profiling and Exploration
KnowledgeSTUDIO Advanced Modeling for Better Decisions Companies that compete with analytics are looking for advanced analytical technologies that accelerate decision making and identify opportunities
More informatione-issn: p-issn:
Available online at www.ijiere.com International Journal of Innovative and Emerging Research in Engineering e-issn: 2394 3343 p-issn: 2394 5494 Real Time Analysis of Social Media Text Using Stream Computing
More informationGot Hadoop? Whitepaper: Hadoop and EXASOL - a perfect combination for processing, storing and analyzing big data volumes
Got Hadoop? Whitepaper: Hadoop and EXASOL - a perfect combination for processing, storing and analyzing big data volumes Contents Introduction...3 Hadoop s humble beginnings...4 The benefits of Hadoop...5
More informationBusiness Intelligence, 4e (Sharda/Delen/Turban) Chapter 1 An Overview of Business Intelligence, Analytics, and Data Science
Business Intelligence, 4e (Sharda/Delen/Turban) Chapter 1 An Overview of Business Intelligence, Analytics, and Data Science 1) Computerized support is only used for organizational decisions that are responses
More informationThe Sysprog s Guide to the Customer Facing Mainframe: Cloud / Mobile / Social / Big Data
Glenn Anderson, IBM Lab Services and Training The Sysprog s Guide to the Customer Facing Mainframe: Cloud / Mobile / Social / Big Data Summer SHARE August 2015 Session 17794 2 (c) Copyright 2015 IBM Corporation
More informationLearning Based Admission Control. Jaideep Dhok MS by Research (CSE) Search and Information Extraction Lab IIIT Hyderabad
Learning Based Admission Control and Task Assignment for MapReduce Jaideep Dhok MS by Research (CSE) Search and Information Extraction Lab IIIT Hyderabad Outline Brief overview of MapReduce MapReduce as
More informationThe Accurate Marketing System Design Based on Data Mining Technology: A New Approach. ZENG Yuling 1, a
International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2015) The Accurate Marketing System Design Based on Data Mining Technology: A New Approach ZENG Yuling 1,
More informationREVIEW ON PREDICTION OF CHRONIC KIDNEY DISEASE USING DATA MINING TECHNIQUES
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 5.258 IJCSMC,
More informationDATA SCIENCE: HYPE AND REALITY PATRICK HALL
DATA SCIENCE: HYPE AND REALITY PATRICK HALL About me SAS Enterprise Miner, 2012 Cloudera Data Scientist, 2014 Do you use Kolmogorov Smirnov often? Statistician No, I mix my martinis with gin. Data Scientist
More informationOutline of Hadoop. Background, Core Services, and Components. David Schwab Synchronic Analytics Nov.
Outline of Hadoop Background, Core Services, and Components David Schwab Synchronic Analytics https://synchronicanalytics.com Nov. 1, 2018 Hadoop s Purpose and Origin Hadoop s Architecture Minimum Configuration
More informationBig Data Foundation. 2 Days Classroom Training PHILIPPINES :: MALAYSIA :: VIETNAM :: SINGAPORE :: INDIA
Big Data Foundation 2 Days Classroom Training PHILIPPINES :: MALAYSIA :: VIETNAM :: SINGAPORE :: INDIA Content Big Data Foundation Course Introduction Who we are Course Overview Career Path Course Content
More informationKnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE
FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?
More informationDatameer for Data Preparation: Empowering Your Business Analysts
Datameer for Data Preparation: Empowering Your Business Analysts As businesses strive to be data-driven organizations, self-service data preparation becomes a critical cog in the analytic process. Self-service
More informationPREDICTION OF SOCIAL NETWORK SITES USING WEKA TOOL
PREDICTION OF SOCIAL NETWORK SITES USING WEKA TOOL G.Thirumani Aatthi 1, R.Aishwarya 2, R.Mallika 3, A.Angel 4 1 Assistant Professor, 2,3,4 M.Sc(CS&IT), Department of Computer Science & Information Technology,
More informationExploring Big Data and Data Analytics with Hadoop and IDOL. Brochure. You are experiencing transformational changes in the computing arena.
Brochure Software Education Exploring Big Data and Data Analytics with Hadoop and IDOL You are experiencing transformational changes in the computing arena. Brochure Exploring Big Data and Data Analytics
More informationData Analytics for Semiconductor Manufacturing The MathWorks, Inc. 1
Data Analytics for Semiconductor Manufacturing 2016 The MathWorks, Inc. 1 Competitive Advantage What do we mean by Data Analytics? Analytics uses data to drive decision making, rather than gut feel or
More informationTransforming Analytics with Cloudera Data Science WorkBench
Transforming Analytics with Cloudera Data Science WorkBench Process data, develop and serve predictive models. 1 Age of Machine Learning Data volume NO Machine Learning Machine Learning 1950s 1960s 1970s
More informationIBM SPSS & Apache Spark
IBM SPSS & Apache Spark Making Big Data analytics easier and more accessible ramiro.rego@es.ibm.com @foreswearer 1 2016 IBM Corporation Modeler y Spark. Integration Infrastructure overview Spark, Hadoop
More informationBIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW
BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW TOPICS COVERED 1 2 Fundamentals of Big Data Platforms Major Big Data Tools Scaling Up vs. Out SCALE UP (SMP) SCALE OUT (MPP) + (n) Upgrade
More informationHadoop Course Content
Hadoop Course Content Hadoop Course Content Hadoop Overview, Architecture Considerations, Infrastructure, Platforms and Automation Use case walkthrough ETL Log Analytics Real Time Analytics Hbase for Developers
More informationStackIQ Enterprise Data Reference Architecture
WHITE PAPER StackIQ Enterprise Data Reference Architecture StackIQ and Hortonworks worked together to Bring You World-class Reference Configurations for Apache Hadoop Clusters. Abstract Contents The Need
More informationBig Data Analytics: Technologies, Opportunities, and Challenges. Murali K. Pusala. Jan 11, 2018
Big Data Analytics: Technologies, Opportunities, and Challenges Murali K. Pusala Jan 11, 2018 CMPS 490 / CSCE 598 Big Data Analytics Description: Essentials of Big Data analytics. Topics include: challenges
More informationINTEGRATION OF MULTI BANK & USER SMART CARD WITH MULTI CLOUD DEPLOYMENT
INTEGRATION OF MULTI BANK & USER SMART CARD WITH MULTI CLOUD DEPLOYMENT R DIVYA 1, K.KAMRUDEEN 2, C.P NIJITHA MAHALAKSMI 3 12 PG Student, Department of Computer Applications, New Prince ShriBhavani College
More informationAnalytical Capability Security Compute Ease Data Scale Price Users Traditional Statistics vs. Machine Learning In-Memory vs. Shared Infrastructure CRAN vs. Parallelization Desktop vs. Remote Explicit vs.
More informationLE NUOVE FRONTIERE DALL AI ALL AR E L IMPATTO SULLA QUOTIDIANITÀ
LE NUOVE FRONTIERE DALL AI ALL AR E L IMPATTO SULLA QUOTIDIANITÀ Deloitte Analytics & Information Management Torino, 26/03/2018 1 AUGMENTED REALITY FUNDAMENTALS EXAMPLES OF DEEP LEARNING ARTIFICIAL INTELLIGENCE
More informationBIG DATA ANALYTICS WITH HADOOP. 40 Hour Course
1 BIG DATA ANALYTICS WITH HADOOP 40 Hour Course OVERVIEW Learning Objectives Understanding Big Data Understanding various types of data that can be stored in Hadoop Setting up and Configuring Hadoop in
More informationSAP Predictive Analytics Suite
SAP Predictive Analytics Suite Tania Pérez Asensio Where is the Evolution of Business Analytics Heading? Organizations Are Maturing Their Approaches to Solving Business Problems Reactive Wait until a problem
More informationReal Applications of Big Data. Challenges and Opportunities
Real Applications of Big Data. Challenges and Opportunities David Gil University of Alicante Polytechnic School - EPSA Department of Computer Technology Introduction to Big Data Projects Benchmarking (weka,
More informationMODEL OF SENTIMENT ANALYSIS FOR SOCIAL MEDIA DATA
MODEL OF SENTIMENT ANALYSIS FOR SOCIAL MEDIA DATA Nurul Atasha Khairuddin, Kamilia Kamardin Advanced Informatics School, Universiti Teknologi Malaysia, Jalan Sultan Yahya Petra, 54100 Kuala Lumpur, Malaysia.
More informationHADOOP ADMINISTRATION
HADOOP ADMINISTRATION PROSPECTUS HADOOP ADMINISTRATION UNIVERSITY OF SKILLS ABOUT ISM UNIV UNIVERSITY OF SKILLS ISM UNIV is established in 1994, past 21 years this premier institution has trained over
More informationCloud Assisted Trend Analysis of Twitter Data using Hadoop
ISSN: 2454-132X Impact factor: 4.295 (Volume 4, Issue 2) Available online at: www.ijariit.com Cloud Assisted Trend Analysis of Twitter Data using Hadoop Aman Gupta vasugupta42@gmail.com Ishaan Bhasin ishaanbhasin96@gmail.com
More informationTop 5 Challenges for Hadoop MapReduce in the Enterprise. Whitepaper - May /9/11
Top 5 Challenges for Hadoop MapReduce in the Enterprise Whitepaper - May 2011 http://platform.com/mapreduce 2 5/9/11 Table of Contents Introduction... 2 Current Market Conditions and Drivers. Customer
More informationThe usage of Big Data mechanisms and Artificial Intelligence Methods in modern Omnichannel marketing and sales
The usage of Big Data mechanisms and Artificial Intelligence Methods in modern Omnichannel marketing and sales Today's IT service providers offer a large set of tools supporting sales and marketing activities
More informationGPU ACCELERATED BIG DATA ARCHITECTURE
INNOVATION PLATFORM WHITE PAPER 1 Today s enterprise is producing and consuming more data than ever before. Enterprise data storage and processing architectures have struggled to keep up with this exponentially
More informationUnderstanding the Behavior of In-Memory Computing Workloads. Rui Hou Institute of Computing Technology,CAS July 10, 2014
Understanding the Behavior of In-Memory Computing Workloads Rui Hou Institute of Computing Technology,CAS July 10, 2014 Outline Background Methodology Results and Analysis Summary 2 Background The era
More informationFINANCE + DEEP LEARNING SKYMIND / DEEPLEARNING4J 2015
FINANCE + DEEP LEARNING SKYMIND / DEEPLEARNING4J 2015 WHICH OUTCOMES MATTER? BASIC APPLICATIONS Anomaly Detection for Compliance Rogue Traders, Fat Fingers Fraud, Money Laundering Detection Trading Strategies
More informationFrom GIS to Location Intelligence
From GIS to Location Intelligence From Science to Big Data to Business Opportunity Joe Francica Managing Director, Location Intelligence solutions @joefrancica Joe.Francica@pb.com Pitney Bowes today. We
More informationAchieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform
Achieving Agility and Flexibility in Big Data Analytics with the Urika -GX Agile Analytics Platform Analytics R&D and Product Management Document Version 1 WP-Urika-GX-Big-Data-Analytics-0217 www.cray.com
More informationIndex Terms: Customer Loyalty, SVM, Data mining, Classification, Gaussian kernel
Volume 4, Issue 12, December 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Gaussian Kernel
More informationKeywords- Social media,machine learning (ML), Hadoop framework, Decision making, Data processing with R, Mahout
GLOBAL JOURNAL OF ENGINEERING SCIENCE AND RESEARCHES USAGE OF MACHINE LEARNING AND HADOOP IN SOCIAL MEDIA ANALYTICS Prof. K. Adisesha *1 & Prof. Praveen Moses 2 *1 HOD, Computer Science Department, Bangalore
More informationData Mining Applications with R
Data Mining Applications with R Yanchang Zhao Senior Data Miner, RDataMining.com, Australia Associate Professor, Yonghua Cen Nanjing University of Science and Technology, China AMSTERDAM BOSTON HEIDELBERG
More informationBig Data Application Engineer/ Developer. Specialization in Apache Spark, Kafka, Airflow, HBase
BIG DATA COURSE Big Data Application Engineer/ Developer Specialization in Apache Spark, Kafka, Airflow, HBase In Exclusive Association with 21,347+ Participants 10,000+ Brands 1200+ Trainings 45+ Countries
More informationIdentification of Crop Disease by Predictive Analysis in Hadoop Environment
Volume 118 No. 18 2018, 2805-2809 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu ijpam.eu Identification of Crop Disease by Predictive Analysis in Hadoop
More informationAugmented Real-time Clinical DataMart. Phani S Srinivasan Ponnapalli, Syneos Health Subrahmanyam Rayaprolu, Syneos Health
Augmented Real-time Clinical DataMart Phani S Srinivasan Ponnapalli, Syneos Health Subrahmanyam Rayaprolu, Syneos Health Agenda Introduction Traditional Clinical Data warehouse vs Digital Data Modern Data
More informationAssistant Professor Neha Pandya Department of Information Technology, Parul Institute Of Engineering & Technology Gujarat Technological University
Feature Level Text Categorization For Opinion Mining Gandhi Vaibhav C. Computer Engineering Parul Institute Of Engineering & Technology Gujarat Technological University Assistant Professor Neha Pandya
More informationEnterprise-Scale MATLAB Applications
Enterprise-Scale Applications Sylvain Lacaze Rory Adams 2018 The MathWorks, Inc. 1 Enterprise Integration Access and Explore Data Preprocess Data Develop Predictive Models Integrate Analytics with Systems
More informationR and Hadoop. Ram Venkat Dawn Analytics
R and Hadoop Ram Venkat Dawn Analytics What is Hadoop? Hadoop is an open source Apache software for running distributed applications on 'big data' It contains a distributed file system (HDFS) and a parallel
More informationDesign and Implementation of Office Automation System based on Web Service Framework and Data Mining Techniques. He Huang1, a
3rd International Conference on Materials Engineering, Manufacturing Technology and Control (ICMEMTC 2016) Design and Implementation of Office Automation System based on Web Service Framework and Data
More informationIntroduction to Real-Time Processing in Apache Apex
Introduction to Real-Time Processing in Apache Apex Harsh Pathak 1, Manas Rathi 2, Aniket Parekh 3 Third Year Students 1,2,3, Department of Computer Engineering, Vishwakarma Institute of Information Technology,
More informationBUSINESS DATA MINING (IDS 572) Please include the names of all team-members in your write up and in the name of the file.
BUSINESS DATA MINING (IDS 572) HOMEWORK 4 DUE DATE: TUESDAY, APRIL 10 AT 3:20 PM Please provide succinct answers to the questions below. You should submit an electronic pdf or word file in blackboard.
More informationNOVEL DATAMINING TECHNIQUE FOR ONLINE SHOPPING
NOVEL DATAMINING TECHNIQUE FOR ONLINE SHOPPING D. Ravi 1, A. Dharshini 2, Keerthaneshwari 3, M. Naveen Kumar 4 1,2,3,4 CSE Department, Kathir College of Engineering Abstract: Opinion mining is a type of
More informationRealising Value from Data
Realising Value from Data Togetherwith Open Source Drives Innovation & Adoption in Big Data BCS Open Source SIG London 1 May 2013 Timings 6:00-6:30pm. Register / Refreshments 6:30-8:00pm, Presentation
More informationInternational Journal of Scientific & Engineering Research, Volume 6, Issue 3, March ISSN Web and Text Mining Sentiment Analysis
International Journal of Scientific & Engineering Research, Volume 6, Issue 3, March-2015 672 Web and Text Mining Sentiment Analysis Ms. Anjana Agrawal Abstract This paper describes the key steps followed
More informationPredictive Analytics Using Support Vector Machine
International Journal for Modern Trends in Science and Technology Volume: 03, Special Issue No: 02, March 2017 ISSN: 2455-3778 http://www.ijmtst.com Predictive Analytics Using Support Vector Machine Ch.Sai
More informationAnalytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand
Paper 2698-2018 Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand ABSTRACT Digital analytics is no longer just about tracking the number
More informationContents at a Glance COPYRIGHTED MATERIAL. Introduction... 1 Part I: Getting Started with Big Data... 7
Contents at a Glance Introduction... 1 Part I: Getting Started with Big Data... 7 Chapter 1: Grasping the Fundamentals of Big Data...9 Chapter 2: Examining Big Data Types...25 Chapter 3: Old Meets New:
More informationActive Analytics Overview
Active Analytics Overview The Fourth Industrial Revolution is predicated on data. Success depends on recognizing data as the most valuable corporate asset. From smart cities to autonomous vehicles, logistics
More informationNEXT GENERATION PREDICATIVE ANALYTICS USING HP DISTRIBUTED R
1 A SOLUTION IS NEEDED THAT NOT ONLY HANDLES THE VOLUME OF BIG DATA OR HUGE DATA EASILY, BUT ALSO PRODUCES INSIGHTS INTO THIS DATA QUICKLY NEXT GENERATION PREDICATIVE ANALYTICS USING HP DISTRIBUTED R A
More informationNICE Customer Engagement Analytics - Architecture Whitepaper
NICE Customer Engagement Analytics - Architecture Whitepaper Table of Contents Introduction...3 Data Principles...4 Customer Identities and Event Timelines...................... 4 Data Discovery...5 Data
More informationForecasting Blood Donor Response Using Predictive Modelling Approach
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 6.199 IJCSMC,
More informationWe create clarity, out of the chaos of digital noise.
We create clarity, out of the chaos of digital noise. Alto-Analytics.com $65 million additional net income earned by a typical Fortune 500 increasing data accessibility 10% 60% increase in operating margin
More informationHadoopWeb: MapReduce Platform for Big Data Analysis
HadoopWeb: MapReduce Platform for Big Data Analysis Saloni Minocha 1, Jitender Kumar 2,s Hari Singh 3, Seema Bawa 4 1Student, Computer Science Department, N.C. College of Engineering, Israna, Panipat,
More informationTapping on the Power of Big Data to Improve the Operation of Power Systems
Tapping on the Power of Big Data to Improve the Operation of Power Systems KA Folly University of Cape Town South Africa Place your company logo here NB: Your company s logo is only permitted on the first
More informationBusiness Intelligence. By Prof.Sushila Aghav-Palwe
Business Intelligence By Prof.Sushila Aghav-Palwe Introduction Business intelligence may be defined as a set of mathematical models and analysis methodologies that exploit the available data to generate
More informationBig Data & Hadoop Advance
Course Durations: 30 Hours About Company: Course Mode: Online/Offline EduNextgen extended arm of Product Innovation Academy is a growing entity in education and career transformation, specializing in today
More informationZeal Education Society s Zeal Institute of Business Administration, Computer Application & Research
Zeal Education Society s Zeal Institute of Business Administration, Computer Application & Research Sr. No. 39, Narhe, Pune -411041, Phone No.:020-67206031, Website: www.zibacar.in (Approved by A.I.C.T.E.,
More informationThe Importance of Secure Analytics & AI
The Importance of 2014 Secure Analytics & AI Rick Hutley Program Director, Professor of Practice, Data Science, University of the Pacific Email: rhutley@pacific.edu Vice President of IoT, Cisco Systems
More informationMicrosoft Big Data. Solution Brief
Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,
More informationSpecial thanks to Chad Diaz II, Jason Montgomery & Micah Torres
Special thanks to Chad Diaz II, Jason Montgomery & Micah Torres Outline: What cloud computing is The history of cloud computing Cloud Services (Iaas, Paas, Saas) Cloud Computing Service Providers Technical
More informationArchitecting for Real- Time Big Data Analytics. Robert Winters
Architecting for Real- Time Big Data Analytics Robert Winters About Me 2 ROBERT WINTERS Head of Business Intelligence, TravelBird Ten years experience in analytics, five years with Vertica and big data
More informationPreface About the Book
Preface About the Book We are living in the dawn of what has been termed as the "Fourth Industrial Revolution" by the World Economic Forum (WEF) in 2016. The Fourth Industrial Revolution is marked through
More informationOptimization and Visualization of Opinion Mining and Sentiments In Tourism Dashboard: A Case of Statics Centre Abu-Dhabi (SCAD)
Optimization and Visualization of Opinion Mining and Sentiments In Tourism Dashboard: A Case of Statics Centre Abu-Dhabi (SCAD) Hamed Saif Albusaidi 1, Prakash Kumar Udupi 2, Vishal Dattana 3 1 Student,
More informationThe ABCs of Big Data Analytics, Bandwidth and Content
White Paper The ABCs of Big Data Analytics, Bandwidth and Content Richard Treadway and Ingo Fuchs, NetApp March 2012 WP-7147 EXECUTIVE SUMMARY Enterprises are entering a new era of scale, where the amount
More informationBusiness is being transformed by three trends
Business is being transformed by three trends Big Cloud Intelligence Stay ahead of the curve with Cortana Intelligence Suite Business apps People Custom apps Apps Sensors and devices Cortana Intelligence
More informationSmartCare. SPSS Workshop. Rick Durham - North American Advanced Analytics Channel Team IBM Corporation. Date: 5/28/2014
SPSS Workshop Key Presenter Rick Durham - North American Advanced Analytics Channel Team Date: 5/28/2014 Agenda What is Predictive Analytics? What is the architecture of the IBM/SPSS technology stack?
More informationEnhancing JS MR Based Data Visualisation using
Indian Journal of Science and Technology, Vol 8(11), DOI: 10.17485/ijst/2015/v8i11/71780, June 2015 ISSN (Print) : 0974-6846 ISSN (Online) : 0974-5645 Enhancing JS MR Based Data Visualisation using YARN
More informationNational Occupational Standard
National Occupational Standard Overview This unit is about performing research and designing a variety of algorithmic models for internal and external clients 19 National Occupational Standard Unit Code
More information