Big Data Initiatives in China: Opportunities and Challenges
|
|
- Carol Fletcher
- 6 years ago
- Views:
Transcription
1 Big Data Initiatives in China: Opportunities and Challenges Joshua Zhexue Huang Distinguished Professor Director of Big Data Institute College of Computer Science and Software Engineering Shenzhen University
2 Agenda 1. Recent Development of Big Data in China 2. Key Initiatives, Challenges and Opportunities 3. Research and Applications at Big Data Institute, Shenzhen University
3 What is Big Data? Big data is a term for data sets that are so large or complex that traditional data processing applications are inadequate to deal with them (Wikipedia). Big data often refers to the use of predictive analytics, user behavior analytics, or certain other advanced data analytics methods that extract value from data, and seldom to a particular size of data set.
4 Big Data Term and Popularity Big Data term was coined in 1998 by John R. Mashey, Chief Scientist of SGI The term then referred to data size in Gigabytes which will cause stress on infrastructure. On MARCH 29, 2012, Obama Administration announced Big Data Research and Development Initiative and $200 million to invest on big data, which made Big Data popular.
5 Recent Development of Big Data in China - China NSF funded key projects (2010) Massive data mining on cloud computing (2013) Big data oriented machine learning theory and methods (2014) Challenging research problems in big data technology and applications(8 projects) (2015) Five projects on big data (2016) More projects funded in information science and management areas
6 Recent Development of Big Data in China In August of 2012, Chinese Academy of Sciences started a strategic pilot project (1.3 billion in 5 years) Sensing China oriented next generation information Technologies A subproject on big data Research and development of key technologies for sea and cloud data systems 中国科学院图册 V 百科
7 Recent Development of Big Data in China In 2016, Ministry of Science and Technology of China started a special program on Cloud computing and big data which will accomplish 12 tasks in four areas with 400 millions RMB Cloud platform and big data infrastructure Data driven new software on cloud service model Big data analytics, applications and Human like intelligence Cloud convergence of Perceptual cognition and human machine interaction
8 Recent Development of Big Data in China -Ministry of Education of China 85 universities set up a new major on data science and big data technology Some major universities set up special schools, faculties and research institutes on data science and big data Tsinghua University:Tsinghua-Qingdao Data Science Institute Peking University: Beijing University Big Data Technology Inst Fudan University: School of Data Science, Sun Yat-Sen University:School of Data and Computer Science Shenzhen University: Big Data Institute
9 Recent Development of Big Data in China Local governments set up special organizations to promote big data Beijing: Beijing Institute of Big Data Research Guangdong Province: Big Data Bureau Shanghai: Shanghai Data Exchange Center Shenzhen: Shenzhen Research Institute of Big Data, Chinese University of Hong Kong (Shenzhen)
10 Recent Development of Big Data in China -Industry Big Internet Companies are the leaders in big data development and applications. They are also big data owners. Baidu, Alibaba, Tencent (BAT) All industry sectors are interested in big data Technology companies, e.g., Huawei, ZTE Telecommunications, e.g., China Mobile, China Unicom Banks and Insurance companies Manufacturing companies E-commerce companies Logistics service companies
11 Big Data Market in China 0.1 billion compound annual growth rate
12 Big data: a national strategy A decision was made to implement a national strategy for big data At the Third Plenary Session of the 18th Central Committee of the CPC in October The 13th Five-year Plan ( ) further defined that big data is fundamental strategic resources to be developed and utilized. National big data centers and platforms will be established. Key technologies, hardware and software will be innovated and developed, including data collection, storage, cleansing, analysis, mining, visualization, security and privacy protection.
13 Implementation Measures The State Council issued the action outline to promote the development of large data in In January 2016, The National Development and Reform Commission issued a notice on organizing the implementation of major projects to promote the development of big data, supporting projects in four areas: Pilot projects on big data applications Big data sharing Big data infrastructure development Big data standards and exchange systems
14 Agenda 1. Recent Development of Big Data in China 2. Key Initiatives, Challenges and Opportunities 3. Research and Applications at Big Data Institute, Shenzhen University
15 Initiatives to Develop Innovation Driven Economy in China Encourage young people to start their own business and pursue innovation (Mass entrepreneurship and innovation ) Development of big data Internet + action plan Cloud computing service development Internet of Things (including wireless Internet) Artificial Intelligence Made in China 2015 (advanced manufacturing) Internet +
16 Directions Data science disciplines Key technology development Big data platforms Key applications Data resource development Data sharing and open data Human resource training for big data
17 Internet + Manufacturing AI Manufacturing procurement Design Customer Service Intelligent warehouse retail Transportation
18 Technological Challenges Storage cloud storage Communication 4G, 5G Processing cleansing, integration Analysis capability, efficiency Mining methods, tools, platforms Energy consumption
19 Application Challenges Lack of clear business requirements Lack of successful pilots Data availability and data sharing Data security and privacy ROI on big data applications Infrastructure Skills and human resources
20 Opportunities: Big Data Industry Chain Telecom Retail Finance Manufacturing Internet Smart Grid E-commerce Logistics Smart City
21 Agenda 1. Recent Development of Big Data in China 2. Key Initiatives, Challenges and Opportunities 3. Research and Applications at Big Data Institute, Shenzhen University
22 Shenzhen Shenzhen
23 China s first Special Economic Zone (SEZ) Neighboring to Hong Kong Area: 2050 km 2 A major city in South China Population (2014): 11 million Shenzhen University The fourth largest city in GDP in China, GDP per capita in USD: 25,038 GDP Growth (2015): 8.9% Xichong Beach Shenzhen Bay Bridge Night View of Shennan Road East
24 A public university established in The fastest growing university intop 100 Universitiesin China. 26 schools (colleges) 57 undergraduate programs, 70 master's programs 3 doctorate programs. Shenzhen University 34,000 full-time students 27,000 undergraduates, 6,000 postgraduates 1,500 international students. Lake Wenshan South pavilion of the school library
25 Big Data Institute, Shenzhen University Established in research staff 30 students Computer Science Building Three organizations International PhD students Institute Corridor
26 Faculty Members
27 Data Center
28 Internet + Manufacturing accumulates big data AI Manufacturing procurement Design Customer Service Intelligent warehouse retail Transportation
29 Research Problems 1 2 n-4 n-3 n-2 n-1 n f1 f2 f3 f4 f5 Thousands of features Curse of dimensionality 1. Mixed data 2. Noise/missing value 3. Correlation 4. Unbalance 5. Subspace property 6. Uninformative Millions of records Challenge of Big Data Matrix
30 Big Data Analytics Big data refers to the use of predictive analytics, user behavior analytics, or certain other advanced data analytics methods that extract value from data.
31 MapReduce Programming(Divide-and- Conquer) Programming (Map) Master node (Reduce) file file file file file node node node node node output File 文件划 partition
32 MapReduce Iteration K-means Pipeline implementation M R M R M R M R M R M R M R M R M R M R M R M R Input Data????? Map process Assign objects to clusters Reduce process Recompute cluster centers C o n v e r g e? output
33 MapReduce limitation Decision Tree It is difficult to implement recursive algorithm like decision trees in MapReduce
34 Spark RDD Computing Model RDD is a matrix.
35 RDD Divide-and-Conquer
36 Asymptotic Ensemble Learning Framework
37 Randomization of Data Blocks Before randomization After randomization
38 Asymptotic Ensemble Learning Results Learning result from none randomized data blocks Learning result from none randomized data blocks
39 Advantage of Asymptotic Ensemble Learning Sampling without replacement Sampling data blocks instead records increases sampling efficiency Learning partial data(10-20%) to approach the result learnt from the whole data. Significantly reduce computation load Scalability,learning TB or PB data
40 Integrated Big Data Analysis Platform
41 Key Technologies Workflow Engine Cloud Computing Engine Algorithm Library Big Data Analytics Open API Cloud Storage
42 Distributed Machine Learning Algorithm Libraries MapReduce Clustering Classification Regression Association K-Means K-Modes W-K-Means EWKM Decision Tree Random Forests LDA Logistic Regression Random Forest Regression FP-Growth Spark 1. Machine Learning Mllib 2. Graph Analysis GraphX 3. Data streams Dstream 4. QuerySpark SQL
43 Analytical Workflow
44 Manufacturing Big Data Application --Product batch quality problem monitoring system Visualization Impala 数据分析引擎 Applications Vis 数据可视化引擎 xxx xx 引擎 Application Layer Data analysis R 数据挖掘 Hive 数据仓库 Analytics Storm 实时流计算 Spark 数据流处理 Data Warehouse Data cleansing and integration Central DB Local quality data Sqoop 数据迁移 ETL Flume 数据收集工具 Cluster Environment Kettle ETL 工具 HDFS Map/Reduce Runtime System Supl 1 Supl 2 Supl n Fac 1 Fac 2 Fac n Platform Layer Data Layer
45 大数据分析一体化平台 - 应用展示
46 Manufacturing Big Data Application --Product batch quality problem monitoring system 10 Year Product quality monitoring period 50M+ No. of products monitored 2015 Huawei President award Factories 1PB+ Data 80%+ Report Accuracy 100+ Development Team 50+ Products 0% Missing Rate
47 Thank You!!! Questions?
China AI and Big Data Talent Assessment
China AI and Big Data Talent Assessment 2018 AGENDA 01 Demand Analysis KEY HIGHLIGHTS 02 Location Deep Dive Analysis Location Characteristics: Key hotspots, cost analysis and top employers Workloads across
More informationDeloitte School of Analytics. Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward
Deloitte School of Analytics Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward February 2018 Agenda 7 February 2018 8 February 2018 9 February 2018 8:00 9:00 Networking
More informationPost Graduate Program in BIG DATA ENGINEERING. In association with 11 MONTHS ONLINE
Post Graduate Program in BIG DATA ENGINEERING In association with 11 MONTHS ONLINE Contents 1. 2. 3. 4. 5. 6. Why Big Data Program Outline Learning Experience Program Objective Program Curriculum Admissions
More informationTransforming Analytics with Cloudera Data Science WorkBench
Transforming Analytics with Cloudera Data Science WorkBench Process data, develop and serve predictive models. 1 Age of Machine Learning Data volume NO Machine Learning Machine Learning 1950s 1960s 1970s
More informationBIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW
BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW TOPICS COVERED 1 2 Fundamentals of Big Data Platforms Major Big Data Tools Scaling Up vs. Out SCALE UP (SMP) SCALE OUT (MPP) + (n) Upgrade
More informationABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA.
ABOUT THIS TRAINING: The world of Hadoop and Big Data" can be intimidating - hundreds of different technologies with cryptic names form the Hadoop ecosystem. This comprehensive training has been designed
More information5th Annual. Cloudera, Inc. All rights reserved.
5th Annual 1 The Essentials of Apache Hadoop The What, Why and How to Meet Agency Objectives Sarah Sproehnle, Vice President, Customer Success 2 Introduction 3 What is Apache Hadoop? Hadoop is a software
More informationCourse Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.
Course Content Course Description: The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. At Course Completion: After competing this course,
More information20775A: Performing Data Engineering on Microsoft HD Insight
20775A: Performing Data Engineering on Microsoft HD Insight Duration: 5 days; Instructor-led Implement Spark Streaming Using the DStream API. Develop Big Data Real-Time Processing Solutions with Apache
More information20775A: Performing Data Engineering on Microsoft HD Insight
20775A: Performing Data Engineering on Microsoft HD Insight Course Details Course Code: Duration: Notes: 20775A 5 days This course syllabus should be used to determine whether the course is appropriate
More information20775 Performing Data Engineering on Microsoft HD Insight
Duración del curso: 5 Días Acerca de este curso The main purpose of the course is to give students the ability plan and implement big data workflows on HD. Perfil de público The primary audience for this
More informationPowered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS
Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS www.upxacademy.com 1800-123-1260 About us UpX Academy is an ed-tech platform providing advanced professional training in Big Data Analytics
More informationPowered by. Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS
Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS www.upxacademy.com 1800-123-1260 About us UpX Academy is an ed-tech platform providing advanced professional training in Big Data Analytics
More information20775: Performing Data Engineering on Microsoft HD Insight
Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com
More informationBIG DATA AND HADOOP DEVELOPER
BIG DATA AND HADOOP DEVELOPER Approximate Duration - 60 Hrs Classes + 30 hrs Lab work + 20 hrs Assessment = 110 Hrs + 50 hrs Project Total duration of course = 160 hrs Lesson 00 - Course Introduction 0.1
More informationIntroduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation
Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Roger Ding Cloudera February 3rd, 2018 1 Agenda Hadoop History Introduction to Apache Hadoop
More informationPowered by. Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS
Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS www.upxacademy.com 1800-123-1260 About us UpX Academy is an ed-tech platform providing advanced professional training in Big Data Analytics
More informationIntroduction to Research at Noah s Ark Lab. Noah s Ark Lab Huawei Technologies Co. Ltd.
Introduction to Research at Noah s Ark Lab Noah s Ark Lab Huawei Technologies Co. Ltd. Noah s Ark Lab Research Areas Machine Learning Data Mining Speech and Language Processing Information and Knowledge
More informationApache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved.
Apache Spark 2.0 GA The General Engine for Modern Analytic Use Cases 1 Apache Spark Drives Business Innovation Apache Spark is driving new business value that is being harnessed by technology forward organizations.
More informationAI Use cases and Requirements for telecom network. China Mobile
AI Use cases and Requirements for telecom network China Mobile 2018.04 2 Agenda Motivation to introduce AI Use cases in telecom network Requirements Why do telecom operators need AI? Ovum observation:
More informationMATLAB 汽车大数据分析平台的构建及应用
MATLAB 汽车大数据分析平台的构建及应用 卓金武 MathWorks 中国 steven.zhuo@mathworks.cn 2015 The MathWorks, Inc. 1 牛人如何看汽车大数据分析? Today's cars produce upwards of 25GB of information per hour information is helping us understand
More informationIBM Analytics Unleash the power of data with Apache Spark
IBM Analytics Unleash the power of data with Apache Spark Agility, speed and simplicity define the analytics operating system of the future 1 2 3 4 Use Spark to create value from data-driven insights Lower
More informationPreface About the Book
Preface About the Book We are living in the dawn of what has been termed as the "Fourth Industrial Revolution" by the World Economic Forum (WEF) in 2016. The Fourth Industrial Revolution is marked through
More informationMachine Learning and Analytics. Machine Learning. Data Lake Analytics. HDInsight (Hadoop, Spark, Storm, HBase Managed Clusters) Stream Analytics
微软云上数据平台概括 Data Sources Information Management Big Data Stores Machine Learning and Analytics Intelligence People Data Factory Data Lake Store Machine Learning Cognitive Services Data Catalog SQL Data
More informationBIG WITH BIG DATA ANALYTICS
Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS www.upxacademy.com 1800-123-1260 About us UpX Academy is an ed-tech platform providing advanced professional training in Big Data Analytics
More informationPowered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS
Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS www.upxacademy.com 1800-123-1260 About us UpX Academy is an ed-tech platform providing advanced professional training in Big Data Analytics
More informationMR TIGER KIU. Leading New ICT, Building A Better Connected World
MR TIGER KIU Leading New ICT, Building A Better Connected World Leading New ICT, Building A Better Connected World Huawei: A Global Leader of ICT Solutions 129 Ranking in Fortune Global 500 (2016 ) 80,000
More informationBIG WITH BIG DATA ANALYTICS
Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS www.upxacademy.com 1800-123-1260 About us UpX Academy is an ed-tech platform providing advanced professional training in Big Data Analytics
More informationOptimal Infrastructure for Big Data
Optimal Infrastructure for Big Data Big Data 2014 Managing Government Information Kevin Leong January 22, 2014 2014 VMware Inc. All rights reserved. The Right Big Data Tools for the Right Job Real-time
More informationAzure ML Data Camp. Ivan Kosyakov MTC Architect, Ph.D. Microsoft Technology Centers Microsoft Technology Centers. Experience the Microsoft Cloud
Microsoft Technology Centers Microsoft Technology Centers Experience the Microsoft Cloud Experience the Microsoft Cloud ML Data Camp Ivan Kosyakov MTC Architect, Ph.D. Top Manager IT Analyst Big Data Strategic
More informationAzure Data Analytics & Machine Learning Seminar. Daire Cunningham: BI Practice Area Manager
Azure Data Analytics & Machine Learning Seminar Daire Cunningham: BI Practice Area Manager AGENDA 09:00 AM 09:30 AM Registration & Refreshments 09.30AM 10:00 AM 10:00 AM 10:30 AM Welcome & Keynote, Ger
More informationCommon Customer Use Cases in FSI
Common Customer Use Cases in FSI 1 Marketing Optimization 2014 2014 MapR MapR Technologies Technologies 2 Fortune 100 Financial Services Company 104M CARD MEMBERS 3 Financial Services: Recommendation Engine
More informationBIG WITH BIG DATA ANALYTICS
Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS www.upxacademy.com 1800-123-1260 About us UpX Academy is an ed-tech platform providing advanced professional training in Big Data Analytics
More informationMapR: Solution for Customer Production Success
2015 MapR Technologies 2015 MapR Technologies 1 MapR: Solution for Customer Production Success Big Data High Growth 700+ Customers Cloud Leaders Riding the Wave with Hadoop The Big Data Platform of Choice
More informationSAP Predictive Analytics Suite
SAP Predictive Analytics Suite Tania Pérez Asensio Where is the Evolution of Business Analytics Heading? Organizations Are Maturing Their Approaches to Solving Business Problems Reactive Wait until a problem
More informationARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics
ADVANCED ANALYTICS & IOT ARCHITECTURES Presented by: Orion Gebremedhin Director of Technology, Data & Analytics Marc Lobree National Architect, Advanced Analytics EDW THE RIGHT TOOL FOR THE RIGHT WORKLOAD
More informationE-guide Hadoop Big Data Platforms Buyer s Guide part 1
Hadoop Big Data Platforms Buyer s Guide part 1 Your expert guide to Hadoop big data platforms for managing big data David Loshin, Knowledge Integrity Inc. Companies of all sizes can use Hadoop, as vendors
More informationConstruction of Regional Logistics Information Platform Based on Cloud Computing
International Conference on Computational Science and Engineering (ICCSE 2015) Construction of Regional Logistics Information Platform Based on Cloud Computing Gang SUN 1,2,a,*, Xiu-You WANG 1,b, Hao WANG
More informationBig Data Application Engineer/ Developer. Specialization in Apache Spark, Kafka, Airflow, HBase
BIG DATA COURSE Big Data Application Engineer/ Developer Specialization in Apache Spark, Kafka, Airflow, HBase In Exclusive Association with 21,347+ Participants 10,000+ Brands 1200+ Trainings 45+ Countries
More informationTutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA
Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA http://kzhang6.people.uic.edu/tutorial/amcis2014.html August 7, 2014 Schedule I. Introduction to big data
More informationManaging explosion of data. Cloudera, Inc. All rights reserved.
Managing explosion of data 1 Customer experience expectations are converging on the brand, not channel Consistent across all channels and lines of business Contextualized to present location and circumstances
More informationRotating to the New. How can Manufacturing Companies in China Thrive in the Digital Age. March 2018
Rotating to the New How can Manufacturing Companies in China Thrive in the Digital Age March 2018 The imperative for growth in Chinese Manufacturing Digital as a driver of high performance Going digital
More informationIntermodal Freight Transportation in China.
Intermodal Freight Transportation in China www.sf-express.com What is Intermodal Freight Transportation? 2 Photo: Akira Kodaka What is Intermodal Freight Transportation? Intermodal Freight Transportation
More informationAccelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica
Accelerating Your Big Data Analytics Jeff Healey, Director Product Marketing, HPE Vertica Recent Waves of Disruption IT Infrastructu re for Analytics Data Warehouse Modernization Big Data/ Hadoop Cloud
More information1% + 99% = AI Popularization
1% + 99% = AI Popularization Unifying Data Science and Engineering Jason Bissell General Manager, APAC The beginnings of Apache Spark at UC Berkeley AMPLab funded by tech companies: Got a glimpse at their
More informationBIG DATA and DATA SCIENCE
Integrated Program In BIG DATA and DATA SCIENCE CONTINUING STUDIES Table of Contents About the Course...03 Key Features of Integrated Program in Big Data and Data Science...04 Learning Path...05 Key Learning
More informationContext. The NEW data services from UST Global UST GLOBAL - A UNIQUE PARTNER. UST Global Data Services March 2018!1
UST Global Data Services March 2018!1 UST GLOBAL - A UNIQUE PARTNER Context Our Fortune 500 customers have immense amounts of transactional as well as interaction data distributed across a number of business
More information2016 China s Internet Consumption Finance Market Research Report.
2016 China s Internet Consumption Finance Market Research Report www.iresearchchina.com Consumption Contributes to Macroeconomic Restructuring China s National Economy Steadily Grows. Total Retail Sales
More informationArchitecture Overview for Data Analytics Deployments
Architecture Overview for Data Analytics Deployments Mahmoud Ghanem Sr. Systems Engineer GLOBAL SPONSORS Agenda The Big Picture Top Use Cases for Data Analytics Modern Architecture Concepts for Data Analytics
More informationBig Data & Artificial Intelligence ----How to Achieve Accurate Sales
Big Data & Artificial Intelligence ----How to Achieve Accurate Sales Prof. Guangxia Xu Chongqing University of Posts and Telecommunications, Chongqing, China xugx@cqupt.edu.cn 1/30 Outline 1. Background
More informationModernizing Your Data Warehouse with Azure
Modernizing Your Data Warehouse with Azure Big data. Small data. All data. Christian Coté S P O N S O R S The traditional BI Environment The traditional data warehouse data warehousing has reached the
More informationMicrosoft Azure Essentials
Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,
More informationInsights to HDInsight
Insights to HDInsight Why Hadoop in the Cloud? No hardware costs Unlimited Scale Pay for What You Need Deployed in minutes Azure HDInsight Big Data made easy Enterprise Ready Easier and more productive
More informationHadoop Course Content
Hadoop Course Content Hadoop Course Content Hadoop Overview, Architecture Considerations, Infrastructure, Platforms and Automation Use case walkthrough ETL Log Analytics Real Time Analytics Hbase for Developers
More informationVICE PRESIDENT, ARCHITECTURE GENERAL MANAGER, AI PRODUCTS GROUP - INTEL
VICE PRESIDENT, ARCHITECTURE GENERAL MANAGER, AI PRODUCTS GROUP - INTEL Artificial intelligence Machine learning Deep learning Types of analytics/ml (PARTIAL LIST) C l a s s i f i c a t i o n R e g r
More informationSimplifying the Process of Uploading and Extracting Data from Apache Hadoop
Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Rohit Bakhshi, Solution Architect, Hortonworks Jim Walker, Director Product Marketing, Talend Page 1 About Us Rohit Bakhshi Solution
More informationSpark and Hadoop Perfect Together
Spark and Hadoop Perfect Together Arun Murthy Hortonworks Co-Founder @acmurthy Data Operating System Enable all data and applications TO BE accessible and shared BY any end-users Data Operating System
More informationBringing the Power of SAS to Hadoop Title
WHITE PAPER Bringing the Power of SAS to Hadoop Title Combine SAS World-Class Analytics With Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities ii Contents Introduction... 1 What
More informationADVANCED ANALYTICS & IOT ARCHITECTURES
ADVANCED ANALYTICS & IOT ARCHITECTURES Presented by: Orion Gebremedhin Director of Technology, Data & Analytics Marc Lobree National Architect, Advanced Analytics EDW THE RIGHT TOOL FOR THE RIGHT WORKLOAD
More informationData Analytics for Semiconductor Manufacturing The MathWorks, Inc. 1
Data Analytics for Semiconductor Manufacturing 2016 The MathWorks, Inc. 1 Competitive Advantage What do we mean by Data Analytics? Analytics uses data to drive decision making, rather than gut feel or
More informationBig Data Foundation. 2 Days Classroom Training PHILIPPINES :: MALAYSIA :: VIETNAM :: SINGAPORE :: INDIA
Big Data Foundation 2 Days Classroom Training PHILIPPINES :: MALAYSIA :: VIETNAM :: SINGAPORE :: INDIA Content Big Data Foundation Course Introduction Who we are Course Overview Career Path Course Content
More informationIntro to Big Data and Hadoop
Intro to Big and Hadoop Portions copyright 2001 SAS Institute Inc., Cary, NC, USA. All Rights Reserved. Reproduced with permission of SAS Institute Inc., Cary, NC, USA. SAS Institute Inc. makes no warranties
More informationBig Data Introduction
Big Data Introduction Who we are Experts At Your Service Over 50 specialists in IT infrastructure Certified, experienced, passionate Based In Switzerland 100% self-financed Swiss company Over CHF8 mio.
More informationDATA SCIENCE: HYPE AND REALITY PATRICK HALL
DATA SCIENCE: HYPE AND REALITY PATRICK HALL About me SAS Enterprise Miner, 2012 Cloudera Data Scientist, 2014 Do you use Kolmogorov Smirnov often? Statistician No, I mix my martinis with gin. Data Scientist
More informationOfficial Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS. Powered by.
Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS Powered by www.upxacademy.com About us UpX Academy is an ed-tech platform providing advanced professional training in Big
More informationHadoop and Analytics at CERN IT CERN IT-DB
Hadoop and Analytics at CERN IT CERN IT-DB 1 Hadoop Use cases Parallel processing of large amounts of data Perform analytics on a large scale Dealing with complex data: structured, semi-structured, unstructured
More informationNew Big Data Solutions and Opportunities for DB Workloads
New Big Data Solutions and Opportunities for DB Workloads Hadoop and Spark Ecosystem for Data Analytics, Experience and Outlook Luca Canali, IT-DB Hadoop and Spark Service WLCG, GDB meeting CERN, September
More informationBig Data in Urban Power Distribution and Consumption Systems. Dr. Dongxia ZHANG 2016 IERE CLP-RI Hong Kong Workshop November 2016
Big Data in Urban Power Distribution and Consumption Systems Dr. Dongxia ZHANG 2016 IERE CLP-RI Hong Kong Workshop 21-24 November 2016 Agenda Ⅰ Ⅱ Data Perspective on Utility Big Overview of CEPRI s Research
More informationLeveraging smart meter data for electric utilities:
Leveraging smart meter data for electric utilities: Comparison of Spark SQL with Hive 5/16/2017 Hitachi, Ltd. OSS Solution Center Yusuke Furuyama Shogo Kinoshita Who are we? Yusuke Furuyama Solutions engineer
More informationHow to build and deploy machine learning projects
How to build and deploy machine learning projects Litan Ilany, Advanced Analytics litan.ilany@intel.com Agenda Introduction Machine Learning: Exploration vs Solution CRISP-DM Flow considerations Other
More informationLeveraging smart meter data for electric utilities:
Leveraging smart meter data for electric utilities: Comparison of Spark SQL with Hive 5/16/2017 Hitachi, Ltd. OSS Solution Center Yusuke Furuyama Shogo Kinoshita Who are we? Yusuke Furuyama Solutions engineer
More informationCloudera Data Science and Machine Learning. Robin Harrison, Account Executive David Kemp, Systems Engineer. Cloudera, Inc. All rights reserved.
Cloudera Data Science and Machine Learning Robin Harrison, Account Executive David Kemp, Systems Engineer 1 This is the age of machine learning. Data volume NO Machine Learning Machine Learning 1950s 1960s
More informationIBM SPSS & Apache Spark
IBM SPSS & Apache Spark Making Big Data analytics easier and more accessible ramiro.rego@es.ibm.com @foreswearer 1 2016 IBM Corporation Modeler y Spark. Integration Infrastructure overview Spark, Hadoop
More informationZHANG Xin. National Center for Climate Change Strategy and International Cooperation
ZHANG Xin National Center for Climate Change Strategy and International Cooperation 2017.9 1 2 3 4 Important requirement and work on Climate Change in China National Greenhous Gas Inventory and methodology
More informationBuilding Enterprise OLAP on Hadoop for Financial Services Industry
Building Enterprise OLAP on Hadoop for Financial Services Industry Luke Han luke@kyligence.io @lukehq Co-founder & CEO of Kyligence Creator & VP of Apache Kylin Microsoft Regional Director & MVP About
More informationData Science, realizing the Hype Cycle. Luigi Di Rito, Director Data Science Team, SAP Center of Excellence
Data Science, realizing the Hype Cycle. Luigi Di Rito, Director Data Science Team, SAP Center of Excellence Data Science, Machine Learning and Artificial Intelligence Deep Learning AREAS OF AI Rule-based
More informationOperational Hadoop and the Lambda Architecture for Streaming Data
Operational Hadoop and the Lambda Architecture for Streaming Data 2015 MapR Technologies 2015 MapR Technologies 1 Topics From Batch to Operational Workloads on Hadoop Streaming Data Environments The Lambda
More informationSAP Machine Learning for Hadoop. Customer
SAP Machine Learning for Hadoop Customer SAP BusinessObjects Predictive Analytics and Big Data 1. Support for end-to-end operational predictive lifecycle on Hadoop 2. Business Analyst Friendly No coding
More informationResearch on the Framework and Data Fusion of an Energy Big-data Platform
1 Paper Number: 17PESGM2652 Panel: Big data for Integrated Energy Systems Research on the Framework and Data Fusion of an Energy Big-data Platform Gengfeng Li, Zhaohong Bie, Jiang Wu, Cheng Li gengfengli@xjtu.edu.cn
More informationAI Solutions and Use Cases Up Close Dolly Wu, Vice President/GM Alfie Lew, Solution Architect
AI Summit 2017 AI Solutions and Use Cases Up Close Dolly Wu, Vice President/GM Alfie Lew, Solution Architect Inspur Quick Overview 4 Business Groups $10.2 Billion FY16 Revenue TOP 4 Server Manufacturer
More informationActive Analytics Overview
Active Analytics Overview The Fourth Industrial Revolution is predicated on data. Success depends on recognizing data as the most valuable corporate asset. From smart cities to autonomous vehicles, logistics
More informationDigital Transformation 2.0
Digital Transformation 2.0 Job roles and skills that every IT Services company must know We have been hearing for quite some time, that the world is going through digital transformation & HR department
More informationThis document (including, without limitation, any product roadmap or statement of direction data) illustrates the planned testing, release and
Shawn Rogers Orchestrating and Managing Enterprise Analytics DISCLAIMER During the course of this presentation, TIBCO or its representatives may make forward-looking statements regarding future events,
More informationThe Internet of Everything and the Research on Big Data. Angelo E. M. Ciarlini Research Head, Brazil R&D Center
The Internet of Everything and the Research on Big Data Angelo E. M. Ciarlini Research Head, Brazil R&D Center A New Industrial Revolution Sensors everywhere: 50 billion connected devices by 2020 Industrial
More informationOfficial Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS. Powered by.
Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS Powered by www.upxacademy.com About us UpX Academy is an ed-tech platform providing advanced professional training in Big
More informationJun Pei. 09/ /2009 Bachelor in Management Science and Engineering
Jun Pei School of Management, Hefei University of Technology Tunxi Road 193, Hefei City, Anhui province, China, 230009 Email: peijun@hfut.edu.cn; feiyijun198612@126.com; feiyijun.ufl@gmail.com Phone: (86)
More informationData Analytics. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC
Data Analytics Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC Last 15 years IT-centric Traditional Analytics Traditional Applications Rigid Infrastructure Internet Next
More informationFrontiers and Trends SHANGHAI: SHENZHEN: BEIJING: 30 th March Shanghai Tower. 8 th April St. Regis. 12 th April Beijing Marriott Hotel City Wall
Frontiers and Trends SHANGHAI: 30 th March Shanghai Tower SHENZHEN: 8 th April St. Regis BEIJING: 12 th April Beijing Marriott Hotel City Wall PERCEPTION AND IMAGINE ONCE AGAIN DEEPLY CONSIDERING THE FUTURE
More informationKnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE
FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?
More informationMapR Pentaho Business Solutions
MapR Pentaho Business Solutions The Benefits of a Converged Platform to Big Data Integration Tom Scurlock Director, WW Alliances and Partners, MapR Key Takeaways 1. We focus on business values and business
More informationTowards a Big Data-as-a-Service for Legislative Research in the National Assembly Library of Korea. Data Convergence Analysis Division
Towards a Big Data-as-a-Service for Legislative Research in the National Assembly Library of Korea Data Convergence Analysis 1. 2. 1 Amend the NAL Act and the Decree of NAL Organization Establish New
More informationDesign Your Strategy for Digital Transformation with SAP S/4HANA. Allen Li, SAP Greater China July 25, 2016
Design Your Strategy for Digital Transformation with SAP S/4HANA Allen Li, SAP Greater China July 25, 2016 Why Digital transformation? Adapt or die Hyperconnectivity Mobile In-memory computing Internet
More informationBig data is hard. Top 3 Challenges To Adopting Big Data
Big data is hard Top 3 Challenges To Adopting Big Data Traditionally, analytics have been over pre-defined structures Data characteristics: Sales Questions answered with BI and visualizations: Customer
More informationOfficial Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS. Powered by.
Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS Powered by www.upxacademy.com About us UpX Academy is an ed-tech platform providing advanced professional training in Big
More informationOfficial Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS. Powered by.
Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS Powered by www.upxacademy.com About us UpX Academy is an ed-tech platform providing advanced professional training in Big
More informationPotential for Savings in China s Government Energy Efficiency Procurement Program: Preliminary Findings
Potential for Savings in China s Government Energy Efficiency Procurement Program: Preliminary Findings David Fridley Lawrence Berkeley National Laboratory Berkeley, CA August 2005 1. Background On December
More informationKnowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Unit # 19 1 Acknowledgement The following discussion is based on the paper Mining Big Data: Current Status, and Forecast to the Future by Fan and Bifet and online presentation
More informationThe Global Market for Intelligent Video Analytics
The Global Market for Intelligent Video Analytics 2018 to 2023 Published: Q3 2018 Global Market for Intell Video Analytics 2018 to 2023 Synopsis This report aims to assist all stakeholders and investors
More informationOfficial Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS. Powered by.
Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS Powered by www.upxacademy.com About us UpX Academy is an ed-tech platform providing advanced professional training in Big
More informationRedefine Big Data: EMC Data Lake in Action. Andrea Prosperi Systems Engineer
Redefine Big Data: EMC Data Lake in Action Andrea Prosperi Systems Engineer 1 Agenda Data Analytics Today Big data Hadoop & HDFS Different types of analytics Data lakes EMC Solutions for Data Lakes 2 The
More information