Big Data Initiatives in China: Opportunities and Challenges

Size: px
Start display at page:

Download "Big Data Initiatives in China: Opportunities and Challenges"

Transcription

1 Big Data Initiatives in China: Opportunities and Challenges Joshua Zhexue Huang Distinguished Professor Director of Big Data Institute College of Computer Science and Software Engineering Shenzhen University

2 Agenda 1. Recent Development of Big Data in China 2. Key Initiatives, Challenges and Opportunities 3. Research and Applications at Big Data Institute, Shenzhen University

3 What is Big Data? Big data is a term for data sets that are so large or complex that traditional data processing applications are inadequate to deal with them (Wikipedia). Big data often refers to the use of predictive analytics, user behavior analytics, or certain other advanced data analytics methods that extract value from data, and seldom to a particular size of data set.

4 Big Data Term and Popularity Big Data term was coined in 1998 by John R. Mashey, Chief Scientist of SGI The term then referred to data size in Gigabytes which will cause stress on infrastructure. On MARCH 29, 2012, Obama Administration announced Big Data Research and Development Initiative and $200 million to invest on big data, which made Big Data popular.

5 Recent Development of Big Data in China - China NSF funded key projects (2010) Massive data mining on cloud computing (2013) Big data oriented machine learning theory and methods (2014) Challenging research problems in big data technology and applications(8 projects) (2015) Five projects on big data (2016) More projects funded in information science and management areas

6 Recent Development of Big Data in China In August of 2012, Chinese Academy of Sciences started a strategic pilot project (1.3 billion in 5 years) Sensing China oriented next generation information Technologies A subproject on big data Research and development of key technologies for sea and cloud data systems 中国科学院图册 V 百科

7 Recent Development of Big Data in China In 2016, Ministry of Science and Technology of China started a special program on Cloud computing and big data which will accomplish 12 tasks in four areas with 400 millions RMB Cloud platform and big data infrastructure Data driven new software on cloud service model Big data analytics, applications and Human like intelligence Cloud convergence of Perceptual cognition and human machine interaction

8 Recent Development of Big Data in China -Ministry of Education of China 85 universities set up a new major on data science and big data technology Some major universities set up special schools, faculties and research institutes on data science and big data Tsinghua University:Tsinghua-Qingdao Data Science Institute Peking University: Beijing University Big Data Technology Inst Fudan University: School of Data Science, Sun Yat-Sen University:School of Data and Computer Science Shenzhen University: Big Data Institute

9 Recent Development of Big Data in China Local governments set up special organizations to promote big data Beijing: Beijing Institute of Big Data Research Guangdong Province: Big Data Bureau Shanghai: Shanghai Data Exchange Center Shenzhen: Shenzhen Research Institute of Big Data, Chinese University of Hong Kong (Shenzhen)

10 Recent Development of Big Data in China -Industry Big Internet Companies are the leaders in big data development and applications. They are also big data owners. Baidu, Alibaba, Tencent (BAT) All industry sectors are interested in big data Technology companies, e.g., Huawei, ZTE Telecommunications, e.g., China Mobile, China Unicom Banks and Insurance companies Manufacturing companies E-commerce companies Logistics service companies

11 Big Data Market in China 0.1 billion compound annual growth rate

12 Big data: a national strategy A decision was made to implement a national strategy for big data At the Third Plenary Session of the 18th Central Committee of the CPC in October The 13th Five-year Plan ( ) further defined that big data is fundamental strategic resources to be developed and utilized. National big data centers and platforms will be established. Key technologies, hardware and software will be innovated and developed, including data collection, storage, cleansing, analysis, mining, visualization, security and privacy protection.

13 Implementation Measures The State Council issued the action outline to promote the development of large data in In January 2016, The National Development and Reform Commission issued a notice on organizing the implementation of major projects to promote the development of big data, supporting projects in four areas: Pilot projects on big data applications Big data sharing Big data infrastructure development Big data standards and exchange systems

14 Agenda 1. Recent Development of Big Data in China 2. Key Initiatives, Challenges and Opportunities 3. Research and Applications at Big Data Institute, Shenzhen University

15 Initiatives to Develop Innovation Driven Economy in China Encourage young people to start their own business and pursue innovation (Mass entrepreneurship and innovation ) Development of big data Internet + action plan Cloud computing service development Internet of Things (including wireless Internet) Artificial Intelligence Made in China 2015 (advanced manufacturing) Internet +

16 Directions Data science disciplines Key technology development Big data platforms Key applications Data resource development Data sharing and open data Human resource training for big data

17 Internet + Manufacturing AI Manufacturing procurement Design Customer Service Intelligent warehouse retail Transportation

18 Technological Challenges Storage cloud storage Communication 4G, 5G Processing cleansing, integration Analysis capability, efficiency Mining methods, tools, platforms Energy consumption

19 Application Challenges Lack of clear business requirements Lack of successful pilots Data availability and data sharing Data security and privacy ROI on big data applications Infrastructure Skills and human resources

20 Opportunities: Big Data Industry Chain Telecom Retail Finance Manufacturing Internet Smart Grid E-commerce Logistics Smart City

21 Agenda 1. Recent Development of Big Data in China 2. Key Initiatives, Challenges and Opportunities 3. Research and Applications at Big Data Institute, Shenzhen University

22 Shenzhen Shenzhen

23 China s first Special Economic Zone (SEZ) Neighboring to Hong Kong Area: 2050 km 2 A major city in South China Population (2014): 11 million Shenzhen University The fourth largest city in GDP in China, GDP per capita in USD: 25,038 GDP Growth (2015): 8.9% Xichong Beach Shenzhen Bay Bridge Night View of Shennan Road East

24 A public university established in The fastest growing university intop 100 Universitiesin China. 26 schools (colleges) 57 undergraduate programs, 70 master's programs 3 doctorate programs. Shenzhen University 34,000 full-time students 27,000 undergraduates, 6,000 postgraduates 1,500 international students. Lake Wenshan South pavilion of the school library

25 Big Data Institute, Shenzhen University Established in research staff 30 students Computer Science Building Three organizations International PhD students Institute Corridor

26 Faculty Members

27 Data Center

28 Internet + Manufacturing accumulates big data AI Manufacturing procurement Design Customer Service Intelligent warehouse retail Transportation

29 Research Problems 1 2 n-4 n-3 n-2 n-1 n f1 f2 f3 f4 f5 Thousands of features Curse of dimensionality 1. Mixed data 2. Noise/missing value 3. Correlation 4. Unbalance 5. Subspace property 6. Uninformative Millions of records Challenge of Big Data Matrix

30 Big Data Analytics Big data refers to the use of predictive analytics, user behavior analytics, or certain other advanced data analytics methods that extract value from data.

31 MapReduce Programming(Divide-and- Conquer) Programming (Map) Master node (Reduce) file file file file file node node node node node output File 文件划 partition

32 MapReduce Iteration K-means Pipeline implementation M R M R M R M R M R M R M R M R M R M R M R M R Input Data????? Map process Assign objects to clusters Reduce process Recompute cluster centers C o n v e r g e? output

33 MapReduce limitation Decision Tree It is difficult to implement recursive algorithm like decision trees in MapReduce

34 Spark RDD Computing Model RDD is a matrix.

35 RDD Divide-and-Conquer

36 Asymptotic Ensemble Learning Framework

37 Randomization of Data Blocks Before randomization After randomization

38 Asymptotic Ensemble Learning Results Learning result from none randomized data blocks Learning result from none randomized data blocks

39 Advantage of Asymptotic Ensemble Learning Sampling without replacement Sampling data blocks instead records increases sampling efficiency Learning partial data(10-20%) to approach the result learnt from the whole data. Significantly reduce computation load Scalability,learning TB or PB data

40 Integrated Big Data Analysis Platform

41 Key Technologies Workflow Engine Cloud Computing Engine Algorithm Library Big Data Analytics Open API Cloud Storage

42 Distributed Machine Learning Algorithm Libraries MapReduce Clustering Classification Regression Association K-Means K-Modes W-K-Means EWKM Decision Tree Random Forests LDA Logistic Regression Random Forest Regression FP-Growth Spark 1. Machine Learning Mllib 2. Graph Analysis GraphX 3. Data streams Dstream 4. QuerySpark SQL

43 Analytical Workflow

44 Manufacturing Big Data Application --Product batch quality problem monitoring system Visualization Impala 数据分析引擎 Applications Vis 数据可视化引擎 xxx xx 引擎 Application Layer Data analysis R 数据挖掘 Hive 数据仓库 Analytics Storm 实时流计算 Spark 数据流处理 Data Warehouse Data cleansing and integration Central DB Local quality data Sqoop 数据迁移 ETL Flume 数据收集工具 Cluster Environment Kettle ETL 工具 HDFS Map/Reduce Runtime System Supl 1 Supl 2 Supl n Fac 1 Fac 2 Fac n Platform Layer Data Layer

45 大数据分析一体化平台 - 应用展示

46 Manufacturing Big Data Application --Product batch quality problem monitoring system 10 Year Product quality monitoring period 50M+ No. of products monitored 2015 Huawei President award Factories 1PB+ Data 80%+ Report Accuracy 100+ Development Team 50+ Products 0% Missing Rate

47 Thank You!!! Questions?

China AI and Big Data Talent Assessment

China AI and Big Data Talent Assessment China AI and Big Data Talent Assessment 2018 AGENDA 01 Demand Analysis KEY HIGHLIGHTS 02 Location Deep Dive Analysis Location Characteristics: Key hotspots, cost analysis and top employers Workloads across

More information

Deloitte School of Analytics. Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward

Deloitte School of Analytics. Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward Deloitte School of Analytics Demystifying Data Science: Leveraging this phenomenon to drive your organisation forward February 2018 Agenda 7 February 2018 8 February 2018 9 February 2018 8:00 9:00 Networking

More information

Post Graduate Program in BIG DATA ENGINEERING. In association with 11 MONTHS ONLINE

Post Graduate Program in BIG DATA ENGINEERING. In association with 11 MONTHS ONLINE Post Graduate Program in BIG DATA ENGINEERING In association with 11 MONTHS ONLINE Contents 1. 2. 3. 4. 5. 6. Why Big Data Program Outline Learning Experience Program Objective Program Curriculum Admissions

More information

Transforming Analytics with Cloudera Data Science WorkBench

Transforming Analytics with Cloudera Data Science WorkBench Transforming Analytics with Cloudera Data Science WorkBench Process data, develop and serve predictive models. 1 Age of Machine Learning Data volume NO Machine Learning Machine Learning 1950s 1960s 1970s

More information

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW

BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW BIG DATA PROCESSING A DEEP DIVE IN HADOOP/SPARK & AZURE SQL DW TOPICS COVERED 1 2 Fundamentals of Big Data Platforms Major Big Data Tools Scaling Up vs. Out SCALE UP (SMP) SCALE OUT (MPP) + (n) Upgrade

More information

ABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA.

ABOUT THIS TRAINING: This Hadoop training will also prepare you for the Big Data Certification of Cloudera- CCP and CCA. ABOUT THIS TRAINING: The world of Hadoop and Big Data" can be intimidating - hundreds of different technologies with cryptic names form the Hadoop ecosystem. This comprehensive training has been designed

More information

5th Annual. Cloudera, Inc. All rights reserved.

5th Annual. Cloudera, Inc. All rights reserved. 5th Annual 1 The Essentials of Apache Hadoop The What, Why and How to Meet Agency Objectives Sarah Sproehnle, Vice President, Customer Success 2 Introduction 3 What is Apache Hadoop? Hadoop is a software

More information

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.

Course Content. The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. Course Content Course Description: The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight. At Course Completion: After competing this course,

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Duration: 5 days; Instructor-led Implement Spark Streaming Using the DStream API. Develop Big Data Real-Time Processing Solutions with Apache

More information

20775A: Performing Data Engineering on Microsoft HD Insight

20775A: Performing Data Engineering on Microsoft HD Insight 20775A: Performing Data Engineering on Microsoft HD Insight Course Details Course Code: Duration: Notes: 20775A 5 days This course syllabus should be used to determine whether the course is appropriate

More information

20775 Performing Data Engineering on Microsoft HD Insight

20775 Performing Data Engineering on Microsoft HD Insight Duración del curso: 5 Días Acerca de este curso The main purpose of the course is to give students the ability plan and implement big data workflows on HD. Perfil de público The primary audience for this

More information

Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS

Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS www.upxacademy.com 1800-123-1260 About us UpX Academy is an ed-tech platform providing advanced professional training in Big Data Analytics

More information

Powered by. Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS

Powered by. Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS www.upxacademy.com 1800-123-1260 About us UpX Academy is an ed-tech platform providing advanced professional training in Big Data Analytics

More information

20775: Performing Data Engineering on Microsoft HD Insight

20775: Performing Data Engineering on Microsoft HD Insight Let s Reach For Excellence! TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC Address: 103 Pasteur, Dist.1, HCMC Tel: 08 38245819; 38239761 Email: traincert@tdt-tanduc.com Website: www.tdt-tanduc.com; www.tanducits.com

More information

BIG DATA AND HADOOP DEVELOPER

BIG DATA AND HADOOP DEVELOPER BIG DATA AND HADOOP DEVELOPER Approximate Duration - 60 Hrs Classes + 30 hrs Lab work + 20 hrs Assessment = 110 Hrs + 50 hrs Project Total duration of course = 160 hrs Lesson 00 - Course Introduction 0.1

More information

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation

Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Introduction to Big Data(Hadoop) Eco-System The Modern Data Platform for Innovation and Business Transformation Roger Ding Cloudera February 3rd, 2018 1 Agenda Hadoop History Introduction to Apache Hadoop

More information

Powered by. Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS

Powered by. Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS www.upxacademy.com 1800-123-1260 About us UpX Academy is an ed-tech platform providing advanced professional training in Big Data Analytics

More information

Introduction to Research at Noah s Ark Lab. Noah s Ark Lab Huawei Technologies Co. Ltd.

Introduction to Research at Noah s Ark Lab. Noah s Ark Lab Huawei Technologies Co. Ltd. Introduction to Research at Noah s Ark Lab Noah s Ark Lab Huawei Technologies Co. Ltd. Noah s Ark Lab Research Areas Machine Learning Data Mining Speech and Language Processing Information and Knowledge

More information

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved.

Apache Spark 2.0 GA. The General Engine for Modern Analytic Use Cases. Cloudera, Inc. All rights reserved. Apache Spark 2.0 GA The General Engine for Modern Analytic Use Cases 1 Apache Spark Drives Business Innovation Apache Spark is driving new business value that is being harnessed by technology forward organizations.

More information

AI Use cases and Requirements for telecom network. China Mobile

AI Use cases and Requirements for telecom network. China Mobile AI Use cases and Requirements for telecom network China Mobile 2018.04 2 Agenda Motivation to introduce AI Use cases in telecom network Requirements Why do telecom operators need AI? Ovum observation:

More information

MATLAB 汽车大数据分析平台的构建及应用

MATLAB 汽车大数据分析平台的构建及应用 MATLAB 汽车大数据分析平台的构建及应用 卓金武 MathWorks 中国 steven.zhuo@mathworks.cn 2015 The MathWorks, Inc. 1 牛人如何看汽车大数据分析? Today's cars produce upwards of 25GB of information per hour information is helping us understand

More information

IBM Analytics Unleash the power of data with Apache Spark

IBM Analytics Unleash the power of data with Apache Spark IBM Analytics Unleash the power of data with Apache Spark Agility, speed and simplicity define the analytics operating system of the future 1 2 3 4 Use Spark to create value from data-driven insights Lower

More information

Preface About the Book

Preface About the Book Preface About the Book We are living in the dawn of what has been termed as the "Fourth Industrial Revolution" by the World Economic Forum (WEF) in 2016. The Fourth Industrial Revolution is marked through

More information

Machine Learning and Analytics. Machine Learning. Data Lake Analytics. HDInsight (Hadoop, Spark, Storm, HBase Managed Clusters) Stream Analytics

Machine Learning and Analytics. Machine Learning. Data Lake Analytics. HDInsight (Hadoop, Spark, Storm, HBase Managed Clusters) Stream Analytics 微软云上数据平台概括 Data Sources Information Management Big Data Stores Machine Learning and Analytics Intelligence People Data Factory Data Lake Store Machine Learning Cognitive Services Data Catalog SQL Data

More information

BIG WITH BIG DATA ANALYTICS

BIG WITH BIG DATA ANALYTICS Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS www.upxacademy.com 1800-123-1260 About us UpX Academy is an ed-tech platform providing advanced professional training in Big Data Analytics

More information

Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS

Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS www.upxacademy.com 1800-123-1260 About us UpX Academy is an ed-tech platform providing advanced professional training in Big Data Analytics

More information

MR TIGER KIU. Leading New ICT, Building A Better Connected World

MR TIGER KIU. Leading New ICT, Building A Better Connected World MR TIGER KIU Leading New ICT, Building A Better Connected World Leading New ICT, Building A Better Connected World Huawei: A Global Leader of ICT Solutions 129 Ranking in Fortune Global 500 (2016 ) 80,000

More information

BIG WITH BIG DATA ANALYTICS

BIG WITH BIG DATA ANALYTICS Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS www.upxacademy.com 1800-123-1260 About us UpX Academy is an ed-tech platform providing advanced professional training in Big Data Analytics

More information

Optimal Infrastructure for Big Data

Optimal Infrastructure for Big Data Optimal Infrastructure for Big Data Big Data 2014 Managing Government Information Kevin Leong January 22, 2014 2014 VMware Inc. All rights reserved. The Right Big Data Tools for the Right Job Real-time

More information

Azure ML Data Camp. Ivan Kosyakov MTC Architect, Ph.D. Microsoft Technology Centers Microsoft Technology Centers. Experience the Microsoft Cloud

Azure ML Data Camp. Ivan Kosyakov MTC Architect, Ph.D. Microsoft Technology Centers Microsoft Technology Centers. Experience the Microsoft Cloud Microsoft Technology Centers Microsoft Technology Centers Experience the Microsoft Cloud Experience the Microsoft Cloud ML Data Camp Ivan Kosyakov MTC Architect, Ph.D. Top Manager IT Analyst Big Data Strategic

More information

Azure Data Analytics & Machine Learning Seminar. Daire Cunningham: BI Practice Area Manager

Azure Data Analytics & Machine Learning Seminar. Daire Cunningham: BI Practice Area Manager Azure Data Analytics & Machine Learning Seminar Daire Cunningham: BI Practice Area Manager AGENDA 09:00 AM 09:30 AM Registration & Refreshments 09.30AM 10:00 AM 10:00 AM 10:30 AM Welcome & Keynote, Ger

More information

Common Customer Use Cases in FSI

Common Customer Use Cases in FSI Common Customer Use Cases in FSI 1 Marketing Optimization 2014 2014 MapR MapR Technologies Technologies 2 Fortune 100 Financial Services Company 104M CARD MEMBERS 3 Financial Services: Recommendation Engine

More information

BIG WITH BIG DATA ANALYTICS

BIG WITH BIG DATA ANALYTICS Powered by Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS www.upxacademy.com 1800-123-1260 About us UpX Academy is an ed-tech platform providing advanced professional training in Big Data Analytics

More information

MapR: Solution for Customer Production Success

MapR: Solution for Customer Production Success 2015 MapR Technologies 2015 MapR Technologies 1 MapR: Solution for Customer Production Success Big Data High Growth 700+ Customers Cloud Leaders Riding the Wave with Hadoop The Big Data Platform of Choice

More information

SAP Predictive Analytics Suite

SAP Predictive Analytics Suite SAP Predictive Analytics Suite Tania Pérez Asensio Where is the Evolution of Business Analytics Heading? Organizations Are Maturing Their Approaches to Solving Business Problems Reactive Wait until a problem

More information

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics

ARCHITECTURES ADVANCED ANALYTICS & IOT. Presented by: Orion Gebremedhin. Marc Lobree. Director of Technology, Data & Analytics ADVANCED ANALYTICS & IOT ARCHITECTURES Presented by: Orion Gebremedhin Director of Technology, Data & Analytics Marc Lobree National Architect, Advanced Analytics EDW THE RIGHT TOOL FOR THE RIGHT WORKLOAD

More information

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

E-guide Hadoop Big Data Platforms Buyer s Guide part 1 Hadoop Big Data Platforms Buyer s Guide part 1 Your expert guide to Hadoop big data platforms for managing big data David Loshin, Knowledge Integrity Inc. Companies of all sizes can use Hadoop, as vendors

More information

Construction of Regional Logistics Information Platform Based on Cloud Computing

Construction of Regional Logistics Information Platform Based on Cloud Computing International Conference on Computational Science and Engineering (ICCSE 2015) Construction of Regional Logistics Information Platform Based on Cloud Computing Gang SUN 1,2,a,*, Xiu-You WANG 1,b, Hao WANG

More information

Big Data Application Engineer/ Developer. Specialization in Apache Spark, Kafka, Airflow, HBase

Big Data Application Engineer/ Developer. Specialization in Apache Spark, Kafka, Airflow, HBase BIG DATA COURSE Big Data Application Engineer/ Developer Specialization in Apache Spark, Kafka, Airflow, HBase In Exclusive Association with 21,347+ Participants 10,000+ Brands 1200+ Trainings 45+ Countries

More information

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA http://kzhang6.people.uic.edu/tutorial/amcis2014.html August 7, 2014 Schedule I. Introduction to big data

More information

Managing explosion of data. Cloudera, Inc. All rights reserved.

Managing explosion of data. Cloudera, Inc. All rights reserved. Managing explosion of data 1 Customer experience expectations are converging on the brand, not channel Consistent across all channels and lines of business Contextualized to present location and circumstances

More information

Rotating to the New. How can Manufacturing Companies in China Thrive in the Digital Age. March 2018

Rotating to the New. How can Manufacturing Companies in China Thrive in the Digital Age. March 2018 Rotating to the New How can Manufacturing Companies in China Thrive in the Digital Age March 2018 The imperative for growth in Chinese Manufacturing Digital as a driver of high performance Going digital

More information

Intermodal Freight Transportation in China.

Intermodal Freight Transportation in China. Intermodal Freight Transportation in China www.sf-express.com What is Intermodal Freight Transportation? 2 Photo: Akira Kodaka What is Intermodal Freight Transportation? Intermodal Freight Transportation

More information

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica

Accelerating Your Big Data Analytics. Jeff Healey, Director Product Marketing, HPE Vertica Accelerating Your Big Data Analytics Jeff Healey, Director Product Marketing, HPE Vertica Recent Waves of Disruption IT Infrastructu re for Analytics Data Warehouse Modernization Big Data/ Hadoop Cloud

More information

1% + 99% = AI Popularization

1% + 99% = AI Popularization 1% + 99% = AI Popularization Unifying Data Science and Engineering Jason Bissell General Manager, APAC The beginnings of Apache Spark at UC Berkeley AMPLab funded by tech companies: Got a glimpse at their

More information

BIG DATA and DATA SCIENCE

BIG DATA and DATA SCIENCE Integrated Program In BIG DATA and DATA SCIENCE CONTINUING STUDIES Table of Contents About the Course...03 Key Features of Integrated Program in Big Data and Data Science...04 Learning Path...05 Key Learning

More information

Context. The NEW data services from UST Global UST GLOBAL - A UNIQUE PARTNER. UST Global Data Services March 2018!1

Context. The NEW data services from UST Global UST GLOBAL - A UNIQUE PARTNER. UST Global Data Services March 2018!1 UST Global Data Services March 2018!1 UST GLOBAL - A UNIQUE PARTNER Context Our Fortune 500 customers have immense amounts of transactional as well as interaction data distributed across a number of business

More information

2016 China s Internet Consumption Finance Market Research Report.

2016 China s Internet Consumption Finance Market Research Report. 2016 China s Internet Consumption Finance Market Research Report www.iresearchchina.com Consumption Contributes to Macroeconomic Restructuring China s National Economy Steadily Grows. Total Retail Sales

More information

Architecture Overview for Data Analytics Deployments

Architecture Overview for Data Analytics Deployments Architecture Overview for Data Analytics Deployments Mahmoud Ghanem Sr. Systems Engineer GLOBAL SPONSORS Agenda The Big Picture Top Use Cases for Data Analytics Modern Architecture Concepts for Data Analytics

More information

Big Data & Artificial Intelligence ----How to Achieve Accurate Sales

Big Data & Artificial Intelligence ----How to Achieve Accurate Sales Big Data & Artificial Intelligence ----How to Achieve Accurate Sales Prof. Guangxia Xu Chongqing University of Posts and Telecommunications, Chongqing, China xugx@cqupt.edu.cn 1/30 Outline 1. Background

More information

Modernizing Your Data Warehouse with Azure

Modernizing Your Data Warehouse with Azure Modernizing Your Data Warehouse with Azure Big data. Small data. All data. Christian Coté S P O N S O R S The traditional BI Environment The traditional data warehouse data warehousing has reached the

More information

Microsoft Azure Essentials

Microsoft Azure Essentials Microsoft Azure Essentials Azure Essentials Track Summary Data Analytics Explore the Data Analytics services in Azure to help you analyze both structured and unstructured data. Azure can help with large,

More information

Insights to HDInsight

Insights to HDInsight Insights to HDInsight Why Hadoop in the Cloud? No hardware costs Unlimited Scale Pay for What You Need Deployed in minutes Azure HDInsight Big Data made easy Enterprise Ready Easier and more productive

More information

Hadoop Course Content

Hadoop Course Content Hadoop Course Content Hadoop Course Content Hadoop Overview, Architecture Considerations, Infrastructure, Platforms and Automation Use case walkthrough ETL Log Analytics Real Time Analytics Hbase for Developers

More information

VICE PRESIDENT, ARCHITECTURE GENERAL MANAGER, AI PRODUCTS GROUP - INTEL

VICE PRESIDENT, ARCHITECTURE GENERAL MANAGER, AI PRODUCTS GROUP - INTEL VICE PRESIDENT, ARCHITECTURE GENERAL MANAGER, AI PRODUCTS GROUP - INTEL Artificial intelligence Machine learning Deep learning Types of analytics/ml (PARTIAL LIST) C l a s s i f i c a t i o n R e g r

More information

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop

Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Simplifying the Process of Uploading and Extracting Data from Apache Hadoop Rohit Bakhshi, Solution Architect, Hortonworks Jim Walker, Director Product Marketing, Talend Page 1 About Us Rohit Bakhshi Solution

More information

Spark and Hadoop Perfect Together

Spark and Hadoop Perfect Together Spark and Hadoop Perfect Together Arun Murthy Hortonworks Co-Founder @acmurthy Data Operating System Enable all data and applications TO BE accessible and shared BY any end-users Data Operating System

More information

Bringing the Power of SAS to Hadoop Title

Bringing the Power of SAS to Hadoop Title WHITE PAPER Bringing the Power of SAS to Hadoop Title Combine SAS World-Class Analytics With Hadoop s Low-Cost, Distributed Data Storage to Uncover Hidden Opportunities ii Contents Introduction... 1 What

More information

ADVANCED ANALYTICS & IOT ARCHITECTURES

ADVANCED ANALYTICS & IOT ARCHITECTURES ADVANCED ANALYTICS & IOT ARCHITECTURES Presented by: Orion Gebremedhin Director of Technology, Data & Analytics Marc Lobree National Architect, Advanced Analytics EDW THE RIGHT TOOL FOR THE RIGHT WORKLOAD

More information

Data Analytics for Semiconductor Manufacturing The MathWorks, Inc. 1

Data Analytics for Semiconductor Manufacturing The MathWorks, Inc. 1 Data Analytics for Semiconductor Manufacturing 2016 The MathWorks, Inc. 1 Competitive Advantage What do we mean by Data Analytics? Analytics uses data to drive decision making, rather than gut feel or

More information

Big Data Foundation. 2 Days Classroom Training PHILIPPINES :: MALAYSIA :: VIETNAM :: SINGAPORE :: INDIA

Big Data Foundation. 2 Days Classroom Training PHILIPPINES :: MALAYSIA :: VIETNAM :: SINGAPORE :: INDIA Big Data Foundation 2 Days Classroom Training PHILIPPINES :: MALAYSIA :: VIETNAM :: SINGAPORE :: INDIA Content Big Data Foundation Course Introduction Who we are Course Overview Career Path Course Content

More information

Intro to Big Data and Hadoop

Intro to Big Data and Hadoop Intro to Big and Hadoop Portions copyright 2001 SAS Institute Inc., Cary, NC, USA. All Rights Reserved. Reproduced with permission of SAS Institute Inc., Cary, NC, USA. SAS Institute Inc. makes no warranties

More information

Big Data Introduction

Big Data Introduction Big Data Introduction Who we are Experts At Your Service Over 50 specialists in IT infrastructure Certified, experienced, passionate Based In Switzerland 100% self-financed Swiss company Over CHF8 mio.

More information

DATA SCIENCE: HYPE AND REALITY PATRICK HALL

DATA SCIENCE: HYPE AND REALITY PATRICK HALL DATA SCIENCE: HYPE AND REALITY PATRICK HALL About me SAS Enterprise Miner, 2012 Cloudera Data Scientist, 2014 Do you use Kolmogorov Smirnov often? Statistician No, I mix my martinis with gin. Data Scientist

More information

Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS. Powered by.

Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS. Powered by. Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS Powered by www.upxacademy.com About us UpX Academy is an ed-tech platform providing advanced professional training in Big

More information

Hadoop and Analytics at CERN IT CERN IT-DB

Hadoop and Analytics at CERN IT CERN IT-DB Hadoop and Analytics at CERN IT CERN IT-DB 1 Hadoop Use cases Parallel processing of large amounts of data Perform analytics on a large scale Dealing with complex data: structured, semi-structured, unstructured

More information

New Big Data Solutions and Opportunities for DB Workloads

New Big Data Solutions and Opportunities for DB Workloads New Big Data Solutions and Opportunities for DB Workloads Hadoop and Spark Ecosystem for Data Analytics, Experience and Outlook Luca Canali, IT-DB Hadoop and Spark Service WLCG, GDB meeting CERN, September

More information

Big Data in Urban Power Distribution and Consumption Systems. Dr. Dongxia ZHANG 2016 IERE CLP-RI Hong Kong Workshop November 2016

Big Data in Urban Power Distribution and Consumption Systems. Dr. Dongxia ZHANG 2016 IERE CLP-RI Hong Kong Workshop November 2016 Big Data in Urban Power Distribution and Consumption Systems Dr. Dongxia ZHANG 2016 IERE CLP-RI Hong Kong Workshop 21-24 November 2016 Agenda Ⅰ Ⅱ Data Perspective on Utility Big Overview of CEPRI s Research

More information

Leveraging smart meter data for electric utilities:

Leveraging smart meter data for electric utilities: Leveraging smart meter data for electric utilities: Comparison of Spark SQL with Hive 5/16/2017 Hitachi, Ltd. OSS Solution Center Yusuke Furuyama Shogo Kinoshita Who are we? Yusuke Furuyama Solutions engineer

More information

How to build and deploy machine learning projects

How to build and deploy machine learning projects How to build and deploy machine learning projects Litan Ilany, Advanced Analytics litan.ilany@intel.com Agenda Introduction Machine Learning: Exploration vs Solution CRISP-DM Flow considerations Other

More information

Leveraging smart meter data for electric utilities:

Leveraging smart meter data for electric utilities: Leveraging smart meter data for electric utilities: Comparison of Spark SQL with Hive 5/16/2017 Hitachi, Ltd. OSS Solution Center Yusuke Furuyama Shogo Kinoshita Who are we? Yusuke Furuyama Solutions engineer

More information

Cloudera Data Science and Machine Learning. Robin Harrison, Account Executive David Kemp, Systems Engineer. Cloudera, Inc. All rights reserved.

Cloudera Data Science and Machine Learning. Robin Harrison, Account Executive David Kemp, Systems Engineer. Cloudera, Inc. All rights reserved. Cloudera Data Science and Machine Learning Robin Harrison, Account Executive David Kemp, Systems Engineer 1 This is the age of machine learning. Data volume NO Machine Learning Machine Learning 1950s 1960s

More information

IBM SPSS & Apache Spark

IBM SPSS & Apache Spark IBM SPSS & Apache Spark Making Big Data analytics easier and more accessible ramiro.rego@es.ibm.com @foreswearer 1 2016 IBM Corporation Modeler y Spark. Integration Infrastructure overview Spark, Hadoop

More information

ZHANG Xin. National Center for Climate Change Strategy and International Cooperation

ZHANG Xin. National Center for Climate Change Strategy and International Cooperation ZHANG Xin National Center for Climate Change Strategy and International Cooperation 2017.9 1 2 3 4 Important requirement and work on Climate Change in China National Greenhous Gas Inventory and methodology

More information

Building Enterprise OLAP on Hadoop for Financial Services Industry

Building Enterprise OLAP on Hadoop for Financial Services Industry Building Enterprise OLAP on Hadoop for Financial Services Industry Luke Han luke@kyligence.io @lukehq Co-founder & CEO of Kyligence Creator & VP of Apache Kylin Microsoft Regional Director & MVP About

More information

Data Science, realizing the Hype Cycle. Luigi Di Rito, Director Data Science Team, SAP Center of Excellence

Data Science, realizing the Hype Cycle. Luigi Di Rito, Director Data Science Team, SAP Center of Excellence Data Science, realizing the Hype Cycle. Luigi Di Rito, Director Data Science Team, SAP Center of Excellence Data Science, Machine Learning and Artificial Intelligence Deep Learning AREAS OF AI Rule-based

More information

Operational Hadoop and the Lambda Architecture for Streaming Data

Operational Hadoop and the Lambda Architecture for Streaming Data Operational Hadoop and the Lambda Architecture for Streaming Data 2015 MapR Technologies 2015 MapR Technologies 1 Topics From Batch to Operational Workloads on Hadoop Streaming Data Environments The Lambda

More information

SAP Machine Learning for Hadoop. Customer

SAP Machine Learning for Hadoop. Customer SAP Machine Learning for Hadoop Customer SAP BusinessObjects Predictive Analytics and Big Data 1. Support for end-to-end operational predictive lifecycle on Hadoop 2. Business Analyst Friendly No coding

More information

Research on the Framework and Data Fusion of an Energy Big-data Platform

Research on the Framework and Data Fusion of an Energy Big-data Platform 1 Paper Number: 17PESGM2652 Panel: Big data for Integrated Energy Systems Research on the Framework and Data Fusion of an Energy Big-data Platform Gengfeng Li, Zhaohong Bie, Jiang Wu, Cheng Li gengfengli@xjtu.edu.cn

More information

AI Solutions and Use Cases Up Close Dolly Wu, Vice President/GM Alfie Lew, Solution Architect

AI Solutions and Use Cases Up Close Dolly Wu, Vice President/GM Alfie Lew, Solution Architect AI Summit 2017 AI Solutions and Use Cases Up Close Dolly Wu, Vice President/GM Alfie Lew, Solution Architect Inspur Quick Overview 4 Business Groups $10.2 Billion FY16 Revenue TOP 4 Server Manufacturer

More information

Active Analytics Overview

Active Analytics Overview Active Analytics Overview The Fourth Industrial Revolution is predicated on data. Success depends on recognizing data as the most valuable corporate asset. From smart cities to autonomous vehicles, logistics

More information

Digital Transformation 2.0

Digital Transformation 2.0 Digital Transformation 2.0 Job roles and skills that every IT Services company must know We have been hearing for quite some time, that the world is going through digital transformation & HR department

More information

This document (including, without limitation, any product roadmap or statement of direction data) illustrates the planned testing, release and

This document (including, without limitation, any product roadmap or statement of direction data) illustrates the planned testing, release and Shawn Rogers Orchestrating and Managing Enterprise Analytics DISCLAIMER During the course of this presentation, TIBCO or its representatives may make forward-looking statements regarding future events,

More information

The Internet of Everything and the Research on Big Data. Angelo E. M. Ciarlini Research Head, Brazil R&D Center

The Internet of Everything and the Research on Big Data. Angelo E. M. Ciarlini Research Head, Brazil R&D Center The Internet of Everything and the Research on Big Data Angelo E. M. Ciarlini Research Head, Brazil R&D Center A New Industrial Revolution Sensors everywhere: 50 billion connected devices by 2020 Industrial

More information

Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS. Powered by.

Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS. Powered by. Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS Powered by www.upxacademy.com About us UpX Academy is an ed-tech platform providing advanced professional training in Big

More information

Jun Pei. 09/ /2009 Bachelor in Management Science and Engineering

Jun Pei. 09/ /2009 Bachelor in Management Science and Engineering Jun Pei School of Management, Hefei University of Technology Tunxi Road 193, Hefei City, Anhui province, China, 230009 Email: peijun@hfut.edu.cn; feiyijun198612@126.com; feiyijun.ufl@gmail.com Phone: (86)

More information

Data Analytics. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC

Data Analytics. Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC Data Analytics Nagesh Madhwal Client Solutions Director, Consulting, Southeast Asia, Dell EMC Last 15 years IT-centric Traditional Analytics Traditional Applications Rigid Infrastructure Internet Next

More information

Frontiers and Trends SHANGHAI: SHENZHEN: BEIJING: 30 th March Shanghai Tower. 8 th April St. Regis. 12 th April Beijing Marriott Hotel City Wall

Frontiers and Trends SHANGHAI: SHENZHEN: BEIJING: 30 th March Shanghai Tower. 8 th April St. Regis. 12 th April Beijing Marriott Hotel City Wall Frontiers and Trends SHANGHAI: 30 th March Shanghai Tower SHENZHEN: 8 th April St. Regis BEIJING: 12 th April Beijing Marriott Hotel City Wall PERCEPTION AND IMAGINE ONCE AGAIN DEEPLY CONSIDERING THE FUTURE

More information

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE

KnowledgeENTERPRISE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK. Advanced Analytics on Spark BROCHURE FAST TRACK YOUR ACCESS TO BIG DATA WITH ANGOSS ADVANCED ANALYTICS ON SPARK Are you drowning in Big Data? Do you lack access to your data? Are you having a hard time managing Big Data processing requirements?

More information

MapR Pentaho Business Solutions

MapR Pentaho Business Solutions MapR Pentaho Business Solutions The Benefits of a Converged Platform to Big Data Integration Tom Scurlock Director, WW Alliances and Partners, MapR Key Takeaways 1. We focus on business values and business

More information

Towards a Big Data-as-a-Service for Legislative Research in the National Assembly Library of Korea. Data Convergence Analysis Division

Towards a Big Data-as-a-Service for Legislative Research in the National Assembly Library of Korea. Data Convergence Analysis Division Towards a Big Data-as-a-Service for Legislative Research in the National Assembly Library of Korea Data Convergence Analysis 1. 2. 1 Amend the NAL Act and the Decree of NAL Organization Establish New

More information

Design Your Strategy for Digital Transformation with SAP S/4HANA. Allen Li, SAP Greater China July 25, 2016

Design Your Strategy for Digital Transformation with SAP S/4HANA. Allen Li, SAP Greater China July 25, 2016 Design Your Strategy for Digital Transformation with SAP S/4HANA Allen Li, SAP Greater China July 25, 2016 Why Digital transformation? Adapt or die Hyperconnectivity Mobile In-memory computing Internet

More information

Big data is hard. Top 3 Challenges To Adopting Big Data

Big data is hard. Top 3 Challenges To Adopting Big Data Big data is hard Top 3 Challenges To Adopting Big Data Traditionally, analytics have been over pre-defined structures Data characteristics: Sales Questions answered with BI and visualizations: Customer

More information

Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS. Powered by.

Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS. Powered by. Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS Powered by www.upxacademy.com About us UpX Academy is an ed-tech platform providing advanced professional training in Big

More information

Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS. Powered by.

Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS. Powered by. Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS Powered by www.upxacademy.com About us UpX Academy is an ed-tech platform providing advanced professional training in Big

More information

Potential for Savings in China s Government Energy Efficiency Procurement Program: Preliminary Findings

Potential for Savings in China s Government Energy Efficiency Procurement Program: Preliminary Findings Potential for Savings in China s Government Energy Efficiency Procurement Program: Preliminary Findings David Fridley Lawrence Berkeley National Laboratory Berkeley, CA August 2005 1. Background On December

More information

Knowledge Discovery and Data Mining

Knowledge Discovery and Data Mining Knowledge Discovery and Data Mining Unit # 19 1 Acknowledgement The following discussion is based on the paper Mining Big Data: Current Status, and Forecast to the Future by Fan and Bifet and online presentation

More information

The Global Market for Intelligent Video Analytics

The Global Market for Intelligent Video Analytics The Global Market for Intelligent Video Analytics 2018 to 2023 Published: Q3 2018 Global Market for Intell Video Analytics 2018 to 2023 Synopsis This report aims to assist all stakeholders and investors

More information

Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS. Powered by.

Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS. Powered by. Official Recruitment Partner of Tech Mahindra MAKE IT BIG WITH BIG DATA ANALYTICS Powered by www.upxacademy.com About us UpX Academy is an ed-tech platform providing advanced professional training in Big

More information

Redefine Big Data: EMC Data Lake in Action. Andrea Prosperi Systems Engineer

Redefine Big Data: EMC Data Lake in Action. Andrea Prosperi Systems Engineer Redefine Big Data: EMC Data Lake in Action Andrea Prosperi Systems Engineer 1 Agenda Data Analytics Today Big data Hadoop & HDFS Different types of analytics Data lakes EMC Solutions for Data Lakes 2 The

More information