Onur Celebioglu. Engineering Director, HPC &AI Solutions

Size: px
Start display at page:

Download "Onur Celebioglu. Engineering Director, HPC &AI Solutions"

Transcription

1 Dell EMC HPC & AI Innovation Lab: AI Technology Update Onur Celebioglu Engineering Director, HPC &AI Solutions

2 Machine Learning & Deep Learning eco-system solving real world problems Verticals / Use Cases Consumption Models Management/ Orchestration Virtualization Web Recommendation engines Buy hpc Image classification Service Providers BigData & HPC orchestration Smart chatbots Bright ML Disease Identification Predictive marketing Containers orchestration Fraud Detection Smart Traffic Core to Edge IoT Bitfusion Threat Predictions Inventory Management Hyperconverged Services Systems Integrators Solution Build appliance asaka Accelerator Virtualization & pooling Software & Frameworks Enterprise ISV ML Software Open Source Frameworks Math Libraries BigDL Processor/ Accelerator Xeon Crest Family FPGA Adapter Xeon Phi Xeon FPGA Adapter Compute/ Storage/ Networking R840 R940XA R740 C4140 T640 C6320p Big Accelerator system C6420 R740 2 of 20 In-Memory Analytics Training Inference

3 Making Deep Learning Easier Building Optimal Developing Industry-focused Consulting Community Involvement Infrastructure Best Practices Examples & Support Thought Leadership 3 of 20

4 Building Optimal Infrastructure

5 50K ft. View What goes into building these solutions: Performance Characterization Servers Storage Networking Software Stack Best Practices for Deep Learning Ecosystems Frameworks and Models Use Cases Containerization Programming models Performance at Scale Deployment, orchestration, monitoring and management Ease of use Data Science Portal Model Zoo Customer POCS 5 of 20

6 Developing Best Practices

7 Benchmarking Deep Learning Frameworks Characterizing the performance of various HW configurations, DL frameworks, math libraries, and parallelization frameworks to save customers time and money, and to inform Dell EMC solution designs. 7 of 20

8 Profiling and Optimizing Determining the best runtime configuration, data formats, software options and environments to get the most out of a particular hardware configuration. 8 of 20

9 Documenting Blog posts, white papers, technical reports, and research papers provide customers and the community at large with best practices for maximizing performance and value of Dell EMC hardware 9 of 20

10 Industry-focused Examples

11 State of the Art Medical Image Classification State-of-the-art Deep Learning models can achieve radiologistlevel accuracy in detecting pneumonia from chest x-rays CheXNet, Stanford 2017 Nodule Pneumonia Improved Accuracy in 10 of 14 conditions! 11 of 20

12 Seconds per step Natural Language Translation Traditional recurrent neural networks provide good translation, but are slow to train and not easily parallelizable. 140 HPC and AI Engineering is 120 exploring other topologies 100 which provide similar 80 translation quality while 60 enabling parallel 40 improvement in training time, and giving customers 20 better time to value. 0 Single Node 7x faster training using 8 nodes! 8 Nodes 12 of 20

13 Customer POCs

14 Helping AeroFarms Feed the World AeroFarms is developing deep learning and machine learning models to: improve quality increase yield eliminate bottlenecks HPC and AI Engineering is working closely with AeroFarms, helping them to make sense of their data, design neural networks, and train models to help them revolutionize farming and feed the world. 14 of 20

15 Community Involvement & Thought Leadership

16 Contribution to DL Software Community Contributions to Open Source Projects Scale-out DL Training Singularity, 7 NV-Caffe, 4 Caffe2, 6 TensorFlow, 9 MXNet, 8 16 of 20

17 Sharing Insights and Thought Leadership 17 of 20

18 Ready Solutions for AI

19 Data Science Portal Ease of use Spawner for Jupyter Hub Integrated into Slurm Scheduler LDAP for user management Module environment Python2, Python 3 and R support Tensorboard Terminal CLI environment Templates for different ecosystems Support for NGC containers Singularity support Walkthrough Video 19 of 20

20 Putting Our Expertise into Customers Hands HPC and AI Engineering is putting all of this expertise into the Dell EMC Ready Bundles for Artificial Intelligence: Dell EMC Ready Bundle for Deep Learning w/ NVIDIA Dell EMC Ready Bundle for Deep Learning w/ Intel 20 of 20