PRODUCT PRESENTATION

Size: px
Start display at page:

Download "PRODUCT PRESENTATION"

Transcription

1 PRODUCT PRESENTATION Valéry Guilleaume, CEO & Founder September 2017, version 1.6 Public 1

2 NODEUM is software created by MT-C We engineered an open platform which enables our customers to focus on their business value add. The storage Abstraction & Orchestration facilitate a fabric of different hardware and silo technologies such as flash, spinning disk, tapes and cloud. All behind a unique file repository. 2

3 NODEUM IS A DATA FABRIC PLATFORM Software Appliance running on commodity hardware abstracting different classes of storage Speed Very Expensive Accessibility Expensive Flash Disk NODEUM ABSTRACTS ALL OF THESE TECHNOLOGIES Cloud Tapes Service Pay per use Capacity Cheapest 3

4 PROVIDING ENHANCED FILE DATA MANAGEMENT A Toolbox is available to quickly and effortlessly manage & retrieve data REST API WEB SEARCH ENGINE DATA MOVER DATA TAGGING WITH METADATA TOOLBOX DATA PLACEMENT CATALOG FILE ANALYSIS AUTOMATIC PLACEMENT OPTIMIZATION BEHAVIOR ANALYSES 4

5 AND SO WHAT? A VISIONARY VALUE PROPOSITION OUR CUSTOMER s business model includes manipulating, comparing, associating or simulating data to create economic and/or social value - Finding/ retrieving data is critical. - Need for a self service model (quick & easy). - Keeping data to use it over and over (100+TB). WHAT WE DO : 1. We facilitate the data management of our customers, which has a direct impact on their business. more data => more information => more business 2. Great user experience to make it simple and Gartner : IT Market Clock for Storage, 2016 convenient. 5

6 HEALTHCARE GENOMIC CHALLENGE Today, the world is facing a massive explosion of structured and unstructured data ; mainly due to 1 the decline in genome analysis cost (100$ Genome). The challenge is at a magnitude of Exabytes and has a direct impact on the business growth of markets. Note : 1 Exabyte (EB) = 1000 petabytes (PB) = 1 million terabytes (TB) = 1 billion gigabytes (GB) 2 For analysis requirements, data cannot be deleted anymore and the required retention periods are growing (30+ years). Requirement to get control of this data. Storing data is easy, but retrieving data is still 3 complex for users. Too much time is currently spent searching for data instead of researching data. Current solutions are coming from hardware vendors. They provide IT oriented silo disk 4 arrays, with an expensive average cost per raw TB of $1,593*. *(Source: Gartner IT Key Metrics Data (December 2016) WANT TO KNOW MORE? LISTEN THIS PODCAST RECORDED AT BIO-IT WORLD 2017 BY THE CHI 6

7 NEXT GEN SEQUENCING - A GAME CHANGER! The price of DNA sequencing is falling faster than computer disk storage costs, making data fabric an increasingly important tool in genomics 1,000, SILO DISK STORAGE DATA FABRIC 100,000, , , , Hard Disk Storage (MB/$) Doubling time 14 months NGS (bp+$) Doubling time 5 months 10,000, ,000, , , , Pre-NGS (bp+$) Doubling time 19 months Disk Storage (Mbytes/$) DNA sequencing (bp/$) DNA sequencing (bp/$) Source: L. D. Stein Genome Biol. 11, 207 (2010) 7

8 WITH NODEUM the right Data is at the right Place with the right Service level for the right User at the right Cost Flash Disk Tapes Disk Cloud Disk Customers generates up to 300TB per year per sequencer 1 st tier storage: fast but expensive and not structured 2 nd Tier Storage : Nodeum software: researchers can easily find what they need, when they need it Flash Disk Tapes Cloud existing legacy & commodity hardware 8

9 WHY? SAVING PEOPLE S LIVES! Genomic analyses have become much more affordable over the recent years. Research centers and hospitals have acquired specific equipment for these applications, for patient treatment purposes or research. This has resulted in a constant creation digital information. Once generated, this data is made available to scientists and medical staff for analysis and then stored many years for instant retrieval whenever needed. Today, researchers are spending too much of their time searching for data instead of researching data. If a researcher has instant access to data, productivity goes up, more research can be performed, and MORE LIVES CAN BE SAVED! 9

10 NODEUM, A DATA FABRIC CIFS data flow NFS command flow REST Single Namespace NAS CLUSTER APPLIANCE Data Cataloging File Data Analysis Metadata Tagging Web Search Engine Data Retention Data Movement LOGICS Integrity Workflow Manager Data Migration Post Processing Events Data Placement Block - File File Object COMMODITY HARDWARE Flash Disk Tapes Cloud INTERFACES IP / F.C. / SAS F.C. / SAS S3 10

11 Storage Classes abstraction -> one file system NAS External Archive NAS External Connector NAS REST API NFS / SMB NODEFS MASTER CONTROLLER APPLIANCE FiberChannel SAS - IP STORAGE NODE HIGH PERFORMANCE FLASH STORAGE NODE NEARLINE DISK STORAGE NODE HIGH TAPE CAPACITY WARMEST COLDEST 11

12 Workflow Manager Scheduler 100 Waiting Movement Request From Source To Destination Filter : <Defined> Option : Integrity : yes RULE_0005 RULE_0004 RULE_0003 RULE_0002 Priority Queue Processing Processing Processing RULE_ STORAGE NODE Processing Queue 12

13 Pre-Defined Workflows Worfklows Online Storage Offline Archive Data Exchange 13

14 Inline Data Cataloging Inline Global Content Catalog Metadata Tagging Real-time in ingestion Storage copies visualisation 14

15 Workflow Online Storage Tapes & Cloud Drives Slots I/O Nodeum Drive I/O File Catalog Data Management Drive I/O Tape Library add copy on tapes Drive I/O Bucket a S3 Object Storage File Catalog after workflow movement: file in cache file with a copy on tape file with a copy on cloud 15

16 Workflow Offline Archive Tape Vaulting Nodeum Tape Library File Catalog Data Management Drives Slots I/O offline archive to tapes Drive I/O Drive I/O Drive I/O File Catalog after workflow movement: file in cache file in online tape file in offline tape 16

17 Workflow Data Exchange Tapes to Cloud Drives Slots I/O Nodeum Drive I/O File Catalog Data Management Drive I/O Tape Library data exchange from LTFS to cache to cloud Drive I/O Bucket a S3 Object Storage File Catalog after workflow movement: file in cache file on tape file on cloud 17

18 Smart Search Engine 18

19 Storage Trends Analysis 19

20 REST API to Automated your Workflow The Restful API allows for a tight integration with the business workflows within Bio IT, allowing for control of the Data Fabric straight from sequencers and other equipment. This results in an automated and highly efficient way of working coupled with low vault tolerances. EXAMPLE: 1. DNA Sequencer copies Data in NODEUM 2. From DNA Sequencer, API calls to updated Patient Field to John 3. From DNA Sequencer, API calls to updated Source Field to set the Sequencer box 20

21 Customer Case KEY DECISION DRIVERS COLDEST STORAGE NODE HIGH TAPE CAPACITY 1800TB 130TB SCALABILITY Virtualization of storage & archiving, Scalability : starting from 100TB and scale to 1800TB STORAGE NODE NEARLINE DISK 30TB REST API NODEFS Intelligent Data Analysis Dynamic Content Catalog WARMEST NAS REST API NFS / SMB Active Archive, LTFS format, no vendor-lockin Intelligent Workflows Process Easy to use & Intuitive System Multi site & offline vaulting. OFFICE GENETICS LAB 21

22 5 TOP CUSTOMER BENEFITS 1. Find and Retrieve Data is easy and fast 2. Self Service Model 3. Highly Scalable to store and archive data forever 4. Control Cost of Storage Evolution 5. Automate Business Data Management Worfklow WANT TO KNOW MORE? WATCH THE ONCODNA EXPERIENCE WITH NODEUM 22

23 THANK YOU INTERESTED TO KNOW MORE? CONTACT US MT-C S.A. Rue Ernest Solvay 29/A 4000 LIEGE, BELGIUM 23

24 ATIONS ADDITIONAL INFO. PRESS 2016 : ComputerWeekly : Linux file system + LTFS tape = Nodeum's cold storage 2016 : TVBEurope: Nodeum set to virtualise storage 2016 : StorageNewsLetter : MT-C Said Nodeum Certified LTFS by LTO Consortium 2016 : CIO Review : MT-C : A route to Next Generation Storage 2016 : The Silicon Review : Turning entrepreneurial vision for the future of storage into reality: MT-C 2017 : ComputerWeekly : Cancer researcher gets Nodeum LTFS tape NAS 2017 : CHI : Podcasts - Bio-IT World 2017 : Poweredbytape : NODEUM is the Scalable, Hybrid Software-Defined Storage Platform 2017 : LeMag IT : OncoDNA s'appuie sur la solution LTFS de Nodeum pour ses archives 24 24

25 ATIONS ADDITIONAL INFO. PRODUCT NODEUM - Next Storage Generation NODEUM - Business Applications cases - Product Overview CUSTOMER TESTIMONIAL Vidéo : ONCODNA precision medicine application leverage their data in using... Case Study Life Science - NODEUM fr Case Study OncoDNA - NODEUM Case Study IPG - NODEUM 25 25

26 We are Vendor Neutral ADDITIONAL INFO. TECHNOLOGY ALLIANCE VENDOR ALLIANCE CONSORTIUM ALLIANCE 26 26

27 Leverage the LTFS Open Standard ADDITIONAL INFO. Store your data in the Linear Tape File System (LTFS) open standard is the best strategy for supporting data transportability and long-term data accessibility. 27