Implementing Relational Use Cases with mongodb and Pentaho January 16, 2013 Dave Henry (dhenry@pentaho.com) Dave Diegtel (ddiegtel@pentaho.com)
Agenda Pentaho / 10Gen Partnership Background Pentaho for Big Data Relational Use Cases - Pentaho & mongodb Pentaho for mongodb planned enhancements 2
About Pentaho Penta Greek for 5. Based on Pentaho s 5 founders, Headquarters, Orlando, FL.. With office is San Francisco, CA. Achieve positive, disruptive change in Data Integration and BI Markets. Achieved Critical Mass 10,000 production deployments in 185 countries Subscription-based Pay-as-you-go, lower upfront cost, only renew when you achieve your ROI Training and Consulting Stewardship of open source analytics projects (Kettle, Mondrian, and WEKA) Pentaho is based on open source software and benefits from a continuous stream of innovation from the open source community. Pentaho contributes back many new capabilities and provides employment for many of the most skilled and innovative community project leaders. INDUSTRY RECOGNITION OVER 160 PARTNERS GLOBALLY 3
Commercial Innovation Online Retailer Understanding the buying patterns of 5 million users from click stream data stored in Hadoop & HBase Mobile & Digital Media Embedded Pentaho to measure massive volumes of mobile and event data generated from mobile devices stored in MongoDB Gaming Better monetization of premium game features through analyzing large volumes of player data - stored in MongoDB & Infobright Travel & Entertainment Helping thousands of travel partners like expedia.co.uk and thomascook.fr improve promotional targeting using Hbase and Hadoop Social Commerce Better campaign performance through monitoring social media, page clicks and email marketing data stored in HP Vertica Healthcare Embedded Pentaho to better patient care & compliance through analysis of unstructured digital pen data stored in CouchDB
5
Big Data Partnership Announcement May 23, 2012 Working together, Pentaho and 10gen offer the first mongodb-based big data analytics solution to the market. mongodb is a scalable, high-performance, open source NoSQL database featuring document-oriented storage, auto-sharing for horizontal scalability, rich document-based queries and fast in-place updates. This is leveraged by Pentaho s visual interfaces for highperformance data input, output and manipulation, as well as data discovery, visualization and predictive analytics. Target Audience is IT staff, developers, data scientists and business analysts. 6
Using Pentaho with mongodb mongodb Input Complete business analytics In addition to visual data loading and manipulation, Pentaho Business Analytics provides a complete end-to-end analytics suite that includes data discovery and predictive analytics. Productivity With Pentaho, mongodb users have an integrated visual environment to deploy, manage, report, visualize and explore big data. Pentaho s visual interface enables up to a 15X productivity improvement for developing and managing big data. Orchestration By integrating mongodb is woven more tightly into the broader fabric of big data and traditional data sources. 7
Pentaho in the Big Data Fabric Pentaho Business Analytics Data Integration Job Orchestration Workflow 3 rd Party Tools R 3 rd Party BI Tools Applications Scheduling High Performance Visual IDE Hadoop NoSQL Databases Analytic Databases Big Data Mgmt Data Integration Big Analytics
Relational Use Cases Pentaho & mongodb In s & out s (the basics) Using mongodb as the source for a data warehouse dimension (extract-transform-load) OLAP analysis using Pentaho Instaview (extract-transform-present) mongodb as the source for a predictive analytic web service Planned enhancements (new mongodb input step) 9
Product Demo 2012, Pentaho. All Rights Reserved. pentaho.com. Worldwide +1 (866) 660-7555 10
Our Questions for You Do you have BI use cases for mongodb? Do you have SOA use cases requiring the use of 3 rd - party integration tools? Do you have use cases requiring predictive analytics? Fill in the blank: If only I could with mongodb 11
Thank You Join the conversation. You can find us on: http://blog.pentaho.com @Pentaho Facebook.com/Pentaho Pentaho Business Analytics