DataAdapt Active Insight

Size: px
Start display at page:

Download "DataAdapt Active Insight"

Transcription

1 Solution Highlights Accelerated time to value Enterprise-ready Apache Hadoop based platform for data processing, warehousing and analytics Advanced analytics for structured, semistructured and unstructured data Professional-grade visualization, development and administration tooling to boost productivity Application accelerators that help speed implementation and accelerate time-to-value Integration with proven IBM offerings as well as thirdparty solutions Extreme Scalability Multiple configuration that are professionally designed, prebuilt & validated ViON s DataAdapt Active Insight enables organizations of any size to harness the power of big data and data analytics. Transform complex data volumes into valuable insights through operational efficiency, advanced analytics and exploration and discovery. The Active Insight Solution is powered by IBM Open Platform for Apache Hadoop and Apache Spark and IBM BigInsights helps organizations integrate data from disparate sources, analyze big data in real time, anticipate future outcomes, and rapidly generate insights for capitalizing on new opportunities. Operational Efficiency Hadoop is a powerful platform for analytics and data discovery that can augment data warehouses and unstructured repositories. It can also open the door to new analytics and new ways of doing things. It can be a more cost effective data management platform. Hadoop can be very challenging to make it all work. Many organizations have spent a year or more to get their Hadoop environment ready for operational use. Active Insight brings the analytics capability of IBM Open Platform for Apache Hadoop, Apache Spark and IBM BigInsights into an integrated system allowing organizations quickly and efficiently deploy a big data solution. The simplified administration and intuitive user interface facilitates data ingestion, discovery, and reduces the complexity of getting started with Hadoop. ViON s Active Insight solution makes it easy for the two largest populations of data processing skills available: spreadsheet users and SQL programmers, to create applications and get insights

2 Exploration and Discovery The explosive growth of big data may overwhelm organizations, making it difficult to uncover nuggets of high-value information. Active Insight helps build an environment well suited to exploring and discovering data relationships and correlations that can lead to new insights and improved business results. Data scientists can analyze raw data from big data sources alongside data from the enterprise warehouse and several other sources in a sandbox-like environment. Subsequently, they can combine any newly discovered high-value information with other data to help improve operational and strategic insights and decision making. The bottom line: with Active Insight, enterprises can finally get their arms around massive amounts of untapped data and mine it for valuable insights in an efficient, optimized and scalable way. DataAdapt Active Insight Enterprise Capabilities Active Insight integrates Big SQL, a massively parallel processing (MPP) SQL engine directly on the physical Hadoop Distributed File System (HDFS) cluster rat her than using Map-Reduce, vastly improving performance and SQL execution capabilities over Apache Hive. Big SQL leverages standard SQL to allow users to access big data in the same way they leverage other relational data. Active Insight also provides a built-in interactive dashboard for end-user interactionwith big data out of the box and it integrates Big SQL seamlessly into IBM Cognos Business intelligence for interactive dashboards and activities. Open & hybrid Combines Hadoop and Spark for lightning fast processing of all data. Exploit in the cloud, on-premises, or both to help you be more agile and efficient Easy & scalable Better tooling means less coding and more analytics. Make Hadoop ready for enterprise-scale workloads and performance with governance, data management and analytics tools. Integrated & seamless IBM BigInsights helps you integrate SQL, nosql, and other data types with Hadoop quickly and easily to enable self-service data access and optimize insight. Hadoop has many next-generation analytics engines to solve big data problems. Big SQL is a SQL engine for Hadoop that concurrently exploits Hive, HBase and Spark using a single database connection even a single query. For this reason, Big SQL is also the ultimate hybrid engine. Big SQL is also the ultimate platform for data warehouse offload and consolidation, a key use case for many Hadoop users. This is because Big SQL is the first and only SQL-on-Hadoop solution to understand commonly used SQL syntax from other vendors and products such as Oracle, IBM DB2 and IBM PureData Systems for Analytics. And where data can t be moved to Hadoop, Big SQL provides federated access to RDBMS sources outside of Hadoop with industry-leading IBM Fluid Query technology.

3 DataAdapt Active Insight Spark and Hadoop for the open enterprise - with the IBM advantage. Organizations want to spend less time creating an enterprise-ready Hadoop infrastructure, and more time gaining insights and delivering data applications. IBM provides a complete Hadoop solution, including Spark, to scale analytics and applications quickly and easily. Benefit from 100% open source Hadoop through the IBM Open Platform, which includes all the Hadoop ecosystem components bundled for you. Integrated with Apache Spark for extra processing power. Designed for performance and usability, with performance optimized capabilities, visualization, rich developer tools and powerful analytic functions. Built-in modules support a variety of analytics applications and analytics skill sets Delivers management, security and reliability features to support large-scale deployments and accelerate time to value. Integrates SQL, nosql, and other data types with Hadoop quickly and easily to enable self-service data access and optimize insight. Integrated with proven IBM analytics offerings and third-party solutions Provides an ODPi compliant distribution, which is certified to be interoperable so that you don t have to guess or worry about stability. Common Hadoop Use Cases 1. Log file(s) Analysis or Extract, Transform, and Load (Internet Analysis) Commercial Products: DataStage, Informatica Cost: $, Cluster Requirements - CPU, Memory: LOW, Storage: LOW, Network: LOW 2. Lower cost data storage (Landing Zone, Data Lake) Commercial Products: EMC, NetApp, XIV, Synology, SAN, NAS Cost: $$, Cluster Requirements - CPU, Memory: MED, Storage: HIGH, Network: MED 3. Data warehouse or database (Big SQL, Impala, Hive, SparkSQL, Hbase) Commercial Products: Oracle, DB2, Informix, SQLServer, Netezza, TeraData Cost: $$$, Cluster Requirements - CPU, Memory: HIGH, Storage: MED, Network: MED

4 ViON DataAdapt A1000 System Architecture System Components Internet Analysis Entry Internet Analysis Data Lake Complex Analytics Raw Capacity 140TB 430TB 1.2PB 750TB (A1020) (A1020) (A1040) (A1060) Management Node 3 3, 4, 6 3, 4, 6 3, 4, 6 Mhz/Cores 2x2.6GHz/16 2x2.6GHz/16 2x2.6GHz/20 2x2.5GHz/24 Memory 64/1866Mhz 64/1866Mhz 128/2133Mhz 128/2133Mhz Storage 2x2TB7.2k 2x2TB/7.2k 2x6TB/7.2k 2x4TB/7.2k Data Node Mhz/Cores 2x2.4GHz/12 2x2.4GHz/12 2x2.6GHz/16 2x2.5GHz/24 Memory 64/1866Mhz 64/1866Mhz 64/2133Mhz 128/2133Mhz Storage 12x2TB/7.2k 12x2TB/7.2k 12x6TB/7.2k 12x4TB/7.2k Service Node (Optional) Management Switch Data Switch N/A 2 2 Each Configuration contains Servers, Rack & Networking details for BOM creation Carefully designed for value in each use case Competitive designs that scale and perform Latest Servers, CPUs, Memory, Network Adapters and Switches

5 Securing your information Active Insight provides four options for authentication: No Authentication, Flat File authentication, LDAP authentication and PAM authentication. In addition, the Advanced Insights installer provides the option to configure HTTPS to potentially provide more security when a user connects to the Advanced Insights web console. Data Adapt Active Insight security includes perimeter security, authentication, and authorization. Authenticate, authorize, and protect your data by using these steps and recommendations. All communication with Hadoop is controlled and moderated. Active Insight includes the following features: LDAP and Active Directory integration, support for identity federation based on HTTP headers, and service-level authorization and auditing. Optional protection for Big Data at Rest can also be incorporated providing data protection, encryption along with the monitoring and auditing of all Hadoop activities. Managed Services Our Big Data professional services can be performed as needed or as a full service solution. ViON experts will work with technical and business staffs in a way that trains and empowers your current workers to use and manage your DataAdapt to help reduce costs. Integrated Release Management Quarterly Patching and Updates Procedural testing and validation Security hot fixes Firmware updates as required Weekly/Monthly Health Checks Systems Health Application Health Network Health Monthly Reporting Custom Application and Network device reports Patch review installed and revision levels Firmware review Monthly Technical Account Review Dedicated Technical Account Manager Reporting review and analysis Support case review Customer satisfaction review 196 Van Buren Street Herndon, Virginia (571) vion.com