Building a HIPAA Compliant Data Lake on AWS

Similar documents
EBOOK: Cloudwick Powering the Digital Enterprise

Building a Data Lake on AWS

Health Solutions. Commercial Health Solutions Overview EXPANDING INSIGHT. ENSURING VALUE. IMPROVING OUTCOMES.

How to Design a Successful Data Lake

Optum Intelligent EDI. Achieve higher first-pass payment rates and help your organization get paid quickly and accurately.

Guide to Modernize Your Enterprise Data Warehouse How to Migrate to a Hadoop-based Big Data Lake

Cask Data Application Platform (CDAP)

Microsoft Azure Essentials

Smart Mortgage Lending

LIAISON ALLOY HEALTH PLATFORM

Analytics in Healthcare. Preparing for advance healthcare analytics

Big Data Management Best Practices for Data Lakes Philip Russom, Ph.D.

Big Data & Clinical Informatics

WELCOME TO. Cloud Data Services: The Art of the Possible

: Boosting Business Returns with Faster and Smarter Data Lakes

Optum Performance Analytics

How Plans and Providers Can Leverage Their HIPAA Investment for Productively and Profitability

MIGRATING AND MANAGING MICROSOFT WORKLOADS ON AWS WITH DATAPIPE DATAPIPE.COM

Enterprise Information Governance, Archiving & Records management

Copyright - Diyotta, Inc. - All Rights Reserved. Page 2

Integrated Social and Enterprise Data = Enhanced Analytics

Nuance Power PDF is PDF uncompromised.

Sharing current and archived patient data in the EMC Federation Enterprise Hybrid Cloud

Session 87PD, Actuaries in the Cloud: Data Lakes to Downstream Analytics

AWS Architecture Case Study: Real-Time Bidding. Tom Maddox, Solutions Architect

Analytics in the Cloud, Cross Functional Teams, and Apache Hadoop is not a Thing Ryan Packer, Bank of New Zealand

DLT AnalyticsStack. Powering big data, analytics and data science strategies for government agencies

Value of. Clinical and Business Data Analytics for. Healthcare Payers NOUS INFOSYSTEMS LEVERAGING INTELLECT

Health Information. for Government. Maximize the Value of Your Health Information Exchange

The power of the Converge platform lies in the ability to share data across all aspects of risk management over a secure workspace.

YASHAJIT SAHA & ABHISHEK SHARMA, SUBJECT MATTER EXPERTS, RESEARCH & ANALYTICS ADVANCED ANALYTICS: A REMEDY FOR COMMERCIAL SUCCESS IN PHARMA.

A Seven-Step Approach to a Clinically Integrated Network. April 28, 2016 Track B ACOs, Population Health, Affiliation and Other Issues

Health Solutions HEALTH SOLUTIONS OVERVIEW EXPANDING INSIGHT. ENSURING VALUE. IMPROVING OUTCOMES.

Leveraging Collaboration to Assess ICD-10 Readiness and Reduce ICD-10 Operational and Financial Risks

VITERA (GREENWAY) INTERGY 9.0 MEDICAL REVIEW

Social Media in Healthcare Leverage Social Media for Real Business Impact

Turn Data into Business Value

Ensuring a Sustainable Architecture for Data Analytics

Spark and Hadoop Perfect Together

Simplifying Your Modern Data Architecture Footprint

2016 Big Data. Mejores pra cticas en AWS

Achieve Powerful Business Benefits by Streamlining Document Workflows

MDM offers healthcare organizations an agile, affordable solution To deliver high quality patient care and better outcomes

THE ESSENTIAL ELEMENT of a

Transforming Healthcare Communications

CRM and the Future of Healthcare Engagement

E-guide Hadoop Big Data Platforms Buyer s Guide part 1

Continuous Delivery and Immutable Infrastructure at Whistle Labs

The Five Essential Elements of Self-Service Data Integration

The IBM Reference Architecture for Healthcare and Life Sciences

How to Select, Align, Develop, and Retain Highly-Engaged People in Healthcare

Business Analytics and Optimization An IBM Growth Priority

Sr. Sergio Rodríguez de Guzmán CTO PUE

Optum. One. Award-winning intelligent health analytics platform

Reselling Update for AWS: Solution Provider Program & AWS Marketplace

Teradata Next-Generation Healthcare Analytics

Pharmaceutical Industry Polpharma S.A.

What's Shaping the Future of Enterprise Content. Management? JOHN O MELIA

Data Strategy: How to Handle the New Data Integration Challenges. Edgar de Groot

Hadoop in the Cloud. Ryan Lippert, Cloudera Product Cloudera, Inc. All rights reserved.

A Non-Actuarial Look at Predictive Analytics in Health Insurance Past, Present and Future. November 2016 Rajiv Sood

Bringing Big Data to Life: Overcoming The Challenges of Legacy Data in Hadoop

Got Data Silos? Automate Data Ingestion Into Isilon In Support Of Analytics

Functional Requirements for Enterprise Clinical Data Management: Solving Technical Problems, Satisfying User Needs

Venn Health Partners. Venn Health Partners, 906 Oak Tree Ave, Suite R, South Plainfield New Jersey, 07080,

Embracing Disruption: Financial Services and the Microsoft Cloud

GDPR: Centralize Unstructured Data Governance Across On-premises and Cloud

FLINK IN ZALANDO S WORLD OF MICROSERVICES JAVIER LOPEZ MIHAIL VIERU

Applications for Clinical, Financial and Administrative Patient Risk:

Achieving Business Resilience in HEALTHCARE

New Frontiers in Personalized Medicine

Improve your probability. of success

MapR Pentaho Business Solutions

GET MORE VALUE OUT OF BIG DATA

Architecture Overview for Data Analytics Deployments

Transforming Big Data to Business Benefits

BIG Data Analytics AWS Training

BROKER-DEALERS. Protegent CAT for broker-dealers

Preparing For & Managing a RADV Audit

How Cisco IT Modernizes Shopping for IT Services with estore

Bringing the Power of SAS to Hadoop Title

Secure information access is critical & more complex than ever

Spend visibility and shared services Strategies to address growing pains for long-term care organizations

KNOWLEDGE MANAGEMENT: CONNECTIONS TO SHARE EXPERTISE AND GAIN EFFICIENCIES

Provider Network Analytics:

Industrialized Clinical Data Standards Management Speed of automation, Power of accuracy and Transforming clinical data into business intelligence

Customer Data Management in the Automotive Industry: Creating Value

Hospitals and Health Systems: Beginning the Journey to the Cloud with Medical Imaging

Microsoft Big Data. Solution Brief

Going Big Data? You Need A Cloud Strategy

Meeting the New Standard for AWS Managed Services

TigerText Solutions Guide

Metrics That Matter Integration Process Overview

Contact Center Integrations Portfolio. Innovation Collaboration Evolution Results

Adobe Cloud Platform

Over a Hundred Reporting Clients throughout the country. 95%+ Client Retention

EMC Information Infrastructure Solutions for Healthcare Providers. Delivering information to the point of care

Physician Marketing & Outreach Growing Referring Physician Lifetime Value

AWS Life Sciences Competency Consulting Partner Validation Checklist

Solutions to Accelerate Compliance with Affordable Care Act (ACA) Mandates and HIPAA Standards IBM Redbooks Solution Guide

Transcription:

Building a HIPAA Compliant Data Lake on AWS with

2 About Since 1998, has developed healthcare consumer engagement solutions to address some of the industry s greatest challenges from adherence, to prevention, to condition management, to brand loyalty and retention. Pay-for-performance in healthcare incentivizes payers and providers to keep a population under their care healthier. The Pay-for-performance arrangements provide financial incentives to hospitals, physicians, and other healthcare providers to carry out such improvements and help achieve optimal outcomes for patients. This is a departure from fee-for-service, where payments are for each service used., focuses on Health Engagement Management, and acts on behalf of healthcare organizations (e.g. hospitals, clinics, pharmacies, insurance companies, etc.) in order to engage people at the right time, with the right message, and in the right channel to capture relevant metrics to analyze the overall value provided by Healthcare.

3 Their Challenge analyzes more than 300 million outreaches per year, primarily through outbound phone calls with Interactive Voice Response (IVR,) technology but rapidly growing channels such as SMS, email, and in-bound IVR. For Eliza, the outreach results are the questions, the responses form a decision tree, and each question and response are captured as a pair, E.g. <question, response> = < Did you visit your physician in the last 30 days?, Yes > This type of data poses challenges in processing and analyzing for example, you can t have a table with fixed columns to store it. The majority of data at Eliza takes the form of outreach results captured as a set of <attribute> and <attribute value> pairs. Other data sets at Eliza include structured data. This data is received from various systems that include customers, claims data, pharmacy data, Electronics Medical Records (EMR/EHR) data, and enrichment data. Needless to say, there is a considerable amount of variety and quality considerations within the data that Eliza manages. Eliza s organic growth resulted in an incumbent data platform and strategy that were architected to store multiple copies of data across multiple systems, and allowed for variation and modification, which in turn created inconsistent versions across the company. This resulted in higher platform costs because multiple systems contained multiple copies of the data. Their architecture created significant scaling and quality control issues for the company because of a lag between the time something happened and the time to derive value. These problems combined limited Eliza s growth and scale as the complexity of their platform meant extended times for evolution.

4 Building a Data Lake with the Help of NorthBay Solutions NorthBay, an Advanced AWS Partner Network (APN) Consulting Partner and AWS Big Data Competency Partner, was ultimately chosen as the Big Data partner to architect, implement data storage, process infrastructure and improve the overall performance of the Eliza s process when analyzing massive amounts of data. Eliza originally contacted NorthBay to seek help and guidance from an organization that had deep expertise on AWS and had familiarity and experience working with healthcare organizations. NorthBay was one of the first APN Consulting Partners to hold the AWS Big Data competency and their ability to consider and discuss various best practice approaches to solve the data volume, varity and compliance requirements for Eliza s use case quickly earned them Eliza s trust. After evaluating Eliza s use case and business needs, NorthBay architected a Data Lake on AWS, adhering to best practices recommendations, such as creating selfdocumenting and intuitive data paths. General benefits of the Data Lake architectural pattern include: Decoupling Storage and Compute Ingesting & Storing original datasets in native or close to native format Allowing for both real-time and batch processing Consumption of datasets from storage with variable compute Providing metadata, catalog and data discovery for content in the data lake Enabling access to data through entitlements and access control.

5 Solving Key Business Challenges on AWS The key challenges Eliza needed to address arise out of key compliance requirements like the Health Insurance Portability and Accountability Act (HIPAA), the variety of the data ingested, and the need for a common view of the data. Meeting HIPAA Requirements AWS enables customers and partners to build HIPAA-compliant applications. Based on the requirements around Encryption in transit and Encryption at rest and following guidelines mentioned in the whitepaper, as well as various HIPAA compliance guidelines, a number of steps were implemented by NorthBay to ensure the Data Lake was HIPAAcompliant, including spinning up Amazon Elastic MapReduce (EMR) in a dedicated Virtual Private Cloud (VPC), encrypting and decrypting data when needed, and launching a data pipeline orchestration process. EMR resources are provisioned in dedicated VPCs, and most of the processing is done on transient clusters which leverage spot/reserved and on-demand instances. In addition, a long-running cluster was also provisioned for ad-hoc analysis of data. To make the real-time streaming data ingestion HIPAA-compliant for Eliza, NorthBay leveraged Amazon Kinesis Producer Library to encrypt the data prior to putting it in Amazon Kinesis, and then decrypting it before putting it into Amazon Simple Storage Service (Amazon S3). NorthBay also launched a data pipeline orchestration process, which in turns accesses resources in a dedicated VPC. Data Obfuscation, Data Cleansing, and Data Mapping To meet Eliza s interpretation of protecting data under HIPAA, NorthBay established a business rule that when dealing with PII (Personally Identifiable Information) and PHI (Personal Health Information) data. In non-production environments, the PII must be obfuscated or masked before it can be shared with the development teams. Considering the volume and velocity of the data, the obfuscation task itself became a Big Data

problem. To solve this problem NorthBay helped develop an algorithm and data map that reads the data and applies the corresponding obfuscation to protect the data. The data map also provided Eliza a common view across all of their data sources, an issue that they had been struggling with previously. 6 The data received by Eliza is populated by disparate systems and can include free-form entries by consumers/customers, creating inconsistencies among each entry. NorthBay helped Eliza implement an additional process to cleanse the data and bring it to a common format. The schema structure that was put in places allows Eliza to apply multiple data cleansing rules on the same field and choose the order in which the rules are applied. Benefits With NorthBay, Eliza now has the ability to consume any type of data (structured, semistructured, and unstructured) at any scale. With support for their entire storage lifecycle (frequent, infrequent & archived) they are able to store raw copies of input data and have eliminated the need to enforce schema during data load processes. NorthBay also provided Eliza with a Data Catalog and Seach capability, which enables Eliza to access their data quicker and easier than before. Unlike with a traditional data warehouse, NorthBay on AWS enabled Eliza to decouple the resources and therefore the costs associated with compute and storage. General benefits of the Data Lake architectural pattern include: Decoupling Storage and Compute Ingesting & Storing original datasets in native or close to native format Allowing for both real-time and batch processing Consuming of datasets from storage with variable compute Providing metadata, catalog and data discovery for content in the data lake Enabling access to data through entitlements and access control.

7 Conclusion By building a Data Lake to streamline data management, Eliza can quickly react to their clients pressing needs to prove results in closing gaps in care, retaining membership, and adhering to medications which support the outcomes base model.