Next Generation Bioinformatics on the Cloud

Similar documents
DNA. bioinformatics. genomics. personalized. variation NGS. trio. custom. assembly gene. tumor-normal. de novo. structural variation indel.

Globus Genomics at GSI Boston University. Dinanath Sulakhe, Alex Rodriguez

Computational Challenges of Medical Genomics

CyVerse Overview. National Academies Special Topics Summer Institute on Quantitative Biology

ILLUMINA SEQUENCING SYSTEMS

Accelerate High Throughput Analysis for Genome Sequencing with GPU

Transcriptomics analysis with RNA seq: an overview Frederik Coppens

Introduction to iplant Collaborative Jinyu Yang Bioinformatics and Mathematical Biosciences Lab

This topic focuses on how to prepare a customer for support, and how to use the SAP support processes to solve your customer s problems.

Next-generation sequencing technologies

resequencing storage SNP ncrna metagenomics private trio de novo exome ncrna RNA DNA bioinformatics RNA-seq comparative genomics

LARGE DATA AND BIOMEDICAL COMPUTATIONAL PIPELINES FOR COMPLEX DISEASES

Bioinformatics: A perspective

Persistent Systems SanGeniX Solution on HANA

Bioinformatics: A perspective

Globus Genomics: An End-to-End NGS Analysis Service on the Cloud for Researchers and Core Labs

solid S Y S T E M s e q u e n c i n g See the Difference Discover the Quality Genome

Course Presentation. Ignacio Medina Presentation

ACCELERATING GENOMIC ANALYSIS ON THE CLOUD. Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes

Genomic Data Is Going Google. Ask Bigger Biological Questions

Cisco Smart Software Manager Satellite. Development Licensing Office Mar, 2015

NextSeq 500 System WGS Solution

Automated Service Builder

ArcGIS Data Reviewer: Integrating Data Quality Control into Web Applications. Shankar Chandrasekaran

Niagara Update N4 & Niagara Analytics. January 19, 2018 The Langham Luxury Hotel, Chicago, IL

Sanger vs Next-Gen Sequencing

ngs metagenomics target variation amplicon bioinformatics diagnostics dna trio indel high-throughput gene structural variation ChIP-seq mendelian

SAMPLE TITLE HERE. The Seven Bridges Cloud Ecosystem: Enabling Interoperable Data Access and Analysis. Liz Williams, PhD


Ensembl workshop. Thomas Randall, PhD bioinformatics.unc.edu. handouts, papers, datasets

My Oracle Support Configuration Manager

1000 Insect Transcriptomes Evolution - 1KITE

An innovative approach to genetic testing for improved patient care

Introduction to RNA-Seq in GeneSpring NGS Software

Analytics Behind Genomic Testing

Bluemix Overview. Last Updated: October 10th, 2017

Bioinformatics and computational tools

Introduction to BIOINFORMATICS

G E N OM I C S S E RV I C ES

ENVIROfying the Future Internet THE ENVIRONMENTAL OBSERVATION WEB AND ITS SERVICE APPLICATIONS WITHIN THE FUTURE INTERNET A Mobile Application for

DNA. bioinformatics. epigenetics methylation structural variation. custom. assembly. gene. tumor-normal. mendelian. BS-seq. prediction.

Complete automation for NGS interpretation and reporting with evidence-based clinical decision support

The Internet of Things Wind Turbine Predictive Analytics. Fluitec Wind s Tribo-Analytics System Predicting Time-to-Failure

User Requirement Specifications

Matthew Tinning Australian Genome Research Facility. July 2012

SMRT Analysis Barcoding Overview (v6.0.0)

FILE BASED MEDIA WORKFLOWS IN THE CLOUD MOVING THE WORLD S DATA AT MAXIMUM SPEED

Oracle Government Tech Cloud Service Descriptions

Introduction to the MiSeq

Monitoring & reporting. Scan management. Print management YSOFT SAFEQ 5. Private Cloud. Security & access management

NEXT GENERATION SEQUENCING. Farhat Habib

Introduction to RNAseq Analysis. Milena Kraus Apr 18, 2016

Comparison Table for PTC PDM/PLM Solutions

How much sequencing do I need? Emily Crisovan Genomics Core

Bioinformatics: A perspective

About Strand NGS. Strand Genomics, Inc All rights reserved.

How much sequencing do I need? Emily Crisovan Genomics Core September 26, 2018

WORKSHOP ON RECRUITMENT COSTS SURVEY

CGE Pipeline. Content 1. The User System 2. The Batch Upload 3. The Pipeline 4. The List Tool 5. The Map Tool 6. FuturePlans 7.

European Genome phenome Archive at the European Bioinformatics Institute. Helen Parkinson Head of Molecular Archives

Introduction. Highlights. Prepare Library Sequence Analyze Data

Genomic resources. for non-model systems

Bionano Access 1.1 Software User Guide

Workshop Overview. J Fass UCD Genome Center Bioinformatics Core Monday September 15, 2014

Welcome to the NGS webinar series

Autotask Workplace for Business Verticals

Big Data and Machine Learning for Predictive Maintenance

High peformance computing infrastructure for bioinformatics

Autotask Workplace for Business Verticals

The Expanded Illumina Sequencing Portfolio New Sample Prep Solutions and Workflow

Bionano Access v1.1 Release Notes

Agilent GeneSpring GX 10: Beyond. Pam Tangvoranuntakul Product Manager, GeneSpring October 1, 2008

Cardiology Information Management System. Sentinel

Bionano Access Software User Guide

The New Genome Analyzer IIx Delivering more data, faster, and easier than ever before. Jeremy Preston, PhD Marketing Manager, Sequencing

Science as a Service Accelerating Scientific Discovery using Cloud

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Mobile BI with Microsoft Tools

White Paper. Non Functional Requirements of Government SaaS. - Ramkumar R S

GPU Technology Conference 2012 May 14-17, 2012 San Jose, California BingQiang WANG, Head of Scalable Computing, BGI

Agilent Genomic Workbench 7.0

CAPTURE-BASED APPROACH FOR COMPREHENSIVE DETECTION OF IMPORTANT ALTERATIONS

Oracle Enterprise Data Quality Product Roadmap and Statement of Direction. October 2016

From Lab Bench to Supercomputer: Advanced Life Sciences Computing. John Fonner, PhD Life Sciences Computing

Experimental Design Microbial Sequencing

Use Case: Salesforce connector

The Journey to Cognos Analytics. Paul Rivera, Eric Smith IBM Analytics Lab Services

Processing Data from Next Generation Sequencing

on-premises to Dynamics 365 (online) Migration

China National Grid --- BioNode. Jun Wang Beijing Genomics Institute

Testing Solutions for Hyper-Connected Apps

De Novo Assembly (Pseudomonas aeruginosa MAPO1 ) Sample to Insight

Meet the iseq 100 System.

Analyzing Data with Power BI

DIGITAL & ONLINE SOLUTIONS FOR CORPORATE PROFESSIONALS

SeqStudio Genetic Analyzer

Next Generation Sequencing

Case Study BONUS CHAPTER 2

SAP Web Intelligence

SciLifeLab Bioinformatics Platform National Bioinformatics Infrastructure Sweden (NBIS)

Transcription:

Next Generation Bioinformatics on the Cloud http://www.easygenomics.com Sifei He Director of BGI Cloud hesifei@genomics.cn Xing Xu, Ph.D Senior Product Manager EasyGenomics BGI xuxing@genomics.cn Contact Us info@easygenomics.com

Agenda Vision and Strategy Problems and Solutions Product Introduction LIVE Demo Future Roadmap Q&A

Trend of Volume and Cost $/Mb D N A S e q u e n c e Human Genome Sequenced Figures adapted from Sboner A, et al.: The real cost of sequencing: higher than you think! Genome Biology 2011, 12:125 Numbers and Images from private research and the open Internet 3

Geological side of the problem + Sequencing is a COMMODITY and happens EVERYWHERE. BGI Images from omicsmaps.com

Interpretation is the KEY Analysis and Interpretation is the KEY Application is the Silver Bullet

Difficulties of Analysis Primary analysis Secondary Analysis Tertiary Analysis Post Tertiary Analysis Base calling Mapping Variant Calling In-depth Annotation Data throughput Data storage Computation intensive Data storage Complicated Algorithms Computation intensive Lack of knowledge

Problems and Solutions Solutions Cloud High Speed Data Exchange Workflows +) Resource Management Problems: Big genomic data Geological distribution Algorithm integration Computational demand 7

EasyGenomics EasyGenomicsis the bioinformatics platform for research and applications on the cloud

EasyGenomics Database, Data management Computational Resources Algorithms, Workflows, Reports High speed connection Web portal, Simple UI EasyGenomicsis the bioinformatics platform for research and applications on the cloud

Bioinformatics Core Algorithms: Carefully chosen, tested and optimized Workflows: Whole genome resequencing, exome resequencing, RNA-Seq, small RNA, de novo Assembly

Enabling Technology Hadoop-based Flexible Computing Best Practice Award for IT Infrastructure Human Genome SOAPdenovo EasyGenomics TM (192 cores) Genome Coverage 86% 86% Assembly Time 70h 55h No. of Servers 1 15 Memory Size 500GB x 1 24 GB x 15 Mode Centralized Distributed 11

Data Management Sample A Analysis I Analysis II Raw Data Sample B Analysis X Project I Sample, Analysis, Project Mimicking real research procedure Automatic management of underlying data structure

High Speed Data Exchange Aspera spatented fasp high-speed file transferring technology 10~100X fasterthan FTP 13

Resource Management Managed Multitenancy Workspace Data Structure Managed Task Safe Backup

Security Access Multitenancy Isolation Compliance Username/Password Biometric access HTTPS, Asperafastp TM Trusted database connection ACL, Data encryption Physical isolation Virtual isolation ISO27000

Introduction to EasyGenomics TM Xing Xu, Ph.D Senior Product Manager

Homepage Navigation Tabs Three task portals Status of recent works Warning and Logging

Project Table Add/Remove Project Project list table Filter and search box Operation short cuts

Analysis Table

Sample Table

Read Upload Read Upload Portal

Upload Raw Data Create a Sample

Upload Raw Reads (Asperaconnect server)

Create a sample Create a Sample

Create a Sample Sequencing information Add Read Group Filter settings Mapping settings

Add read groups Create a Sample

Sample Page Individual report for each lane Summarized report for all lanes

Sequencing Quality Report 28

Mapping Report 29

Data Analysis Portal Create a Analysis

Create an Analysis

Create an Analysis Selected sample(s) One selected sample => Single Analysis Multiple selected samples => Batch Analyses

Create an Analysis Selectable modules Predefined Settings Shortcut

Create an Analysis

Create an Analysis Customizable

Create an Analysis

Create an Analysis

Data Harvest Portal Data Management

Upload Management

Download Management

LIVE DEMO

Sifei He Director of BGI Cloud 42

Applications Complex -Omicsresearch Genetic testing Diagnostics

One More Thing FREE Ref: BOSTON Please Visit BGI Booth @ 213 Subject to T&C

Q & A 45

BACKUP 46