Genomic Data Analysis Services Available for PL-Grid Users

Similar documents
From Lab Bench to Supercomputer: Advanced Life Sciences Computing. John Fonner, PhD Life Sciences Computing

Sanger vs Next-Gen Sequencing

Globus Genomics at GSI Boston University. Dinanath Sulakhe, Alex Rodriguez

Applications of short-read

Course Presentation. Ignacio Medina Presentation

Sequencing applications. Today's outline. Hands-on exercises. Applications of short-read sequencing: RNA-Seq and ChIP-Seq

Research Computing. Information for New and Junior Faculty, Researchers and Graduate Students August, 2016

UAB DNA-Seq Analysis Workshop. John Osborne Research Associate Centers for Clinical and Translational Science

ACC Cyfronet Centre Forum

NOW GENERATION SEQUENCING. Monday, December 5, 11

Gene Regulation Solutions. Microarrays and Next-Generation Sequencing

Next-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX

Long and short/small RNA-seq data analysis

Galaxy Platform For NGS Data Analyses

NGS in Pathology Webinar

Arraygen Technologies Pvt. Ltd.

Research Powered by Agilent s GeneSpring

About Strand NGS. Strand Genomics, Inc All rights reserved.

Introduction to RNA-Seq in GeneSpring NGS Software

Introduction to NGS analyses

Globus Genomics: An End-to-End NGS Analysis Service on the Cloud for Researchers and Core Labs

Agilent Genomics Software Future Directions

Reads to Discovery. Visualize Annotate Discover. Small DNA-Seq ChIP-Seq Methyl-Seq. MeDIP-Seq. RNA-Seq. RNA-Seq.

LARGE DATA AND BIOMEDICAL COMPUTATIONAL PIPELINES FOR COMPLEX DISEASES

C3BI. VARIANTS CALLING November Pierre Lechat Stéphane Descorps-Declère

Transcriptomics analysis with RNA seq: an overview Frederik Coppens

Agilent GeneSpring GX 10: Beyond. Pam Tangvoranuntakul Product Manager, GeneSpring October 1, 2008

Mapping Next Generation Sequence Reads. Bingbing Yuan Dec. 2, 2010

Deakin Research Online

IDENTIFYING A DISEASE CAUSING MUTATION

RNA-sequencing. Next Generation sequencing analysis Anne-Mette Bjerregaard. Center for biological sequence analysis (CBS)

Surely Better Target Enrichment from Sample to Sequencer and Analysis

Surely Better Target Enrichment from Sample to Sequencer

Pioneering Clinical Omics

Introduction to Next Generation Sequencing (NGS) Data Analysis and Pathway Analysis. Jenny Wu

Next Generation Sequencing Data Analysis with BioHPC. Updated for

Next Generation Sequencing

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

Experiences in implementing large-scale biomedical workflows on the cloud: Challenges in transitioning to the clinical domain

RNA-Seq analysis workshop

Compute- and Data-Intensive Analyses in Bioinformatics"

Eucalyptus gene assembly

Ecole de Bioinforma(que AVIESAN Roscoff 2014 GALAXY INITIATION. A. Lermine U900 Ins(tut Curie, INSERM, Mines ParisTech

Integrating MATLAB Analytics into Enterprise Applications

Introduction to iplant Collaborative Jinyu Yang Bioinformatics and Mathematical Biosciences Lab

BICF Variant Analysis Tools. Using the BioHPC Workflow Launching Tool Astrocyte

Welcome to the NGS webinar series

Measuring and Understanding Gene Expression

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

Assembling a Cassava Transcriptome using Galaxy on a High Performance Computing Cluster

Galaxy for Next Generation Sequencing 初探次世代序列分析平台 蘇聖堯 2013/9/12

Globus Genomics Powered by Galaxy. Ravi K Madduri and Globus Genomics Team

ILLUMINA SEQUENCING SYSTEMS

earray 5.0 Create your own Custom Microarray Design

CNV and variant detection for human genome resequencing data - for biomedical researchers (II)

1. Introduction Gene regulation Genomics and genome analyses

RNA-Seq Module 2 From QC to differential gene expression.

BST 226 Statistical Methods for Bioinformatics David M. Rocke. March 10, 2014 BST 226 Statistical Methods for Bioinformatics 1

Agilent Genomic Workbench 7.0

NextSeq 500 System WGS Solution

HPC 2.0 for Genomics. An Introduction to IBM HPDA Framework & Reference Architecture. Frank Lee, PhD IBM Systems

Introducing QIAseq. Accelerate your NGS performance through Sample to Insight solutions. Sample to Insight

Data Analysis with CASAVA v1.8 and the MiSeq Reporter

A Crash Course in NGS for GI Pathologists. Sandra O Toole

Galaxy. Data intensive biology for everyone. / #usegalaxy

Biology 644: Bioinformatics

Next Generation Sequencing (NGS) Market Size, Growth and Trends ( )

Total RNA isola-on End Repair of double- stranded cdna

The New Genome Analyzer IIx Delivering more data, faster, and easier than ever before. Jeremy Preston, PhD Marketing Manager, Sequencing

G E N OM I C S S E RV I C ES

Centro Nacional de Análisis Genómico. Where are the Bottlenecks of Genome Analysis Today? Teratec. Ecole Polytechnique, Palaiseau, F.

Introducing combined CGH and SNP arrays for cancer characterisation and a unique next-generation sequencing service. Dr. Ruth Burton Product Manager

Welcome! Introduction to High Throughput Genomics December Norwegian Microarray Consortium FUGE Bioinformatics platform

SCALABLE, REPRODUCIBLE RNA-Seq

Next Genera*on Sequencing So2ware for Data Management, Analysis, and Visualiza*on. Session W14

solid S Y S T E M s e q u e n c i n g See the Difference Discover the Quality Genome

Analysis Datasheet Exosome RNA-seq Analysis

QIAseq Targeted Panel Analysis Plugin USER MANUAL

Illumina s Suite of Targeted Resequencing Solutions

Genomic solutions for complex disease

The Human Toxome Project a test case for pathway identification by multiomics. Thomas Hartung

WELCOME. Norma J. Nowak, PhD Executive Director, NY State Center of Excellence in Bioinformatics and Life Sciences (CBLS)

Gene Expression Profiling and Validation Using Agilent SurePrint G3 Gene Expression Arrays

DNA. bioinformatics. genomics. personalized. variation NGS. trio. custom. assembly gene. tumor-normal. de novo. structural variation indel.

oqtans A Galaxy-Integrated Workflow for Quantitative Transcriptome Analysis from NGS Data

ChIP-Seq, mrna-seq, & Resequencing via the Genboree Workench

ACCELERATING GENOMIC ANALYSIS ON THE CLOUD. Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes

SureSelect Target Enrichment for the Ion Proton TM Next Generation Sequencing System

locuz.com Professional Services Locuz HPC Services

Next-Generation Sequencing. Technologies

CAPTURE-BASED APPROACH FOR COMPREHENSIVE DETECTION OF IMPORTANT ALTERATIONS

Wheat CAP Gene Expression with RNA-Seq

Analysis of RNA-seq Data. Feb 8, 2017 Peikai CHEN (PHD)

To determine MRK-003 IC50 values, cell lines were plated in triplicate in 96-well plates at 3 x

B&DA Committee Bioinformatics and Data Analysis. PAG January 2016

Transcriptome analysis

WebMeV A Cloud Based Platform for Genomic Analysis Yaoyu E. Wang

IMGM Laboratories GmbH. Sales Manager

'Bioinformatics in academia as related to ehealth' - including the "Genomic Virtual Lab"

Nature Methods: doi: /nmeth.3732

Transcription:

Domain-oriented services and resources of Polish Infrastructure for Supporting Computational Science in the European Research Space PLGrid Plus Domain-oriented services and resources of Polish Infrastructure for Supporting Computational Science in the European Research Space PLGrid Plus Genomic Data Analysis Services Available for PL-Grid Users Clinical Genomic Analysis (CGA) Workshop

ACC Cyfronet AGH and PL-Grid Infrastructure 2 Academic Computer Centre Cyfronet AGH Established in 1973 (40 years of experience) Main mission: to provide network, computational power and data storage capabilities for Polish science ~374 TFlops (145@top500), 2.5 PB (disks) and 3.5 PB (tapes) Regular and bigmem nodes, vsmp, GPGPU, FPGA, MPI over Infiniband Details: http://kdm.cyfronet.pl/ PL-Grid Infrastructure for Polish science Five computing centers with Cyfronet as the consortium leader Total: ~588 TFlops and ~5.6 PB (disks) Planned for 1Q2015: >900 TFlops, 8 PB Available free of charge to all Polish scientists and their foreign collaborators Details: http://www.plgrid.pl

Using PL-Grid Infrastructure 3 Register at https://portal.plgrid.pl User verification process based on Polish OPI number Assistants and foreigners are confirmed by Polish PIs Variety of basic and higher level services available after login Local SSH access, cloud computing, middlewares Considerable library of installed applications GATK, MACS, SAMTools, Picard, TopHat, Bowtie, (p)bwa, R/Bioconductor, AutoDock/AutoGrid, BLAST, Clustal, CPMD, Gromacs, NAMD, Matlab, Mathematica Free to compile and install own applications using the shell login Possibility to use own commercial licenses on HPC resources Questions: https://helpdesk.plgrid.pl or helpdesk@plgrid.pl

PLGrid PLUS: Domain-oriented Services, Resources and Tools 4 Preparation of specific computing environments, i.e., solutions, services and extended infrastructure tailored to the needs of different groups of scientists (2012-2014) Life Science among 13 domains of science LS Domain Leader: Kraków LifeScience Klaster Tasks: Analysis of user needs Development of services Procurement and deployment of applications on HPC res. Continuous assistance for the Life Science community

DNA Microarray Integromics Analysis Platform (1/2) 5 https://lifescience.plgrid.pl/ For people who perform biological investigations using DNA microarrays Goal: help to analyze gene expression information and correlate it with other clinical data In development since 1Q2013, first version deployed Analyses available now: normalization, clustering, SAM, T- test, GO-based enrichment, ANNs, PCA, panel filtering Integromics analyses in preparation CCA, PLS (gene expression and lipidomics) Roleswitch, TargetScore (gene expression and mirna) Supported models: Affymetrix, Agilent (support for others is possible in case of demand)

DNA Microarray Integromics Analysis Platform (2/2) 6 Notable features Integration with EBI ArrayExpress (import, MIAME) Sharing experiments with others Importing own data for further analysis Supported languages: PL, EN Manual: https://docs.cyfronet.pl/x/jpaz Cooperation Jagiellonian University Medical Collage, Kraków Medical University of Silesia, Katowice Institute of Oncology, Gliwice

Galaxy NGS Server (1/2) 7 https://galaxy.plgrid.pl/ Galaxy is an open, web-based platform for data intensive biomedical research. Goal: deploy high-performance, high-throughput NGS data analysis solution on top of HPC resources for PL-Grid users Needs a lot of adjustments and in-house add-on development Work started 12.2013, first version planned ~08.2014 Planned integrated tools (list not closed): GATK, SAMtools, Bowtie, TopHat, BWA, bedtools, Cufflinks, Picard, SnpEff/SnpSift, Flexbar, FastQC, MACS References: human, mouse, domestic animals Targeted platforms: Illumina *Seq, Roche 454, Ion Proton

Galaxy NGS Server (2/2) 8 Notable features Full integration with Zeus cluster and large disk arrays PBS and MQ system for effective job queuing and management Secured environment (open for all PL-Grid users, not public ) All major Galaxy features (history, sharing, viewers) enabled Well documented workflows designed by NGS experts Basics (alignment and quality control, trimming, filtering) DNA-Seq, RNA-Seq, variant calling, SNP calling, methylation, exome analysis with annotations Manual: https://docs.cyfronet.pl/x/voas (available when service goes production) Cooperation Institute of Pharmacology, Polish Academy of Sciences, Kraków Jagiellonian University Medical Collage, Kraków National Research Institute of Animal Production, Kraków-Balice

Agilent GeneSpring GX 9 RDP: genespring.plgrid.pl Used with Windows Remote Desktop Integrated with the DNA Integromics Platform for uniform microarray files management 5-year, single-seat license for all registered Polish scientists Manual: https://docs.cyfronet.pl/x/jiq1

PLGData Simple Service to Manage Files on Clusters 10 https://data.plgrid.pl/ Simple file and folder management upload, delete, download, rename, change rights Integrated with Cyfronet s Zeus cluster, accessible for all users Uses GridFTP and HTTPS protocols for secure data transfer Access to group storage for team/project collaboration Manual: https://docs.cyfronet.pl/x/64es

Links, Contact, Partners 11 These resources, services and tools (and much more) are available after registering to PL-Grid https://portal.plgrid.pl/ PL-Grid User Manual https://docs.plgrid.pl/podrecznik_uzytkownika (PL) https://docs.plgrid.pl/display/plgdoc/user+manual (EN) Questions, problems, requests about PL-Grid https://helpdesk.plgrid.pl or helpdesk@plgrid.pl Contact for LifeScience domain services plgrid@lifescience.pl Collaborative effort Academic Computer Centre Cyfronet AGH, Kraków (project leader) Kraków LifeScience Klaster (Life Science domain services leader) 10 expert institutes/laboratories from Małopolska and Śląsk