'Bioinformatics in academia as related to ehealth' - including the "Genomic Virtual Lab"

Similar documents
LARGE DATA AND BIOMEDICAL COMPUTATIONAL PIPELINES FOR COMPLEX DISEASES

USING HPC CLASS INFRASTRUCTURE FOR HIGH THROUGHPUT COMPUTING IN GENOMICS

Nuts & Bolts of Informatics Infrastructure for Clinical Next Generation Sequencing (NGS)

SEQUENCING. M Ataei, PhD. Feb 2016

Introduction to Next Generation Sequencing (NGS) Data Analysis and Pathway Analysis. Jenny Wu

Understanding the science and technology of whole genome sequencing

resequencing storage SNP ncrna metagenomics private trio de novo exome ncrna RNA DNA bioinformatics RNA-seq comparative genomics

Course Presentation. Ignacio Medina Presentation

NGS in Pathology Webinar

Genomic Technologies. Michael Schatz. Feb 1, 2018 Lecture 2: Applied Comparative Genomics

Crash-course in genomics

Massive Analysis of cdna Ends for simultaneous Genotyping and Transcription Profiling in High Throughput

Pioneering Clinical Omics

Introducing QIAseq. Accelerate your NGS performance through Sample to Insight solutions. Sample to Insight

Measuring and Understanding Gene Expression

Bioinformatics Programming and Analysis CSC Dr. Garrett Dancik

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

Agilent NGS Solutions : Addressing Today s Challenges

Total genomic solutions for biobanks. Maximizing the value of your specimens.

NGS Data Analysis & Interpretation Ecosystem Analysis Report

Introduction to Bioinformatics

Interpreting Genome Data for Personalised Medicine. Professor Dame Janet Thornton EMBL-EBI

What is Bioinformatics?

DNA. bioinformatics. genomics. personalized. variation NGS. trio. custom. assembly gene. tumor-normal. de novo. structural variation indel.

Genomic Data Is Going Google. Ask Bigger Biological Questions

BIOINFORMATICS FOR DUMMIES MB&C2017 WORKSHOP

European Genome phenome Archive at the European Bioinformatics Institute. Helen Parkinson Head of Molecular Archives

Introduction to Bioinformatics

Scottish Genomes Project: Taking Genomics into Healthcare. Zosia Miedzybrodzka

Session 8. Differential gene expression analysis using RNAseq data

Experiences in implementing large-scale biomedical workflows on the cloud: Challenges in transitioning to the clinical domain

Analytics Behind Genomic Testing

Variant detection analysis in the BRCA1/2 genes from Ion torrent PGM data

Computational Challenges of Medical Genomics

TECHNOLOGIES, PRODUCTS & SERVICES for MOLECULAR DIAGNOSTICS, MDx ABA 298

G E N OM I C S S E RV I C ES

Research Assistant (Complex Trait Genomics)

Sequencing applications. Today's outline. Hands-on exercises. Applications of short-read sequencing: RNA-Seq and ChIP-Seq

Biology 644: Bioinformatics

DNA. bioinformatics. epigenetics methylation structural variation. custom. assembly. gene. tumor-normal. mendelian. BS-seq. prediction.

Targeted Sequencing in the NBS Laboratory

Introduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013

Jefferies Healthcare Conference. Frank Witney, President & CEO

NextSeq 500 System WGS Solution

Forensic genetics: New frontiers

Next Generation Sequencing (NGS) Market Size, Growth and Trends ( )

ASHG 2018 CoLab Schedule CoLab #1 Booth #1441 DATA CLINICAL LABORATORY OTHER NO COLAB

Introduction into single-cell RNA-seq. Kersti Jääger 19/02/2014

Personalised Medicine and #QLDgenomics

Satellite Education Workshop (SW4): Epigenomics: Design, Implementation and Analysis for RNA-seq and Methyl-seq Experiments

Course Overview: Mutation Detection Using Massively Parallel Sequencing

Introduction to RNA-Seq in GeneSpring NGS Software

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary

DNBseq TM SERVICE OVERVIEW Plant and Animal Whole Genome Re-Sequencing

Microarray Informatics

Sanger vs Next-Gen Sequencing

Quality assurance in NGS (diagnostics)

Applications of short-read

A Crash Course in NGS for GI Pathologists. Sandra O Toole

Short Course Instructors

Genetics/Genomics in Public Health. Role of Clinical and Biochemical Geneticists in Public Health

Introduction to NGS. Josef K Vogt Slides by: Simon Rasmussen Next Generation Sequencing Analysis

The MiniSeq System. Explore the possibilities. Discover demonstrated NGS workflows for molecular biology applications.

Accelerate precision medicine with Microsoft Genomics

Undiagnosed Disease Programs and Networks

Agilent Genomics Software Future Directions

Almac Diagnostics. NGS Panels: From Patient Selection to CDx. Dr Katarina Wikstrom Head of US Operations Almac Diagnostics

Data Basics. Josef K Vogt Slides by: Simon Rasmussen Next Generation Sequencing Analysis

The HL7 Clinical Genomics Work Group

Welcome to the NGS webinar series

Get to Know Your DNA. Every Single Fragment.

From Forensic Genetics to Genomics Perspectives for an Integrated Approach to the Use of Genetic Evidence in Criminal Investigations

De Novo Assembly (Pseudomonas aeruginosa MAPO1 ) Sample to Insight

Exome Sequencing Exome sequencing is a technique that is used to examine all of the protein-coding regions of the genome.

Functional Annotation and Prioritization of Whole Exome and Whole Genome Sequencing Variants. Mulin Jun Li

Introduction to Bioinformatics and Gene Expression Technologies

Introduction to Bioinformatics and Gene Expression Technologies

Whole Genome, Exome, or Custom Targeted Sequencing: How do I choose? Aaron Thorner, PhD Clinical Genomics Group Leader

Using Galaxy for the analysis of NGS-derived pathogen genomes in clinical microbiology

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016

Complete automation for NGS interpretation and reporting with evidence-based clinical decision support

Introduction to NGS. Simon Rasmussen Associate Professor DTU Bioinformatics Technical University of Denmark 2018

Genomic solutions for complex disease

Explore Impact of Computing Innovations Written Response Submission Template. Submission Requirements 2. Written Responses. Computational Artifact

Introducing combined CGH and SNP arrays for cancer characterisation and a unique next-generation sequencing service. Dr. Ruth Burton Product Manager

1000 Insect Transcriptomes Evolution - 1KITE

Bioinformatics. Dick de Ridder. Tuinbouw Digitaal, 12/11/15

CDC s Advanced Molecular Detection (AMD) Sequence Data Analysis and Management

Introduction to Bioinformatics

The EMBL-Bioinformatics and Data-Intensive Informatics

Assay Validation Services

Using VarSeq to Improve Variant Analysis Research

Undergraduate Research in the Brzustowicz Laboratory Human Genetics Institute Department of Genetics Rutgers University

Alissa Interpret The next evolution of Cartagenia Bench

Types of Databases - By Scope

Accelerate High Throughput Analysis for Genome Sequencing with GPU

Cover Page. The handle holds various files of this Leiden University dissertation.

Introduction to NGS analyses

Next-generation sequencing Technology Overview

Genetics and Bioinformatics

Transcription:

'Bioinformatics in academia as related to ehealth' - including the "Genomic Virtual Lab" Dr Gareth Price Head of Computational Biology Queensland Facility of Advanced Bioinformatics

From Genomes to Systems Literature and ontologies Genomes DNA & RNA sequence Gene expression Protein sequence Protein structure Chemical entities Protein families, motifs and domains Protein interactions Pathways Systems... Bioinformatics Resource

What is Next Gen Sequencing? Next Generation Sequencing: high-throughput sequencing massive parallel short (and now long) read sequencing deep sequencing In reality NGS really just refers to the scale of sequencing. For Example: ABI Sequencers, Venter Institute 2007 1 Human Genome in total (years!) Illumina HiSeq 2000s, BGI 2013 2 Human Genomes per machine (days!)

Big Data: Acquisition, Storage, Distribution, and Analysis terapetaexazetta- 10 12 10 21 Stephens ZD, Lee SY, Faghri F, Campbell RH, Zhai C et al., PLOS Biology (2015)

Submarine Cable Map TeleGeography www.submarinecablemap.com

Data Transfer speeds Datasets potentially very large (many Tbs) Download time is lengthy and at risk of failure and interruption

Overview of NGS data flow Library preparation preparation of samples in the lab Sequencing library run in sequencer either at sequencing facility or in-house Raw reads sequencer generates output "reads" QC quality checks of the reads Trimming trimming reads if required Examine/Filter noise Removal of foreign DNA materials Assembly / Map Construction of genomes using sequencing reads or count transcripts Identification of Branches out to research analysis

Tools Academic and Clinical Freeware Genome Analysis Toolkit (GATK) Virtual Labs / Machines Galaxy R Studio (Bioconductor) Command Line Commercial Agilent Cartagenia Bench Lab for Molecular Pathology Illumina BaseSpace Qiagen CLC-Bio Suite of Analysis Products Ingenuity Pathway Analysis Ingenuity Variant Analysis ANNOVAR ThermoFisher Ion Reporter Google Genomics Microsoft Genomics Oracle Healthcare Precision Medicine http://grouthbio.com/genome_software_service.php

We observed different biases toward specific types of SNP genotyping errors by the different variant callers

Health Solutions to the use Genomic Information American College of Medical Genetics 2015: doi:10.1038/gim.2015.30 Benign, Likely Benign, Of Unknown Significance, Likely Pathogenic and Pathogenic NIH National Human Genome Research Institute: Division of Genomic Medicine Undiagnosed Rare Disorders, GWAS studies, report formats Global Alliance for Genomics and Health (GA4GH) policy-framing and technical standards-setting organization, seeking to enable responsible genomic data sharing within a human rights framework

Genomics Virtual Labs Virtual Machines with pre-installed suite of tools for performing bioinformatics analyses Public instances all around the world gvl.org.au usegalaxy.org.au [currently Galaxy Qld] GVL provides compute and storage Galaxy houses reference data, public data and indices Galaxy houses your uploaded data, computed data and results Direct import from public repositories Data visualisation options Data sharing (with and without data duplication) Big community Easy registration Services Rstudio Galaxy Command line Data Genome Indices Reference data User data

Galaxy is a workflow engine A Galaxy workflow is a series of tools and dataset actions that run in sequence as a batch operation Input Noodle Tool box 12

Genomic Information Management Improving the health of Queenslanders by delivering genomic medicine Leads David Hansen, AeHRC, CSIRO John Pearson, QIMRB MRI Collaborators Dominique Gorse, QCIF Paul Leo, QUT Pamela Pollock, QUT Cas Simons, UQ Nic Waddell, QIMRB MRI Amanda Spurdle, QIMRB MRI Sunil Lakhani, PQ & UQ Naomi Wray, IMB, QBI & UQ Hugo Leroux, AeHRC, CSIRO Alejandro Metke, AeHRC, CSIRO

Contacts www.qfab.org contact@qfab.org training@qfab.org g.price@qfab.org 14

Balancing Sharing with Identification 3D facial structure Voice Biological age Height Weight BMI Eye colour Skin colour Sex Hair Colour Baldness

Sex + Ancestry + SNPs + Age + BMI limitations in statistical power (n=1,061) individually, each model provided limited information about an individual s identity multiple prediction models enabled matching between genomes and phenotypic profiles with good accuracy Over time, predictions will get more precise thus, the results of this work will be of greater consideration in the current discussion on genome privacy protection. Left Observed Right - Predicted How we protect against identification? Mask known predictor sites?