Bioinformatics for the 100,000 Genomes Project

Size: px
Start display at page:

Download "Bioinformatics for the 100,000 Genomes Project"

Transcription

1 Bioinformatics for the 100,000 Genomes Project Augusto Rendón Director of Bioinformatics Genomics England Principal Research Associate University of Cambridge Barcelona,

2 Outline Introduction to the UK s 100,000 genomes project Analyses in rare diseases Analyses in cancer Bioinformatics Platform Data models and flows Databases Interpretation

3 Inception of the 100,000 genomes project (2012, 2014) If we get this right, we could transform how we diagnose and treat our most complex diseases not only here but across the world (December 2012) I am determined to do all I can to support the health and scientific sector to unlock the power of DNA, turning an important scientific breakthrough into something that will help deliver better tests, better drugs and above all better care for patients. (August 2014)

4 Goals of the Genomics England project Sequence 100,000 genomes Cancer and rare genetic disease Capture data delivered electronically, store it securely and analyse it within an English data centre (reading library) Combine genomes with extracted clinical information for analysis, interpretation, and aggregation Create capacity, capability and legacy in personalised medicine for the UK 1. To bring benefit to NHS patients 3. To kickstart the development of a UK genomics industry 2. To enable new scientific discovery and medical insights 4. To create an ethical and transparent programme based on consent

5 Genomics England project

6 Recruitment and clinical interface via 13 GMCs, Scotland and Northern Ireland + Genomic Medicine Centres Networks of NHS hospitals including genomics labs 13 Lead organisation plus 71 Local Delivery Partners Contracted by NHS England Cover recruitment, data and return of results Scotland Doing own sequencing Northern Ireland Similar to a GMC Contracted by NI payer

7 Feedback to participants 7

8 Additional finding Genes Requirements: A treatable or preventable condition. Reliably detected by next generation sequencing. Each gene will have a curated list of high confidence, high penetrance variants. Other conditions may be added if clinically appropriate and technically feasible.

9 Participants recruited in RD About 400 RD participants currently recruited per week 5,000 participants recruited to the RD pilot Family Size *Data from Main Programme

10 Recruitment by tumour type Sarcoma, 44, 4% Renal, 65, 6% Testicular Germ Cell Tumours, 1, 0% Adult Glioma, 19, 2% Bladder, 28, 3% Prostate, 95, 9% Breast, 321, 29% Ovarian, 91, 8% Malignant Melanoma, 3, 0% Lung, 139, 13% Childhood, 1, 0% Endometrial Carcinoma, 20, 2% Colorectal, 264, 24% 10

11 >14,896 genomes sequenced (Nov 1) N Bases x 10 9 (Q30-nodup) Median %Autosomal coverage >= 15X = 97.4% About 1.4 PB of data % Autosomal coverage >=15x(Q30-nodup) Germline data only

12 Analyses in Rare Diseases Genetic based test

13 Checks of reported data vs genetics Sex checks Coverage-based (WGS) X chromosome heterozygosity and Y chromosome genotyping rate (array) Predicted minor karyotypes include XO, XXY, XYY Relatedness checks Mendelian inconsistency rate (where at least one parent sequenced) Estimated identity by descent sharing for all pairs in cohort and working on a family only workflow - PLINK and PC-Relate Can identify rare phenomena, e.g. large-scale uniparental isodisomy

14 Coverage based sex checks

15 Relatedness checking 15

16 Analyses in Cancer Assessing the quality of sample preparation protocols

17 Fresh Frozen (FF) vs Formalin Fixed Paraffin Embedded (FFPE) FF Costly and not widely available Difficult to capture tumour High quality DNA FFPE Routinely used Digital pathology for tumour selection Low quality and quantity of DNA

18 AT/CG dropout effect on copy number variant calling FFPE GC dropout FFPE AT dropout FF AT dropout FF GC dropout AT rich GC rich AT dropout GC dropout FF sample low coverage for GC-rich regions FFPE sample trend is reversed with poor coverage of AT-rich regions

19 Fresh frozen and FFPE paired samples: ability to call CNVs FF AT drop Pur ity RMSD cov FFPE AT drop Pur ity RMSD cov NA

20 FFPE also affects small variant calling Proportion of variants Overlapping SNVs in FF and FFPE samples from paired VAF < 5% filtered out

21 Comparing sequence quality metrics across labs GMC1 Other GMCs After standardising on optimised FFPE protocol

22 Bioinformatics platform

23 GEL bioinformatics platform Design Goals Scalability: able to operate on several hundred whole genomes per day Traceability: able to keep the provenance of every artefact produced in the process Knowledge accumulation: able to capture and aggregate the knowledge, decisions captured during the interpretation in order to generate better knowledge bases Service oriented: components talk to each other via well defined APIs and data formats

24 Hospitals Genomics England Interp. providers Genome intake service Workflow management Clinical Data intake service Tracking GxP associations Interpretation Metadata Reference Knowledge Variants Interpretation platform services

25 Same data model, many manifestations How to ensure that all the data is coherently stored and easily retrievable? Inspired by Model-Driven Architecture approaches Models(schemas) controlled in github including boiler plate functions to validate data against model Documentation auto-generated out of the model Services communicate using JSON derived from the model Data written against the schema auto-generated from the model in the metadata store using document stores

26 Data models in the platform Use of avro for its interface definition language, JSON out of the box, automatic code generation of classes to handle these data Models (and auxiliary libraries) available here: s/tree/releases/schemas/idls Documentation for the master branch here: ndex.html Bioinformatics models here: For reads and variants we use protocol buffers compatible with GA4GH standards

27

28 InterpretedGenomeRD

29 Bertha: Distributed workflow management system (really an enterprise service bus for genomic data) DeliveryAPI Auditor Message Broker Job Scheduler Grid Consumer Restarts Scatter-gather Single and group processes Multiple concurrent workflows (work in progress) Orchestrator Producer publishes Exchange routes Queue consumes Consumer Tracking DB Dashboard

30 Workflow diagramme bertha_default Intake QC Delivery API Integrity Check Sequence received IntakeAPI MD5 Check Filtered Bamstats Unfiltered Bamstats Q30 Bamstats VCF QC Data intake Validate BAM Picard Plot Filtered Bamstats Generate Filtered Metrics Bamstats Plot Unfiltered Bamstats Generate Q30 Metrics Bamstats Fix Permissions QC Stats Post-processing Intake QC Check Point Merge Array Genotypes Single Sample QC & Processing Single Sample QC & Processing Sex Check Cross Sample Contamination Somatic VCF re-headering Cross Species Contamination Tumour Cross Sample Contamination Depth of Coverage Concordance check Single-Sample QC Check Point Multi Sample QC Multi-sample QC Mendelian Inconsistency Rate Identity by Decent Multi-Sample QC Check Point Consent Check Point Analysis Inbreeding Coefficient Mutation Burden SNV & Indel Refinement Actionable Mutation Coverage Viral Insertions Mutation Signature Tumour Clonality Tumour Purity Tumour Ploidy Variant Calling Homozygosity Runs Analysis Variant Normalisation Variant Annotation Interpretation Request Dispatched Interpretation Dispatch Variant Tiering Exomiser

31

32 Interpretation approach Virtual Gene Panels Initially assigned by a clinician Working on automated panel suggestions Variant filtering Allele Frequency: variant is rare Segregation: variant segregates with condition in family Panel membership (including mode of inheritance) Different for cancer Interpretation Automated pathogenicity scoring Manual review Several manual QC points

33 Panelapp: Crowdsourcing curation of gene disease associations

34 Status of Panels 190 panels 97 >=v1 panels 3,512 genes 435 registered reviewers 15,149 gene level reviews Recognised by the UK genetic testing network Curation reaching a point of diminishing returns Number of reviews Reviewers

35 Automated panel suggestion Disease X HP1 HP2 HP3 HP4 Diseases, core HP terms and panels HP1 HP2 HP3 HP4 HP5 X Y Z Panel X G1,G2,G3 Panel Y G1,G4,G5 Panel Z G6,G7,G8 Diseases, HP annotation and genes X Y Z G1,G2,G3 G1,G4,G5 G6,G7,G8 Also get a QC score for how phenotypically similar patient is to recruited disease HP2 HP3 HP4 G6 G7

36 RD pilot benchmarking 1831 participants with HPO terms, assigned panels (2674 total) and core disease 847/1831 (46%) have exactly same panels 728/1831 (40%) have same panels plus 1 or 2 extra 256/1831 (14%) are missing some of medical review 1200 panels Gene gains or losses Frequency November More 36 Bin

37 Filtering in the rare diseases programme

38 Filtering in the cancer programme Frequency filters: exclude common variants (1000G, ExAC, GEL) Consequence filters: exclude synonymous variants Domain 1 Domain 2 Domain 3 Variants in a virtual panel of actionable genes (between 20 and 40). Actionable genes are defined as genes with short variants associated with therapeutic, prognostic or diagnostic actions by GenomOncology (My Cancer Genome) Matching at the variant level Variants in the genes from Cancer Gene Census genes. Variants in all other genes Two part reports: Actionable and Interesting

39 Structural variants Mutational density Coverage and copy number Mutational signatures Supplementary analysis Mutation context Hypermutation rain plots

40 Bioinformatics platform components Cellbase Reference data store/annotation Engine OpenCGA Catalog: metadata and clinical data store Storage: variant database InterpretationPlatform Interpretation service: manage various producers and consumers Interpretation warehouse (under construction): stores and serves interpretation data

41 OpenCB family of applications Genome Browser Variant Analysis Data Discovery Interface Layer Cellbase OpenCGA Catalog OpenCGA Storage MongoDB MongoDB MongoDB HBASE PosixFS

42 Cellbase Knowledge base management Uses Ensembl, Uniprot, IntAct, ClinVar, etc. Current database engine is MongoDB JSON outputs against well defined model Supports annotation against local DBs Annotates about 10,000 variants/second per instance Python and R APIs

43 Annotation against Cellbase

44 CellBase VEP 82 Consequence type benchmark (1kG phase 3, 83M variants) VEP annotations: 346M CellBase annotations: 346M Coincidence at SO term level (346M annotations) Annotations provided by VEP and not provided by CellBase: 3364 (99.999% coincidence) Annotations provided by CellBase and not provided by VEP: 4918 (99.999% coincidence) 60% Due to differences on mirna data sources 39% Difficulties with VEP output format parsing Coincidence at variant level (83M variants) Variants with conflicting annotation: 4990 (99.994% coincidence)

45 Annotation for phased MNVs and CNVs Support for CNVs new in CellBase 4.5 Beta Main challenge: support imprecise calling - match against already reported CNVs (population frequencies, clinical variants) Same annotation data as for the rest of variants: consequence type, population frequencies, etc. Example CNV Support for MNVs and phased variants from CellBase 4.0 Consequence type depends on variants affecting the same codon Variants are assigned a phase set (phased VCFs include the PS tag) - all variants on the same phase set shall be processed together

46 Annotation of MNVs Example: 17:270550: AA CA G : TG CA A Example MNV Decompose into single phased variants members of the same phase set: { { "id": "17:270550:A:T", "result": [ { "codon": "caa/ctg", "proteinvariantannotation": { "reference": "GLN", "alternate": "LEU" }, "sequenceontologyterms": [ { "accession": "SO: ", "name": "missense_variant" } "id": "17:270551:A:G", "result": [ { "codon": "caa/ctg", "proteinvariantannotation": { "reference": "GLN", "alternate": "LEU" }, "sequenceontologyterms": [ { "accession": "SO: ", "name": "missense_variant" } { "id": "17:270554:G:A", "result": [ { "codon": "cag/caa", "proteinvariantannotation": { "reference": "GLN", "alternate": "GLN" }, "sequenceontologyterms": [ { "accession": "SO: ", "name": "synonymous_variant" }

47 OpenCGA - Catalog Metadata store and A&A for OpenCGA Manages roles, groups, acls Audit log LDAP integration Arbitrary schemas (annotation sets)

48 OpenCGA - Storage 6 node Hadoop cluster: Transform: 97 min Load: 80 sec Merge: 84 sec Millisecond response times for regional queries Whole genome filtering queries for all individuals within seconds Extensive capabilities to query across genotype and phenotype relationships

49 Aspiration to be fully GA4GH compatible from v1.0

50 Platform for interpretation (under construction)

51

52 Key (personal) learnings There is great strength in multidisciplinary teams with specialisation, but those individuals that can span both biology/genetics and software engineer are pivotal the connect the specialist Good software engineering practices also apply to bioinformatics, to name a few: designing, documenting, Testing, support and service. Skipping them don t really save you time I have become a big fan of using well established technologies with rich ecosystems (e.g. hadoop) rather than inventing new formats, data structures, toolchains

53 Final thoughts The future in human genetics will be underpinned by academic/industrial partnerships; both the task and the benefits are too big to go at it alone Genomic Medicine is just one of the pilots of a digital revolution in health care where artificial intelligence will complement/replace the diagnostic journey But genomics is the easy part, clinical data is the real challenge

100,000 Genomes Project Rare Disease Programme. Dr Richard Scott Clinical Lead for Rare Disease, Genomics England

100,000 Genomes Project Rare Disease Programme. Dr Richard Scott Clinical Lead for Rare Disease, Genomics England 100,000 Genomes Project Rare Disease Programme Dr Richard Scott Clinical Lead for Rare Disease, Genomics England Recruitment to 16 th June 31,231 rare disease participants Average = 490 participants /

More information

The 100,000 Genomes Project

The 100,000 Genomes Project The 100,000 Genomes Project Dr Tom Fowler, Deputy Chief Scientist Genomics England Jillian Hastings Ward, Chair, National 100K Participant Panel The Kings Fund, 6th October 2017 Our Mission 100,000 whole

More information

Genomics and personalised medicine

Genomics and personalised medicine Genomics and personalised medicine Dr Tom Fowler, Deputy Chief Scientist & Director of Public Health WHO Symposium on the Future of Digital Health Systems in the European Region February 2019 About me

More information

The 100,000 Genomes Project

The 100,000 Genomes Project The 100,000 Genomes Project Louise J Jones Genomics England James Fisher Director, Genomics Implementation Unit North East & North Cumbria Cancer Alliance Launch Event 30 th March 2017 The first human

More information

Delivering Genomic Medicine at a Population Level: beyond the 100,000 Genomes Project

Delivering Genomic Medicine at a Population Level: beyond the 100,000 Genomes Project Delivering Genomic Medicine at a Population Level: beyond the 100,000 Genomes Project J Louise Jones Barts Cancer Institute Barts Health NHS Trust Genomics England What is Genomic Medicine? an emerging

More information

Genomics in the NHS. Professor Sue Chief Scientific Officer for England

Genomics in the NHS. Professor Sue Chief Scientific Officer for England Genomics in the NHS Professor Sue Hill @CSOSue Chief Scientific Officer for England Nov 2017 100,000 Genomes Project: overview PRINCIPLES 100,000 genomes from Rare Disease (families) & Cancer (people &

More information

Introducing combined CGH and SNP arrays for cancer characterisation and a unique next-generation sequencing service. Dr. Ruth Burton Product Manager

Introducing combined CGH and SNP arrays for cancer characterisation and a unique next-generation sequencing service. Dr. Ruth Burton Product Manager Introducing combined CGH and SNP arrays for cancer characterisation and a unique next-generation sequencing service Dr. Ruth Burton Product Manager Today s agenda Introduction CytoSure arrays and analysis

More information

Validation of EBUS sampling

Validation of EBUS sampling Functional Area Cancer Document Key PAR-SOP-003 Document Owner Clare Craig Status FINAL Document Author Clare Craig Version 1.0 Document Reviewer(s) Shirley Henderson Version Date Kay Lawson Next Review

More information

The 100,000 Genomes Project Genomics Collaboration Event Finnish Residence

The 100,000 Genomes Project Genomics Collaboration Event Finnish Residence The 100,000 Genomes Project Genomics Collaboration Event Finnish Residence Prof Mark Caulfield FMedSci Chief Scientist for Genomics England Barts Heart Centre William Harvey Research Institute Queen Mary

More information

Validation of cytology media for whole genome sequencing

Validation of cytology media for whole genome sequencing Validation of cytology media for whole genome sequencing Functional Area Cancer Document Key PAR-SOP-004 Document Owner Clare Craig Status FINAL Document Author Clare Craig Version 1.0 Document Reviewer(s)

More information

The 100,000 Genomes Project Rare Disease Programme

The 100,000 Genomes Project Rare Disease Programme The 100,000 Genomes Project Rare Disease Programme Dr Richard Scott Clinical Lead for Rare Disease, Genomics England Consultant in Clinical Genetics, Great Ormond Street The 100,000 Genomes Project History

More information

Scottish Genomes Project: Taking Genomics into Healthcare. Zosia Miedzybrodzka

Scottish Genomes Project: Taking Genomics into Healthcare. Zosia Miedzybrodzka Scottish Genomes Project: Taking Genomics into Healthcare Zosia Miedzybrodzka DNA GENOME IS ALL OF OUR DNA: CODING & NON-CODING Sequencing of Whole Genomes at Scale in Scotland Edinburgh Genomics Sequencing

More information

Bioinformatics group update. Joo Wook Ahn, Guy s & St Thomas 26/06/ ACGS Summer Meeting

Bioinformatics group update. Joo Wook Ahn, Guy s & St Thomas 26/06/ ACGS Summer Meeting Bioinformatics group update Joo Wook Ahn, Guy s & St Thomas 26/06/2017 - ACGS Summer Meeting Recent activity... Bi-annual group meeting Nov 16 - Bristol // June 17 - Leeds // Nov 17 - Oxford ACGS Genomics

More information

ACCELERATING GENOMIC ANALYSIS ON THE CLOUD. Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes

ACCELERATING GENOMIC ANALYSIS ON THE CLOUD. Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes ACCELERATING GENOMIC ANALYSIS ON THE CLOUD Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia

More information

Accessible answers. Targeted sequencing: accelerating and amplifying answers for oncology research

Accessible answers. Targeted sequencing: accelerating and amplifying answers for oncology research Accessible answers Targeted sequencing: accelerating and amplifying answers for oncology research Help advance precision medicine Accelerate results with Ion Torrent NGS Life without cancer. This is our

More information

NHS ENGLAND BOARD PAPER

NHS ENGLAND BOARD PAPER NHS ENGLAND BOARD PAPER Paper: PB.30.03.2017/06 Title: Creating a genomic medicine service to lay the foundations to deliver personalised interventions and treatments Lead Director: Professor Sir Bruce

More information

European Genome phenome Archive at the European Bioinformatics Institute. Helen Parkinson Head of Molecular Archives

European Genome phenome Archive at the European Bioinformatics Institute. Helen Parkinson Head of Molecular Archives European Genome phenome Archive at the European Bioinformatics Institute Helen Parkinson Head of Molecular Archives What is EMBL-EBI? International, non-profit research institute Part of the European Molecular

More information

An Automated Pipeline for NGS Testing and Reporting in a Commercial Molecular Pathology Lab: The Genoptix Case

An Automated Pipeline for NGS Testing and Reporting in a Commercial Molecular Pathology Lab: The Genoptix Case Cartagenia Bench Lab Case Study An Automated Pipeline for NGS Testing and Reporting in a Commercial Molecular Pathology Lab: The Genoptix Case At a Glance In this case study, you will learn: How the Molecular

More information

2017 HTS-CSRS COMMUNITY PUBLIC WORKSHOP

2017 HTS-CSRS COMMUNITY PUBLIC WORKSHOP 2017 HTS-CSRS COMMUNITY PUBLIC WORKSHOP GenomeNext Overview Olympus Platform The Olympus Platform provides a continuous workflow and data management solution from the sequencing instrument through analysis,

More information

Sample to Insight. Dr. Bhagyashree S. Birla NGS Field Application Scientist

Sample to Insight. Dr. Bhagyashree S. Birla NGS Field Application Scientist Dr. Bhagyashree S. Birla NGS Field Application Scientist bhagyashree.birla@qiagen.com NGS spans a broad range of applications DNA Applications Human ID Liquid biopsy Biomarker discovery Inherited and somatic

More information

FFPE in your NGS Study

FFPE in your NGS Study FFPE in your NGS Study Richard Corbett Canada s Michael Smith Genome Sciences Centre Vancouver, British Columbia Dec 6, 2017 Our mandate is to advance knowledge about cancer and other diseases and to use

More information

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary Executive Summary Helix is a personal genomics platform company with a simple but powerful mission: to empower every person to improve their life through DNA. Our platform includes saliva sample collection,

More information

Implementing ACMG guidelines on sequence variant interpretation: software-assisted variant curation and filtering

Implementing ACMG guidelines on sequence variant interpretation: software-assisted variant curation and filtering Cartagenia Bench Lab CASE STUDY Implementing ACMG guidelines on sequence variant interpretation: software-assisted variant curation and filtering Authors: Berivan Baskin 1, PhD, FACMG, FCCMG; Tina Smets

More information

Course Presentation. Ignacio Medina Presentation

Course Presentation. Ignacio Medina Presentation Course Index Introduction Agenda Analysis pipeline Some considerations Introduction Who we are Teachers: Marta Bleda: Computational Biologist and Data Analyst at Department of Medicine, Addenbrooke's Hospital

More information

BOARD PAPER - NHS ENGLAND. Purpose of Paper: To inform the Board of the development of an NHS England Personalised Medicine Strategy.

BOARD PAPER - NHS ENGLAND. Purpose of Paper: To inform the Board of the development of an NHS England Personalised Medicine Strategy. Paper: PB.24.09.15/05 Title: Personalised Medicine Strategy. From: Sir Bruce Keogh, National Medical Director. BOARD PAPER - NHS ENGLAND Purpose of Paper: To inform the Board of the development of an NHS

More information

The experience of Andalusia in ehealth and big data --- Big data in health - IMI s HARMONY project, European Parliament, Brussels, 19 June 2018

The experience of Andalusia in ehealth and big data --- Big data in health - IMI s HARMONY project, European Parliament, Brussels, 19 June 2018 The experience of Andalusia in ehealth and big data --- Big data in health - IMI s HARMONY project, European Parliament, Brussels, 19 June 2018 Joaquín Dopazo Área de Bioinformática, Fundación Progreso

More information

The NHS approach to personalised medicine in respiratory disease. Professor Sue Chief Scientific Officer for England

The NHS approach to personalised medicine in respiratory disease. Professor Sue Chief Scientific Officer for England The NHS approach to personalised medicine in respiratory disease Professor Sue Hill @CSOSue Chief Scientific Officer for England Jul 2017 Genomics is probably the biggest breakthrough in the last 50 years.

More information

Oncomine cfdna Assays Part III: Variant Analysis

Oncomine cfdna Assays Part III: Variant Analysis Oncomine cfdna Assays Part III: Variant Analysis USER GUIDE for use with: Oncomine Lung cfdna Assay Oncomine Colon cfdna Assay Oncomine Breast cfdna Assay Catalog Numbers A31149, A31182, A31183 Publication

More information

Variant calling workflow for the Oncomine Comprehensive Assay using Ion Reporter Software v4.4

Variant calling workflow for the Oncomine Comprehensive Assay using Ion Reporter Software v4.4 WHITE PAPER Oncomine Comprehensive Assay Variant calling workflow for the Oncomine Comprehensive Assay using Ion Reporter Software v4.4 Contents Scope and purpose of document...2 Content...2 How Torrent

More information

New Frontiers in Personalized Medicine

New Frontiers in Personalized Medicine New Frontiers in Personalized Medicine Oracle Open World Shanghai 2013 Neil de Crescenzo SVP and GM 1 Safe Harbor Statement The following is intended to outline our general product direction. It is intended

More information

Variant prioritization in NGS studies: Annotation and Filtering "

Variant prioritization in NGS studies: Annotation and Filtering Variant prioritization in NGS studies: Annotation and Filtering Colleen J. Saunders (PhD) DST/NRF Innovation Postdoctoral Research Fellow, South African National Bioinformatics Institute/MRC Unit for Bioinformatics

More information

Almac Diagnostics. NGS Panels: From Patient Selection to CDx. Dr Katarina Wikstrom Head of US Operations Almac Diagnostics

Almac Diagnostics. NGS Panels: From Patient Selection to CDx. Dr Katarina Wikstrom Head of US Operations Almac Diagnostics Almac Diagnostics NGS Panels: From Patient Selection to CDx Dr Katarina Wikstrom Head of US Operations Almac Diagnostics Overview Almac Diagnostics Overview Benefits and Challenges of NGS Panels for Subject

More information

High Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays

High Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays High Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays Ali Pirani and Mohini A Patil ISAG July 2017 The world leader in serving science

More information

Data Mining for Genomic- Phenomic Correlations

Data Mining for Genomic- Phenomic Correlations Data Mining for Genomic- Phenomic Correlations Joyce C. Niland, Ph.D. Associate Director & Chair, Information Sciences Rebecca Nelson, Ph.D. Lead, Data Mining Section City of Hope National Medical Center

More information

BICF Variant Analysis Tools. Using the BioHPC Workflow Launching Tool Astrocyte

BICF Variant Analysis Tools. Using the BioHPC Workflow Launching Tool Astrocyte BICF Variant Analysis Tools Using the BioHPC Workflow Launching Tool Astrocyte Prioritization of Variants SNP INDEL SV Astrocyte BioHPC Workflow Platform Allows groups to give easy-access to their analysis

More information

DNA. Clinical Trials. Research RNA. Custom. Reports CLIA CAP GCP. Tumor Genomic Profiling Services for Clinical Trials

DNA. Clinical Trials. Research RNA. Custom. Reports CLIA CAP GCP. Tumor Genomic Profiling Services for Clinical Trials Tumor Genomic Profiling Services for Clinical Trials Custom Reports DNA RNA Focused Gene Sets Clinical Trials Accuracy and Content Enhanced NGS Sequencing Extended Panel, Exomes, Transcriptomes Research

More information

Introduction to Next Generation Sequencing (NGS) Andrew Parrish Exeter, 2 nd November 2017

Introduction to Next Generation Sequencing (NGS) Andrew Parrish Exeter, 2 nd November 2017 Introduction to Next Generation Sequencing (NGS) Andrew Parrish Exeter, 2 nd November 2017 Topics to cover today What is Next Generation Sequencing (NGS)? Why do we need NGS? Common approaches to NGS NGS

More information

MedSavant: An open source platform for personal genome interpretation

MedSavant: An open source platform for personal genome interpretation MedSavant: An open source platform for personal genome interpretation Marc Fiume 1, James Vlasblom 2, Ron Ammar 3, Orion Buske 1, Eric Smith 1, Andrew Brook 1, Sergiu Dumitriu 2, Christian R. Marshall

More information

Implementing ACMG guidelines on sequence variant interpretation: software-assisted variant curation and filtering

Implementing ACMG guidelines on sequence variant interpretation: software-assisted variant curation and filtering Cartagenia Bench Lab Case Study Implementing ACMG guidelines on sequence variant interpretation: software-assisted variant curation and filtering At a Glance In this case study, you will learn: How Uppsala

More information

Next Generation Molecular Diagnostics in Scotland. Dr Mark Drummond Haematologist, Beatson Cancer Centre

Next Generation Molecular Diagnostics in Scotland. Dr Mark Drummond Haematologist, Beatson Cancer Centre Next Generation Molecular Diagnostics in Scotland Dr Mark Drummond Haematologist, Beatson Cancer Centre Outline Precision Medicine Regulatory and Funding background in Scotland (briefly) The Gap in the

More information

QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd

QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd 1 Our current NGS & Bioinformatics Platform 2 Our NGS workflow and applications 3 QIAGEN s

More information

Implementing ACMG guidelines on sequence variant interpretation: software-assisted variant curation and filtering

Implementing ACMG guidelines on sequence variant interpretation: software-assisted variant curation and filtering Alissa Interpret The next evolution of Cartagenia Bench Case Study: Implementing ACMG guidelines on sequence variant interpretation: software-assisted variant curation and filtering At a Glance In this

More information

Advanced healthcare solutions for

Advanced healthcare solutions for Advanced Software For Breakthrough Applications Robert Nagle Delivering Breakthroughs Advanced technology for developers of data-intensive applications Advanced healthcare solutions for providers, communities

More information

BaseSpace Knowledge Network Variant interpretation is simplified with organized biomarker content curated from large public databases.

BaseSpace Knowledge Network Variant interpretation is simplified with organized biomarker content curated from large public databases. BaseSpace Knowledge Network Variant interpretation is simplified with organized biomarker content curated from large public databases. Introduction Estimates of human genome variants implicated in disease

More information

An Automated Pipeline for NGS Testing and Reporting in a Commercial Molecular Pathology Lab: The Genoptix Case

An Automated Pipeline for NGS Testing and Reporting in a Commercial Molecular Pathology Lab: The Genoptix Case Bench Lab CASE STUDY An Automated Pipeline for NGS Testing and Reporting in a Commercial Molecular Pathology Lab: The Genoptix Case Authors: Matthew J. McGinniss 1, PhD FACMG, Executive Director, Molecular

More information

Welcome to the NGS webinar series

Welcome to the NGS webinar series Welcome to the NGS webinar series Webinar 1 NGS: Introduction to technology, and applications NGS Technology Webinar 2 Targeted NGS for Cancer Research NGS in cancer Webinar 3 NGS: Data analysis for genetic

More information

Using VarSeq to Improve Variant Analysis Research

Using VarSeq to Improve Variant Analysis Research Using VarSeq to Improve Variant Analysis Research June 10, 2015 G Bryce Christensen Director of Services Questions during the presentation Use the Questions pane in your GoToWebinar window Agenda 1 Variant

More information

Genomics for All: From 100,000 genomes to a national NHS Genomic Medicine Service

Genomics for All: From 100,000 genomes to a national NHS Genomic Medicine Service Genomics for All: From 100,000 genomes to a national NHS Genomic Medicine Service Professor Dame Sue Hill @CSOsue Chief Scientific Officer for England SRO Genomics, NHS England September 2018 Genomics:

More information

SureSelect XT HS. Target Enrichment

SureSelect XT HS. Target Enrichment SureSelect XT HS Target Enrichment What Is It? SureSelect XT HS joins the SureSelect library preparation reagent family as Agilent s highest sensitivity hybrid capture-based library prep and target enrichment

More information

CAPTURE-BASED APPROACH FOR COMPREHENSIVE DETECTION OF IMPORTANT ALTERATIONS

CAPTURE-BASED APPROACH FOR COMPREHENSIVE DETECTION OF IMPORTANT ALTERATIONS CAPTURE-BASE APPROACH FOR COMPREHENSIVE ETECTION OF IMPORTANT ALTERATIONS SEQUENCE MUTATIONS MICROSATELLITE INSTABILITY AMPLIFICATIONS GENOMIC REARRANGEMENTS For Research Use Only. Not for iagnostic Purposes.

More information

Ion S5 and Ion S5 XL Systems

Ion S5 and Ion S5 XL Systems Ion S5 and Ion S5 XL Systems Targeted sequencing has never been simpler Explore the Ion S5 and Ion S5 XL Systems Adopting next-generation sequencing (NGS) in your lab is now simpler than ever The Ion S5

More information

Biobanking for the 100,000 genomes project. BBMRI-UKCTC satellite meeting 29 th September

Biobanking for the 100,000 genomes project. BBMRI-UKCTC satellite meeting 29 th September Biobanking for the 100,000 genomes project BBMRI-UKCTC satellite meeting 29 th September The challenge Scale Quality Patient to sample/data to clinician Impact of pre-analytical variables vs cost of process

More information

Ion S5 and Ion S5 XL Systems

Ion S5 and Ion S5 XL Systems Ion S5 and Ion S5 XL Systems Targeted sequencing has never been simpler Explore the Ion S5 and Ion S5 XL Systems Adopting next-generation sequencing (NGS) in your lab is now simpler than ever The Ion S5

More information

Single Nucleotide Variant Analysis. H3ABioNet May 14, 2014

Single Nucleotide Variant Analysis. H3ABioNet May 14, 2014 Single Nucleotide Variant Analysis H3ABioNet May 14, 2014 Outline What are SNPs and SNVs? How do we identify them? How do we call them? SAMTools GATK VCF File Format Let s call variants! Single Nucleotide

More information

Prostate Cancer Genetics: Today and tomorrow

Prostate Cancer Genetics: Today and tomorrow Prostate Cancer Genetics: Today and tomorrow Henrik Grönberg Professor Cancer Epidemiology, Deputy Chair Department of Medical Epidemiology and Biostatistics ( MEB) Karolinska Institutet, Stockholm IMPACT-Atanta

More information

G E N OM I C S S E RV I C ES

G E N OM I C S S E RV I C ES GENOMICS SERVICES ABOUT T H E N E W YOR K G E NOM E C E N T E R NYGC is an independent non-profit implementing advanced genomic research to improve diagnosis and treatment of serious diseases. Through

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature13127 Factors to consider in assessing candidate pathogenic mutations in presumed monogenic conditions The questions itemized below expand upon the definitions in Table 1 and are provided

More information

Your Best Data: Teaming QIAGEN Chemistry & Bioinformatics to Drive Samples to Insight

Your Best Data: Teaming QIAGEN Chemistry & Bioinformatics to Drive Samples to Insight Your Best Data: Teaming QIAGEN Chemistry & Bioinformatics to Drive Samples to Insight Aysel Heckel Director Clinical Solutions Sales Dr. Anne Arens Field Application Scientist Course on Variant Detection

More information

Agilent NGS Solutions : Addressing Today s Challenges

Agilent NGS Solutions : Addressing Today s Challenges Agilent NGS Solutions : Addressing Today s Challenges Charmian Cher, Ph.D Director, Global Marketing Programs 1 10 years of Next-Gen Sequencing 2003 Completion of the Human Genome Project 2004 Pyrosequencing

More information

Training materials.

Training materials. Training materials - Ensembl training materials are protected by a CC BY license - http://creativecommons.org/licenses/by/4.0/ - If you wish to re-use these materials, please credit Ensembl for their creation

More information

Developing Data Models and Standards to Support Use Cases

Developing Data Models and Standards to Support Use Cases Developing Data Models and Standards to Support Use Cases Robert R. Freimuth, PhD ClinGen/DECIPHER Meeting May 27, 2015 2014 MFMER slide-1 Interoperability Semantic Requires a common understanding of the

More information

Assignment 9: Genetic Variation

Assignment 9: Genetic Variation Assignment 9: Genetic Variation Due Date: Friday, March 30 th, 2018, 10 am In this assignment, you will profile genome variation information and attempt to answer biologically relevant questions. The variant

More information

Cancer Genetics Solutions

Cancer Genetics Solutions Cancer Genetics Solutions Cancer Genetics Solutions Pushing the Boundaries in Cancer Genetics Cancer is a formidable foe that presents significant challenges. The complexity of this disease can be daunting

More information

Market engagement day introduction DRAFT may be subject to change

Market engagement day introduction DRAFT may be subject to change Market engagement day introduction DRAFT may be subject to change Professor Sue Hill @CSOsue Chief Scientific Officer for England SRO for Genomics, NHS England Sep 2017 What will be covered today 1. Brief

More information

Genomic solutions for complex disease

Genomic solutions for complex disease Genomic solutions for complex disease Power your with our genomic solutions Access a breadth of applications. Gain a depth of insights. To enhance their understanding of complex disease, researchers are

More information

Global Screening Array (GSA)

Global Screening Array (GSA) Technical overview - Infinium Global Screening Array (GSA) with optional Multi-disease drop in (MD) The Infinium Global Screening Array (GSA) combines a highly optimized, universal genome-wide backbone,

More information

Creating a national Genomic Medicine Service. Professor Sue Chief Scientific Officer for England

Creating a national Genomic Medicine Service. Professor Sue Chief Scientific Officer for England Creating a national Genomic Medicine Service Professor Sue Hill @CSOsue Chief Scientific Officer for England Jun 2017 Into the next generation science moving beyond genomics alone DNA HGP 100,000 Genomes

More information

NGS in Pathology Webinar

NGS in Pathology Webinar NGS in Pathology Webinar NGS Data Analysis March 10 2016 1 Topics for today s presentation 2 Introduction Next Generation Sequencing (NGS) is becoming a common and versatile tool for biological and medical

More information

Exome Sequencing Exome sequencing is a technique that is used to examine all of the protein-coding regions of the genome.

Exome Sequencing Exome sequencing is a technique that is used to examine all of the protein-coding regions of the genome. Glossary of Terms Genetics is a term that refers to the study of genes and their role in inheritance the way certain traits are passed down from one generation to another. Genomics is the study of all

More information

Nature Genetics: doi: /ng Supplementary Figure 1

Nature Genetics: doi: /ng Supplementary Figure 1 Supplementary Figure 1 Processing of mutations and generation of simulated controls. On the left, a diagram illustrates the manner in which covariate-matched simulated mutations were obtained, filtered

More information

Accelerate precision medicine with Microsoft Genomics

Accelerate precision medicine with Microsoft Genomics Accelerate precision medicine with Microsoft Genomics Copyright 2018 Microsoft, Inc. All rights reserved. This content is for informational purposes only. Microsoft makes no warranties, express or implied,

More information

MICROARRAYS+SEQUENCING

MICROARRAYS+SEQUENCING MICROARRAYS+SEQUENCING The most efficient way to advance genomics research Down to a Science. www.affymetrix.com/downtoascience Affymetrix GeneChip Expression Technology Complementing your Next-Generation

More information

TOTAL CANCER CARE: CREATING PARTNERSHIPS TO ADDRESS PATIENT NEEDS

TOTAL CANCER CARE: CREATING PARTNERSHIPS TO ADDRESS PATIENT NEEDS TOTAL CANCER CARE: CREATING PARTNERSHIPS TO ADDRESS PATIENT NEEDS William S. Dalton, PhD, MD CEO, M2Gen & Director, Personalized Medicine Institute, Moffitt Cancer Center JULY 15, 2013 MOFFITT CANCER CENTER

More information

Implementing a National Genomic Medicine Service for the NHS: building on the legacy of the 100,000 Genomes Project

Implementing a National Genomic Medicine Service for the NHS: building on the legacy of the 100,000 Genomes Project Implementing a National Genomic Medicine Service for the NHS: building on the legacy of the 100,000 Genomes Project A joint event by the All-Party Parliamentary Health Group and the All- Party Parliamentary

More information

Prioritization: from vcf to finding the causative gene

Prioritization: from vcf to finding the causative gene Prioritization: from vcf to finding the causative gene vcf file making sense A vcf file from an exome sequencing project may easily contain 40-50 thousand variants. In order to optimize the search for

More information

C3BI. VARIANTS CALLING November Pierre Lechat Stéphane Descorps-Declère

C3BI. VARIANTS CALLING November Pierre Lechat Stéphane Descorps-Declère C3BI VARIANTS CALLING November 2016 Pierre Lechat Stéphane Descorps-Declère General Workflow (GATK) software websites software bwa picard samtools GATK IGV tablet vcftools website http://bio-bwa.sourceforge.net/

More information

Oral Cleft Targeted Sequencing Project

Oral Cleft Targeted Sequencing Project Oral Cleft Targeted Sequencing Project Oral Cleft Group January, 2013 Contents I Quality Control 3 1 Summary of Multi-Family vcf File, Jan. 11, 2013 3 2 Analysis Group Quality Control (Proposed Protocol)

More information

Detection of Rare Variants in Degraded FFPE Samples Using the HaloPlex Target Enrichment System

Detection of Rare Variants in Degraded FFPE Samples Using the HaloPlex Target Enrichment System Detection of Rare Variants in Degraded FFPE Samples Using the HaloPlex Target Enrichment System Application Note Author Linus Forsmark Henrik Johansson Agilent Technologies Inc. Santa Clara, CA USA Abstract

More information

Gene Regulation Solutions. Microarrays and Next-Generation Sequencing

Gene Regulation Solutions. Microarrays and Next-Generation Sequencing Gene Regulation Solutions Microarrays and Next-Generation Sequencing Gene Regulation Solutions The Microarrays Advantage Microarrays Lead the Industry in: Comprehensive Content SurePrint G3 Human Gene

More information

SNP calling and VCF format

SNP calling and VCF format SNP calling and VCF format Laurent Falquet, Oct 12 SNP? What is this? A type of genetic variation, among others: Family of Single Nucleotide Aberrations Single Nucleotide Polymorphisms (SNPs) Single Nucleotide

More information

Whole Genome Sequencing in Cancer Diagnostics (research) Nederlandse Pathologiedagen 19 & 20 November 2015

Whole Genome Sequencing in Cancer Diagnostics (research) Nederlandse Pathologiedagen 19 & 20 November 2015 Whole Genome Sequencing in Cancer Diagnostics (research) Nederlandse Pathologiedagen 19 & 20 November 2015 Dr. I.J. Nijman Disclosure slide (Potential) conflict of interest None For this meeting relevant

More information

Target Enrichment Strategies for Next Generation Sequencing

Target Enrichment Strategies for Next Generation Sequencing Target Enrichment Strategies for Next Generation Sequencing Anuj Gupta, PhD Agilent Technologies, New Delhi Genotypic Conference, Sept 2014 NGS Timeline Information burst Nearly 30,000 human genomes sequenced

More information

Redefine what s possible with the Axiom Genotyping Solution

Redefine what s possible with the Axiom Genotyping Solution Redefine what s possible with the Axiom Genotyping Solution From discovery to translation on a single platform The Axiom Genotyping Solution enables enhanced genotyping studies to accelerate your research

More information

Q4. WORK STREAMS Discovery, Cloud, Large Scale Genomics. DRIVER PROJECTS ELIXIR Beacon, EVA/EGA/ENA. WORK STREAMS Discovery

Q4. WORK STREAMS Discovery, Cloud, Large Scale Genomics. DRIVER PROJECTS ELIXIR Beacon, EVA/EGA/ENA. WORK STREAMS Discovery 2018 - Q4 Search The GA4GH Search API enables a search engine for genomic and clinical data by providing specification for query language across genomic, phenotypic, and clinical data that can be used

More information

SureSelect Clinical Research Exome V2. Optimized for Rare Diseases

SureSelect Clinical Research Exome V2. Optimized for Rare Diseases SureSelect Clinical Research Exome V2 Optimized for Rare Diseases SureSelect Clinical Research Exome V2 Definitive nswers Where It Matters Most he SureSelect Clinical Research Exome V2 is the newest version

More information

Information Builders Enterprise Information Management Solution Transforming data into business value Fateh NAILI Enterprise Solutions Manager

Information Builders Enterprise Information Management Solution Transforming data into business value Fateh NAILI Enterprise Solutions Manager Information Builders Enterprise Information Management Solution Transforming data into business value Fateh NAILI Enterprise Solutions Manager June 20 th, 2017 1 Agenda Introduction Information Builders

More information

CITATION FILE CONTENT / FORMAT

CITATION FILE CONTENT / FORMAT CITATION 1) For any resultant publications using single samples please cite: Matthew A. Field, Vicky Cho, T. Daniel Andrews, and Chris C. Goodnow (2015). "Reliably detecting clinically important variants

More information

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with

More information

Comments on Use of Databases for Establishing the Clinical Relevance of Human Genetic Variants

Comments on Use of Databases for Establishing the Clinical Relevance of Human Genetic Variants Division of Dockets Management (HFA-305) Food and Drug Administration 5630 Fishers Lane, Rm. 1061 Rockville, MD 20852 The American Society of Human Genetics 9650 Rockville Pike Bethesda, MD 20814 24 December

More information

HaloPlex HS. Get to Know Your DNA. Every Single Fragment. Kevin Poon, Ph.D.

HaloPlex HS. Get to Know Your DNA. Every Single Fragment. Kevin Poon, Ph.D. HaloPlex HS Get to Know Your DNA. Every Single Fragment. Kevin Poon, Ph.D. Sr. Global Product Manager Diagnostics & Genomics Group Agilent Technologies For Research Use Only. Not for Use in Diagnostic

More information

5-Minute Guide to. Cognitive Search for the Life Sciences

5-Minute Guide to. Cognitive Search for the Life Sciences 5-Minute Guide to Cognitive Search for the Life Sciences 5-Minute Guide to Cognitive Search for the Life Sciences Cognitive search has its roots in enterprise search, which goes back to at least the 1960s.

More information

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS

More information

ILLUMINA SEQUENCING SYSTEMS

ILLUMINA SEQUENCING SYSTEMS ILLUMINA SEQUENCING SYSTEMS PROVEN QUALITY. TRUSTED SOLUTIONS. Every day, researchers are using Illumina next-generation sequencing (NGS) systems to better understand human health and disease, as well

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Richard Corbett Canada s Michael Smith Genome Sciences Centre Vancouver, British Columbia June 28, 2017 Our mandate is to advance knowledge about cancer and other diseases

More information

Centro Nacional de Análisis Genómico. Where are the Bottlenecks of Genome Analysis Today? Teratec. Ecole Polytechnique, Palaiseau, F.

Centro Nacional de Análisis Genómico. Where are the Bottlenecks of Genome Analysis Today? Teratec. Ecole Polytechnique, Palaiseau, F. Centro Nacional de Análisis Genómico Where are the Bottlenecks of Genome Analysis Today? Teratec Ecole Polytechnique, Palaiseau, F Ivo Glynne Gut 29.06.2016 The genomehenge Sequencing capacity >1000 Gbases/day

More information

VALIDATION OF HLA TYPING BY NGS

VALIDATION OF HLA TYPING BY NGS VALIDATION OF HLA TYPING BY NGS Eric T. Weimer, Ph.D., D(ABMLI) Assistant Professor, Pathology and Laboratory Medicine Associate Director, Clinical Flow Cytometry, HLA, and Immunology Laboratories CONFLICT

More information

Corporate Medical Policy

Corporate Medical Policy Corporate Medical Policy Proteogenomic Testing for Patients with Cancer (GPS Cancer Test) File Name: Origination: Last CAP Review: Next CAP Review: Last Review: proteogenomic_testing_for_patients_with_cancer_gps_cancer_test

More information

Genomics: Human variation

Genomics: Human variation Genomics: Human variation Lecture 1 Introduction to Human Variation Dr Colleen J. Saunders, PhD South African National Bioinformatics Institute/MRC Unit for Bioinformatics Capacity Development, University

More information

New Frontiers of Genetic Profiling Achieve Higher Sensitivity and Greater Insights with Molecular Barcodes, Long Read Capture and Optimized Exomes

New Frontiers of Genetic Profiling Achieve Higher Sensitivity and Greater Insights with Molecular Barcodes, Long Read Capture and Optimized Exomes New Frontiers of Genetic Profiling Achieve Higher Sensitivity and Greater Insights with Molecular Barcodes, Long Read Capture and Optimized Exomes Jennifer Jones, PhD Senior Field Application Scientist

More information

Reads to Discovery. Visualize Annotate Discover. Small DNA-Seq ChIP-Seq Methyl-Seq. MeDIP-Seq. RNA-Seq. RNA-Seq.

Reads to Discovery. Visualize Annotate Discover. Small DNA-Seq ChIP-Seq Methyl-Seq. MeDIP-Seq. RNA-Seq. RNA-Seq. Reads to Discovery RNA-Seq Small DNA-Seq ChIP-Seq Methyl-Seq RNA-Seq MeDIP-Seq www.strand-ngs.com Analyze Visualize Annotate Discover Data Import Alignment Vendor Platforms: Illumina Ion Torrent Roche

More information