Whole-Genome Sequencing (WGS) for Food Safety

Size: px
Start display at page:

Download "Whole-Genome Sequencing (WGS) for Food Safety"

Transcription

1 Whole-Genome Sequencing (WGS) for Food Safety Errol Strain, Ph.D. Director, Biostatistics and Bioinformatics Staff Center for Food Safety and Applied Nutrition U.S. Food Drug Administration IFSH Meeting 5/22/2017

2 FDA Regulatory Use Cases 1. Do these new bacterial isolates from environmental/product testing match any clinical isolates in the DB? Is this product/facility causing illness? 2. Do these new clinical isolates match any environmental/food isolates in DB? Should we test product/swab a facility? 3. Are isolates collected at different points in time from the same facility a match? Is there a problem w/ a resident pathogen, harborage? 2

3 GenomeTrakr Data Flow Salmonella Listeria GenomeTrakr Labs & Collaborators

4 NGS-Based Surveillance (prior to NCBI Pathogen Detection) Initial Clustering: PFGE, K-mer, MASH, BLAST Goal: Find a group of Closely related isolates NCBI Missing SNP Pipeline: Find phylogenetically informative SNPs, FASTA alignment FDA Construct Phylogeny FDA 4

5 NCBI Pathogen Detection 5

6 CFSAN vs NCBI SNPs 6

7 Scientific Evidence Daubert Standard 1. Empirical testing: whether the theory or technique is falsifiable, refutable, and/or testable. 2. Whether it has been subjected to peer review and publication. Specific/Target Studies for pathogen have been published. Multiple software packages for mapping and calling SNPs. 3. The known or potential error rate. Well characterized at read level, less so for cluster analysis. 4. The existence and maintenance of standards and controls concerning its operation. Proficiency testing efforts through Global Microbial Identifier and also FDA GenomeTrakr network. 5. The degree to which the theory and technique is generally accepted by a relevant scientific community. Acceptance facilitated by open database (NCBI/SRA). 7

8 Why Build A Pipeline? 1. Regulatory Use and/or Accredited Labs NCBI methods not public and peer-reviewed Chain of custody local computation Results needed immediately 2. Pathogen and/or data not at NCBI Mycobacterium, Legionella* Food Industry private data 8

9 What Kind of Pipeline? SNPs wgmlst Unit of Measure Single Nucleotide Substitutions (other types of mutations are excluded) Allele - variant of a gene. Variation could arise form a number of sources, including SNPs, insertions, deletions, etc. Requirements Complete or high-quality reference genome for mapping Database of named alleles, must be actively maintained Pros Extremely High Resolution, Methods have been published and validated Relatively Fast, not directly dependent upon reference genome Cons Requires reference genome, computationally intense, requires local bioinformatics expertise Allele database must be centralized, cannot compute novel wgmlst types locally. wgmlst schemas not easy to publicly access 9 9

10 FDA Pipeline Requirements 1. Public, Peer-Reviewed Results may be subject to legal scrutiny Accessible to FDA-regulated industries 2. Reproducible 3. Documentation & Validation 4. Platform independent (fastq) 5. Run Locally 10 10

11 Background: CFSAN SNP Pipeline Mapping/Aligning (66+) SNP Detection (16+) Samtools SOAPsnp GATK Bowtie2 VarScan SNVer VarScan SHORE SMALT MaCH IMPUTE2 CLC Bio QualitySNPng DNABaser SNPdetector FreeBayes SolSNP DNAStar 11

12 CFSAN SNP Pipeline Documentation: Source Code: Pettengill JB, Luo Y, Davis S, Chen Y, Gonzalez-Escalona N, Ottesen A, Rand H, Allard MW, Strain E. (2014) An evaluation of alternative methods for constructing phylogenies from whole genome sequence data: a case study with Salmonella. PeerJ 2:e620 Davis S, Pettengill JB, Luo Y, Payne J, Shpuntoff A, Rand H, Strain E. (2015) CFSAN SNP Pipeline: an automated method for constructing SNP matrices from next-generation sequence data. PeerJ Computer Science 1:e

13 FDA\CFSAN Validation Efforts 1. Technical Performance Accuracy: Salmonella LT2 and Agona SL Intralaboratory variation, sequencing platform Salmonella Montevideo (180+ runs), PacBio vs short reads 3. Interlaboratory variation Salmonella Braenderup BAA-664 (PFGE control), ISO/CEN WG, GenomeTrakr PT set (Salmonella & Listeria), Global Microbial Identifier PT 4. Bioinformatics Pipeline Software Validation, Benchmark bioinformatic data sets Collaborations w/ Canada, CDC, NIH/NCBI 13

14 Proficiency Testing: GenomeTrakr 2014, 2015: Each lab in the GT network sequenced the same set of 8 strains. CFSAN PT analysis returned. Manuscript in preparation GMI (yearly since 2013) 2016 PT has wet and dry lab components 2016 PT includes K. pneumonia, L. mono, C. jejuni, E. coli PulseNet/GenomeTrakr harmonized PT Early

15 CFSAN Workflow 15 15

16 16 16

17 Min-diff Minimum SNP distance to an isolate of a different sample type Food/Environmental vs Clinical (or Microbe) 17 17

18 8 SNPs Check SNP Cluster 18 18

19 19 19

20 CFSAN Workflow CFSAN SNP Pipeline is run on NCBI SNP cluster Reference prefer complete genomes, drafts work almost as well High-Density SNP regions are filtered >3 SNPs in 1000 bases, phages/recombination/etc. Phylogenetic inference Maximum Likelihood Ambiguous sites are treated as missing data 20 20

21 Which Reference? 10 SNP Difference Strain Subtype PFGE 0% 0.1% 0.5% 1% 2.5% 5% % Divergence Serotype Subspecies Species 21

22 High-Quality Draft CFSAN SNP Pipeline: Listeria Draft vs PacBio Genome Complete (PacBio) 22

23 Interpretation SNP Distance How close are the isolates? No single threshold for all species/types, rough guides 1. <=20 SNPs match, virtually identical SNPs inconclusive 3. > 100 SNPs exclude Bootstrapping Do the isolates form a unique cluster w/ >= 95% support? Is the cluster distinct from other isolates in the tree? Results are critically evaluated and not used blindly 23 23

24 Forensic Needs WGS (SRA) Database: Random survey of bacteria not possible, need to continue to grow database and curate genotypes Thresholds for SNPs vs wgmlst: 1 SNP 1 INDEL 1 Recombination Well-Documented wgmlst databases 24

25 Example E. coli & Flour 25

26

27 27 27

28 0-3 SNPs to clinical isolates 0-3 SNPs to other food/env isolates 28 28

29 29 29

30 CFSAN SNP Pipeline 30 30

31 Future of GenomeTrakr & CFSAN SNP Pipeline 1. Local or web-based QA/QC and identification tools Detect sample mix-ups and low quality before data is submitted to NCBI/SRA, fix problems more quickly 2. Continue to build WGS databases Better thresholds for identity, increase odds of finding a match 3. Local SNP pipeline analysis Accredited labs don t have to send out data 31

32 Snapshot of Data 3/1 to 4/30 SNP/ERD Clusters # SNP Clusters % isolates in SNP clusters (3/2017) Total Campylobacter E.coli/Shigella (56%) 1132 Listeria (89%) 356 Salmonella (86%) 2100 * 2 or more isolates within 50 SNPs 32

33 Acknowledgements FDA Center for Food Safety and Applied Nutrition Center for Veterinary Medicine Office of Regulatory Affairs National Institutes of Health National Center for Biotechnology Information State Health and University Labs Alaska Arizona California Florida Hawaii Maryland Minnesota New Mexico New York South Dakota Texas Virginia Washington USDA/FSIS Eastern Laboratory CDC Enteric Diseases Laboratory INEI-ANLIS Carolos Malbran Institute, Argentina Centre for Food Safety, University College Dublin, Ireland Food Environmental Research Agency, UK Public Health England, UK WHO Illumina Pac Bio CLC Bio Other independent collaborators 33

34

Bioinformatics Tools and Pipelines for Real-Time Pathogen Surveillance

Bioinformatics Tools and Pipelines for Real-Time Pathogen Surveillance Bioinformatics Tools and Pipelines for Real-Time Pathogen Surveillance Errol Strain, Ph.D. Chief, Biostatistics Branch FDA/OFVM/CFSAN/OAO/DPHIA 3/24/2014 Overview 1. Validation and Proficiency Testing

More information

Development and Implementation of a Quality System for Next-Generation Sequencing

Development and Implementation of a Quality System for Next-Generation Sequencing Development and Implementation of a Quality System for Next-Generation Sequencing Lauren Turner, PhD Lead Scientist Virginia Division of Consolidated Laboratory Services DCLS Phased Implementation of NGS

More information

The Basics of Understanding Whole Genome Next Generation Sequence Data

The Basics of Understanding Whole Genome Next Generation Sequence Data The Basics of Understanding Whole Genome Next Generation Sequence Data Heather Carleton-Romer, MPH, Ph.D. ASM-CDC Infectious Disease and Public Health Microbiology Postdoctoral Fellow PulseNet USA Next

More information

EURL WORKING GROUP ON WHOLE GENOME SEQUENCING AND PULSENET INTERNATIONAL

EURL WORKING GROUP ON WHOLE GENOME SEQUENCING AND PULSENET INTERNATIONAL EURL WORKING GROUP ON WHOLE GENOME SEQUENCING AND PULSENET INTERNATIONAL EURL-Campylobacter workshop, 9/10-2018 Joakim Skarin, SVA Objectives of the WG-NGS To promote the use of NGS across the EURL networks

More information

Developing Tools for Rapid and Accurate Post-Sequencing Analysis of Foodborne Pathogens. Mitchell Holland, Noblis

Developing Tools for Rapid and Accurate Post-Sequencing Analysis of Foodborne Pathogens. Mitchell Holland, Noblis Developing Tools for Rapid and Accurate Post-Sequencing Analysis of Foodborne Pathogens Mitchell Holland, Noblis Agenda Introduction Whole Genome Sequencing Analysis Pipeline Sequence Alignment SNPs and

More information

Beef Industry Safety Summit Renaissance Austin Hotel 9721 Arboretum Blvd. Austin, TX March 1-3

Beef Industry Safety Summit Renaissance Austin Hotel 9721 Arboretum Blvd. Austin, TX March 1-3 1 USDA, Food Safety and Inspection Service Beef Industry Safety Summit - 2016 Renaissance Austin Hotel 9721 Arboretum Blvd. Austin, TX 78759 March 1-3 Uday Dessai MPH, MS, PhD Senior Public Health Advisor

More information

New York State s experience with analyzing, interpreting, and sharing whole genome sequence data for surveillance of enteric organisms.

New York State s experience with analyzing, interpreting, and sharing whole genome sequence data for surveillance of enteric organisms. New York State s experience with analyzing, interpreting, and sharing whole genome sequence data for surveillance of enteric organisms. InForm 11/18/15 William Wolfgang, PhD Wadsworth Center, NYSDOH william.wolfgang@health.ny.gov

More information

Next generation sequencing in diagnostic laboratories: opportunities and challenges

Next generation sequencing in diagnostic laboratories: opportunities and challenges Next generation sequencing in diagnostic laboratories: opportunities and challenges Vitali Sintchenko Marie Bashir Institute for Emerging Infectious Diseases & Biosecurity Declaration No conflict of interest

More information

Whole Genome Sequencing for Enteric Pathogen Surveillance and Outbreak Investigations

Whole Genome Sequencing for Enteric Pathogen Surveillance and Outbreak Investigations Whole Genome Sequencing for Enteric Pathogen Surveillance and Outbreak Investigations Anne Maki, Manager, Enteric, Environmental, Molecular Surveillance and Bacterial Sexually Transmitted Infections, Public

More information

The Basics of Understanding Whole Genome Next Generation Sequence Data

The Basics of Understanding Whole Genome Next Generation Sequence Data The Basics of Understanding Whole Genome Next Generation Sequence Data Heather Carleton, MPH, Ph.D. ASM-CDC Infectious Disease and Public Health Microbiology Postdoctoral Fellow PulseNet USA Next Generation

More information

Introduction to NGS Analysis Tools

Introduction to NGS Analysis Tools National Center for Emerging and Zoonotic Infectious Diseases Introduction to NGS Analysis Tools Heather Carleton, PhD, MPH Team Lead, Enteric Diseases Bioinformatics, Enteric Diseases Laboratory Branch,

More information

IFSH WHOLE GENOME SEQUENCING FOR FOOD INDUSTRY SYMPOSIUM May 22-23, 2017

IFSH WHOLE GENOME SEQUENCING FOR FOOD INDUSTRY SYMPOSIUM May 22-23, 2017 1 USDA, Food Safety and Inspection Service IFSH WHOLE GENOME SEQUENCING FOR FOOD INDUSTRY SYMPOSIUM May 22-23, 2017 Chicago Marriott Southwest at Burr Ridge 1200 Burr Ridge Parkway, Burr Ridge, IL 60527

More information

Bringing Whole Genome Sequencing on Board in a State Regulatory Laboratory

Bringing Whole Genome Sequencing on Board in a State Regulatory Laboratory Bringing Whole Genome Sequencing on Board in a State Regulatory Laboratory Brian D. Sauders, PhD NY State Dept. of Agriculture & Markets Food Laboratory The Food Laboratory! 2 Major laboratory sections:

More information

Canada's IRIDA platform for genomic epidemiology. Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health Agency of Canada

Canada's IRIDA platform for genomic epidemiology. Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health Agency of Canada Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health Agency of Canada Integrated Rapid Infectious Disease Analysis informatics

More information

Whole Genome Sequencing for food safety FSA Chief Scientific Advisor Report and 2013 Listeria pilot study

Whole Genome Sequencing for food safety FSA Chief Scientific Advisor Report and 2013 Listeria pilot study Whole Genome Sequencing for food safety FSA Chief Scientific Advisor Report and 2013 Listeria pilot study Dr Edward Hayes Date: July 2016, Version 1 Foodborne Pathogens 280,000 cases of Campylobacter,

More information

Introduction to PulseNet WGS Tools in BioNumerics v7.6

Introduction to PulseNet WGS Tools in BioNumerics v7.6 National Center for Emerging and Zoonotic Infectious Diseases Introduction to PulseNet WGS Tools in BioNumerics v7.6 Steven Stroika PulseNet CDC PulseNet/OutbreakNet Regional Meeting February 2019 Overview

More information

SNP calling and VCF format

SNP calling and VCF format SNP calling and VCF format Laurent Falquet, Oct 12 SNP? What is this? A type of genetic variation, among others: Family of Single Nucleotide Aberrations Single Nucleotide Polymorphisms (SNPs) Single Nucleotide

More information

Variation detection based on second generation sequencing data. Xin LIU Department of Science and Technology, BGI

Variation detection based on second generation sequencing data. Xin LIU Department of Science and Technology, BGI Variation detection based on second generation sequencing data Xin LIU Department of Science and Technology, BGI liuxin@genomics.org.cn 2013.11.21 Outline Summary of sequencing techniques Data quality

More information

Ribotyping Easily Fills in for Whole Genome Sequencing to Characterize Food-borne Pathogens David Sistanich

Ribotyping Easily Fills in for Whole Genome Sequencing to Characterize Food-borne Pathogens David Sistanich Ribotyping Easily Fills in for Whole Genome Sequencing to Characterize Food-borne Pathogens David Sistanich Technical Support Specialist Hygiena, LLC Whole genome sequencing (WGS) has become the ultimate

More information

The implementation and application of Whole Genome Sequencing in the Campylobacter Reference Laboratory at Public Health England Craig Swift

The implementation and application of Whole Genome Sequencing in the Campylobacter Reference Laboratory at Public Health England Craig Swift The implementation and application of Whole Genome Sequencing in the Campylobacter Reference Laboratory at Public Health England Craig Swift Campylobacter EURL workshop (2018) The Gastrointestinal Bacteria

More information

Validating Bionumerics 7.6: A strategic approach from Oregon

Validating Bionumerics 7.6: A strategic approach from Oregon Validating Bionumerics 7.6: A strategic approach from Oregon Karim Morey, MS, M(ASCP) Oregon State Public Health Laboratory PulseNet West Coast Regional Meeting February 2019 Outline Compliance requirements

More information

Sequence quality: GMI Proficiency Tests for Whole Genome Sequencing of bacteria

Sequence quality: GMI Proficiency Tests for Whole Genome Sequencing of bacteria Sequence quality: GMI Proficiency Tests for Whole Genome Sequencing of bacteria Presented by Pimlapas Leekitcharoenphon (Shinny) (DTU-Food) Research Group of Genomic Epidemiology National Food Institute,

More information

EURL Working Group on NGS

EURL Working Group on NGS EURL Working Group on NGS Rene S. Hendriksen, PhD Research group of Genomic Epidemiology National Food Institute, Technical University of Denmark Annual EURL AR workshop 5-6 April 2018 DTU, Kgs. Lyngby,

More information

Rue Juliette Wytsmanstraat Brussels Belgium T F

Rue Juliette Wytsmanstraat Brussels Belgium T F Kevin Vanneste, PhD Bioinformatics Platform Platform Biotechnology and Molecular Biology Department Expertise, Service Provision and Customer Relations Collaboration between the EURL-VTEC and the Platform

More information

2014 APHL Next Generation Sequencing (NGS) Survey

2014 APHL Next Generation Sequencing (NGS) Survey APHL would like you to complete the Next Generation Sequencing (NGS) in Public Health Laboratories Survey. The purpose of this survey is to collect information on current capacities for NGS testing and

More information

Programmatic Implementation of NGS for TB & Future Plans for the ReSeqTB Knowledgebase

Programmatic Implementation of NGS for TB & Future Plans for the ReSeqTB Knowledgebase Programmatic Implementation of NGS for TB & Future Plans for the ReSeqTB Knowledgebase Tim Rodwell, Rebecca Colman, Anita Suresh & Claudia Denkinger IUTLD, Guadalajara, Mexico, 11 th October 2017 A TB

More information

Integrating Modern Genomic Sciences into Practical Microbiology: WGS and The Case for Food Safety

Integrating Modern Genomic Sciences into Practical Microbiology: WGS and The Case for Food Safety Integrating Modern Genomic Sciences into Practical Microbiology: WGS and The Case for Food Safety Eric W Brown Director, Division of Microbiology Office of Regulatory Science Center for Food Safety and

More information

VTEC strains typing: from traditional methods to NGS

VTEC strains typing: from traditional methods to NGS VTEC strains typing: from traditional methods to NGS 2 nd course on bioinformatics tools for Next Generation Sequencing data mining: use of bioinformatics tools for typing pathogenic E. coli ISS, Rome

More information

Update on NCBI Pathogen Detection and Antimicrobial Resistance Activities William Klimke -GenomeTrakr, Sept 26-28, Crystal City, VA

Update on NCBI Pathogen Detection and Antimicrobial Resistance Activities William Klimke -GenomeTrakr, Sept 26-28, Crystal City, VA Update on NCBI Pathogen Detection and Antimicrobial Resistance Activities William Klimke -GenomeTrakr, Sept 26-28, Crystal City, VA Shared Network pathways and data streams for outbreak detection and investigations

More information

Whole genome and core genome multilocus sequence typing and single nucleotide

Whole genome and core genome multilocus sequence typing and single nucleotide AEM Accepted Manuscript Posted Online 26 May 2017 Appl. Environ. Microbiol. doi:10.1128/aem.00633-17 Copyright 2017 Chen et al. This is an open-access article distributed under the terms of the Creative

More information

Current status of universal whole genome sequencing of Mycobacterium tuberculosis in the United States

Current status of universal whole genome sequencing of Mycobacterium tuberculosis in the United States Current status of universal whole genome sequencing of Mycobacterium tuberculosis in the United States Lauren Cowan, PhD Medical Consultant Meeting San Antonio, TX November 29-30, 2018 1 EXCELLENCE EXPERTISE

More information

Challenges and opportunities for whole genome sequencing based surveillance of antibiotic resistance

Challenges and opportunities for whole genome sequencing based surveillance of antibiotic resistance Challenges and opportunities for whole genome sequencing based surveillance of antibiotic resistance Prof. Willem van Schaik Professor in Microbiology and Infection Institute of Microbiology and Infection

More information

Food Safety & High-Throughput Sequencing (HTS) What Does the Future Hold? Perspectives from the Industry, Governmental Agencies and Academia An IFSH

Food Safety & High-Throughput Sequencing (HTS) What Does the Future Hold? Perspectives from the Industry, Governmental Agencies and Academia An IFSH Food Safety & High-Throughput Sequencing (HTS) What Does the Future Hold? Perspectives from the Industry, Governmental Agencies and Academia An IFSH HTS Initiative's Palantir, May 30-31, 2018 Chicago Marriott

More information

What Are We Trying to Say Here? Standardizing Next Generation Sequencing Reports for Tuberculosis

What Are We Trying to Say Here? Standardizing Next Generation Sequencing Reports for Tuberculosis Jeffrey Tornheim, MD MPH Clinical Fellow in Infectious Diseases Johns Hopkins University School of Medicine tornheim@jhu.edu What Are We Trying to Say Here? Standardizing Next Generation Sequencing Reports

More information

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary Executive Summary Helix is a personal genomics platform company with a simple but powerful mission: to empower every person to improve their life through DNA. Our platform includes saliva sample collection,

More information

Variant Callers. J Fass 24 August 2017

Variant Callers. J Fass 24 August 2017 Variant Callers J Fass 24 August 2017 Variant Types Caller Consistency Pabinger (2014) Briefings Bioinformatics 15:256 Freebayes Bayesian haplotype caller that can call SNPs, short CNVs / duplications,

More information

Introduction to Whole Genome Sequencing and its Applications in Microbial Diagnostics

Introduction to Whole Genome Sequencing and its Applications in Microbial Diagnostics Introduction to Whole Genome Sequencing and its Applications in Microbial Diagnostics Workshop on Whole Genome Sequencing and Analysis, 19-21 Mar. 2018 Whole genome sequencing is currently revolutionising

More information

Bioinformatics- Data Analysis

Bioinformatics- Data Analysis Bioinformatics- Data Analysis Erin H. Graf, PhD, D(ABMM) Infectious Disease Diagnostics Laboratory, Children s Hospital of Philadelphia Department of Pathology and Laboratory Medicine, University of Pennsylvania

More information

Integration of Regulatory and Clinical Data: an FDA Perspective

Integration of Regulatory and Clinical Data: an FDA Perspective Integration of Regulatory and Clinical Data: an FDA Perspective Eric L. Stevens, PhD U.S. Food & Drug Administration Center for Food Safety and Applied Nutrition The Fresh-Cut Tomato Supply Chain 3 Understanding

More information

Using Galaxy for the analysis of NGS-derived pathogen genomes in clinical microbiology

Using Galaxy for the analysis of NGS-derived pathogen genomes in clinical microbiology Using Galaxy for the analysis of NGS-derived pathogen genomes in clinical microbiology Anthony Underwood*, Paul-Michael Agapow, Michel Doumith and Jonathan Green. Bioinformatics Unit, Health Protection

More information

Assay Validation Services

Assay Validation Services Overview PierianDx s assay validation services bring clinical genomic tests to market more rapidly through experimental design, sample requirements, analytical pipeline optimization, and criteria tuning.

More information

Whole genome sequencing in the reference laboratory: An Introduction & Overview

Whole genome sequencing in the reference laboratory: An Introduction & Overview Whole genome sequencing in the reference laboratory: An Introduction & Overview 1 WGS Services in Scotland STEC reference service Salmonella & Shigella reference services.only the beginning! 2 Typing -

More information

Overview of CIDT Challenges and Opportunities

Overview of CIDT Challenges and Opportunities Overview of CIDT Challenges and Opportunities Peter Gerner-Smidt, MD, DSc Enteric Diseases Laboratory Branch InFORM II Phoenix, AZ, 19 November 2015 National Center for Emerging and Zoonotic Infectious

More information

Food Safety & High-Throughput Sequencing (HTS) What Does the Future Hold? Perspectives from the Industry, Governmental Agencies and Academia An IFSH

Food Safety & High-Throughput Sequencing (HTS) What Does the Future Hold? Perspectives from the Industry, Governmental Agencies and Academia An IFSH Food Safety & High-Throughput Sequencing (HTS) What Does the Future Hold? Perspectives from the Industry, Governmental Agencies and Academia An IFSH HTS Initiative's Palantir, May 30-31, 2018 Chicago Marriott

More information

Whole Genome Sequence Data Quality Control and Validation

Whole Genome Sequence Data Quality Control and Validation Whole Genome Sequence Data Quality Control and Validation GoSeqIt ApS / Ved Klædebo 9 / 2970 Hørsholm VAT No. DK37842524 / Phone +45 26 97 90 82 / Web: www.goseqit.com / mail: mail@goseqit.com Table of

More information

Supplementary Appendix

Supplementary Appendix Supplementary Appendix This appendix has been provided by the authors to give readers additional information about their work. Supplement to: Shah NS, Auld SC, Brust JCM, et al. Transmission of extensively

More information

From classical molecular typing to WGS in a food safety context: WGS at EFSA

From classical molecular typing to WGS in a food safety context: WGS at EFSA From classical molecular typing to WGS in a food safety context: WGS at EFSA Beatriz Guerra EURL-AR WGS Training, Copenhage, Denmark, 27.09.17 WGS FOR FOOD SAFETY AT EFSA Molecular Typing Recent Past:

More information

Marc Allard Ph.D. Microbiologist, Division of Microbiology, ORS, Center for Food Safety and Applied Nutrition, FDA

Marc Allard Ph.D. Microbiologist, Division of Microbiology, ORS, Center for Food Safety and Applied Nutrition, FDA Integration of NGS Desktop Sequencers to Build a Global Genomic Network for Pathogen Traceback and Outbreak Detection: Description of international (GMI, WHO) and national (GenomeTrakr, 100K) activities.

More information

by author Bacterial typing - what methodology should I use? MTE Session ECCMID 2017 VIENNA, 25 APRIL 2017 L u í s a V i e i ra P e i xe

by author Bacterial typing - what methodology should I use? MTE Session ECCMID 2017 VIENNA, 25 APRIL 2017 L u í s a V i e i ra P e i xe Bacterial typing - what methodology should I use? MTE Session ECCMID 2017 VIENNA, 25 APRIL 2017 L u í s a V i e i ra P e i xe U C I B I O @ R E Q U I M T E, F a c u l t y o f P h a r m a c y U n i v e

More information

MicroSEQ Rapid Microbial Identification System

MicroSEQ Rapid Microbial Identification System MicroSEQ Rapid Microbial Identification System Giving you complete control over microbial identification using the gold-standard genotypic method The MicroSEQ ID microbial identification system, based

More information

Setting the Course: Virginia's experience navigating information technology and bioinformatics needs for whole genome sequencing

Setting the Course: Virginia's experience navigating information technology and bioinformatics needs for whole genome sequencing Setting the Course: Virginia's experience navigating information technology and bioinformatics needs for whole genome sequencing Lauren Turner, Ph.D. Virginia Division of Consolidated Laboratory Services

More information

Introduction to CGE tools

Introduction to CGE tools Introduction to CGE tools Pimlapas Leekitcharoenphon (Shinny) Research Group of Genomic Epidemiology, DTU-Food. WHO Collaborating Centre for Antimicrobial Resistance in Foodborne Pathogens and Genomics.

More information

One Health and the Importance of WGS Data-sharing from All Food Sectors

One Health and the Importance of WGS Data-sharing from All Food Sectors One Health and the Importance of WGS Data-sharing from All Food Sectors Eric.stevens@fda.hhs.gov Food Safety & High-Throughput Sequencing Symposium May 30, 2018 We all know how useful WGS data is within

More information

New Developments in the NCBI Pathogen Detection Pipeline

New Developments in the NCBI Pathogen Detection Pipeline New Developments in the NCBI Pathogen Detection Pipeline William Klimke National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health Food Safety and High Throughput

More information

Introduction to Whole Genome Sequencing and its Applications in Microbial Diagnostics

Introduction to Whole Genome Sequencing and its Applications in Microbial Diagnostics Introduction to Whole Genome Sequencing and its Applications in Microbial Diagnostics Workshop on Whole Genome Sequencing and Analysis, 2-4 Oct. 2017 Whole genome sequencing is currently revolutionising

More information

Introduction to Whole Genome Sequencing and its Applications in Microbial Diagnostics

Introduction to Whole Genome Sequencing and its Applications in Microbial Diagnostics Introduction to Whole Genome Sequencing and its Applications in Microbial Diagnostics Workshop on Whole Genome Sequencing and Analysis, 27-29 Mar. 2017 Whole genome sequencing is currently revolutionising

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Alla L Lapidus, Ph.D. SPbSU St. Petersburg Term Bioinformatics Term Bioinformatics was invented by Paulien Hogeweg (Полина Хогевег) and Ben Hesper in 1970 as "the study of

More information

CDC s Advanced Molecular Detection (AMD) Sequence Data Analysis and Management

CDC s Advanced Molecular Detection (AMD) Sequence Data Analysis and Management CDC s Advanced Molecular Detection (AMD) Sequence Data Analysis and Management Scott Sammons Technology Officer Office of Advanced Molecular Detection National Center for Emerging and Zoonotic Infectious

More information

Read Mapping and Variant Calling. Johannes Starlinger

Read Mapping and Variant Calling. Johannes Starlinger Read Mapping and Variant Calling Johannes Starlinger Application Scenario: Personalized Cancer Therapy Different mutations require different therapy Collins, Meredith A., and Marina Pasca di Magliano.

More information

An Industrial Lab s Experience of NGS. Dr Greg Jones

An Industrial Lab s Experience of NGS. Dr Greg Jones An Industrial Lab s Experience of NGS Dr Greg Jones Campden BRI The partner of choice for the development and application of technical knowledge and commercially relevant solutions for the food and drink

More information

Detecting Clusters and Reporting Results

Detecting Clusters and Reporting Results National Center for Emerging and Zoonotic Infectious Diseases Detecting Clusters and Reporting Results Beth Tolar Salmonella Database Coordinator PulseNet Central Regional Meeting March 2019 Update to

More information

CBC Data Therapy. Metagenomics Discussion

CBC Data Therapy. Metagenomics Discussion CBC Data Therapy Metagenomics Discussion General Workflow Microbial sample Generate Metaomic data Process data (QC, etc.) Analysis Marker Genes Extract DNA Amplify with targeted primers Filter errors,

More information

International Standards Development for Use of Whole Genome Sequencing in Food Microbiology. Peter Evans FDA-CFSAN InFORM Meeting Nov 20, 2015

International Standards Development for Use of Whole Genome Sequencing in Food Microbiology. Peter Evans FDA-CFSAN InFORM Meeting Nov 20, 2015 International Standards Development for Use of Whole Genome Sequencing in Food Microbiology Peter Evans FDA-CFSAN InFORM Meeting Nov 20, 2015 Impact of NGS on food microbiology WGS Use Case Regulatory

More information

MicroSEQ Rapid Microbial Identifi cation System

MicroSEQ Rapid Microbial Identifi cation System APPLICATION NOTE MicroSEQ Rapid Microbial Identifi cation System MicroSEQ Rapid Microbial Identification System Giving you complete control over microbial identifi cation using the gold-standard genotypic

More information

Food Safety (Bio-)Informatics

Food Safety (Bio-)Informatics Food Safety (Bio-)Informatics Henk C. den Bakker Assistant Professor in Bioinformatics and Epidemiology Center for Food Safety University of Georgia hcd82599@uga.edu Overview Short introduction of Food

More information

A year in clinical bioinformatics

A year in clinical bioinformatics Division of Clinical Microbiology A year in clinical bioinformatics Helena Seth-Smith, PhD October 2018 ICCMg " the application of next generation sequencing to clinical samples in order to recover information

More information

Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH. BIOL 7210 A Computational Genomics 2/18/2015

Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH. BIOL 7210 A Computational Genomics 2/18/2015 Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH BIOL 7210 A Computational Genomics 2/18/2015 The $1,000 genome is here! http://www.illumina.com/systems/hiseq-x-sequencing-system.ilmn Bioinformatics bottleneck

More information

THE RISE OF WHOLE GENOME SEQUENCING AS A SUBTYPING TOOL FOR MICROBIAL SOURCE TRACKING: FROM FUNDAMENTALS TO APPLICATIONS

THE RISE OF WHOLE GENOME SEQUENCING AS A SUBTYPING TOOL FOR MICROBIAL SOURCE TRACKING: FROM FUNDAMENTALS TO APPLICATIONS THE RISE OF WHOLE GENOME SEQUENCING AS A SUBTYPING TOOL FOR MICROBIAL SOURCE TRACKING: FROM FUNDAMENTALS TO APPLICATIONS STEAK EXPERT MEETING: ANGERS FRANCE JUNE, 2015 Kendra Nightingale, Ph.D. Inter national

More information

TECHNOLOGY IN THE FOOD SAFETY WORLD: TOOLS SUCH AS WHO GENOME SEQUENCING FRIEND OR FOE? Room 314 December

TECHNOLOGY IN THE FOOD SAFETY WORLD: TOOLS SUCH AS WHO GENOME SEQUENCING FRIEND OR FOE? Room 314 December TECHNOLOGY IN THE FOOD SAFETY WORLD: TOOLS SUCH AS WHO GENOME SEQUENCING FRIEND OR FOE? Room 314 December 5 2017 CEUs New Process Certified Crop Advisor (CCA) Sign in and out of each session you attend.

More information

Variant Discovery. Jie (Jessie) Li PhD Bioinformatics Analyst Bioinformatics Core, UCD

Variant Discovery. Jie (Jessie) Li PhD Bioinformatics Analyst Bioinformatics Core, UCD Variant Discovery Jie (Jessie) Li PhD Bioinformatics Analyst Bioinformatics Core, UCD Variant Type Alkan et al, Nature Reviews Genetics 2011 doi:10.1038/nrg2958 Variant Type http://www.broadinstitute.org/education/glossary/snp

More information

Use of Whole Genome Sequence Analysis to Improve Food Safety and. SUMMARY: The Food Safety and Inspection Service (FSIS), with

Use of Whole Genome Sequence Analysis to Improve Food Safety and. SUMMARY: The Food Safety and Inspection Service (FSIS), with This document is scheduled to be published in the Federal Register on 09/22/2017 and available online at https://federalregister.gov/d/2017-20247, and on FDsys.gov Billing Code 3410-DM-P DEPARTMENT OF

More information

Next Generation Sequencing Applications in Food Safety and Quality

Next Generation Sequencing Applications in Food Safety and Quality Next Generation Sequencing Applications in Food Safety and Quality Our science National and international centre of excellence for interdisciplinary investigation and problem solving across plant and bee

More information

Comparative Genomics Background and Strategy

Comparative Genomics Background and Strategy Comparative Genomics Background and Strategy Faction 1 3/29/2017 Comparative Genomics is Comparison of structure and feature composition Comparison of derived metrics Li et al., 2015; Chaudhry & Patil,

More information

The Sentieon Genomic Tools Improved Best Practices Pipelines for Analysis of Germline and Tumor-Normal Samples

The Sentieon Genomic Tools Improved Best Practices Pipelines for Analysis of Germline and Tumor-Normal Samples The Sentieon Genomic Tools Improved Best Practices Pipelines for Analysis of Germline and Tumor-Normal Samples Andreas Scherer, Ph.D. President and CEO Dr. Donald Freed, Bioinformatics Scientist, Sentieon

More information

SNP calling. Jose Blanca COMAV institute bioinf.comav.upv.es

SNP calling. Jose Blanca COMAV institute bioinf.comav.upv.es SNP calling Jose Blanca COMAV institute bioinf.comav.upv.es SNP calling Genotype matrix Genotype matrix: Samples x SNPs SNPs and errors A change in a read may due to: Sample contamination Cloning or PCR

More information

Single Nucleotide Variant Analysis. H3ABioNet May 14, 2014

Single Nucleotide Variant Analysis. H3ABioNet May 14, 2014 Single Nucleotide Variant Analysis H3ABioNet May 14, 2014 Outline What are SNPs and SNVs? How do we identify them? How do we call them? SAMTools GATK VCF File Format Let s call variants! Single Nucleotide

More information

Pioneering Clinical Omics

Pioneering Clinical Omics Pioneering Clinical Omics Clinical Genomics Strand NGS An analysis tool for data generated by cutting-edge Next Generation Sequencing(NGS) instruments. Strand NGS enables read alignment and analysis of

More information

Data Basics. Josef K Vogt Slides by: Simon Rasmussen Next Generation Sequencing Analysis

Data Basics. Josef K Vogt Slides by: Simon Rasmussen Next Generation Sequencing Analysis Data Basics Josef K Vogt Slides by: Simon Rasmussen 2017 Generalized NGS analysis Sample prep & Sequencing Data size Main data reductive steps SNPs, genes, regions Application Assembly: Compare Raw Pre-

More information

GALAXY TRAKR FOR STATE PUBLIC HEALTH BIOINFORMATICS INTRODUCTORY TRAININGS, DATA ANALYTICS, & BIOINFORMATICS COLLABORATIONS

GALAXY TRAKR FOR STATE PUBLIC HEALTH BIOINFORMATICS INTRODUCTORY TRAININGS, DATA ANALYTICS, & BIOINFORMATICS COLLABORATIONS GALAXY TRAKR FOR STATE PUBLIC HEALTH BIOINFORMATICS INTRODUCTORY TRAININGS, DATA ANALYTICS, & BIOINFORMATICS COLLABORATIONS Kevin G. Libuit, M.S. Senior Informatics Scientist Division of Consolidated Laboratory

More information

Experimental Design Microbial Sequencing

Experimental Design Microbial Sequencing Experimental Design Microbial Sequencing Matthew L. Settles Genome Center Bioinformatics Core University of California, Davis settles@ucdavis.edu; bioinformatics.core@ucdavis.edu General rules for preparing

More information

NGS aspect to the investigation into a Legionella outbreak linked to a cooling tower

NGS aspect to the investigation into a Legionella outbreak linked to a cooling tower NGS aspect to the investigation into a Legionella outbreak linked to a cooling tower What Dangers are Lurking in Your Water? NextGen Sequencing Applications for Waterborne Threats June 8, 2016 Kimberlee

More information

Can whole genome sequencing replace AST?

Can whole genome sequencing replace AST? Can whole genome sequencing replace AST? Matthew J Ellington Antimicrobial Resistance & Healthcare Associated Infections (AMRHAI) Reference Unit Crown copyright EUCAST Subcommittee on the role of whole

More information

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with

More information

Why can GBS be complicated? Tools for filtering, error correction and imputation.

Why can GBS be complicated? Tools for filtering, error correction and imputation. Why can GBS be complicated? Tools for filtering, error correction and imputation. Edward Buckler USDA-ARS Cornell University http://www.maizegenetics.net Many Organisms Are Diverse Humans are at the lower

More information

Updates from CDC: Cluster Detection and Reporting Guidelines

Updates from CDC: Cluster Detection and Reporting Guidelines National Center for Emerging and Zoonotic Infectious Diseases Updates from CDC: Cluster Detection and Reporting Guidelines Molly Leeper Salmonella Database Manager PulseNet Western Regional Meeting February

More information

CAPTURE-BASED APPROACH FOR COMPREHENSIVE DETECTION OF IMPORTANT ALTERATIONS

CAPTURE-BASED APPROACH FOR COMPREHENSIVE DETECTION OF IMPORTANT ALTERATIONS CAPTURE-BASE APPROACH FOR COMPREHENSIVE ETECTION OF IMPORTANT ALTERATIONS SEQUENCE MUTATIONS MICROSATELLITE INSTABILITY AMPLIFICATIONS GENOMIC REARRANGEMENTS For Research Use Only. Not for iagnostic Purposes.

More information

Setting Standards and Raising Quality for Clinical Bioinformatics. Joo Wook Ahn, Guy s & St Thomas 04/07/ ACGS summer scientific meeting

Setting Standards and Raising Quality for Clinical Bioinformatics. Joo Wook Ahn, Guy s & St Thomas 04/07/ ACGS summer scientific meeting Setting Standards and Raising Quality for Clinical Bioinformatics Joo Wook Ahn, Guy s & St Thomas 04/07/2016 - ACGS summer scientific meeting 1. Best Practice Guidelines Draft guidelines circulated to

More information

Cory Brouwer, Ph.D. Xiuxia Du, Ph.D. Anthony Fodor, Ph.D.

Cory Brouwer, Ph.D. Xiuxia Du, Ph.D. Anthony Fodor, Ph.D. Cory Brouwer, Ph.D. Dr. Cory R. Brouwer is Director of the Bioinformatics Services Division and Associate Professor of Bioinformatics and Genomics at UNC Charlotte. He and his team provide a wide range

More information

Current Needs for Bioforensics SCIENCE AND TECHNOLOGY BRANCH DISCOVER DEVELOP DELIVER

Current Needs for Bioforensics SCIENCE AND TECHNOLOGY BRANCH DISCOVER DEVELOP DELIVER Current Needs for Bioforensics SCIENCE AND TECHNOLOGY BRANCH DISCOVER DEVELOP DELIVER 1 State Sponsored Programs, Bioterrorism, Biocrime and Evolving Technology - Requires a Responsive Bioforensic Capability

More information

Shannon pipeline plug-in: For human mrna splicing mutations CLC bio Genomics Workbench plug-in CLC bio Genomics Server plug-in Features and Benefits

Shannon pipeline plug-in: For human mrna splicing mutations CLC bio Genomics Workbench plug-in CLC bio Genomics Server plug-in Features and Benefits Shannon pipeline plug-in: For human mrna splicing mutations CLC bio Genomics Workbench plug-in CLC bio Genomics Server plug-in Features and Benefits Cytognomix introduces a line of Shannon pipeline plug-ins

More information

Using the NCBI Pathogen Detection Portal to Aid in Surveillance of Enteric Pathogens

Using the NCBI Pathogen Detection Portal to Aid in Surveillance of Enteric Pathogens Using the NCBI Pathogen Detection Portal to Aid in Surveillance of Enteric Pathogens Bill Wolfgang Bill Klimke Samantha Wirth March 8, 2018 william.wolfgang@health.ny.gov samantha.wirth@health.ny.gov klimke@ncbi.nlm.nih.gov

More information

Evolutionary Genetics: Part 1 Polymorphism in DNA

Evolutionary Genetics: Part 1 Polymorphism in DNA Evolutionary Genetics: Part 1 Polymorphism in DNA S. chilense S. peruvianum Winter Semester 2012-2013 Prof Aurélien Tellier FG Populationsgenetik Color code Color code: Red = Important result or definition

More information

Top 5 Lessons Learned From MAQC III/SEQC

Top 5 Lessons Learned From MAQC III/SEQC Top 5 Lessons Learned From MAQC III/SEQC Weida Tong, Ph.D Division of Bioinformatics and Biostatistics, NCTR/FDA Weida.tong@fda.hhs.gov; 870 543 7142 1 MicroArray Quality Control (MAQC) An FDA led community

More information

Food Safety and Inspection Service A review of laboratory-based regulatory and response activities.

Food Safety and Inspection Service A review of laboratory-based regulatory and response activities. Food Safety and Inspection Service A review of laboratory-based regulatory and response activities. Objective Provide an overview of recently implemented regulatory testing programs for non-o157 shiga-toxin

More information

Analytics Behind Genomic Testing

Analytics Behind Genomic Testing A Quick Guide to the Analytics Behind Genomic Testing Elaine Gee, PhD Director, Bioinformatics ARUP Laboratories 1 Learning Objectives Catalogue various types of bioinformatics analyses that support clinical

More information

Supplementary Figures and Data

Supplementary Figures and Data Supplementary Figures and Data Whole Exome Screening Identifies Novel and Recurrent WISP3 Mutations Causing Progressive Pseudorheumatoid Dysplasia in Jammu and Kashmir India Ekta Rai 1, Ankit Mahajan 2,

More information

Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie. Sander van Boheemen Medical Microbiology

Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie. Sander van Boheemen Medical Microbiology Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie Sander van Boheemen Medical Microbiology Next-generation sequencing Next-generation sequencing (NGS), also known as

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics IMBB 2017 RAB, Kigali - Rwanda May 02 13, 2017 Joyce Nzioki Plan for the Week Introduction to Bioinformatics Raw sanger sequence data Introduction to CLC Bio Quality Control

More information

LARGE DATA AND BIOMEDICAL COMPUTATIONAL PIPELINES FOR COMPLEX DISEASES

LARGE DATA AND BIOMEDICAL COMPUTATIONAL PIPELINES FOR COMPLEX DISEASES 1 LARGE DATA AND BIOMEDICAL COMPUTATIONAL PIPELINES FOR COMPLEX DISEASES Ezekiel Adebiyi, PhD Professor and Head, Covenant University Bioinformatics Research and CU NIH H3AbioNet node Covenant University,

More information

Whole Genome Sequencing of Food Isolates

Whole Genome Sequencing of Food Isolates Whole Genome Sequencing of Food Isolates Ai Kataoka, - GMA Danielle Montoya, M.S. - Noblis May 23, 2017 Project Objectives Promote genomic testing in the Food Industry To have participants gain familiarity

More information