Bioinformatics in Public Health Laboratories Duncan MacCannell PhD
|
|
- Wesley Bates
- 6 years ago
- Views:
Transcription
1 Bioinformatics in Public Health Laboratories Duncan MacCannell PhD National Center for Emerging and Zoonotic Infectious Diseases Office of Advanced Molecular Detection
2 What is bioinformatics? 2 The use of computer systems to analyze and interpret complex biological datasets in order to better inform public health action.
3 Why is it necessary? 3
4 NGS: A Disruptive Technology Cost to sequence 1 million basepairs VOLUME OF RAW DATA Maximum output per instrument 4 kilobytes (10 3 bytes) terabytes (10 12 bytes)
5 5
6 NGS Sequencing Technologies Illumina MiSeq//NextSeq/HiSeq IonTorrent PGM/Proton/S5 SHORT READ SEQUENCING 10X Genomics GemCode Illumina TruSeq SLR (moleculo) SYNTHETIC LONG READ 75 to 400bp readlengths* Millions to billions of reads Various error models Relatively inexpensive (~$60/isolate) Issues: Resolving complex structures, phasing Pacific BioSciences RS II Pacific BioSciences Sequel* Oxford Nanopore MinION LONG READ SEQUENCING >3 20kbp readlengths* Hundreds of thousands of reads* High error rates have presented challenges* Roughly $600/isolate (RSII; Nanopore TBD) Other adv: SMRT; methyl; phasing; closing 6
7 NGS Applications in Microbiology Library Data Info. ACAATTTGTGCATAACATGTGGACAGTTTTAATCACATGTGGGTAAATAGTTGTCCACATTTGCTTTTTT TGTCGAAAACCCTATCTCATATACAAACGACGTTTTTAGGTTTTAAAATACGTTTCGTATAAATATACAT TTTATATTTATTAGGTTGTACATTTGTTGCGCAACCTTATTCTTTTACCATCTTAGTAAAGGAGGGACAC CTTTGGAAAATATCTCTGATTTATGGAATAGTGCCTTAAAAGAATTAGAAAAAAAGGTAAGCAAGCCTAG TTATGAAACATGGTTAAAATCAACAACGGCTCATAACTTGAAGAAAGACGTATTAACGATTACAGCTCCA AATGAATTTGCTCGTGACTGGCTAGAATCTCATTACTCAGAACTTATTTCGGAAACACTATACGATTTAA CAGGGGCAAAATTAGCAATTCGCTTTATTATTCCCCAAAGTCAATCGGAAGAGGACATTGATCTTCCTCC AGTTAAGCGGAATCCAGCACAAGATGATTCAGCTCATTTACCACAGAGCATGTTAAATCCAAAATATACA TTTGATACATTTGTTATCGGCTCTGGTAACCGTTTTGCCCATGCAGCTTCATTAGCTGTAGCCGAGGCGC CAGCTAAAGCGTATAATCCACTCTTTATTTATGGGGGAGTTGGGCTTGGAAAGACGCATTTAATGCACGC AATTGGTCATTATGTAATTGAACATAATCCAAATGCAAAAGTTGTATATTTATCATCAGAAAAATTCACG AATGAATTTATTAACTCTATTCGTGATAATAAAGCTGTTGATTTTCGTAATAAATATCGCAACGTAGATG 7 Input: DNA/RNA Source: Genomic Amplicon Whole sample Host/vector/ pathogen/ environment NGS Workflow: Platforms Chemistry Perf. char. Labor/TaT Expertise Cost Increasingly Universal Workflows Working to establish standardized sequencing workflows for a wide range of pathogens. Sample intake Prep/staging Extraction Conversion Library prep Sequencing Bioinformatics Workflow: Hardware/software Specialized skillsets Algorithms/pipelines Pathogen databases Data analysis/interpret/ Integration/visualization A Moving Target Rapidly evolving technology space. Changing hardware and COTS/OSS capabilities. Lots of choice, but lack of consistent standards. BIG DATA. New workforce and skillset is required. File hashes/versioning QA/QC Security Validated methods/databases Skills/proficiency Process logging/audit Standards Reporting Pathogen and application specific, standard and/or compliant assays Output: Information From Sequence Data Comparative Genomics Identification High res Straintyping/Subtyping Cluster identification Molecular evolution Genotypic characterization Virulence, AR, signatures Functional annotation Diagnostic dev/validation Minor populations, quasispecies Host/pathogen expression Metagenomics Pathogen identification/discovery Culture independent diagnostics Microbial ecology/diversity Many results from a single dataset. Faster and cheaper than serial tests.
8 Impact of NGS on Infectious Disease Research Bacteria, fungi and parasites Genomics revolutionary: phylogenetics, molecular evolution and transmission dynamics; broad based characterization; open ended inquiry; insight into poorly characterized genus/species; AR/virulence. Metagenomics (discovery, diagnostics, ecology/population, non culturable) Transcriptomics (host/pathogen/vector) Viruses Deep sequencing (minor var, quasispec, diversity, AR) Metagenomics (discovery, diagnostics, ecology/population, non culturable) Transcriptomics (host/pathogen/vector) 8
9 9
10 10
11 Run QC & Metrics Demultiplexing Read QC FastQC, CLC, FastXToolkit, etc 11 PAN GENOME, wgmlst, AMR PREDICTION, FUNC ANNOTATION Trimming/Filtering Trimmomatic, CLC, kraken, bowtie2,... Reference Mapping Alignment bwa Variant Calling samtools, gatk, varscan, freebayes, Variant Filter/Annot MLST/AMR srst2 Tree Building RAxML, PHYML, Fasttree, ksnp De Novo Assembly WG Alignment mauve, parsnp Annotation/FP prokka, pgaap, etc Tree Building Harvest, ksnp, MLST/AMR abricate, mlst Comparative Genomics Hat tip: N. Loman
12 What does the data actually look like? 3 SAMPLES Illumina Paired End ~47GB compressed ID,flowcell,barcode,lane,pair SEQUENCE DATA Identifier for each read. (Syntax varies Illumina) INSTRUMENT: HISEQ RUN ID: 165 FLOWCELL ID: C1CKRACXX FLOWCELL LANE: 2 TILE: 2201 X,Y: 1257,1980 PAIR: 1 FILTERING: N BARCODE: GAGTGG 1 2 Per base quality score: ASCII 64 or 94 levels LOWEST! (HEX 21) HIGHEST ~ (HEX7E) M 100bp reads in CDD5 Further reading:
13 CLCbio Genomics Workbench 13 For demonstration; does not imply endorsement.
14 NGS Quality Assessment: FastQC 14
15 Genome Assembly and Mapping 15 INCLUDES: Sequence(s) of interest, Contaminants MAY NOT INCLUDE: Poorly sequenceable regions MAY NOT RESOLVE: repeats, rearrangements, complex structure
16 16 Mapped assembly For demonstration purposes only. All copyrights belong to their respective owners.
17 Reference Guided (Mapped) Assembly Contig 1 Contig 2 1X Reference Sequence/Genome 17 Coverage 18X Low sequence coverage UNMAPPED READS 1. Sequences not present in the reference. 2. Plasmids or other extrachromosomal. 3. DNA Structural Variation/Rearrangement ADVANTAGES: Relatively fast, well suited to highly conserved genomes. DISADVANTAGES: Issues with high diversity, mobile elements, linear reference Example software: BWA ( breseq (
18 Mapping Example: MTB (ref: h37rv) 18 Mapping parameters to consider: mismatch, insertion, deletion, extension, cutoffs, algorithm
19 Example Application: WG SNP Typing Reference Sequence/Core Genome 1 A C T AG A A T C C T T AG AC Advantages: adaptable, highly discriminatory, good for cluster investigations. Disadvantages: not well suited for surveillance or studies where reference or allele set may shift over time; limited additional data. T T ACTAGA ACTAGT TCTACT Example software: SNIPPY ( LYVE SET ( SET), SNP Pipeline ( Biostatistics/snp pipeline), others.
20 20 De Novo Assembly For demonstration purposes only. All copyrights belong to their respective owners.
21 De Novo Assembly Contig 1 Contig 2 Contig 3 21 Contig 4 Contig 5 Contig 6 Contig 7 PLASMID ADVANTAGES: Reference agnostic: assembles all the reads it can. Various algorithms. DISADVANTAGES: Doesn t always get things right. Particularly with complex repeats. Example software: SPAdes ( List:
22 Example De Novo: MTB contigs; 126 contigs >1.5kbases in length 4.35Mbases (h37rv=4.41mbases); N50: 59kbases
23 Example application: wgmlst 23 Conventional MLST Sequencing and assignment of alleles for 6 or 7 bacterial housekeeping genes. Sequence type on the basis of allelic differences. wgmlst (PulseNet, others) Expanded scheme: Typically includes all relevant open reading frames across core or pangenome Denovo assembly, sequence typing of >1000 loci Advantages: discriminatory; reproducible; results in consistent, hierarchical nomenclature; corollary information; data are reasonably portable. Disadvantages: may have decreased discriminatory power, particularly with clustered genomic variations; requires initial dev/curation of allele db; computationally intensive. Example software: SRST2 (
24 Getting Started with Bioinformatics For demonstration purposes only. 24
25 Setting the Stage: Scientific Computing 25 General requirements: High performance computing: o Scientific workstation (8+ cores, >16GB RAM) Linux or OSX pref o Access to high performance computing cluster(s) and private/public cloud resources helpful Storage capacity (10TB+) o Backup & data management plan o Access for instruments & users High speed networking (>1Gb) o Instruments, storage, compute Bare bones: workstation, VM. Bioinformatics, sysadmin and software engineering expertise
26 Bioinformatics Software: F/OSS versus COTS 26 Free and Open Source Software (F/OSS) The source code for many bioinformatic tools are publicly released & linked to publication. o Various licenses, govern use/modification/attribution o Use of DOIs for sourcecode and software Multiple public repositories: o Github, SourceForge, project/laboratory websites. Most tools written for Linux/OSX command line, some windows and crossplatform Friendliness of the code varies; ongoing support. Commercial off the shelf (COTS) BioNumerics, CLCBio, DNAStar NGEN, Geneious Professional, MATLAB, SAS JMP Genomics, Vendor supported, many provide a GUI for F/OSS algorithms Much more user friendly than most CLI tools, features Some software has hybrid model (eg: non commercial use)
27 Bioinformatic Software Packages YES YES YES 27 Wyres et al. Pathogens 2014, 3(2), ; doi: /pathogens
28 Run QC & Metrics Demultiplexing Read QC FastQC, CLC, FastXToolkit, etc 28 PAN GENOME, wgmlst, AMR PREDICTION, FUNC ANNOTATION Trimming/Filtering Trimmomatic, CLC, kraken, bowtie2,... Reference Mapping Alignment bwa Variant Calling samtools, gatk, varscan, freebayes, Variant Filter/Annot MLST/AMR srst2 Tree Building RAxML, PHYML, Fasttree, ksnp De Novo WG Alignment mauve, parsnp Annotation/FP prokka, pgaap, etc Tree Building Harvest, ksnp, MLST/AMR abricate, mlst Comparative Genomics Hat tip: N. Loman
29 Pipelines: Automating Workflows 29
30 Pipelines: Automating Workflows 30 Development considerations: OS and hardware environment, dependencies, parallization, flexibility, modularity, documentation, support,
31 CLCbio Genomics: Workflows 31
32 BioNumerics: Workflows 32
33 Geneious: Workflows 33
34 GALAXY 34
35 Common Bioinformatics Platforms: Next Generation PulseNet LIMS Database managers and end users Sequencer Raw sequences PH databases Closed to the public Nomenclature server Allele databases Currently CDC (Ideally public domain) Genus/species Serotype Pathotype Virulence Resistance 7 gene MLST emlst cgmlst wgmlst Data pathway Analysis request External storage NCBI, ENA, BaseSpace Calculation engine Trimming, mapping, de novo assembly, SNP detection, allele detection Currently CDC (Ideally public domain) A number of other CDC programs are evaluating the use of BioNumerics infrastructure: mening, TB, GC, legionella, strep, HAIs, malaria, HIV 35
36 AMD Portal: Standardizing Data Flow STATE PHL PHL USER COMMUNITY AMD PORTAL INFLUENZA TB Etc. HPC CLUSTERS STORAGE ARRAY (3.6PB) CDC High Performance Computing CoE CDC USER COMMUNITY 36
37 Developing A Bioinformatics Workforce for Public Health 37 Like microbiology and epidemiology, good bioinformatics is increasingly critical to good PH. Challenges with recruitment and retention Skilled bioinformaticians are a highly sought after resource. Career bioinformaticians with relevant public health/microbiology experience are an extremely rare breed. It is more common to find microbiologists with on the job bioinformatics experience expand/enable through training. Limited state/federal workforce support (job series, competencies, training ) for new technical labor categories Technical and informatics challenges (compute/storage/net) Interdisciplinary communication is vital: translating between program scientific needs and bioinformatics technical specialists. Team based approaches, hub and spoke model. AMD training and capacity building for PHL: stay tuned!
38 38
39 39 Application deadline for class: April 1,
40 Links and Further Reading 40 Further reading Loman NJ, Pallen M. Twenty years of bacterial genome sequencing. Nat Rev Microbiol Dec;13(12): doi: /nrmicro3565 Kwong JC, McCallum N, Sintchenko V, Howden BP. Whole genome sequencing in clinical and public health microbiology. Pathology Apr;47(3): doi: /PAT Free and open source software: NCBI Genome Workbench ( SRST2 ( Prokka/snippy/arbricate ( BWA ( GATK ( SPAdes ( HARVEST ( ksnp ( PhyloViz ( Samtools ( FreeBayes ( BIGSdb ( RAxML ( RAxML); Mauve ( GEPHI ( BioPerl ( BioPython ( R ( project.org) Commercial software: BioNumerics ( maths.com); CLCbio Genomics Workbench ( DNAStar Lasergene NGEN ( Geneious Professional (
41 Advanced Molecular Detection FOLLOW US ON THE WEB: FOLLOW US ON 41
42 42
43 Questions? For more information please contact Centers for Disease Control and Prevention 1600 Clifton Road NE, Atlanta, GA Telephone: CDC INFO ( )/TTY: E mail: fms2@cdc.gov Web: The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of the Centers for Disease Control and Prevention. National Center for Emerging and Zoonotic Infectious Diseases Office of Advanced Molecular Detection
CDC s Advanced Molecular Detection (AMD) Sequence Data Analysis and Management
CDC s Advanced Molecular Detection (AMD) Sequence Data Analysis and Management Scott Sammons Technology Officer Office of Advanced Molecular Detection National Center for Emerging and Zoonotic Infectious
More informationIntroduction to NGS Analysis Tools
National Center for Emerging and Zoonotic Infectious Diseases Introduction to NGS Analysis Tools Heather Carleton, PhD, MPH Team Lead, Enteric Diseases Bioinformatics, Enteric Diseases Laboratory Branch,
More informationThe Basics of Understanding Whole Genome Next Generation Sequence Data
The Basics of Understanding Whole Genome Next Generation Sequence Data Heather Carleton-Romer, MPH, Ph.D. ASM-CDC Infectious Disease and Public Health Microbiology Postdoctoral Fellow PulseNet USA Next
More informationThe Basics of Understanding Whole Genome Next Generation Sequence Data
The Basics of Understanding Whole Genome Next Generation Sequence Data Heather Carleton, MPH, Ph.D. ASM-CDC Infectious Disease and Public Health Microbiology Postdoctoral Fellow PulseNet USA Next Generation
More informationBioinformatics and Public Health
Bioinformatics and Public Health Scott Sammons Technology Officer Office of Advanced Molecular Detection (detail) Team Lead, Bioinformatics NCEZID/DSR/BCFB January 12, 2015 National Center for Emerging
More informationValidating Bionumerics 7.6: A strategic approach from Oregon
Validating Bionumerics 7.6: A strategic approach from Oregon Karim Morey, MS, M(ASCP) Oregon State Public Health Laboratory PulseNet West Coast Regional Meeting February 2019 Outline Compliance requirements
More informationNext generation sequencing in diagnostic laboratories: opportunities and challenges
Next generation sequencing in diagnostic laboratories: opportunities and challenges Vitali Sintchenko Marie Bashir Institute for Emerging Infectious Diseases & Biosecurity Declaration No conflict of interest
More informationSetting the Course: Virginia's experience navigating information technology and bioinformatics needs for whole genome sequencing
Setting the Course: Virginia's experience navigating information technology and bioinformatics needs for whole genome sequencing Lauren Turner, Ph.D. Virginia Division of Consolidated Laboratory Services
More informationIntroduction to PulseNet WGS Tools in BioNumerics v7.6
National Center for Emerging and Zoonotic Infectious Diseases Introduction to PulseNet WGS Tools in BioNumerics v7.6 Steven Stroika PulseNet CDC PulseNet/OutbreakNet Regional Meeting February 2019 Overview
More informationIntroductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie. Sander van Boheemen Medical Microbiology
Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie Sander van Boheemen Medical Microbiology Next-generation sequencing Next-generation sequencing (NGS), also known as
More informationRue Juliette Wytsmanstraat Brussels Belgium T F
Kevin Vanneste, PhD Bioinformatics Platform Platform Biotechnology and Molecular Biology Department Expertise, Service Provision and Customer Relations Collaboration between the EURL-VTEC and the Platform
More informationNext Generation Sequencing. Tobias Österlund
Next Generation Sequencing Tobias Österlund tobiaso@chalmers.se NGS part of the course Week 4 Friday 13/2 15.15-17.00 NGS lecture 1: Introduction to NGS, alignment, assembly Week 6 Thursday 26/2 08.00-09.45
More informationData Intensive Biomedical Research: The EU RL VTEC efforts to take up the NGS challenge. EU RL for E. coli Annual Workshop 2015
Data Intensive Biomedical Research: The EU RL VTEC efforts to take up the NGS challenge EU RL for E. coli Annual Workshop 2015 NGS adoption: Worldwide Source: Omicsmap.com November, 2015 Data Production
More informationBioinformatics- Data Analysis
Bioinformatics- Data Analysis Erin H. Graf, PhD, D(ABMM) Infectious Disease Diagnostics Laboratory, Children s Hospital of Philadelphia Department of Pathology and Laboratory Medicine, University of Pennsylvania
More informationDeveloping Tools for Rapid and Accurate Post-Sequencing Analysis of Foodborne Pathogens. Mitchell Holland, Noblis
Developing Tools for Rapid and Accurate Post-Sequencing Analysis of Foodborne Pathogens Mitchell Holland, Noblis Agenda Introduction Whole Genome Sequencing Analysis Pipeline Sequence Alignment SNPs and
More informationEURL WORKING GROUP ON WHOLE GENOME SEQUENCING AND PULSENET INTERNATIONAL
EURL WORKING GROUP ON WHOLE GENOME SEQUENCING AND PULSENET INTERNATIONAL EURL-Campylobacter workshop, 9/10-2018 Joakim Skarin, SVA Objectives of the WG-NGS To promote the use of NGS across the EURL networks
More informationCurrent status of universal whole genome sequencing of Mycobacterium tuberculosis in the United States
Current status of universal whole genome sequencing of Mycobacterium tuberculosis in the United States Lauren Cowan, PhD Medical Consultant Meeting San Antonio, TX November 29-30, 2018 1 EXCELLENCE EXPERTISE
More informationTranscriptomics analysis with RNA seq: an overview Frederik Coppens
Transcriptomics analysis with RNA seq: an overview Frederik Coppens Platforms Applications Analysis Quantification RNA content Platforms Platforms Short (few hundred bases) Long reads (multiple kilobases)
More information2014 APHL Next Generation Sequencing (NGS) Survey
APHL would like you to complete the Next Generation Sequencing (NGS) in Public Health Laboratories Survey. The purpose of this survey is to collect information on current capacities for NGS testing and
More informationTruSPAdes: analysis of variations using TruSeq Synthetic Long Reads (TSLR)
tru TruSPAdes: analysis of variations using TruSeq Synthetic Long Reads (TSLR) Anton Bankevich Center for Algorithmic Biotechnology, SPbSU Sequencing costs 1. Sequencing costs do not follow Moore s law
More informationThe Genome Analysis Centre. Building Excellence in Genomics and Computa5onal Bioscience
Building Excellence in Genomics and Computa5onal Bioscience Resequencing approaches Sarah Ayling Crop Genomics and Diversity sarah.ayling@tgac.ac.uk Why re- sequence plants? To iden
More informationIntroduction to Next Generation Sequencing (NGS) Data Analysis and Pathway Analysis. Jenny Wu
Introduction to Next Generation Sequencing (NGS) Data Analysis and Pathway Analysis Jenny Wu Outline Introduction to NGS data analysis in Cancer Genomics NGS applications in cancer research Typical NGS
More informationUsing Galaxy for the analysis of NGS-derived pathogen genomes in clinical microbiology
Using Galaxy for the analysis of NGS-derived pathogen genomes in clinical microbiology Anthony Underwood*, Paul-Michael Agapow, Michel Doumith and Jonathan Green. Bioinformatics Unit, Health Protection
More informationExperimental Design. Sequencing. Data Quality Control. Read mapping. Differential Expression analysis
-Seq Analysis Quality Control checks Reproducibility Reliability -seq vs Microarray Higher sensitivity and dynamic range Lower technical variation Available for all species Novel transcript identification
More informationWhole Genome Sequence Data Quality Control and Validation
Whole Genome Sequence Data Quality Control and Validation GoSeqIt ApS / Ved Klædebo 9 / 2970 Hørsholm VAT No. DK37842524 / Phone +45 26 97 90 82 / Web: www.goseqit.com / mail: mail@goseqit.com Table of
More informationCBC Data Therapy. Metagenomics Discussion
CBC Data Therapy Metagenomics Discussion General Workflow Microbial sample Generate Metaomic data Process data (QC, etc.) Analysis Marker Genes Extract DNA Amplify with targeted primers Filter errors,
More informationUsing New ThiNGS on Small Things. Shane Byrne
Using New ThiNGS on Small Things Shane Byrne Next Generation Sequencing New Things Small Things NGS Next Generation Sequencing = 2 nd generation of sequencing 454 GS FLX, SOLiD, GAIIx, HiSeq, MiSeq, Ion
More informationAnalytics Behind Genomic Testing
A Quick Guide to the Analytics Behind Genomic Testing Elaine Gee, PhD Director, Bioinformatics ARUP Laboratories 1 Learning Objectives Catalogue various types of bioinformatics analyses that support clinical
More informationESCMID Online Lecture Library. by author
ESCMID WS Rapid NGS for Characterization and Typing of Resistant Gram-Negative Bacilli 7-9 October 2015 João André Carriço, Microbiology Institute and Instituto de Medicina Molecular, Faculty of Medicine,
More informationFrom Bands to Base Pairs: Implementation of WGS in a PulseNet Laboratory
From Bands to Base Pairs: Implementation of WGS in a PulseNet Laboratory Sara Wagner Microbiologist WI State Lab of Hygiene InFORM Meeting Nov 19, 2015 Objectives Describe WGS implementation at WSLH What
More informationLARGE DATA AND BIOMEDICAL COMPUTATIONAL PIPELINES FOR COMPLEX DISEASES
1 LARGE DATA AND BIOMEDICAL COMPUTATIONAL PIPELINES FOR COMPLEX DISEASES Ezekiel Adebiyi, PhD Professor and Head, Covenant University Bioinformatics Research and CU NIH H3AbioNet node Covenant University,
More informationCanada's IRIDA platform for genomic epidemiology. Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health Agency of Canada
Canada's IRIDA platform for genomic epidemiology Gary Van Domselaar Chief, Bioinformatics National Microbiology Lab Public Health Agency of Canada Integrated Rapid Infectious Disease Analysis informatics
More informationG E N OM I C S S E RV I C ES
GENOMICS SERVICES ABOUT T H E N E W YOR K G E NOM E C E N T E R NYGC is an independent non-profit implementing advanced genomic research to improve diagnosis and treatment of serious diseases. Through
More informationC3BI. VARIANTS CALLING November Pierre Lechat Stéphane Descorps-Declère
C3BI VARIANTS CALLING November 2016 Pierre Lechat Stéphane Descorps-Declère General Workflow (GATK) software websites software bwa picard samtools GATK IGV tablet vcftools website http://bio-bwa.sourceforge.net/
More informationDevelopment and Implementation of a Quality System for Next-Generation Sequencing
Development and Implementation of a Quality System for Next-Generation Sequencing Lauren Turner, PhD Lead Scientist Virginia Division of Consolidated Laboratory Services DCLS Phased Implementation of NGS
More informationTargeted Sequencing in the NBS Laboratory
Targeted Sequencing in the NBS Laboratory Christopher Greene, PhD Newborn Screening and Molecular Biology Branch Division of Laboratory Sciences Gene Sequencing in Public Health Newborn Screening February
More informationA year in clinical bioinformatics
Division of Clinical Microbiology A year in clinical bioinformatics Helena Seth-Smith, PhD October 2018 ICCMg " the application of next generation sequencing to clinical samples in order to recover information
More informationIntroduction to DNA-Sequencing
informatics.sydney.edu.au sih.info@sydney.edu.au The Sydney Informatics Hub provides support, training, and advice on research data, analyses and computing. Talk to us about your computing infrastructure,
More informationAaron Liston, Oregon State University Botany 2012 Intro to Next Generation Sequencing Workshop
Output (bp) Aaron Liston, Oregon State University Growth in Next-Gen Sequencing Capacity 3.5E+11 2002 2004 2006 2008 2010 3.0E+11 2.5E+11 2.0E+11 1.5E+11 1.0E+11 Adapted from Mardis, 2011, Nature 5.0E+10
More informationNext-Generation Sequencing. Technologies
Next-Generation Next-Generation Sequencing Technologies Sequencing Technologies Nicholas E. Navin, Ph.D. MD Anderson Cancer Center Dept. Genetics Dept. Bioinformatics Introduction to Bioinformatics GS011062
More informationsolid S Y S T E M s e q u e n c i n g See the Difference Discover the Quality Genome
solid S Y S T E M s e q u e n c i n g See the Difference Discover the Quality Genome See the Difference With a commitment to your peace of mind, Life Technologies provides a portfolio of robust and scalable
More informationILLUMINA SEQUENCING SYSTEMS
ILLUMINA SEQUENCING SYSTEMS PROVEN QUALITY. TRUSTED SOLUTIONS. Every day, researchers are using Illumina next-generation sequencing (NGS) systems to better understand human health and disease, as well
More informationDNA concentration and purity were initially measured by NanoDrop 2000 and verified on Qubit 2.0 Fluorometer.
DNA Preparation and QC Extraction DNA was extracted from whole blood or flash frozen post-mortem tissue using a DNA mini kit (QIAmp #51104 and QIAmp#51404, respectively) following the manufacturer s recommendations.
More informationNOW GENERATION SEQUENCING. Monday, December 5, 11
NOW GENERATION SEQUENCING 1 SEQUENCING TIMELINE 1953: Structure of DNA 1975: Sanger method for sequencing 1985: Human Genome Sequencing Project begins 1990s: Clinical sequencing begins 1998: NHGRI $1000
More informationWhole Genome Sequencing for food safety FSA Chief Scientific Advisor Report and 2013 Listeria pilot study
Whole Genome Sequencing for food safety FSA Chief Scientific Advisor Report and 2013 Listeria pilot study Dr Edward Hayes Date: July 2016, Version 1 Foodborne Pathogens 280,000 cases of Campylobacter,
More informationAnalysing genomes and transcriptomes using Illumina sequencing
Analysing genomes and transcriptomes using Illumina uencing Dr. Heinz Himmelbauer Centre for Genomic Regulation (CRG) Ultrauencing Unit Barcelona The Sequencing Revolution High-Throughput Sequencing 2000
More informationPractical quality control for whole genome sequencing in clinical microbiology
Practical quality control for whole genome sequencing in clinical microbiology John WA Rossen, PhD, MMM Department of Medical Microbiology, University of Groningen, UMCG, Groningen, The Netherlands Disclosure
More informationWhole-Genome Sequencing (WGS) for Food Safety
Whole-Genome Sequencing (WGS) for Food Safety Errol Strain, Ph.D. Director, Biostatistics and Bioinformatics Staff Center for Food Safety and Applied Nutrition U.S. Food Drug Administration IFSH Meeting
More informationCourse Presentation. Ignacio Medina Presentation
Course Index Introduction Agenda Analysis pipeline Some considerations Introduction Who we are Teachers: Marta Bleda: Computational Biologist and Data Analyst at Department of Medicine, Addenbrooke's Hospital
More informationDeep Sequencing technologies
Deep Sequencing technologies Gabriela Salinas 30 October 2017 Transcriptome and Genome Analysis Laboratory http://www.uni-bc.gwdg.de/index.php?id=709 Microarray and Deep-Sequencing Core Facility University
More informationNext-Generation Sequencing Services à la carte
Next-Generation Sequencing Services à la carte www.seqme.eu ngs@seqme.eu SEQme 2017 All rights reserved The trademarks and names of other companies and products mentioned in this brochure are the property
More informationFrom Lab Bench to Supercomputer: Advanced Life Sciences Computing. John Fonner, PhD Life Sciences Computing
From Lab Bench to Supercomputer: Advanced Life Sciences Computing John Fonner, PhD Life Sciences Computing A Decade s Progress in DNA Sequencing 2003: ABI 3730 Sequencer Human Genome: $2.7 Billion, 13
More informationNextSeq 500 System WGS Solution
NextSeq 500 System WGS Solution An accessible, high-quality whole-genome sequencing solution for any species. Highlights High-Quality, High-Coverage Genome Illumina chemistry offers highest read quality
More informationWhole Genome Sequencing for Enteric Pathogen Surveillance and Outbreak Investigations
Whole Genome Sequencing for Enteric Pathogen Surveillance and Outbreak Investigations Anne Maki, Manager, Enteric, Environmental, Molecular Surveillance and Bacterial Sexually Transmitted Infections, Public
More informationDe Novo and Hybrid Assembly
On the PacBio RS Introduction The PacBio RS utilizes SMRT technology to generate both Continuous Long Read ( CLR ) and Circular Consensus Read ( CCS ) data. In this document, we describe sequencing the
More informationWelcome to the NGS webinar series
Welcome to the NGS webinar series Webinar 1 NGS: Introduction to technology, and applications NGS Technology Webinar 2 Targeted NGS for Cancer Research NGS in cancer Webinar 3 NGS: Data analysis for genetic
More informationOvercome limitations with RNA-Seq
Buyer s Guide Simple, customized RNA-Seq workflows Evaluating options for next-generation RNA sequencing Overcome limitations with RNA-Seq Next-generation sequencing (NGS) has revolutionized the study
More informationExperimental Design Microbial Sequencing
Experimental Design Microbial Sequencing Matthew L. Settles Genome Center Bioinformatics Core University of California, Davis settles@ucdavis.edu; bioinformatics.core@ucdavis.edu General rules for preparing
More informationDNA. bioinformatics. genomics. personalized. variation NGS. trio. custom. assembly gene. tumor-normal. de novo. structural variation indel.
DNA Sequencing T TM variation DNA amplicon mendelian trio genomics NGS bioinformatics tumor-normal custom SNP resequencing target validation de novo prediction personalized comparative genomics exome private
More informationLooking Ahead: Improving Workflows for SMRT Sequencing
Looking Ahead: Improving Workflows for SMRT Sequencing Jonas Korlach FIND MEANING IN COMPLEXITY Pacific Biosciences, the Pacific Biosciences logo, PacBio, SMRT, and SMRTbell are trademarks of Pacific Biosciences
More informationIntroduction to the MiSeq
Introduction to the MiSeq 2011 Illumina, Inc. All rights reserved. Illumina, illuminadx, BeadArray, BeadXpress, cbot, CSPro, DASL, Eco, Genetic Energy, GAIIx, Genome Analyzer, GenomeStudio, GoldenGate,
More informationSMRT Analysis Barcoding Overview (v6.0.0)
SMRT Analysis Barcoding Overview (v6.0.0) Introduction This document applies to PacBio RS II and Sequel Systems using SMRT Link v6.0.0. Note: For information on earlier versions of SMRT Link, see the document
More informationSequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es
Sequencing technologies Jose Blanca COMAV institute bioinf.comav.upv.es Outline Sequencing technologies: Sanger 2nd generation sequencing: 3er generation sequencing: 454 Illumina SOLiD Ion Torrent PacBio
More informationIntroduction. Highlights. Prepare Library Sequence Analyze Data
BaseSpace Sequence Hub Genomics cloud computing expands opportunities for biological discovery, making analysis and storage of next-generation sequencing data accessible to any scientist. Highlights Centralized
More informationMicrobial sequencing solutions
Microbial sequencing solutions Scalable, simple, fast TARGETED GENOME Sequencing f every lab, every budget, every application Ion Trent semiconduct sequencing Ion Trent technology has pioneered an entirely
More informationNGS in Pathology Webinar
NGS in Pathology Webinar NGS Data Analysis March 10 2016 1 Topics for today s presentation 2 Introduction Next Generation Sequencing (NGS) is becoming a common and versatile tool for biological and medical
More informationVTEC strains typing: from traditional methods to NGS
VTEC strains typing: from traditional methods to NGS 2 nd course on bioinformatics tools for Next Generation Sequencing data mining: use of bioinformatics tools for typing pathogenic E. coli ISS, Rome
More informationOverview of CIDT Challenges and Opportunities
Overview of CIDT Challenges and Opportunities Peter Gerner-Smidt, MD, DSc Enteric Diseases Laboratory Branch InFORM II Phoenix, AZ, 19 November 2015 National Center for Emerging and Zoonotic Infectious
More informationAssay Validation Services
Overview PierianDx s assay validation services bring clinical genomic tests to market more rapidly through experimental design, sample requirements, analytical pipeline optimization, and criteria tuning.
More informationCNV and variant detection for human genome resequencing data - for biomedical researchers (II)
CNV and variant detection for human genome resequencing data - for biomedical researchers (II) Chuan-Kun Liu 劉傳崑 Senior Maneger National Center for Genome Medican bioit@ncgm.sinica.edu.tw Abstract Common
More informationData Analysis with CASAVA v1.8 and the MiSeq Reporter
Data Analysis with CASAVA v1.8 and the MiSeq Reporter Eric Smith, PhD Bioinformatics Scientist September 15 th, 2011 2010 Illumina, Inc. All rights reserved. Illumina, illuminadx, Solexa, Making Sense
More informationWelcome to the Bioinformatics Workshop!
Welcome to the Bioinformatics Workshop! UCD Genome Center Bioinformatics Core training.bioinformatics@ucdavis.edu bioinformatics.core@ucdavis.edu August 29, 2016 First Notes Non-UCD Folks can log into
More informationChallenges and opportunities for whole genome sequencing based surveillance of antibiotic resistance
Challenges and opportunities for whole genome sequencing based surveillance of antibiotic resistance Prof. Willem van Schaik Professor in Microbiology and Infection Institute of Microbiology and Infection
More informationDe Novo Assembly of High-throughput Short Read Sequences
De Novo Assembly of High-throughput Short Read Sequences Chuming Chen Center for Bioinformatics and Computational Biology (CBCB) University of Delaware NECC Third Skate Genome Annotation Workshop May 23,
More informationFUTURE PROSPECTS IN MOLECULAR INFECTIOUS DISEASES DIAGNOSIS
FUTURE PROSPECTS IN MOLECULAR INFECTIOUS DISEASES DIAGNOSIS Richard L. Hodinka, Ph.D. University of South Carolina School of Medicine Greenville Greenville Health System, Greenville, SC hodinka@greenvillemed.sc.edu
More informationSanger vs Next-Gen Sequencing
Tools and Algorithms in Bioinformatics GCBA815/MCGB815/BMI815, Fall 2017 Week-8: Next-Gen Sequencing RNA-seq Data Analysis Babu Guda, Ph.D. Professor, Genetics, Cell Biology & Anatomy Director, Bioinformatics
More informationHigh Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center
High Throughput Sequencing the Multi-Tool of Life Sciences Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center Complementary Approaches Illumina Still-imaging of clusters (~1000
More informationTREE CODE PRODUCT BROCHURE
TREE CODE PRODUCT BROCHURE Single Molecule, Real-Time (SMRT) Sequencing technology offers: Long read sequencing ~10 Gb with 20 kb average read lengths for WGS ~20 Gb with 40 kb average read length for
More informationBeef Industry Safety Summit Renaissance Austin Hotel 9721 Arboretum Blvd. Austin, TX March 1-3
1 USDA, Food Safety and Inspection Service Beef Industry Safety Summit - 2016 Renaissance Austin Hotel 9721 Arboretum Blvd. Austin, TX 78759 March 1-3 Uday Dessai MPH, MS, PhD Senior Public Health Advisor
More informationDetecting Clusters and Reporting Results
National Center for Emerging and Zoonotic Infectious Diseases Detecting Clusters and Reporting Results Beth Tolar Salmonella Database Coordinator PulseNet Central Regional Meeting March 2019 Update to
More informationSequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es
Sequencing technologies Jose Blanca COMAV institute bioinf.comav.upv.es Outline Sequencing technologies: Sanger 2nd generation sequencing: 3er generation sequencing: 454 Illumina SOLiD Ion Torrent PacBio
More informationDE NOVO WHOLE GENOME ASSEMBLY AND SEQUENCING OF THE SUPERB FAIRYWREN. (Malurus cyaneus) JOSHUA PEÑALBA LEO JOSEPH CRAIG MORITZ ANDREW COCKBURN
DE NOVO WHOLE GENOME ASSEMBLY AND SEQUENCING OF THE SUPERB FAIRYWREN (Malurus cyaneus) JOSHUA PEÑALBA LEO JOSEPH CRAIG MORITZ ANDREW COCKBURN ... 2014 2015 2016 2017 ... 2014 2015 2016 2017 Synthetic
More informationNGS, omics and applied bioinformatics at CVI The first steps in NGS
NGS, omics and applied bioinformatics at CVI The first steps in NGS Alex Bossers, Freddy de Bree, and Frank Harders et al. Department of Infection Biology Pathogenomics and Applied Bioinformatics group
More informationHigh Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays
High Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays Ali Pirani and Mohini A Patil ISAG July 2017 The world leader in serving science
More informationNext Generation Sequencing Lecture Saarbrücken, 19. March Sequencing Platforms
Next Generation Sequencing Lecture Saarbrücken, 19. March 2012 Sequencing Platforms Contents Introduction Sequencing Workflow Platforms Roche 454 ABI SOLiD Illumina Genome Anlayzer / HiSeq Problems Quality
More informationSequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es
Sequencing technologies Jose Blanca COMAV institute bioinf.comav.upv.es Outline Sequencing technologies: Sanger 2nd generation sequencing: 3er generation sequencing: 454 Illumina SOLiD Ion Torrent PacBio
More informationRequest for Proposal for Bioinformatics Services in Support of Next Generation/Whole Genome DNA Sequencing
Request for Proposal for Bioinformatics Services in Support of Next Generation/Whole Genome DNA Sequencing Proposal Deadline: 4:00pm PST, Friday, March 9, 2018 Revised: 4:00 p.m. PST, Friday, March 23,
More informationIntroduction to CGE tools
Introduction to CGE tools Pimlapas Leekitcharoenphon (Shinny) Research Group of Genomic Epidemiology, DTU-Food. WHO Collaborating Centre for Antimicrobial Resistance in Foodborne Pathogens and Genomics.
More informationThe MiniSeq System. Explore the possibilities. Discover demonstrated NGS workflows for molecular biology applications.
The MiniSeq System. Explore the possibilities. Discover demonstrated NGS workflows for molecular biology applications. Let your work flow with Illumina NGS. The MiniSeq System delivers powerful and cost-effective
More informationGENOMICS WORKFLOW SOLUTIONS THAT GO WHERE THE SCIENCE LEADS. Genomics Solutions Portfolio
GENOMICS WORKFLOW SOLUTIONS THAT GO WHERE THE SCIENCE LEADS Genomics Solutions Portfolio WORKFLOW SOLUTIONS FROM EXTRACTION TO ANALYSIS Application-based answers for every step of your workflow Scientists
More informationGALAXY TRAKR FOR STATE PUBLIC HEALTH BIOINFORMATICS INTRODUCTORY TRAININGS, DATA ANALYTICS, & BIOINFORMATICS COLLABORATIONS
GALAXY TRAKR FOR STATE PUBLIC HEALTH BIOINFORMATICS INTRODUCTORY TRAININGS, DATA ANALYTICS, & BIOINFORMATICS COLLABORATIONS Kevin G. Libuit, M.S. Senior Informatics Scientist Division of Consolidated Laboratory
More informationPacBio. The world s first single molecule, real-time DNA sequencer
PacBio The world s first single molecule, real-time DNA sequencer A revolutionary third generation DNA sequencing system incorporating novel single molecule sequencing with unprecedented readlengths to
More informationDe novo whole genome assembly
De novo whole genome assembly Qi Sun Bioinformatics Facility Cornell University Sequencing platforms Short reads: o Illumina (150 bp, up to 300 bp) Long reads (>10kb): o PacBio SMRT; o Oxford Nanopore
More informationwww.illumina.com/hiseq www.illumina.com FOR RESEARCH USE ONLY 2012 2014 Illumina, Inc. All rights reserved. Illumina, BaseSpace, cbot, CSPro, Genetic Energy, HiSeq, Nextera, TruSeq, the pumpkin orange
More informationQuestionnaire on the use of High Throughput Sequencing, Bioinformatics and Computational Genomics (HTS-BCG) in the OIE Reference Centre network
Questionnaire on the use of High Throughput Sequencing, Bioinformatics and Computational Genomics (HTS-BCG) in the OIE Reference Centre network Massimo Palmarini MRC-University of Glasgow Centre for Virus
More informationGenome Resequencing. Rearrangements. SNPs, Indels CNVs. De novo genome Sequencing. Metagenomics. Exome Sequencing. RNA-seq Gene Expression
Genome Resequencing De novo genome Sequencing SNPs, Indels CNVs Rearrangements Metagenomics RNA-seq Gene Expression Splice Isoform Abundance High Throughput Short Read Sequencing: Illumina Exome Sequencing
More informationFrom Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow
From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with
More informationGENOMICS WORKFLOW SOLUTIONS THAT GO WHERE THE SCIENCE LEADS. Genomics Solutions Portfolio
GENOMICS WORKFLOW SOLUTIONS THAT GO WHERE THE SCIENCE LEADS Genomics Solutions Portfolio WORKFLOW SOLUTIONS FROM EXTRACTION TO ANALYSIS Application-based answers for every step of your workflow Scientists
More informationDNBseq TM SERVICE OVERVIEW Plant and Animal Whole Genome Re-Sequencing
TM SERVICE OVERVIEW Plant and Animal Whole Genome Re-Sequencing Plant and animal whole genome re-sequencing (WGRS) involves sequencing the entire genome of a plant or animal and comparing the sequence
More informationIntroduction to iplant Collaborative Jinyu Yang Bioinformatics and Mathematical Biosciences Lab
Introduction to iplant Collaborative Jinyu Yang Bioinformatics and Mathematical Biosciences Lab May/27 th /2016 About iplant What is iplant Collaborative? 1. A virtual organization 2. Created by a cooperative
More information