Streaming algorithms for real-time analysis of Oxford Nanopore sequencing data

Size: px
Start display at page:

Download "Streaming algorithms for real-time analysis of Oxford Nanopore sequencing data"

Transcription

1 Streaming algorithms for real-time analysis of Oxford Nanopore sequencing data Minh Duc Cao Institute for Molecular Bioscience The University of Queensland London Calling 26 May 27, 26 Minh Duc Cao, The University of Queensland London Calling 26

2 Streaming algorithms Real-time analysis: Answer the biological questions quickly e.g., infection diagnosis Run sequencing only until the answers are obtained Decide complementary experiments Save time, save money Streaming algorithms: Process data input as a stream Continuously make inference and update the confident level Are robust to noise and scalable to massive data sets Minh Duc Cao, The University of Queensland London Calling 26

3 Real-time analysis workflow DNA extraction Library preparation Sequencing setup 2 hours 2.5 hours.5 hours Run simultaneously Scaffold Assemblies Fastq extraction Species typing Basecalling (Metrichor) Strain typing MinION sequencing Resistance profile Minh Duc Cao, The University of Queensland London Calling 26

4 Fastq extraction DNA extraction Library preparation Sequencing setup 2 hours 2.5 hours.5 hours Fastq extraction Basecalling (Metrichor) MinION sequencing Scaffold Species Strain Resistance Assemblies typing typing profile (Cao et al, 25): Bioinformatics, DOI:.93/bioinformatics/btv658 Minh Duc Cao, The University of Queensland London Calling 26

5 Scaffold and complete genome assemblies Stream of long reads BWA MEM Pre assemblies aligning repeats con nuing process Stream of alignment records pairing Stream of bridges connec ng Extending scaffolds output in real me (Cao et al, 26): biorxiv, DOI:./54783 Minh Duc Cao, The University of Queensland London Calling 26

6 MinION sequencing Sequence two K. pneumoniae strains with the MinION: Strain Phred Quality Scores Read Length Distribution Emp. errors BAA-246 Del: 9.5% (NDM- strain) Ins: 6.3% Mis: 5.3% Chemistry R7 Unaligned: Sep % 33-X Coverage 3883 Del: 7.9% (type strain) Ins: 6.% Mis: 2.9% Chemistry R7.3 Unaligned: Dec 24 % 8-X coverage Minh Duc Cao, The University of Queensland London Calling 26

7 Scaffolding and completing genome results Contigs K. pneumoniae BAA-246 (33X) K. pneumoniae 3883 (8X) Coverage (-fold) Coverage (-fold) 3.. S. cerevisae W33 (9X) Coverage (-fold).6.3 N5 (Mb) Minh Duc Cao, The University of Queensland London Calling 26

8 Scaffolding and completing genome results Contigs K. pneumoniae BAA-246 (33X) K. pneumoniae 3883 (8X) Coverage (-fold) Coverage (-fold) 3.. S. cerevisae W33 (9X) Coverage (-fold).6.3 N5 (Mb) Methods #Contig N5 (Mb) Errors /Kb CPUhrs #Contig N5 (Mb) Errors /Kb CPUhrs #Contig N5 (Mb) Misassemblies Misassemblies Misassemblies Errors /Kb CPUhrs SPAdes Hybrid SSPACE LINK npscarf NaS Nanocorr Minh Duc Cao, The University of Queensland London Calling 26

9 Pathogenicity island reconstruction SapF SapD SapC inta munim YqaJ rect ParA LexA CII ydau dnac IS26 aada sul ebr GCN5-like IS6 IS26 hin ltra umud umuc FRG Contigs Sequence data required to join 5Mb 5Mb 54Mb 54Mb 54Mb 54Mb 54Mb 65Mb 65Mb Minh Duc Cao, The University of Queensland London Calling 26

10 Pathogen identification (Cao et al, 25): biorxiv, DOI:./9356 Minh Duc Cao, The University of Queensland London Calling 26

11 Bacteria identification Proportion.5 Species typing K. pneumoniae ATCC BAA-246 2, 2 3 4,5, 5 K. pneumoniae ATCC , ,5, 5 Mixture 75% E.coli+25% S. aureus 2, ,5, 5 Yield (reads) Sequencing yield K. pneumoniae K. pneumoniae E. coli E. coli S. aureus Minh Duc Cao, The University of Queensland London Calling 26

12 Bacteria identification Proportion.5 Species typing K. pneumoniae ATCC BAA-246 2, 2 3 4,5, 5 K. pneumoniae ATCC , ,5, 5 Mixture 75% E.coli+25% S. aureus 2, ,5, 5 Yield (reads) Sequencing yield K. pneumoniae K. pneumoniae E. coli E. coli S. aureus Strain typing K. pneumoniae ATCC BAA-246 3, K. pneumoniae ATCC , Mixture 75% E.coli+25% S. aureus e) Mixture 75% E.coli+25% S. aureus 3, 3, Probability.5 2,,.5 2,,.5 2,,.5 2,, Yield (reads) K. pneumoniae ST K. pneumoniae ST3 E. coli ST73 E. coli ST526 S. aureus ST243 S. aureus ST2 S. aureus ST46 S. aureus ST29 Minh Duc Cao, The University of Queensland London Calling 26

13 Antibiotic resistance identification K. pneumoniae BAA-246 (NDM- strain): 27 genes Time genes Sens. Spec. Data (mins) (%) (%) (reads) 3 mpha 228 blashv stra blatem strb blactx blalen 263 sul2 blaoxa aac3 aac6 blacmy blacfe blalat blabil Time genes Sens. Spec. Data (mins) (%) (%) (reads) 9 QnrB 3844 aada oqxa teta oqxb dfra blaokp rmtc sul 322 sul fosa blandm Minh Duc Cao, The University of Queensland London Calling 26

14 A glimpse of R9 flowcells Sequence on R9 flowcell the mixture of E. coli ESBL (4%), E. faecium VRE-VanA (2%) S. aureus MRSA (2%) S. aureus VRSA (2%) Minh Duc Cao, The University of Queensland London Calling 26

15 A glimpse of R9 flowcells Time genes Data (mins) (reads) erma 23 ermg spc nora 5 Van 252 VanH VanS 2 dfra 2968 aac6 VanX tetu aada sul sul3 aadd 3 fusb 3964 mpha 4 VanY 5754 cat VanZ VanA Time genes Data (mins) (reads) 5 tetm 684 tets 2 aph msrc meca 5 catpc fosa blaoxa 24 blactx 538 cfr 48 blaz 242 vgaa 78 vgaalc 2684 ermc aac3 blacmy blalat blabil 2 dfra Minh Duc Cao, The University of Queensland London Calling 26

16 Summary and outlook Scaffold and complete bacterial assemblies with < 3-fold coverage Identify pathogen species and strain with reads (<.5 hours sequencing) Detect antibiotic resistance profile in a few hours of sequencing We expect the times to be significantly shortened: Higher throughput with upcoming models: MinION MkII, PromethION. Quicker library preparation Software available Minh Duc Cao, The University of Queensland London Calling 26

17 Acknowledgements Lachlan Coin Matthew Cooper Devika Ganesamoorthy Son Nguyen Alysha Elliott Tania Duarte Vivian Zhang Nick Hamilton Derek Benson Dominique Gorse Michael Thang Minh Duc Cao, The University of Queensland London Calling 26

Practical quality control for whole genome sequencing in clinical microbiology

Practical quality control for whole genome sequencing in clinical microbiology Practical quality control for whole genome sequencing in clinical microbiology John WA Rossen, PhD, MMM Department of Medical Microbiology, University of Groningen, UMCG, Groningen, The Netherlands Disclosure

More information

Scaffolding and Completing Genome Assemblies in Real-time with Nanopore Sequencing

Scaffolding and Completing Genome Assemblies in Real-time with Nanopore Sequencing biorxiv preprint first posted online May. 22, 16; doi: http://dx.doi.org/.11/04783. The copyright holder for this preprint Scaffolding and Completing Genome Assemblies in Real-time with Nanopore Sequencing

More information

Ganwu Li Assistant Professor Veterinary Diagnostic Lab at Iowa State University

Ganwu Li Assistant Professor Veterinary Diagnostic Lab at Iowa State University Applications of NGS in Clinical Microbiology Ganwu Li Assistant Professor Veterinary Diagnostic Lab at Iowa State University Detect and Identify Pathogens Total reads FastQC and host reads remove Clean

More information

Understanding The rising tide of resistance.

Understanding The rising tide of resistance. Understanding The rising tide of resistance Tolemanma@cardiff.ac.uk Talk Overview Antibiotic resistance and bacterial fitness Decay in bacteria CTX-M genes and parallels to NDM-1 Some reasons for the success

More information

GALAXY TRAKR FOR STATE PUBLIC HEALTH BIOINFORMATICS INTRODUCTORY TRAININGS, DATA ANALYTICS, & BIOINFORMATICS COLLABORATIONS

GALAXY TRAKR FOR STATE PUBLIC HEALTH BIOINFORMATICS INTRODUCTORY TRAININGS, DATA ANALYTICS, & BIOINFORMATICS COLLABORATIONS GALAXY TRAKR FOR STATE PUBLIC HEALTH BIOINFORMATICS INTRODUCTORY TRAININGS, DATA ANALYTICS, & BIOINFORMATICS COLLABORATIONS Kevin G. Libuit, M.S. Senior Informatics Scientist Division of Consolidated Laboratory

More information

MinION, GridION, how does Nanopore technology meet the needs of our users?

MinION, GridION, how does Nanopore technology meet the needs of our users? MinION, GridION, how does Nanopore technology meet the needs of our users? Journée Long Reads GeT 28 Novembre 2017 Catherine Zanchetta & Maxime Manno get@genotoul.fr @get_genotoul 1 Wet Lab Nanopore technology

More information

Mate-pair library data improves genome assembly

Mate-pair library data improves genome assembly De Novo Sequencing on the Ion Torrent PGM APPLICATION NOTE Mate-pair library data improves genome assembly Highly accurate PGM data allows for de Novo Sequencing and Assembly For a draft assembly, generate

More information

Next- gen sequencing. STAMPS 2015 Hilary G. Morrison Joe Vineis, Nora Downey, Be>e Hecox- Lea, Kim Finnegan

Next- gen sequencing. STAMPS 2015 Hilary G. Morrison Joe Vineis, Nora Downey, Be>e Hecox- Lea, Kim Finnegan Next- gen sequencing STAMPS 2015 Hilary G. Morrison Joe Vineis, Nora Downey, Be>e Hecox- Lea, Kim Finnegan QuesIons What is the difference between standard and next- gen sequencing? How is next- gen sequencing

More information

AUDREY FARBOS JEREMIE POSCHMANN PAUL O NEILL KONRAD PASZKIEWICZ KAREN MOORE

AUDREY FARBOS JEREMIE POSCHMANN PAUL O NEILL KONRAD PASZKIEWICZ KAREN MOORE We provide: AUDREY FARBOS JEREMIE POSCHMANN PAUL O NEILL KONRAD PASZKIEWICZ KAREN MOORE State of the art genomics and bioinformatics analysis Training in experimental techniques, analysis and modelling

More information

Data Basics. Josef K Vogt Slides by: Simon Rasmussen Next Generation Sequencing Analysis

Data Basics. Josef K Vogt Slides by: Simon Rasmussen Next Generation Sequencing Analysis Data Basics Josef K Vogt Slides by: Simon Rasmussen 2017 Generalized NGS analysis Sample prep & Sequencing Data size Main data reductive steps SNPs, genes, regions Application Assembly: Compare Raw Pre-

More information

De Novo and Hybrid Assembly

De Novo and Hybrid Assembly On the PacBio RS Introduction The PacBio RS utilizes SMRT technology to generate both Continuous Long Read ( CLR ) and Circular Consensus Read ( CCS ) data. In this document, we describe sequencing the

More information

Real world applications of whole genome sequencing. Henrik Hasman Bacteria, parasites and fungi Statens Serum Institut, Denmark

Real world applications of whole genome sequencing. Henrik Hasman Bacteria, parasites and fungi Statens Serum Institut, Denmark Real world applications of whole genome sequencing Henrik Hasman Bacteria, parasites and fungi Statens Serum Institut, Denmark REAL WORLD APPLICATIONS or application of WGS for outbreak detection, national

More information

Whole Genome Sequencing for food safety FSA Chief Scientific Advisor Report and 2013 Listeria pilot study

Whole Genome Sequencing for food safety FSA Chief Scientific Advisor Report and 2013 Listeria pilot study Whole Genome Sequencing for food safety FSA Chief Scientific Advisor Report and 2013 Listeria pilot study Dr Edward Hayes Date: July 2016, Version 1 Foodborne Pathogens 280,000 cases of Campylobacter,

More information

Hybrid Error Correction and De Novo Assembly with Oxford Nanopore

Hybrid Error Correction and De Novo Assembly with Oxford Nanopore Hybrid Error Correction and De Novo Assembly with Oxford Nanopore Michael Schatz Jan 13, 2015 PAG Bioinformatics @mike_schatz / #PAGXXIII Oxford Nanopore MinION Thumb drive sized sequencer powered over

More information

Outline General NGS background and terms 11/14/2016 CONFLICT OF INTEREST. HLA region targeted enrichment. NGS library preparation methodologies

Outline General NGS background and terms 11/14/2016 CONFLICT OF INTEREST. HLA region targeted enrichment. NGS library preparation methodologies Eric T. Weimer, PhD, D(ABMLI) Assistant Professor, Pathology & Laboratory Medicine, UNC School of Medicine Director, Molecular Immunology Associate Director, Clinical Flow Cytometry, HLA, and Immunology

More information

RNA sequencing with the MinION at Genoscope

RNA sequencing with the MinION at Genoscope RNA sequencing with the MinION at Genoscope Jean-Marc Aury jmaury@genoscope.cns.fr @J_M_Aury December 13, 2017 RNA workshop, Genoscope Overview Genoscope Overview MinION sequencing at Genoscope RNA-Seq

More information

High Throughput Sequencing Technologies. UCD Genome Center Bioinformatics Core Monday 15 June 2015

High Throughput Sequencing Technologies. UCD Genome Center Bioinformatics Core Monday 15 June 2015 High Throughput Sequencing Technologies UCD Genome Center Bioinformatics Core Monday 15 June 2015 Sequencing Explosion www.genome.gov/sequencingcosts http://t.co/ka5cvghdqo Sequencing Explosion 2011 PacBio

More information

Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie. Sander van Boheemen Medical Microbiology

Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie. Sander van Boheemen Medical Microbiology Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie Sander van Boheemen Medical Microbiology Next-generation sequencing Next-generation sequencing (NGS), also known as

More information

Human Genome Sequencing Over the Decades The capacity to sequence all 3.2 billion bases of the human genome (at 30X coverage) has increased

Human Genome Sequencing Over the Decades The capacity to sequence all 3.2 billion bases of the human genome (at 30X coverage) has increased Human Genome Sequencing Over the Decades The capacity to sequence all 3.2 billion bases of the human genome (at 30X coverage) has increased exponentially since the 1990s. In 2005, with the introduction

More information

Current'Advances'in'Sequencing' Technology' James'Gurtowski' Schatz'Lab'

Current'Advances'in'Sequencing' Technology' James'Gurtowski' Schatz'Lab' Current'Advances'in'Sequencing' Technology' James'Gurtowski' Schatz'Lab' Outline' 1. Assembly'Review' 2. Pacbio' Technology'Overview' Data'CharacterisFcs' Algorithms' Results' 'Assemblies' 3. Oxford'Nanopore'

More information

Ten Minute, Reagent-Free identification of Bacteria Containing Resistance Genes Using a Rapid Intrinsic Fluorescence Method

Ten Minute, Reagent-Free identification of Bacteria Containing Resistance Genes Using a Rapid Intrinsic Fluorescence Method 548 Ten Minute, Reagent-Free identification of Bacteria Containing Resistance Genes Using a Rapid Intrinsic Fluorescence Method R. Rozen-Sadowsky 1, A. Shinderman 1, D. Gohman 1, D. Shimonov 1, Y. Gluckman

More information

Next Gen Sequencing. Expansion of sequencing technology. Contents

Next Gen Sequencing. Expansion of sequencing technology. Contents Next Gen Sequencing Contents 1 Expansion of sequencing technology 2 The Next Generation of Sequencing: High-Throughput Technologies 3 High Throughput Sequencing Applied to Genome Sequencing (TEDed CC BY-NC-ND

More information

Next Generation Sequences & Chloroplast Assembly. 8 June, 2012 Jongsun Park

Next Generation Sequences & Chloroplast Assembly. 8 June, 2012 Jongsun Park Next Generation Sequences & Chloroplast Assembly 8 June, 2012 Jongsun Park Table of Contents 1 History of Sequencing Technologies 2 Genome Assembly Processes With NGS Sequences 3 How to Assembly Chloroplast

More information

Matthew Tinning Australian Genome Research Facility. July 2012

Matthew Tinning Australian Genome Research Facility. July 2012 Next-Generation Sequencing: an overview of technologies and applications Matthew Tinning Australian Genome Research Facility July 2012 History of Sequencing Where have we been? 1869 Discovery of DNA 1909

More information

Introduction to NGS Analysis Tools

Introduction to NGS Analysis Tools National Center for Emerging and Zoonotic Infectious Diseases Introduction to NGS Analysis Tools Heather Carleton, PhD, MPH Team Lead, Enteric Diseases Bioinformatics, Enteric Diseases Laboratory Branch,

More information

Next Generation Sequencing. Jeroen Van Houdt - Leuven 13/10/2017

Next Generation Sequencing. Jeroen Van Houdt - Leuven 13/10/2017 Next Generation Sequencing Jeroen Van Houdt - Leuven 13/10/2017 Landmarks in DNA sequencing 1953 Discovery of DNA double helix structure 1977 A Maxam and W Gilbert "DNA seq by chemical degradation" F Sanger"DNA

More information

The Genome Analysis Centre. Building Excellence in Genomics and Computa5onal Bioscience

The Genome Analysis Centre. Building Excellence in Genomics and Computa5onal Bioscience Building Excellence in Genomics and Computa5onal Bioscience Resequencing approaches Sarah Ayling Crop Genomics and Diversity sarah.ayling@tgac.ac.uk Why re- sequence plants? To iden

More information

Next-generation sequencing Technology Overview

Next-generation sequencing Technology Overview Next-generation sequencing Technology Overview UQ Winter School 2018 Christopher Noune, PhD AGRF Melbourne christopher.noune@agrf.org.au What is NGS? Ion Torrent PGM (Thermo-Fisher) MiSeq (Illumina) High-Throughput

More information

Challenges and opportunities for whole genome sequencing based surveillance of antibiotic resistance

Challenges and opportunities for whole genome sequencing based surveillance of antibiotic resistance Challenges and opportunities for whole genome sequencing based surveillance of antibiotic resistance Prof. Willem van Schaik Professor in Microbiology and Infection Institute of Microbiology and Infection

More information

Using New ThiNGS on Small Things. Shane Byrne

Using New ThiNGS on Small Things. Shane Byrne Using New ThiNGS on Small Things Shane Byrne Next Generation Sequencing New Things Small Things NGS Next Generation Sequencing = 2 nd generation of sequencing 454 GS FLX, SOLiD, GAIIx, HiSeq, MiSeq, Ion

More information

NEXT GENERATION SEQUENCING. Farhat Habib

NEXT GENERATION SEQUENCING. Farhat Habib NEXT GENERATION SEQUENCING HISTORY HISTORY Sanger Dominant for last ~30 years 1000bp longest read Based on primers so not good for repetitive or SNPs sites HISTORY Sanger Dominant for last ~30 years 1000bp

More information

Lecture 7. Next-generation sequencing technologies

Lecture 7. Next-generation sequencing technologies Lecture 7 Next-generation sequencing technologies Next-generation sequencing technologies General principles of short-read NGS Construct a library of fragments Generate clonal template populations Massively

More information

Whole Genome Sequence Data Quality Control and Validation

Whole Genome Sequence Data Quality Control and Validation Whole Genome Sequence Data Quality Control and Validation GoSeqIt ApS / Ved Klædebo 9 / 2970 Hørsholm VAT No. DK37842524 / Phone +45 26 97 90 82 / Web: www.goseqit.com / mail: mail@goseqit.com Table of

More information

TruSPAdes: analysis of variations using TruSeq Synthetic Long Reads (TSLR)

TruSPAdes: analysis of variations using TruSeq Synthetic Long Reads (TSLR) tru TruSPAdes: analysis of variations using TruSeq Synthetic Long Reads (TSLR) Anton Bankevich Center for Algorithmic Biotechnology, SPbSU Sequencing costs 1. Sequencing costs do not follow Moore s law

More information

Nature Biotechnology: doi: /nbt Supplementary Figure 1. Read Complexity

Nature Biotechnology: doi: /nbt Supplementary Figure 1. Read Complexity Supplementary Figure 1 Read Complexity A) Density plot showing the percentage of read length masked by the dust program, which identifies low-complexity sequence (simple repeats). Scrappie outputs a significantly

More information

Get to Know Your DNA. Every Single Fragment.

Get to Know Your DNA. Every Single Fragment. HaloPlex HS NGS Target Enrichment System Get to Know Your DNA. Every Single Fragment. High sensitivity detection of rare variants using molecular barcodes How Does Molecular Barcoding Work? HaloPlex HS

More information

Sequencing techniques

Sequencing techniques Sequencing techniques Workshop on Whole Genome Sequencing and Analysis, 2-4 Oct. 2017 Learning objective: After this lecture, you should be able to account for different techniques for whole genome sequencing

More information

Illumina (Solexa) Throughput: 4 Tbp in one run (5 days) Cheapest sequencing technology. Mismatch errors dominate. Cost: ~$1000 per human genme

Illumina (Solexa) Throughput: 4 Tbp in one run (5 days) Cheapest sequencing technology. Mismatch errors dominate. Cost: ~$1000 per human genme Illumina (Solexa) Current market leader Based on sequencing by synthesis Current read length 100-150bp Paired-end easy, longer matepairs harder Error ~0.1% Mismatch errors dominate Throughput: 4 Tbp in

More information

Terabases of long-read sequence data, analysed in real time. Available now

Terabases of long-read sequence data, analysed in real time. Available now Terabases of long-read sequence data, analysed in real time Available now The PromethION is a real game changer. Combining ultra-long reads with high sequence output for the production of contiguous, highquality

More information

Next Generation Sequencing. Tobias Österlund

Next Generation Sequencing. Tobias Österlund Next Generation Sequencing Tobias Österlund tobiaso@chalmers.se NGS part of the course Week 4 Friday 13/2 15.15-17.00 NGS lecture 1: Introduction to NGS, alignment, assembly Week 6 Thursday 26/2 08.00-09.45

More information

Sequencing Theory. Brett E. Pickett, Ph.D. J. Craig Venter Institute

Sequencing Theory. Brett E. Pickett, Ph.D. J. Craig Venter Institute Sequencing Theory Brett E. Pickett, Ph.D. J. Craig Venter Institute Applications of Genomics and Bioinformatics to Infectious Diseases GABRIEL Network Agenda Sequencing Instruments Sanger Illumina Ion

More information

De novo whole genome assembly

De novo whole genome assembly De novo whole genome assembly Qi Sun Bioinformatics Facility Cornell University Sequencing platforms Short reads: o Illumina (150 bp, up to 300 bp) Long reads (>10kb): o PacBio SMRT; o Oxford Nanopore

More information

Introduction to metagenome assembly. Bas E. Dutilh Metagenomic Methods for Microbial Ecologists, NIOO September 18 th 2014

Introduction to metagenome assembly. Bas E. Dutilh Metagenomic Methods for Microbial Ecologists, NIOO September 18 th 2014 Introduction to metagenome assembly Bas E. Dutilh Metagenomic Methods for Microbial Ecologists, NIOO September 18 th 2014 Sequencing specs* Method Read length Accuracy Million reads Time Cost per M 454

More information

Whole-genome sequencing (WGS) of microbes employing nextgeneration sequencing (NGS) technologies enables pathogen

Whole-genome sequencing (WGS) of microbes employing nextgeneration sequencing (NGS) technologies enables pathogen Application Note Microbial whole-genome sequencing A novel, single-tube enzymatic fragmentation and library construction method enables fast turnaround times and improved data quality for microbial whole-genome

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION Supplementary Note 1. Stress-testing the encoder/decoder. We summarize our experience decoding two files that were sequenced with nanopore technologyerror! Reference source not

More information

Detection and characterization of extended spectrum β-lactamase producing Escherichia coli from poultry of eastern India

Detection and characterization of extended spectrum β-lactamase producing Escherichia coli from poultry of eastern India Detection and characterization of extended spectrum β-lactamase producing Escherichia coli from poultry of eastern India Dr. Samiran Bandyopadhyay Scientist Indian Veterinary Research Institute Eastern

More information

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday September 15, 2014

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday September 15, 2014 High Throughput Sequencing Technologies J Fass UCD Genome Center Bioinformatics Core Monday September 15, 2014 Sequencing Explosion www.genome.gov/sequencingcosts http://t.co/ka5cvghdqo Sequencing Explosion

More information

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Tuesday December 16, 2014

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Tuesday December 16, 2014 High Throughput Sequencing Technologies J Fass UCD Genome Center Bioinformatics Core Tuesday December 16, 2014 Sequencing Explosion www.genome.gov/sequencingcosts http://t.co/ka5cvghdqo Sequencing Explosion

More information

Metagenomic 3C, full length 16S amplicon sequencing on Illumina, and the diabetic skin microbiome

Metagenomic 3C, full length 16S amplicon sequencing on Illumina, and the diabetic skin microbiome Also: Sunaina Melissa Gardiner UTS Catherine Burke UTS Michael Liu UTS Chris Beitel UTS, UC Davis Matt DeMaere UTS Metagenomic 3C, full length 16S amplicon sequencing on Illumina, and the diabetic skin

More information

Bioinformatic analysis of Illumina sequencing data for comparative genomics Part I

Bioinformatic analysis of Illumina sequencing data for comparative genomics Part I Bioinformatic analysis of Illumina sequencing data for comparative genomics Part I Dr David Studholme. 18 th February 2014. BIO1033 theme lecture. 1 28 February 2014 @davidjstudholme 28 February 2014 @davidjstudholme

More information

Quality assurance in NGS (diagnostics)

Quality assurance in NGS (diagnostics) Quality assurance in NGS (diagnostics) Chris Mattocks National Genetics Reference Laboratory (Wessex) Research Diagnostics Quality assurance Any systematic process of checking to see whether a product

More information

Nanopore sequencing How it works

Nanopore sequencing How it works Product 1 Nanopore sequencing How it works The nanopore processes the length of DNA or RNA presented to it. The user can control fragment length through the library preparation protocol utilised. (e.g.

More information

Next-Generation Sequencing. Technologies

Next-Generation Sequencing. Technologies Next-Generation Next-Generation Sequencing Technologies Sequencing Technologies Nicholas E. Navin, Ph.D. MD Anderson Cancer Center Dept. Genetics Dept. Bioinformatics Introduction to Bioinformatics GS011062

More information

CBC Data Therapy. Metatranscriptomics Discussion

CBC Data Therapy. Metatranscriptomics Discussion CBC Data Therapy Metatranscriptomics Discussion Metatranscriptomics Extract RNA, subtract rrna Sequence cdna QC Gene expression, function Institute for Systems Genomics: Computational Biology Core bioinformatics.uconn.edu

More information

A Guide to Consed Michelle Itano, Carolyn Cain, Tien Chusak, Justin Richner, and SCR Elgin.

A Guide to Consed Michelle Itano, Carolyn Cain, Tien Chusak, Justin Richner, and SCR Elgin. 1 A Guide to Consed Michelle Itano, Carolyn Cain, Tien Chusak, Justin Richner, and SCR Elgin. Main Window Figure 1. The Main Window is the starting point when Consed is opened. From here, you can access

More information

Nanopore long read sequencing for detection of point mutations and structural variants

Nanopore long read sequencing for detection of point mutations and structural variants Nanopore long read sequencing for detection of point mutations and structural variants Viapath Genetics, Guy s Hospital, London Presented by: Kezia Brown Nanopore for long read sequencing Why do we need

More information

Whole Human Genome Sequencing Report This is a technical summary report for PG DNA

Whole Human Genome Sequencing Report This is a technical summary report for PG DNA Whole Human Genome Sequencing Report This is a technical summary report for PG0002601-DNA Physician and Patient Information Physician name: Vinodh Naraynan Address: Suite 406 222 West Thomas Road Phoenix

More information

HLA-Typing Strategies

HLA-Typing Strategies HLA-Typing Strategies Cologne, 13.5.2017 Joannis Mytilineos MD, PhD Department of Transplantation Immunology Institute for Clinical Transfusion Medicine and Immunogenetics German Red Cross Blood Transfusion

More information

Our goal is to enable the analysis of any living thing, by any person, in any environment.

Our goal is to enable the analysis of any living thing, by any person, in any environment. Technology 1 2 Our goal is to enable the analysis of any living thing, by any person, in any environment. Nanopore DNA and direct RNA sequencing has been performed on board the International Space Station.

More information

BIOINFORMATICS ORIGINAL PAPER

BIOINFORMATICS ORIGINAL PAPER BIOINFORMATICS ORIGINAL PAPER Vol. 27 no. 21 2011, pages 2957 2963 doi:10.1093/bioinformatics/btr507 Genome analysis Advance Access publication September 7, 2011 : fast length adjustment of short reads

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Alla L Lapidus, Ph.D. SPbSU St. Petersburg Term Bioinformatics Term Bioinformatics was invented by Paulien Hogeweg (Полина Хогевег) and Ben Hesper in 1970 as "the study of

More information

Oxford Nanopore Sequencing and de novo Assembly of a Eukaryotic Genome Supplemental Notes and Figures

Oxford Nanopore Sequencing and de novo Assembly of a Eukaryotic Genome Supplemental Notes and Figures Oxford Nanopore Sequencing and de novo Assembly of a Eukaryotic Genome Supplemental Notes and Figures Table of Contents Supplemental Note 1. Flowcell performance... 2 Supplemental Note 2. Nanopore sequencing

More information

SUPPLEMENTARY MATERIAL FOR THE PAPER: RASCAF: IMPROVING GENOME ASSEMBLY WITH RNA-SEQ DATA

SUPPLEMENTARY MATERIAL FOR THE PAPER: RASCAF: IMPROVING GENOME ASSEMBLY WITH RNA-SEQ DATA SUPPLEMENTARY MATERIAL FOR THE PAPER: RASCAF: IMPROVING GENOME ASSEMBLY WITH RNA-SEQ DATA Authors: Li Song, Dhruv S. Shankar, Liliana Florea Table of contents: Figure S1. Methods finding contig connections

More information

Genetics Lecture 21 Recombinant DNA

Genetics Lecture 21 Recombinant DNA Genetics Lecture 21 Recombinant DNA Recombinant DNA In 1971, a paper published by Kathleen Danna and Daniel Nathans marked the beginning of the recombinant DNA era. The paper described the isolation of

More information

Faction 2: Genome Assembly Lab and Preliminary Data

Faction 2: Genome Assembly Lab and Preliminary Data Faction 2: Genome Assembly Lab and Preliminary Data [Computational Genomics 2017] Christian Colon, Erisa Sula, David Lu, Tian Jin, Lijiang Long, Rohini Mopuri, Bowen Yang, Saminda Wijeratne, Harrison Kim

More information

SUMMARY. Key words: antibioticresistance, Enterobacteriaceae, ESBL, CTX-M,

SUMMARY. Key words: antibioticresistance, Enterobacteriaceae, ESBL, CTX-M, SUMMARY Key words: antibioticresistance, Enterobacteriaceae, ESBL, CTX-M, The doctoral thesis entitled Prevalence of Enterobacteriaceae producing extendedspectrum beta-lactamases (ESBL) isolated from broilers

More information

Understanding the science and technology of whole genome sequencing

Understanding the science and technology of whole genome sequencing Understanding the science and technology of whole genome sequencing Dag Undlien Department of Medical Genetics Oslo University Hospital University of Oslo and The Norwegian Sequencing Centre d.e.undlien@medisin.uio.no

More information

Aaron Liston, Oregon State University Botany 2012 Intro to Next Generation Sequencing Workshop

Aaron Liston, Oregon State University Botany 2012 Intro to Next Generation Sequencing Workshop Output (bp) Aaron Liston, Oregon State University Growth in Next-Gen Sequencing Capacity 3.5E+11 2002 2004 2006 2008 2010 3.0E+11 2.5E+11 2.0E+11 1.5E+11 1.0E+11 Adapted from Mardis, 2011, Nature 5.0E+10

More information

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es Sequencing technologies Jose Blanca COMAV institute bioinf.comav.upv.es Outline Sequencing technologies: Sanger 2nd generation sequencing: 3er generation sequencing: 454 Illumina SOLiD Ion Torrent PacBio

More information

Workflow of de novo assembly

Workflow of de novo assembly Workflow of de novo assembly Experimental Design Clean sequencing data (trim adapter and low quality sequences) Run assembly software for contiging and scaffolding Evaluation of assembly Several iterations:

More information

Nanopore-based DNA sequencing in clinical microbiology: preliminary

Nanopore-based DNA sequencing in clinical microbiology: preliminary Nanopore sequencing in clinical routine 1 1 2 Nanopore-based DNA sequencing in clinical microbiology: preliminary assessment of basic requirements 3 4 5 6 7 Håvard Harstad a, Rafi Ahmad b*, Anders Bredberg

More information

Supplementary Figure 1. Design of the control microarray. a, Genomic DNA from the

Supplementary Figure 1. Design of the control microarray. a, Genomic DNA from the Supplementary Information Supplementary Figures Supplementary Figure 1. Design of the control microarray. a, Genomic DNA from the strain M8 of S. ruber and a fosmid containing the S. ruber M8 virus M8CR4

More information

Targeted Sequencing in the NBS Laboratory

Targeted Sequencing in the NBS Laboratory Targeted Sequencing in the NBS Laboratory Christopher Greene, PhD Newborn Screening and Molecular Biology Branch Division of Laboratory Sciences Gene Sequencing in Public Health Newborn Screening February

More information

NextGen Sequencing and Target Enrichment

NextGen Sequencing and Target Enrichment NextGen Sequencing and Target Enrichment Laurent FARINELLI 7 September 2010 Agilent 3rd Analytic Forum Basel, Switzerland Outline The illumina HiSEQ 2000 system Applications Target enrichment Outlook 7

More information

Next Generation Sequencing Applications in Food Safety and Quality

Next Generation Sequencing Applications in Food Safety and Quality Next Generation Sequencing Applications in Food Safety and Quality Our science National and international centre of excellence for interdisciplinary investigation and problem solving across plant and bee

More information

De Novo Assembly of High-throughput Short Read Sequences

De Novo Assembly of High-throughput Short Read Sequences De Novo Assembly of High-throughput Short Read Sequences Chuming Chen Center for Bioinformatics and Computational Biology (CBCB) University of Delaware NECC Third Skate Genome Annotation Workshop May 23,

More information

Course summary. Today. PCR Polymerase chain reaction. Obtaining molecular data. Sequencing. DNA sequencing. Genome Projects.

Course summary. Today. PCR Polymerase chain reaction. Obtaining molecular data. Sequencing. DNA sequencing. Genome Projects. Goals Organization Labs Project Reading Course summary DNA sequencing. Genome Projects. Today New DNA sequencing technologies. Obtaining molecular data PCR Typically used in empirical molecular evolution

More information

Identification of Bacterial Pathogens and Antimicrobial Resistance Directly from Clinical Urines by Nanopore-Based Metagenomic Sequencing

Identification of Bacterial Pathogens and Antimicrobial Resistance Directly from Clinical Urines by Nanopore-Based Metagenomic Sequencing 1 2 Identification of Bacterial Pathogens and Antimicrobial Resistance Directly from Clinical Urines by Nanopore-Based Metagenomic Sequencing 3 4 5 6 K. Schmidt 1, S. Mwaigwisya 1, L. C. Crossman 2, M.

More information

Finishing of Fosmid 1042D14. Project 1042D14 is a roughly 40 kb segment of Drosophila ananassae

Finishing of Fosmid 1042D14. Project 1042D14 is a roughly 40 kb segment of Drosophila ananassae Schefkind 1 Adam Schefkind Bio 434W 03/08/2014 Finishing of Fosmid 1042D14 Abstract Project 1042D14 is a roughly 40 kb segment of Drosophila ananassae genomic DNA. Through a comprehensive analysis of forward-

More information

TREE CODE PRODUCT BROCHURE

TREE CODE PRODUCT BROCHURE TREE CODE PRODUCT BROCHURE Single Molecule, Real-Time (SMRT) Sequencing technology offers: Long read sequencing ~10 Gb with 20 kb average read lengths for WGS ~20 Gb with 40 kb average read length for

More information

Reference genomes and common file formats

Reference genomes and common file formats Reference genomes and common file formats Overview Reference genomes and GRC Fasta and FastQ (unaligned sequences) SAM/BAM (aligned sequences) Summarized genomic features BED (genomic intervals) GFF/GTF

More information

Corynebacterium pseudotuberculosis genome sequencing: Final Report

Corynebacterium pseudotuberculosis genome sequencing: Final Report Summary To provide an invaluable resource to assist in the development of diagnostics and vaccines against caseous lymphadenitis (CLA), the sequencing of the genome of a virulent, United Kingdom Corynebacterium

More information

Building a platinum human genome assembly from single haplotype human genomes. Karyn Meltz Steinberg PacBio UGM December,

Building a platinum human genome assembly from single haplotype human genomes. Karyn Meltz Steinberg PacBio UGM December, Building a platinum human genome assembly from single haplotype human genomes Karyn Meltz Steinberg PacBio UGM December, 2015 @KMS_Meltzy Single haplotype from hydatidiform mole Enucleated egg (no maternal

More information

Incorporating Molecular ID Technology. Accel-NGS 2S MID Indexing Kits

Incorporating Molecular ID Technology. Accel-NGS 2S MID Indexing Kits Incorporating Molecular ID Technology Accel-NGS 2S MID Indexing Kits Molecular Identifiers (MIDs) MIDs are indices used to label unique library molecules MIDs can assess duplicate molecules in sequencing

More information

Reference genomes and common file formats

Reference genomes and common file formats Reference genomes and common file formats Dóra Bihary MRC Cancer Unit, University of Cambridge CRUK Functional Genomics Workshop September 2017 Overview Reference genomes and GRC Fasta and FastQ (unaligned

More information

Next Generation Sequencing Lecture Saarbrücken, 19. March Sequencing Platforms

Next Generation Sequencing Lecture Saarbrücken, 19. March Sequencing Platforms Next Generation Sequencing Lecture Saarbrücken, 19. March 2012 Sequencing Platforms Contents Introduction Sequencing Workflow Platforms Roche 454 ABI SOLiD Illumina Genome Anlayzer / HiSeq Problems Quality

More information

1. A brief overview of sequencing biochemistry

1. A brief overview of sequencing biochemistry Supplementary reading materials on Genome sequencing (optional) The materials are from Mark Blaxter s lecture notes on Sequencing strategies and Primary Analysis 1. A brief overview of sequencing biochemistry

More information

Molecular susceptibility testing

Molecular susceptibility testing Molecular susceptibility testing Dr Andrew Ginn Supervising Scientist Antimicrobial Resistance Reference Laboratory ICPMR, Westmead Hospital Resistance genes Gram negatives Transmissible; e.g. ESBLs, MBLs,

More information

De novo Genome Assembly

De novo Genome Assembly De novo Genome Assembly A/Prof Torsten Seemann Winter School in Mathematical & Computational Biology - Brisbane, AU - 3 July 2017 Introduction The human genome has 47 pieces MT (or XY) The shortest piece

More information

The Journey of DNA Sequencing. Chromosomes. What is a genome? Genome size. H. Sunny Sun

The Journey of DNA Sequencing. Chromosomes. What is a genome? Genome size. H. Sunny Sun The Journey of DNA Sequencing H. Sunny Sun What is a genome? Genome is the total genetic complement of a living organism. The nuclear genome comprises approximately 3.2 * 10 9 nucleotides of DNA, divided

More information

Experimental Design Microbial Sequencing

Experimental Design Microbial Sequencing Experimental Design Microbial Sequencing Matthew L. Settles Genome Center Bioinformatics Core University of California, Davis settles@ucdavis.edu; bioinformatics.core@ucdavis.edu General rules for preparing

More information

DNA METHYLATION RESEARCH TOOLS

DNA METHYLATION RESEARCH TOOLS SeqCap Epi Enrichment System Revolutionize your epigenomic research DNA METHYLATION RESEARCH TOOLS Methylated DNA The SeqCap Epi System is a set of target enrichment tools for DNA methylation assessment

More information

High throughput omics and BIOINFORMATICS

High throughput omics and BIOINFORMATICS High throughput omics and BIOINFORMATICS Giuseppe D'Auria Seville, February 2009 Genomes from isolated bacteria $ $ $ $ $ $ $ $ $$ $ $ $ $ $ $ $ se q se uen q c se uen ing q c se uen ing qu c en ing c

More information

High-throughput genome scaffolding from in vivo DNA interaction frequency

High-throughput genome scaffolding from in vivo DNA interaction frequency correction notice Nat. Biotechnol. 31, 1143 1147 (213) High-throughput genome scaffolding from in vivo DNA interaction frequency Noam Kaplan & Job Dekker In the version of this supplementary file originally

More information

Contact us for more information and a quotation

Contact us for more information and a quotation GenePool Information Sheet #1 Installed Sequencing Technologies in the GenePool The GenePool offers sequencing service on three platforms: Sanger (dideoxy) sequencing on ABI 3730 instruments Illumina SOLEXA

More information

A Roadmap to the De-novo Assembly of the Banana Slug Genome

A Roadmap to the De-novo Assembly of the Banana Slug Genome A Roadmap to the De-novo Assembly of the Banana Slug Genome Stefan Prost 1 1 Department of Integrative Biology, University of California, Berkeley, United States of America April 6th-10th, 2015 Outline

More information

2 nd Genera-on ( NextGen ) Sequencing Technologies

2 nd Genera-on ( NextGen ) Sequencing Technologies 2 nd Genera-on ( NextGen ) Sequencing Technologies Jay Shendure Read Length is Not As Important For Resequencing % of Paired K-mers with Uniquely Assignable Location 100% 90% 80% 70% 60% 50% 40% 30% 20%

More information

Food Safety (Bio-)Informatics

Food Safety (Bio-)Informatics Food Safety (Bio-)Informatics Henk C. den Bakker Assistant Professor in Bioinformatics and Epidemiology Center for Food Safety University of Georgia hcd82599@uga.edu Overview Short introduction of Food

More information

Whole Genome, Exome, or Custom Targeted Sequencing: How do I choose? Aaron Thorner, PhD Clinical Genomics Group Leader

Whole Genome, Exome, or Custom Targeted Sequencing: How do I choose? Aaron Thorner, PhD Clinical Genomics Group Leader Whole Genome, Exome, or Custom Targeted Sequencing: How do I choose? Aaron Thorner, PhD Clinical Genomics Group Leader Center for Cancer Genome Discovery (CCGD) Dana-Farber Cancer Institute d Outline Center

More information

Comparative Bioinformatics. BSCI348S Fall 2003 Midterm 1

Comparative Bioinformatics. BSCI348S Fall 2003 Midterm 1 BSCI348S Fall 2003 Midterm 1 Multiple Choice: select the single best answer to the question or completion of the phrase. (5 points each) 1. The field of bioinformatics a. uses biomimetic algorithms to

More information