Streaming algorithms for real-time analysis of Oxford Nanopore sequencing data
|
|
- Joshua Booth
- 6 years ago
- Views:
Transcription
1 Streaming algorithms for real-time analysis of Oxford Nanopore sequencing data Minh Duc Cao Institute for Molecular Bioscience The University of Queensland London Calling 26 May 27, 26 Minh Duc Cao, The University of Queensland London Calling 26
2 Streaming algorithms Real-time analysis: Answer the biological questions quickly e.g., infection diagnosis Run sequencing only until the answers are obtained Decide complementary experiments Save time, save money Streaming algorithms: Process data input as a stream Continuously make inference and update the confident level Are robust to noise and scalable to massive data sets Minh Duc Cao, The University of Queensland London Calling 26
3 Real-time analysis workflow DNA extraction Library preparation Sequencing setup 2 hours 2.5 hours.5 hours Run simultaneously Scaffold Assemblies Fastq extraction Species typing Basecalling (Metrichor) Strain typing MinION sequencing Resistance profile Minh Duc Cao, The University of Queensland London Calling 26
4 Fastq extraction DNA extraction Library preparation Sequencing setup 2 hours 2.5 hours.5 hours Fastq extraction Basecalling (Metrichor) MinION sequencing Scaffold Species Strain Resistance Assemblies typing typing profile (Cao et al, 25): Bioinformatics, DOI:.93/bioinformatics/btv658 Minh Duc Cao, The University of Queensland London Calling 26
5 Scaffold and complete genome assemblies Stream of long reads BWA MEM Pre assemblies aligning repeats con nuing process Stream of alignment records pairing Stream of bridges connec ng Extending scaffolds output in real me (Cao et al, 26): biorxiv, DOI:./54783 Minh Duc Cao, The University of Queensland London Calling 26
6 MinION sequencing Sequence two K. pneumoniae strains with the MinION: Strain Phred Quality Scores Read Length Distribution Emp. errors BAA-246 Del: 9.5% (NDM- strain) Ins: 6.3% Mis: 5.3% Chemistry R7 Unaligned: Sep % 33-X Coverage 3883 Del: 7.9% (type strain) Ins: 6.% Mis: 2.9% Chemistry R7.3 Unaligned: Dec 24 % 8-X coverage Minh Duc Cao, The University of Queensland London Calling 26
7 Scaffolding and completing genome results Contigs K. pneumoniae BAA-246 (33X) K. pneumoniae 3883 (8X) Coverage (-fold) Coverage (-fold) 3.. S. cerevisae W33 (9X) Coverage (-fold).6.3 N5 (Mb) Minh Duc Cao, The University of Queensland London Calling 26
8 Scaffolding and completing genome results Contigs K. pneumoniae BAA-246 (33X) K. pneumoniae 3883 (8X) Coverage (-fold) Coverage (-fold) 3.. S. cerevisae W33 (9X) Coverage (-fold).6.3 N5 (Mb) Methods #Contig N5 (Mb) Errors /Kb CPUhrs #Contig N5 (Mb) Errors /Kb CPUhrs #Contig N5 (Mb) Misassemblies Misassemblies Misassemblies Errors /Kb CPUhrs SPAdes Hybrid SSPACE LINK npscarf NaS Nanocorr Minh Duc Cao, The University of Queensland London Calling 26
9 Pathogenicity island reconstruction SapF SapD SapC inta munim YqaJ rect ParA LexA CII ydau dnac IS26 aada sul ebr GCN5-like IS6 IS26 hin ltra umud umuc FRG Contigs Sequence data required to join 5Mb 5Mb 54Mb 54Mb 54Mb 54Mb 54Mb 65Mb 65Mb Minh Duc Cao, The University of Queensland London Calling 26
10 Pathogen identification (Cao et al, 25): biorxiv, DOI:./9356 Minh Duc Cao, The University of Queensland London Calling 26
11 Bacteria identification Proportion.5 Species typing K. pneumoniae ATCC BAA-246 2, 2 3 4,5, 5 K. pneumoniae ATCC , ,5, 5 Mixture 75% E.coli+25% S. aureus 2, ,5, 5 Yield (reads) Sequencing yield K. pneumoniae K. pneumoniae E. coli E. coli S. aureus Minh Duc Cao, The University of Queensland London Calling 26
12 Bacteria identification Proportion.5 Species typing K. pneumoniae ATCC BAA-246 2, 2 3 4,5, 5 K. pneumoniae ATCC , ,5, 5 Mixture 75% E.coli+25% S. aureus 2, ,5, 5 Yield (reads) Sequencing yield K. pneumoniae K. pneumoniae E. coli E. coli S. aureus Strain typing K. pneumoniae ATCC BAA-246 3, K. pneumoniae ATCC , Mixture 75% E.coli+25% S. aureus e) Mixture 75% E.coli+25% S. aureus 3, 3, Probability.5 2,,.5 2,,.5 2,,.5 2,, Yield (reads) K. pneumoniae ST K. pneumoniae ST3 E. coli ST73 E. coli ST526 S. aureus ST243 S. aureus ST2 S. aureus ST46 S. aureus ST29 Minh Duc Cao, The University of Queensland London Calling 26
13 Antibiotic resistance identification K. pneumoniae BAA-246 (NDM- strain): 27 genes Time genes Sens. Spec. Data (mins) (%) (%) (reads) 3 mpha 228 blashv stra blatem strb blactx blalen 263 sul2 blaoxa aac3 aac6 blacmy blacfe blalat blabil Time genes Sens. Spec. Data (mins) (%) (%) (reads) 9 QnrB 3844 aada oqxa teta oqxb dfra blaokp rmtc sul 322 sul fosa blandm Minh Duc Cao, The University of Queensland London Calling 26
14 A glimpse of R9 flowcells Sequence on R9 flowcell the mixture of E. coli ESBL (4%), E. faecium VRE-VanA (2%) S. aureus MRSA (2%) S. aureus VRSA (2%) Minh Duc Cao, The University of Queensland London Calling 26
15 A glimpse of R9 flowcells Time genes Data (mins) (reads) erma 23 ermg spc nora 5 Van 252 VanH VanS 2 dfra 2968 aac6 VanX tetu aada sul sul3 aadd 3 fusb 3964 mpha 4 VanY 5754 cat VanZ VanA Time genes Data (mins) (reads) 5 tetm 684 tets 2 aph msrc meca 5 catpc fosa blaoxa 24 blactx 538 cfr 48 blaz 242 vgaa 78 vgaalc 2684 ermc aac3 blacmy blalat blabil 2 dfra Minh Duc Cao, The University of Queensland London Calling 26
16 Summary and outlook Scaffold and complete bacterial assemblies with < 3-fold coverage Identify pathogen species and strain with reads (<.5 hours sequencing) Detect antibiotic resistance profile in a few hours of sequencing We expect the times to be significantly shortened: Higher throughput with upcoming models: MinION MkII, PromethION. Quicker library preparation Software available Minh Duc Cao, The University of Queensland London Calling 26
17 Acknowledgements Lachlan Coin Matthew Cooper Devika Ganesamoorthy Son Nguyen Alysha Elliott Tania Duarte Vivian Zhang Nick Hamilton Derek Benson Dominique Gorse Michael Thang Minh Duc Cao, The University of Queensland London Calling 26
Practical quality control for whole genome sequencing in clinical microbiology
Practical quality control for whole genome sequencing in clinical microbiology John WA Rossen, PhD, MMM Department of Medical Microbiology, University of Groningen, UMCG, Groningen, The Netherlands Disclosure
More informationScaffolding and Completing Genome Assemblies in Real-time with Nanopore Sequencing
biorxiv preprint first posted online May. 22, 16; doi: http://dx.doi.org/.11/04783. The copyright holder for this preprint Scaffolding and Completing Genome Assemblies in Real-time with Nanopore Sequencing
More informationGanwu Li Assistant Professor Veterinary Diagnostic Lab at Iowa State University
Applications of NGS in Clinical Microbiology Ganwu Li Assistant Professor Veterinary Diagnostic Lab at Iowa State University Detect and Identify Pathogens Total reads FastQC and host reads remove Clean
More informationUnderstanding The rising tide of resistance.
Understanding The rising tide of resistance Tolemanma@cardiff.ac.uk Talk Overview Antibiotic resistance and bacterial fitness Decay in bacteria CTX-M genes and parallels to NDM-1 Some reasons for the success
More informationGALAXY TRAKR FOR STATE PUBLIC HEALTH BIOINFORMATICS INTRODUCTORY TRAININGS, DATA ANALYTICS, & BIOINFORMATICS COLLABORATIONS
GALAXY TRAKR FOR STATE PUBLIC HEALTH BIOINFORMATICS INTRODUCTORY TRAININGS, DATA ANALYTICS, & BIOINFORMATICS COLLABORATIONS Kevin G. Libuit, M.S. Senior Informatics Scientist Division of Consolidated Laboratory
More informationMinION, GridION, how does Nanopore technology meet the needs of our users?
MinION, GridION, how does Nanopore technology meet the needs of our users? Journée Long Reads GeT 28 Novembre 2017 Catherine Zanchetta & Maxime Manno get@genotoul.fr @get_genotoul 1 Wet Lab Nanopore technology
More informationMate-pair library data improves genome assembly
De Novo Sequencing on the Ion Torrent PGM APPLICATION NOTE Mate-pair library data improves genome assembly Highly accurate PGM data allows for de Novo Sequencing and Assembly For a draft assembly, generate
More informationNext- gen sequencing. STAMPS 2015 Hilary G. Morrison Joe Vineis, Nora Downey, Be>e Hecox- Lea, Kim Finnegan
Next- gen sequencing STAMPS 2015 Hilary G. Morrison Joe Vineis, Nora Downey, Be>e Hecox- Lea, Kim Finnegan QuesIons What is the difference between standard and next- gen sequencing? How is next- gen sequencing
More informationAUDREY FARBOS JEREMIE POSCHMANN PAUL O NEILL KONRAD PASZKIEWICZ KAREN MOORE
We provide: AUDREY FARBOS JEREMIE POSCHMANN PAUL O NEILL KONRAD PASZKIEWICZ KAREN MOORE State of the art genomics and bioinformatics analysis Training in experimental techniques, analysis and modelling
More informationData Basics. Josef K Vogt Slides by: Simon Rasmussen Next Generation Sequencing Analysis
Data Basics Josef K Vogt Slides by: Simon Rasmussen 2017 Generalized NGS analysis Sample prep & Sequencing Data size Main data reductive steps SNPs, genes, regions Application Assembly: Compare Raw Pre-
More informationDe Novo and Hybrid Assembly
On the PacBio RS Introduction The PacBio RS utilizes SMRT technology to generate both Continuous Long Read ( CLR ) and Circular Consensus Read ( CCS ) data. In this document, we describe sequencing the
More informationReal world applications of whole genome sequencing. Henrik Hasman Bacteria, parasites and fungi Statens Serum Institut, Denmark
Real world applications of whole genome sequencing Henrik Hasman Bacteria, parasites and fungi Statens Serum Institut, Denmark REAL WORLD APPLICATIONS or application of WGS for outbreak detection, national
More informationWhole Genome Sequencing for food safety FSA Chief Scientific Advisor Report and 2013 Listeria pilot study
Whole Genome Sequencing for food safety FSA Chief Scientific Advisor Report and 2013 Listeria pilot study Dr Edward Hayes Date: July 2016, Version 1 Foodborne Pathogens 280,000 cases of Campylobacter,
More informationHybrid Error Correction and De Novo Assembly with Oxford Nanopore
Hybrid Error Correction and De Novo Assembly with Oxford Nanopore Michael Schatz Jan 13, 2015 PAG Bioinformatics @mike_schatz / #PAGXXIII Oxford Nanopore MinION Thumb drive sized sequencer powered over
More informationOutline General NGS background and terms 11/14/2016 CONFLICT OF INTEREST. HLA region targeted enrichment. NGS library preparation methodologies
Eric T. Weimer, PhD, D(ABMLI) Assistant Professor, Pathology & Laboratory Medicine, UNC School of Medicine Director, Molecular Immunology Associate Director, Clinical Flow Cytometry, HLA, and Immunology
More informationRNA sequencing with the MinION at Genoscope
RNA sequencing with the MinION at Genoscope Jean-Marc Aury jmaury@genoscope.cns.fr @J_M_Aury December 13, 2017 RNA workshop, Genoscope Overview Genoscope Overview MinION sequencing at Genoscope RNA-Seq
More informationHigh Throughput Sequencing Technologies. UCD Genome Center Bioinformatics Core Monday 15 June 2015
High Throughput Sequencing Technologies UCD Genome Center Bioinformatics Core Monday 15 June 2015 Sequencing Explosion www.genome.gov/sequencingcosts http://t.co/ka5cvghdqo Sequencing Explosion 2011 PacBio
More informationIntroductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie. Sander van Boheemen Medical Microbiology
Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie Sander van Boheemen Medical Microbiology Next-generation sequencing Next-generation sequencing (NGS), also known as
More informationHuman Genome Sequencing Over the Decades The capacity to sequence all 3.2 billion bases of the human genome (at 30X coverage) has increased
Human Genome Sequencing Over the Decades The capacity to sequence all 3.2 billion bases of the human genome (at 30X coverage) has increased exponentially since the 1990s. In 2005, with the introduction
More informationCurrent'Advances'in'Sequencing' Technology' James'Gurtowski' Schatz'Lab'
Current'Advances'in'Sequencing' Technology' James'Gurtowski' Schatz'Lab' Outline' 1. Assembly'Review' 2. Pacbio' Technology'Overview' Data'CharacterisFcs' Algorithms' Results' 'Assemblies' 3. Oxford'Nanopore'
More informationTen Minute, Reagent-Free identification of Bacteria Containing Resistance Genes Using a Rapid Intrinsic Fluorescence Method
548 Ten Minute, Reagent-Free identification of Bacteria Containing Resistance Genes Using a Rapid Intrinsic Fluorescence Method R. Rozen-Sadowsky 1, A. Shinderman 1, D. Gohman 1, D. Shimonov 1, Y. Gluckman
More informationNext Gen Sequencing. Expansion of sequencing technology. Contents
Next Gen Sequencing Contents 1 Expansion of sequencing technology 2 The Next Generation of Sequencing: High-Throughput Technologies 3 High Throughput Sequencing Applied to Genome Sequencing (TEDed CC BY-NC-ND
More informationNext Generation Sequences & Chloroplast Assembly. 8 June, 2012 Jongsun Park
Next Generation Sequences & Chloroplast Assembly 8 June, 2012 Jongsun Park Table of Contents 1 History of Sequencing Technologies 2 Genome Assembly Processes With NGS Sequences 3 How to Assembly Chloroplast
More informationMatthew Tinning Australian Genome Research Facility. July 2012
Next-Generation Sequencing: an overview of technologies and applications Matthew Tinning Australian Genome Research Facility July 2012 History of Sequencing Where have we been? 1869 Discovery of DNA 1909
More informationIntroduction to NGS Analysis Tools
National Center for Emerging and Zoonotic Infectious Diseases Introduction to NGS Analysis Tools Heather Carleton, PhD, MPH Team Lead, Enteric Diseases Bioinformatics, Enteric Diseases Laboratory Branch,
More informationNext Generation Sequencing. Jeroen Van Houdt - Leuven 13/10/2017
Next Generation Sequencing Jeroen Van Houdt - Leuven 13/10/2017 Landmarks in DNA sequencing 1953 Discovery of DNA double helix structure 1977 A Maxam and W Gilbert "DNA seq by chemical degradation" F Sanger"DNA
More informationThe Genome Analysis Centre. Building Excellence in Genomics and Computa5onal Bioscience
Building Excellence in Genomics and Computa5onal Bioscience Resequencing approaches Sarah Ayling Crop Genomics and Diversity sarah.ayling@tgac.ac.uk Why re- sequence plants? To iden
More informationNext-generation sequencing Technology Overview
Next-generation sequencing Technology Overview UQ Winter School 2018 Christopher Noune, PhD AGRF Melbourne christopher.noune@agrf.org.au What is NGS? Ion Torrent PGM (Thermo-Fisher) MiSeq (Illumina) High-Throughput
More informationChallenges and opportunities for whole genome sequencing based surveillance of antibiotic resistance
Challenges and opportunities for whole genome sequencing based surveillance of antibiotic resistance Prof. Willem van Schaik Professor in Microbiology and Infection Institute of Microbiology and Infection
More informationUsing New ThiNGS on Small Things. Shane Byrne
Using New ThiNGS on Small Things Shane Byrne Next Generation Sequencing New Things Small Things NGS Next Generation Sequencing = 2 nd generation of sequencing 454 GS FLX, SOLiD, GAIIx, HiSeq, MiSeq, Ion
More informationNEXT GENERATION SEQUENCING. Farhat Habib
NEXT GENERATION SEQUENCING HISTORY HISTORY Sanger Dominant for last ~30 years 1000bp longest read Based on primers so not good for repetitive or SNPs sites HISTORY Sanger Dominant for last ~30 years 1000bp
More informationLecture 7. Next-generation sequencing technologies
Lecture 7 Next-generation sequencing technologies Next-generation sequencing technologies General principles of short-read NGS Construct a library of fragments Generate clonal template populations Massively
More informationWhole Genome Sequence Data Quality Control and Validation
Whole Genome Sequence Data Quality Control and Validation GoSeqIt ApS / Ved Klædebo 9 / 2970 Hørsholm VAT No. DK37842524 / Phone +45 26 97 90 82 / Web: www.goseqit.com / mail: mail@goseqit.com Table of
More informationTruSPAdes: analysis of variations using TruSeq Synthetic Long Reads (TSLR)
tru TruSPAdes: analysis of variations using TruSeq Synthetic Long Reads (TSLR) Anton Bankevich Center for Algorithmic Biotechnology, SPbSU Sequencing costs 1. Sequencing costs do not follow Moore s law
More informationNature Biotechnology: doi: /nbt Supplementary Figure 1. Read Complexity
Supplementary Figure 1 Read Complexity A) Density plot showing the percentage of read length masked by the dust program, which identifies low-complexity sequence (simple repeats). Scrappie outputs a significantly
More informationGet to Know Your DNA. Every Single Fragment.
HaloPlex HS NGS Target Enrichment System Get to Know Your DNA. Every Single Fragment. High sensitivity detection of rare variants using molecular barcodes How Does Molecular Barcoding Work? HaloPlex HS
More informationSequencing techniques
Sequencing techniques Workshop on Whole Genome Sequencing and Analysis, 2-4 Oct. 2017 Learning objective: After this lecture, you should be able to account for different techniques for whole genome sequencing
More informationIllumina (Solexa) Throughput: 4 Tbp in one run (5 days) Cheapest sequencing technology. Mismatch errors dominate. Cost: ~$1000 per human genme
Illumina (Solexa) Current market leader Based on sequencing by synthesis Current read length 100-150bp Paired-end easy, longer matepairs harder Error ~0.1% Mismatch errors dominate Throughput: 4 Tbp in
More informationTerabases of long-read sequence data, analysed in real time. Available now
Terabases of long-read sequence data, analysed in real time Available now The PromethION is a real game changer. Combining ultra-long reads with high sequence output for the production of contiguous, highquality
More informationNext Generation Sequencing. Tobias Österlund
Next Generation Sequencing Tobias Österlund tobiaso@chalmers.se NGS part of the course Week 4 Friday 13/2 15.15-17.00 NGS lecture 1: Introduction to NGS, alignment, assembly Week 6 Thursday 26/2 08.00-09.45
More informationSequencing Theory. Brett E. Pickett, Ph.D. J. Craig Venter Institute
Sequencing Theory Brett E. Pickett, Ph.D. J. Craig Venter Institute Applications of Genomics and Bioinformatics to Infectious Diseases GABRIEL Network Agenda Sequencing Instruments Sanger Illumina Ion
More informationDe novo whole genome assembly
De novo whole genome assembly Qi Sun Bioinformatics Facility Cornell University Sequencing platforms Short reads: o Illumina (150 bp, up to 300 bp) Long reads (>10kb): o PacBio SMRT; o Oxford Nanopore
More informationIntroduction to metagenome assembly. Bas E. Dutilh Metagenomic Methods for Microbial Ecologists, NIOO September 18 th 2014
Introduction to metagenome assembly Bas E. Dutilh Metagenomic Methods for Microbial Ecologists, NIOO September 18 th 2014 Sequencing specs* Method Read length Accuracy Million reads Time Cost per M 454
More informationWhole-genome sequencing (WGS) of microbes employing nextgeneration sequencing (NGS) technologies enables pathogen
Application Note Microbial whole-genome sequencing A novel, single-tube enzymatic fragmentation and library construction method enables fast turnaround times and improved data quality for microbial whole-genome
More informationSUPPLEMENTARY INFORMATION
SUPPLEMENTARY INFORMATION Supplementary Note 1. Stress-testing the encoder/decoder. We summarize our experience decoding two files that were sequenced with nanopore technologyerror! Reference source not
More informationDetection and characterization of extended spectrum β-lactamase producing Escherichia coli from poultry of eastern India
Detection and characterization of extended spectrum β-lactamase producing Escherichia coli from poultry of eastern India Dr. Samiran Bandyopadhyay Scientist Indian Veterinary Research Institute Eastern
More informationHigh Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday September 15, 2014
High Throughput Sequencing Technologies J Fass UCD Genome Center Bioinformatics Core Monday September 15, 2014 Sequencing Explosion www.genome.gov/sequencingcosts http://t.co/ka5cvghdqo Sequencing Explosion
More informationHigh Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Tuesday December 16, 2014
High Throughput Sequencing Technologies J Fass UCD Genome Center Bioinformatics Core Tuesday December 16, 2014 Sequencing Explosion www.genome.gov/sequencingcosts http://t.co/ka5cvghdqo Sequencing Explosion
More informationMetagenomic 3C, full length 16S amplicon sequencing on Illumina, and the diabetic skin microbiome
Also: Sunaina Melissa Gardiner UTS Catherine Burke UTS Michael Liu UTS Chris Beitel UTS, UC Davis Matt DeMaere UTS Metagenomic 3C, full length 16S amplicon sequencing on Illumina, and the diabetic skin
More informationBioinformatic analysis of Illumina sequencing data for comparative genomics Part I
Bioinformatic analysis of Illumina sequencing data for comparative genomics Part I Dr David Studholme. 18 th February 2014. BIO1033 theme lecture. 1 28 February 2014 @davidjstudholme 28 February 2014 @davidjstudholme
More informationQuality assurance in NGS (diagnostics)
Quality assurance in NGS (diagnostics) Chris Mattocks National Genetics Reference Laboratory (Wessex) Research Diagnostics Quality assurance Any systematic process of checking to see whether a product
More informationNanopore sequencing How it works
Product 1 Nanopore sequencing How it works The nanopore processes the length of DNA or RNA presented to it. The user can control fragment length through the library preparation protocol utilised. (e.g.
More informationNext-Generation Sequencing. Technologies
Next-Generation Next-Generation Sequencing Technologies Sequencing Technologies Nicholas E. Navin, Ph.D. MD Anderson Cancer Center Dept. Genetics Dept. Bioinformatics Introduction to Bioinformatics GS011062
More informationCBC Data Therapy. Metatranscriptomics Discussion
CBC Data Therapy Metatranscriptomics Discussion Metatranscriptomics Extract RNA, subtract rrna Sequence cdna QC Gene expression, function Institute for Systems Genomics: Computational Biology Core bioinformatics.uconn.edu
More informationA Guide to Consed Michelle Itano, Carolyn Cain, Tien Chusak, Justin Richner, and SCR Elgin.
1 A Guide to Consed Michelle Itano, Carolyn Cain, Tien Chusak, Justin Richner, and SCR Elgin. Main Window Figure 1. The Main Window is the starting point when Consed is opened. From here, you can access
More informationNanopore long read sequencing for detection of point mutations and structural variants
Nanopore long read sequencing for detection of point mutations and structural variants Viapath Genetics, Guy s Hospital, London Presented by: Kezia Brown Nanopore for long read sequencing Why do we need
More informationWhole Human Genome Sequencing Report This is a technical summary report for PG DNA
Whole Human Genome Sequencing Report This is a technical summary report for PG0002601-DNA Physician and Patient Information Physician name: Vinodh Naraynan Address: Suite 406 222 West Thomas Road Phoenix
More informationHLA-Typing Strategies
HLA-Typing Strategies Cologne, 13.5.2017 Joannis Mytilineos MD, PhD Department of Transplantation Immunology Institute for Clinical Transfusion Medicine and Immunogenetics German Red Cross Blood Transfusion
More informationOur goal is to enable the analysis of any living thing, by any person, in any environment.
Technology 1 2 Our goal is to enable the analysis of any living thing, by any person, in any environment. Nanopore DNA and direct RNA sequencing has been performed on board the International Space Station.
More informationBIOINFORMATICS ORIGINAL PAPER
BIOINFORMATICS ORIGINAL PAPER Vol. 27 no. 21 2011, pages 2957 2963 doi:10.1093/bioinformatics/btr507 Genome analysis Advance Access publication September 7, 2011 : fast length adjustment of short reads
More informationIntroduction to Bioinformatics
Introduction to Bioinformatics Alla L Lapidus, Ph.D. SPbSU St. Petersburg Term Bioinformatics Term Bioinformatics was invented by Paulien Hogeweg (Полина Хогевег) and Ben Hesper in 1970 as "the study of
More informationOxford Nanopore Sequencing and de novo Assembly of a Eukaryotic Genome Supplemental Notes and Figures
Oxford Nanopore Sequencing and de novo Assembly of a Eukaryotic Genome Supplemental Notes and Figures Table of Contents Supplemental Note 1. Flowcell performance... 2 Supplemental Note 2. Nanopore sequencing
More informationSUPPLEMENTARY MATERIAL FOR THE PAPER: RASCAF: IMPROVING GENOME ASSEMBLY WITH RNA-SEQ DATA
SUPPLEMENTARY MATERIAL FOR THE PAPER: RASCAF: IMPROVING GENOME ASSEMBLY WITH RNA-SEQ DATA Authors: Li Song, Dhruv S. Shankar, Liliana Florea Table of contents: Figure S1. Methods finding contig connections
More informationGenetics Lecture 21 Recombinant DNA
Genetics Lecture 21 Recombinant DNA Recombinant DNA In 1971, a paper published by Kathleen Danna and Daniel Nathans marked the beginning of the recombinant DNA era. The paper described the isolation of
More informationFaction 2: Genome Assembly Lab and Preliminary Data
Faction 2: Genome Assembly Lab and Preliminary Data [Computational Genomics 2017] Christian Colon, Erisa Sula, David Lu, Tian Jin, Lijiang Long, Rohini Mopuri, Bowen Yang, Saminda Wijeratne, Harrison Kim
More informationSUMMARY. Key words: antibioticresistance, Enterobacteriaceae, ESBL, CTX-M,
SUMMARY Key words: antibioticresistance, Enterobacteriaceae, ESBL, CTX-M, The doctoral thesis entitled Prevalence of Enterobacteriaceae producing extendedspectrum beta-lactamases (ESBL) isolated from broilers
More informationUnderstanding the science and technology of whole genome sequencing
Understanding the science and technology of whole genome sequencing Dag Undlien Department of Medical Genetics Oslo University Hospital University of Oslo and The Norwegian Sequencing Centre d.e.undlien@medisin.uio.no
More informationAaron Liston, Oregon State University Botany 2012 Intro to Next Generation Sequencing Workshop
Output (bp) Aaron Liston, Oregon State University Growth in Next-Gen Sequencing Capacity 3.5E+11 2002 2004 2006 2008 2010 3.0E+11 2.5E+11 2.0E+11 1.5E+11 1.0E+11 Adapted from Mardis, 2011, Nature 5.0E+10
More informationSequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es
Sequencing technologies Jose Blanca COMAV institute bioinf.comav.upv.es Outline Sequencing technologies: Sanger 2nd generation sequencing: 3er generation sequencing: 454 Illumina SOLiD Ion Torrent PacBio
More informationWorkflow of de novo assembly
Workflow of de novo assembly Experimental Design Clean sequencing data (trim adapter and low quality sequences) Run assembly software for contiging and scaffolding Evaluation of assembly Several iterations:
More informationNanopore-based DNA sequencing in clinical microbiology: preliminary
Nanopore sequencing in clinical routine 1 1 2 Nanopore-based DNA sequencing in clinical microbiology: preliminary assessment of basic requirements 3 4 5 6 7 Håvard Harstad a, Rafi Ahmad b*, Anders Bredberg
More informationSupplementary Figure 1. Design of the control microarray. a, Genomic DNA from the
Supplementary Information Supplementary Figures Supplementary Figure 1. Design of the control microarray. a, Genomic DNA from the strain M8 of S. ruber and a fosmid containing the S. ruber M8 virus M8CR4
More informationTargeted Sequencing in the NBS Laboratory
Targeted Sequencing in the NBS Laboratory Christopher Greene, PhD Newborn Screening and Molecular Biology Branch Division of Laboratory Sciences Gene Sequencing in Public Health Newborn Screening February
More informationNextGen Sequencing and Target Enrichment
NextGen Sequencing and Target Enrichment Laurent FARINELLI 7 September 2010 Agilent 3rd Analytic Forum Basel, Switzerland Outline The illumina HiSEQ 2000 system Applications Target enrichment Outlook 7
More informationNext Generation Sequencing Applications in Food Safety and Quality
Next Generation Sequencing Applications in Food Safety and Quality Our science National and international centre of excellence for interdisciplinary investigation and problem solving across plant and bee
More informationDe Novo Assembly of High-throughput Short Read Sequences
De Novo Assembly of High-throughput Short Read Sequences Chuming Chen Center for Bioinformatics and Computational Biology (CBCB) University of Delaware NECC Third Skate Genome Annotation Workshop May 23,
More informationCourse summary. Today. PCR Polymerase chain reaction. Obtaining molecular data. Sequencing. DNA sequencing. Genome Projects.
Goals Organization Labs Project Reading Course summary DNA sequencing. Genome Projects. Today New DNA sequencing technologies. Obtaining molecular data PCR Typically used in empirical molecular evolution
More informationIdentification of Bacterial Pathogens and Antimicrobial Resistance Directly from Clinical Urines by Nanopore-Based Metagenomic Sequencing
1 2 Identification of Bacterial Pathogens and Antimicrobial Resistance Directly from Clinical Urines by Nanopore-Based Metagenomic Sequencing 3 4 5 6 K. Schmidt 1, S. Mwaigwisya 1, L. C. Crossman 2, M.
More informationFinishing of Fosmid 1042D14. Project 1042D14 is a roughly 40 kb segment of Drosophila ananassae
Schefkind 1 Adam Schefkind Bio 434W 03/08/2014 Finishing of Fosmid 1042D14 Abstract Project 1042D14 is a roughly 40 kb segment of Drosophila ananassae genomic DNA. Through a comprehensive analysis of forward-
More informationTREE CODE PRODUCT BROCHURE
TREE CODE PRODUCT BROCHURE Single Molecule, Real-Time (SMRT) Sequencing technology offers: Long read sequencing ~10 Gb with 20 kb average read lengths for WGS ~20 Gb with 40 kb average read length for
More informationReference genomes and common file formats
Reference genomes and common file formats Overview Reference genomes and GRC Fasta and FastQ (unaligned sequences) SAM/BAM (aligned sequences) Summarized genomic features BED (genomic intervals) GFF/GTF
More informationCorynebacterium pseudotuberculosis genome sequencing: Final Report
Summary To provide an invaluable resource to assist in the development of diagnostics and vaccines against caseous lymphadenitis (CLA), the sequencing of the genome of a virulent, United Kingdom Corynebacterium
More informationBuilding a platinum human genome assembly from single haplotype human genomes. Karyn Meltz Steinberg PacBio UGM December,
Building a platinum human genome assembly from single haplotype human genomes Karyn Meltz Steinberg PacBio UGM December, 2015 @KMS_Meltzy Single haplotype from hydatidiform mole Enucleated egg (no maternal
More informationIncorporating Molecular ID Technology. Accel-NGS 2S MID Indexing Kits
Incorporating Molecular ID Technology Accel-NGS 2S MID Indexing Kits Molecular Identifiers (MIDs) MIDs are indices used to label unique library molecules MIDs can assess duplicate molecules in sequencing
More informationReference genomes and common file formats
Reference genomes and common file formats Dóra Bihary MRC Cancer Unit, University of Cambridge CRUK Functional Genomics Workshop September 2017 Overview Reference genomes and GRC Fasta and FastQ (unaligned
More informationNext Generation Sequencing Lecture Saarbrücken, 19. March Sequencing Platforms
Next Generation Sequencing Lecture Saarbrücken, 19. March 2012 Sequencing Platforms Contents Introduction Sequencing Workflow Platforms Roche 454 ABI SOLiD Illumina Genome Anlayzer / HiSeq Problems Quality
More information1. A brief overview of sequencing biochemistry
Supplementary reading materials on Genome sequencing (optional) The materials are from Mark Blaxter s lecture notes on Sequencing strategies and Primary Analysis 1. A brief overview of sequencing biochemistry
More informationMolecular susceptibility testing
Molecular susceptibility testing Dr Andrew Ginn Supervising Scientist Antimicrobial Resistance Reference Laboratory ICPMR, Westmead Hospital Resistance genes Gram negatives Transmissible; e.g. ESBLs, MBLs,
More informationDe novo Genome Assembly
De novo Genome Assembly A/Prof Torsten Seemann Winter School in Mathematical & Computational Biology - Brisbane, AU - 3 July 2017 Introduction The human genome has 47 pieces MT (or XY) The shortest piece
More informationThe Journey of DNA Sequencing. Chromosomes. What is a genome? Genome size. H. Sunny Sun
The Journey of DNA Sequencing H. Sunny Sun What is a genome? Genome is the total genetic complement of a living organism. The nuclear genome comprises approximately 3.2 * 10 9 nucleotides of DNA, divided
More informationExperimental Design Microbial Sequencing
Experimental Design Microbial Sequencing Matthew L. Settles Genome Center Bioinformatics Core University of California, Davis settles@ucdavis.edu; bioinformatics.core@ucdavis.edu General rules for preparing
More informationDNA METHYLATION RESEARCH TOOLS
SeqCap Epi Enrichment System Revolutionize your epigenomic research DNA METHYLATION RESEARCH TOOLS Methylated DNA The SeqCap Epi System is a set of target enrichment tools for DNA methylation assessment
More informationHigh throughput omics and BIOINFORMATICS
High throughput omics and BIOINFORMATICS Giuseppe D'Auria Seville, February 2009 Genomes from isolated bacteria $ $ $ $ $ $ $ $ $$ $ $ $ $ $ $ $ se q se uen q c se uen ing q c se uen ing qu c en ing c
More informationHigh-throughput genome scaffolding from in vivo DNA interaction frequency
correction notice Nat. Biotechnol. 31, 1143 1147 (213) High-throughput genome scaffolding from in vivo DNA interaction frequency Noam Kaplan & Job Dekker In the version of this supplementary file originally
More informationContact us for more information and a quotation
GenePool Information Sheet #1 Installed Sequencing Technologies in the GenePool The GenePool offers sequencing service on three platforms: Sanger (dideoxy) sequencing on ABI 3730 instruments Illumina SOLEXA
More informationA Roadmap to the De-novo Assembly of the Banana Slug Genome
A Roadmap to the De-novo Assembly of the Banana Slug Genome Stefan Prost 1 1 Department of Integrative Biology, University of California, Berkeley, United States of America April 6th-10th, 2015 Outline
More information2 nd Genera-on ( NextGen ) Sequencing Technologies
2 nd Genera-on ( NextGen ) Sequencing Technologies Jay Shendure Read Length is Not As Important For Resequencing % of Paired K-mers with Uniquely Assignable Location 100% 90% 80% 70% 60% 50% 40% 30% 20%
More informationFood Safety (Bio-)Informatics
Food Safety (Bio-)Informatics Henk C. den Bakker Assistant Professor in Bioinformatics and Epidemiology Center for Food Safety University of Georgia hcd82599@uga.edu Overview Short introduction of Food
More informationWhole Genome, Exome, or Custom Targeted Sequencing: How do I choose? Aaron Thorner, PhD Clinical Genomics Group Leader
Whole Genome, Exome, or Custom Targeted Sequencing: How do I choose? Aaron Thorner, PhD Clinical Genomics Group Leader Center for Cancer Genome Discovery (CCGD) Dana-Farber Cancer Institute d Outline Center
More informationComparative Bioinformatics. BSCI348S Fall 2003 Midterm 1
BSCI348S Fall 2003 Midterm 1 Multiple Choice: select the single best answer to the question or completion of the phrase. (5 points each) 1. The field of bioinformatics a. uses biomimetic algorithms to
More information