Next Gen Sequencing. Expansion of sequencing technology. Contents

Similar documents
DNA Sequencing by Ion Torrent. Marc Lavergne CHEM 4590

Introduction to Next Generation Sequencing (NGS)

Lecture 7. Next-generation sequencing technologies

Data Basics. Josef K Vogt Slides by: Simon Rasmussen Next Generation Sequencing Analysis

Wheat CAP Gene Expression with RNA-Seq

Welcome to the NGS webinar series

DNA-Sequencing. Technologies & Devices. Matthias Platzer. Genome Analysis Leibniz Institute on Aging - Fritz Lipmann Institute (FLI)

DNA-Sequencing. Technologies & Devices. Matthias Platzer. Genome Analysis Leibniz Institute on Aging - Fritz Lipmann Institute (FLI)

Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie. Sander van Boheemen Medical Microbiology

High Throughput Sequencing Technologies. UCD Genome Center Bioinformatics Core Monday 15 June 2015

Next-generation sequencing technologies

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday June 16, 2014

Functional Genomics Research Stream. Research Meetings: November 2 & 3, 2009 Next Generation Sequencing

Sequencing techniques

Matthew Tinning Australian Genome Research Facility. July 2012

Overview of Next Generation Sequencing technologies. Céline Keime

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday September 15, 2014

A Crash Course in NGS for GI Pathologists. Sandra O Toole

Aaron Liston, Oregon State University Botany 2012 Intro to Next Generation Sequencing Workshop

Third Generation Sequencing

Using New ThiNGS on Small Things. Shane Byrne

Next-generation sequencing Technology Overview

Ultrasequencing: Methods and Applications of the New Generation Sequencing Platforms

Next Generation Sequencing. Tobias Österlund

High throughput DNA Sequencing. An Equal Opportunity University!

Outline General NGS background and terms 11/14/2016 CONFLICT OF INTEREST. HLA region targeted enrichment. NGS library preparation methodologies

Next Generation Sequencing. Simon Rasmussen Assistant Professor Center for Biological Sequence analysis Technical University of Denmark

Genome 373: High- Throughput DNA Sequencing. Doug Fowler

DNA Sequencing and Assembly

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Ultrasequencing: methods and applications of the new generation sequencing platforms

Next Generation Sequencing. Jeroen Van Houdt - Leuven 13/10/2017

NextGen Sequencing Technologies Sequencing overview

NEXT GENERATION SEQUENCING. Farhat Habib

Human genome sequence

1. Introduction Gene regulation Genomics and genome analyses

The Journey of DNA Sequencing. Chromosomes. What is a genome? Genome size. H. Sunny Sun

you can see that if if you look into the you know the capability kilobases per day, per machine kind of calculation if you do.

Research school methods seminar Genomics and Transcriptomics

Introduction to the MiSeq

Gene Expression Technology

Chapter 7. DNA Microarrays

Mate-pair library data improves genome assembly

Deep Sequencing technologies

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Introduction to Bioinformatics and Gene Expression Technologies

Introduction to Bioinformatics and Gene Expression Technologies

Human Genome Sequencing Over the Decades The capacity to sequence all 3.2 billion bases of the human genome (at 30X coverage) has increased

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Reading Lecture 8: Lecture 9: Lecture 8. DNA Libraries. Definition Types Construction

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Tuesday December 16, 2014

Basics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility

Transcriptome analysis

Next Generation Sequencing (NGS)

DNA-Sequencing. Technologies & Devices

Contact us for more information and a quotation

Novel methods for RNA and DNA- Seq analysis using SMART Technology. Andrew Farmer, D. Phil. Vice President, R&D Clontech Laboratories, Inc.

BIOINFORMATICS 1 SEQUENCING TECHNOLOGY. DNA story. DNA story. Sequencing: infancy. Sequencing: beginnings 26/10/16. bioinformatic challenges

Next Generation Sequencing Lecture Saarbrücken, 19. March Sequencing Platforms

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center

Next-Generation Sequencing. Technologies

Experimental Design. Sequencing. Data Quality Control. Read mapping. Differential Expression analysis

Get to Know Your DNA. Every Single Fragment.

Next-generation sequencing and quality control: An introduction 2016

CM581A2: NEXT GENERATION SEQUENCING PLATFORMS AND LIBRARY GENERATION

Understanding the science and technology of whole genome sequencing

Transcriptomics analysis with RNA seq: an overview Frederik Coppens

NB536: Bioinformatics

DNA-Sequencing. Technologies & Devices

RNA Sequencing. Next gen insight into transcriptomes , Elio Schijlen

The New Genome Analyzer IIx Delivering more data, faster, and easier than ever before. Jeremy Preston, PhD Marketing Manager, Sequencing

Introduction to NGS. Josef K Vogt Slides by: Simon Rasmussen Next Generation Sequencing Analysis

Next generation sequencing techniques" Toma Tebaldi Centre for Integrative Biology University of Trento

Illumina Sequencing Error Profiles and Quality Control

RNA-Seq data analysis course September 7-9, 2015

Sequencing Theory. Brett E. Pickett, Ph.D. J. Craig Venter Institute

Analytics Behind Genomic Testing

The Iso-Seq Method: Transcriptome Sequencing Using Long Reads

DNA concentration and purity were initially measured by NanoDrop 2000 and verified on Qubit 2.0 Fluorometer.

Genome Resequencing. Rearrangements. SNPs, Indels CNVs. De novo genome Sequencing. Metagenomics. Exome Sequencing. RNA-seq Gene Expression

Next Generation Sequencing Technologies

Genome Sequencing. I: Methods. MMG 835, SPRING 2016 Eukaryotic Molecular Genetics. George I. Mias

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center

Sequence Assembly and Alignment. Jim Noonan Department of Genetics

TECH NOTE Ligation-Free ChIP-Seq Library Preparation

Differential gene expression analysis using RNA-seq

Next- gen sequencing. STAMPS 2015 Hilary G. Morrison Joe Vineis, Nora Downey, Be>e Hecox- Lea, Kim Finnegan

Wet-lab Considerations for Illumina data analysis

Course summary. Today. PCR Polymerase chain reaction. Obtaining molecular data. Sequencing. DNA sequencing. Genome Projects.

Ecole de Bioinforma(que AVIESAN Roscoff 2014 GALAXY INITIATION. A. Lermine U900 Ins(tut Curie, INSERM, Mines ParisTech

NEXT GENERATION SEQUENCING Whole Gene Sequencing

The Expanded Illumina Sequencing Portfolio New Sample Prep Solutions and Workflow

Incorporating Molecular ID Technology. Accel-NGS 2S MID Indexing Kits

Genetics and Genomics in Medicine Chapter 3. Questions & Answers

Concepts and methods in sequencing and genome assembly

Chapter 6 - Molecular Genetic Techniques

Microarrays: since we use probes we obviously must know the sequences we are looking at!

DATA FORMATS AND QUALITY CONTROL

Genomic resources. for non-model systems

TECH NOTE Pushing the Limit: A Complete Solution for Generating Stranded RNA Seq Libraries from Picogram Inputs of Total Mammalian RNA

Transcription:

Next Gen Sequencing Contents 1 Expansion of sequencing technology 2 The Next Generation of Sequencing: High-Throughput Technologies 3 High Throughput Sequencing Applied to Genome Sequencing (TEDed CC BY-NC-ND 4.0) 4 Short Read Sequencing by Synthesis 4.1 Illumina 4.2 Ion Torrent 5 Single Molecule Real Time Sequencing 5.1 Pac Bio 6 Oxford Nanopore 7 Sequence output 8 Assembly and Alignment 9 RNA-Seq 10 Advanced Video of Variant Calling from NGS to Decipher a Genetic Susceptibility Expansion of sequencing technology Traditional sequencing of genomes was a long and tedious process that cloned fragments of genomic DNA into plasmids to generate a genomic DNA library (gdna). These plasmids were individually sequenced using Sanger sequencing methodology and computational was performed to identify overlapping pieces, like a jigsaw puzzle. This assembly would result in a draft scaffold. 1 / 15

As technology improved, the cost of sequencing genomes became less expensive. This technology outpaced the Moore s Law, a semiconductor projection about the the speed of computers as time progressed. A dramatic price decrease in cost of genome sequencing occurred around 2008 due to technical advances. 2 / 15

As the cost of genome sequencing decreased, a dramatic increase in genome deposition into Genbank was observed. These deposits reflected small genomes of bacteria and archaea. The decrease in per nucleotide sequencing cost came from the parallelization of sequencing. Whereas Sanger Sequencing is capable of sequencing one stretch at a time, a parallel assembly of sequencing reactions has lead to high throughput sequencing often dubbed Next Generation Sequencing (NGS). The Next Generation of Sequencing: High-Throughput Technologies 3 / 15

High Throughput Sequencing Applied to Genome Sequencing (TEDed CC BY-NC-ND 4.0) Short Read Sequencing by Synthesis Illumina Illumina short read sequencing uses flow cell technology where oligonucleotides complimentary to adapter primers are physically seeded. 4 / 15

Flow cell surface with the adapter oligonucleotides.fragmented DNA sequences are adapted with primers through ligation and hybridized to the flow cell. To increase the signal from sequencing, the short DNA sequences are amplified through a process called bridge amplification or cluster generation. 5 / 15

Cluster generation through bridge amplification. A low numb er of PCR cycles is used. Cluster generation aids in subsequent signal/noise determination.the flow cell undergoes successive rounds of flooding with a fluorescent nucleotide, permitted to incorporate with a DNA polymerase and washed away. After each flood/wash cycle, fluorescent signals are measured to indicate the incorporation. Specific 6 / 15

locations of fluorescence are tracked and consolidated to indicate the sequence at each registered point. Each flow cycle introduces a fluorescent nucleotide for incorporation. Ion Torrent Fragmented DNA is ligated to adapter sequences and adhered onto microbeads. The beads are embedded into microwells on a semiconductor. Ion Torrent performs the sequencing reactions in an unbuffered solution since the semiconductor acts as a ph meter to identify nucleotide incorporation. Standard nucleotides are flooded onto the chip and incorporated. Because nucleotide incorporation creates a proton (H+), a microenvironment of low ph is detected in the 7 / 15

unbuffered solution. Single Molecule Real Time Sequencing Pac Bio Pac Bio uses nanowells with covalent bonded DNA polymerase to sequence individual molecules of DNA. Fluorescent nucleotides are incorporated during synthesis reactions and a real-time incorporation can be measured. Pac Bio sequencing has the advantage of sequencing fragments of 10-20kb, in stark contrast to the short read methods. Oxford Nanopore 8 / 15

Oxford Nanopore utilizes the protein alpha-hemolysin integrated onto a semiconductor chip. The pore size of the protein is the correct ssize for a single DNA molecule to fit through. A DNA Polymerase molecule is linked to the opening of the pore where the replicated DNA is fed through. As the DNA traverses the pore, the voltage changes are measured and mapped to the qualities of specific bases. https://youtu.be/bnz880v52rq Sequence output A sample ab1 file displaying the base calls, the chromatograms and the quality scores for each base. Notice the poor quality in the red box and the corresponding peaks/bases The output file of next generation sequencing methods utilize the fastq format. Like a fasta file, there is a header that describes the sequence. The first line is the header or title line which begins with @ (remember that fasta begins with > ). The second line is the actual raw sequence (once again similar to fasta). The third line has no meaning while the fourth line is filled with symbols as long as the sequence line. This last line is the quality score of the base call. As with the Sanger sequencing, there may be ambiguity with the base call of the sequence and the certainty is maintained in the quality score. 9 / 15

Sample fastq file displaying 5 short read sequences Phred scores were developed to assess the quality of the base calls arising from fluorescent Sanger sequencing during the Human Genome Project. The phred program scans the peaks of the chromatogram and scores based on certainty or accuracy of the call. The scores are logarithmically based and scores greater than 20 represent greater than 99% accuracy of the base call. 10 / 15

Using the phred scores embedded in the last line of fastq files, poor quality reads can be removed. Using a program like FastQC permits the assessment of the reads and produces graphical representation of quality. FastQC quality output illustrating the Phred score for each base call. This short read sequence of about 100 nucleotides has all bases made at greater than 30, or > 99.9% accuracy. Assembly and Alignment Sequences from short reads must be assembled into a usable sequence. To do so, a reference genome may aid in the assembly after adapter sequences are trimmed using automated 11 / 15

methods. In the case that there is no reference genome, a related species may be used or a more computationally intensive process of de novo assembly must take place. With de novo assembly, it may be useful to have some long reads performed with PacBio to create scaffolds for generating the assembly into contiguous sequences, or contigs. RNA-Seq 12 / 15

RT-PCR and RT-qPCR can be used to measure the abundance of specific transcripts in a fairly low throughput way. Leveraging the the concept of Reverse Transcription and coupling that to high-throughput sequencing technologies, transcripts can be sequenced and mapped to a genome to depict the quantity of transcripts as represented by number of reads. 13 / 15

Given sufficient read coverage, novel splice isoforms can also be identified as different exonexon junctions are identified. The general workflow of RNA-Seq analysis follows: 14 / 15

Advanced Video of Variant Calling from NGS to Decipher a Genetic Susceptibility 15 / 15 Powered by TCPDF (www.tcpdf.org)