High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center

Similar documents
High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center

Wet-lab Considerations for Illumina data analysis

Genome Resequencing. Rearrangements. SNPs, Indels CNVs. De novo genome Sequencing. Metagenomics. Exome Sequencing. RNA-seq Gene Expression

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center

How to Prepare for your Next Gen-Sequencing Project. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center

Considerations for Illumina library preparation. Henriette O Geen June 20, 2014 UCD Genome Center

Deep Sequencing technologies

Next-generation sequencing technologies

Novel methods for RNA and DNA- Seq analysis using SMART Technology. Andrew Farmer, D. Phil. Vice President, R&D Clontech Laboratories, Inc.

Transcriptomics analysis with RNA seq: an overview Frederik Coppens

02 Agenda Item 03 Agenda Item

DNA concentration and purity were initially measured by NanoDrop 2000 and verified on Qubit 2.0 Fluorometer.

Wheat CAP Gene Expression with RNA-Seq

G E N OM I C S S E RV I C ES

Experimental Design. Dr. Matthew L. Settles. Genome Center University of California, Davis

Integrated NGS Sample Preparation Solutions for Limiting Amounts of RNA and DNA. March 2, Steven R. Kain, Ph.D. ABRF 2013


RNA Sequencing. Next gen insight into transcriptomes , Elio Schijlen

The Genome Analysis Centre. Building Excellence in Genomics and Computa5onal Bioscience

NextGen Sequencing Technologies Sequencing overview

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday September 15, 2014

SMARTer Ultra Low RNA Kit for Illumina Sequencing Two powerful technologies combine to enable sequencing with ultra-low levels of RNA

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday June 16, 2014

CM581A2: NEXT GENERATION SEQUENCING PLATFORMS AND LIBRARY GENERATION

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Tuesday December 16, 2014

The New Genome Analyzer IIx Delivering more data, faster, and easier than ever before. Jeremy Preston, PhD Marketing Manager, Sequencing

Introduction Bioo Scientific

Overview of Next Generation Sequencing technologies. Céline Keime

Welcome to the NGS webinar series

Illumina s Suite of Targeted Resequencing Solutions

Sample to Insight. Dr. Bhagyashree S. Birla NGS Field Application Scientist

Next-Generation Sequencing. Technologies

High Throughput Sequencing Technologies. UCD Genome Center Bioinformatics Core Monday 15 June 2015

SO YOU WANT TO DO A: RNA-SEQ EXPERIMENT MATT SETTLES, PHD UNIVERSITY OF CALIFORNIA, DAVIS

The Expanded Illumina Sequencing Portfolio New Sample Prep Solutions and Workflow

Lecture 7. Next-generation sequencing technologies

Template Preparation FIND MEANING IN COMPLEXITY. Copyright 2014 by Pacific Biosciences of California, Inc. All rights reserved.

Matthew Tinning Australian Genome Research Facility. July 2012

Genomic & RNA Profiling Core Facility

Bioinformatics Advice on Experimental Design

Bi 8 Lecture 4. Ellen Rothenberg 14 January Reading: from Alberts Ch. 8

Incorporating Molecular ID Technology. Accel-NGS 2S MID Indexing Kits

Surely Better Target Enrichment from Sample to Sequencer

Illumina Sequencing Overview

RAPID, ROBUST & RELIABLE

Obtain superior NGS library performance with lower input amounts using the NEBNext Ultra II Directional RNA Library Prep Kit for Illumina

Obtain superior NGS library performance with lower input amounts using the NEBNext Ultra II Directional RNA Library Prep Kit for Illumina

Applications of Next Generation Sequencing in Metagenomics Studies

Gene Expression Technology

resequencing storage SNP ncrna metagenomics private trio de novo exome ncrna RNA DNA bioinformatics RNA-seq comparative genomics

Introductory Next Gen Workshop

Next-generation sequencing technologies

Research school methods seminar Genomics and Transcriptomics

Aaron Liston, Oregon State University Botany 2012 Intro to Next Generation Sequencing Workshop

RNA-Seq data analysis course September 7-9, 2015

Increased transcription detection with the NEBNext Single Cell/Low Input RNA Library Prep Kit

Next Generation Sequencing. Tobias Österlund

Bacterial Iso-Seq Transcript Sequencing Using the SMARTer PCR cdna Synthesis Kit and BluePippin Size-Selection System

Parts of a standard FastQC report

Cancer Genetics Solutions

TECH NOTE Pushing the Limit: A Complete Solution for Generating Stranded RNA Seq Libraries from Picogram Inputs of Total Mammalian RNA

TREE CODE PRODUCT BROCHURE

P HENIX. PHENIX PCR Enzyme Guide Tools For Life Science Discovery RESEARCH PRODUCTS

Applications and Sample Preparation for SOLiD Runs

Single Cell Genomics

SureSelect Target Enrichment for the Ion Proton TM Next Generation Sequencing System

Overcome limitations with RNA-Seq

Introducing QIAseq. Accelerate your NGS performance through Sample to Insight solutions. Sample to Insight

ACCEL-NGS 2S DNA LIBRARY KITS

How much sequencing do I need? Emily Crisovan Genomics Core

Next Generation Sequencing. Jeroen Van Houdt - Leuven 13/10/2017

How much sequencing do I need? Emily Crisovan Genomics Core September 26, 2018

New Frontiers of Genetic Profiling Achieve Higher Sensitivity and Greater Insights with Molecular Barcodes, Long Read Capture and Optimized Exomes

Obtaining DNA from degraded samples for NGS sequencing

Outline General NGS background and terms 11/14/2016 CONFLICT OF INTEREST. HLA region targeted enrichment. NGS library preparation methodologies

Single Cell Genomics

Whole genome Bisulfite Sequencing for Methylation Analysis Preparing Samples for the Illumina Sequencing Platform

Plant Breeding and Agri Genomics. Team Genotypic 24 November 2012

Surely Better Target Enrichment from Sample to Sequencer and Analysis

Next-generation sequencing and quality control: An introduction 2016

GENOMICS WORKFLOW SOLUTIONS THAT GO WHERE THE SCIENCE LEADS. Genomics Solutions Portfolio

A Genomics (R)evolution: Harnessing the Power of Single Cells

solid S Y S T E M s e q u e n c i n g See the Difference Discover the Quality Genome

NEXT GENERATION SEQUENCING. Innovative solutions for library

Basics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility

Human Genome Sequencing Over the Decades The capacity to sequence all 3.2 billion bases of the human genome (at 30X coverage) has increased

Admera Health RUO Services

Chapter 7. DNA Microarrays

Joint RuminOmics/Rumen Microbial Genomics Network Workshop

Next Gen Sequencing. Expansion of sequencing technology. Contents

High-quality stranded RNA-seq libraries from single cells using the SMART-Seq Stranded Kit Product highlights:

Get to Know Your DNA. Every Single Fragment.

SMARTer for NGS. SMARTer Solutions 다카라코리아바이오메디칼

Next-Generation Sequencing: custom solutions

Exploring new frontiers with next-generation sequencing

NextSeq 500 System WGS Solution

Preparing normalized cdna libraries for transcriptome sequencing (Illumina HiSeq)

SOLiD Total RNA-Seq Kit SOLiD RNA Barcoding Kit

Reading Lecture 8: Lecture 9: Lecture 8. DNA Libraries. Definition Types Construction

Multiplexed Strand-specific RNA-Seq Library Preparation for Illumina Sequencing Platforms

Transcription:

High Throughput Sequencing the Multi-Tool of Life Sciences Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center

DNA Technologies & Expression Analysis Cores HT Sequencing (Illumina & PacBio) Illumina microarray (expression analysis, genotyping) consultations introducing new technologies to the campus shared equipment teaching (workshops)

Hi-C Lieberman-Aiden 2010

Hi-C Lieberman-Aiden 2010

Dovetail Sequencing (Putnam 2015)

Bisulfite Conversion METHYL-Seq

ChIP-Seq Chromatin Immuno Preciptation DNA/Protein interactions e.g. transcription factors

DRIP-Seq R-loops: three-stranded structure Genome-wide mapping of RNA-DNA hybrids

Complementary Approaches Illumina Still-imaging of clusters (~1000 clonal molecules) PacBio Movie recordings fluorescence of single molecules Short reads - 2x300 bp Miseq Repeats are mostly not analyzable High output - up to 100 Gb per lane Up to 60 kb, N50 21 kb spans retro elements up to 1,3 Gb per SMRT-cell High accuracy ( < 0.5 %) Error rate 15 % Considerable base composition bias Very affordable De novo assemblies of thousands of scaffolds No base composition bias Costs 5 to 10 times higher Near perfect genome assemblies

Genome Resequencing De novo genome Sequencing SNPs, Indels CNVs Rearrangements Metagenomics RNA-seq Gene Expression Splice Isoform Abundance High Throughput Short Read Sequencing: Illumina Exome Sequencing DNA Methylation ChIP-SEQ 3D Organization Genotyping Small RNA

Genome Resequencing Rearrangements De novo genome Sequencing RNA-seq Gene Expression Splice Isoform Abundance SNPs, Indels CNVs Long Read Sequencing PacBio Metageno mics Exome Sequencing DNA Methylation ChIP-SEQ 3D Organization Genotyping Small RNA

Illumina sequencing workflow Library Construction Cluster Formation Sequencing Data Analysis

Fragmentation Mechanical shearing: BioRuptor Covaris Enzymatic: Fragmentase, RNAse3 Chemical: Mg2+, Zn2+ DNA, RNA DNA, RNA RNA

DNA library construction 5 5 5 5 Fragmented DNA End Repair 5 P OH HO P 5 Blunt End Fragments A Tailing 5 P A A P 5 Single Overhang Fragments T T Adapter Ligation DNA Fragments with Adapter Ends

Enrichment of library fragments 5 5 PCR Amplification

Illumina Sequencing Technology Sequencing By Synthesis (SBS) Technology 3 5 DNA (0.1-1.0 ug) Library preparation Single Cluster molecule generation array A C T C T G C T G A A G 5 T G C T A C G A T A C C C G A T C G A T Sequencing

TruSeq Chemistry: Flow Cell 8 channels Surface of flow cell coated with a lawn of oligo pairs

Sequencing 1.6 Billion Clusters Per Flow Cell 20 Microns 100 Microns 21

Sequencing 100 Microns 22

Patterned Flowcell

Hiseq 3000: 478 million nanowells per lane

What will go wrong? cluster identification bubbles synthesis errors:

What will go wrong? synthesis errors: Phasing & Pre-Phasing problems

HS2500 PE250 - Q20

Intensities - HS3K PE100

HS3K PE150

FASTQC

FASTQC - Nextera

FASTQC

FASTQC

FASTQC

FASTQC

FASTQC

FASTQC

FASTQC

FASTQC

If you can put adapters on it, we can sequence it! single-stranded Adapter Ligation

Optional: PCR-free libraries PCR-free library: OR if concentration allows Reduction of PCR bias against e.g. GC rich or AT rich regions, especially for metagenomic samples Library enrichment by PCR: Ideal combination: high input and low cycle number; low-bias polymerase

Quantitation & QC methods Intercalating dye methods (PicoGreen, Qubit, etc.): Specific to dsdna, accurate at low levels of DNA Great for pooling of indexed libraries to be sequenced in one lane Requires standard curve generation, many accurate pipetting steps Bioanalyzer: Quantitation is good for rough estimate Invaluable for library QC High-sensitivity DNA chip allows quantitation of low DNA levels qpcr Most accurate quantitation method More labor-intensive Must be compared to a control

Library QC by Bioanalyzer Predominant species of appropriate MW Minimal primer dimer or adapter dimers Minimal higher MW material

Library QC by Bioanalyzer ~ 125 bp Beautiful 100% Adapters Beautiful

Library QC ~125 bp Examples for successful libraries Adapter contamination at ~125 bp

Recommended RNA input Library prep kit mrna (TruSeq) Directional mrna (TruSeq) Apollo324 library robot (strand specific) Small RNA (TruSeq) Ribo depletion (Epicentre) SMARTer Ultra Low RNA (Clontech) Ovation RNA seq V2, Single Cell RNA seq (NuGen) Starting material 100 ng 4 μg total RNA 1 5 μg total RNA or 50 ng mrna 100 ng mrna 1 μg total RNA 1 5 μg total RNA 100 pg 10 ng 10 ng 100 ng

Standard RNA-Seq library protocol QC of total RNA to assess integrity Removal of rrna (most common) mrna isolation rrna depletion Fragmentation of RNA Reverse transcription and secondstrand cdna synthesis Ligation of adapters PCR Amplification Purify, QC and Quantify

18S (2500b), 28S (4000b)

RNA integrity <> reproducibility Chen et al. 2014

Considerations in choosing an RNA-Seq method Transcript type: - mrna, extent of degradation - small/micro RNA Strandedness: - un-directional ds cdna library - directional library Input RNA amount: - 0.1-4ug original total RNA - linear amplification from 0.5-10ng RNA Complexity: - original abundance - cdna normalization for uniformity Boundary of transcripts: - identify 5 and/or 3 ends - poly-adenylation sites - Degradation, cleavage sites

Is strand-specific information important? Standard library (non-directional) antisense sense Neu1

Strand-specific RNA-seq Standard library (non-directional) Antisense non-coding RNA Sense transcripts Informative for non-coding RNAs and antisense transcripts Essential when NOT using polya selection (mrna) No disadvantage to preserving strand specificity

RNA-seq for DGE Differential Gene Expression (DGE) 50 bp single end reads 30 million reads per sample (eukaryotes) 10 mill. reads > 80% of annotated genes 30 mill.. reads > 90% of annotated genes 10 million reads per sample (bacteria)

Molecular indexing for precision counts

Molecular indexing for precision counts

Other RNA-seq Transcriptome assembly: 300 bp paired end plus 100 bp paired end Long non coding RNA studies: 100 bp paired end 60-100 million reads Splice variant studies: 100 bp paired end 60-100 million reads

RNA-seq cheap and dirty - 3 Tag-sequencing - Micro-array-like data - Quant-Seq (coming soon) - Brad-Seq (Townsley 2015)

RNA-seq targeted sequencing: - Capture-seq (Mercer et al. 2014) - Nimblegen and Illumina - Low quality DNA (FFPE) - Lower read numbers 10 million reads - Targeting lowly expressed genes.

RNA-seq reproducibility Two big studies multi-center studies (2014) High reproducibility of data given: - same library prep kits, same protocols - same RNA samples - RNA isolation protocols have to be identical - robotic library preps?

http://pacificbiosciences.com THIRD GENERATION DNA SEQUENCING Single Molecule Real Time (SMRT ) sequencing Sequencing of single DNA molecule by single polymerase Very long reads: average reads over 8 kb, up to 30 kb High error rate (~13%). Complementary to short accurate reads of Illumina

70 nm aperture Zero Mode Waveguide

Damien Pelt

First Sequencing of CGG-repeat Alleles in Human Fragile X Syndrome using PacBio RS Sequencer Paul Hagerman, Biochemistry and Molecular Medicine, SOM. Single-molecule sequencing of pure CGG array, - first for disease-relevant allele. Loomis et al. (2012) Genome Research. - applicable to many other tandem repeat disorders. Direct genomic DNA sequencing of methyl groups, - direct epigenetic sequencing (paper under review). Discovered 100% bias toward methylation of 20 CGGrepeat allele in female, first direct methylated DNA sequencing in human CGG disease. 36 CGG 95 DoD STTR award with PacBio. Basis of R01 applications. C A G T Nucleotide position

Iso-Seq Pacbio Sequence full length transcripts no assembly High accuracy (except very long transcripts) More than 95% of genes show alternate splicing On average more than 5 isoforms/gene Precise delineation of transcript isoforms ( PCR artifacts? chimeras?)

C 1 Single cell capture

10X Genomics

25x linked read coverage

60 kb deletion

Thank you!