March 20-23, 2010 Sacramento, CA

Similar documents
Targeted Sequencing Using Droplet-Based Microfluidics. Keith Brown Director, Sales

ACCEL-NGS 2S DNA LIBRARY KITS

The Agilent Technologies SureSelect Platform for Target Enrichment

Next-Generation Sequencing. Technologies

Target Enrichment Strategies for Next Generation Sequencing

Gene Expression Profiling and Validation Using Agilent SurePrint G3 Gene Expression Arrays

Cancer Genetics Solutions

Gene Regulation Solutions. Microarrays and Next-Generation Sequencing

Incorporating Molecular ID Technology. Accel-NGS 2S MID Indexing Kits

Welcome to the NGS webinar series

Lecture #1. Introduction to microarray technology

Single Cell Genomics

Next Generation Sequencing. Target Enrichment

Bioinformatics Advice on Experimental Design

Lab methods: Exome / Genome. Ewart de Bruijn

Gene Expression Technology

A Genomics (R)evolution: Harnessing the Power of Single Cells

Implementation of Automated Sample Quality Control in Whole Exome Sequencing

Introductory Next Gen Workshop

Introduction Bioo Scientific

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary

SMARTer Ultra Low RNA Kit for Illumina Sequencing Two powerful technologies combine to enable sequencing with ultra-low levels of RNA

Targeted Sequencing in the NBS Laboratory

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

How much sequencing do I need? Emily Crisovan Genomics Core

High Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays

Illumina TruSeq RNA Access Library Prep Kit Automated on the Biomek FX P Dual-Hybrid Liquid Handler

High-Resolution Oligonucleotide- Based acgh Analysis of Single Cells in Under 24 Hours

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Sequence Assembly and Alignment. Jim Noonan Department of Genetics

NEBNext. for Ion Torrent LIBRARY PREPARATION KITS

Complementary Technologies for Precision Genetic Analysis

Third Generation Sequencing

Jenny Gu, PhD Strategic Business Development Manager, PacBio

To sequence or not to sequence is not a question anymore. BUT

Next Generation Sequencing Lecture Saarbrücken, 19. March Sequencing Platforms

Introduction to genome biology

Whole Transcriptome Analysis of Illumina RNA- Seq Data. Ryan Peters Field Application Specialist

Analysis of Differential Gene Expression in Cattle Using mrna-seq

Gene Expression on the Fluidigm BioMark HD

MiSeq. system applications

scgem Workflow Experimental Design Single cell DNA methylation primer design

RADSeq Data Analysis. Through STACKS on Galaxy. Yvan Le Bras Anthony Bretaudeau Cyril Monjeaud Gildas Le Corguillé

Efficiency in Next-Generation Sequencing for Public Health

Introduction to Bioinformatics and Gene Expression Technologies

RIPTIDE HIGH THROUGHPUT RAPID LIBRARY PREP (HT-RLP)

Data Analysis with CASAVA v1.8 and the MiSeq Reporter

Long and short/small RNA-seq data analysis

DNA-Sequencing. Technologies & Devices. Matthias Platzer. Genome Analysis Leibniz Institute on Aging - Fritz Lipmann Institute (FLI)

Regulation of eukaryotic transcription:

DNA-Sequencing. Technologies & Devices. Matthias Platzer. Genome Analysis Leibniz Institute on Aging - Fritz Lipmann Institute (FLI)

Molecular Markers CRITFC Genetics Workshop December 9, 2014

Automated size selection of NEBNext Small RNA libraries with the Sage Pippin Prep

QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd

SOLiD Total RNA-Seq Kit SOLiD RNA Barcoding Kit

Expression Array System

RNA-Sequencing analysis

HiSeqTM 2000 Sequencing System

Methods of Biomaterials Testing Lesson 3-5. Biochemical Methods - Molecular Biology -

Getting the Most from Your DNA Analysis from Purification to Downstream Analysis. Eric B. Vincent, Ph.D. February 2013

Considerations for Illumina library preparation. Henriette O Geen June 20, 2014 UCD Genome Center

Assay Design Considerations, Optimization and Validation

Outline. Array platform considerations: Comparison between the technologies available in microarrays

Getting high-quality cytogenetic data is a SNP.

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

Recent technology allow production of microarrays composed of 70-mers (essentially a hybrid of the two techniques)

Outline. General principles of clonal sequencing Analysis principles Applications CNV analysis Genome architecture

Application Note. Genomics. Abstract. Authors. Kirill Gromadski Ruediger Salowsky Susanne Glueck Agilent Technologies Waldbronn, Germany

Functional Genomics Research Stream. Research Meeting: June 19, 2012 SYBR Green qpcr, Research Update

2100 Bioanalyzer. Overview & News. Ralph Beneke Dec 2010

Using Low Input of Poly (A) + RNA and Total RNA for Oligonucleotide Microarrays Application

H3A - Genome-Wide Association testing SOP

Reduce microrna RT-qPCR Assay Costs by More Than 10-fold Without Compromising Results

De novo metatranscriptome assembly and coral gene expression profile of Montipora capitata with growth anomaly

Axiom mydesign Custom Array design guide for human genotyping applications

What is a microarray

Ecole de Bioinforma(que AVIESAN Roscoff 2014 GALAXY INITIATION. A. Lermine U900 Ins(tut Curie, INSERM, Mines ParisTech

Introduction to gene expression microarray data analysis

Performance characteristics of the High Sensitivity DNA kit for the Agilent 2100 Bioanalyzer

SNP GENOTYPING WITH iplex REAGENTS AND THE MASSARRAY SYSTEM

Microarray Technique. Some background. M. Nath

Title: High-quality genome assembly of channel catfish, Ictalurus punctatus

Exploring genomic databases: Practical session "

Roche Sequencing Story

Ensembl Tools. EBI is an Outstation of the European Molecular Biology Laboratory.

Microarray Gene Expression Analysis at CNIO

Chips and next-generation sequencing tools for diagnosis in NMDs

De Novo Assembly of High-throughput Short Read Sequences

RNA-Seq Workshop AChemS Sunil K Sukumaran Monell Chemical Senses Center Philadelphia

DNA Arrays Affymetrix GeneChip System

SPH 247 Statistical Analysis of Laboratory Data

MHC Region. MHC expression: Class I: All nucleated cells and platelets Class II: Antigen presenting cells

IMGM Laboratories GmbH. Sales Manager

TECHNICAL BULLETIN. SeqPlex DNA Amplification Kit for use with high throughput sequencing technologies. Catalog Number SEQX Storage Temperature 20 C

PROTOCOL: Illumina Barcoded Paired-End Capture Library Preparation

TruSeq Enrichment Guide FOR RESEARCH USE ONLY

Dapprich et al. BMC Genomics (2016) 17:486 DOI /s

Hybridization capture of DNA libraries using xgen Lockdown Probes and Reagents

HLA and Next Generation Sequencing it s all about the Data

Transcription:

Comparison of Commercially Available Target Enrichment Methods for Next Generation Sequencing with the Illumina Platform March 20-23, 2010 Sacramento, CA Anoja Perera, Scottie Adams, David Bintzler, Kip Bodi, Ken Dewar, Deborah Grove, Jan Kieleczawa, Robert Lyons, Aaron Noll, Sushmita Singh, Robert Steen, Michael Zianni i

Why do capture? Using Illumina HiSeq 2000: 1 run = 10 days = 200 Gb => 1 Human Genome at a cost of ~$10,000 -GWAS - Exome sequencing - Candidate gene sequencing Capture Methods Stanford.edu

Many different methods 1. Multiplex PCR Some key features: High specificity and coverage 200-20 20,000000 primer pairs target 10Kb to 10Mb Genomic DNA input: 2 μg Requires capital equipment 2. Hybridization approaches; in-solution or array based

Agilent SureSelect Some key features: earray: free of charge custom array design tool RNA probes: 100 bases One array can capture up to 3.3Mb after masking Genomic DNA input: 3μg or less No capital equipment! Sample cost: ~$1000 Automation friendly and Scalable If done manually many hands- on steps http://www.chem.agilent.com/library/brochures/5990-3532en_lo%20cms.pdf

Some key features: Febit HybSelect Microfluidic Biochip contains 8 separate micro channels for independent capture Requires capital equipment or use of a service provider 30 min hands-on time Genomic DNA : 1-5 μg Oligo length: 60-mer One array can capture up to 125Kb after masking Sample cost: over $2000 http://www.febit.com/microarray-sequencing/services/hybselect/

NimbleGen Array Capture Some key features: Requires a hybridization and elution systems or use of a service provider Charged array design (waived if more than 5 arrays) Genomic DNA input: 3μg or less One array capture 5-30 Mb after masking Validated arrays for human studies Sample cost: ~$1000 2010: In-solution capture is available http://www.nimblegen.com/products/seqcap/

2009/10 DSRG study Coriell DNA: The Human Reference Genetic Material Repository DNA Sample (catalog ID: NS12911) http://huref.jcvi.org/ Two types of regions selected (total ~3.5Mb): 1. 2Mb continuous region 2. 31 individual genes* * The genes selected ranged widely in regards to size (2kb to 400kb), exon numbers, GC content, number of transcripts and repetitive nature of the sequences. All companies were provided with ensembl gene IDs and genomic locations.

2009/10 DSRG study DSRG: Select DNA and regions *** Illumina paired-end sample prep kit provided to all participants! Agilent Febit NimbleGen Illumina library prep + In-solution capture Illumina library prep + Array-based capture Illumina library prep + Array-based capture DSRG: Sequence samples on Illumina GAII at 2 different centers Data Analysis at 2 different centers

QC and Illumina Run Statistics Samples were run on the Agilent High Sensitivity chip to assess the quantity and quality Samples were loaded in equal nm concentrations for each technology on two Illumina paired-end end flowcells Two lanes were dedicated for each technology and the flowcells were run at different centers Illumina Primary Analysis Elizabeth Ketterer, Kendra Young

Data Analysis 1. Combined the datasets from both sequencing centers for each replicate 2. Filtered each data set so that sequences have quality score > 10 for 100% of the bases 3. Mapped reads against the hg19/grch37 genome using bowtie 0.12.0 4. Normalized the data sets to equal sizes 5. A series of perl scripts were then used to calculate coverage per position in every targeted region, creating a coverage map 6. Coverage maps were imported into the R statistical computing environment (2.1.0), to find the sensitivity, specificity, and reproducibility for each sample 7. Plots and figures were generated using the "ggplot2" library and MS Excel Kip Bodi, Aaron Noll

Sensitivity: How much was captured? a g e e a s t 1 X c o v e r a a s e s w i t h a t l e % T a r g e t e d b 100 90 80 70 60 50 40 30 20 10 0 Agilent: Replicate 1 Agilent: Replicate 2 Febit: Lane 1 Febit: Lane 2 Nimblegen: Replicate 1 Technology Nimblegen: Replicate 2 Converted hg18 genome build coordinates to hg19 co-ordinates Determine the overlapping regions between the two different targeted regions Reanalyze all the data for only the overlapping regions. 3.5Mb 2.1Mb Kip Bodi, Aaron Noll

Sensitivity: How much of the 2.1Mb was captured by at least 1 read? 100.00% % Targeted ba ases with at le east 1X cover reage 90.00% 80.00% 70.00% 60.00% 50.00% 40.00% 30.00% 00% 20.00% 10.00% 0.00% Agilent: Replicate1 Agilent: Replicate2 Febit: Lane1 Febit: Lane2 NimbleGen: Replicate1 NimbleGen: Replicate2 Capture method Kip Bodi, Aaron Noll

Specificity: How many of the reads mapped to the intended targets? 100.0% 90.0% 80.0% % Sequenc ces that are on ta arget 70.0% 60.0% 50.0% 40.0% 30.0% 20.0% 10.0% 0.0% Agilent: Replicate1 Agilent: Replicate2 Febit: Lane1 Febit: Lane2 NimbleGen: Replicate1 NimbleGen: Replicate2 Capture method Kip Bodi

% Coverage vs. Read Depth 100.00% 00% 90.00% 80.00% 70.00% % Cover rage 60.00% 50.00% 40.00% 30.00% 00% 20.00% 10.00% Agilent Febit NimbleGen 0.00% 1X 5X 10X 20X 30X 40X 50X 60X 70X 80X 90X 100X Read Depth Kip Bodi, Aaron Noll

Coverage of Overlapping Genes Aaron Noll

Coverage of the Continuous Region Kip Bodi

In Summary Description Agilent Febit NimbleGen Cost per sample Sample input requirement Quality of captured sample Reproducibility Sensitivity Specificity Coverage Scalability In-solution Array Array

Future Directions Repeat Agilent capture with probes designed to the 3.5Mb region of hg19 Febit capture: What went wrong? Repeat? Determine reasons for low coverage areas that are common to all three technologies Perform SNP analysis - check how well each technology can detect SNPs in a given region - look for false positives and negatives

Many Thanks!!! Agilent/Vector Biotech Alexander Wong Fred Ernani Garrick Peters Katie Weaver Ken Olinger Nick Mapara Febit Han Lee Jack Leonard Natalie Oliveira NimbleGen Daniel Burgess Lance Brown Michael Frawley Xinmin Zhang Illumina Jonathan Pinter Data Analysis: Aaron Noll Kip Bodi Others: Christine Brennan Constance Esposito Elizabeth Ketterer Karen Staehling Kendra Young DSRG David Bintzler Deborah Grove Jan Kieleczawa Ken Dewar Michael Zianni Robert Lyons Robert Steen Scottie Adams Sushmita Singh