Gene Expression Profiling and Validation Using Agilent SurePrint G3 Gene Expression Arrays

Similar documents
Gene Regulation Solutions. Microarrays and Next-Generation Sequencing

Using Low Input of Poly (A) + RNA and Total RNA for Oligonucleotide Microarrays Application

High-Resolution Oligonucleotide- Based acgh Analysis of Single Cells in Under 24 Hours

Technical Review. Real time PCR

Cancer Genetics Solutions

Detection of the TMPRSS2:ERG fusion transcript

Expression Array System

Agilent s Mx3000P and Mx3005P

New Stringent Two-Color Gene Expression Workflow Enables More Accurate and Reproducible Microarray Data

Optimizing FFPE DNA preparation for use in SureSelect XT2

High Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays

Gene Expression Technology

Introduction To Real-Time Quantitative PCR (qpcr)

Gene Expression on the Fluidigm BioMark HD

MMLV Reverse Transcriptase 1st-Strand cdna Synthesis Kit

3.1.4 DNA Microarray Technology

Next Generation Sequencing. Target Enrichment

Microarray Technique. Some background. M. Nath

Recent technology allow production of microarrays composed of 70-mers (essentially a hybrid of the two techniques)

P HENIX. PHENIX PCR Enzyme Guide Tools For Life Science Discovery RESEARCH PRODUCTS

Next Gen Sequencing. Expansion of sequencing technology. Contents

Next-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX

Complementary Technologies for Precision Genetic Analysis

Agilent RNA Spike-In Kit

INTRODUCTION TO REVERSE TRANSCRIPTION PCR (RT-PCR) ABCF 2016 BecA-ILRI Hub, Nairobi 21 st September 2016 Roger Pelle Principal Scientist

Lecture #1. Introduction to microarray technology

PrimePCR Assay Validation Report

USB HotStart-IT. for increased specificity and consistent results. PCR, qpcr and qrt-pcr

qpcr Quantitative PCR or Real-time PCR Gives a measurement of PCR product at end of each cycle real time

RT 2 Easy First Strand Handbook

PCR Add-on Kit for Illumina Instruction Manual

DNA Arrays Affymetrix GeneChip System

SuperScript IV Reverse Transcriptase as a better alternative to AMV-based enzymes

Introduction to Real-Time PCR: Basic Principles and Chemistries

T7-Based RNA Amplification Protocol (in progress)

ReverTra Ace qpcr RT Master Mix

HiSeqTM 2000 Sequencing System

HiPer RT-PCR Teaching Kit

QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd

Stratagene QPCR Human Reference Total RNA

SYBR Green Realtime PCR Master Mix

QUANTITATIVE RT-PCR PROTOCOL (SYBR Green I) (Last Revised: April, 2007)

GeneChip Eukaryotic Small Sample Target Labeling Assay Version II *

B r1 US M

PrimePCR Assay Validation Report

De novo metatranscriptome assembly and coral gene expression profile of Montipora capitata with growth anomaly

Single Cell Genomics

Chapter 17. PCR the polymerase chain reaction and its many uses. Prepared by Woojoo Choi

RT 2 Profiler PCR Array: Web-based data analysis tutorial

Axiom mydesign Custom Array design guide for human genotyping applications

GENE EXPRESSION REAGENTS MARKETS (SAMPLE COPY, NOT FOR RESALE)

Mir-X mirna First-Strand Synthesis and SYBR qrt-pcr

Methods of Biomaterials Testing Lesson 3-5. Biochemical Methods - Molecular Biology -

Introductory Next Gen Workshop

Bioinformatics Advice on Experimental Design

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

Gene Expression Analysis Superior Solutions for any Project

QPCR Solutions. Our QPCR solutions meet your current research needs, offer flexibility for future research goals, and match your budget.

Welcome to the NGS webinar series

TECHNICAL BULLETIN. SeqPlex DNA Amplification Kit for use with high throughput sequencing technologies. Catalog Number SEQX Storage Temperature 20 C

RNAseq Differential Gene Expression Analysis Report

REAL TIME PCR USING SYBR GREEN

Instructions for Use. RealStar Lassa Virus RT-PCR Kit /2017 EN

Same TaqMan Assay quality, new format

mmu-mir-200a-3p Real-time RT-PCR Detection and U6 Calibration Kit User Manual MyBioSource.com Catalog # MBS826230

SOLiD Total RNA-Seq Kit SOLiD RNA Barcoding Kit

TaqMan Advanced mirna Assays

1. COMPONENTS. PyroStart Fast PCR Master Mix (2X) (#K0211 for 250 reactions of 20µl) 2. STORAGE 3. DESCRIPTION

User Manual. NGS Library qpcr Quantification Kit (Illumina compatible)

Targeted Sequencing Using Droplet-Based Microfluidics. Keith Brown Director, Sales

A systematic guideline for developing the best real-time PCR primers

Reverse transcription-pcr (rt-pcr) Dr. Hani Alhadrami

HUMAN EMBRYONIC STEM CELL (hesc) ASSESSMENT BY REAL-TIME POLYMERASE CHAIN REACTION (PCR)

Quant One Step RT-PCR Kit

Real-Time PCR Validations

Single Cell Genomics (SCG): Market Size, Segmentation, Growth, Competition and Trends ( )

Molecular Markers CRITFC Genetics Workshop December 9, 2014

Next Generation Sequencing. Jeroen Van Houdt - Leuven 13/10/2017

measuring gene expression December 5, 2017

mmu-mir-34a Real-time RT-PCR Detection Kit User Manual

Molecular Cell Biology - Problem Drill 11: Recombinant DNA

Computational Biology I LSM5191

Introduction Bioo Scientific

Performance characteristics of the High Sensitivity DNA kit for the Agilent 2100 Bioanalyzer

Introduction to BioMEMS & Medical Microdevices DNA Microarrays and Lab-on-a-Chip Methods

Ion S5 and Ion S5 XL Systems

User Manual. VisiBlue qpcr mix colorant. Version 1.3 September 2012 For use in quantitative real-time PCR

EECS730: Introduction to Bioinformatics

Score winning cdna yields with SuperScript III RT

TaqPath ProAmp Master Mixes

Globin Block Modules for QuantSeq Instruction Manual

RT 2 Profiler PCR Array System

BioBank TM cdna Kit. Instructions for the use of BioBank TM cdna in real-time PCR. BioBank control cdna

Absolute Human Telomere Length Quantification qpcr Assay Kit (AHTLQ) Catalog # reactions

Next-Generation Sequencing. Technologies

PrimeScript RT reagent Kit (Perfect Real Time)

Evaluating the Agilent 4200 TapeStation System for High Throughput Sequencing Quality Control

Data and Metadata Models Recommendations Version 1.2 Developed by the IHEC Metadata Standards Workgroup

Functional Genomics Overview RORY STARK PRINCIPAL BIOINFORMATICS ANALYST CRUK CAMBRIDGE INSTITUTE 18 SEPTEMBER 2017

Amplification: Consumables. Reagent Comparison Guide for Real-Time PCR

Transcription:

Gene Expression Profiling and Validation Using Agilent SurePrint G3 Gene Expression Arrays Application Note Authors Bahram Arezi, Nilanjan Guha and Anne Bergstrom Lucas Agilent Technologies Inc. Santa Clara, CA USA Abstract Gene expression analysis is currently carried out using three methods that provide complementary data: microarrays, RNA-Seq, and RT-qPCR. Microarrays provide comprehensive whole-genome coverage in single experiments, with rapid data acquisition and results at a low price per sample. RNA-Seq provides the most comprehensive data across all RNA transcripts present in a sample. However, RNA-Seq also requires the largest sample amounts, the most time, and the highest investment. RT-qPCR provides the fastest and the most cost effective solution for reliable expression analysis, but it is most suitable for experiments involving a small number of genes across a large number of samples. In this study, we compare Agilent SurePrint G3 gene expression microarray data with the results achieved using RT-qPCR and RNA-Seq. The data demonstrates excellent correlation between G3 microarray data and these two complementary techniques. A key factor in achieving strong correlation between microarrays and RT-qPCR data is the design of the primers and probes to the same target sequence on the microarray. This study also demonstrates the coverage of Agilent s gene expression workfl ow from discovery through biological validation.

Introduction Agilent Technologies offers a complete solution for gene expression analysis, from microarray profi ling to reverse transcriptase quantitative polymerase chain reaction (RT-qPCR) for follow-up biological validation across many samples (Figure 1). SurePrint G3 Gene Expression microarrays provide updated content and higher throughput for whole-genome analysis with eight samples per slide, enabling the evaluation of a broad spectrum of coding and non-coding RNAs in a single experiment. 6 4 3 7 2 1 1) Array selection and creation 2) Catalog and custom arrays 3) Sample prep and QA 4) Sample labeling ) Scanning and image analysis 6) Data analysis 7) RT-qPCR validation earray RT-qPCR is a commonly used tool for corroborating microarray gene expression results and further interrogating targeted genes of interest. However, microarray and RT-qPCR results sometimes disagree due to ineffi cient primer design or the varying effi ciencies of reverse transcriptase. RT-qPCR analysis using Agilent s Brilliant III qpcr reagents, IDT PrimeTime assays, and Agilent s Mx300P QPCR System provides reliable expression analysis as demonstrated here. Primers and probes can be synthesized to fi t the same target sequence from the microarray. By designing the nuclease dual-labeled probes to the G3 Gene Expression array probe sequence, one can ensure that the RT-qPCR assay is interrogating the same transcript as the microarray. This specifi c design criterion, combined with sensitive and reproducible microarray and RT-qPCR systems from Agilent, overcomes some of the common issues seen when correlating gene expression data across platforms. Figure 1. Agilent s complete gene expression analysis workfl ow offers solutions from microarrays through RT-qPCR validation. Applying next-generation sequencing to RNA samples (RNA-Seq) to detect and measure expression levels has recently become a common tool for discovery in gene expression studies. The method provides high-throughput quantitative measurement for transcriptome profi ling and effectively corroborates microarray results. Here, we assessed the reproducibility of gene expression predictions for technical replicates using Agilent SurePrint G3 8x60K microarrays and the Agilent Low Input Quick Amp (LIQA) Labeling Kit. Similarly, the correlation of RT-qPCR data from technical replicates was also evaluated. Both studies demonstrate that these approaches are highly reliable for the measurement of gene expression values. Furthermore, a comparison of results between microarrays and RT-qPCR and between microarrays and RNA-Seq shows a close correlation, indicating that Agilent s microarrays and RT-qPCR solution give accurate and reproducible results. 2

Experimental Samples The RNA samples used for microarray, RT-qPCR, and RNA-Seq analysis were the Agilent Universal Human Reference RNA (MAQC A, 740000) and Ambion Brain Reference RNA (MAQC B, AM600). The quality of the RNA samples were confi rmed with the Agilent 2100 Bioanalyzer (G2940CA) using an Agilent RNA 6000 Nano LabChip Kit (067-111). Microarray Analysis Twenty-fi ve ng of total RNA was amplifi ed and labeled using the Agilent Low Input Quick Amp Labeling Kit, One-Color (190-230), and labeled RNA was hybridized to Agilent SurePrint G3 Human Gene Expression 8x60K Microarrays (G481A). Agilent Feature Extraction Image Analysis Software (Version 10.7.3) was used to extract data from raw microarray image fi les. Data visualization and analysis was performed using GeneSpring GX (Version 11.0) software. RT-qPCR Analysis For RT-qPCR, assay primers and probes (PrimeTime Mini qpcr Assay) were ordered from Integrated DNA Technologies (IDT). IDT s Oligo Analyzer program was used to design the primers and probe sets. The probes were labeled with 6-FAM at the -end and Iowa Black FQ at the 3 -end. The Agilent Affi nityscript QPCR cdna Synthesis Kit (6009) and Brilliant III Ultra-Fast QPCR Master Mix (600880) were used for quantitative RT-PCR. All amplifi cations were carried out on the Agilent Mx300P QPCR system (401449). cdnas were synthesized using Agilent Affi nityscript QPCR cdna Synthesis Kits. qpcr assays were performed using Brilliant III Ultra-Fast QPCR Master Mix using the synthesized cdna. Duplicate reactions were run on an Agilent Mx300P using the Comparative Quantitation experiment within the MxPro Software. The differences in gene expression between the two sources of RNA were analyzed and reported as the normalized fold change of target to control. Microarray and RT-qPCR data were analyzed using Agilent MxPro software and exported for correlation analysis in GeneSpring. 3

RNA-Seq Analysis The RNA samples were prepared using Illumina mrna Sequencing Kit (RS-100-0801) with 1 μg of total RNA input. The resulting cdna was validated using the Agilent 2100 Bioanalyzer DNA-1000 Kit (067-104) to ensure that the fi nal product was approximately 200 bp. The validated cdna library underwent cluster amplifi cation on single read fl ow cells using the Illumina Single Read Cluster Generation Kit v2 (GD-203-2001-1), followed by sequencing with the Illumina 36 Cycle Sequencing Kit v3 (FC-104-3002) on the Illumina Genome Analyzer IIx. The single end sequencing was performed with short 36 bp reads. The raw sequencing data was aligned using the GERALD software and processed with the Illumina Consensus Assessment of Sequence and Variation (CASAVA) software to convert raw image data into intensity scores, base calls, and quality score alignments. Microarray and RT-qPCR Comparison The Agilent MxPro text output fi le was modifi ed with column headers Sample, Detector, Task, and Ct Avg for uploading into the Agilent GeneSpring GX program. Both the microarray and the RT-qPCR data were baseline transformed to MAQC A. The RT-qPCR entity list was translated into Agilent Single Color technology (Gene Spring Technology 27220) using normalized signal values. An entity list was created for the microarray data in which only the genes studied by RT-qPCR were included. In cases where multiple probes on the microarray were available for a single gene, which occurs when there are multiple splice variants at the 3 end, only the probe that matched the RT-qPCR primer design was used for comparison. Correlation plots were made using Plot List Associated Values in GeneSpring with the translated entity list. RNA-Seq Comparison to Microarray Microarray data was extracted from the raw image fi les using Agilent Feature Extraction software (version 10..1) and the text output fi les were imported into Agilent GeneSpring GX software (version 11.0). In GeneSpring GX, the microarray data was 7 th percentile normalized and baseline transformed to the MAQC A samples to create log 2 MAQC B/MAQC A ratios. The biological probes were fi ltered in GeneSpring to remove non-detected probes. The fi ltered microarray data, along with the sequencing text output fi les and RT-qPCR text fi les, were imported into Microsoft Access. A Microsoft Access query was used to generate the sequencing log 2 MAQC B/MAQC A ratios and was used to compare the sequencing and microarray data. The resulting query was visualized in Spotfi re DecisionSite (version 19.1.0.89). 4

Results and Discussion A Gene expression studies rely on the sensitive detection of genes as well as the accurate representation of gene expression levels. Microarrays offer a simple, fast, and affordable approach to genome wide expression analysis. SurePrint G3 Gene Expression Microarrays are found to be highly reliable, with reproducibility demonstrated by a correlation of R > 0.99 for signals across technical replicates in one-color assays (Figure 2). This high reproducibility spanned a dynamic range greater than orders of magnitude, providing dependable measurement of both low and high expressors. MAQC A, Replicate 2 10 0 - R = 0.993-0 10 MAQC A, Replicate 1 B 10 R = 0.993 MAQC B, Replicate 2 0 - - 0 10 MAQC B, Replicate 1 Figure 2. Correlation of microarrays using scatter plots to show the reproducibility of signal intensities between two technical replicates from MAQC A (A) and MAQC B (B).

The comparison of the ratios of the MAQC B/MAQC A sample between the RT-qPCR predictions, with levels determined by microarray analysis, indicates a very high concordance between both techniques (Figure 3) with a correlation of R > 0.99 between the two orthogonal assays. Ratios calculated from expression levels based on RNA-Seq data were compared to microarray expression predictions and found to be highly concordant as well, with a correlation of R > 0.90 (Figure 4). Microarray Log 2 Ratio B/A 1 10 0 - R = 0.992 m = 0.947-10 -1-1 -10-0 10 1 qrt-pcr Log 2 Ratio B/A Figure 3. Correlation plot of Microarray and RT-qPCR log ratios demonstrates that Agilent microarrays provide accurate gene expression values. Both the microarray and RT-qPCR data were baseline transformed to MAQC A and log 2 ratios were plotted. The microarray entity list contained only the genes studied by RT-qPCR. 10 M = 0.972 R = 0.900 N = 13702 RNA-Seq, 1µg 0 - -10-10 - 0 10 8x60K, 2 ng, 1-color Figure 4. Comparison of SurePrint G3 microarrays and RNA-Seq data. The log 2 ratios of microarray and RNA-Seq data, baseline transformed to MAQC A, were plotted. 6

The correlation of Agilent SurePrint G3 gene expression microarray data with RT-qPCR and RNA-Seq results confi rms that each provides an appropriate and complementary method for gene expression analysis (Figure ). While RNA-Seq data provides the most comprehensive results across all RNA transcripts present in a sample, it also requires the highest RNA input amount, the longest time to collect data, the most complex data analysis, and the highest cost. Therefore, RNA-Seq is well-suited to discovery projects, but less optimal for profi ling and high-throughput studies. In contrast, microarrays provide comprehensive whole-genome coverage in single experiments, with rapid results and low cost per sample. This makes microarrays highly suitable for profi ling of hundreds or thousands of targets, or for validating results identifi ed in RNA-Seq experiments. The possibility of creating custom gene expression microarrays using the Agilent SurePrint G3 gene expression platform opens a window for validating novel information attained through a discovery project using RNA-Seq. Alternatively, when only a handful of genes require profi ling or validation, RT-qPCR provides the fastest and the most cost-effective solution for reliable expression analysis. Conclusions Agilent SurePrint G3 Gene Expression Microarray data tightly correlates with RT-qPCR data generated on the Agilent Mx300P QPCR System using Brilliant III Ultra-Fast QPCR reagents. A key factor in achieving strong correlation of data is designing primers and probes to the same target sequence from the microarray. Data from both platforms was easily imported and analyzed in GeneSpring GX to further streamline the workfl ow. We also demonstrate the consistency of the microarray data with RNA-Seq results. Overall, this study establishes the reliability of Agilent s gene expression product offerings in giving accurate and reproducible results with complementary technologies. Discovery Profiling Validation RNA - Seq Microarrays qrt - PCR Starting Material 1 µg 10 ng ng Sample Prep Total Time 11. h 6. h 1 h Sample Prep Hands-on TIme 3.7 h 1. h 30 min Sample to Data Total Time ~ 1 week 1. d 2 h Analysis Challenging Simple Simple Price per sample $$$ $$ $ Figure. Workfl ow comparing three different gene expression analysis platforms. 7

www.agilent.com/genomics/ microarrays For Research Use Only. Not for use in diagnostic procedures. This information is subject to change without notice. Agilent Technologies, Inc., 2012 Published in the USA, March 30, 2012 990-993EN