Quality Measures for CytoChip Microarrays

Similar documents
Getting high-quality cytogenetic data is a SNP.

Detecting copy-neutral LOH in cancer using Agilent SurePrint G3 Cancer CGH+SNP Microarrays

High-Resolution Oligonucleotide- Based acgh Analysis of Single Cells in Under 24 Hours

Bioinformatics Advice on Experimental Design

User Bulletin. Veriti 96-Well Thermal Cycler AmpFlSTR Kit Validation. Overview

Using the Trio Workflow in Partek Genomics Suite v6.6

3.1.4 DNA Microarray Technology

High-throughput scale. Desktop simplicity.

3C bioscience. ProbeSure Genotyping Master Mix User Guide. P a g e 1 ProbeSure User Guide v1.0

New Stringent Two-Color Gene Expression Workflow Enables More Accurate and Reproducible Microarray Data

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016

Lecture #1. Introduction to microarray technology

Using Low Input of Poly (A) + RNA and Total RNA for Oligonucleotide Microarrays Application

Microarray Technique. Some background. M. Nath

DNA Arrays Affymetrix GeneChip System

MiSeq. system applications

Complementary Technologies for Precision Genetic Analysis

Runs of Homozygosity Analysis Tutorial

Data Analysis on the ABI PRISM 7700 Sequence Detection System: Setting Baselines and Thresholds. Overview. Data Analysis Tutorial

IQFISH on Dako Omnis. Panel for Lung Cancer. Dako FAST RESULTS. ALK, ROS1, RET and MET IQFISH. Dako Omnis. Agilent Pathology Solutions

Gene Expression Technology

Recent technology allow production of microarrays composed of 70-mers (essentially a hybrid of the two techniques)

Please purchase PDFcamp Printer on to remove this watermark. DNA microarray

EECS730: Introduction to Bioinformatics

TruSeq DNA PCR-Free Library Prep Protocol Guide

Introduction to gene expression microarray data analysis

TaqPath ProAmp Master Mixes

TruSeq Nano DNA Library Prep Protocol Guide

AMPURE PCR PURIFICATION PAGE 1 OF 7

Gap hunting to characterize clustered probe signals in Illumina methylation array data

Typical probes. Slides per pack Aminosilane. Long oligo- Slide AStar None D surface. nucleotides

Multiplex Assay Design

Autozygosity by difference a method for locating autosomal recessive mutations. Geoff Pollott

Microarray. Slide Selection Chart... J2. Epoxide-coated Slides... J3. GAPS II-coated Slides... J5. Corning Cover Glass... J6

NovaSeq Series. Custom Primers Guide

Humboldt Universität zu Berlin. Grundlagen der Bioinformatik SS Microarrays. Lecture

DNA Microarray Technology

Applied Biosystems 7500 Fast, 7500 and 7300 Real-Time PCR Systems

Multiplex PCR reaction

Gene Expression Profiling and Validation Using Agilent SurePrint G3 Gene Expression Arrays

Chapter 20: Biotechnology

Introduction to BioMEMS & Medical Microdevices DNA Microarrays and Lab-on-a-Chip Methods

Axiom mydesign Custom Array design guide for human genotyping applications

PERFORMANCE MADE EASY REAL-TIME PCR

Philippe Hupé 1,2. The R User Conference 2009 Rennes

Introduction. Highlights. Prepare Library Sequence Analyze Data

Reviewed: Hamilton. Contents; Overview. 2.0 Methods 3.0 Notes 4.0 Acknowledgements & References

Chromosome Analysis Suite 3.0 (ChAS 3.0)

Sanger Sequencing: Troubleshooting Guide

7.014 Quiz III 4/22/05. Write your name on this page and your initials on all the other pages in the space provided.

Introduction to Bioinformatics. Fabian Hoti 6.10.

Whole Transcriptome Analysis of Illumina RNA- Seq Data. Ryan Peters Field Application Specialist

Gene Regulation Solutions. Microarrays and Next-Generation Sequencing

PowerPlex 5-Dye Matrix Standards, 3100/3130

TruSight HLA Sequencing Panel Protocol Guide

DNA Genotyping from Human FFPE Samples Reliable and Reproducible

Paper DP04 Statistical Analysis of Gene Expression Micro Arrays John Schwarz, University of Louisville, Louisville, KY

DEPArray Technology. Sorting and Recovery of Rare Cells

SNP GENOTYPING WITH iplex REAGENTS AND THE MASSARRAY SYSTEM

Finishing Drosophila Ananassae Fosmid 2728G16

STR Profiling Matching Criteria: Establishment and Importance of a Cell Line Database

Genome-wide association studies (GWAS) Part 1

Next Generation Sequencing Lecture Saarbrücken, 19. March Sequencing Platforms

Revolutionizing genetics in the cattle industry. Extensive toolset. Proven technology. Collaborative approach.

Sequence assembly. Jose Blanca COMAV institute bioinf.comav.upv.es

Targeted Sequencing Using Droplet-Based Microfluidics. Keith Brown Director, Sales

Genome Sequencing-- Strategies

Introduction to Bioinformatics and Gene Expression Technologies

Introduction To Real-Time Quantitative PCR (qpcr)

Genome Sequence Assembly

Linkage & Genetic Mapping in Eukaryotes. Ch. 6

QIAGEN Whole Genome Amplification REPLI-g Eliminating Sample Limitations, Potential Use for Reference Material

BIOLOGY 163 LABORATORY. RESTRICTION MAPPING OF PLASMID DNA (Revised Fall 2017)

LightScanner Hi-Res Melting Comparison of Six Master Mixes for Scanning and Small Amplicon and LunaProbes Genotyping

Technical Note. Performance Review of the GeneChip AutoLoader for the Affymetrix GeneChip Scanner Introduction

Frequently Asked Questions

Analysis of Microarray Data

ANALYSING QUANTITATIVE DATA

Outline. Array platform considerations: Comparison between the technologies available in microarrays

H3A - Genome-Wide Association testing SOP

Single Cell Washing and Handling. Introduction 1 Preparation 3 Cell Wash Protocol 5 Technical Assistance 7

PowerPlex 4C Matrix Standard

I.QC (quality control of your qpcr)

Lecture Four. Molecular Approaches I: Nucleic Acids

Analysis of Microarray Data

Analysis of a Tiling Regulation Study in Partek Genomics Suite 6.6

Biology Genetics Practice Quiz

Learning Objectives :

Expression Array System

Comparison of the Promega Power Y and AB AmpFlSTR YFiler kits and the Subsequent Validation of YFiler.

Next Gen Sequencing. Expansion of sequencing technology. Contents

Exploring the Genetic Basis of Congenital Heart Defects

Multiplex PCR. Bruce Applegate, Michael Kane and Sergei Savikhin. Analysis of multiplex PCR reactions. Detect multiple target DNA sequences

PowerPlex 5C Matrix Standards, 310

MICROARRAY HYBRIDISATION PROTOCOL 10/05 (for both PCR and Oligo arrays on UltraGAPS slides)

DO NOT OPEN UNTIL TOLD TO START

Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Supplementary Material

High Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays

Transcription:

Quality Measures for CytoChip Microarrays How to evaluate CytoChip Oligo data quality in BlueFuse Multi software. Data quality is one of the most important aspects of any microarray experiment. This technical note outlines the ways that users can check the quality of their microarray data, to assess laboratory procedures, before proceeding with downstream analyses. Checking the data quality enables users to assess whether the genomic changes seen in a sample are real or artefacts. All CytoChip Oligo, CytoChip Focus Constitutional, and SNP microarrays come with BlueFuse Multi software. Quality control (QC) metrics are displayed in the report for each experiment. The composite image for each array can also be viewed to check for any major artefacts (Subarray QC shortcut). In addition, a confidence call is given to the imbalance calling of the chromosomes. It is also possible to view QC metrics across a database to assess data quality over time (detailed in the BlueFuse Multi Reference Manual). This document should be used as a guide only. It remains the responsibility of the laboratory personel to make the final decision on the usability of array data. For any further assistance, contact Illumina technical support at techsupport@illumina.com. Overview of QC Metrics QC Measures In BlueFuse Multi, the report displays QC measures that are relevant to the selected experiment. These measures should be consulted together with the Subarray QC shortcut and the confidence values for calling when reviewing results. SD Autosome/Robust is a measure of the dispersion of log 2 ratio of all clones on the array. A small value indicates that the log 2 ratios of the autosome clones are tightly clustered around zero, which is their expected value. It is calculated on the normalized but unsmoothed data. SD autosome is calculated using all autosomal clones on the array; SD robust is calculated using the middle 66% of the data, allowing the removal of outliers. The SD robust value is used by the software for region calling to determine gain and loss thresholds. An SD value that is too low suggests a problem with washing or hybridization, resulting in lots of non-specific signals left on the array. % Included Clones percentage of all clones that were not excluded due to inconsistency between clone replicates on an array. This measure is used for bacterial artificial chromosome (BAC) arrays (CytoChip Focus Constitutional and 24sure). If the level is low, it may indicate a major artefact on the array. There is no exclusion on oligo arrays; therefore, this value is always 100%. metric can give an indication of how well the DNA has labeled with fluorescent dyes. More importantly, extremely high values can indicate over-scanning of the microarray image or poor washing so that there is lots of non-specific signal left on the array. The balance between channels can be assessed but the Cy5 signal tends to give a higher intensity than Cy3. Major differences in the channels may indicate a labellng or a scanner problem. SBR Ch1/Ch2 the overall signal-to-background ratio for each fluorescent channel. This value is calculated by dividing the background-corrected fluorescent signal intensity from the DNA clones by the raw background signal level. SBR indicates how clearly the spots can be detected above the background. The SBR metric gives an indication of how well the labeling and washing steps have been performed. DLR Raw/Fused derivative log ratio (DLR) measures the spread of the difference in log 2 ratios between all the pairs of adjacent clones along the genome. For a simple profile, this statistic will be very similar to autosome standard deviation. The presence of many step changes in a complex profile will cause the autosome standard deviation to become inflated, whereas the DLR metric is much less sensitive to this problem. DLR fused is the DLR calculated on the normalized but unsmoothed data. Additional SNP QC Metrics CytoChip Oligo SNP and CytoChip Cancer SNP array designs include DNA probes that measure the presence of biallelic SNPs in a sample, in addition to those that assess copy-number imbalance. In the report for SNP arrays, three metrics relate to the performance of SNP calling in the experiment. These metrics are: SNPs on Array the total number of SNPs on the array design (CytoChip Oligo SNP = 26,541; CytoChip Cancer SNP = 59,593). Reference SNPs the number of SNP loci where the reference sample has a known genotype and which is not homozygous for the digested allele. All other SNP loci are excluded from analysis. For any particular reference DNA, approximately one-third of loci are expected to be homozygous for the digested allele. % High-Confidence SNPs the percentage of Reference SNPs where the genotype call has a confidence value of 95% or more. This measure indicates the overall quality of the SNP data and mainly corresponds to how well the peaks of the SNP histogram are resolved from each other. Overlapping peaks leads to low-confidence SNP calls and a low value for the percentage of high-confidence SNPs. Mean Spot Amplitude Ch1/Ch2 the mean fluorescent signal intensities for the two channels; channel 1 = sample (typically Cy3) and channel 2 = reference (typically Cy5). This metric is variable due to the differences between available scanners. The mean spot amplitude

Oligo Microarray QC Guidance Oligo QC Metrics Basic QC criteria settings can be used as a guide to assess the performance of an oligo microarray experiment before analyzing the results (Table 1). Illumina oligo arrays include: CytoChip Oligo, CytoChip ISCA, CytoChip Cancer, CytoChip Focus 8x60K, 4x180K, Table 1: Basic QC Metrics SD Autosome/Robust Good 0.08 0.15 Too low < 0.07 Too high > 0.17 % Included Clones 100% for oligo designs Mean Spot Amplitude (signal)* Good 400 2,000 Too low < 400 Too high > 2,500 SBR Good 5 20 Too low < 3 Too high > 30 DLR Raw/Fused Good < 0.15 *Scanner-dependent OK < 0.20 Fail > 0.24 and oligo SNP array designs. The CytoChip Focus 4x180K platform requires less DNA (200 ng) for hybridization; as a result, the QC metrics for mean spot amplitude are different (Table 2). These values are based on preliminary results and may be modified once there is more experience using the platform. Table 2: QC Metrics for CytoChip Focus 4x180K Arrays SD Autosome/Robust Good 0.08 0.15 Too low < 0.07 Too high > 0.17 % Included Clones 100% for oligo designs Mean Spot Amplitude (signal)* Good 200+ Too low < 200 Too high > 2,500 SBR Good 5 20 Too low < 2 Too high > 30 DLR Raw/Fused Good < 0.15 *Scanner-dependent OK < 0.20 Fail > 0.24

Figure 1: Fused Chart for Good Oligo Array A Good Quality Oligo Array Result Figure 1 shows a fused chart plot from BlueFuse Multi and is an example of a result from a good experiment. There is clear separation of normal regions and regions of imbalance. For this type of result, there is no need to rely on QC metrics. Figure 2: Composite Image for Good Oligo Array Figure 2 shows the composite image for the array, which can be viewed using the Image QC shortcut in the software. Oligo slides have QC measures incorporated during microarray manufacture and are supposed to have dark corners with a few bright spots in the extreme corners. Figure 3 is an image of a scan that shows the corner of the microarray. The bright spots are visible, with clear dark corners and no spots visible in this region. There is also a very even overall background on this image. If the dark corners are visible, it is indicative of non-specific fluorescent signal, perhaps due to a washing issue. Table 3 shows the QC metrics for a good oligo array. Table 3: QC Metrics for Good Oligo Array Parameter Value Comment SD Autosome/Robust 0.12/0.09 Good robust SD score; below 0.1 is a very good result % Included Clones 100.00 No replicates on the array can be ignored Mean Spot Amplitude Ch1/Ch2 325.87/429.86 Low side of mean signal but acceptable SBR Ch1/Ch2 8.02/9.62 SBRs fall within the optimal range DLR Raw/Fused 0.15/0.14 Good fused DLR value; any value below 0.15 is a very good result

Figure 3: Fused Chart for Marginal Oligo Array A Marginal Quality Oligo Array Result Figure 3 shows a fused chart plot from BlueFuse Multi and is an example of a result from a difficult-to-call oligo array. QC metrics must be taken into account when analyzing this array. On closer examination (right panel), large changes are called but would require extra care when checking through the imbalances. Figure 4: Composite Image for Marginal Oligo Array Figure 4 is a scan image that shows the corner of this microarray. It is clear that the image has been over-scanned, as there are very bright signals. It is not likely to be a wash problem as spots in the dark corners are still not visible. Table 4 shows the QC metrics for the marginal oligo array. Table 4: QC Metrics for Marginal Oligo Array Parameter Value Comment SD Autosome/Robust 0.19/0.15 Anything below 0.15 should be interpretable for large changes; this result is borderline % Included Clones 100.00 No replicates on the array can be ignored Mean Spot Amplitude Ch1/Ch2 6250.15/7278.70 Massively over-scanned and will cause suppression of some ratios, which can result in under-calling SBR Ch1/Ch2 63.63/22.52 SBRs are artificially high due to the over-scanning of the arrays. This result would have been enhanced with a reduced scanner setting. DLR Raw/Fused 0.21/0.21 Any value below 0.2 should be interpretable for large changes; this result is borderline

Figure 5: Fused Chart for Failed Oligo Array A Failed Quality Oligo Array Result Figure 5 shows a fused chart plot from BlueFuse Multi and is an example of a result from a failed oligo array. The data are very noisy, which makes it difficult to determine the true calls. Figure 6: Composite Image for Failed Oligo Array Figure 6 is a scan image that shows the corner of this microarray. The array looks reasonable, the background looks low, and the dark corners are acceptable. Table 5 shows the QC metrics for the failed oligo array. Table 5: QC Metrics for Failed Oligo Array Parameter Value Comment SD Autosome/Robust 0.27/0.19 Very poor robust SD score; any value above 0.17 is a bad result and almost impossible to call accurately % Included Clones 100.00 No replicates on the array can be ignored Mean Spot Amplitude Ch1/Ch2 711.75/1151.61 Signal is around optimal, so not the cause of the noise SBR Ch1/Ch2 8.55/7.70 SBRs within acceptable range DLR Raw/Fused 0.28/0.28 Very high DLR, probably a DNA-related issue

Figure 7: Fused Chart for Good Focus Array CytoChip Focus BAC QC Guidance CytoChip Focus Constitutional (BAC Array) QC Metrics These basic QC criteria settings can be used as a guide to assess CytoChip Focus Constitutional microarray results (Table 6). A Good Quality CytoChip Focus Array Result Figure 7 shows a fused chart plot from BlueFuse Multi and is an example of a result from a good experiment. In normal regions, the spots are tight to the log 2 ratio 0 axis. There is clear demarcation of Figure 8: Composite Image for Good Focus Array regions of imbalance. For this type of result, there is no need to rely on QC metrics. Figure 8 is the scan image for this CytoChip Focus Constitutional array. It shows a clean image, and even dark background with no wash-related artefacts or other flaws. Table 7 shows the QC metrics for the good Focus array. Table 6: Basic QC Metrics SD Autosome/Robust Good 0.05 0.11 Too low < 0.05 Too high > 0.17 % Included Clones Good > 95% Mean Spot Amplitude Good 700 3,000 (signal)* Too low < 500 Too high > 3,000 SBR Good 3 12 Too low < 3 Too high > 15 DLR Raw/Fused Good < 0.2 OK < 0.22 Fail > 0.24 *Scanner-dependent Table 7: QC Metrics for Good Focus Array Parameter Value Comment SD Autosome/Robust 0.06/0.06 Very good robust SD score; below 0.1 is a good result % Included Clones 99.29 Near-perfect clone inclusion, less than 1% excluded Mean Spot Amplitude Ch1/Ch2 1453.22/2713.96 Good signal SBR Ch1/Ch2 5.91/6.05 SBRs well within the acceptable range DLR Raw/Fused 0.05/0.04 Excellent fused DLR value; any value below 0.15 is a very good result

Figure 9: Fused Chart for Bad Focus Array A Bad Quality CytoChip Focus Array Result Figure 9 shows a fused chart plot from BlueFuse Multi and is an example of a result from a bad or failed experiment. In general, the array results look very noisy; the resulting called imbalances cannot be trusted. Figure 10: Composite Image for Bad Focus Array The scan image of this CytoChip Focus Constitutional array (Figure 10) shows that the slide has not been washed effectively. There is lots of background and wash artefacts. It is possible that the array dried out during hybridization. There are also speckles on the array that indicate the labeled DNA pellet was not resuspended properly in hybridization buffer. Also, the image looks quite red. Table 8 shows the QC metrics for the bad Focus array. Table 8: QC Metrics for Bad Focus Array Parameter Value Comment SD Autosome/Robust 0.35/0.32 Very poor robust SD score; any value above 0.17 is a bad result and almost impossible to call accurately % Included Clones 48.46 Over half the clones have been excluded, leading to uninterpretable results Mean Spot Amplitude Ch1/Ch2 39.51/1152.80 Signal far too low in Ch1 (Cy3) and may be one of the causes of the noise SBR Ch1/Ch2 0.18/4.16 SBR far too low in Ch1 (Cy3) due to low signal DLR Raw/Fused 0.55/0.49 DLR far too high

Figure 11: Fused Charts for Good SNP Data SNP Array QC Guidance Figure 12: Histogram for Good SNP Data For CytoChip Oligo SNP and CytoChip Cancer SNP, use the oligo QC metrics detailed in this document to assess the copy-number aspect of each experiment. Use this section to assess the quality of the SNP probe performance and the resulting confidence in the UPD/LOH regions called. In addition to the general oligo QC metrics and standard BlueFuse Multi views, examine the Fused SNP Copy Number Chart. Additional QC metrics and the SNP histogram provide essential QC information. Good Quality SNP Data Both the fused chart and fused SNP copy-number chart (Figure 11) show clear array results. There is good separation of SNPs in the chart, shown by the clearly separated SNP data points at copynumber values 0 (AA, uncut), 1 (AB, one allele cut), and 2 (BB, both alleles cut), with few spots plotted in between. The QC metrics in the report show a good percentage of highconfidence SNPs: % High-confidence SNPs: 90.88 SNPs on array: 9,911 Reference SNPs: 6,088 Finally, viewing the SNP Histogram (Figure 12) shows clearly defined peaks that represent 0, 1, and 2 copies of SNPs.

Figure 13: Fused Charts for Poor SNP Data Poor Quality SNP Data Figure 14: Histogram for Poor SNP Data Both the fused chart and fused SNP copy-number chart (Figure 13) show noisy/borderline array results. In the SNP chart, there is good separation of SNP data points between 0 and 1 copy number, but not between 1 and 2. The QC metrics in the report show a borderline percentage of highconfidence SNPs: % High-confidence SNPs: 66.14 SNPs on array: 59,593 Reference SNPs: 36,436 Finally, viewing the SNP Histogram (Figure 14) shows clearly separated peaks that represent 0 and 1, but not 1 and 2 copies of SNPs. This result may indicate incomplete digestion of the DNA.

Figure 15: Fused Charts for Failed SNP Data Failed Quality SNP Data Figure 16: Histogram for Failed SNP Data Both the fused chart and fused SNP copy-number chart (Figure 15) show noisy array results. In the SNP chart, there is no clear separation at 0, 1, or 2. The QC metrics in the report show a low percentage of highconfidence SNPs: % High-confidence SNPs: 22.67 SNPs on array: 59,593 Reference SNPs: 36,436 Viewing the SNP Histogram (Figure 16) shows no separation of the peaks and BlueFuse Multi has failed to fit the peaks. This result indicates poor or failed digestion of the genomic DNA prior to labeling and hybridization.

Illumina 1.800.809.4566 toll-free (U.S.) +1.858.202.4566 tel techsupport@illumina.com www.illumina.com FOR RESEARCH USE ONLY 2014 Illumina, Inc. All rights reserved. Illumina, BlueFuse, CytoChip, the pumpkin orange color, and the Genetic Energy streaming bases design are trademarks or registered trademarks of Illumina, Inc. All other brands and names contained herein are the property of their respective owners. Pub. No. 1570-2014-038 Current as of 09 December 2014