Use of Spike-ins for Sample Tracking in Agilent Array CGH

Similar documents
High-Resolution Oligonucleotide- Based acgh Analysis of Single Cells in Under 24 Hours

Single Cell Copy Number Screening Using the GenetiSure Pre-Screen Kit

SEE WHAT S BEYOND. HaloPlex NGS Target Enrichment SureFISH Probes SurePrint CGH+SNP Microarrays

The Agilent Total RNA Isolation Kit

Agilent RNA Spike-In Kit

Labeling Protocol for mytags Immortal Libraries

Gene Expression Profiling of Prokaryotic Samples using Low Input Quick Amp WT Kit

Implementation of Automated Sample Quality Control in Whole Exome Sequencing

SURESELECTXT LOW INPUT TARGET ENRICHMENT

Hybridization capture of DNA libraries using xgen Lockdown Probes and Reagents

Complete Success Begins with Sample Quality Control. Agilent 4150 and 4200 TapeStation Systems

SureSelect XT HS. Target Enrichment

Evaluating the Agilent 4200 TapeStation System for High Throughput Sequencing Quality Control

Detection of Rare Variants in Degraded FFPE Samples Using the HaloPlex Target Enrichment System

Agilent SurePrint G3 Human Catalog CGH Microarrays

Get to Know Your DNA. Every Single Fragment.

Cancer Genetics Solutions

HyperCap, an automatable workflow on the Agilent Bravo B

Biomek Automated Genomic Sample Prep Accelerates Research

Automation of xgen hybridization capture on the Sciclone G3 NGS Workstation

hpsc Genetic Analysis Kit

scgem Workflow Experimental Design Single cell DNA methylation primer design

High Resolution Microarray Scanner used to Compare CGH Labeling Methods

Absolute Human Telomere Length Quantification qpcr Assay Kit (AHTLQ) Catalog # reactions

HELINI Hepatitis B virus [HBV] Real-time PCR Kit (Genotype A to H)

Magnis NGS Prep System. Your lab s automated library preparation solution for next-generation sequencing

Automation of the IDT xgen Lockdown Panels on the Sciclone G3 NGS Workstation

Wu et al., Determination of genetic identity in therapeutic chimeric states. We used two approaches for identifying potentially suitable deletion loci

Protocol for submitting gdna for Illumina 16S rrna gene sequencing, V4 region

A Recommended Procedure for Real-Time Quantitative TaqMan PCR for Roundup Ready Canola RT73 Monsanto Biotechnology Regulatory Sciences

A Systematic Approach to Optimize Real-Time Quantitative RT-qPCR Experiments with the Agilent 2200 TapeStation System

HALOPLEX PCR TARGET ENRICHMENT & LIBRARY PREPARATION PROTOCOL. Version 1.0.3, April 2011 For research use only

TECHNICAL BULLETIN. SeqPlex DNA Amplification Kit for use with high throughput sequencing technologies. Catalog Number SEQX Storage Temperature 20 C

Absolute Mouse Telomere Length Quantification qpcr Assay Kit (AMTLQ) Catalog #M reactions

Executive Summary. clinical supply services

Performance of the Agilent D1000 and the Agilent High Sensitivity D1000 ScreenTape Assay for the Agilent 4200 TapeStation System

Comparison of ExoSAP-IT and ExoSAP-IT Express reagents to alternative PCR cleanup methods

GreenMasterMix (2X) b i o s c i e n c e. G E N A X X O N b i o s c i e n c e. High ROX (500nM)

QUANTITATIVE RT-PCR PROTOCOL (SYBR Green I) (Last Revised: April, 2007)

Analysis of DNA sequence copy number in archival formalin-fixed, paraffinembedded. Dione K. Bailey Anniek De Witte October 9, 2007

AmpF STR NGM PCR Amplification Kit - Overview

Event COT102 Cotton. Real-time, Event-specific Polymerase Chain Reaction Method

Roche Molecular Biochemicals Technical Note No. LC 12/2000

CytoChip Oligo Summary Protocol

Copy Number Analysis of Archival FFPE Tumor Samples by Oligo Array CGH

Detecting copy-neutral LOH in cancer using Agilent SurePrint G3 Cancer CGH+SNP Microarrays

Complete protocol in 110 minutes Enzymatic fragmentation without sonication One-step fragmentation/tagging to save time

Reference Gene Panel. Human - Short Assays Version 1 - April 2016 Probe protocol. User manual. Order #:

NUCLEIC ACID PURIFICATION KITS FAST SIMPLE QUALITY CONVENIENT REPRODUCIBLE

Impact of gdna Integrity on the Outcome of DNA Methylation Studies

High Performance Labeling Kits for cdna and Oligo Microarrays

Functional Genomics Research Stream. Research Meeting: June 19, 2012 SYBR Green qpcr, Research Update

Fully Automated Library Quantification for Illumina Sequencing on the NGS STAR

Agilent SureCycler 8800

Archaea V3-5 16S rrna Amplicon Sequencing

Relative Mouse Telomere Length Quantification qpcr Assay Kit (RMTLQ) Catalog #M reactions

Updated August well High-multiplexing Tagmentation and Amplification Robyn Tanny August Company Kit Catalog Number.

Cold Fusion Cloning Kit. Cat. #s MC100A-1, MC101A-1. User Manual

High specificity and sensitivity for fungal DNA allows reliable quantification in a background of nonfungal. Contents

HELINI White spot Syndrome virus [WSSV] Real-time PCR Kit

Forensic News. FAS Corner April The Maximizing Data Quality Series Part 2: DNA Quantitation

QIAGEN Supplementary Protocol

Cat. # RR391A. For Research Use. Probe qpcr Mix. Product Manual. v201610da

Precipio, Inc. Instructions for Use. PIK3CA Exon 9 Mutation Analysis using ICE COLD-PCR for Detection with High Resolution Melting

Efficient Method for Isolation of High Quality Concentrated Cellular RNA with Extremely Low Levels of Genomic DNA Contamination Application

PAXgene Blood DNA PAXgene Blood DNA Tube (IVD) for clinical use PAXgene Blood DNA System (RUO) for research use. Explore more at

HaloPlex HS. Get to Know Your DNA. Every Single Fragment. Kevin Poon, Ph.D.

Amplicon Sequencing Template Preparation

PrimePCR Assay Validation Report

Procedure & Checklist - Preparing Asymmetric SMRTbell Templates

ThruPLEX -FD Prep Kit Instruction Manual. Single Tube Library Preparation for Illumina NGS Platforms

PrimePCR Assay Validation Report

Absolute Human Telomere Length and Mitochondrial DNA Copy Number Dual Quantification qpcr Assay Kit (AHDQ) Catalog # reactions

Relative Rat Telomere Length Quantification qpcr Assay Kit (RRTLQ) Catalog #R reactions

Guidelines for Developing Robust and Reliable PCR Assays

Application Protocol: DNA quantification by using Absolute Quantitative PCR (qpcr)

10 RXN 50 RXN 500 RXN

PrimePCR Assay Validation Report

Internal controls for real-time PCR-based pathogen identification

Amplicon size (bp) E ± SD

Amplicon Library Preparation Method Manual. GS FLX Titanium Series October 2009

Surely Better Target Enrichment from Sample to Sequencer

2014 Hands-On Introductory Workshop on Array CGH

Getting the Most from Your DNA Analysis from Purification to Downstream Analysis. Eric B. Vincent, Ph.D. February 2013

General Product Insert. End-Point PCR/RT-PCR Kit Product# EPxxxxx

Purpose: To sequence 16S microbial DNA isolated from animal fecal matter and perform compositional analysis of bacterial communities.

User Bulletin. Veriti 96-Well Thermal Cycler AmpFlSTR Kit Validation. Overview

Premix Ex Taq (Probe qpcr)

PrimePCR Assay Validation Report

A comparative performance evaluation of illustra Ready-To-Go GenomiPhi V3 and illustra GenomiPhi V2 DNA amplification kits

Accurately and reproducibly quantify as little as 20 fg of bacterial DNA using real-time PCR.

Complementary Technologies for Precision Genetic Analysis

QPCR Designed for You. Superior Instruments, Reagents, and Support.

Sanger Sequencing Portfolio

Best Quantification Practices with the Agilent 5200 Fragment Analyzer System

THUNDERBIRD SYBR qpcr Mix

T7-Based RNA Amplification Protocol (in progress)

Brian Williams, Richele Gwirtz Version 1.0, December 29, 2003

Transcription:

Use of Spike-ins for Sample Tracking in Agilent Array CGH Application Note Authors Srirangan Sampath, Ph.D., FACMG and Heather Klein PreventionGenetics Iman Kishawi, Ph.D. Agilent Technologies, Inc. Abstract This Application Note describes a method that allows sample tracking through introduction of DNA spike-ins to human DNA samples before wet lab handling. Adding the spike-ins to samples ensures the identity of the sample throughout the process, and reduces sample mix-up in high-throughput DNA microarrays. In principle, this method can be applied to many other molecular genetic methodologies including NGS. DNA spike-ins contain one or more PCR-generated amplicons that are approximately 400 bp in length. The amplicons are designed to span sequences of unique and nondisease associated regions in the human genome. Agilent CytoGenomics software identifi es sample identity by detecting the signal intensity of spike-in probes in individual arrays.

Introduction Many laboratories process multiple samples on any given day, and human error can contribute to mistakes in the processing and reporting of results for samples. Sample mix-up errors constitute the most serious pre-analytic mistakes 1-. In addition, the chances of a specimen mix-up during the analysis process increases with the number of experimental steps (multistep or multiday experiments). DNA microarray applications, such as array comparative genomic hybridization (acgh), are a multistep process. The wet lab portion includes restriction-digestion (for arrays containing SNP probes), fluorescent-labeling of genomic DNA (gdna), cleanup and quantification of labeled gdna, and hybridization of labeled gdna to microarray slides. Sample mix-up at any of these stages can lead to erroneous interpretation of results. Human error can also occur with data analysis if sample attributes are associated with the wrong array. A spike-in is a calibrated reference standard made up of one or more DNA amplicons of known concentn and identity. Amplicons are added in large excess to each DNA sample being tested. Fluorescent labeling of the spiked amplicon generates a bright fluorescent signal after hybridizing to probes that correspond to the genomic loci of spike-ins. Agilent CytoGenomics software v4 or higher reports the identity of the spike-in in each sample. This reporting is done by calculating the log of the sample signal to the reference signal at any given probe. The software identifies the identity of the spike-in based on a default log equal to, or greater than one. A default threshold of one corresponds to two times more in a spiked DNA sample versus a reference DNA sample where no spike-ins were added. The spike-in identity is entered into the CytoGenomics software by the user, and the software reports if the spike-in identity does or does not match the user-inputted sample attribute. Methods Reagents and supplies needed Spike-in identity The 50 unique amplicons (Table 1) are PCR-generated using the forward and reverse primers shown in the table. The probes corresponding to each amplicon (Table 1) are printed on the CGH arrays. The software recognizes and identifies them by their probe names. Individual probes that hybridize to the spike-ins are printed 5 times on the array. The data presented in Figure 1 show the average log of spike-ins from replicated probes. Spike-in probes are part of the HD Agilent probe database. Spike-in controls can be added to any array by selecting the spike-in probe group in SureDesign, and replicating it 5 times depending on available space on the array. Material Vendor Part number KAPA HS MasterMix Kapa Biosystems KK1606 Oligonucleotide primer mixes IDT Custom (5 nmole DNA oligo) Nanodrop Thermo Scientific Human Reference Genomic DNA Agilent European Male 5190-470 1X TE buffer IDT N/A Eppendorf Mastercycler ProS Eppendorf 9500000 Agilent TapeStation 00 Agilent Technologies G964AA Agilent D1000 ScreenTape Agilent Technologies 5067-558

Table 1. Forward and reverse primers for each of the 50 DNA amplicons that are used as spike-ins. The expected sizes for each amplicon, and the Agilent probe names are provided. Band Probe_name FP_seq RP_seq Amplicon size 1q.1 A_16_P5405 CATTCAATGTTACCTTCCAA AGCCACTCATTTTTAACACC 487 A_16_P154054 AATGTACAAACTGGAAACC TTTCTCTCTCACAGGAATAA 4 A_16_P15405605 TTCTCCTTAAGCTGGTTTAA GGAAAACTGGTTCAGTAGAC 500 q1. A_18_P15801 TTATCTCTTGCCTTCTTTTC AGACAATAGACTGCAAATCC 50 A_16_P1595047 CCTACCACCTTGTGAAGA GCACTGTACACTGAAAATTG 45 q1. A_16_P1675641 CCAAGATGCTTAGAAAAATG GGCATCAAATCTCTGGTT 444 A_16_P167586 AGCTTCAGGTAAGTTTTTGG ACACAGAAGCATCTGTTCTT 451 A_18_P145811 TTACTCCCAAAGTCTATTGC TGTAACCAGTGAGATAAGGC 475 4q1. A_18_P14767506 GGTTGTGGGTTTACATTCAA AAGGGTGACCTAGAGCACA 48 A_18_P14771686 GAATCTGAAGACAGTCCAGT TTTAACAACCCTTCACTTC 516 A_16_P6800196 GAACTCAGTGCATGAACAAG TCTGAGGACAGAGCTGTATC 5q14. A_18_P156957 GCTGTTCTCCAAACACTAAT TTACATCGAGCTAACCTGTC 41 A_18_P1567 TTCATTCAAGAGCACAGAAA ACAAACTAGGGCAAAACAG 49 A_16_P75066 GTCATTTAAATTTTCTTGGC TAATCTAGGTGGTGGTTACC 487 6q14. A_18_P15761 CAGAAGCTCTTTGGTTTAAT GCTGCTTTACAAAATACTGC 41 A_18_P1570616 CCTTGGGTTCAATGAAACCT GGCACAGAAGAAAGGGACAG 95 A_14_P101769 GAGCCCACCCTATAAACATT CAGATCTTGGCAAGACTCAG 44 8q. A_16_P1844814 CCTTTAGGAATTACCCAGTA TTAGGAAGCAGTAATGAGG 465 A_16_P1844955 GTTGACATGCTGGGAAATC TGCCCATACAGCTGTTCTCT 64 A_16_P0199664 AGTTGTTTCAGAAGGCCTCT AGTGACAGGAATGAGGCTG 66 9q1. A_18_P16960445 GATGTGCACACATGGACC AGACATCTGCTCAGGGAGTT 47 A_16_P015618 CAATCTAAAATCAAGGAAAT ATACAATGTGACAGATGCTA 55 A_16_P1870465 TTGTTTGCCTAGAAAACAGG TCACTTTTTCAATCCCACTT 45 10q5.1 A_16_P1905809 GGATATTTCTGTCCATTGAT CACAGTGGTATTAAACAAGG 47 A_18_P10889615 GGCATTGTCTCTTCTTGCTA TAACAGAACCCAAACTCGTG 47 A_16_P19058540 TCATAATTACCATCATAGCA AAATCAGCGATATTAATAGC 416 11q.1 A_16_P1940901 GGCAAAAGAGTGAGACTC CCTTCCCTTAATAGCCTAAG 468 A_16_P1941057 AGATAGCCTGCTAGGATTTG GTGATGGATACCCCATTTAC 41 A_18_P1104159 CCATTACTTCATTTCTGCCT TTGGTCACAGAACATTGAAT 411 14q1. A_16_P005587 TTATTTCCTTGCTATAGTGG GAAACAACTCAGAAAGCTAG 64 A_16_P4017845 TGGAACAAGAACTTGTGAGC GCCAAAATGCATAATTTCA 471 A_18_P11991646 TCTCTGTGCATCACCCTATT GGCAAAGTTCAAGTCTCTGG 404 15q1.1 A_18_P105990 GATTCCCGTCTGTGTGGTCT TCTTATTTTTGACTCTGAGATTGTGG 51 A_18_P10640 TGGTGCCTAAGAAAGACATTCA TTCCTTCCCAAAGTGGCTTA 87 A_16_P4088779 TCCCAAAGAATTGTCAATGG TTTGGTTTCCCATCAAGGTC 00 16q1 A_18_P1484550 AACATTGAGGATGGAGCAG CAAAAATGGGGTCTTATCTG 41 A_18_P14805 TGTCAATGTAAAAATGCAAG ACAGCAAAGGTCTTCTATTC 46 A_16_P049408 GTAACAGAGGGCTAGTCTTG AGTTTGAGTGGTGACTGATT 445 18q1. A_18_P18458 GCCTTTATTGATTCCTGCTT TGGGGCTCACCTAAAAGTT 5 A_18_P184410 GGTGTCTCATCAGCAGGG GATTCCTATTTTGCTCAGGC 459 A_18_P184688 CCTTAGGTCCACATGACATG GTGCAGTCAAGCTTGAAGTC 45 19q1 A_16_P099167 AAGTTTCCTTTTGATTAGGT ACTAGGAGAAAATATTTGCC 4 A_16_P044188 AACAAACAAACAAAAAACGC CACAGCTTCTTCATCTGACC 41 A_18_P10145 GAAGAGGTCACATATGTGAG ATTACCAAACAAGATTGTTG 97 0q1 A_18_P1795069 GCATTCCAGCTTAAAGGA ACATTTAAAAGATGGTGCAA 408 A_18_P179864 GCATAGCCTTGAAAAAATCC TGTGGAATGGATAAATTGCC 48 A_16_P119167 TTCATGGACTCAATAGTCCC TGAGCTTGAAGAGGAAAGAG 4 1q1. A_16_P414187 CCTGTCAGCAGGTAGAAGGA TCTCATGAAGTAGGCAGAGGAA 56 A_16_P116858 TTTTGCAAGTAAAGCAATTTTAGG GCTCTGCTGGACACCTTGAT 99 A_18_P1889716 CAAATTGATGAAATCCTTATTTGC AAACCATCAAATACAGGAAAAGAAA 86

Figure 1. Spike-in M is assigned three probes. Three different amplicons can be spiked into a DNA sample. CytoGenomics software can identify the identity of the three amplicons as Spike-in M. 4

Spike-in genen DNA spike-ins are amplicons generated from PCR of human genomic sequences using a reference human genome (for example, Agilent European Male). Tables and provide the conditions of the PCR reactions. The amplification efficiency differs for each of the 50 amplicons, and may need some level of PCR optimization in the hands of users. The PCR efficiency and identity of the amplicons were checked by running -µl aliquots of all amplicons on an Agilent TapeStation 00 using Agilent D1000 ScreenTape. The quality and identity of each amplicon were confirmed by the presence of a correct sized single band. The PCR products were purified using the Qiagen QIAquick PCR Purification Kit (catalog # 8106) using the manufacturer s protocol. Table. Setup of PCR reactions. Reagents Volume/rxn (µl) KAPA HS MasterMix 0.5 Amplicon F/R Primers, 5 µm.75 Template, 5 ng/µl 6 Total reaction volume 0 Table. PCR Conditions are listed below (7 cycles). Conditions Time 95 C minutes 95 C 0 seconds 55 C 0 seconds 7 C 1 minute 7 C 5 minutes A B C D E F G H Depending on the amount of spike-in needed, each of the 50 amplicons may be amplified in replicates of four or five PCR reactions. Replicates from each amplicon were pooled before Qiagen PCR Purification. Pooled amplicons can be eluted with 0 µl of elution buffer. The purified amplicons were checked again on the TapeStation 00, using the D1000 ScreenTape to ensure proper cleanup and size (Figure ). Figure. To ensure unique amplification of the target region, PCR amplicons are evaluated on the Agilent TapeStation 00 using the Agilent D1000 ScreenTape. Each spike-in is given a unique identifier (in this case, alphabets spike-in A-H for eight amplicons). 5

The purified amplicons were quantified on a Nanodrop. When pools of 4 5 replicate amplicons were eluted in 0 µl of elution buffer, the concentn was usually > 100 ng/µl; hence, the spike-in stock solution was normalized to 40 ng/µl in 1X TE buffer (ph 8). A spike-in working solution was made from the stock solution by diluting it ~1:,000 with 1X TE buffer (ph 8). At this dilution, the spike-in probe fluorescent signal was significantly above background noise in an acgh experiment. The spike-in working solution dilution can be determined empirically by performing microarray experiments at different dilutions. Aliquots of the spike-in stock (40 ng/µl) can be stored at 0 C indefinitely. Working solution (40 ng/µl diluted at ~1:,000 with 1X TE buffer) can be stored at 4 C for a week without appreciable degradation. For optimum performance, avoid repeated freeze-thaw cycles. During the analysis setup in the CytoGenomics software, the user needs to assign a spike-in identifier to an array (or sample). The spike-in can be assigned from the drop down menu, or as part of the sample attribute file with a column header Spike-In. After data analysis, the CytoGenomics software reports the observed and expected value of each spike-in. Spike-in identity in each sample is reported as match or no match based on the user inputted spike-in identifier in the software (Figure ). Additionally, the software reports the log corresponding to each spike-in. Note that based on this procedure, 50 spike-ins can be generated using one spike-in amplicon per sample. This number can go much higher if combinations of spike-in amplicons are added to each sample. This experiment describes using one amplicon per sample. For each experiment, two microliters of DNA spike-in working solution are added to the test sample gdna before labeling with Cy5 (red signal). A different DNA spike-in is used for each test sample. Spike-ins are not added to the reference DNA (labeled with Cy). Consequently, probes corresponding to spike-ins have a high red signal and low green signal. Using spike-ins in an acgh experiment Spike-in identifiers can be associated with the amplicon probe name in the CytoGenomics software. To identify a sample, one or more amplicons can be added to the sample before a wet lab procedure. An identifier can represent one probe name if one PCR amplicon is spiked into the sample. An identifier can represent more than one probe if multiple PCR amplicons are spiked into the sample. For example, Spike-in A represents a single probe, and spike-in M represents more than one probe (Figure 1). Two microliters of spike-in control is added to DNA sample before processing. Figure. Agilent CytoGenomics software identifies the identity of the spike-in controls. 6

Spike-in analysis gprocessedsignal and rprocessedsignal values from each acgh experiment were used for plotting. Four plots are shown in Figure 4, each with a different spike-in added to the test DNA sample. For each sample, the log of rprocessedsignal/gprocessedsignal for all 50 spikes (in trilpicate) are plotted. Each of the 50 spike-ins is represented by a different color. There is very little variation between triplicate values for all spike-ins. As can be seen in Figure 4, the log values differ for individual spike-ins despite the fact that the amplicons are diluted to the same concentn of 40 ng/µl. Log 4 1 0-1 A 0 50 100 150 Index Log 6 B 5 4 1 0 0 50 100 150 Index C 6 D 5 Log 1 Log 4 0 1 0 0 50 100 150 Index 0 50 100 150 Index Figure 4. Log is plotted for all 50 amplicons. Greater than 1 log value represents the added DNA spike. 7

Conclusion In conclusion, 50 unique amplicons can be used as spike-ins for sample tracking. The spike-ins can be one or more amplicons that are introduced into DNA samples before wet lab processing. The log of spike-ins should be equal to or higher than 1, under the dilution conditions described. The data are highly reproducible, and give the user confidence that there was no mix-up of samples during wet lab or analysis processing. References 1. Douglas, J. A.; Boehnke, M.; Lange, K. A. Multipoint method for detecting genotyping errors and mutations in sibling-pair linkage data. Am. J. Hum. Genet. 000, 66(4), 187-197. doi: 10.1086/0861. [PMID:1079757]. Jun, G.; et al. Detecting and estimating contamination of human DNA samples in sequencing and array-based genotype data. Am. J. Hum. Genet. 01, 91(5), 89-848. doi: 10.1016/j. ajhg.01.09.004. [PMID: 106]. Burrell, A.; Foy, C.; Burns, M. Applicability of Three Alternative Instruments for Food Authenticity Analysis:GMO Identification. Biotechnology Research International 011, vol. 011, Article ID 88, 8 pages, doi:10.4061/011/88 For more information and to learn more www.genomics.agilent.com Find an Agilent customer center in your country www.agilent.com/chem/contactus U.S. and Canada 800.7.9770 agilent_inquiries@agilent.com Asia Pacifi c adinquiry_aplsca@agilent.com Europe info_agilent@agilent.com www.agilent.com/genomics For Research Use Only. Not for use in diagnostic procedures. This information is subject to change without notice. PR7000-011 Agilent Technologies, Inc., 016 Published in the USA, March 15, 016 5991-6661EN