Integrated Transcriptomics and Metabolomics Study of Retinoblastoma Using Agilent Microarrays and LC/MS/GC/MS Platforms

Similar documents
Expand Your Research with Metabolomics and Proteomics. Christine Miller Omics Market Manager ASMS 2017

Instrumental Solutions for Metabolomics

Benefits of 2D-LC/MS/MS in Pharmaceutical Bioanalytics

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

Clone Selection Using the Agilent 1290 Infinity Online 2D-LC/MS Solution

Monitoring of Mammalian Cell Culture Media with HILIC LC/MS

Analysis of Monoclonal Antibody (mab) Using Agilent 1290 Infinity LC System Coupled to Agilent 6530 Accurate-Mass Quadrupole Time-of-Flight (Q-TOF)

Metabolic Changes in Lung Tissue of Tuberculosis-Infected Mice Using GC/Q-TOF with Low Energy EI

The Agilent Metabolomics Dynamic MRM Database and Method

Separation of Native Monoclonal Antibodies and Identification of Charge Variants:

Agilent Solutions for Metabolomics YOUR PATH TO SUCCESS

Robustness of the Agilent Ultivo Triple Quadrupole LC/MS for Routine Analysis in Food Safety

Metabolomics: From Samples to Softwares

Metabolomics of Carbon-Fixing Mutants of Cyanobacteria by GC/Q-TOF

Fast and Efficient Peptide Mapping of a Monoclonal Antibody (mab): UHPLC Performance with Superficially Porous Particles

PLRP-S Polymeric Reversed-Phase Column for LC/MS Separation of mabs and ADC

Agilent Solutions for Metabolomics EMERGING INSIGHTS

Gene Expression Profiling of Prokaryotic Samples using Low Input Quick Amp WT Kit

Application of Agilent AdvanceBio Desalting-RP Cartridges for LC/MS Analysis of mabs A One- and Two-dimensional LC/MS Study

Agilent Software Tools for Mass Spectrometry Based Multi-omics Studies

Analysis of Nucleosides Using an Agilent Infinity II High Speed UHPLC with the 6130 Single Quadrupole Mass Selective Detector

Precise Characterization of Intact Monoclonal Antibodies by the Agilent 6545XT AdvanceBio LC/Q-TOF

Pharmaceutical LC/MS Solutions from Agilent Technologies

N-Glycan Profiling Analysis of a Monoclonal Antibody Using UHPLC/FLD/Q-TOF

Heart-cut 2D-LC/MS approach for pharmaceutical impurity identification using an Agilent 6540 Q-TOF LC/MS System

A Highly Accurate Mass Profiling Approach to Protein Biomarker Discovery Using HPLC-Chip/ MS-Enabled ESI-TOF MS

MassHunter Profinder: Batch Processing Software for High Quality Feature Extraction of Mass Spectrometry Data

AGILENT SOLUTIONS FOR METABOLOMICS

Application Note. Authors. Abstract. Ravindra Gudihal Agilent Technologies India Pvt. Ltd. Bangalore India

Application Note. Abstract. Authors. Introduction. Environmental

EPA Method 540: Selected Organic Contaminants Using Agilent Plexa Cartridges and the Agilent 6460 Triple Quadrupole LC/MS

Quantification of Host Cell Protein Impurities Using the Agilent 1290 Infinity II LC Coupled with the 6495B Triple Quadrupole LC/MS System

Introduction. Benefits of the SWATH Acquisition Workflow for Metabolomics Applications

Reproducible LC/MS Peptide Separation Using Agilent AdvanceBio Peptide Plus Columns

Peptide Mapping of Glycoprotein Erythropoietin by HILIC LC/MS and RP-LC/MS

Determination of Table 3 Compounds by LC/MS/MS Chemours Fluoroproducts Analytical Method Revision date: 4/26/2018

Technical Overview. Author. Abstract. Edgar Naegele Agilent Technologies, Inc. Waldbronn, Germany

EVALUATION OF D DIFFERENT EXTRACTION METHODS OF PESTICIDE RESIDUES IN SPICES BY

Fast and High-Resolution Reversed-Phase Separation of Synthetic Oligonucleotides

Screening for Emerging Chemical Contaminants in Water Using LC/Q-TOF and Mass Profiler Professional Software

Cell Clone Selection Using the Agilent Bio-Monolith Protein A Column and LC/MS

ENVIRONMENTAL ANALYSIS. Solutions for Your Analytical Business Markets and Applications Programs. Solution Note

Minimizing Sample Carryover Using the Multiwash Function of the Agilent 1290 Infinity II Multisampler

Research Powered by Agilent s GeneSpring

Confident Pesticides Analysis with the Agilent LC/Triple Quadrupole and TOF/QTOF Solutions

Experimental Design and MS Workflows for Omics Applications. David A. Weil, Ph.D. Senior Application Scientist Schaumburg, IL

High-Throughput LC/MS Purification of Pharmaceutical Impurities

Profiling Glycosylation of Monoclonal Antibodies at Three Levels Using the Agilent 6545XT AdvanceBio LC/Q TOF

The Use of High Resolution Accurate Mass GC/Q-TOF and Chemometrics in the Identification of Environmental Pollutants in Wastewater Effluents

Characterize Fab and Fc Fragments by Cation-Exchange Chromatography

Fiehn GC/MS Metabolomics RTL Library IDENTIFY MORE METABOLITES

Gene Regulation Solutions. Microarrays and Next-Generation Sequencing

Automated QbD-Based Method Development and Validation of Oxidative Degraded Atorvastatin

Clearly better LC/MS solutions

SGN-6106 Computational Systems Biology I

Characterize mab Charge Variants by Cation-Exchange Chromatography

Application Note. Abstract. Authors. Environmental

Application Note. Author. Abstract. Biotherapeutics and Biologics. Sonja Schneider Agilent Technologies, Inc. Waldbronn, Germany

Cell Culture Optimization Using an Agilent Bio-Monolith Protein A Column and LC/MS

Agilent Metabolomics Laboratory: The breadth of tools you need for successful metabolomics research

Application Note. Authors. Abstract. Ravindra Gudihal Agilent Technologies India Pvt. Ltd. Bangalore, India

Peptide Mapping: A Quality by Design (QbD) Approach

Disulfide Linkage Analysis of IgG1 using an Agilent 1260 Infinity Bio inert LC System with an Agilent ZORBAX RRHD Diphenyl sub 2 µm Column

Agilent s NEW MassHunter Profinder

Agilent 6000 Series LC/MS Solutions. Confidence in Quantitative and Qualitative Analysis

Metabolomics: Techniques and Applications ABRF

Developing Purification Strategies for the Agilent 1260 Infinity II Preparative LC/MSD System

FROM DISCOVERY TO INSIGHT

A Comprehensive Workflow to Optimize and Execute Protein Aggregate Studies

Automated Scouting of Stationary and Mobile Phases Using the Agilent 1290 Infinity II Method Development Solution

Next-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX

Reducing Cycle Time for Charge Variant Analysis of Monoclonal Antibodies

Gene Expression Profiling and Validation Using Agilent SurePrint G3 Gene Expression Arrays

EPA Method 543: Selected Organic Contaminants by Online SPE LC/MS/MS Using the Agilent Flexible Cube

Functional Genomic Study of Exogenous n-butanol Stress in Escherichia coli. Supplementary Data

Automated alternating column regeneration on the Agilent 1290 Infinity LC

Puzzled About LCMS? Sample. Sensitivity in Mass Spec Analysis. Prep. Adapting LC-UV. Optimizing and Maintaining Your Mass Spec LCMS

Application Note. Authors. Abstract. Introduction. Pharmaceutical

Application Note # ET-20 BioPharma Compass: A fully Automated Solution for Characterization and QC of Intact and Digested Proteins

Quantification of genotoxic "Impurity D" in Atenolol by LC/ESI/MS/MS with Agilent 1200 Series RRLC and 6410B Triple Quadrupole LC/MS

Key Words Q Exactive Focus, SIEVE Software, Biomarker, Discovery, Metabolomics

Minimize Method Development Time for Pesticide Screening Using LC/MS Application Kit

Put the PRO in Protein Characterization

Xevo G2-S QTof and TransOmics: A Multi-Omics System for the Differential LC/MS Analysis of Proteins, Metabolites, and Lipids

Monitoring for Microcystins in Raw Water Supply Reservoirs Using the Agilent 6410 Triple Quadrupole LC/MS

Monoclonal Antibody Charge Variants Identification by Fully Automated Capillary Isoelectric Focusing Mass Spectrometry

Certified Reference Material - Certificate of Analysis

Title: Non-absorbable apple procyanidins prevent obesity associated with gut microbial

Rapid Peptide Catabolite ID using the SCIEX Routine Biotransform Solution

Agilent GeneSpring GX 10: Beyond. Pam Tangvoranuntakul Product Manager, GeneSpring October 1, 2008

Monitoring for Microcystins in Raw Water Supply Reservoirs Using the Agilent 6410 Triple Quadrupole LC/MS

Isotopic Resolution of Chromatographically Separated IdeS Subunits Using the X500B QTOF System

A Comprehensive Approach for Monoclonal Antibody N-linked Glycan Analysis from Sample Preparation to Data Analysis

Charge Heterogeneity Analysis of Rituximab Innovator and Biosimilar mabs

Rapid Soft Spot Analysis using the SCIEX Routine Biotransform Solution

The Agilent 6545 Q-TOF LC/MS System. The Answers You Need:

The Agilent 6545 Q-TOF LC/MS System. The Answers You Need:

Analysis of Monoclonal Antibody N-glycans by Fluorescence Detection and Robust Mass Selective Detection Using the Agilent LC/MSD XT

Jet Stream Proteomics for Sensitive and Robust Standard Flow LC/MS

Certified Reference Material - Certificate of Analysis

Transcription:

Integrated Transcriptomics and Metabolomics Study of Retinoblastoma Using Agilent Microarrays and LC/MS/GC/MS Platforms Application Note Authors Nilanjan Guha, Deepak S.A., Syed Lateef, Seetaraman Gundimeda, Arunkumar Padmanaban, and Upendra Simha Agilent Technologies, Inc. Ashwin Mallipatna, Vishnu Suresh Babu, and Arkasubhra Ghosh Narayana Nethralaya Abstract This Application Note illustrates a multi-omics approach combining transcriptomics and metabolomics to study molecular events in the progression of retinoblastoma (Rb). On a set of tissue samples affected with Rb, we performed mrna and mirna gene expression using microarrays followed by pathway analysis to identify gene enrichment that would enable functional characterization of Rb. Transcriptomics data were collected using Agilent SurePrint G3 Human microarrays and a SureScan microarray scanner. Metabolomics data were obtained from aqueous humor, vitreous humor, and tear samples of Rb using Agilent 7200 GC/Q-TOF and Agilent 6550 ifunnel Q-TOF LC/MS systems. After feature extraction and processing using Agilent MassHunter Software, differential and multi-omics analyses were performed using Agilent GeneSpring software suite. This study demonstrates how a comprehensive understanding of a disease mechanism can be obtained using Agilent mass spectrometry (GC/MS and LC/MS) and microarray technologies along with Agilent Mass Profiler Professional and GeneSpring GX software.

Introduction Retinoblastoma occurs when both copies of the RB1 gene in a child s retina are inactivated. While much literature exists regarding pathogenesis and genetic changes in retinoblastoma, there is still a lack of understanding of disease mechanism. In this study, we acquired multi-omics data using a combination of an Agilent portfolio of instruments, software tools, and sample preparation kits to acquire, integrate, and analyze multi-omics data. The study samples were classified with a risk number based on the stage of disease progression. Total RNA extracted from nine fresh frozen Rb and two control tissue samples were analyzed using Agilent SurePrint G3 Gene Expression microarrays to define a gene expression pattern. The Rb sample expressions were compared to two control samples. Metabolite extraction was carried out from aqueous humor, vitreous humor, and tears of Rb and control samples. The extracted metabolites were subjected to analysis using LC/MS and GC/MS. Figure 1 shows the various Agilent products used in this multi-omic study. Data generated from transcriptomics (mrna and mirna) and metabolomics analyses were analyzed using Agilent GeneSpring 13.1 software. Experimental Gene expression and mirna microarray analysis Total RNA was extracted from Rb and control tissues using the Agilent Absolutely RNA mirna Kit (Cat# 400814). The quality of isolated RNA was determined on an Agilent 2200 TapeStation system (G2964AA) using an Agilent RNA ScreenTape assay (5067 5576). mrna labeling and microarray processing was performed as detailed in the One Color Microarray Based Gene Expression Analysis (version 6.9, p/n G4140-90040). mirna labeling was done using an Agilent mirna Complete Labeling and Hyb Kit (Cat# 5190-0456). Genomics Metabolomics Extraction kits Columns Electrophoresis platform LC/MS GC/MS QPCR Microarray GeneSpring Software Biological pathways Figure1. Agilent products used in the study. 2

The gene expression and mirna data were extracted using Agilent Feature Extraction Software (11.5.1.1) and analyzed using Agilent GeneSpring GX 13.1. In both mrna and mirna analyses, transcripts exhibiting P 0.05 and fold changes greater than or equal to two were considered to be differentially expressed. Figure 2 outlines the genomics workflow. Differential and pathway analysis of gene expression and mirna data Gene Expression microarray data were analyzed using the mrna and mirna workflows in GeneSpring GX 13.1. Signal intensities for each probe were normalized to 75 th percentile without baseline transformation for gene expression analysis. No normalization and baseline transformation was performed for mirna analysis. Differential gene and mirna expression analysis was performed using the Filter on Volcano Plot option in GeneSpring. The analysis was carried out using a Moderated T-test unpaired statistical method with the Benjamini Hochberg FDR method; p-values were computed asymptotically. The differential mirna list was used to identify the gene targets by using TargetScan, a mirna target identification database incorporated within GeneSpring. The validated targets of the differential mirnas were identified using mirwalk 1. RNA QC Control and Rb tissue Agilent 2100 Bioanalyzer and RNA 6000 Nano kit Agilent Absolutely RNA mirna Kit RNA Extraction Total RNA Labeling mirna Agilent Low Input Quick Amp Labeling Kit Hybridization Agilent SurePrint G3 Human GE 8x60 K V2 Agilent SureHyb Microarray Hybridization chamber and backing slides Agilent mirna Complete Labeling and Hyb Kit Hybridization Agilent SurePrint G3 Human v16 mirna 8x60K Agilent Microarray Wash Buffer kits Wash Scanning and feature extraction Agilent Surescan Microarray scanner Data analysis GE Data analysis Agilent GeneSpring 13.1 mirna Data analysis Figure 2. Schematic showing gene expression and mirna workflow. Agilent products used in the workflow are highlighted in blue boxes. 3

Pathway analysis was carried out using the Pathway Architect Module in GeneSpring 13.1. The differentially expressed gene entity list (p 0.05 and fold change 2.0) was selected for pathway analysis. The list of validated targets was used for mirna pathway analysis. Curated pathways from KEGG were used for pathway analysis. Metabolomics analysis using LC/MS and GC/MS The metabolites from each of the samples were extracted using a mixture of methanol and ethanol (1:1 v/v). A 500 µl aliquot of this solvent mixture was added to 25 µl of each aqueous humor/vitreous humor/tear sample with appropriate amounts of internal standards, glucose (1-13 C), tyrosine (1 13 C), cholesterol (3,4-13 C 2 ), and palmitic acid (1-13 C). The extract containing internal standards was vortexed for 20 seconds followed by vigorous shaking for 15 minutes at 4 C. The samples were then centrifuged at 4 C for 5 minutes at 10,000 rpm. Two 180 µl aliquots of this supernatant were placed into separate glass vials and dried. One vial was used for GC/MS analysis, and the other vial was used for LC/MS analysis. Figure 3 shows a summary of the workflow for monophasic extraction followed by analysis using LC/MS and GC/MS platforms. LC/MS sample preparation and data analysis To the dried sample, 60 µl of 0.5 ppm epicatechin in methanol was added, and vortexed for 10 seconds followed by sonication for 30 seconds. Another 60 µl of 0.2 % acetic acid in water was added to the sample, vortexed for 10 seconds, and sonicated for 30 seconds. The reconstituted samples were subjected to MS and data-dependent MS/MS acquisition using an Agilent 1290 Infinity LC System coupled to an Agilent 6550 Accurate mass QTOF LC-MS system with a dual Agilent Jet Stream source. The Q TOF was tuned for low mass and fragile molecule tune. 25 µl of vitreous/ aqueous humor and tears Metabolite extraction: Monophasic 50:50 methanol:ethanol LC/MS analysis Derivatization followed by GC/MS analysis Agilent 6550 Q-TOF Column 1 Agilent ZORBAX RRHD SB-Aq Column 2 Agilent Poroshell 120 HILIC Mode Both positive and negative MS and MS/MS analysis LC/MS GC/MS GC Agilent 7200 MS Agilent 5975C Column Agilent DB-5MS (p/n 122-5532G) Method Ref manual G1676-80000 Agilent MassHunter Qualitative Analysis Software and Agilent MassHunter Profinder Software Agilent METLIN PCDL Data analysis Data analysis Agilent Unknown Analysis Software (Fiehn RTL library) Agilent GeneSpring 13.1-MPP Differential Analysis METLIN search using ID Browser KEGG Pathway Searches using Pathway Architect Statistics Figure 3. Summary of the workflow for monophasic extraction followed by LC/MS and GC/MS. 4

The reference solution was prepared in ethanol:acetonitrile:water (750:200:50(v/v)) containing 0.1 % acetic acid by adding an appropriate amount of Agilent HP921 reference solution. The reference solution was added continuously during the run using an external pump at the flow rate of 0.1 ml/min. Ions at m/z 64.0158 and m/z 922.0098 were used as reference ions for calibration in positive mode. Ions at m/z 68.9957, m/z 119.0363, and m/z 980.0164 were used as reference ions in negative mode. Data were acquired using electrospray ionization (ESI) in positive and negative ion modes using modified polar reverse phase C18 and HILIC columns. Tables 1A and 1B show the details of chromatographic and mass spectrometric parameters. Features were found using Agilent MassHunter Profinder, and were searched against the Agilent METLIN database and confirmed by matching them against the Agilent METLIN MS/MS library. Agilent MassHunter Profinder (v. B.06.00) software was used for processing LC/MS data. The accurate mass MS data were processed using the recursive MFE tool in the Profinder software. The aligned compounds were exported to an Agilent Mass Profiler Professional module of the Agilent GeneSpring Multi-Omics Analysis software. Agilent Mass Profiler Professional software (MPP) was used for statistical comparison of the LC/MS data from Rb and control samples. The differential list was exported as a.cef file. The.cef file, which contains the mass and retention time of the differential compounds data, was used as a database to process the data-dependent MS/MS files. The MS/MS data were processed using the Find by Formula algorithm of the Agilent MassHunter Qualitative Analysis Software (v B.07.00), pulling out those features found only in the differential list. The spectral pattern generated was searched against the Agilent METLIN MS/MS library. Table 1A. The chromatographic parameters used in the LC/MS analysis. LC parameters Column Agilent ZORBAX RRHD SB-Aq 3.0 50 mm, 1.8 µm column (p/n 857700-314) Agilent Poroshell 120 HILIC Plus, 3.0 50 mm, 2.7 µm column (p/n 699975-301) Ionization mode Positive and negative MS and MS/MS Positive and negative MS and MS/MS Mobile phase LC Gradient A) Water with 0.2 % acetic acid B) Methanol with 0.2 % acetic acid Time (min) 1.0 12.0 13.0 15.0 15.1 20.0 % mobile phase B 5.0 35.0 95.0 95.0 5.0 5.0 A) (9:1) Acetonitrile:water with 50 mm ammonium acetate B) (5:4:1) Acetonitrile:water:water with 50 mm ammonium acetate Time (min) 3.0 12.0 15.0 15.1 20.0 Table 1B. The MS source parameters used in MS and MS/MS data acquisition. Agilent 6550 MS parameters Injection volume 5 µl Flow rate Thermostated column temperature 40 C 0.3 ml/min Gas temperature 175 C Drying gas flow Nebulilzer 15 L/min 45 psig Sheath gas temperature 200 C Sheath gas flow VCap Nozzle voltage Fragmentor 175 Skimmer 65 Octopole RF peak 750 Min range Max range MS Scan rate in MS mode MS Scan rate in MS/MS mode MS/MS scan rate Isolation width (MS/MS) Collision energy 12 L/min 3,500 V 1,000 V 50 m/z 1,200 m/z 1 spectra/sec 6 spectra/sec 3 spectra/sec Medium (~4 amu) 10, 20, and 40 V % mobile phase B 0.0 100.0 100.0 0.0 0.0 5

GC/MS sample preparation and data analysis The dried samples were subjected to derivatization as described elsewhere 2. An Agilent Fiehn GC/MS Metabolomics Standards Kit (p/n 400505) was used to perform derivatization. An Agilent 7890C GC coupled to an Agilent 7200 GC/Q-TOF system was used for data acquisition. D27 myristic acid was used for retention time locking. Data were acquired using an EI source and an Agilent DB-5ms column (p/n 122-5532G). Table 2 shows the GC/MS chromatographic and acquisitions parameters. The 7200 GC/Q-TOF data were processed using MassHunter Unknowns Analysis Software (version B.07.00). This software uses mass spectral deconvolution, which automatically finds peaks and deconvolutes spectra from coeluting compounds using model ion traces. The spectral information was matched with the retention time locked Agilent-Fiehn library and retention time index with respect to FAME mix (Agilent Fiehn GC-MS Metabolomics Standards Kit, p/n 400505). The matched compound list was exported to MPP for further analysis. Protein estimation Three microliters of each ocular fluid sample was added to 7 µl of water containing 0.9% sodium chloride to estimate the protein concentration using the Bio-Rad DC Protein Assay following the manufacturer guidelines. A standard curve was generated using BSA, and the total protein of the extract was determined using UV measurement at 750 nm. This protein concentration was used to normalize the LC/MS metabolomics data during MPP analysis. Multi-omics data analysis The metabolomics and gene microarray results were combined and analyzed using a pathway-centric approach. Genomics and metabolomics data were covisualized in the pathway context using the Multi-Omics Analysis tool of GeneSpring 13.1. This enabled simultaneous viewing of the differential entities from both gene expression and metabolomics. The data integration of different omics data in GeneSpring helps to gain a better understanding of the interrelationship between changes in expression of individual biochemical entities. Reagents and materials LC/MS grade isopropanol, methanol, and acetonitrile were purchased from Fluka (Germany). Milli-Q water (Millipore Elix 10 model, USA) was used for mobile phase preparation. The additives, ammonium fluoride, acetic acid, ammonium formate, formic acid, and ammonium acetate, were procured from Fluka (Germany). Table 2. The Agilent 7200 GC/Q-TOF chromatography and MS experimental parameters. GC parameters Column 2nd transfer column Injection volume 1 µl Inlet Inlet temperature 250 C Carried gas and flow Oven temperature program Transfer line temperature 290 C Agilent 7200 Q-TOF parameters Ionization mode Source temperature 230 C Quadrupole temperature 150 C m/z scan Spectral acquisition rate Agilent DB-5MS, 30 mm 0.2 5mm, 0.25 µm, Guard column 10 m (p/n 122-5532G) Deactivated fused silica, 0.7 m 150 µm, 0 µm (p/n 122-5532) at constant pressure of 3 psi Multimode operated in split less mode Helium at 1.8636 ml/min, contact flow 60 C for 1 minute 10 C/min to 325 C 10 minute hold EI 50 to 600 m/z 5 spectra/sec, 2,679 transients/spectrum 6

Results and Discussion mrna Approximately 1,600 genes were differentially expressed (p 0. 05 and FC 10) between control and Rb samples. Pathway analysis revealed many key pathways to be differentially altered in Rb affected samples. Figure 4 displays many key entities of the photo transduction pathway to be differentially expressed. The pathway in GeneSpring is illustrated in two ways: first by coloring the rectangles surrounding the differentially expressed genes, and second by displaying histograms near entities to indicate relative levels of genes between Rb and controls. For example, Figure 4 shows down-regulation of transducine (Gt), with red and blue bars representing the expression levels in control and Rb samples, respectively. Phototransduction Plasma membrane Dark Disc Disc membrane Control Rb RK Rec Rh Ca 2+ Arr Gt Transduction RGS9 PDE cgmp GC Ca 2+ GC inhibition GCAP NCKX Ca 2+ K + Dark adaptation 4Na + Vertebrate rod photoreceptor cell CaM Ca 2+ Ca 2+ Depolarization CNG Open Na + cgmp Light Disc Plasma membrane Disc membrane Light hn Rec + P RK Ca 2+ Rh Arr Gt Transduction cgmp RGS9 Visual pigment Rhodopsin (Rh) hn Metarhodopsin II Rhodopsin (Interaction with Gt) Activation Inactivation Enzymatic regeneration Opsin 11-cis-Retinal all-trans-retinal Opsin PDE RK Arr cgmp GC GCAP Mg 2+ Vertebrate rod photoreceptor cell Ca 2+ GC activation K + CaM NCKX Light adaptation 4Na + Retinol metabolism in animals Hyperpolarization (light response) CNG cgmp Figure 4. Gene expression pathway analysis showing the photo transduction pathway to be significantly affected. 7

mirna We could identify 18 mirnas to be significantly regulated (p 0.05 and FC 10) in Rb samples relative to the controls. Figure 5 displays a volcano plot showing differentially expressed mirnas. Metabolomics analysis Metabolomics analysis was conducted using both LC/MS (positive and negative, HILIC and C18 columns) and GC/MS methods. LC/MS and GC/MS produce nonoverlapping results, allowing different classes of compounds to be identified with confidence. For LC/MS, the choice of two different columns along with both ionization modes gave broader coverage of metabolites. HILIC columns are an ideal choice for the analysis of polar compounds, and C18 columns for nonpolar compounds. The metabolomics experiments were conducted on aqueous humor, vitreous humor, and tear samples from the same set of samples used for gene expression and mirna. Of the 102, 117, and 75 total entities identified from aqueous, vitreous, and tear samples, respectively by GC/MS, 32, 32, and 23 differential metabolites were identified by spectral search against the Agilent Fiehn GC/MS library. The identified metabolites included mostly carbohydrates and amino acids. GC/MS and gene expression Log 10 (corrected Pvalue) 4 3 2 1 multi omics analysis was performed using the differential gene entity list from Rb and control tissues samples, and identified differential metabolite list from aqueous humor, vitreous humor, and tear samples independently. Table 3 shows the predominate pathways revealed by combined GC/MS and gene expression multi-omics in tears. -6-4 -2 0 2 4 6 Log 2 (fold change) Figure 5. Volcano plot showing 18 differentially regulated mirna between Rb and controls. Of the 1,000 validated and reported targets of these 18 mirnas, 12 were found to be differentially expressed in the mrna data from this study. Table 3. Predominant pathways revealed by combined GC-MS and gene expression multi-omic analysis in tears Pathway p-value (Gene expression) Matched entities (Gene expression) Pathway entities of experiment type (Gene expression) Matched entities (Tears_GC_ Normalization) Fructose and mannose metabolism 0.0882 3 32 3 53 Starch and sucrose metabolism 0.0651 4 56 3 51 ABC transporters 0.1777 3 44 3 122 Aminoacyl-tRNA biosynthesis 0.7726 1 66 2 52 Protein digestion and absorption 0.3250 4 89 2 47 Carbohydrate digestion and absorption 0.0031 6 45 2 27 Pathway entities of experiment type (Tears_ GC_Normalization) 8

Pathway analysis and visualization can be extended to multi-omics analysis as well. Figure 6 shows the combined analysis of transcriptomics data from tissues and metabolomics using GC/MS from aqueous humor of the same samples visualized on the valine biosynthesis pathway. The histograms next to the yellow bar and the blue bar show the relative levels of genes and metabolites, respectively. GC/MS studies showed high levels of valine to be present in aqueous humor Rb samples. The valine synthesis pathway also shows significantly low levels of the branched chain amino-acid transaminase 1 enzyme (BCAT1)(EC 2.6.1.42), in Rb samples. The deficiency of the enzyme is responsible for the up regulation of valine, which is also evident from the pathway figure. Valine, leucine, and isoleucine biosynthesis Glycine, serine, and threonine metabolism D-erythro- 3-methylmalate 2-Methylmaleate 4.2.1.35 4.2.1.35 (R)-2-Methylmalate Gene expression enrichment Aqueous humor-gc normalized enrichment Threonine 4.3.1.19 1.1.1.85 2-Oxobutanoate 2.2.1.6 2.2.1.6 (S)-2-Aceto- 2-hydroxybutanoate (S)-2-Acetolactate 23.1.182 Pyruvate Acetyl-CoA Gene expression Control Rb AH_GC_Normalized Control Rb 1.1.1.86 (R)-3-Hydroxy-3-methyl- 2-oxopentanoate 5.4.99.3 1.1.1.86 5.4.99.3 3-Hydroxy-3-methyl- 2-oxobutanoate Pyruvate metabolism (R)-2,3-Dihydroxy- 3-methylpentanoate 1.1.1.86 (R)-2,3-Dihydroxy- 3-methylbutanoate 1.1.1.86 (S)-3-Methyl- 2-oxopentanoate 4.2.1.9 2-Oxoisovalerate 4.2.1.9 2.6.1.42 1.4.1.9 2.6.1.42 1.4.1.9 2.3.3.13 2.6.1.66 (2R,3S)-3-2-Isopropylmalate Isopropylmalate 4.2.1.33 4.2.1.33 (2S)-2- Isopropylmalate 1.1.1.85 L-Isoleucine L-Valine Valine, leucine, and isoleucine degradation (2S)-2-Isopropyl- 3-oxosuccinate spontaneous 4-Methyl- 2-oxopentanoate 2.6.1.6 2.6.1.42 1.4.1.9 L-Leucine Figure 6. Multi-omics analysis of aqueous humor GC/MS and tissue gene expression reveals the valine, leucine and isoleucine biosynthesis pathway to be significantly down-regulated. 9

The differentially expressed entities between Rb and control samples across aqueous humor, vitreous humor, and tear samples were matched by searching against the METLIN LC/MS accurate mass database, resulting in approximately 1,300 annotated metabolites. Figure 7 shows the combined multi-omics analysis of LC/MS and gene expression data visualized on the purine metabolism pathway. LC/MS studies of the tear samples showed significantly lower levels of inosine and uric acid in the tears of Rb versus the control group. Figure 7. LC/MS and gene expression multi-omics analysis shows purine metabolism to be significantly affected in tumor samples. 10

Figure 8 shows the MS/MS spectra of inosine as identified and confirmed using the Agilent METLIN MS/MS library. Several associated genes in the pathway were also found to be differentially regulated. Conclusions This study demonstrates the value of using a multi-omics approach to understanding the biological pathways in disease progression. The data were acquired using an Agilent multi omics portfolio including reagents, instrumentation, and software tools designed for transcriptomics and metabolomics. Agilent GeneSpring, Agilent Mass Profiler Professional, and Agilent Pathway Architect software offer new biological insights from complex biological datasets. The software enables researchers to perform discovery experiments by co-analyzing data from different omics such as transcriptomics and metabolomics. Counts Counts Counts Cpd40: -ESI Product ion 10 2 1.0 0.5 0 135.0312 92.9272 267.0737 59.0142 208.9266 314.3148 365.0170 414.2394 10 2 Cpd40: -ESI Product ion 1 B 135.0312 92.9272 267.0737 59.0142 208.9266 314.3148 365.0170 414.2394 0-1 10 2 Inosine C 10 H 12 N 4 O 5 Product ion 1.0 C 135.0299 0.5 0 A 92.0254 267.0735 60 80 100 120 140 160 180 200 220 240 260 280 300 320 340 360 380 400 420 440 Mass-to-charge (m/z) Figure 8. MS/MS Library spectral match of inosine A) spectra taken from the sample, B) mirror plot combining the MS/MS from the library and from the unknown sample C) spectra from Agilent METLIN MS/MS library. Combining data from transcriptomics and metabolomics revealed previously unknown pathways related to retinoblastoma. For example, the valine, leucine, and isoleucine biosynthesis pathway has never been implicated in retinoblastoma. In addition to the high levels of valine detected by our GC/MS experiments, gene expression results also showed the enzyme branched chain amino-acid transaminase 1 (BCAT1), an enzyme required for valine metabolism, to be significantly lower in Rb samples. Our study opens a possibility for follow up studies, and demonstrates a unified approach to discovering new biological insights an approach made possible by integrating data from different omics in the same sample. 11

References 1. Dweep, H., et al. mirwalk - database: prediction of possible mirna binding sites by walking the genes of 3 genomes, J. of Biomedical Informatics 2011, 44, pp 839-7. 2. Palazoglu, M., and Fiehn, O. Metabolite Identification in Blood Plasma Using GC/MS and the Agilent Fiehn GC/MS Metabolomics RTL Library. Agilent Technologies Application Note, publication number 5990-3638EN, 2009. www.agilent.com/chem For Research Use Only. Not for use in diagnostic procedures. This information is subject to change without notice. Agilent Technologies, Inc., 2015 Published in the USA, October 1, 2015 5991-6215EN