Gene Signature Lab: Exploring integrative LINCS (ilincs) Data and Signatures Analysis Portal & Other LINCS Resources

Similar documents
Knowledge-Guided Analysis with KnowEnG Lab

LINCS: Example of Handling Multimodal data and Integration. Ajay Pillai NHGRI

Gene expression connectivity mapping and its application to Cat-App

Smart India Hackathon

Next-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX

Whole Transcriptome Analysis of Illumina RNA- Seq Data. Ryan Peters Field Application Specialist

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

IPA Advanced Training Course

QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd

Bioinformatics Analysis of Nano-based Omics Data

Introduction to Microarray Analysis

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

Analysis of a Tiling Regulation Study in Partek Genomics Suite 6.6

resequencing storage SNP ncrna metagenomics private trio de novo exome ncrna RNA DNA bioinformatics RNA-seq comparative genomics

About Strand NGS. Strand Genomics, Inc All rights reserved.

Developing an Accurate and Precise Companion Diagnostic Assay for Targeted Therapies in DLBCL

IPA : Maximizing the Biological Interpretation of Gene, Transcript & Protein Expression Data with IPA

MOLECULAR BIOLOGY OF EUKARYOTES 2016 SYLLABUS

RNA-Seq analysis using R: Differential expression and transcriptome assembly

ACCELERATING GENOMIC ANALYSIS ON THE CLOUD. Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes

Corporate Medical Policy

Terminology for personalized medicine

Next Generation Sequencing Data Analysis with BioHPC. Updated for

Introducing a Highly Integrated Approach to Translational Research: Biomarker Data Management, Data Integration, and Collaboration

Stefano Monti. Workshop Format

PAREXEL GENOMIC MEDICINE SERVICES. Applying genomics to enhance your drug development journey

Personalized Medicine

Bayesian Networks as framework for data integration

Introduction to Bioinformatics

Introduction to Next Generation Sequencing (NGS) Data Analysis and Pathway Analysis. Jenny Wu

Introduction to Bioinformatics and Gene Expression Technologies

Goals of pharmacogenomics

Protein Synthesis: Transcription and Translation

Assembling Protein Molecules

BBISR Do it Yourself Workshops Bioinformatics & Biostatistics Tools for Cancer Research

Ontologies - Useful tools in Life Sciences and Forensics

Identifying Signaling Pathways. BMI/CS 776 Spring 2016 Anthony Gitter

Welcome to the NGS webinar series

Themes: RNA and RNA Processing. Messenger RNA (mrna) What is a gene? RNA is very versatile! RNA-RNA interactions are very important!

If Dna Has The Instructions For Building Proteins Why Is Mrna Needed

Runs of Homozygosity Analysis Tutorial

Agilent Genomic Workbench 7.0

Fundamentals of Bioinformatics: computation, biology, computational biology

Application of Deep Learning to Drug Discovery

Nature Methods: doi: /nmeth Supplementary Figure 1. Validation of RaPID with EDEN15

"Stratification biomarkers in personalised medicine"

Quick reference guide

DNA Microarray Technology

DNA. bioinformatics. epigenetics methylation structural variation. custom. assembly. gene. tumor-normal. mendelian. BS-seq. prediction.

Nature Methods: doi: /nmeth.3732

Gene Regulation Solutions. Microarrays and Next-Generation Sequencing

Nature Genetics: doi: /ng Supplementary Figure 1

Supplemental Table 1 Gene Symbol FDR corrected p-value PLOD1 CSRP2 PFKP ADFP ADM C10orf10 GPI LOX PLEKHA2 WIPF1

Data Intensive Scientific Discovery Vijay Chandru

Towards unbiased biomarker discovery

Exercise1 ArrayExpress Archive - High-throughput sequencing example

Whole genome sequencing in drug discovery research: a one fits all solution?

SureSilencing sirna Array Technology Overview

Cytomics in Action: Cytokine Network Cytometry

In silico prediction of novel therapeutic targets using gene disease association data

Preanalytical Processing: The Biospecimen Quality Imperative

Advanced Bioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2018

Genetic Basis of Development & Biotechnologies

Theoretische Biologie

Green Center Computational Core ChIP- Seq Pipeline, Just a Click Away

Introduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013

Optimization of RNAi Targets on the Human Transcriptome Ahmet Arslan Kurdoglu Computational Biosciences Program Arizona State University

Top 5 Lessons Learned From MAQC III/SEQC

What we ll do today. Types of stem cells. Do engineered ips and ES cells have. What genes are special in stem cells?

SENIOR BIOLOGY. Blueprint of life and Genetics: the Code Broken? INTRODUCTORY NOTES NAME SCHOOL / ORGANISATION DATE. Bay 12, 1417.

The Effect of cdna on Tumor Cells Growth on Nude Mice

Do engineered ips and ES cells have similar molecular signatures?

Introduction to Bioinformatics

Exam MOL3007 Functional Genomics

The first and only fully-integrated microarray instrument for hands-free array processing

PCR Arrays. An Advanced Real-time PCR Technology to Empower Your Pathway Analysis

Predictive and Causal Modeling in the Health Sciences. Sisi Ma MS, MS, PhD. New York University, Center for Health Informatics and Bioinformatics

Gene Network Central (GNC) Pro Tutorial

MICROARRAYS+SEQUENCING

China National Grid --- BioNode. Jun Wang Beijing Genomics Institute

Gene List Enrichment Analysis - Statistics, Tools, Data Integration and Visualization

Gene-Level Analysis of Exon Array Data using Partek Genomics Suite 6.6

Exam 2 3/19/07 P. Sengupta BISC 4A

Metabolic collateral vulnerabilities of MTAP-deleted cancers as therapeutic opportunities Keystone on Tumor Metabolism 2017

Clinician s Guide to Actionable Genes and Genome Interpretation

Year III Pharm.D Dr. V. Chitra

Terminology: chromosome; gene; allele; proteins; enzymes

Cancer Genetics Solutions

Gene Expression Technology

Form for publishing your article on BiotechArticles.com this document to

WORKDAY: Appraising Performance General Workflow

Applications of Big Data in Evidence-Based Medicine

3.1.4 DNA Microarray Technology

Analysis of Microarray Data

Multi-omics in biology: integration of omics techniques

Preanalytical Variables in Blood Collection: Impact on Precision Medicine

Feature Selection of Gene Expression Data for Cancer Classification: A Review

Welcome to R&D Day! Christine Lindenboom VP, Investor Relations & Corporate Communications

Corporate Presentation. February 2, 2018

1. Page 90: Cellular Metabolism Explain what the everyday use of the word metabolism means to you.

Transcription:

Gene Signature Lab: Exploring integrative LINCS (ilincs) Data and Signatures Analysis Portal & Other LINCS Resources Jarek Meller, PhD BD2K-LINCS Data Coordination and Integration Center University of Cincinnati Gene Signature Lab, Comp. Genomics Course, IGB 607

Outline A couple of quick reminders: CMAP & LINCS Interacting with Big Omics Data using ilincs (Medvedovic et al.) Part I: deriving and interpreting genomic signatures P53 ER Part II: searching for drug targets and drugs Exploring other LINCS-related tools (Enrichr, L1000CDS2, Ma ayan et al.) Gene Signature Lab, Comp. Genomics Course, IGB 607

LINCS: Extending Connectivity Map Negative correlation with disease transcriptional signature Potential of the drug to reverse the disease process J Lamb et al. Science 2006;313:1929-1935

cell types LINCS Cube Cancer cell lines ips cells Primary cells Transcriptomic (L1000, RNA-seq) Proteomic Phosphoproteomic Morphoplogical Proliferation, apoptosis, Perturbations Chemical perturbagens (~30,000 x doses) Genetic perturbations (~30,000 x shrnas) Microenvironment perturbations Disease http://lincsproject.org

Towards Using CMAP/LINCS as Resources for Personalized Precision Medicine NOTE that small molecules with negatively correlating signatures with respect to an individual tumor signature (characterized by some mutations and some up- and down-regulated genes) could potentially be used to identify drugs to treat that particular tumor! This can be viewed as reversing the signature of the tumor This and other applications can be greatly facilitated by highly integrative and intuitive tools that enable seamless interaction with Big Omics Data, such as LINCS ilincs 5

ilincs: Linking Datasets and Signatures with Online Analysis What are my genes/proteins doing in other datasets? Constructing and analyzing signatures from transcriptomics and proteomics datasets Analyzing and mining perturbation and disease signatures ilincs.org, Mario Medvedovic et al., University of Cincinnati

ilincs Team ilincs.org, Mario Medvedovic et al., University of Cincinnati

ilincs Demo I: p53 Signature in Breast Tumors Gene Signature Lab, Comp. Genomics Course, IGB 607

Getting started Go to http://www.ilincs.org/ilincs/ Select Datasets workflow by either clicking on Datasets in the top bar or data sets icon below icon Select All Data sets and TCGA (click on Choose button to the right); select the 3 rd data set from the top (919 BRCAs) 9

Exploratory analysis Explore Heatmap Download Data ilincs.org, Mario Medvedovic et al., University of Cincinnati

Note that NAs can be effectively classified ilincs.org, Mario Medvedovic et al., University of Cincinnati

Let us generate p53 signature ilincs.org, Mario Medvedovic et al., University of Cincinnati

Gene Signature Lab, Comp. Genomics Course, IGB 607

P53 signature can be used to reclassify wt and mutants JM - http://folding.chmcc.org 14

Work around to generate the correct heatmap: Use the signature to re-analyze the same data set.

Work around to generate the correct heatmap

Work around to generate the correct heatmap

Avi Ma yan et al., Mount Sinai School of Medicine

Gene Signature Lab, Comp. Genomics Course, IGB 607

Big p53 signature 21

Dataset Analysis Workflow Enrichment analysis via Enricher Pathway analysis vis SPIA algorithm LINCS RNA-seq dataset Differential gene expression signature Small molecule CD signatures L1000CDS2 LINCS RPPA dataset TCGA RNA-seq BC dataset Connected TF binding and L1000 KD signatures ilincs.org, Mario Medvedovic et al., University of Cincinnati

ilincs Datasets 3,600 Datasets TCGA Transcriptomics ENCODE TF Binding Data P100 + GCP Proteomics ilincs.org, Mario Medvedovic et al., University of Cincinnati

Signatures Workflow Finding signatures Analyze Connected Signatures ilincs.org, Mario Medvedovic et al., University of Cincinnati

ilincs Signatures ilincs.org, Mario Medvedovic et al., University of Cincinnati

Genes Workflow Finding genes Dataset workflow Signatures workflow ilincs.org, Mario Medvedovic et al., University of Cincinnati

ilincs Demo II: ER Signature in Cell Lines vs. Breast Tumors Go to http://www.ilincs.org/ilincs/ Select Datasets workflow by either clicking on Datasets in the top bar or data sets icon below icon Select LINCS Data sets and select the last data set Oregon Health Sciences 54 mrna-seq samples from cell lines (click on Analyze button to the right) Click on Generate a Signature Select Grouping variable as ER Define groups as + and - (ER positive and ER negative cell lines) Click Create signature Select Use differentially expressed genes to analyze another set (work around) and choose the same Oregon Health Sciences data set and select Statistical analysis of genes and select ER again as the grouping variable, open heatmap Do the same, but this time find the TCGA BRCA data set and generate heatmap Gene Signature Lab, Comp. Genomics Course, IGB 607

Cell lines cluster largely by ER status; unassigned cell lines can be predicted to have either negative or positive ER status. Note that genes were selected to make that happen this is not a truly unsupervised approach. ilincs.org, Mario Medvedovic et al., University of Cincinnati

ilincs.org, Mario Medvedovic et al., University of Cincinnati

Going back to the page with ER signature: Step-by-step instructions one more time Go to http://www.ilincs.org/ilincs/ Select Datasets workflow by either clicking on Datasets in the top bar or data sets icon below icon Select LINCS Data sets and select the last data set Oregon Health Sciences 54 mrna-seq samples from cell lines (click on Analyze button to the right) Click on Generate a Signature Select Grouping variable as ER Define groups as + and - (ER positive and ER negative cell lines) Click Create signature Click Enrichr to perform enrichment analysis

Going back to the page with ER signature

Gene Signature Lab, Comp. Genomics Course, IGB 607

Avi Ma yan et al., Mount Sinai School of Medicine

ilincs Demo III: Reversing ER Signature Gene Signature Lab, Comp. Genomics Course, IGB 607

1 3 2

Searching by Gene Knockdown Signatures

Group Analysis of Raloxifen Signatures

Concordant vs. Discordant Signatures

Searching for Novel ER(-pathway) Inhibitors (concordance>0.4)

Caveats: Potentially Small Overlap with L1000 Gene Set for User Defined Signatures

Caveats: Current Sparse Cube Transcriptomics HMS LINCS Proteomics Microenvironment DTox http://lincsproject.org/ NeuroLINCS

Take home messages: i) Potential gold mine for hypothesis generation and mechanistic insights ii) Use with utmost caution, do not over-interpret, validate iii) Please be somewhat patient with the tools they keep getting better For the rest f the lab, try to reproduce as much as possible one more time. Gene Signature Lab, Comp. Genomics Course, IGB 607