Inference of predictive gene interaction networks

Similar documents
Controversy In Pharmacogenomics

WebMeV A Cloud Based Platform for Genomic Analysis Yaoyu E. Wang

BIG DATA IN HEALTH CARE AND BIOMEDICAL RESEARCH. John Quackenbush Dana-Farber Cancer Institute Harvard School of Public Health

Inferring Gene Networks from Microarray Data using a Hybrid GA p.1

Le nouveau Master ULB en Data Science et Big Data

Package BAGS. September 21, 2014

Introduction to Bioinformatics

Introduction to BIOINFORMATICS

Systems Biology-Informed Risk Assessment Suggested Background Reading and Additional Information. June 14-15, 2012 Washington, DC

USING NETWORKS TO UNDERSTAND THE GENOTYPE-PHENOTYPE CONNECTION. John Quackenbush Dana-Farber Cancer Institute Harvard TH Chan School of Public Health

Cancer Informatics. Comprehensive Evaluation of Composite Gene Features in Cancer Outcome Prediction

Emerging Impacts on Artificial Intelligence on Healthcare IT Session 300, February 20, 2017 James Golden, Ph.D., Christopher Ross, MBA

PREDICTION AND SIMULATION OF MULTI-TARGET THERAPIES FOR TRIPLE NEGATIVE BREAST CANCER THROUGH A NETWORK-BASED DATA INTEGRATION APPROACH

Predictive and Causal Modeling in the Health Sciences. Sisi Ma MS, MS, PhD. New York University, Center for Health Informatics and Bioinformatics

Statistical Inference and Reconstruction of Gene Regulatory Network from Observational Expression Profile

The Future of HealthCare Information Technology

COS 597c: Topics in Computational Molecular Biology. DNA arrays. Background

Network System Inference

On the Asymptotic Distribution of Likelihood Ratio Test when Parameters Lie on the Boundary

Dana-Farber Cancer Institute Speeds Medical Research with Advanced Data Warehouse

Computational Biology

Editorial. Current Computational Models for Prediction of the Varied Interactions Related to Non-Coding RNAs

Functional profiling of metagenomic short reads: How complex are complex microbial communities?

Survival Outcome Prediction for Cancer Patients based on Gene Interaction Network Analysis and Expression Profile Classification

Basics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility

Machine Learning. HMM applications in computational biology

Gene expression connectivity mapping and its application to Cat-App

advanced analysis of gene expression microarray data aidong zhang World Scientific State University of New York at Buffalo, USA

Machine learning applications in genomics: practical issues & challenges. Yuzhen Ye School of Informatics and Computing, Indiana University

Deep learning sequence-based ab initio prediction of variant effects on expression and disease risk

Department of Systems Biotechnology

Statistical Modeling of Epigenomics and Gene Regulation Workshop Program

IPA Advanced Training Course

Random matrix analysis for gene co-expression experiments in cancer cells

Machine Learning in Computational Biology CSC 2431

Genetic tests are available for hundreds of disorders. DNA testing can pinpoint the exact genetic basis of a disorder.

Translational Medicine in the Era of Big Data: Hype or Real?

Signaling Hypergraph. Set V of nodes proteins, small molecules, etc.

Biomedical Big Data and Precision Medicine

Uncovering differentially expressed pathways with protein interaction and gene expression data

Single-cell sequencing

WaterlooClarke: TREC 2015 Total Recall Track

GeneQuery: A phenotype search tool based on gene co-expression clustering. Alexander Predeus 21-oct-2015

MANIFESTO OF STUDIES 2012

Knowledge-Guided Analysis with KnowEnG Lab

Computational Challenges of Medical Genomics

Inferring Gene-Gene Interactions and Functional Modules Beyond Standard Models

A New Algorithm for Protein-Protein Interaction Prediction

Our website:

BIOINFORMATICS AND SYSTEM BIOLOGY (INTERNATIONAL PROGRAM)

Microarrays & Gene Expression Analysis

Identifying Signaling Pathways. BMI/CS 776 Spring 2016 Anthony Gitter

DNA Matters Constructing Genes, Enzymes and Pathways by the Plate

Single-cell RNA-sequencing

Drug Repurposing. Motivation:

Presidential Commission for the Study of Bioethical Issues 2 nd meeting September

Genetics and Bioinformatics

Prostate Cancer Genetics: Today and tomorrow

Our view on cdna chip analysis from engineering informatics standpoint

Pioneering Clinical Omics

University of Athens - Medical School. pmedgr. The Greek Research Infrastructure for Personalized Medicine

The flow diagram below shows part of a process to produce a protein, using genetically modified plants.

Title: Genome-Wide Predictions of Transcription Factor Binding Events using Multi- Dimensional Genomic and Epigenomic Features Background

Machine Learning in Pharmaceutical Research

Form for publishing your article on BiotechArticles.com this document to

DOUBLE-STRAND DNA BREAK PREDICTION USING EPIGENOME MARKS AT KILOBASE RESOLUTION

Data analysis in cell-based functional assay

Feature selection methods for mining bioinformatics data

Towards Gene Network Estimation with Structure Learning

Stefano Monti. Workshop Format

Single-Cell Genomics Deep insights into complex biology

From Bench to Bedside: Role of Informatics. Nagasuma Chandra Indian Institute of Science Bangalore

Personalized Human Genome Sequencing

Chromatin signature identifies monoallelic gene expression across mammalian cell types

Learning theory: SLT what is it? Parametric statistics small number of parameters appropriate to small amounts of data

DNA Based Disease Prediction using pathway Analysis

SPO GRANTS (2001) PHASES

Investimento & Inovação na Saúde

Lecture 8: Predicting and analyzing metagenomic composition from 16S survey data

Research Powered by Agilent s GeneSpring

ADVANCES IN INTERNATIONAL COMPARATIVE MANAGEMENT VOLUME 7

Lesson Overview. Studying the Human Genome. Lesson Overview Studying the Human Genome

Human Genome Editing: Science, Ethics, and Governance Highlights for Industry Stakeholders

KnetMiner USER TUTORIAL

BIRKBECK COLLEGE (University of London)

THE ERA OF INDIVIDUAL GENOMES. Sandra Viz Lasheras Advanced Genetics ( )

ChIP-Seq, mrna-seq, & Resequencing via the Genboree Workench

Functional Genomics and Single-Cell Genomics. Christine Queitsch Department of Genome Sciences

Functional genomics + Data mining

Gene Expression Data Analysis

An Overview of Probabilistic Methods for RNA Secondary Structure Analysis. David W Richardson CSE527 Project Presentation 12/15/2004

Differential expression analysis for sequencing count data. Simon Anders

A Propagation-based Algorithm for Inferring Gene-Disease Associations

1.2 Geometry of a simplified two dimensional model molecule Discrete flow of empty space illustrated for two dimensional disks.

TUTORIAL. Revised in Apr 2015

NMI Natural and Medical Sciences Institute at the University of Tübingen

From Proteomics to Systems Biology. Integration of omics - information

2. Materials and Methods

Introduction Reference System Requirements GIST (Gibbs sampler Infers Signal Transduction)... 4

Protein-Protein-Interaction Networks. Ulf Leser, Samira Jaeger

Transcription:

Inference of predictive gene interaction networks Benjamin Haibe-Kains DFCI/HSPH December 15, 2011 The Computational Biology and Functional Genomics Laboratory at the Dana-Farber Cancer Institute and Harvard School of Public Health Benjamin Haibe-Kains (DFCI/HSPH) R/Bioconductor Course December 15, 2011 1 / 12

Background Phenotypes result from biological networks, not individual genes New biotechnologies allow us to analyze multiple genes in parallel: next generation sequencing gene expression profiling Chip-seq... Understand the complex interactions between genes and the behavior of a network is fundamental to bring new biological insights to make useful predictions Benjamin Haibe-Kains (DFCI/HSPH) R/Bioconductor Course December 15, 2011 2 / 12

Relevance Aim: Infer reliable predictive gene interaction networks from gene expression data Beyond biological understanding, such networks would be efficient tools for: predicting the response of an organism (cancer patient) to perturbations (targeted therapies) identifying the key genes to target for significantly decreasing a pathway activity optimizing combination of drugs and therapy regiments Benjamin Haibe-Kains (DFCI/HSPH) R/Bioconductor Course December 15, 2011 3 / 12

Gene Interaction Network Genes are represented as nodes gene h gene b gene f Interactions are represented by edges gene a gene c Edges can be directed to show causal interactions gene d Edges are not necessarily direct interactions gene g gene e Benjamin Haibe-Kains (DFCI/HSPH) R/Bioconductor Course December 15, 2011 4 / 12

Gene Interaction Network and Perturbation High expression gene h gene b gene f gene h gene b gene e Low expression gene a gene c Perturbation gene a gene c gene d gene d gene g gene e gene g gene e Benjamin Haibe-Kains (DFCI/HSPH) R/Bioconductor Course December 15, 2011 5 / 12

Challenges in network inference... and how we address them 1 Problem complexity: seed the search for the best network by using prior biological knowledge about gene interactions Predictive Networks web application 2 Curse of dimensionality: development of a local regression-based network inference to enable analysis of hundreds of genes in parallel 3 Lack of validation: development of performance criteria to assess the quality of network models 4 Lack of software: implementation of network inference methods and related tools in R predictionet package Benjamin Haibe-Kains (DFCI/HSPH) R/Bioconductor Course December 15, 2011 6 / 12

Challenges in network inference... and how we address them 1 Problem complexity: seed the search for the best network by using prior biological knowledge about gene interactions Predictive Networks web application 2 Curse of dimensionality: development of a local regression-based network inference to enable analysis of hundreds of genes in parallel 3 Lack of validation: development of performance criteria to assess the quality of network models 4 Lack of software: implementation of network inference methods and related tools in R predictionet package Benjamin Haibe-Kains (DFCI/HSPH) R/Bioconductor Course December 15, 2011 6 / 12

Challenges in network inference... and how we address them 1 Problem complexity: seed the search for the best network by using prior biological knowledge about gene interactions Predictive Networks web application 2 Curse of dimensionality: development of a local regression-based network inference to enable analysis of hundreds of genes in parallel 3 Lack of validation: development of performance criteria to assess the quality of network models 4 Lack of software: implementation of network inference methods and related tools in R predictionet package Benjamin Haibe-Kains (DFCI/HSPH) R/Bioconductor Course December 15, 2011 6 / 12

Challenges in network inference... and how we address them 1 Problem complexity: seed the search for the best network by using prior biological knowledge about gene interactions Predictive Networks web application 2 Curse of dimensionality: development of a local regression-based network inference to enable analysis of hundreds of genes in parallel 3 Lack of validation: development of performance criteria to assess the quality of network models 4 Lack of software: implementation of network inference methods and related tools in R predictionet package Benjamin Haibe-Kains (DFCI/HSPH) R/Bioconductor Course December 15, 2011 6 / 12

Predictive Networks web application https://compbio.dfci.harvard.edu/predictivenetworks/ Benjamin Haibe-Kains (DFCI/HSPH) R/Bioconductor Course December 15, 2011 7 / 12

Regression-based network inference Priors Gene expression data MRMR Predictive Networks web application Undirected graph causality Inference Directed graph Directed graph Network topology (linear) regression Predictive models Benjamin Haibe-Kains (DFCI/HSPH) R/Bioconductor Course December 15, 2011 8 / 12

Performance criteria for network inference We implement a cross-validation framework to assess: edge-specific stability: what are the interactions inferred in most of the cross-validation folds? gene-specific prediction score: what are the genes whose expression can be well predicted by their parent/source genes? Benjamin Haibe-Kains (DFCI/HSPH) R/Bioconductor Course December 15, 2011 9 / 12

Predictionet R package https://github.com/bhaibeka/predictionet Benjamin Haibe-Kains (DFCI/HSPH) R/Bioconductor Course December 15, 2011 10 / 12

Acknowledgements Computational Biology and Functional Genomics Laboratory Dana-Farber Cancer Institute, USA Renee Rubio Kathleen Fleming Niall Prendergast Raktim Sinha Daniel Schlauch Alejandro Quiroz Megha Padi Stefan Bentink John Quackenbush Machine Learning Group Université Libre de Bruxelles, Belgium Catharina Olsen Gianluca Bontempi Ontario Cancer Institute Universty of Toronto, Canada Entagen Entagen, USA Christopher Bouton Erik Bakke James Hardwick Amira Djebbari Benjamin Haibe-Kains (DFCI/HSPH) R/Bioconductor Course December 15, 2011 11 / 12