Network System Inference

Similar documents
Experimental / Data Technology

CS 5984: Topics and Schedule

BIOINFORMATICS AND SYSTEM BIOLOGY (INTERNATIONAL PROGRAM)

ECS 234: Genomic Data Integration ECS 234

Uncovering differentially expressed pathways with protein interaction and gene expression data

BIOINF/BENG/BIMM/CHEM/CSE 184: Computational Molecular Biology. Lecture 2: Microarray analysis

Bayesian Variable Selection and Data Integration for Biological Regulatory Networks

Engineering Genetic Circuits

From Proteomics to Systems Biology. Integration of omics - information

Towards Gene Network Estimation with Structure Learning

Research proposal. Title: Development and Implementation of a Data Model for Pathway Mapping. Student: Linghao Yi

Protein-Protein-Interaction Networks. Ulf Leser, Samira Jaeger

Inferring Gene Networks from Microarray Data using a Hybrid GA p.1

Protein-Protein-Interaction Networks. Ulf Leser, Samira Jaeger

Microarrays & Gene Expression Analysis

Technical University of Denmark

Random matrix analysis for gene co-expression experiments in cancer cells

ECS 234: Introduction to Computational Functional Genomics ECS 234

Outline and learning objectives. From Proteomics to Systems Biology. Integration of omics - information

Data representation for clinical data and metadata

Computational Biology

Inferring Cellular Networks Using Probabilis6c Graphical Models. Jianlin Cheng, PhD University of Missouri 2010

Learning theory: SLT what is it? Parametric statistics small number of parameters appropriate to small amounts of data

Design Principles in Synthetic Biology

Introduction to Bioinformatics

PREDICTION AND SIMULATION OF MULTI-TARGET THERAPIES FOR TRIPLE NEGATIVE BREAST CANCER THROUGH A NETWORK-BASED DATA INTEGRATION APPROACH

Introduction to BIOINFORMATICS

Proteomics. Manickam Sugumaran. Department of Biology University of Massachusetts Boston, MA 02125

ROAD TO STATISTICAL BIOINFORMATICS CHALLENGE 1: MULTIPLE-COMPARISONS ISSUE

Gene expression analysis. Biosciences 741: Genomics Fall, 2013 Week 5. Gene expression analysis

Computational Genomics. Reconstructing signaling and dynamic regulatory networks

ECS 234: Introduction to Computational Functional Genomics ECS 234

Including prior knowledge in shrinkage classifiers for genomic data

Microarray Informatics

Capabilities & Services

CMSE 520 BIOMOLECULAR STRUCTURE, FUNCTION AND DYNAMICS

Bayesian Networks as framework for data integration

The application of hidden markov model in building genetic regulatory network

Inference of Gene Relations from Microarray Data by Abduction

CS 5854: Predictive Models of Transcriptional Regulatory Networks

Computational Genomics. Regulatory Machinery of Gene Expression

Machine learning applications in genomics: practical issues & challenges. Yuzhen Ye School of Informatics and Computing, Indiana University

Bioinformatics : Gene Expression Data Analysis

Embeddable Sensor/Actuator Networks for Biological Systems

Examination Assignments

Machine Learning. HMM applications in computational biology

Identifying Signaling Pathways. BMI/CS 776 Spring 2016 Anthony Gitter

The Two-Hybrid System

This place covers: Methods or systems for genetic or protein-related data processing in computational molecular biology.

Loosely Coupled Architecture for Bio-Network Reverse Engineering

Experimental Techniques 2

Biology 644: Bioinformatics

Biomedical Sciences Graduate Program

Protein-Protein-Interaction Networks. Ulf Leser, Samira Jaeger

Our view on cdna chip analysis from engineering informatics standpoint

MANIFESTO OF STUDIES 2012

From Bench to Bedside: Role of Informatics. Nagasuma Chandra Indian Institute of Science Bangalore

VALLIAMMAI ENGINEERING COLLEGE

Research Powered by Agilent s GeneSpring

Learning Bayesian Network Models of Gene Regulation

CS 6824: New Directions in Computational Systems Biology

IPA Advanced Training Course

SIMS2003. Instructors:Rus Yukhananov, Alex Loguinov BWH, Harvard Medical School. Introduction to Microarray Technology.

/ Computational Genomics. Time series analysis

China National Grid --- BioNode. Jun Wang Beijing Genomics Institute

MANIFESTO OF STUDIES 2013

Short Course Instructors

Statistical Methods for Network Analysis of Biological Data

Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS

Pathways from the Genome to Risk Factors and Diseases via a Metabolomics Causal Network. Azam M. Yazdani, PhD

Following text taken from Suresh Kumar. Bioinformatics Web - Comprehensive educational resource on Bioinformatics. 6th May.2005

27041, Week 02. Review of Week 01

6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008

GREG GIBSON SPENCER V. MUSE

Gene function prediction. Computational analysis of biological networks. Olga Troyanskaya, PhD

CENTER FOR BIOTECHNOLOGY

Bioinformatics and Life Sciences Standards and Programming for Heterogeneous Architectures

Networks of Signal Transduction and Regulation in Cellular Systems

Era with Computational Biology/Toxicology

Microarray Informatics

Judy Wieber. Department of Computational Biology. May 28, 2008

Syllabus for BIOS 101, SPRING 2013

Stefano Monti. Workshop Format

Grand Challenges in Computational Biology

Cory Brouwer, Ph.D. Xiuxia Du, Ph.D. Anthony Fodor, Ph.D.

Accuracy of the Bayesian Network Algorithms for Inferring Gene Regulatory Networks

Neural Networks and Applications in Bioinformatics. Yuzhen Ye School of Informatics and Computing, Indiana University

Introduction to Microarray Analysis

How to deal with the microarray results.

Progress and Future Directions in Integrated Systems Toxicology. Mary McBride Agilent Technologies

Bioinformatics. Dick de Ridder. Tuinbouw Digitaal, 12/11/15

Neural Networks and Applications in Bioinformatics

Computational Genomics

and Promoter Sequence Data

Impact of Retinoic acid induced-1 (Rai1) on Regulators of Metabolism and Adipogenesis

2017 Qualifying Examination

BIOINFORMATICS Introduction

Integrating Biological Databases in the Context of Transcriptional Regulatory Networks

Computational Methods for Systems Biology: Analysis of High-Throughput Measurements and Modeling of Genetic Regulatory Networks

Welcome! Introduction to High Throughput Genomics December Norwegian Microarray Consortium FUGE Bioinformatics platform

Transcription:

Network System Inference Francis J. Doyle III University of California, Santa Barbara Douglas Lauffenburger Massachusetts Institute of Technology WTEC Systems Biology Final Workshop March 11, 2005 What is Systems Inference? Estimation of interactions of elements in a network given time series data of activities of different nodes (e.g., gene interactions from expression data) Goals Hypothesis generation Design of experiment Understanding cellular function Unravel design principles Data sources Large scale deletion projects DNA microarray ChiP Assay 1

Spectrum of Network Modeling [Stelling, 2005] Complexity Gene-Protein Sequence Protein biochemistry Cell dynamics Coverage Genomics Systems analysis of networks 2

Network Model Identification Problem Basic transcription model ( ) log Y eg = α fg X ef + n eg f F( g) e = experiment, f = factor, g = gene X = TF level, Y = expression, α = connection strength F = connection knowledge (i.e., localization constraint) not considering details of clustering methods, etc. Basic signal transduction model Rate of generation x& =Ω + Sr µ x gen (, ) r = f x p Rate of degradation Rate of inter-conversion Applications of Network Inference Metabolic networks Signaling networks Development models Gene regulatory networks Physical interaction networks 3

Tools and Mathematical Approaches Many Approaches Boolean networks + optimization [Ideker et al., 2000] Signed directed graphs + graph theory (SDGs) [Kyoda et al., 2000] Bayesian networks + machine learning [Pe er et al., 2001] Acyclic graphs + graph theory [Wagner, 2001] S-functions + decomposition [Kimura et al., 2004] Technical refinements that are promising innovations Exploit sparsity Exploit modularity Refined design of experiment Representations S-functions, Bayesian nets, SDGs, etc. Motifs and Modules Interconnection of function, dynamics & network topology Role in robustness compensation [Stelling et al., 2004] Recurring regulatory motifs Switches, oscillators, filters, memory, amplifiers [Wolf & Arkin, 2000] Technical approaches Pattern matching Constrained optimization Others Modules Larger size, inter-connectivity, autonomy Confers: robustness evolvability [Lee et al., 2002; Shen-Orr et al., 2002] 4

Modules in Yeast Molecular Network [Tanay et al., 2004] Issue of Dynamics Largely static interactions maps are constructed Explosion of parameters for dynamic network models Limited dynamic data available Isolated exceptions circadian rhythm work of Ueda (RIKEN, Kobe) [Ueda et al., Nature Genetics, 2005] 5

Iterations Between Models and Experiments Very few groups report studies with multiple modelexperiment iterations e.g., Klingmüller (Germany), Nobel (UK) Convergence is an issue Reporting modeling failures is an issue Multiple groups are at an early stage Important issues Model validation/invalidation Identifiability Design of experiment [Kitano, 2002] Validation, Verification, Consistency, etc. Validation or verification is critical step in any model identification problem [Ljung, 1999] Matching of data (to date): consistency In practice, only invalidation is possible [Poolla et al., 1994] Contradiction w/ data is often the most valuable role of a model Model discrimination can suggest new experiments Competing hypotheses can be resolved Data sets can be invalidated Various statistical tools for model invalidation Measure of error Confidence intervals Likelihood ratios 6

Identifiability Issues General questions [Ljung, 1999] Will identification procedure yield unique parameter set? Is resulting model equal to true system? Can experimental conditions lead to model discrimination? Design of experiment issues inputs (ligand, environmental, knockouts, etc.) measurements (localization information, expression, etc.) perturbation richness ; duration of experiment; sampling protocol Few applications in biological network inference Statistical mechanics approaches [Brown & Sethna, 2003] Formal identifiability of gene networks [Zak et al., 2003] Design of experiment [Faller, et al., 2003] Measurement selection [Stelling & Gilles, 2005] Role of Perturbations vs. Data [Stelling & Gilles, MPI, Magdeburg, Germany] 7

Regional Highlights - Japan Kitano Lab Systems Biology Markup Language [SBML] Difference-based regulation finding-minimum equivalent gene network RIKEN Cooperativity coevolutionary inference algorithm Receptor Tyrosine Kinase Regulatory Networks Consortium Ueda lab profiling of circadian regulation Miyano Lab (U. Tokyo) Bayesian networks and non-parametric regression for yeast gene networks Collaboration with pharma company for drug studies Computational Biology Research Center Gene regulatory network inference (lung cancer studies) KEGG (U. Kyoto) Portability of network data SBML, GON, etc. Reconstruction of networks via kernel methods (incl. dynamics) Ito Lab (U. Tokyo) Heterogeneous measurements for network inference (MS proteomics, FRET metabolomics, ChiP & GATC-PCR TF-binding, MS phosphorylation, MS protein complex) Regional Highlights - Europe MPI Dynamics of Complex Systems Design of experiment, model iterations, identifiability, perturbations Collaborative Research Center for Theoretical Biology (Humboldt U.) Dynamic modeling, driving new microarray data applied to Ras pathway German Cancer Research Center Klingmueller: design of experiment, novel hypothesis testing, model discrimination application to Jak/STAT pathway to predict unobservable behaviors Reuss: Bioinformatics studies of cytochrome p450 Oxford University Center for Mathematical Biology (Armitage): bottom-up approach to pathway analysis in histidine sensing Dept. of Physiology (Noble): challenging the bottom-up approach, advocating a combination that starts in the middle 8

General Observations Problems Many people using limited data to regress lots of coefficients Little true validation out there Limited success with dynamic network inference Promise Many, many approaches to fit models to data ( regression ) Motifs and modules (structure) are being factored into inference methods Nicer interplay possible between network modelers (static) and dynamic modelers with portability of tools and data Lots of curricular development in this area (bioinformatics) 9