Bayesian Decomposition

Similar documents
Relationship of Gene s Types and Introns

Biochemistry study of the molecular basis of life

CELL BIOLOGY BIOL3030, 3 credits Fall 2012, Aug 20, Dec 14, 2012 Tuesday/Thursday 8:00 am-9:15 am Bowman-Oddy Laboratories 1049

REGULATION OF PROTEIN SYNTHESIS. II. Eukaryotes

Chapter 18: Regulation of Gene Expression. 1. Gene Regulation in Bacteria 2. Gene Regulation in Eukaryotes 3. Gene Regulation & Cancer

FACULTY OF BIOCHEMISTRY AND MOLECULAR MEDICINE

Pennsylvania 15260, USA and 6 Department of Bioinformatics and Biosystems Technology, Technical University of Applied Sciences Wildau, Hochschulring

Scientific Method. Name: NetID: Exam 1 Version 1 September 12, 2017 Dr. A. Pimentel

Sequence Based Function Annotation. Qi Sun Bioinformatics Facility Biotechnology Resource Center Cornell University

Higher Human Biology Unit 1: Human Cells Pupils Learning Outcomes

Chapter 4: How Cells Work

Nucleic acids AP Biology

Biomedical Sciences (Medical) (BMSC)

Division Ave. High School Ms. Foglia AP Biology. Nucleic acids. AP Biology Nucleic Acids. Information storage

Nucleotide Metabolism Biochemistry by Lippincott pp

The Integrated Biomedical Sciences Graduate Program

Chapter 5: Microbial Metabolism (Part I)

AGRO/ANSC/BIO/GENE/HORT 305 Fall, 2016 Overview of Genetics Lecture outline (Chpt 1, Genetics by Brooker) #1

7.06 Cell Biology QUIZ #3

MICB688L/MOCB639 Advanced Cell Biology Exam II

8/21/2014. From Gene to Protein

Read and take notes on pages

GENE REGULATION IN PROKARYOTES

Lecture 9 Controlling gene expression

Functional Genomics Overview RORY STARK PRINCIPAL BIOINFORMATICS ANALYST CRUK CAMBRIDGE INSTITUTE 18 SEPTEMBER 2017

Gene Expression: Transcription

Genetics module. DNA Structure, Replication. The Genetic Code; Transcription and Translation. Principles of Heredity; Gene Mapping

Assembling Protein Molecules

Chapter 8 Lecture Outline. Transcription, Translation, and Bioinformatics

Chapter 3. DNA Replication & The Cell Cycle

BIOLOGY LTF DIAGNOSTIC TEST DNA to PROTEIN & BIOTECHNOLOGY

BIOCHEMISTRY, BIOPHYSICS, AND MOLECULAR BIOLOGY (BBMB)

Chapter 2 Molecules to enzymes - Short answer [72 marks]

Gut microbial colonization in day old chicks. Henri Woelders DirkJan Schokker Mari A. Smits

Introduction to Microarray Data Analysis and Gene Networks. Alvis Brazma European Bioinformatics Institute

AP Biology Gene Expression/Biotechnology REVIEW

Microarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison. CodeLink compatible

Chapter 11: Regulation of Gene Expression

BIOTECHNOLOGY. Course Syllabus. Section A: Engineering Mathematics. Subject Code: BT. Course Structure. Engineering Mathematics. General Biotechnology

Peptide libraries: applications, design options and considerations. Laura Geuss, PhD May 5, 2015, 2:00-3:00 pm EST

SCI204: Honors Biology

Chapter 3 Nucleic Acids, Proteins, and Enzymes

Protein Synthesis & Gene Expression

If Dna Has The Instructions For Building Proteins Why Is Mrna Needed

Click here to read the case study about protein synthesis.

Page 1. Name: 1) Which letter indicates a cell structure that directly controls the movement of molecules into and out of the cell?

CHAPTER 21 LECTURE SLIDES

Genetic Engineering Challenge How can scientists develop a type of rice that could prevent vitamin A deficiency? 1

Fig Ch 17: From Gene to Protein

Fundamentals of Bioinformatics: computation, biology, computational biology

CHAPTER 11 DNA NOTES PT. 4: PROTEIN SYNTHESIS TRANSCRIPTION & TRANSLATION

DNA Binding Domains: Structural Motifs. Effector Domain. Zinc Fingers. Zinc Fingers, continued. Zif268

Computational Biology I LSM5191 (2003/4)

Molecular Biology Primer. CptS 580, Computational Genomics, Spring 09

Today s lecture: Types of mutations and their impact on protein function

Temperature sensitive gene expression: Two rules.

Division Ave. High School AP Biology

T and B cell gene rearrangement October 17, Ram Savan

Time allowed: 2 hours Answer ALL questions in Section A, ALL PARTS of the question in Section B and ONE question from Section C.

Developmental Biology BY1101 P. Murphy

Concepts and Methods in Developmental Biology

PCR Arrays. An Advanced Real-time PCR Technology to Empower Your Pathway Analysis

Chapter 16: Gene Expression from Biology by OpenStax College is licensed under a Creative Commons Attribution 3.0 Unported license.

Lecture 3 (FW) January 28, 2009 Cloning of DNA; PCR amplification Reading assignment: Cloning, ; ; 330 PCR, ; 329.

Transcription Eukaryotic Cells

LECTURE: 22 IMMUNOGLOBULIN DIVERSITIES LEARNING OBJECTIVES: The student should be able to:

GENETIC BASIS OF ANTIBODY STRUCTURE AND DIVERSITY. Steven J. Norris, Ph.D

Prokaryotic Transcription

Self-test Quiz for Chapter 12 (From DNA to Protein: Genotype to Phenotype)

Hole s Essentials of Human Anatomy & Physiology

Review of Protein (one or more polypeptide) A polypeptide is a long chain of..

What is DNA??? DNA = Deoxyribonucleic acid IT is a molecule that contains the code for an organism s growth and function

The hidden transcriptome: discovery of novel, stress-responsive transcription in Daphnia pulex

BIOLOGY. Chapter 16 GenesExpression

2. Outline the levels of DNA packing in the eukaryotic nucleus below next to the diagram provided.

Genomics and Gene Recognition Genes and Blue Genes

CHAPTER 20 DNA TECHNOLOGY AND GENOMICS. Section A: DNA Cloning

Analysis of Microarray Data

CHAPTER 13 LECTURE SLIDES

Frequency of Keyword Totals - (All LE Regents Exams)

leading the way in research & development

NUCLEUS. Fig. 2. Various stages in the condensation of chromatin

Transcription in Eukaryotes

Introduction to BIOINFORMATICS

Chapter 7 Outline. Microbial Physiology Introduction 5/22/2011

Optimizing Synthetic DNA for Metabolic Engineering Applications. Howard Salis Penn State University

Nucleic Acids, Proteins, and Enzymes

Cells and Tissues. Overview CELLS

IPA Advanced Training Course

BABELOMICS: Microarray Data Analysis

BIOINFORMATICS IN AQUACULTURE. Aleksei Krasnov AKVAFORSK (Ås, Norway) Bergen, September 21, 2007

Regulation of gene expression. (Lehninger pg )

Gene Expression Technology

Metabolic Networks. Ulf Leser and Michael Weidlich

Ch 10.4 Protein Synthesis

DNA DNA Profiling 18. Discuss the stages involved in DNA profiling 19. Define the process of DNA profiling 20. Give two uses of DNA profiling

GENE REGULATION slide shows by Kim Foglia modified Slides with blue edges are Kim s

Estimating Cell Cycle Phase Distribution of Yeast from Time Series Gene Expression Data

Computational Biology I LSM5191

Transcription:

Bayesian Decomposition Michael Ochs

Making Proteins

A Closer Look at Translation Post-Translational Modification RNA Splicing mirna

Identifying Pathways A 1 2 3 B C D A B C D Bioinformatics www.promega.com

Goal of Analysis Take measurements of thousands of genes, some of which are responding to stimuli of interest And find the correct set of basis vectors that link to pathways Bioinformatics 1 2 3 * * * * then identify * the pathways *

BD: Matrix Decomposition gene 1 gene N condition 1 Data condition M = The behavior of one gene can be explained as a mixture of patterns Distribution of Patterns gene 1 gene N pattern 1 pattern k ** ** X condition 1 with different behaviors condition M pattern 1 pattern k Patterns of Behavior

Patterns as Basis Vectors

BD with Knowledge of Classes gene 1 gene N condition 1 Data condition M = Distribution of Patterns gene 1 gene N pattern 1 pattern k ** ** X condition 1 condition M 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 * * pattern 1 Patterns of Behavior pattern k

BD Structure Atomic Domains Allow Encoding of Biological Information Markov Chain Monte Carlo is used to explore possible sets of distributions and patterns

Project Normal Data Download Data from CAMDA Site Adjust for Background Measurement Take Ratios Calc Mean and SDOM for Each Ratio Eliminate M3T and M4T Data Eliminate 24 Points with Only 1 Data Pt 99% 4 Pts, 1% 3 Pts, 0.1% 2 Pts

Filtering of Genes Eliminated all ESTs Annotated Remaining Genes from Gene Ontology on Unigene Name Annotated all Genes on Clone ID 24% Changed Unigene Cluster 948 Clones Had GO Process Information

Updating Annotations: ASAP http://bioinformatics.fccc.edu/

Bayesian Decomposition Encoded 3 Known Patterns Kidney, 6 Conditions Liver, 6 Conditions Testis, 4 Conditions Allowed 1-3 Additional Patterns Account for Behavior Unrelated to Tissue Specific Expression

Fitting the Data

Four Patterns 0.3 0.25 Kidney Liver Testis Background 0.2 0.15 0.1 0.05 0 Bioinformatics 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

Five Patterns 0.3 0.25 0.2 Kidney Liver Testis Background 1 Background 2 0.15 0.1 0.05 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

Four vs Five Patterns

Gene Ontology Identify Genes Only in One Pattern See if Pattern Enhanced in GO Identify Genes in a Pattern 3σ above Zero in Distribution Look at GO Assignments Identify Genes Lacking in Pattern Eliminate Background (Genes > 70%) Look for Genes Not in Pattern (3σ)

Genes Only in Kidney by GO neurotransmitter transport * chloride transport * receptor mediated endocytosis enzyme linked receptor protein signaling pathway * transmembrane receptor protein tyrosine kinase signaling pathway * vitamin/cofactor transport * vitamin B12 transport inorganic anion transport * anion transport * neuropeptide signaling pathway endocytosis * From Old Annotations Sodium transport, vesiclemediated transport, amino acid transport, folate transport, homophilic cell adhesion, cell-cell adhesion, monovalent inorganic cation transport metal ion transport > 10x Enhancement

Genes Only in Liver by GO antigen processing antigen processing, endogenous antigen via MHC class I" cellular defense response response to drug drug susceptibility/resistance * cell-cell adhesion * homophilic cell adhesion * response to abiotic stimulus response to chemical substance response to pest/pathogen/parasite protein targeting From Old Annotations small molecule transport, histogenesis and organogenesis, embryogenesis and morphogenesis, lipid metabolism > 10x Enhancement

Genes Only in Testis by GO DNA recombination meiotic recombination reproduction * gametogenesis * spermatogenesis * regulation of transcription from Pol II promoter microtubule-based movement microtubule-based process development * > 10x Enhancement From Old Annotations nuclear organization and biogenesis, chromosome organization and biogenesis, cell organization and biosynthesis

Kidney Genes, 3σ, > 2 fold amino acid metabolism inflammatory response mitotic cell cycle amine metabolism anion transport nitrogen metabolism perception of abiotic stimulus perception of light cell-cell adhesion homophilic cell adhesion S phase of mitotic cell cycle endocytosis G-protein coupled receptor protein signaling pathway

Testis Genes, 3σ, >4 fold reproduction gametogenesis spermatogenesis regulation of cell shape and cell size mitotic cell cycle microtubule-based movement protein folding S phase of mitotic cell cycle

Liver Genes, 3σ, >3 fold amino acid metabolism response to drug drug susceptibility/resistance energy pathways energy derivation by oxidation of organic compounds main pathways of carbohydrate metabolism catabolic carbohydrate metabolism response to abiotic stimulus response to chemical substance sensory perception morphogenesis organogenesis tricarboxylic acid cycle

Genes Absent in Patterns obsolete Absent in Kidney monosaccharide metabolism regulation of transcription from Pol II promoter regulation of cell shape and cell size biological_process unknown reproduction gametogenesis spermatogenesis microtubule-based process Absent in Liver reproduction gametogenesis spermatogenesis cell differentiation actin filament-based process actin cytoskeleton organization and biogenesis microtubule-based movement

Genes Absent in Background 1 biological_process unknown obsolete protein modification protein targeting actin filament-based process actin cytoskeleton organization and biogenesis endocytosis regulation of transcription from Pol II promoter reproduction gametogenesis spermatogenesis mitotic cell cycle

Genes Present in Two Tissues Kidney/Liver not Testis cell-cell adhesion homophilic cell adhesion defense response immune response amino acid metabolism amine metabolism perception of abiotic stimulus perception of light Kidney/Testis not Liver mitotic cell cycle

Acknowledgements This Work Tom Moloshok DJ Datta (Cambridge) Andrew Kossenkov Bill Speier (JHU) Colleagues J. Robert Beck Frank Manion Programming Jeffrey Grant Elizabeth Goralczyk Luke Somers Others G. Parmigiani (JHU) T. Brown (Columbia) E. Korotkov (RAS)