Systems Biology. Nicolás Loira Center for Mathematical Modeling November 2014

Size: px
Start display at page:

Download "Systems Biology. Nicolás Loira Center for Mathematical Modeling November 2014"

Transcription

1 Systems Biology Nicolás Loira Center for Mathematical Modeling November 2014

2 Overview Intro to Systems Biology Metabolic Models

3 Systems Biology

4 Parts and pieces 4

5

6 Systems

7 Sub-systems

8 Systemic view of a Cell Signaling networks Regulatory networks Metabolic networks 8

9 Metabolic Networks

10 Signaling networks

11 Regulatory networks [Calzone, 2008]

12 Network Inference Network inference regression procedure a 0,i a 1,i a 2,i a 3,i a 4,i x x x x _ x x 4 + x 2 x 3

13 Scientific & Biotech Applications 1. Contextualization of data 2. Guidance of metabolic engineering 3. Directing hypothesis-driven discovery 4. Discovery of multi-species relationships 5. Network property discovery

14 Bioinfo. sub-fields -- Sequence analysis -- Genome annotation -- Computational evolutionary biology -- Literature analysis -- Analysis of gene expression -- Analysis of regulation -- Analysis of protein expression -- Analysis of molecules (metabolomics) -- Comparative genomics -- Modeling biological systems -- High-throughput image analysis -- Structural Bioinformatics

15 Tools from Math, CS, IT ---- algorithms, computational biology ---- databases & information system ---- web, web services ---- software engineering ---- HPC (High Performance Computing) ---- data mining ---- image processing ---- modeling and simulation ---- discrete mathematics (eg: graphs, logic) ---- control and system theory ---- statistics

16 engineering vs. reverse engineering biology: reverse engineering systems knowledge Reverse engineering is the process of discovering the technological principles of a device, object, or system through analysis of its structure, function, and operation. Engineering is the discipline, art, skill and profession of acquiring and applying scientific, mathematical,economic, social, and practical knowledge, in order to design and build structures, machines, devices, systems, materials and processes

17 Biology and models -- biology: reverse engineering systems models -- text, symbols, standards, language, math -- models: static vs dynamic ---- static: data and their relationships ---- dynamic: fluxes and multi-agent -- genomics -- prokaryote vs eukaryote

18 Models Descriptive Predictive Generic *choose two

19 Descriptive Models vs Black Box

20 Predictive Models experimental evidence biological model Simulate Simulation results modified model Simulate Simulation results Do experiment

21 Generic Models Specific model Generic model

22 Iterative model improvement experimental evidence biological model Simulate Simulation results Validate model accuracy

23 Metabolic Models

24 Definition of Metabolic Pathways l A chemical reaction interconverts chemical compounds (analogous to a production rule) A + B = C + D l An enzyme is a protein that accelerates chemical reactions. Each enzyme is encoded by one or more genes. l A pathway is a linked set of reactions (analogous to a chain of rules) A C E

25 Pathways

26 What is a Metabolic Pathway? A pathway is a conceptual unit of the metabolism An ordered set of interconnected, directed biochemical reactions A pathway forms a coherent unit: Boundaries defined at high-connectivity substrates Regulated as a single unit Evolutionarily conserved across organisms as a single unit Performs a single cellular function Historically grouped together as a unit All reactions in a single organism

27 Genome-scale Metabolic Networks Compartments Transports Influx Outflux Biomass Chains of reactions 27

28 Elements of Met.Networks a Chemical Reactions b Genes->Proteins->Reactions c cytosol extracellular nucl. mitocondria

29 Stoichiometry is the measuring of metabolites in a chemical reaction Reaction2: 2 M1 + 3 M2 => 1 M3 + 4 M4 ( )

30 A network of reactions can be described with an Stoichiometric Matrix R1 R2 R3 R4 R5 S = M1 M2 M3 M Systems Biology: Properties of Reconstructed Networks (Palsson, 2006)

31 Instantiation of a Reaction reaction Organism1 (Source) R1 S 1, S 2, S 3 genes Organism2 (Target) R1 T 1??? 31

32 Gene association S 1 S 2 S 3 Γ= {S 1,S 2,S 3 } β= and S 1 or Reaction R Gene association: S 1 and (S 2 or S 3 ) S 2 S 3 32

33 KEGG

34 KEGG Color Mapper

35 Current metabolic models of S.cerevisiae Model ID Publication Genes Reactions Metabolites Compart -ments iff708 (Förster, 03) 708 1, ind750 (Duarte, 04) 750 1, ill672 (Kuepfer, 05) 672 1, iin800 (Nookaew, 08) 800 1,446 1,118 4 ijm832 (Herrgård, 08) 832 1,857 2, ijm832 no compartments (Herrgård, 08) 832 1,573 1,748 2 imm904 (Mo, 09) 904 1,577 1,392 8

36 SBML beginning of model definition list of function definitions (optional) list of unit definitions (optional) list of compartment types (optional) list of species types (optional) list of compartments (optional) list of species (optional) list of parameters (optional) list of initial assignments (optional) list of rules (optional) list of constraints (optional) list of reactions (optional) list of events (optional) end of model definition

37 Visualization tools Cytoscape CellDesigner

38 Current reconstructions Genome-scale models are hard and expensive to build Most reconstructions are for bacteria There are no tools to correctly reconstruct models for eukaryotes [Oberhardt, 2009]

39 [Thiele, Nature de novo reconstruction Protocols, 2010]

40 Auto: Pathway Tools Software: PathoLogic l Computational creation of new Pathway/Genome Databases l Transforms genome into Pathway Tools schema and layers inferred information above the genome l Predicts operons l Predicts metabolic network l Predicts pathway hole fillers l Infers transport reactions [Slides by Peter Karp]

41 Pathway Tools Software: Pathway/Genome Editors l Interactively update PGDBs with graphical editors l Support geographically distributed teams of curators with object database system l Gene editor l Protein editor l Reaction editor l Compound editor l Pathway editor l Operon editor l Publication editor

42 Pathway Tools Software: Pathway/Genome Navigator l Querying, visualization of pathways, chromosomes, operons l Analysis operations l Pathway visualization of gene-expression data l Global comparisons of metabolic networks l Comparative genomics l WWW publishing of PGDBs l Desktop operation

43

44 Auto: Pantograph Re-using existing models Saccharomyces cerevisiae Saccharomyces paradoxus Saccharomyces mikatae Saccharomyces kudriavzevii Saccharomyces bayanus Saccharomyces exiguus Saccharomyces servazzii Saccharomyces castellii Candida glabrata Zygosaccharomyces rouxii Kluyveromyces thermotolerans We need methods that can use an existing model to provide a base to build models for other organisms This should decrease the amount of work needed to build a metabolic model Kluyveromyces waltii Saccharomyces kluyveri Kluyveromyces lactis Kluyveromyces marxianus Ashbya gossypii 44

45 Model Organism 1 Auto: Pantograph Genes Organism 1 ] PANTOGRAPH METHOD Model Organism 2 Genes organism 2

46 Pantograph Workflow Target annotated genes Source annotated genes Reaction Instantiator Instantiated Reactions curator editediteditedit Source metabolic model Model Instantiator target draft model Curation target model accuracy Validation target curated model experimental evidence 46

47 Reconstructions with Pantograph [Loira et al., 2012] Yarrowia lipolytica (INRA Micalis, France) Biofuels Nannochloropsis salina (AUSTRAL Biotech, Chile) Ectocarpus siliculosus (INRIA Dylis, France) Biomining Acidithiobacillus ferrooxidans (MATHomics, CMM, U.Chile)

48 Gap Filling Automatic procedures can generate models with gaps Gaps can exist because: A U U The organism doesn t have that reaction C? D Metabolic network U Candidates to fill gap The tools could not identify an homolog for the genes The organism have a different way to produce this enzymatic reaction

49 Manual Curation Models reconstructed by automatic methods still require manual curation To Fix non-obvious gaps (but obvious for an expert in the organism) To Add relevant knowledge from the literature

50 Manual curation Target annotated genes Source annotated genes Reaction Instantiator Instantiated Reactions curator editediteditedit Source model Model Instantiator target draft model Curation target model accuracy Validation target curated model experimental evidence 50

51 Simulation and predictions of metabolism Models can be used to predict behavior Model Simulate Predict Experimental Validate Data

52 Fluxes Fluxes account for the number of times a reaction Representation of the variation of Metabolites in a time unit, given some flux: S * v = dx/dt S: Stoichiometric Matrix v: Flux Vector happens x: Metabolite vector x=(m1 M2 M3 M4... )

53 Steady State Steady-state mean number of internal metabolites is constant in time S * v = dx/dt = 0 Biological Systems tend to stabilize with time in a Steady State

54 Flux Balance Analysis (FBA) Flux Balance Analysis (FBA) is useful to search fluxes that maximize a function Input Substrates Metabolic Model Look for optimal path to maximize production Cell Products Max f() (biomass, ethanol, etc.)

55 FBA solutions are restricted by constrains Constraints Model Biomass Function

56 FBA can be used to maximize biomass production Biomass = ,3betaDglcn ala amp arg asn asp atp cmp cys damp dcmp dgmp dtmp ergst gln glu gly glycogen gmp h2o his ile leu lys mannan met pa pc pe phe pro ps ptd1ino ser so thr tre triglyc trp tyr ump val zymst

57 From a constrained flux problem we build an LP problem Maximize: ω define metabolites in biomass function Subject to: α and β are bounds to fluxes

58 We use FBA to measure effects of reaction deletion in biomass Non-lethal From LP B Growth Deletion Max Biomass{< B No Growth Lethal Deletion

59 Genetic conditions (Gene KOs) Knocking out a gene and measuring growth growth (OD) Viable [Joice, 2006] (threshold=⅓ of average OD) Lethal time

60 Simulation vs experimental evidence A simulation is accurate if predicts (lack of) growth correctly Experimental Growth Growth True Positive Simulation FBA No Growth False Positive No Growth Growth No Growth False Negative True Negative

61 Metabolic model accuracy Example: Model ind750, Biomass Function ind750 Against experimental results from (Winzeler,1999) Rich Media Minimal Media Accuracy= (TP+TN) (TP+TN+FP+FN) TP TN 1 6 FP 3 4 FN 9 4 Accuracy Accuracy= (Tp+Tn)/(tp+tn+fp+fn) sensitivity= tp/(tp+fn) specificity= tn/(tn+fp) geom.mean= sqr(sens*specif)

62 Accuracy report Model Simulator MATLAB COBRA Toolbox Literature Prediction Experiment Prediction Prediction Prediction VALIDATOR Experiment Experiment Experiment Prediction Experiment Accuracy report

63 Detailed validation Y. lipolytica S. cerevisiae Gene Exp. Simul. Ref Media [36] Ethanol knocked YALI0C24101g locus ortholog YGL062W name PYC1 Growth + Growth Result - FN [36] Aspartate YALI0C24101g YGL062W PYC1 + + TP [36] Glutamate YALI0C24101g YGL062W PYC1 + - FN [36] YNBD YALI0C16885g YER065C ICL1 + + TP [36] Ethanol YALI0C16885g YER065C ICL1 - - TN [36] Aspartate YALI0C16885g YER065C ICL1 + + TP [36] Glutamate YALI0C16885g YER065C ICL1 + - FN [36] YNBD YALI0C24101g YGL062W ICL1 YALI0C16885g YER065C PYC1 - - TN [36] Ethanol YALI0C24101g YGL062W ICL1 YALI0C16885g YER065C PYC1 - - TN [36] Aspartate YALI0C24101g YGL062W ICL1 YALI0C16885g YER065C PYC1 + + TP YALI0C24101g YGL062W ICL1 63

64 Y. lipolytica Model validation 152 experiments 98 simulations 39 True Positives 16 False Positives 25 True Negatives18 False Negatives 64

65 Tools used for this analysis COBRA toolbox: for FluxBalanceAnalysis Matlab glpk/glpkmex: for solving LP libsbml/sbmltools: for SBML handling Pantograph

66 Validation and iterative improvement Target annotated genes Source annotated genes Reaction Instantiator Instantiated Reactions curator editediteditedit Source model Model Instantiator target draft model Curation target model accuracy Validation target curated model experimental evidence 66

67 FIN Nicolás Loira Center for Mathematical Modeling November 2014

PEPS ExoMod 27/02/17. NeoGene: Caractérisation de nouveaux gènes par expansion du code génétique chez les levures du genre Lachancea

PEPS ExoMod 27/02/17. NeoGene: Caractérisation de nouveaux gènes par expansion du code génétique chez les levures du genre Lachancea PEPS ExoMod 27/02/17 NeoGene: Caractérisation de nouveaux gènes par expansion du code génétique chez les levures du genre Lachancea Gilles Fischer - Biology of Genomes Shixin Ye-Lehmann - Genetic code

More information

03-511/711 Computational Genomics and Molecular Biology, Fall

03-511/711 Computational Genomics and Molecular Biology, Fall 03-511/711 Computational Genomics and Molecular Biology, Fall 2011 1 Problem Set 0 Due Tuesday, September 6th This homework is intended to be a self-administered placement quiz, to help you (and me) determine

More information

Multiple Gap-Filling of Flux-Balance Models

Multiple Gap-Filling of Flux-Balance Models Multiple Gap-Filling of Flux-Balance Models Mario Latendresse and Peter D. Karp Bioinformatics Research Group SRI International, Menlo Park, USA COBRA 2011, Reykjavik, Iceland, June 25 Outline 1 The Problem

More information

Algorithms in Bioinformatics ONE Transcription Translation

Algorithms in Bioinformatics ONE Transcription Translation Algorithms in Bioinformatics ONE Transcription Translation Sami Khuri Department of Computer Science San José State University sami.khuri@sjsu.edu Biology Review DNA RNA Proteins Central Dogma Transcription

More information

Basic modeling approaches for biological systems -II. Mahesh Bule

Basic modeling approaches for biological systems -II. Mahesh Bule Basic modeling approaches for biological systems -II Mahesh Bule Modeling Metabolism Metabolism is general term for two kind of reactions Catabolic reactions (breakdown of complex compounds to get energy

More information

NIH Public Access Author Manuscript Nat Biotechnol. Author manuscript; available in PMC 2011 June 6.

NIH Public Access Author Manuscript Nat Biotechnol. Author manuscript; available in PMC 2011 June 6. NIH Public Access Author Manuscript Published in final edited form as: Nat Biotechnol. 2010 March ; 28(3): 245 248. doi:10.1038/nbt.1614. What is flux balance analysis? Jeffrey D. Orth, University of California

More information

From DNA Sequences to Genome-Scale Metabolic Models (to Network Projects in CellNetAnalyzer)

From DNA Sequences to Genome-Scale Metabolic Models (to Network Projects in CellNetAnalyzer) From DNA Sequences to Genome-Scale Metabolic Models (to Network Projects in CellNetAnalyzer) Sven Thiele Max Planck Institute for Dynamics of Complex Technical Systems, Germany Introduction A genome-scale

More information

CFSSP: Chou and Fasman Secondary Structure Prediction server

CFSSP: Chou and Fasman Secondary Structure Prediction server Wide Spectrum, Vol. 1, No. 9, (2013) pp 15-19 CFSSP: Chou and Fasman Secondary Structure Prediction server T. Ashok Kumar Department of Bioinformatics, Noorul Islam College of Arts and Science, Kumaracoil

More information

EE550 Computational Biology

EE550 Computational Biology EE550 Computational Biology Week 1 Course Notes Instructor: Bilge Karaçalı, PhD Syllabus Schedule : Thursday 13:30, 14:30, 15:30 Text : Paul G. Higgs, Teresa K. Attwood, Bioinformatics and Molecular Evolution,

More information

Handling the heterogeneity of genomic and metabolic networks data within flexible workflows with the PADMet toolbox

Handling the heterogeneity of genomic and metabolic networks data within flexible workflows with the PADMet toolbox Handling the heterogeneity of genomic and metabolic networks data within flexible workflows with the PADMet toolbox Marie Chevallier, Meziane Aite, Jeanne Got, Guillaume Collet, Nicolas Loira, María Paz

More information

In silico measurements of twist and bend. moduli for beta solenoid protein self-

In silico measurements of twist and bend. moduli for beta solenoid protein self- In silico measurements of twist and bend moduli for beta solenoid protein self- assembly units Leonard P. Heinz, Krishnakumar M. Ravikumar, and Daniel L. Cox Department of Physics and Institute for Complex

More information

Metabolic modeling (4cr)

Metabolic modeling (4cr) 582605 Metabolic modeling (4cr) Lecturer: prof. Juho Rousu Course assistant: Markus Heinonen Lectures: Tuesdays and Fridays, 14.15-16, B119 Exercises: 16.03.-24.04. Tuesdays 16.15-18, C221 Course topics:

More information

Amino Acid Sequences and Evolutionary Relationships

Amino Acid Sequences and Evolutionary Relationships Amino Acid Sequences and Evolutionary Relationships One technique used to determine evolutionary relationships is to study the biochemical similarity of organisms. Though molds, aardvarks, and humans appear

More information

Amino Acid Sequences and Evolutionary Relationships

Amino Acid Sequences and Evolutionary Relationships Amino Acid Sequences and Evolutionary Relationships Pre-Lab Discussion Homologous structures -- those structures believed to have a common origin but not necessarily a common function -- provide some of

More information

Metabolic Networks. Ulf Leser and Michael Weidlich

Metabolic Networks. Ulf Leser and Michael Weidlich Metabolic Networks Ulf Leser and Michael Weidlich This Lecture Introduction Systems biology & modelling Metabolism & metabolic networks Network reconstruction Strategy & workflow Mathematical representation

More information

7.014 Quiz II Handout

7.014 Quiz II Handout 7.014 Quiz II Handout Quiz II: Wednesday, March 17 12:05-12:55 54-100 **This will be a closed book exam** Quiz Review Session: Friday, March 12 7:00-9:00 pm room 54-100 Open Tutoring Session: Tuesday,

More information

7.014 Solution Set 4

7.014 Solution Set 4 7.014 Solution Set 4 Question 1 Shown below is a fragment of the sequence of a hypothetical bacterial gene. This gene encodes production of HWDWN, protein essential for metabolizing sugar yummose. The

More information

Inference in Metabolic Network Models using Flux Balance Analysis

Inference in Metabolic Network Models using Flux Balance Analysis Inference in Metabolic Network Models using Flux Balance Analysis BMI/CS 776 www.biostat.wisc.edu/bmi776/ Mark Craven craven@biostat.wisc.edu Spring 2011 Goals for Lecture the key concepts to understand

More information

Basic concepts of molecular biology

Basic concepts of molecular biology Basic concepts of molecular biology Gabriella Trucco Email: gabriella.trucco@unimi.it What is life made of? 1665: Robert Hooke discovered that organisms are composed of individual compartments called cells

More information

Metabolism and metabolic networks. Metabolism is the means by which cells acquire energy and building blocks for cellular material

Metabolism and metabolic networks. Metabolism is the means by which cells acquire energy and building blocks for cellular material Metabolism and metabolic networks Metabolism is the means by which cells acquire energy and building blocks for cellular material Metabolism is organized into sequences of biochemical reactions, metabolic

More information

ENZYMES AND METABOLIC PATHWAYS

ENZYMES AND METABOLIC PATHWAYS ENZYMES AND METABOLIC PATHWAYS This document is licensed under the Attribution-NonCommercial-ShareAlike 2.5 Italy license, available at http://creativecommons.org/licenses/by-nc-sa/2.5/it/ 1. Enzymes build

More information

Zool 3200: Cell Biology Exam 3 3/6/15

Zool 3200: Cell Biology Exam 3 3/6/15 Name: Trask Zool 3200: Cell Biology Exam 3 3/6/15 Answer each of the following questions in the space provided; circle the correct answer or answers for each multiple choice question and circle either

More information

466 Asn (N) to Ala (A) Generate beta dimer Interface

466 Asn (N) to Ala (A) Generate beta dimer Interface Table S1: Amino acid changes to the HexA α-subunit to convert the dimer interface from α to β and to introduce the putative GM2A binding surface from β- onto the α- subunit Residue position (α-numbering)

More information

11 questions for a total of 120 points

11 questions for a total of 120 points Your Name: BYS 201, Final Exam, May 3, 2010 11 questions for a total of 120 points 1. 25 points Take a close look at these tables of amino acids. Some of them are hydrophilic, some hydrophobic, some positive

More information

Basic concepts of molecular biology

Basic concepts of molecular biology Basic concepts of molecular biology Gabriella Trucco Email: gabriella.trucco@unimi.it Life The main actors in the chemistry of life are molecules called proteins nucleic acids Proteins: many different

More information

BIOINFORMATICS AND SYSTEM BIOLOGY (INTERNATIONAL PROGRAM)

BIOINFORMATICS AND SYSTEM BIOLOGY (INTERNATIONAL PROGRAM) BIOINFORMATICS AND SYSTEM BIOLOGY (INTERNATIONAL PROGRAM) PROGRAM TITLE DEGREE TITLE Master of Science Program in Bioinformatics and System Biology (International Program) Master of Science (Bioinformatics

More information

BIOSTAT516 Statistical Methods in Genetic Epidemiology Autumn 2005 Handout1, prepared by Kathleen Kerr and Stephanie Monks

BIOSTAT516 Statistical Methods in Genetic Epidemiology Autumn 2005 Handout1, prepared by Kathleen Kerr and Stephanie Monks Rationale of Genetic Studies Some goals of genetic studies include: to identify the genetic causes of phenotypic variation develop genetic tests o benefits to individuals and to society are still uncertain

More information

DNA.notebook March 08, DNA Overview

DNA.notebook March 08, DNA Overview DNA Overview Deoxyribonucleic Acid, or DNA, must be able to do 2 things: 1) give instructions for building and maintaining cells. 2) be copied each time a cell divides. DNA is made of subunits called nucleotides

More information

Dr. R. Sankar, BSE 631 (2018)

Dr. R. Sankar, BSE 631 (2018) Pauling, Corey and Branson Diffraction of DNA http://www.nature.com/scitable/topicpage/dna-is-a-structure-that-encodes-biological-6493050 In short, stereochemistry is important in determining which helices

More information

Using DNA sequence, distinguish species in the same genus from one another.

Using DNA sequence, distinguish species in the same genus from one another. Species Identification: Penguins 7. It s Not All Black and White! Name: Objective Using DNA sequence, distinguish species in the same genus from one another. Background In this activity, we will observe

More information

He who asks is a fool for five minutes, but he who does not ask remains a fool forever. Today. Admin Stuff. CSE527 Computational Biology

He who asks is a fool for five minutes, but he who does not ask remains a fool forever. Today. Admin Stuff. CSE527 Computational Biology CSE527 Computational Biology http://www.cs.washington.edu/527 He who asks is a fool for five minutes, but he who does not ask remains a fool forever. Larry Ruzzo Autumn 2006 -- Chinese Proverb UW CSE Computational

More information

ELLI HEIKKINEN ALGORITHM FOR IDENTIFYING GENETIC MODIFICATIONS FOR OPTIMIZATION OF MICROBIAL PRODUCTION STRAINS. Bachelor of Science Thesis

ELLI HEIKKINEN ALGORITHM FOR IDENTIFYING GENETIC MODIFICATIONS FOR OPTIMIZATION OF MICROBIAL PRODUCTION STRAINS. Bachelor of Science Thesis ELLI HEIKKINEN ALGORITHM FOR IDENTIFYING GENETIC MODIFICATIONS FOR OPTIMIZATION OF MICROBIAL PRODUCTION STRAINS Bachelor of Science Thesis Examiner: Associate Professor Heikki Huttunen, Supervisor: Research

More information

Protein Structure Analysis

Protein Structure Analysis BINF 731 Protein Structure Analysis http://binf.gmu.edu/vaisman/binf731/ Secondary Structure: Computational Problems Secondary structure characterization Secondary structure assignment Secondary structure

More information

7.013 Problem Set 3 FRIDAY October 8th, 2004

7.013 Problem Set 3 FRIDAY October 8th, 2004 MIT Biology Department 7.012: Introductory Biology - Fall 2004 Instructors: Professor Eric Lander, Professor Robert. Weinberg, Dr. laudette ardel Name: T: 7.013 Problem Set 3 FRIDY October 8th, 2004 Problem

More information

36. The double bonds in naturally-occuring fatty acids are usually isomers. A. cis B. trans C. both cis and trans D. D- E. L-

36. The double bonds in naturally-occuring fatty acids are usually isomers. A. cis B. trans C. both cis and trans D. D- E. L- 36. The double bonds in naturally-occuring fatty acids are usually isomers. A. cis B. trans C. both cis and trans D. D- E. L- 37. The essential fatty acids are A. palmitic acid B. linoleic acid C. linolenic

More information

Pathway Tools Schema and Semantic Inference Layer: Pathways and the Overview. SRI International Bioinformatics

Pathway Tools Schema and Semantic Inference Layer: Pathways and the Overview. SRI International Bioinformatics Pathway Tools Schema and Semantic Inference Layer: Pathways and the Overview 1 Outline Pathways Representation of Pathways Querying Pathways Programmatically How Pathway Diagrams are Generated Future Work:

More information

a) Give the sequence of the mrna transcribed from this gene and indicate the 5 and 3 ends of the mrna.

a) Give the sequence of the mrna transcribed from this gene and indicate the 5 and 3 ends of the mrna. ame: Section : 7.014 Problem Set 4 nswers to this problem set are to be turned in at the box outside 68120 by 11:45 am, Wednesday, March 9. Problem sets will not be accepted late. Solutions will be posted

More information

Amino Acid Sequences and Evolutionary Relationships. How do similarities in amino acid sequences of various species provide evidence for evolution?

Amino Acid Sequences and Evolutionary Relationships. How do similarities in amino acid sequences of various species provide evidence for evolution? Amino Acid Sequences and Evolutionary Relationships Name: How do similarities in amino acid sequences of various species provide evidence for evolution? An important technique used in determining evolutionary

More information

Nucleic acid and protein Flow of genetic information

Nucleic acid and protein Flow of genetic information Nucleic acid and protein Flow of genetic information References: Glick, BR and JJ Pasternak, 2003, Molecular Biotechnology: Principles and Applications of Recombinant DNA, ASM Press, Washington DC, pages.

More information

Problem Set Unit The base ratios in the DNA and RNA for an onion (Allium cepa) are given below.

Problem Set Unit The base ratios in the DNA and RNA for an onion (Allium cepa) are given below. Problem Set Unit 3 Name 1. Which molecule is found in both DNA and RNA? A. Ribose B. Uracil C. Phosphate D. Amino acid 2. Which molecules form the nucleotide marked in the diagram? A. phosphate, deoxyribose

More information

Dynamic Programming Algorithms

Dynamic Programming Algorithms Dynamic Programming Algorithms Sequence alignments, scores, and significance Lucy Skrabanek ICB, WMC February 7, 212 Sequence alignment Compare two (or more) sequences to: Find regions of conservation

More information

He who asks is a fool for five minutes, but he who does not ask remains a fool forever. Tonight. Admin Stuff. CSEP590A Computational Biology

He who asks is a fool for five minutes, but he who does not ask remains a fool forever. Tonight. Admin Stuff. CSEP590A Computational Biology CSEP590A Computational Biology http://www.cs.washington.edu/csep590a He who asks is a fool for five minutes, but he who does not ask remains a fool forever. Larry Ruzzo Summer 2006 -- Chinese Proverb UW

More information

Lecture 11: Gene Prediction

Lecture 11: Gene Prediction Lecture 11: Gene Prediction Study Chapter 6.11-6.14 1 Gene: A sequence of nucleotides coding for protein Gene Prediction Problem: Determine the beginning and end positions of genes in a genome Where are

More information

Station 1 DNA Evidence

Station 1 DNA Evidence Station 1 DNA Evidence Cytochrome-c is a protein found in the mitochondria that is used in cellular respiration. This protein consists of a chain of 104 amino acids. The chart below shows the amino acid

More information

MetaFlux in Pathway Tools (Short Tutorial)

MetaFlux in Pathway Tools (Short Tutorial) MetaFlux in Pathway Tools (Short Tutorial) Mario Latendresse SRI International 22 September 2015 Outline 1 Introduction to Flux Balance Analysis What is Flux Balance Analysis (FBA)? Overview of MetaFlux

More information

Network System Inference

Network System Inference Network System Inference Francis J. Doyle III University of California, Santa Barbara Douglas Lauffenburger Massachusetts Institute of Technology WTEC Systems Biology Final Workshop March 11, 2005 What

More information

DNA and the Double Helix in the Fifties: Papers Published in Nature which mention DNA and the Double Helix

DNA and the Double Helix in the Fifties: Papers Published in Nature which mention DNA and the Double Helix DNA and the Double Helix in the Fifties: Papers Published in Nature 1950-1960 which mention DNA and the Double Helix DNA paper Mention double helix 50 40 30 20 10 1950 1951 1952 1953 1954 1955 1956 1957

More information

Using Pathway Tools & Matlab for Flux Balance Analysis. Kent Peterson 25 Aug. 2009

Using Pathway Tools & Matlab for Flux Balance Analysis. Kent Peterson 25 Aug. 2009 Using Pathway Tools & Matlab for Flux Balance Analysis Kent Peterson 25 Aug. 2009 Summary Flux Balance Analysis (FBA) Workflow: from Ptools to Matlab, and back again A simple example What next? 2 Let s

More information

Sequence Alignment and Phylogenetic Tree Construction of Malarial Parasites

Sequence Alignment and Phylogenetic Tree Construction of Malarial Parasites 72 Sequence Alignment and Phylogenetic Tree Construction of Malarial Parasites Sk. Mujaffor 1, Tripti Swarnkar 2, Raktima Bandyopadhyay 3 M.Tech (2 nd Yr.), ITER, S O A University mujaffor09 @ yahoo.in

More information

1/4/18 NUCLEIC ACIDS. Nucleic Acids. Nucleic Acids. ECS129 Instructor: Patrice Koehl

1/4/18 NUCLEIC ACIDS. Nucleic Acids. Nucleic Acids. ECS129 Instructor: Patrice Koehl NUCLEIC ACIDS ECS129 Instructor: Patrice Koehl Nucleic Acids Nucleotides DNA Structure RNA Synthesis Function Secondary structure Tertiary interactions Wobble hypothesis DNA RNA Replication Transcription

More information

NUCLEIC ACIDS. ECS129 Instructor: Patrice Koehl

NUCLEIC ACIDS. ECS129 Instructor: Patrice Koehl NUCLEIC ACIDS ECS129 Instructor: Patrice Koehl Nucleic Acids Nucleotides DNA Structure RNA Synthesis Function Secondary structure Tertiary interactions Wobble hypothesis DNA RNA Replication Transcription

More information

Engineering Genetic Circuits

Engineering Genetic Circuits Engineering Genetic Circuits I use the book and slides of Chris J. Myers Lecture 0: Preface Chris J. Myers (Lecture 0: Preface) Engineering Genetic Circuits 1 / 19 Samuel Florman Engineering is the art

More information

Bioinformatics. ONE Introduction to Biology. Sami Khuri Department of Computer Science San José State University Biology/CS 123A Fall 2012

Bioinformatics. ONE Introduction to Biology. Sami Khuri Department of Computer Science San José State University Biology/CS 123A Fall 2012 Bioinformatics ONE Introduction to Biology Sami Khuri Department of Computer Science San José State University Biology/CS 123A Fall 2012 Biology Review DNA RNA Proteins Central Dogma Transcription Translation

More information

Solutions to Problem Set 1

Solutions to Problem Set 1 MIT Department of Biology 7.014 Introductory Biology, Spring 004 Question 1 Solutions to 7.014 Problem Set 1 a) Describe the conditions of the atmosphere on prebiotic earth and how these conditions differ

More information

The Stringent Response

The Stringent Response The Stringent Response When amino acids are limiting a response is triggered to shut down a wide range of biosynthetic processes This process is called the Stringent Response It results in the synthesis

More information

De novo sequencing in the identification of mass data. Wang Quanhui Liu Siqi Beijing Institute of Genomics, CAS

De novo sequencing in the identification of mass data. Wang Quanhui Liu Siqi Beijing Institute of Genomics, CAS De novo sequencing in the identification of mass data Wang Quanhui Liu Siqi Beijing Institute of Genomics, CAS The difficulties in mass data analysis Although the techniques of genomic sequencing are being

More information

Kyoto Encyclopedia of Genes and Genomes (KEGG)

Kyoto Encyclopedia of Genes and Genomes (KEGG) NPTEL Biotechnology -Systems Biology Kyoto Encyclopedia of Genes and Genomes (KEGG) Dr. M. Vijayalakshmi School of Chemical and Biotechnology SASTRA University Joint Initiative of IITs and IISc Funded

More information

6-Foot Mini Toober Activity

6-Foot Mini Toober Activity Big Idea The interaction between the substrate and enzyme is highly specific. Even a slight change in shape of either the substrate or the enzyme may alter the efficient and selective ability of the enzyme

More information

First&year&tutorial&in&Chemical&Biology&(amino&acids,&peptide&and&proteins)&! 1.&!

First&year&tutorial&in&Chemical&Biology&(amino&acids,&peptide&and&proteins)&! 1.&! First&year&tutorial&in&Chemical&Biology&(amino&acids,&peptide&and&proteins& 1.& a. b. c. d. e. 2.& a. b. c. d. e. f. & UsingtheCahn Ingold Prelogsystem,assignstereochemicaldescriptorstothe threeaminoacidsshownbelow.

More information

Materials Protein synthesis kit. This kit consists of 24 amino acids, 24 transfer RNAs, four messenger RNAs and one ribosome (see below).

Materials Protein synthesis kit. This kit consists of 24 amino acids, 24 transfer RNAs, four messenger RNAs and one ribosome (see below). Protein Synthesis Instructions The purpose of today s lab is to: Understand how a cell manufactures proteins from amino acids, using information stored in the genetic code. Assemble models of four very

More information

Module 3. Lecture 5. Regulation of Gene Expression in Prokaryotes

Module 3. Lecture 5. Regulation of Gene Expression in Prokaryotes Module 3 Lecture 5 Regulation of Gene Expression in Prokaryotes Recap So far, we have looked at prokaryotic gene regulation using 3 operon models. lac: a catabolic operon which displays induction via negative

More information

Computational Methods for Protein Structure Prediction

Computational Methods for Protein Structure Prediction Computational Methods for Protein Structure Prediction Ying Xu 2017/12/6 1 Outline introduction to protein structures the problem of protein structure prediction why it is possible to predict protein structures

More information

From Bench to Bedside: Role of Informatics. Nagasuma Chandra Indian Institute of Science Bangalore

From Bench to Bedside: Role of Informatics. Nagasuma Chandra Indian Institute of Science Bangalore From Bench to Bedside: Role of Informatics Nagasuma Chandra Indian Institute of Science Bangalore Electrocardiogram Apparent disconnect among DATA pieces STUDYING THE SAME SYSTEM Echocardiogram Chest sounds

More information

He who asks is a fool for five minutes, but he who does not ask remains a fool forever. Today. Admin Stuff. CSE527 Computational Biology

He who asks is a fool for five minutes, but he who does not ask remains a fool forever. Today. Admin Stuff. CSE527 Computational Biology CSE527 Computational Biology http://www.cs.washington.edu/527 He who asks is a fool for five minutes, but he who does not ask remains a fool forever. Larry Ruzzo Autumn 2007 -- Chinese Proverb UW CSE Computational

More information

Systems biology of genetic interactions in yeast metabolism

Systems biology of genetic interactions in yeast metabolism Systems biology of genetic interactions in yeast metabolism Balázs Papp Evolutionary Systems Biology Group Biological Research Centre of the HAS Szeged, Hungary Genetic interaction (GI) refers to the nonindependence

More information

Solutions to 7.02 Quiz II 10/27/05

Solutions to 7.02 Quiz II 10/27/05 Solutions to 7.02 Quiz II 10/27/05 Class Average = 83 Standard Deviation = 9 Range Grade % 87-100 A 43 74-86 B 39 55-73 C 17 > 54 D 1 Question 1 (56 points) While studying deep sea bacteria, you discover

More information

DE NOVO GENOME ASSEMBLY OF THE AFRICAN CATFISH (CLARIAS GARIEPINUS)

DE NOVO GENOME ASSEMBLY OF THE AFRICAN CATFISH (CLARIAS GARIEPINUS) DE NOVO GENOME ASSEMBLY OF THE AFRICAN CATFISH (CLARIAS GARIEPINUS) Kovács B. a,, Barta E. c, Pongor S. L. b, Uri Cs. a, Patócs A. b, Orbán L. d, Müller T. a, Urbányi B. a a Department of Aquaculture,

More information

Alpha-helices, beta-sheets and U-turns within a protein are stabilized by (hint: two words).

Alpha-helices, beta-sheets and U-turns within a protein are stabilized by (hint: two words). 1 Quiz1 Q1 2011 Alpha-helices, beta-sheets and U-turns within a protein are stabilized by (hint: two words) Value Correct Answer 1 noncovalent interactions 100% Equals hydrogen bonds (100%) Equals H-bonds

More information

Functional profiling of metagenomic short reads: How complex are complex microbial communities?

Functional profiling of metagenomic short reads: How complex are complex microbial communities? Functional profiling of metagenomic short reads: How complex are complex microbial communities? Rohita Sinha Senior Scientist (Bioinformatics), Viracor-Eurofins, Lee s summit, MO Understanding reality,

More information

A study on the robustness of strain optimization algorithms

A study on the robustness of strain optimization algorithms A study on the robustness of strain optimization algorithms Paulo Vilaça, Paulo Maia, and Miguel Rocha Abstract In recent years, there have been considerable advances in the use of genome-scale metabolic

More information

Laboratory Evolution of Robust and Enantioselective Baeyer-Villiger Monooxygenases for Asymmetric Catalysis

Laboratory Evolution of Robust and Enantioselective Baeyer-Villiger Monooxygenases for Asymmetric Catalysis Laboratory Evolution of Robust and Enantioselective Baeyer-Villiger Monooxygenases for Asymmetric Catalysis Induced fit docking model Manfred T. Reetz* and Sheng Wu Max-Planck-Institut für Kohlenforschung

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics If the 19 th century was the century of chemistry and 20 th century was the century of physic, the 21 st century promises to be the century of biology...professor Dr. Satoru

More information

Protein NMR II. Lecture 5

Protein NMR II. Lecture 5 Protein NMR II Lecture 5 Standard and NMR chemical shifts in proteins Residue N A A B O Ala 123.8 4.35 52.5 19.0 177.1 ys 118.8 4.65 58.8 28.6 174.8 Asp 120.4 4.76 54.1 40.8 177.2 Glu 120.2 4.29 56.7 29.7

More information

Pacific Symposium on Biocomputing 4: (1999)

Pacific Symposium on Biocomputing 4: (1999) Applications of Knowledge Discovery to Molecular Biology: Identifying Structural Regularities in Proteins Shaobing Su, Diane J. Cook, and Lawrence B. Holder University of Texas at Arlington sandy su@sabre.com,

More information

IB HL Biology Test: Topics 1 and 3

IB HL Biology Test: Topics 1 and 3 October 26, 2011 IB HL Biology Test: Topics 1 and 3 Multiple Choice Identify the choice that best completes the statement or answers the question. 1. What conditions must be met for the t-test to be applied?

More information

Cambridge International Examinations Cambridge International Advanced Subsidiary and Advanced Level

Cambridge International Examinations Cambridge International Advanced Subsidiary and Advanced Level Cambridge International Examinations Cambridge International Advanced Subsidiary and Advanced Level *2249654089* BIOLOGY 9700/21 Paper 2 AS Level Structured Questions October/November 2016 1 hour 15 minutes

More information

Granby Transcription and Translation Services plc

Granby Transcription and Translation Services plc ompany Resources ranby Transcription and Translation Services plc has invested heavily in the Protein Synthesis business. mongst the resources available to new recruits are: the latest cellphones which

More information

Supplementary Data for Monti, et al.

Supplementary Data for Monti, et al. Supplementary Data for Monti, et al. Supplementary Figure S1 Legend to Supplementary Figure S1 Tumor spectrum associated with germline p53 alleles (restricted to the 7 most frequent tissue targets). Structural

More information

A New Database of Genetic and. Molecular Pathways. Minoru Kanehisa. sequencing projects have been. Mbp) and for several bacteria including

A New Database of Genetic and. Molecular Pathways. Minoru Kanehisa. sequencing projects have been. Mbp) and for several bacteria including Toward Pathway Engineering: A New Database of Genetic and Molecular Pathways Minoru Kanehisa Institute for Chemical Research, Kyoto University From Genome Sequences to Functions The Human Genome Project

More information

Case 7 A Storage Protein From Seeds of Brassica nigra is a Serine Protease Inhibitor

Case 7 A Storage Protein From Seeds of Brassica nigra is a Serine Protease Inhibitor Case 7 A Storage Protein From Seeds of Brassica nigra is a Serine Protease Inhibitor Focus concept Purification of a novel seed storage protein allows sequence analysis and determination of the protein

More information

From code to translation

From code to translation From code to translation What could be the role of the first peptides? Ádám Kun & Ádám Radványi Dpt. Plant Systematics, Ecology and Theoretical Biology, Eötvös University, Budapest, Hungary Parmenides

More information

MATH 5610, Computational Biology

MATH 5610, Computational Biology MATH 5610, Computational Biology Lecture 2 Intro to Molecular Biology (cont) Stephen Billups University of Colorado at Denver MATH 5610, Computational Biology p.1/24 Announcements Error on syllabus Class

More information

Bioinformatika a výpočetní biologie. KFC/BIN VIII. Systémová biologie

Bioinformatika a výpočetní biologie. KFC/BIN VIII. Systémová biologie Bioinformatika a výpočetní biologie KFC/BIN VIII. Systémová biologie RNDr. Karel Berka, Ph.D. Univerzita Palackého v Olomouci Syllabus Pathway Analysis KEGG Whole cell simulation Pathway analysis Caveats

More information

Cambridge International Examinations Cambridge International Advanced Subsidiary and Advanced Level

Cambridge International Examinations Cambridge International Advanced Subsidiary and Advanced Level ambridge International Examinations ambridge International Advanced Subsidiary and Advanced Level *8744875516* BIOLOGY 9700/22 Paper 2 AS Level Structured Questions October/November 2016 1 hour 15 minutes

More information

Key Concept Translation converts an mrna message into a polypeptide, or protein.

Key Concept Translation converts an mrna message into a polypeptide, or protein. 8.5 Translation VOBLRY translation codon stop codon start codon anticodon Key oncept Translation converts an mrn message into a polypeptide, or protein. MIN IDES mino acids are coded by mrn base sequences.

More information

Today He who asks is a fool for five minutes, but he who does not ask remains a fool forever.

Today He who asks is a fool for five minutes, but he who does not ask remains a fool forever. CSE527 Computational Biology http://www.cs.washington.edu/527 Larry Ruzzo Autumn 2009 UW CSE Computational Biology Group Today He who asks is a fool for five minutes, but he who does not ask remains a

More information

Cambridge International Examinations Cambridge International Advanced Subsidiary and Advanced Level

Cambridge International Examinations Cambridge International Advanced Subsidiary and Advanced Level ambridge International Examinations ambridge International Advanced Subsidiary and Advanced Level *8744875516* BIOLOGY 9700/22 Paper 2 AS Level Structured Questions October/November 2016 1 hour 15 minutes

More information

Finding your Future. 10/21/2014 ECE Graduate Seminar

Finding your Future. 10/21/2014 ECE Graduate Seminar Finding your Future H. Scott Hinton, FIEEE, FOSA Professor, ECE Department President, Utah State University Research Foundation Director, USTAR Synthetic Biomanufacturing Institute 1 Following Opportunities

More information

Grand Challenges in Computational Biology

Grand Challenges in Computational Biology Grand Challenges in Computational Biology Kimmen Sjölander UC Berkeley Reconstructing the Tree of Life CITRIS-INRIA workshop 24 May, 2011 Prediction of biological pathways and networks Human microbiome

More information

CS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS

CS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS 1 CS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS * Some contents are adapted from Dr. Jean Gao at UT Arlington Mingon Kang, PhD Computer Science, Kennesaw State University 2 Genetics The discovery of

More information

Supplemental Table 1. Amino acid sequences of synthetic kisspeptins

Supplemental Table 1. Amino acid sequences of synthetic kisspeptins Supplemental Data Supplemental Table 1. Amino acid sequences of synthetic kisspeptins Kisspeptins Symbol Sequence Human kisspeptin-10 H-10 Tyr-Asn-Trp-Asn-Ser-Phe-Gly-Leu-Arg-Phe-NH 2 Rodent/Xenopus 1a

More information

ORFs and genes. Please sit in row K or forward

ORFs and genes. Please sit in row K or forward ORFs and genes Please sit in row K or forward https://www.flickr.com/photos/teseum/3231682806/in/photostream/ Question: why do some strains of Vibrio cause cholera and others don t? Methods Mechanisms

More information

Basic Concepts of Human Genetics

Basic Concepts of Human Genetics Basic Concepts of Human Genetics The genetic information of an individual is contained in 23 pairs of chromosomes. Every human cell contains the 23 pair of chromosomes. One pair is called sex chromosomes

More information

BME205: Lecture 2 Bio systems. David Bernick

BME205: Lecture 2 Bio systems. David Bernick BME205: Lecture 2 Bio systems David Bernick Bioinforma;cs Infer pa>erns from life biological sequences structures molecular pathways. Suggest hypotheses from inferred pa>erns Structure and Func;on Novel

More information

KEGG: Kyoto Encyclopedia of Genes and Genomes

KEGG: Kyoto Encyclopedia of Genes and Genomes 1999 Oxford University Press Nucleic Acids Research, 1999, Vol. 27, No. 1 29 34 KEGG: Kyoto Encyclopedia of Genes and Genomes Hiroyuki Ogata, Susumu Goto, Kazushige Sato, Wataru Fujibuchi, Hidemasa Bono

More information

Following text taken from Suresh Kumar. Bioinformatics Web - Comprehensive educational resource on Bioinformatics. 6th May.2005

Following text taken from Suresh Kumar. Bioinformatics Web - Comprehensive educational resource on Bioinformatics. 6th May.2005 Bioinformatics is the recording, annotation, storage, analysis, and searching/retrieval of nucleic acid sequence (genes and RNAs), protein sequence and structural information. This includes databases of

More information

ECS 129: Structural Bioinformatics March 15, 2016

ECS 129: Structural Bioinformatics March 15, 2016 Notes: ES 129: Structural Bioinformatics March 15, 2016 1) The final exam is open book, open notes. 2) The final is divided into 2 parts, and graded over 100 points (with 8 points extra credit) 3) You

More information

Thr Gly Tyr. Gly Lys Asn

Thr Gly Tyr. Gly Lys Asn Your unique body characteristics (traits), such as hair color or blood type, are determined by the proteins your body produces. Proteins are the building blocks of life - in fact, about 45% of the human

More information

ABSTRACT COMPUTER EVOLUTION OF GENE CIRCUITS FOR CELL- EMBEDDED COMPUTATION, BIOTECHNOLOGY AND AS A MODEL FOR EVOLUTIONARY COMPUTATION

ABSTRACT COMPUTER EVOLUTION OF GENE CIRCUITS FOR CELL- EMBEDDED COMPUTATION, BIOTECHNOLOGY AND AS A MODEL FOR EVOLUTIONARY COMPUTATION ABSTRACT COMPUTER EVOLUTION OF GENE CIRCUITS FOR CELL- EMBEDDED COMPUTATION, BIOTECHNOLOGY AND AS A MODEL FOR EVOLUTIONARY COMPUTATION by Tommaso F. Bersano-Begey Chair: John H. Holland This dissertation

More information

Additional Case Study: Amino Acids and Evolution

Additional Case Study: Amino Acids and Evolution Student Worksheet Additional Case Study: Amino Acids and Evolution Objectives To use biochemical data to determine evolutionary relationships. To test the hypothesis that living things that are morphologically

More information