Folding simulation: self-organization of 4-helix bundle protein. yellow = helical turns

Similar documents
Level 2 Biology, 2017

Biomolecules: lecture 6

1. DNA, RNA structure. 2. DNA replication. 3. Transcription, translation

Fishy Amino Acid Codon. UUU Phe UCU Ser UAU Tyr UGU Cys. UUC Phe UCC Ser UAC Tyr UGC Cys. UUA Leu UCA Ser UAA Stop UGA Stop

Important points from last time

Biomolecules: lecture 6

Protein Synthesis. Application Based Questions

ANCIENT BACTERIA? 250 million years later, scientists revive life forms

Degenerate Code. Translation. trna. The Code is Degenerate trna / Proofreading Ribosomes Translation Mechanism

CONVERGENT EVOLUTION. Def n acquisition of some biological trait but different lineages

(a) Which enzyme(s) make 5' - 3' phosphodiester bonds? (c) Which enzyme(s) make single-strand breaks in DNA backbones?

Bioinformatics CSM17 Week 6: DNA, RNA and Proteins

iclicker Question #28B - after lecture Shown below is a diagram of a typical eukaryotic gene which encodes a protein: start codon stop codon 2 3

p-adic GENETIC CODE AND ULTRAMETRIC BIOINFORMATION

How life. constructs itself.

7.016 Problem Set 3. 1 st Pedigree

Human Gene,cs 06: Gene Expression. Diversity of cell types. How do cells become different? 9/19/11. neuron

A Zero-Knowledge Based Introduction to Biology

Honors packet Instructions

Chemistry 121 Winter 17

Just one nucleotide! Exploring the effects of random single nucleotide mutations

The combination of a phosphate, sugar and a base forms a compound called a nucleotide.

UNIT I RNA AND TYPES R.KAVITHA,M.PHARM LECTURER DEPARTMENT OF PHARMACEUTICS SRM COLLEGE OF PHARMACY KATTANKULATUR

Codon Bias with PRISM. 2IM24/25, Fall 2007

Deoxyribonucleic Acid DNA. Structure of DNA. Structure of DNA. Nucleotide. Nucleotides 5/13/2013

Chapter 10. The Structure and Function of DNA. Lectures by Edward J. Zalisko

Basic Biology. Gina Cannarozzi. 28th October Basic Biology. Gina. Introduction DNA. Proteins. Central Dogma.

PROTEIN SYNTHESIS Study Guide

Protein Synthesis: Transcription and Translation

Enduring Understanding

UNIT (12) MOLECULES OF LIFE: NUCLEIC ACIDS

Describe the features of a gene which enable it to code for a particular protein.

CISC 1115 (Science Section) Brooklyn College Professor Langsam. Assignment #6. The Genetic Code 1

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Gene Prediction

Emergence of the Canonical Genetic Code

Bioinformation by Biomedical Informatics Publishing Group

BIOL591: Introduction to Bioinformatics Comparative genomes to look for genes responsible for pathogenesis

INTRODUCTION TO THE MOLECULAR GENETICS OF THE COLOR MUTATIONS IN ROCK POCKET MICE

Evolution of protein coding sequences

7.013 Problem Set

PGRP negatively regulates NOD-mediated cytokine production in rainbow trout liver cells

DNA sentences. How are proteins coded for by DNA? Materials. Teacher instructions. Student instructions. Reflection

PRINCIPLES OF BIOINFORMATICS

7.013 Exam Two

IMAGE HIDING IN DNA SEQUENCE USING ARITHMETIC ENCODING Prof. Samir Kumar Bandyopadhyay 1* and Mr. Suman Chakraborty

Molecular Level of Genetics

SUPPLEMENTARY INFORMATION

It has not escaped our notice that the specific paring we have postulated immediately suggest a possible copying mechanism for the genetic material

CHAPTER 12- RISE OF GENETICS I. DISCOVERY OF DNA A. GRIFFITH (1928) 11/15/2016

Lecture 19A. DNA computing

Keywords: DNA methylation, deamination, codon usage, genome, genomics

Today in Astronomy 106: the important polymers and from polymers to life

If stretched out, the DNA in chromosome 1 is roughly long.

Chapter 10. The Structure and Function of DNA. Lectures by Edward J. Zalisko

Problem Set 3

Today in Astronomy 106: polymers to life

Bioinformatics 1. Sepp Hochreiter. Biology, Sequences, Phylogenetics Part 1. Bioinformatics 1: Biology, Sequences, Phylogenetics

Bioinformatics of 18 Fungal Genomes

Chapter 3: Information Storage and Transfer in Life

Trends in the codon usage patterns of Chromohalobacter salexigens genes

Materials Protein synthesis kit. This kit consists of 24 amino acids, 24 transfer RNAs, four messenger RNAs and one ribosome (see below).

Daily Agenda. Warm Up: Review. Translation Notes Protein Synthesis Practice. Redos

Keywords: Staphylococcal phage, Synonymous codon usage, Translational selection, Mutational bias, Phage therapy.

comparing acrylamide gel patterns of restriction enzyme digests of plasmid pbr322 with those of pbr322/fpv2-22, the plasmid

Genes & Inheritance Series: Set 1. Copyright 2005 Version: 2.0

Mechanisms of Genetics

MOLECULAR EVOLUTION AND PHYLOGENETICS

The complete amino acid sequence of human fibroblast interferon as deduced using synthetic oligodeoxyribonucleotide primers of reverse transcriptase

Expression analysis of genes responsible for amino acid biosynthesis in halophilic bacterium Salinibacter ruber

Gene Prediction. Srivani Narra Indian Institute of Technology Kanpur

UNIT (12) MOLECULES OF LIFE: NUCLEIC ACIDS

Q1: Find the secret message in the sentences. Q2: Can I delete a w letter in these sentences?

DNA Base Data Hiding Algorithm Mohammad Reza Abbasy, Pourya Nikfard, Ali Ordi, and Mohammad Reza Najaf Torkaman

A 2D graphical representation of the sequences of DNA based on triplets and its application

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Translating the Genetic Code. DANILO V. ROGAYAN JR. Faculty, Department of Natural Sciences

DNA Begins the Process

Station 1: DNA Structure Use the figure above to answer each of the following questions. 1.This is the subunit that DNA is composed of. 2.

The Final Exam will be: Monday, May 17 9:00 am - 12:00 noon Johnson

Biology'AHA' 4 th 'Marking'Period'' Benchmark'Review'

Cloning and sequence analysis of cdna for rat angiotensinogen (angiotensin/recombinant DNA/DNA sequence/blood pressure)

Selection, history and chemistry: the three faces of the genetic code

7-9/99 Neuman Chapter 23

7.014 Quiz II 3/18/05. Write your name on this page and your initials on all the other pages in the space provided.

G+C content. 1 Introduction. 2 Chromosomes Topology & Counts. 3 Genome size. 4 Replichores and gene orientation. 5 Chirochores.

NAME:... MODEL ANSWER... STUDENT NUMBER:... Maximum marks: 50. Internal Examiner: Hugh Murrell, Computer Science, UKZN

Chapter 13 From Genes to Proteins

Worksheet: Mutations Practice

ORFs and genes. Please sit in row K or forward

Homework. A bit about the nature of the atoms of interest. Project. The role of electronega<vity

Chapter 7 DNA Structure and Gene Function

BIOLOGY. Gene Expression. Gene to Protein. Protein Synthesis Overview. The process in which the information coded in DNA is used to make proteins

1 How DNA Makes Stuff

The Genetic Code Degeneration I: Rules Governing the Code Degeneration and the Spatial Organization of the Codon Informative Properties.

Review of Central Dogma; Simple Mendelian Inheritance

Quiz 2 on Wednesday 4/3 from 11-noon. Bring IDs to exam. Review Session 4/1 from 7-9 pm in Tutoring Session 4/2 from 4-6 pm in

ENZYMES AND METABOLIC PATHWAYS

Flow of Genetic Information

Synonymous codon usage in Pseudomonas aeruginosa PA01

BIOSTAT516 Statistical Methods in Genetic Epidemiology Autumn 2005 Handout1, prepared by Kathleen Kerr and Stephanie Monks

Transcription:

Folding simulation: self-organization of 4-helix bundle protein yellow = helical turns

Protein structure Protein: heteropolymer chain made of amino acid residues R + H 3 N - C - COO - H φ ψ Chain of amino acid residues 20 different amino acids More than 50,000 different proteins in human body alone

Protein structure Hierarchical levels of structure: α-helix β-strand primary secondary tertiary myoglobin Linear chain of amino acids Local regular structures 3-D compact structure with long-range contacts The biological function is determined by shape. The shape is determined by primary sequence. How? ( Protein Folding Problem )

Protein Folding Protein folding : Primary sequence Native state S<0 Random coil Highly organized Compact 3-d structure Huge variation in the possible primary sequence: 20 N (20 different amino acids, N is # of amino acids in a chain) Most sequences do not fold; primary sequence must be carefully chosen Methods for finding primary sequences that fold to specific shapes: Evolution: trial and error, requires lots of time Engineering: Understand underlying principles of Self-organization

Protein Folding Problem Proteins are long (>50) chains of amino acid residues Biological functioning requires protein chain to fold to very specific compact shape: native state Chain is very flexible: each amino acid has internal degrees of freedom (Φ, Ψ, sidechain, e.g. 4 states each) > 64 configurations Ex: Myoglobin (153 amino acids) 64 153 = 10 276 configs Paradox: Even with super-fast sampling rate 10-12 sec/config 10 264 seconds (10 256 yrs) to randomly find native state. (degeneracy of native state reduces this to 10 118 years) Actual real protein folding times: milli-seconds!! How?? Folding must be a guided deterministic process, not random. Configuration space is frustrated, ultra-metric. Different initial configurations converge to native state. How does primary sequence determine folding dynamics. Interactions are non-linear: Anti-chaotic dynamics!?

Energy Landscape Funnel shaped, different initial configurations guide system to the same native state. Anti-Chaos? Is this a valid and useful approach? (B. Gerstman and Y. Garbourg, Journal of Polymer Science B: Polymer Physics, 36, 2761-2769, 1998.) -- 0 Many other axes are necessary to represent all the structural degrees of freedom. Which are most important?

Ultimate Physics Aim: Determine which aspects of 1-D sequence of amino acids in peptide chain determine efficient folding pathway and final shape (native state). Immediate Aim: Determine if formalism of non-linear dynamics is useful for investigating protein folding. This work: Can formalism of non-linear dynamics show that large scale un-folding is deterministic (and is it mathematically anti-chaotic) and distinguish random thermal fluctuations? Use data from lattice simulations of protein unfolding (realistic folding simulations of full proteins not available) First check to confirm that model realistically simulates protein dynamics. Compare results from model for characteristics that have been experimentally measured; e.g. Heat Capacity

Why use computer model? The system is complex - Huge number of degrees of structural freedom - Many terms in the Hamiltonian - System is not solvable analytically Monte Carlo simulations are very useful for these kinds of systems - Interested in relaxation times (non-equilibrium dynamics), as well as final configurations (equilibrium).

Lattice model and interaction Hamiltonian Red: backbone Green: side chain Interaction Hamiltonian: H = i j> i a ij ss p E ij ssp + a ij bb E bb + a ij rep E rep close enough contact (or preferred state)? Yes: a = 1; No: a = 0. ss : sidechain-sidechain bb: backbone-backbone l : local m : cooperative p : hydrophobic or polar or hydrophilic + l a l i E l + a m i E m m

Protein Configuration Energy Determined by Interaction Hamiltonian i,j : amino acid residue number in the primary sequence. a ss ij : are sidechains of i and j close enough to interact; yes = 1, no = 0. E ij ssp : sidechain-sidechain energy (p = 1 hydrophobic-hydrophobic, p = 2 hydrophilic-hydrophilic, p = 3 hydrophobic-hydrophilic). a ij bb : are backbones i and j close enough to interact; y=1, n=0. E bb : a l i : E l : a m i : E m : backbone-backbone interaction energy ( hydrogen bond, dipole, soft core repulsion combined together) are residues i-1, i, i+1, arranged so that i is in its preferred user-defined local configuration (i.e. α-helix, β-sheet, turn); y=1, n=0. local propensity energy. are residues i-1, i, i+1, i+2 arranged so that i and i+1 are both in the same preferred local configuration; y=1, n=0 medium range (cooperative) propensity energy

Nucleus DNA

DNA Packaging Inside Nucleus Nucleosome Supercoil Chromosome Protein Scaffold

Double Helix (Partially Disrupted) 2 nanometers 1 turn = 10 base pairs = 3.4 nanometers Minor Groove Major Groove

NUCLEOTIDES Phosphate Base Sugar

TRIPLET CODONS

GENETIC CODE Initiation Codon Termination Codons U C A G UUU Phe UCU Ser UAU Tyr UGU Cys U UUC Phe UCC Ser UAC Tyr UGC Cys UUA Leu UCA Ser UAA UGA UUG Leu UCG Ser UGA UAG UGG Trp First Base in Codon C CUU Leu CCU Pro CAU His CGU Arg CUC Leu CCC Pro CAC His CGC Arg CUA Leu CCA Pro CAA Gln CGA Arg CUG Leu CCG Pro CAG Gln CGG Arg Third Base in Codon AUU Ile ACU Thr AAU Asn AGU Ser A AUC Ile ACC Thr AAC Asn AGC Ser AUA Ile ACA Thr AAA Lys AGA Arg AUG Met ACG Thr AAG Lys AGG Arg GUU Val GCU Ala GAU Asp GGU Gly G GUC Val GCC Ala GAC Asp GGC Gly GUA Val GCA Ala GAA Glu GGA Gly GUG Val GCG Ala GAG Glu GGG Gly