Structural bioinformatics

Size: px
Start display at page:

Download "Structural bioinformatics"

Transcription

1 Structural bioinformatics Why structures? The representation of the molecules in 3D is more informative New properties of the molecules are revealed, which can not be detected by sequences Eran Eyal Plant Sciences Department Weizmann Institute of Science Similar sequence Similar sequence Similar structure Similar sequence Similar function Similar sequence Similar structure Source of data: Crystal structures NMR models Other PDB The PDB database is the main repository for the processing and distribution of 3D biological macromolecular structure data

2 PDB content growth XRay Crystallography Data Source Clone/Express/Purify Crystallize XRay diffraction data + Solve phase problem Interpret electron density map Coordinates of atoms in protein molecule

3 NMR Spectroscopy Data Source Xray crystallography NMR information about spatiallyclosed atoms list of distance constraints + dihedral angles constraints multiple models of protein structure Atomic resolution Good Reasonable Hydrogens Rarely determined Determined Molecule size No restriction Small proteins Dynamics Snapshot Multi models Membrane proteins Problematic Procedure Very long long Coordinates of atoms in protein molecule What information is included in the PDB? File Format Protein description Literature Data about the experiment Sequence Header section Structure (atomic coordinates) Connectivity Coordinate section

4 JRNL AUTH L.J.HARRIS,S.B.LARSON,K.W.HASEL,A.MCPHERSON JRNL TITL REFINED STRUCTURE OF AN INTACT IGG2A MONOCLONAL JRNL TITL 2 ANTIBODY HEADER IMMUNOGLOBULIN 25OCT96 1IGT JRNL REF BIOCHEMISTRY V COMPND MOLECULE: IGG2A INTACT ANTIBODY MAB231; JRNL REFN ASTM BICHAW US ISSN SOURCE MOUSE (MUS MUSCULUS, STRAIN BALB/C) KEYWDS INTACT IMMUNOGLOBULIN V REGION C REGION, IMMUNOGLOBULIN EXPDTA XRAY DIFFRACTION AUTHOR L.J.HARRIS,S.B.LARSON,K.W.HASEL,A.MCPHERSON REVDAT 1 07JUL97 1IGT 0 REMARK 2 RESOLUTION. 2.8 ANGSTROMS. REMARK 470 THE FOLLOWING RESIDUES HAVE MISSING ATOMS (M=MODEL NUMBER; REMARK 470 RES=RESIDUE NAME; C=CHAIN IDENTIFIER; SSEQ=SEQUENCE NUMBER; REMARK 470 I=INSERTION CODE): REMARK 470 M RES CSSEQI ATOMS REMARK 470 LEU A 6 CG CD1 CD2 REMARK 470 ARG A 8 CG CD NE CZ NH1 NH2 HELIX 1 1 PRO A 80 ASP A 82 5 SHEET 1 A 4 LEU A 4 SER A 7 0 SHEET 2 A 4 ILE A 19 HIS A 24 1 N HIS A 24 O THR A SHEET 3 A 4 GLY A 70 ILE A 75 1 N ILE A 75 O ILE A SHEET 4 A 4 PHE A 62 SER A 67 1 N SER A 67 O GLY A SSBOND 1 CYS A 23 CYS A 88 CRYST P 12

5 SEQRES SEQRES SEQRES 1 A 214 ASP ILE VAL LEU THR GLN SER PRO SER SER LEU SER 2 A 214 SER LEU GLY ASP THR ILE THR ILE THR CYS HIS ALA 3 A 214 GLN ASN ILE ASN VAL TRP LEU SER TRP TYR GLN GLN Atom Atom Res Res X Y Z Occ Bfact No name No ATOM 1 N ASP A ATOM 2 CA ASP A HET NAG D 1 26 HETNAM NAG NACETYLDGLUCOSAMINE FORMUL 5 NAG 8(C8 H15 N1 O6) HETATM 3568 CA CA HETATM 3569 O HOH HETATM 3570 O HOH HETATM 3571 O HOH HETATM 3572 O HOH CONECT CONECT CONECT Visualization Molecular graphics What do we need? Rotation & translation Color specific parts of the molecule Labeling of residues and atoms Geometrical measurements (distances & angles) Schematic representation: Atoms/Bonds/Secondary structures, Molecular surfaces Compare structures Saving pictures

6 Representation of molecules (1) Stickmodel Ball & Stick Ball size: 0 Stick size: 0.2 Ball size: 0.4 Stick size: 0.2 Molecular surfaces Spacefilled model Ball size: 0.8 Stick size: 0 Representation of molecules (2) Backbone only connections between Calpha atoms Schematic Surface helix cylinder strand arrow How to search in the PDB? The OCA browser developed in the WIS by Jaime Prilusky is the best interface to the PDB. Entries can be retrieved by variety of criteria such

7 Problems in the PDB database Missing data Quality of data Format problems residue numbers Independence of data is doubtful Structural analysis of proteins Examination of atomic interactions Examination of secondary structures Cavities Buried/exposed regions Analysis of ligands Topics in structural bioinformatics Structural alignment Structural classification Secondary structure prediction Structure prediction Molecular docking Molecular dynamics

8 Structural alignment why to compare protein structures? Structures are more conserved in evolution than sequences. Two homologous proteins have the same overall structure. It is possible that 2 proteins without detectable similarity will have the same structure. In the twilight zone of sequence similarity, structural alignment might help to correctly determine the relations between 2 proteins Structural similarity is therefore more sensitive method than sequence alignment to determine protein function What properties of protein might be used to detect structural similarity to other proteins? Structural classification All " All! sequence Type and number of secondary structures (sheets, helices) Structural arrangement of secondary structures Structural attributes of individual amino acids Distances between amino acids in the protein!/ "!+"

9 Secondary structure prediction Prediction of tertiary structures based on the amino acid sequence is still very difficult task. Prediction of more local structural properties is easier The most known classification databases are: SCOP CATH Prediction of secondary structures is important and more feasible Prediction of secondary structures is a bridge between the linear information and the 3D structure Programs in this field often employ different types of machine learning approaches ACHYTTEKRGGSGTKKREA Building 3D models of proteins ACHYTTEKRGGSGTKKREA HHHHHHHHOOOOOSSSSSS

10 Building by homology (Homology modelling) Fold recognition (Threading) Alignment with proteins of known structure The sequence: M A A G Y A V L S M A A A A A T S K G G G A Y F F Y A D E L Y G V V V V L I V L S D E S + Known protein folds structural model structural model Ab initio Building by homology The sequence M A A G Y A V L S There are millions of proteins but only several thousands different folds. If we can find a similar protein with a known structure we can use the fold of that structure as the basic template to the structure of our protein. structural model Positions of loop and side chains will be constructed in the second stage

11 Find proteins with known structure which are similar to your sequence build alignment Build structural model Check the model Finish Construction of loops might be done by: Using database of loops. The loops are classified according to their length, the geometry of their edges and their sequence Without any use of previous data, using physical and chemical principles

12 Several web pages for homology modeling COMPOSER felix.bioccam.ac.uksoftbase.html MODELLER guitar.rockefeller.edu/modeller/modeller.html WHAT IF SWISSMODEL SwissModel

13 Modeller Advanced program for homology modeling Based on distance constraints Implemented in several popular modelling packages such as InsightII The source is available for unix platforms at the above URL Threading (fold recognition) The input sequence is threaded on different folds from library of known folds Using scoring functions we get a score for the compatability between the sequence and the structure Statisticaly significant score tells that the input protein adopts similar 3D structure to that of the examined fold

14 This method is less accurate but could be applied for.more cases When the fold of our protein is not represented in the database we can not get a correct solution using this method. The most important part is the accuracy of the scoring function which evaluate the compatibility of a structure and a sequence. H bond donor H bond acceptor Glycin Hydrophobic Input: sequence Library of folds of known proteins H bond donor H bond acceptor Glycin Hydrophobic Web sites for fold recognition Profiles: 3DPSSM Libra I UCLA DOE Contact potentials 123D S=2 Z= 1 S=5 Z=1.5 S=20 Z=5 Profit

15 Abinitio methods for modelling Great theoretical interest but not practical The basic idea is to build empirical function that simulates real physical forces and potentials of chemical contacts If we will have perfect function and we will be able to scan all the possible conformations, then we will be able to detect the correct fold Docking: finding the binding orientation of two molecules with known structures According to the molecules involved: ProteinLigand docking ProteinProtein docking Specific docking algorithms usually designed to deal with one of these problems but not with both (different contact area, flexibility, level of representation, etc.) Local docking Global docking Why? Understanding interactions, roles of specific amino acids, design of mutations and changes of activity. Comparison of affinities of different molecules Drug design

Dr. R. Sankar, BSE 631 (2018)

Dr. R. Sankar, BSE 631 (2018) Pauling, Corey and Branson Diffraction of DNA http://www.nature.com/scitable/topicpage/dna-is-a-structure-that-encodes-biological-6493050 In short, stereochemistry is important in determining which helices

More information

Virtual bond representation

Virtual bond representation Today s subjects: Virtual bond representation Coordination number Contact maps Sidechain packing: is it an instrumental way of selecting and consolidating a fold? ASA of proteins Interatomic distances

More information

Computational Methods for Protein Structure Prediction

Computational Methods for Protein Structure Prediction Computational Methods for Protein Structure Prediction Ying Xu 2017/12/6 1 Outline introduction to protein structures the problem of protein structure prediction why it is possible to predict protein structures

More information

CFSSP: Chou and Fasman Secondary Structure Prediction server

CFSSP: Chou and Fasman Secondary Structure Prediction server Wide Spectrum, Vol. 1, No. 9, (2013) pp 15-19 CFSSP: Chou and Fasman Secondary Structure Prediction server T. Ashok Kumar Department of Bioinformatics, Noorul Islam College of Arts and Science, Kumaracoil

More information

Cryo-electron microscopy

Cryo-electron microscopy Cryo-electron microscopy Liao et al., Nature 504, 107 (2013) TRPV1 receptor (receptor for capsaicin making chili hot ) 3.4 Å resolution breaking side-chain resolution barrier (PDB: 3J5P) Protein Structure

More information

Homology Modelling. Thomas Holberg Blicher NNF Center for Protein Research University of Copenhagen

Homology Modelling. Thomas Holberg Blicher NNF Center for Protein Research University of Copenhagen Homology Modelling Thomas Holberg Blicher NNF Center for Protein Research University of Copenhagen Why are Protein Structures so Interesting? They provide a detailed picture of interesting biological features,

More information

Structural Bioinformatics (C3210) Conformational Analysis Protein Folding Protein Structure Prediction

Structural Bioinformatics (C3210) Conformational Analysis Protein Folding Protein Structure Prediction Structural Bioinformatics (C3210) Conformational Analysis Protein Folding Protein Structure Prediction Conformational Analysis 2 Conformational Analysis Properties of molecules depend on their three-dimensional

More information

Fundamentals of Protein Structure

Fundamentals of Protein Structure Outline Fundamentals of Protein Structure Yu (Julie) Chen and Thomas Funkhouser Princeton University CS597A, Fall 2005 Protein structure Primary Secondary Tertiary Quaternary Forces and factors Levels

More information

Programme Good morning and summary of last week Levels of Protein Structure - I Levels of Protein Structure - II

Programme Good morning and summary of last week Levels of Protein Structure - I Levels of Protein Structure - II Programme 8.00-8.10 Good morning and summary of last week 8.10-8.30 Levels of Protein Structure - I 8.30-9.00 Levels of Protein Structure - II 9.00-9.15 Break 9.15-11.15 Exercise: Building a protein model

More information

Protein Structure Prediction

Protein Structure Prediction Homology Modeling Protein Structure Prediction Ingo Ruczinski M T S K G G G Y F F Y D E L Y G V V V V L I V L S D E S Department of Biostatistics, Johns Hopkins University Fold Recognition b Initio Structure

More information

Residue Contact Prediction for Protein Structure using 2-Norm Distances

Residue Contact Prediction for Protein Structure using 2-Norm Distances Residue Contact Prediction for Protein Structure using 2-Norm Distances Nikita V Mahajan Department of Computer Science &Engg GH Raisoni College of Engineering, Nagpur LGMalik Department of Computer Science

More information

6-Foot Mini Toober Activity

6-Foot Mini Toober Activity Big Idea The interaction between the substrate and enzyme is highly specific. Even a slight change in shape of either the substrate or the enzyme may alter the efficient and selective ability of the enzyme

More information

Basic concepts of molecular biology

Basic concepts of molecular biology Basic concepts of molecular biology Gabriella Trucco Email: gabriella.trucco@unimi.it Life The main actors in the chemistry of life are molecules called proteins nucleic acids Proteins: many different

More information

Homology Modelling. Thomas Holberg Blicher NNF Center for Protein Research University of Copenhagen

Homology Modelling. Thomas Holberg Blicher NNF Center for Protein Research University of Copenhagen Homology Modelling Thomas Holberg Blicher NNF Center for Protein Research University of Copenhagen Why are Protein Structures so Interesting? They provide a detailed picture of interesting biological features,

More information

Molecular Structures

Molecular Structures Molecular Structures 1 Molecular structures 2 Why is it important? Answers to scientific questions such as: What does the structure of protein X look like? Can we predict the binding of molecule X to Y?

More information

Molecular Structures

Molecular Structures Molecular Structures 1 Molecular structures 2 Why is it important? Answers to scientific questions such as: What does the structure of protein X look like? Can we predict the binding of molecule X to Y?

More information

Traditional approaches to 3-D structure determination

Traditional approaches to 3-D structure determination Homology Modeling: A Brief Introduction W. Ross Ellington Biology-IMB Traditional approaches to 3-D structure determination X-ray crystallography: Sufficient protein? Natural protein is often heterogeneous

More information

LIST OF ACRONYMS & ABBREVIATIONS

LIST OF ACRONYMS & ABBREVIATIONS LIST OF ACRONYMS & ABBREVIATIONS ALA ARG ASN ATD CRD CYS GLN GLU GLY GPCR HIS hstr ILE LEU LYS MET mglur1 NHDC PDB PHE PRO SER T1R2 T1R3 TMD TRP TYR THR 7-TM VFTM ZnSO 4 Alanine, A Arginine, R Asparagines,

More information

Packing of Secondary Structures

Packing of Secondary Structures 7.88 Lecture Notes - 5 7.24/7.88J/5.48J The Protein Folding and Human Disease Packing of Secondary Structures Packing of Helices against sheets Packing of sheets against sheets Parallel Orthogonal Table:

More information

Algorithms in Bioinformatics ONE Transcription Translation

Algorithms in Bioinformatics ONE Transcription Translation Algorithms in Bioinformatics ONE Transcription Translation Sami Khuri Department of Computer Science San José State University sami.khuri@sjsu.edu Biology Review DNA RNA Proteins Central Dogma Transcription

More information

Problem: The GC base pairs are more stable than AT base pairs. Why? 5. Triple-stranded DNA was first observed in 1957. Scientists later discovered that the formation of triplestranded DNA involves a type

More information

Comparative Modeling Part 1. Jaroslaw Pillardy Computational Biology Service Unit Cornell Theory Center

Comparative Modeling Part 1. Jaroslaw Pillardy Computational Biology Service Unit Cornell Theory Center Comparative Modeling Part 1 Jaroslaw Pillardy Computational Biology Service Unit Cornell Theory Center Function is the most important feature of a protein Function is related to structure Structure is

More information

03-511/711 Computational Genomics and Molecular Biology, Fall

03-511/711 Computational Genomics and Molecular Biology, Fall 03-511/711 Computational Genomics and Molecular Biology, Fall 2011 1 Problem Set 0 Due Tuesday, September 6th This homework is intended to be a self-administered placement quiz, to help you (and me) determine

More information

Structure formation and association of biomolecules. Prof. Dr. Martin Zacharias Lehrstuhl für Molekulardynamik (T38) Technische Universität München

Structure formation and association of biomolecules. Prof. Dr. Martin Zacharias Lehrstuhl für Molekulardynamik (T38) Technische Universität München Structure formation and association of biomolecules Prof. Dr. Martin Zacharias Lehrstuhl für Molekulardynamik (T38) Technische Universität München Motivation Many biomolecules are chemically synthesized

More information

Molecular Modeling 9. Protein structure prediction, part 2: Homology modeling, fold recognition & threading

Molecular Modeling 9. Protein structure prediction, part 2: Homology modeling, fold recognition & threading Molecular Modeling 9 Protein structure prediction, part 2: Homology modeling, fold recognition & threading The project... Remember: You are smarter than the program. Inspecting the model: Are amino acids

More information

Docking. Why? Docking : finding the binding orientation of two molecules with known structures

Docking. Why? Docking : finding the binding orientation of two molecules with known structures Docking : finding the binding orientation of two molecules with known structures Docking According to the molecules involved: Protein-Ligand docking Protein-Protein docking Specific docking algorithms

More information

Basic concepts of molecular biology

Basic concepts of molecular biology Basic concepts of molecular biology Gabriella Trucco Email: gabriella.trucco@unimi.it What is life made of? 1665: Robert Hooke discovered that organisms are composed of individual compartments called cells

More information

BIRKBECK COLLEGE (University of London)

BIRKBECK COLLEGE (University of London) BIRKBECK COLLEGE (University of London) SCHOOL OF BIOLOGICAL SCIENCES M.Sc. EXAMINATION FOR INTERNAL STUDENTS ON: Postgraduate Certificate in Principles of Protein Structure MSc Structural Molecular Biology

More information

Immune system IgGs. Carla Cortinas, Eva Espigulé, Guillem Lopez-Grado, Margalida Roig, Valentina Salas. Group 2

Immune system IgGs. Carla Cortinas, Eva Espigulé, Guillem Lopez-Grado, Margalida Roig, Valentina Salas. Group 2 Immune system IgGs Carla Cortinas, Eva Espigulé, Guillem Lopez-Grado, Margalida Roig, Valentina Salas Group 2 Index 1. Introduction 1.1. 1.2. 1.3. 1.4. 2. Immunoglobulins IgG formation IgG subclasses Structural

More information

Problem Set Unit The base ratios in the DNA and RNA for an onion (Allium cepa) are given below.

Problem Set Unit The base ratios in the DNA and RNA for an onion (Allium cepa) are given below. Problem Set Unit 3 Name 1. Which molecule is found in both DNA and RNA? A. Ribose B. Uracil C. Phosphate D. Amino acid 2. Which molecules form the nucleotide marked in the diagram? A. phosphate, deoxyribose

More information

Suppl. Figure 1: RCC1 sequence and sequence alignments. (a) Amino acid

Suppl. Figure 1: RCC1 sequence and sequence alignments. (a) Amino acid Supplementary Figures Suppl. Figure 1: RCC1 sequence and sequence alignments. (a) Amino acid sequence of Drosophila RCC1. Same colors are for Figure 1 with sequence of β-wedge that interacts with Ran in

More information

In silico measurements of twist and bend. moduli for beta solenoid protein self-

In silico measurements of twist and bend. moduli for beta solenoid protein self- In silico measurements of twist and bend moduli for beta solenoid protein self- assembly units Leonard P. Heinz, Krishnakumar M. Ravikumar, and Daniel L. Cox Department of Physics and Institute for Complex

More information

Protein 3D Structure Prediction

Protein 3D Structure Prediction Protein 3D Structure Prediction Michael Tress CNIO ?? MREYKLVVLGSGGVGKSALTVQFVQGIFVDE YDPTIEDSYRKQVEVDCQQCMLEILDTAGTE QFTAMRDLYMKNGQGFALVYSITAQSTFNDL QDLREQILRVKDTEDVPMILVGNKCDLEDER VVGKEQGQNLARQWCNCAFLESSAKSKINVN

More information

Protein NMR II. Lecture 5

Protein NMR II. Lecture 5 Protein NMR II Lecture 5 Standard and NMR chemical shifts in proteins Residue N A A B O Ala 123.8 4.35 52.5 19.0 177.1 ys 118.8 4.65 58.8 28.6 174.8 Asp 120.4 4.76 54.1 40.8 177.2 Glu 120.2 4.29 56.7 29.7

More information

Amino Acids and Proteins

Amino Acids and Proteins Various Functions of Proteins SB203 Amino Acids and Proteins Jirundon Yuvaniyama, Ph.D. Department of Biochemistry Faculty of Science Mahidol University Enzymes Transport proteins utrient and storage proteins

More information

Ali Yaghi. Tamara Wahbeh. Mamoun Ahram

Ali Yaghi. Tamara Wahbeh. Mamoun Ahram 28 Ali Yaghi Tamara Wahbeh Mamoun Ahram This sheet is a continuation of protein purification methods. Isoelectric focusing Separation of proteins based on Isoelectric points(charge),and it is a horizontal

More information

Ch Biophysical Chemistry

Ch Biophysical Chemistry Ch 247.53. Biophysical Chemistry Nina Rosario L. Rojas 2012-2013 sem 1 Review of Protein Structure Why structure? Primary, secondary, tertiary structure Disulfide bonds scheme 2 STRUCTURE- REGULAR STRUCTURE

More information

Pacific Symposium on Biocomputing 4: (1999)

Pacific Symposium on Biocomputing 4: (1999) Applications of Knowledge Discovery to Molecular Biology: Identifying Structural Regularities in Proteins Shaobing Su, Diane J. Cook, and Lawrence B. Holder University of Texas at Arlington sandy su@sabre.com,

More information

Introduction to protein structure analysis and prediction

Introduction to protein structure analysis and prediction Introduction to protein structure analysis and prediction Mónica Chagoyen monica.chagoyen@cnb.csic.es Protein sequence analysis and prediction service Centro Nacional de Biotecnologia (CNB-CSIC) 24-26

More information

Introduction to Proteins

Introduction to Proteins Introduction to Proteins Lecture 4 Module I: Molecular Structure & Metabolism Molecular Cell Biology Core Course (GSND5200) Matthew Neiditch - Room E450U ICPH matthew.neiditch@umdnj.edu What is a protein?

More information

Protein Structure Analysis

Protein Structure Analysis BINF 731 Protein Structure Analysis http://binf.gmu.edu/vaisman/binf731/ Secondary Structure: Computational Problems Secondary structure characterization Secondary structure assignment Secondary structure

More information

Bioinformatics. ONE Introduction to Biology. Sami Khuri Department of Computer Science San José State University Biology/CS 123A Fall 2012

Bioinformatics. ONE Introduction to Biology. Sami Khuri Department of Computer Science San José State University Biology/CS 123A Fall 2012 Bioinformatics ONE Introduction to Biology Sami Khuri Department of Computer Science San José State University Biology/CS 123A Fall 2012 Biology Review DNA RNA Proteins Central Dogma Transcription Translation

More information

Solving Structure Based Design Problems using Discovery Studio 1.7 Building a Flexible Docking Protocol

Solving Structure Based Design Problems using Discovery Studio 1.7 Building a Flexible Docking Protocol Solving Structure Based Design Problems using Discovery Studio 1.7 Building a Flexible Docking Protocol C. M. (Venkat) Venkatachalam Fellow, Life Sciences Dipesh Risal Marketing, Life Sciences Overview

More information

Bioinformatics & Protein Structural Analysis. Bioinformatics & Protein Structural Analysis. Learning Objective. Proteomics

Bioinformatics & Protein Structural Analysis. Bioinformatics & Protein Structural Analysis. Learning Objective. Proteomics The molecular structures of proteins are complex and can be defined at various levels. These structures can also be predicted from their amino-acid sequences. Protein structure prediction is one of the

More information

First&year&tutorial&in&Chemical&Biology&(amino&acids,&peptide&and&proteins)&! 1.&!

First&year&tutorial&in&Chemical&Biology&(amino&acids,&peptide&and&proteins)&! 1.&! First&year&tutorial&in&Chemical&Biology&(amino&acids,&peptide&and&proteins& 1.& a. b. c. d. e. 2.& a. b. c. d. e. f. & UsingtheCahn Ingold Prelogsystem,assignstereochemicaldescriptorstothe threeaminoacidsshownbelow.

More information

Biochemistry Prof. S. DasGupta Department of Chemistry Indian Institute of Technology Kharagpur. Lecture - 5 Protein Structure - III

Biochemistry Prof. S. DasGupta Department of Chemistry Indian Institute of Technology Kharagpur. Lecture - 5 Protein Structure - III Biochemistry Prof. S. DasGupta Department of Chemistry Indian Institute of Technology Kharagpur Lecture - 5 Protein Structure - III This is lecture number three on protein structure. (Refer Slide Time:

More information

1) The penicillin family of antibiotics, discovered by Alexander Fleming in 1928, has the following general structure: O O

1) The penicillin family of antibiotics, discovered by Alexander Fleming in 1928, has the following general structure: O O ame: TF ame: LS1a Fall 06 Problem Set #3 Due Friday 10/13 at noon in your TF s drop box on the 2 nd floor of the Science Center All questions including the (*extra*) ones should be turned in 1) The penicillin

More information

From code to translation

From code to translation From code to translation What could be the role of the first peptides? Ádám Kun & Ádám Radványi Dpt. Plant Systematics, Ecology and Theoretical Biology, Eötvös University, Budapest, Hungary Parmenides

More information

Protein Data Bank and Structure Display with PyMOL

Protein Data Bank and Structure Display with PyMOL Protein Data Bank and Structure Display with PyMOL Ching-Shu Suen ( 孫慶姝 ) Supervisor: Dr. Ming-Jing Hwang ( 黃明經 ) Bioinformatics Service Support BioIT, IBMS ( 生醫所 ) Academia Sinica Outline About the RCSB

More information

Chapter 8. One-Dimensional Structural Properties of Proteins in the Coarse-Grained CABS Model. Sebastian Kmiecik and Andrzej Kolinski.

Chapter 8. One-Dimensional Structural Properties of Proteins in the Coarse-Grained CABS Model. Sebastian Kmiecik and Andrzej Kolinski. Chapter 8 One-Dimensional Structural Properties of Proteins in the Coarse-Grained CABS Model Abstract Despite the significant increase in computational power, molecular modeling of protein structure using

More information

1. DNA replication. (a) Why is DNA replication an essential process?

1. DNA replication. (a) Why is DNA replication an essential process? ame Section 7.014 Problem Set 3 Please print out this problem set and record your answers on the printed copy. Answers to this problem set are to be turned in to the box outside 68120 by 5:00pm on Friday

More information

Molecular design principles underlying β-strand swapping. in the adhesive dimerization of cadherins

Molecular design principles underlying β-strand swapping. in the adhesive dimerization of cadherins Supplementary information for: Molecular design principles underlying β-strand swapping in the adhesive dimerization of cadherins Jeremie Vendome 1,2,3,5, Shoshana Posy 1,2,3,5,6, Xiangshu Jin, 1,3 Fabiana

More information

Protein Structure Prediction. christian studer , EPFL

Protein Structure Prediction. christian studer , EPFL Protein Structure Prediction christian studer 17.11.2004, EPFL Content Definition of the problem Possible approaches DSSP / PSI-BLAST Generalization Results Definition of the problem Massive amounts of

More information

Research in Structural Bioinformatics and Molecular Biophysics. OUTLINE: What is it and why is it useful? EXAMPLES: b. Improving enzyme s function.

Research in Structural Bioinformatics and Molecular Biophysics. OUTLINE: What is it and why is it useful? EXAMPLES: b. Improving enzyme s function. Research in Structural Bioinformatics and Molecular Biophysics OUTLINE: What is it and why is it useful? EXAMPLES: a. Biomolecular surface story. b. Improving enzyme s function. c. Folding proteins. Alexey

More information

Unit 1. DNA and the Genome

Unit 1. DNA and the Genome Unit 1 DNA and the Genome Gene Expression Key Area 3 Vocabulary 1: Transcription Translation Phenotype RNA (mrna, trna, rrna) Codon Anticodon Ribosome RNA polymerase RNA splicing Introns Extrons Gene Expression

More information

11 questions for a total of 120 points

11 questions for a total of 120 points Your Name: BYS 201, Final Exam, May 3, 2010 11 questions for a total of 120 points 1. 25 points Take a close look at these tables of amino acids. Some of them are hydrophilic, some hydrophobic, some positive

More information

Supplementary Table 1: List of CH3 domain interface residues in the first chain (A) and

Supplementary Table 1: List of CH3 domain interface residues in the first chain (A) and Supplementary Tables Supplementary Table 1: List of CH3 domain interface residues in the first chain (A) and their side chain contacting residues in the second chain (B) a Interface Res. in Contacting

More information

1/4/18 NUCLEIC ACIDS. Nucleic Acids. Nucleic Acids. ECS129 Instructor: Patrice Koehl

1/4/18 NUCLEIC ACIDS. Nucleic Acids. Nucleic Acids. ECS129 Instructor: Patrice Koehl NUCLEIC ACIDS ECS129 Instructor: Patrice Koehl Nucleic Acids Nucleotides DNA Structure RNA Synthesis Function Secondary structure Tertiary interactions Wobble hypothesis DNA RNA Replication Transcription

More information

NUCLEIC ACIDS. ECS129 Instructor: Patrice Koehl

NUCLEIC ACIDS. ECS129 Instructor: Patrice Koehl NUCLEIC ACIDS ECS129 Instructor: Patrice Koehl Nucleic Acids Nucleotides DNA Structure RNA Synthesis Function Secondary structure Tertiary interactions Wobble hypothesis DNA RNA Replication Transcription

More information

Alpha-helices, beta-sheets and U-turns within a protein are stabilized by (hint: two words).

Alpha-helices, beta-sheets and U-turns within a protein are stabilized by (hint: two words). 1 Quiz1 Q1 2011 Alpha-helices, beta-sheets and U-turns within a protein are stabilized by (hint: two words) Value Correct Answer 1 noncovalent interactions 100% Equals hydrogen bonds (100%) Equals H-bonds

More information

BMB/Bi/Ch 170 Fall 2017 Problem Set 1: Proteins I

BMB/Bi/Ch 170 Fall 2017 Problem Set 1: Proteins I BMB/Bi/Ch 170 Fall 2017 Problem Set 1: Proteins I Please use ray-tracing feature for all the images you are submitting. Use either the Ray button on the right side of the command window in PyMOL or variations

More information

Supplementary Figure 1

Supplementary Figure 1 Supplementary Figure 1 2 Supplementary Figure 1: Sequence alignment of HsHSD17B8 and HsCBR4 of with KAR orthologs. The secondary structure elements as calculated by DSSP and residue numbers are displayed

More information

Solutions to Problem Set 1

Solutions to Problem Set 1 MIT Department of Biology 7.014 Introductory Biology, Spring 004 Question 1 Solutions to 7.014 Problem Set 1 a) Describe the conditions of the atmosphere on prebiotic earth and how these conditions differ

More information

Protein Structure Prediction by Constraint Logic Programming

Protein Structure Prediction by Constraint Logic Programming MPRI C2-19 Protein Structure Prediction by Constraint Logic Programming François Fages, Constraint Programming Group, INRIA Rocquencourt mailto:francois.fages@inria.fr http://contraintes.inria.fr/ Molecules

More information

Supplementary Figure 1. Electron microscopy of gb-698glyco/1g2 Fab complex. a)

Supplementary Figure 1. Electron microscopy of gb-698glyco/1g2 Fab complex. a) Supplementary Figure 1. Electron microscopy of gb-698glyco/1g2 Fab complex. a) Representative images of 2D class averages of gb-698glyc bound to 1G2 Fab. Top views of the complex were underrepresented

More information

IV107 Bioinformatika I

IV107 Bioinformatika I IV107 Bioinformatika I Přednáška 5 Katedra informačních technologií Masarykova Univerzita Brno Jaro 2011 Předchozí týden Struktura genu prokaryotického eukaryotického Porovnání sekvencí globální (Needleman

More information

Solutions to 7.02 Quiz II 10/27/05

Solutions to 7.02 Quiz II 10/27/05 Solutions to 7.02 Quiz II 10/27/05 Class Average = 83 Standard Deviation = 9 Range Grade % 87-100 A 43 74-86 B 39 55-73 C 17 > 54 D 1 Question 1 (56 points) While studying deep sea bacteria, you discover

More information

CSE : Computational Issues in Molecular Biology. Lecture 19. Spring 2004

CSE : Computational Issues in Molecular Biology. Lecture 19. Spring 2004 CSE 397-497: Computational Issues in Molecular Biology Lecture 19 Spring 2004-1- Protein structure Primary structure of protein is determined by number and order of amino acids within polypeptide chain.

More information

Protein Folding Problem I400: Introduction to Bioinformatics

Protein Folding Problem I400: Introduction to Bioinformatics Protein Folding Problem I400: Introduction to Bioinformatics November 29, 2004 Protein biomolecule, macromolecule more than 50% of the dry weight of cells is proteins polymer of amino acids connected into

More information

Lecture 1 - Introduction to Structural Bioinformatics

Lecture 1 - Introduction to Structural Bioinformatics Lecture 1 - Introduction to Structural Bioinformatics Motivation and Basics of Protein Structure Prof. Haim J. Wolfson 1 Most of the Protein Structure slides courtesy of Hadar Benyaminy. Prof. Haim J.

More information

Supplementary Data for Monti, et al.

Supplementary Data for Monti, et al. Supplementary Data for Monti, et al. Supplementary Figure S1 Legend to Supplementary Figure S1 Tumor spectrum associated with germline p53 alleles (restricted to the 7 most frequent tissue targets). Structural

More information

Molecular Docking Study of Some Novel Nitroimidazo[1,2-b]pyridazine

Molecular Docking Study of Some Novel Nitroimidazo[1,2-b]pyridazine DOI:10.7598/cst2016.1227 Chemical Science Transactions ISSN:2278-3458 2016, 5(3), 700-710 RESEARCH ARTICLE Molecular Docking Study of Some Novel Nitroimidazo[1,2-b]pyridazine based Heterocyclics on Methicillin

More information

The Structure Lectures

The Structure Lectures The Structure Lectures Boris Steipe boris.steipe@utoronto.ca http://biochemistry.utoronto.ca/steipe Departments of Biochemistry and Molecular and Medical Genetics Program in Proteomics and Bioinformatics

More information

Amino Acid Sequences and Evolutionary Relationships

Amino Acid Sequences and Evolutionary Relationships Amino Acid Sequences and Evolutionary Relationships Pre-Lab Discussion Homologous structures -- those structures believed to have a common origin but not necessarily a common function -- provide some of

More information

Cristian Micheletti SISSA (Trieste)

Cristian Micheletti SISSA (Trieste) Cristian Micheletti SISSA (Trieste) michelet@sissa.it Mar 2009 5pal - parvalbumin Calcium-binding protein HEADER CALCIUM-BINDING PROTEIN 25-SEP-91 5PAL 5PAL 2 COMPND PARVALBUMIN (ALPHA LINEAGE) 5PAL 3

More information

Amino Acid Sequences and Evolutionary Relationships

Amino Acid Sequences and Evolutionary Relationships Amino Acid Sequences and Evolutionary Relationships One technique used to determine evolutionary relationships is to study the biochemical similarity of organisms. Though molds, aardvarks, and humans appear

More information

Description of Changes and Corrections for PDB File Format Version 4.0. Provisional Document April 12, 2011

Description of Changes and Corrections for PDB File Format Version 4.0. Provisional Document April 12, 2011 Description of Changes and Corrections for PDB File Format Version 4.0 Provisional Document April 12, 2011 The wwpdb has reviewed the PDB archive and created a new set of corrected files that will be released

More information

KEMM15 Lecture note in structural bioinformatics: A practical guide. S Al-Karadaghi, Biochemistry & Structural Biology, Lund University

KEMM15 Lecture note in structural bioinformatics: A practical guide. S Al-Karadaghi, Biochemistry & Structural Biology, Lund University KEMM15 Lecture note in structural bioinformatics: A practical guide S Al-Karadaghi, Biochemistry & Structural Biology, Lund University 1 BASICS OF PROTEIN STRUCTURE 3 SOME DEFINITIONS 3 THE 20 AMINO ACIDS

More information

7.014 Solution Set 4

7.014 Solution Set 4 7.014 Solution Set 4 Question 1 Shown below is a fragment of the sequence of a hypothetical bacterial gene. This gene encodes production of HWDWN, protein essential for metabolizing sugar yummose. The

More information

ECS 129: Structural Bioinformatics March 15, 2016

ECS 129: Structural Bioinformatics March 15, 2016 Notes: ES 129: Structural Bioinformatics March 15, 2016 1) The final exam is open book, open notes. 2) The final is divided into 2 parts, and graded over 100 points (with 8 points extra credit) 3) You

More information

BIOINFORMATICS Introduction

BIOINFORMATICS Introduction BIOINFORMATICS Introduction Mark Gerstein, Yale University bioinfo.mbb.yale.edu/mbb452a 1 (c) Mark Gerstein, 1999, Yale, bioinfo.mbb.yale.edu What is Bioinformatics? (Molecular) Bio -informatics One idea

More information

Aipotu II: Biochemistry

Aipotu II: Biochemistry Aipotu II: Biochemistry Introduction: The Biological Phenomenon Under Study In this lab, you will continue to explore the biological mechanisms behind the expression of flower color in a hypothetical plant.

More information

466 Asn (N) to Ala (A) Generate beta dimer Interface

466 Asn (N) to Ala (A) Generate beta dimer Interface Table S1: Amino acid changes to the HexA α-subunit to convert the dimer interface from α to β and to introduce the putative GM2A binding surface from β- onto the α- subunit Residue position (α-numbering)

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Structure of a tyrosyl-trna synthetase splicing factor bound to a group I intron RNA Paul J. Paukstelis 1, Jui-Hui Chen 2, Elaine Chase 2, Alan M. Lambowitz 1,*, and Barbara L. Golden 2,*,. 1 Institute

More information

Proteins and their 3 D Structure

Proteins and their 3 D Structure Structural BioInformatics Laboratory: SBI Proteins and their 3 D Structure Embrapa Informática Agropecuária Cidade Universitária - UNICAMP Campinas, SP Sequence Structure Function Role Blast Lexical STING

More information

Amino Acid Sequences and Evolutionary Relationships. How do similarities in amino acid sequences of various species provide evidence for evolution?

Amino Acid Sequences and Evolutionary Relationships. How do similarities in amino acid sequences of various species provide evidence for evolution? Amino Acid Sequences and Evolutionary Relationships Name: How do similarities in amino acid sequences of various species provide evidence for evolution? An important technique used in determining evolutionary

More information

EE550 Computational Biology

EE550 Computational Biology EE550 Computational Biology Week 1 Course Notes Instructor: Bilge Karaçalı, PhD Syllabus Schedule : Thursday 13:30, 14:30, 15:30 Text : Paul G. Higgs, Teresa K. Attwood, Bioinformatics and Molecular Evolution,

More information

CS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS

CS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS 1 CS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS * Some contents are adapted from Dr. Jean Gao at UT Arlington Mingon Kang, PhD Computer Science, Kennesaw State University 2 Genetics The discovery of

More information

Protein design. CS/CME/Biophys/BMI 279 Oct. 20 and 22, 2015 Ron Dror

Protein design. CS/CME/Biophys/BMI 279 Oct. 20 and 22, 2015 Ron Dror Protein design CS/CME/Biophys/BMI 279 Oct. 20 and 22, 2015 Ron Dror 1 Optional reading on course website From cs279.stanford.edu These reading materials are optional. They are intended to (1) help answer

More information

7.014 Problem Set 4 Answers to this problem set are to be turned in. Problem sets will not be accepted late. Solutions will be posted on the web.

7.014 Problem Set 4 Answers to this problem set are to be turned in. Problem sets will not be accepted late. Solutions will be posted on the web. MIT Department of Biology 7.014 Introductory Biology, Spring 2005 Name: Section : 7.014 Problem Set 4 Answers to this problem set are to be turned in. Problem sets will not be accepted late. Solutions

More information

2018 Protein Modeling Exam Key

2018 Protein Modeling Exam Key 2018 Protein Modeling Exam Key Multiple Choice: 1. Which of the following amino acids has a negative charge at ph 7? a. Gln b. Glu c. Ser d. Cys 2. Which of the following is an example of secondary structure?

More information

Dynamic Programming Algorithms

Dynamic Programming Algorithms Dynamic Programming Algorithms Sequence alignments, scores, and significance Lucy Skrabanek ICB, WMC February 7, 212 Sequence alignment Compare two (or more) sequences to: Find regions of conservation

More information

Nucleic acid and protein Flow of genetic information

Nucleic acid and protein Flow of genetic information Nucleic acid and protein Flow of genetic information References: Glick, BR and JJ Pasternak, 2003, Molecular Biotechnology: Principles and Applications of Recombinant DNA, ASM Press, Washington DC, pages.

More information

Collagen. 7.88J Protein Folding. Prof. David Gossard October 20, 2003

Collagen. 7.88J Protein Folding. Prof. David Gossard October 20, 2003 Collagen 7.88J Protein Folding Prof. David Gossard October 20, 2003 PDB Acknowledgements The Protein Data Bank (PDB - http://www.pdb.org/) is the single worldwide repository for the processing and distribution

More information

Zool 3200: Cell Biology Exam 3 3/6/15

Zool 3200: Cell Biology Exam 3 3/6/15 Name: Trask Zool 3200: Cell Biology Exam 3 3/6/15 Answer each of the following questions in the space provided; circle the correct answer or answers for each multiple choice question and circle either

More information

What s New in Discovery Studio 2.5.5

What s New in Discovery Studio 2.5.5 What s New in Discovery Studio 2.5.5 Discovery Studio takes modeling and simulations to the next level. It brings together the power of validated science on a customizable platform for drug discovery research.

More information

Retrieving and Viewing Protein Structures from the Protein Data Base

Retrieving and Viewing Protein Structures from the Protein Data Base Retrieving and Viewing Protein Structures from the Protein Data Base 7.88J Protein Folding Prof. David Gossard September 15, 2003 1 PDB Acknowledgements The Protein Data Bank (PDB - http://www.pdb.org/)

More information

Visualizing proteins with PyMol

Visualizing proteins with PyMol Visualizing proteins with PyMol Structural Bioinformatics GENOME 541 Spring 2018 Lecture 1: Protein Structure Frank DiMaio (dimaio@uw.edu) Obtaining PyMOL We will use an educational build of PyMOL that

More information

Computational Methods for Protein Structure Prediction and Fold Recognition... 1 I. Cymerman, M. Feder, M. PawŁowski, M.A. Kurowski, J.M.

Computational Methods for Protein Structure Prediction and Fold Recognition... 1 I. Cymerman, M. Feder, M. PawŁowski, M.A. Kurowski, J.M. Contents Computational Methods for Protein Structure Prediction and Fold Recognition........................... 1 I. Cymerman, M. Feder, M. PawŁowski, M.A. Kurowski, J.M. Bujnicki 1 Primary Structure Analysis...................

More information

Protein design. CS/CME/BioE/Biophys/BMI 279 Oct. 24, 2017 Ron Dror

Protein design. CS/CME/BioE/Biophys/BMI 279 Oct. 24, 2017 Ron Dror Protein design CS/CME/BioE/Biophys/BMI 279 Oct. 24, 2017 Ron Dror 1 Outline Why design proteins? Overall approach: Simplifying the protein design problem Protein design methodology Designing the backbone

More information