Retrieving and Viewing Protein Structures from the Protein Data Base

Similar documents
Ligand Depot: A Data Warehouse for Ligands Bound to Macromolecules

Collagen. 7.88J Protein Folding. Prof. David Gossard October 20, 2003

If you wish to have extra practice with swiss pdb viewer or to familiarize yourself with how to use the program here is a tutorial:


Protein Bioinformatics Part I: Access to information

Protein structure. Wednesday, October 4, 2006

CS612 - Algorithms in Bioinformatics

Computational methods in bioinformatics: Lecture 1

Structural bioinformatics

DNA Repair Protein Exercise

Bioinformatics Tools. Stuart M. Brown, Ph.D Dept of Cell Biology NYU School of Medicine

Searching and Analysing 3D Protein-ligand Structures using PDBe web services. Abhik Mukhopadhyay.

Biochemistry Proteins

DNA Glycosylase Exercise

Molecular Modeling Lecture 8. Local structure Database search Multiple alignment Automated homology modeling

Exploring Proteins and Nucleic Acids with Jmol

Are specialized servers better at predicting protein structures than stand alone software?

BIOLOGY 200 Molecular Biology Students registered for the 9:30AM lecture should NOT attend the 4:30PM lecture.

Bioinformatics of the Green Fluorescent Proteins

BMB/Bi/Ch 170 Fall 2017 Problem Set 1: Proteins I

Protein Structure Databases, cont. 11/09/05

A Neural Network Method for Identification of RNA-Interacting Residues in Protein

CSE : Computational Issues in Molecular Biology. Lecture 19. Spring 2004

Biochemistry Prof. S. DasGupta Department of Chemistry Indian Institute of Technology Kharagpur. Lecture - 5 Protein Structure - III

Molecular Structures

Project Manual Bio3055. Lung Cancer: Cytochrome P450 1A1

Molecular Structures

Introduction to Bioinformatics

Structure formation and association of biomolecules. Prof. Dr. Martin Zacharias Lehrstuhl für Molekulardynamik (T38) Technische Universität München

Comparative Modeling Part 1. Jaroslaw Pillardy Computational Biology Service Unit Cornell Theory Center

Packing of Secondary Structures

ELE4120 Bioinformatics. Tutorial 5

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748

Biological databases an introduction

Introduction to Bioinformatics Introduction to Bioinformatics

Molecular Modeling 9. Protein structure prediction, part 2: Homology modeling, fold recognition & threading

Computational Analysis of Broad Complex Zinc-Finger Transcription Factors

CMSE 520 BIOMOLECULAR STRUCTURE, FUNCTION AND DYNAMICS

CS273: Algorithms for Structure Handout # 5 and Motion in Biology Stanford University Tuesday, 13 April 2004

This practical aims to walk you through the process of text searching DNA and protein databases for sequence entries.

BIRKBECK COLLEGE (University of London)

Plant Proteomics Tutorial and Online Resources. Manish Raizada University of Guelph

What s New in Discovery Studio 2.5.5

Homology Modelling. Thomas Holberg Blicher NNF Center for Protein Research University of Copenhagen

Extracting Database Properties for Sequence Alignment and Secondary Structure Prediction

Bioinformatics Prof. M. Michael Gromiha Department of Biotechnology Indian Institute of Technology, Madras. Lecture - 5a Protein sequence databases

Hmwk 6. Nucleic Acids

GMOL: An Interactive Tool for 3D Genome Structure Visualization

G4120: Introduction to Computational Biology

Project Manual Bio3055. Lung Cancer: K-Ras 2

Protein Folding Problem I400: Introduction to Bioinformatics

Open the modeling database of three-dimensional protein structures to the public in the web on a worldwide level

Two Mark question and Answers

Problem Set #1

BIOINF525: INTRODUCTION TO BIOINFORMATICS LAB SESSION 1

Computational Analysis of Broad Complex Zinc- Finger Transcription Factors

Getting to know RNA Polymerase: A Tutorial For Exploring Proteins in VMD By Sharlene Denos (UIUC)

(Refer Slide Time: 00:15)

Lecture 1 - Introduction to Structural Bioinformatics

Exploring Suboptimal Sequence Alignments and Scoring Functions in Comparative Protein Structural Modeling

Introduction to molecular docking

DNA and RNA are both made of nucleotides. Proteins are made of amino acids. Transcription can be reversed but translation cannot.

NCBI web resources I: databases and Entrez

Structural Bioinformatics (C3210) Conformational Analysis Protein Folding Protein Structure Prediction

COMPARATIVE ANALYSIS AND INSILICO CHARACTERIZATION OF LEA, DREB AND OsDHODH1 GENES INVOLVED IN DROUGHT RESISTANCE OF ORYZA SATIVA

Highly-efficient T4 DNA ligase-based SNP analysis using a ligation fragment containing a modified nucleobase at the end

ProBiS: a web server for detection of structurally similar protein binding sites

How is smoking linked to lung cancer? Learn about one of the genetic risk factors for lung cancer (the KRAS oncogene) using bioinformatics tools

Bi 8 Lecture 7. Ellen Rothenberg 26 January Reading: Ch. 3, pp ; panel 3-1

Overview. Secondary Structure. Tertiary Structure

The Gene Ontology Annotation (GOA) project application of GO in SWISS-PROT, TrEMBL and InterPro

The Need for Scientific. Data Annotation. Alick K Law, Ph.D., M.B.A. Marketing Manager IBM Life Sciences.

RCSB Protein Data Bank Advisory Committee. Teleconference October 19, 2017

Sequence Databases and database scanning

S Basics for biosystems of the Cell PATENTING OF PROTEIN STRUCTURES AND PROTEOMICS INVENTIONS IN THE EUROPEAN PATENT OFFICE

The Nucleic Acid Database: A Resource for Nucleic Acid Science

Bioinformatics & Protein Structural Analysis. Bioinformatics & Protein Structural Analysis. Learning Objective. Proteomics

Total Test Questions: 66 Levels: Grades Units of Credit: 1.0 STANDARD 2. Demonstrate appropriate use of personal protective devices.

Protein Bioinformatics PH Final Exam

Recursive protein modeling: a divide and conquer strategy for protein structure prediction and its case study in CASP9

Protein Structure Prediction. christian studer , EPFL

Lecture 1a Introduction to Protein Structures - Molecular Graphics Tool. Software Widely Used by Scientific Community

Protein Peeling 2: a web server to convert protein structures into series of Protein Units.

Proteins Higher Order Structures

BCH222 - Greek Key β Barrels

Bioinformatics for Cell Biologists

The Gene Gateway Workbook

Introduction to Proteins

PRESENTING SEQUENCES 5 GAATGCGGCTTAGACTGGTACGATGGAAC 3 3 CTTACGCCGAATCTGACCATGCTACCTTG 5

Research in Structural Bioinformatics and Molecular Biophysics. OUTLINE: What is it and why is it useful? EXAMPLES: b. Improving enzyme s function.

RCSB Protein Data Bank Advisory Committee Report of October 2 nd 2010 Annual Meeting Rutgers University, New Brunswick, New Jersey

Following text taken from Suresh Kumar. Bioinformatics Web - Comprehensive educational resource on Bioinformatics. 6th May.2005

Introduction to Cellular Biology and Bioinformatics. Farzaneh Salari

BIO 311C Spring Lecture 16 Monday 1 March

Reading for lecture 2

Step-by-Step Description of ELISA

PeppyChains: Simplifying the assembly of 3D- printed generic protein models

2 Introduction to BI Launch Pad

Tutorial. Visualize Variants on Protein Structure. Sample to Insight. November 21, 2017

Introduction to Bioinformatics for Medical Research. Gideon Greenspan TA: Oleg Rokhlenko. Lecture 1

Transcription:

Retrieving and Viewing Protein Structures from the Protein Data Base 7.88J Protein Folding Prof. David Gossard September 15, 2003 1

PDB Acknowledgements The Protein Data Bank (PDB - http://www.pdb.org/) is the single worldwide repository for the processing and distribution of 3-D biological macromolecular structure data. Berman, H. M., J. Westbrook, Z. Feng, G. Gilliland, T. N. Bhat, H. Weissig, I. N. Shindyalov, and P. E. Bourne. The Protein Data Bank. Nucleic Acids Research 28 (2000): 235-242. (PDB Advisory Notice on using materials available in the archive: http://www.pdb.org/pdb/static.do?p=general_information/about_ pdb/pdb_advisory.html) Data used in the Retrieving, Viewing Protein Structures from the Protein Data Base Lecture Notes for 7.88J - Protein Folding PDB site data on page 4 ( PDB Growth ) and 7 ( Not all Structures are Different ). PDB screenshots: pages 9 14. Pages 20-22 contain screen shots from ExPASy (SwissPDB): Appel, R. D., A. Bairoch, and D. F. Hochstrasser. A new generation of information retrieval tools for biologists: the example of the ExPASy WWW server. Trends. Biochem. Sci. 19 (1994): 258-260. (ExPASy (Expert Protein Analysis System) proteomics server disclaimer: http://us.expasy.org/disclaimer.html) 2

Protein Data Base Established in 1971 Funded by NSF, DOE, NIH Operated by Rutgers, SDSC, NIST Purpose: Make protein structure data available to the entire scientific community In the beginning: less than a dozen protein structures Currently has 22,333 protein structures Growing at 20% per year New structures 50 times larger than those in 1971 are commonplace 3

PDB Growth 4

Why the Knee in the Curve? Engineered bacteria as a source of proteins Improved crystal-growing conditions More intense sources of X-rays Cryogenic treatment of crystals Improved detectors & data collection New method - NMR: Accounts for 15% of new structures in PDB Enables determination of structure of proteins in solution Protein Structures: From Famine to Feast, Berman, et.al. American Scientist v.90, p.350-359, July-August 2002 5

Why is the PDB Important? Collective Leverage for Understanding molecular machinery Rational drug design Engineering new molecules Structural genomics 6

Not all Structures are Different PDB Growth in New Folds 7

Structure vs Sequence New protein structures are being solved more slowly than new protein sequences are being solved Currently, known protein sequences outnumber known protein structures The sequence-structure gap continues to widen number Known Sequences Known Structures time 8

PDB Website http://www.rcsb.org/pdb/ Enter what you know 9

Query Result Browser Which one do I want? Let s look at this one 10

Yep, that s the right one Structure Explorer View it Download it 11

View Structure Static Images 12

Download/Display Display the file header Download the file (Select this file format) 13

Header Information 14

Visualizing Proteins High complexity Multiple levels of structure Important properties are distributed throughout the 3D structure No single/simple point at which to look 15

Visualization Objectives Structure Backbone; secondary, tertiary & quaternary Side chain groups Hydrophobic, charged, polar, acidic/base, etc. Cross-links Hydrogen bonds, disulfide bonds Surfaces VanderWaals, solvent-accessible Charge distributions, distances & angles, etc. 16

Display Conventions Wireframe Spacefill Ribbon Molecular Surface 17

Visualization Tools Viewers (free) 1960 s : MAGE, RasMol, Chime 2003 : SwissPDB, Protein Explorer, Cn3D, etc. Operating systems Unix, Windows, Mac Our choice (arbitrary) : Chime (plug-in to NETSCAPE) SwissPDB (stand-alone) 18

Important URL s Protein Data Base http://www.rcsb.org/pdb/ Chime https://en.wikipedia.org/wiki/mdl_chime SwissPDB http://www.expasy.ch/spdbv/ History of Visualization of Macromolecules http://www.umass.edu/microbio/rasmol/history.htm 19

SwissPDB 20

SwissPDB Toolbar Center Translate Zoom Rotate Distance between two atoms Angle between three atoms Measure omega, phi and psi angles Provenance of an atom Display groups a certain distance from an atom 21

Control Panel Chain Helix/sheet Residue Main chain Color target Color Side chain Label Surface Ribbon 22

Point of Information Today s lecture material is: a subset of the information available to you in online tutorials presented to get you started quickly and to shorten the learning curve not exhaustive or even sufficient => should be augmented by actually working through the online tutorials 23

MIT OpenCourseWare http://ocw.mit.edu 7.88J / 5.48J / 10.543J Protein Folding and Human Disease Spring 2015 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.