Introduction to Bioinformatics Introduction to Bioinformatics

Size: px
Start display at page:

Download "Introduction to Bioinformatics Introduction to Bioinformatics"

Transcription

1 Dr. rer. nat. Gong Jing Cancer Research center Medicine School of Shandong University

2 Chapter 5 Structure 2

3 Protein Structure When you study a protein, you are usually interested in its function. Function Structure Sequence Motif 3

4 Protein Structure There are four distinct levels of protein structure: Primary structure amino acid sequence Secondary structure regular sub-structures Tertiary structure 3-dimentional structure Quaternary structure complex of protein molecules 4

5 Protein Structure 5

6 Protein Structure - Primary Structure V G S A Primary structure the amino acid sequence of a protein >sp P06968 DUT_ECOLI MKKIDVKILDPRVGKEFPLPTYATSGSAGLDLRACLNDAVELAPGDTTLVPTGL AIHIADPSLAAMMLPRSGLGHKHGIVLGNLVGLIDSDYQGQLMISVWNRGQDSF TIQPGERIAQMIFVPVVQAEFNLVEDFDATDRGEGGFGHSGRQ 6

7 Protein Structure - Primary Structure >sp P06968 DUT_ECOL MKKIDVKILDPRVGKEFPL PTYATSGSAGLDLRACLND AVELAPGDTTLVPTGLAIH IADPSLAAMMLPRSGLGHK HGIVLGNLVGLIDSDYQGQ LMISVWNRGQDSFTIQPGE RIAQMIFVPVVQAEFNLVE DFDATDRGEGGFGHSGRQ Chapter 2 Chapter 3 sequence search sequence analysis 7

8 Protein Structure - Secondary Structure 90 o Helices: Where residues seem to be following the shape of a spring. The most common are the so-called alpha helices. 90 o Beta-strands: Where residues are in line and almost completely extended. Beta-sheets consist of beta-strands connected laterally, ming a generally twisted, pleated sheet. Random coils (loop): When the amino-acid chain is neither helices nor Beta-strands. Turn: in those cases where the chain makes a sharp turn (90 or more). 8

9 How to get secondary structures? 1. from PDB/DSSP structure known proteins The DSSP (Definition of Secondary Structure of Proteins) program was designed to standardize secondary structure assignment. DSSP is a database of secondary structure assignments all protein entries in the Protein Data Bank (PDB). DSSP does not predict secondary structure. Home page : alpha-helix Protein Structure - Secondary Structure coil beta-sheet 9

10 How to get secondary structures? 1. from PDB/DSSP structure known proteins Protein Structure - Secondary Structure secondary structures part of a DSSP file 10

11 How to get secondary structures? 1. from PDB/DSSP structure known proteins Protein Structure - Secondary Structure secondary structures H = alpha helix B = residue in isolated beta-bridge E = extended strand, participates in beta ladder G = 3-helix (3/10 helix) I = 5 helix (pi helix) T = hydrogen bonded turn S = bend Blank = loop or irregular part of a DSSP file 11

12 Protein Structure - Secondary Structure How to get secondary structures? 1. from PDB/DSSP structure known proteins

13 Protein Structure - Secondary Structure How to get secondary structures? 1. from PDB/DSSP structure known proteins

14 How to get secondary structures? 1. from PDB/DSSP structure known proteins Protein Structure - Secondary Structure 14

15 Protein Structure - Secondary Structure How to get secondary structures? 1. from PDB/DSSP structure known proteins primary structure secondary structure 15

16 Protein Structure - Secondary Structure How to get secondary structures? 1. from PDB/DSSP structure known proteins

17 Protein Structure - Secondary Structure How to get secondary structures? 1. from PDB/DSSP structure known proteins PDB ID in lower case 17

18 How to get secondary structures? 1. from PDB/DSSP structure known proteins Protein Structure - Secondary Structure A FASTA matted file ("ss.txt") generated using DSSP with sequences and secondary structures of all entries in PDB is available at: (compressed) 18

19 How to get secondary structures? Protein Structure - Secondary Structure 2. from predictions structure unknown proteins Predicting the secondary structure of proteins was one of the hottest goals of the 1990s. Nowadays, fairly good servers are available to accurately predict the secondary structure of any protein that may interest you. If your protein has enough homologues in the current databases, you can expect its secondary structure prediction to be close to 80% accurate. However, bear in mind that this is only a prediction; as with all predictions, it can be more or less inaccurate. PSIPRED - one of the most accurate secondary structure prediction servers. 19

20 How to get secondary structures? Protein Structure - Secondary Structure 2. from predictions structure unknown proteins

21 How to get secondary structures? Protein Structure - Secondary Structure 2. from predictions structure unknown proteins 3cig.fasta This server returns its results by , usually in less than 30 minutes. Give a name if you need help remembering what the analysis was all about. 21

22 Protein Structure - Secondary Structure How to get secondary structures? Prediction (Pred) : the 2. from predictions structure unknown proteins predicted conmation each residue. Confidence (Conf) : the reliability of the prediction each position

23 How to get secondary structures? Protein Structure - Secondary Structure 2. from predictions structure unknown proteins The result will be sent to the address you provided. Note : The server does not accept free, Web-based e- mail addresses (such as those from MSN s Hotmail or Google s Gmail). No results will be returned. 23

24 Protein Structure - Secondary Structure How to get secondary structures? DSSP 2. from predictions structure unknown proteins PSIPRED 51 LAILDAGFNSISKLEPELCQILPLLKVLNLQHNELSQISDQTFVFCTNLT CSEEECCSSCCCCCCTHHHHHSTTCCEEECTTCCCCCCCTTSTTSCCSCC CCEEECCCCCCCCCCHHHHCCCCCCCEEECCCCCCCCCCCCCCCCCCCCC 101 ELDLMSNSIHKIKSNPFKNQKNLIKLDLSHNGLSSTKLGTGVQLENLQEL EEECTTSCCCCCCSCTTTTCTTCCEEECCSSCCCCCCCCSSSCCTTCCEE EEEEECCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCEE 151 LLAKNKILALRSEELEFLGNSSLRKLDLSSNPLKEFSPGCFQTIGKLFAL ECCSSCCCEECSGGGGGGTTCEESEEECCSCCCCEECTTTTTTSSEECEE EEECCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCCCCCEE over 90% correctly predicted positions 24

25 This level of structure describes how regions of secondary structure fold together - the 3D arrangement of a polypeptide chain, including alpha-helices, beta-sheets, and any other loops and folds. Tertiary structure results from interactions between side chains, or between side chains and the polypeptide backbone, which are often distant in sequence. The tertiary structure of a protein is defined by its atomic coordinates

26 How to get tertiary structures? 1. Search 3D Structure in PDB e.g. PDB ID, protein name, author 26 26

27 How to get tertiary structures? 1. Search 3D Structure in PDB 27

28 How to get tertiary structures? 1. Search 3D Structure in PDB 28

29 How to get tertiary structures? 1. Search 3D Structure in PDB 29

30 How to get tertiary structures? 2. Search PDB Structure in Dali helsinki.fi/dali_server PDB File 30 30

31 How to get tertiary structures? 2. Search PDB Structure in Dali helsinki.fi/dali_server 31

32 How to get tertiary structures? 2. Search PDB Structure in Dali helsinki.fi/dali_server 32 32

33 How to get tertiary structures? 1. Search PDB Structure in Dali helsinki.fi/dali_server 33 33

34 How to get tertiary structures? 1. Search PDB Structure in Dali helsinki.fi/dali_server 34 34

35 How to get tertiary structures? 1. Search PDB Structure in Dali helsinki.fi/dali_server Superimposed structures shown by Jmol Applet mol1a 3CIG 1ZIW 35

36 How to get tertiary structures? 3. 3D Molecular Viewer Jmol online (PDB) 36

37 How to get tertiary structures? 1. Experimental Methods Search Structure in PDB Press left button Press left button to rotate the molecule Roll middle button to zoom in/out Click right button to access Jmol menu 37

38 How to get tertiary structures? 3. 3D Molecular Viewer Jmol online (PDB) Cartoon Secondary Structure None Backbone By chain None 38

39 How to get tertiary structures? 3. 3D Molecular Viewer VMD Maestro 3H6X.pdb Pymol 39

40 How to get tertiary structures? 3. 3D Molecular Viewer --- Jmol 40

41 How to get tertiary structures? 3. 3D Molecular Viewer --- RasMol

42 How to get tertiary structures? 3. 3D Molecular Viewer --- CanVasMol 42

43 How to get tertiary structures? 3. 3D Molecular Viewer --- Cn3D Structure/CN3D/cn3d.shtml 43 43

44 How to get tertiary structures? 3. 3D Molecular Viewer --- Molegro mmv-product.php 44

45 How to get tertiary structures? 3. 3D Molecular Viewer --- MGL Tools

46 How to get tertiary structures? 3. 3D Molecular Viewer --- Maestro products/14/

47 How to get tertiary structures? 3. 3D Molecular Viewer --- PyMOL

48 How to get tertiary structures? 3. 3D Molecular Viewer --- PyMOL

49 How to get tertiary structures? 3. 3D Molecular Viewer --- Swiss-PDBViewer

50 How to get tertiary structures? 3. 3D Molecular Viewer --- JMV 50

51 How to get tertiary structures? 3. 3D Molecular Viewer --- VMD 51

52 How to get tertiary structures? 3. 3D Molecular Viewer --- VMD Free! 52

53 How to get tertiary structures? 3. 3D Molecular Viewer --- VMD More details under VMD tutorial: Tutorials/vmd/tutorialhtml/index.html 53

54 How to get tertiary structures? 4. Experimental Methods The first 3D structure of a protein was determined in 1958 by Drs. Kendrew and Perutz using X-ray crystallography X-ray Crystallography 7929 Max Ferdinand Perutz ( ) nobel prize 1962 John Cowdery Kendrew ( ) nobel prize 1962 Nuclear Magnetic Resonance (NMR) 54

55 How to get tertiary structures? 4. Experimental Methods X-ray Crystallography Crystallography can solve structures of relatively large molecules. X-ray crystallography is now used routinely by scientists to determine how a pharmaceutical drug interacts with its protein target and what changes might improve it. However, intrinsic membrane proteins remain challenging to crystallize because they require detergents or other means to solubilize them in isolation, and such detergents often interfere with crystallization. Such membrane proteins are a large component of the genome and include many proteins of great physiological importance, such as ion channels and receptors. 55

56 How to get tertiary structures? 4. Experimental Methods Why X-ray, not Ultraviolet or other? When using light to measure an object, the wavelength of the light needs to be similar to the size of the object. X-rays, with wavelengths of approximately 0.5 to 1.5 angstroms, can measure the distance between atoms. Visible light, with a wavelength of 4,000 to 7,000 angstroms, is used in ordinary light microscopes because it can measure objects with the size of cellular components. 56

57 How to get tertiary structures? 4. Experimental Methods Solution-state NMR is restricted to relatively small ones (less than 70 kda). It is used to obtain inmation about the structure and dynamics of proteins. Protein NMR techniques are continually being used and improved in both academia and the biotech industry. Not one, but many structures $ 60,0000 Nuclear Magnetic Resonance (NMR) 57

58 How to get tertiary structures? 4. Experimental Methods SDU Experts in X-ray Crystallography Prof. SUN Jinpeng Ph.D. Institute of Biochemistry and Molecular Biology School of Medicine, SDU Prof. GU Lichuan Ph.D. State Key Laboratory of Microbial Technology School of Life Sciences, SDU 58

59 How to get tertiary structures? 4. Experimental Methods Advantages & Disadvantages The crystal structure is necessary, only that proteins which can be crystallized are examinable. (disadvantage of X-ray) (advantage of NMR) There is no chance direct determination of secondary structures and especially domain movements. (disadvantage of X-ray) (advantage of NMR) The hydrogen in the molecules are not examinable since it has only one electron. (disadvantage of X-ray) (advantage of NMR) The highest molecular mass which was examined successfully is just a 70kDa protein-complex. (disadvantage of NMR) (advantage of X-ray) The cost of the experimental implementation is increasing with the higher strength and the complexity of the determination. (disadvantage of X-ray) (disadvantage of NMR) 59

60 How to get tertiary structures? 4. Experimental Methods Advantages & Disadvantages The crystal structure is necessary, only that proteins which can be crystallized are examinable. (disadvantage of X-ray) (advantage of NMR) Time-consuming There is no chance direct determination of secondary structures and especially domain movements. (disadvantage of X-ray) (advantage of NMR) Labor-consuming The hydrogen in the molecules are not examinable since it has only one electron. (disadvantage of X-ray) (advantage of NMR) Expensive The highest molecular mass which was examined successfully is just a 70kDa protein-complex. (disadvantage of NMR) (advantage of X-ray) The cost of the experimental implementation is increasing with the higher strength and the complexity of the determination. (disadvantage of X-ray) (disadvantage of NMR) 60

61 How to get tertiary structures? 5. Computational Methods MEAKIVKVLDSSRCEDGFGKKRKRAASYAAYVTGV SCAKLQNVPPPNGQCQIPDKRRRLEGENKLSAYE NRSGKALVRYYTYFKKTGIAKRVMMYENGEWNDL PEHVICAIQNELEEKSAAIEFKLCGHSFILDFLHMQR LDMETGAKTPLAWIDNAGKCFFPEIYESDERTNYC HHKCVEDPKQNAPHDIKLRLEIDVNGGETPRLNLE ECSDESGDNMMDDVPLAQRSSNEHYDEATEDSC SRKLEAAVSKWDETDAIVVSGAKLTGSEVLDKDAV KKMFAVGTASLGHVPVLDVGRFSSEIAEARLALFQ KQVEITKKHRGDANVRYAWLPAKREVLSAVMMQG LGVGGAFIRKSIYGVGIHLTAADCPYFSARYCDVDE NGVRYMVLCRVIMGNMELLRGDKAQFFSGGEEYD NGVDDIESPKNYIVWNINMNTHIFPEFVVRFKLSNL PNAEGNLIAKRDNSGVTLEGPKDLPPQLESNQGAR GSGSANSVGSSTTRPKSPWMPFPTLFAAISHKVAE NDMLLINADYQQLRDKKMTRAEFVRKLRVIVGDDL LRSTITTLQNQPKSKEIPGSIRDHEEGAGGL input output 61

62 How to get tertiary structures? 5. Computational Methods AB initio Homology Modeling Threading Ensemble 62

63 How to get tertiary structures? 5. Computational Methods AB initio predicts structures based on physical principles. Indeed, the protein amino acid sequence already contains all inmation needed to create a correctly folded protein [Anfinsen, 1937, Science]. All the energetic conmations involved in the protein folding process are calculated by energy functions to find the structure with the lowest free energy. 63

64 How to get tertiary structures? 5. Computational Methods --- AB initio : QUARK ed.umich.edu/quark quark.fasta It takes 1-2 days. 64

65 How to get tertiary structures? 5. Computational Methods --- AB initio : QUARK ed.umich.edu/quark 65

66 How to get tertiary structures? 5. Computational Methods --- AB initio : QUARK ed.umich.edu/quark 66

67 How to get tertiary structures? 5. Computational Methods --- AB initio : QUARK ed.umich.edu/quark Homology Modeling is based on the assumption that similar sequences among evolutionarily related proteins share an overall structural similarity [Marti-Renom, 2000, Annu Rev Biophys Biomol Struct].? 67

68 How to get tertiary structures? 5. Computational Methods Homology Modeling is based on the assumption that similar sequences among evolutionarily related proteins share an overall structural similarity [Marti-Renom, 2000, Annu Rev Biophys Biomol Struct]. 68

69 How to get tertiary structures? 5. Computational Methods --- Homology Modeling 1. Identification of a homologue(s) (sequence identity >= 30%) of known structure as template(s) modeling the target sequence. 69

70 How to get tertiary structures? 5. Computational Methods --- Homology Modeling 2. Generating and improving an alignment of the target sequence with the template sequence(s). Usually the knowledge based manual adjustment is necessary. 70

71 How to get tertiary structures? 5. Computational Methods --- Homology Modeling 3. Building coordinates of the three-dimensional model through automatic modeling programs, based on the alignment. 71

72 How to get tertiary structures? 5. Computational Methods --- Homology Modeling 4. Evaluation of the model and then to repeat the previous steps according the evaluation results until the model becomes satisfactory. 72

73 How to get tertiary structures? 5. Computational Methods --- Homology Modeling : SWISS-MODEL 73 73

74 How to get tertiary structures? 5. Computational Methods --- Homology Modeling : SWISS-MODEL swiss_model.fasta 74

75 How to get tertiary structures? 5. Computational Methods --- Homology Modeling : SWISS-MODEL 75

76 How to get tertiary structures? 5. Computational Methods --- Homology Modeling Homology modeling is most accurate when the target and template have similar sequences. 76

77 How to get tertiary structures? 5. Computational Methods --- Homology Modeling Two proteins with a high level of sequence identity will nevertheless have not exactly identical backbone conmations. [Gong et.al. BMC Struct Biol, 2012, 12:23] 77

78 How to get tertiary structures? 5. Computational Methods --- Homology Modeling Homology modeling cannot be used when the sequence identity between the target and template is < 30%. 78

79 How to get tertiary structures? 5. Computational Methods Threading (fold recognition) The prediction is made by "threading" (i.e. placing, aligning) each amino acid in the target sequence to a position in the template structure, and evaluating how well the target fits the template. After the best-fit template is selected, the structural model of the target is built based on the alignment along with the threading path within the chosen template. 79

80 How to get tertiary structures? 5. Computational Methods Threading (fold recognition) Protein threading is based on two basic observations: that the number of different folds in nature is fairly small (approximately 1000); and that 90% of the new structures submitted to the PDB in the past three years have similar structural folds to ones already in the PDB. Threading method doesn t depends on the sequence identity between target and template. 80

81 How to get tertiary structures? 5. Computational Methods --- Threading ed.umich.edu/i-tasser 81

82 How to get tertiary structures? 5. Computational Methods --- Threading ed.umich.edu/i-tasser itasser.fasta Free Registration 82

83 How to get tertiary structures? 5. Computational Methods --- Threading ed.umich.edu/i-tasser 83

84 How to get tertiary structures? 5. Computational Methods --- Threading ed.umich.edu/i-tasser 84

85 How to get tertiary structures? 5. Computational Methods --- Threading ed.umich.edu/i-tasser Part I 85

86 How to get tertiary structures? 5. Computational Methods --- Threading ed.umich.edu/i-tasser Part II 86

87 How to get tertiary structures? 5. Computational Methods --- Threading ed.umich.edu/i-tasser Part III 87

88 How to get tertiary structures? 5. Computational Methods --- Threading ed.umich.edu/i-tasser Part IV 88

89 How to get tertiary structures? 5. Computational Methods --- Homology Modeling / Threading Input Modeller is a partly automated protein structure prediction program. Output 89 89

90 How to get tertiary structures? 5. Computational Methods --- AB Initio + Homology Modeling Robetta provides both ab initio and homology modeling models of protein domains. Domains without a detectable PDB homolog are modeled with the Rosetta de novo protocol. The procedure is fully automated

91 How to get tertiary structures? 5. Computational Methods Homologues in PDB (>30%)? Best-fit template (energy function)? < 200 AA? 91

92 How to get tertiary structures? 5. Computational Methods --- MQAPs When building a protein model, with or without the aid of experimental inmation, it is often necessary to use an independent measure to evaluate the correctness of the model. This is the role of Model Quality Assessment programs (MQAPs). Different types of MQAPs have been developed during the last decades. A model on hand, finish? a random model (a low quality model) = maybe 92

93 How to get tertiary structures? 5. Computational Methods --- MQAPs I-TASSER Swiss-Model 93

94 How to get tertiary structures? 5. Computational Methods --- MQAPs I-TASSER (C-score) : C-score is typically in the range of [-5,2], where a C- score of higher value signifies a model with a high confidence and vice-versa. Swiss-Model (QMEAN4) : QMEAN4 is a reliability score the whole model which can be used in order to compare and rank alternative models of the same target. The quality estimate ranges between 0 and 1 with higher values better models. Quark (TM-score) : TM-score is a recently proposed scale measuring the structural similarity between two structures. A TM-score >0.5 indicates a model of correct topology and a TM-score<0.17 means a random similarity. 94

95 How to get tertiary structures? 5. Computational Methods --- MQAPs : SAVES 95

96 How to get tertiary structures? 5. Computational Methods --- MQAPs : SAVES >85% of the residues have an averaged 3D- 1D score > 0.2 means a good model. 96

97 How to get tertiary structures? 5. Computational Methods --- MQAPs : ProQ predicted secondary structure required 97

98 How to get tertiary structures? 5. Computational Methods --- MQAPs : MetaMQAPII 98

99 How to get tertiary structures? 5. Computational Methods --- MQAPs : ModFOLD One job takes minutes. 99

100 How to get tertiary structures? 5. Computational Methods --- Model Refinement : ModLoop freely available academic users one loop between residues 15 and 22, no chain identifier, another loop between residues 98 and 103 in chain B

101 How to get tertiary structures? 5. Computational Methods --- Model Refinement : MDS Molecular Dynamic Simulation (MDS) is a computer simulation of physical movements of atoms and molecules. Free! Free!

102 How to get tertiary structures? 5. Computational Methods --- Model Refinement : MDS Molecular Dynamic Simulation (MDS) is a computer simulation of physical movements of atoms and molecules. 500-aa protein, 1 ns (10-9 s), 120 Cores : 5 hours a meaningful simulation > 20ns 102

103 Protein Structure - Quaternary Structure How to get quaternary structures? 1. Experimentally determined molecular complexes Many proteins are actually assemblies of more than one polypeptide chain. Quaternary structure can play important functional roles multi-subunit proteins. More recently, people refer to protein-protein interaction when discussing quaternary structure of proteins and consider all assemblies of proteins as protein complexes. crystal structure of the mouse TLR3 ECD- dsrna-2:1 complex (PDB ID: 3CIY) crystal structure of the human TLR4 ECD-human MD2-E.coli LPS complex (PDB ID: 3FIX) 103

104 Protein Structure - Quaternary Structure How to get quaternary structures? 2. Protein-Protein interaction databases DIP (the Database of Interacting Proteins) : experimentally determined interactions between proteins BioGRID (the Biological General Repository Interaction Datasets) : collections of protein and genetic interactions from major model organism species STRING - Known and Predicted Protein-Protein Interactions etc. 104

105 Protein Structure - Quaternary Structure How to get quaternary structures? 3. Protein-Protein docking Macromolecular docking describes the way in which two large molecules approach and interact with each other. Docking between molecules may be likened to docking a ship at a wharf. Docking programs try all possible conmations and use scoring functions to rank them. Scoring functions including: shape complementarily residue contacts desolvation hydrophobicity electrostatics energy are usually exploited in ranking the docking conmations. 105

106 Protein Structure - Quaternary Structure How to get quaternary structures? 3. Protein-Protein docking Two kinds of protein-protein docking: Rigid Docking 刚性对接 relate to the molecules as rigid objects that cannot change their spatial shape during the docking process. When substantial conmational change occurs within the components at the time of complex mation, rigid-body docking is inadequate. Flexible Docking 柔性对接 model changes in internal geometry of the interacting partners that may occur when a complex is med. However, scoring all possible conmational changes is prohibitively expensive in computing time. 106

107 Protein Structure - Quaternary Structure How to get quaternary structures? 3. Protein-Ligand docking This problems involves a large molecule (the protein - also called the receptor ) and a small molecule (the ligand) and is very useful in developing medicines. Rigid Docking - the ligand will be kept completely rigid during the orientation step. Flexible Docking the ligand will be allowed to be flexible. This type of docking allows the ligand to structurally rearrange in response to the receptor. 107

108 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- HADDOCK 108

109 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- AutoDock 109

110 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- AutoDock Vina 110

111 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- AutoDock

112 112

113 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- AutoDock More details under AutoDock Tutorials: s.edu/faqs-help/tutorial 113

114 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- ZDOCK Almighty Online Server Almighty Online Server zdock1.pdb zdock2.pdb gongj@inmatik.u Uploading the receptor and ligand PDB files to the ZDOCK Server Please note that you must use an academic account as the ZDOCK Server is only available to academic users. 114

115 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- ZDOCK Choosing residues to block from or ce into the binding site Suppose, we had inmation from the literature that residues 732 in zdock1 is crucial binding. So this example we use this inmation to choose residue 732 and ce it into the binding site. 115

116 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- ZDOCK Confirm residues selected 116

117 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- ZDOCK The result page contains 4 links. The first three are download links the ZDOCK output file, ligand file and receptor file. To automatically create the predicted complex files, download these three files into the same directory. Then click the fourth link, "Launch Create Complexes!" which is a Java program that will create the complexes you. 117

118 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- ZDOCK A new window will appear named "Create Complexes. To create a subset (e.g. top 10), you need to specify the start and end number and then click the "Create Subset" button. "Create All" would return all max predicted complexes, which may take a few minutes to complete. In the Create Complexes window, click the "ZDOCK Output File..." button to browse the ZDOCK output file you downloaded earlier. 118

119 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- ZDOCK Now you can find the top 10 complexes in the identical directory, and you can now compare them. 119

120 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- ZDOCK 120

121 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- ZDOCK 121

122 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- PDBePISA complex1.pdb

123 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- PDBePISA 123

124 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- PDBePISA 124

125 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- PDBePISA

126 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- PDBePISA

127 Protein Structure - Quaternary Structure How to get quaternary structures? 4. Docking Program --- PDBePISA 127

128 Protein Structure - Quaternary Structure How to get quaternary structures? 5. Virtual Screening Virtual screening (VS) is a computational technique used in drug discovery research. It involves the rapid in silico assessment of large libraries of chemical structures in order to identify those structures which are most likely to bind to a drug target, typically a protein receptor or enzyme. Library of chemical compounds VS focuses on questions like how can we filter down the enormous chemical space of numerous conceivable compounds to a manageable number that can be synthesized, purchased, and tested. Virtual screening 128

129 Protein Structure - Quaternary Structure How to get quaternary structures? 5. Virtual Screening Commercial Free! 129

130 Protein Structure - Quaternary Structure How to get quaternary structures? 5. Virtual Screening --- ZINC

131 Protein Structure - Quaternary Structure How to get quaternary structures? 5. Virtual Screening --- ZINC

132 Protein Structure - Quaternary Structure How to get quaternary structures? 5. Virtual Screening --- ZINC

133 Protein Structure - Quaternary Structure How to get quaternary structures? 5. Virtual Screening --- ZINC The simplified molecular input line entry specification (SMILES) is a specification in m of a line notation describing the structure of chemical molecules using short ASCII strings

134 Protein Structure - Quaternary Structure 134

135 Protein Structure - Quaternary Structure

136 Protein Structure - Quaternary Structure How to get quaternary structures? 5. Virtual Screening --- ZINC

137 Protein Structure - Quaternary Structure How to get quaternary structures? 5. Virtual Screening --- ZINC

138 Protein Structure - Quaternary Structure How to get quaternary structures? 5. Virtual Screening --- ZINC Benzoic Acid 192 similar compounds with identity > 90% 138

139 Protein Structure - Quaternary Structure How to get quaternary structures? 5. Virtual Screening --- Preparing Input Files Ligands benzoic.mol2 split transm Programming required *.mol2 x 192 *.pdbqt x

140 Protein Structure - Quaternary Structure How to get quaternary structures? 5. Virtual Screening --- Preparing Input Files Receptor receptor.pdb receptor.pdbqt

141 Protein Structure - Quaternary Structure How to get quaternary structures? 5. Virtual Screening --- Docking with AutoDock 192 compounds automatically, programming required X 192 docking receptor 8 cores : 1.2 hours Dell T , docking results 141

142 Protein Structure - Quaternary Structure How to get quaternary structures? 5. Virtual Screening --- Ranking the Docking Results Ranked by Free Energy 1.Rank Benzoic Acid: positive control 142

143 Protein Structure - Quaternary Structure How to get quaternary structures? 5. Virtual Screening --- Visualization of Complex 143

144

145 Protein Structure - Quaternary Structure How to get quaternary structures? 5. Virtual Screening --- Testing the Binding Affinity by Experiments 145

146 >exercise_seq AVLAGEFSDIQACSAAWKADGVCSTVAGSRPEN VRKNRYKDVLPYDQTRVILSLLQEEGHSDYING NFIRGVDGSLAYIATQGPLPHTLLDFWRLVWEF GVKVILMACREIENGRKRCERYWAQEQEPLQTG LFCITLIKEKWLNEDIMLRTLKVTFQKESRSVY QLQYMSWPDRGVPSSPDHMLAMVEEARRLQGSG PEPLCVHCSAGCGRTGVLCTVDYVRQLLLTQMI PPDFSLFDVVLKMRKQRPAAVQTEEQYRFLYHT VAQMFC exercise.fasta What kind of chemical structure is most likely to be active against this protein target? 146

147 >exercise_seq AVLAGEFSDIQACSAAWKADGVCSTVAGSRPEN VRKNRYKDVLPYDQTRVILSLLQEEGHSDYING NFIRGVDGSLAYIATQGPLPHTLLDFWRLVWEF GVKVILMACREIENGRKRCERYWAQEQEPLQTG LFCITLIKEKWLNEDIMLRTLKVTFQKESRSVY QLQYMSWPDRGVPSSPDHMLAMVEEARRLQGSG PEPLCVHCSAGCGRTGVLCTVDYVRQLLLTQMI PPDFSLFDVVLKMRKQRPAAVQTEEQYRFLYHT VAQMFC Know more about this protein: protein tyrosine phosphatase non-receptor type 18 is an enzyme that Download compounds Refine model: in the low quality regions < 24 hours Virtual screening Find homologue: 2QCT vs PTP_N18 Identity: > 48% MQAPs: ModFOLD, ProQ, QMEAN, etc. Construct 3D model: homology modeling 147

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics http://1.51.212.243/bioinfo.html Dr. rer. nat. Jing Gong Cancer Research Center School of Medicine, Shandong University 2011.10.19 1 Chapter 4 Structure 2 Protein Structure

More information

CSE : Computational Issues in Molecular Biology. Lecture 19. Spring 2004

CSE : Computational Issues in Molecular Biology. Lecture 19. Spring 2004 CSE 397-497: Computational Issues in Molecular Biology Lecture 19 Spring 2004-1- Protein structure Primary structure of protein is determined by number and order of amino acids within polypeptide chain.

More information

CS612 - Algorithms in Bioinformatics

CS612 - Algorithms in Bioinformatics Spring 2012 Class 6 February 9, 2012 Backbone and Sidechain Rotational Degrees of Freedom Protein 3-D Structure The 3-dimensional fold of a protein is called a tertiary structure. Many proteins consist

More information

Protein Structure Prediction. christian studer , EPFL

Protein Structure Prediction. christian studer , EPFL Protein Structure Prediction christian studer 17.11.2004, EPFL Content Definition of the problem Possible approaches DSSP / PSI-BLAST Generalization Results Definition of the problem Massive amounts of

More information

Structural bioinformatics

Structural bioinformatics Structural bioinformatics Why structures? The representation of the molecules in 3D is more informative New properties of the molecules are revealed, which can not be detected by sequences Eran Eyal Plant

More information

Introduction to Proteins

Introduction to Proteins Introduction to Proteins Lecture 4 Module I: Molecular Structure & Metabolism Molecular Cell Biology Core Course (GSND5200) Matthew Neiditch - Room E450U ICPH matthew.neiditch@umdnj.edu What is a protein?

More information

Homology Modelling. Thomas Holberg Blicher NNF Center for Protein Research University of Copenhagen

Homology Modelling. Thomas Holberg Blicher NNF Center for Protein Research University of Copenhagen Homology Modelling Thomas Holberg Blicher NNF Center for Protein Research University of Copenhagen Why are Protein Structures so Interesting? They provide a detailed picture of interesting biological features,

More information

Homology Modeling of Mouse orphan G-protein coupled receptors Q99MX9 and G2A

Homology Modeling of Mouse orphan G-protein coupled receptors Q99MX9 and G2A Quality in Primary Care (2016) 24 (2): 49-57 2016 Insight Medical Publishing Group Research Article Homology Modeling of Mouse orphan G-protein Research Article Open Access coupled receptors Q99MX9 and

More information

Ligand docking and binding site analysis with pymol and autodock/vina

Ligand docking and binding site analysis with pymol and autodock/vina International Journal of Basic and Applied Sciences, 4 (2) (2015) 168-177 www.sciencepubco.com/index.php/ijbas Science Publishing Corporation doi: 10.14419/ijbas.v4i2.4123 Research Paper Ligand docking

More information

Protein Folding Problem I400: Introduction to Bioinformatics

Protein Folding Problem I400: Introduction to Bioinformatics Protein Folding Problem I400: Introduction to Bioinformatics November 29, 2004 Protein biomolecule, macromolecule more than 50% of the dry weight of cells is proteins polymer of amino acids connected into

More information

CFSSP: Chou and Fasman Secondary Structure Prediction server

CFSSP: Chou and Fasman Secondary Structure Prediction server Wide Spectrum, Vol. 1, No. 9, (2013) pp 15-19 CFSSP: Chou and Fasman Secondary Structure Prediction server T. Ashok Kumar Department of Bioinformatics, Noorul Islam College of Arts and Science, Kumaracoil

More information

Retrieving and Viewing Protein Structures from the Protein Data Base

Retrieving and Viewing Protein Structures from the Protein Data Base Retrieving and Viewing Protein Structures from the Protein Data Base 7.88J Protein Folding Prof. David Gossard September 15, 2003 1 PDB Acknowledgements The Protein Data Bank (PDB - http://www.pdb.org/)

More information

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Secondary Structure Prediction

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Secondary Structure Prediction CMPS 6630: Introduction to Computational Biology and Bioinformatics Secondary Structure Prediction Secondary Structure Annotation Given a macromolecular structure Identify the regions of secondary structure

More information

Introduction to molecular docking

Introduction to molecular docking Introduction to molecular docking Alvaro Cortes Cabrera, Javier Klett, Antonio Morreale Centro de Biología Molecular Severo Ochoa. UAM CSIC C/Nicolas Cabrera, 1. Campos Cantoblanco UAM. 28049, Madrid.

More information

Sequence Analysis '17 -- lecture Secondary structure 3. Sequence similarity and homology 2. Secondary structure prediction

Sequence Analysis '17 -- lecture Secondary structure 3. Sequence similarity and homology 2. Secondary structure prediction Sequence Analysis '17 -- lecture 16 1. Secondary structure 3. Sequence similarity and homology 2. Secondary structure prediction Alpha helix Right-handed helix. H-bond is from the oxygen at i to the nitrogen

More information

BIOINFORMATICS Introduction

BIOINFORMATICS Introduction BIOINFORMATICS Introduction Mark Gerstein, Yale University bioinfo.mbb.yale.edu/mbb452a 1 (c) Mark Gerstein, 1999, Yale, bioinfo.mbb.yale.edu What is Bioinformatics? (Molecular) Bio -informatics One idea

More information

Protein design. CS/CME/BioE/Biophys/BMI 279 Oct. 24, 2017 Ron Dror

Protein design. CS/CME/BioE/Biophys/BMI 279 Oct. 24, 2017 Ron Dror Protein design CS/CME/BioE/Biophys/BMI 279 Oct. 24, 2017 Ron Dror 1 Outline Why design proteins? Overall approach: Simplifying the protein design problem Protein design methodology Designing the backbone

More information

FACULTY OF BIOCHEMISTRY AND MOLECULAR MEDICINE

FACULTY OF BIOCHEMISTRY AND MOLECULAR MEDICINE FACULTY OF BIOCHEMISTRY AND MOLECULAR MEDICINE BIOMOLECULES COURSE: COMPUTER PRACTICAL 1 Author of the exercise: Prof. Lloyd Ruddock Edited by Dr. Leila Tajedin 2017-2018 Assistant: Leila Tajedin (leila.tajedin@oulu.fi)

More information

S Basics for biosystems of the Cell PATENTING OF PROTEIN STRUCTURES AND PROTEOMICS INVENTIONS IN THE EUROPEAN PATENT OFFICE

S Basics for biosystems of the Cell PATENTING OF PROTEIN STRUCTURES AND PROTEOMICS INVENTIONS IN THE EUROPEAN PATENT OFFICE S-114.500 Basics for biosystems of the Cell PATENTING OF PROTEIN STRUCTURES AND PROTEOMICS INVENTIONS IN THE EUROPEAN PATENT OFFICE Riku Rinta-Jouppi, 44448J Written Course Work Presentation given on 1

More information

Homology Modelling. Thomas Holberg Blicher NNF Center for Protein Research University of Copenhagen

Homology Modelling. Thomas Holberg Blicher NNF Center for Protein Research University of Copenhagen Homology Modelling Thomas Holberg Blicher NNF Center for Protein Research University of Copenhagen Why are Protein Structures so Interesting? They provide a detailed picture of interesting biological features,

More information

Nucleic Acids, Proteins, and Enzymes

Nucleic Acids, Proteins, and Enzymes 3 Nucleic Acids, Proteins, and Enzymes Chapter 3 Nucleic Acids, Proteins, and Enzymes Key Concepts 3.1 Nucleic Acids Are Informational Macromolecules 3.2 Proteins Are Polymers with Important Structural

More information

Molecular Dynamics of Proteins

Molecular Dynamics of Proteins Molecular Dynamics of Proteins Ubiquitin Myoglobin Equilibrium Properties of Proteins Ubiquitin Root Mean Squared Deviation: measure for equilibration and protein flexibility RMSD constant protein equilibrated

More information

Protein Structure Databases, cont. 11/09/05

Protein Structure Databases, cont. 11/09/05 11/9/05 Protein Structure Databases (continued) Prediction & Modeling Bioinformatics Seminars Nov 10 Thurs 3:40 Com S Seminar in 223 Atanasoff Computational Epidemiology Armin R. Mikler, Univ. North Texas

More information

Bi 8 Lecture 7. Ellen Rothenberg 26 January Reading: Ch. 3, pp ; panel 3-1

Bi 8 Lecture 7. Ellen Rothenberg 26 January Reading: Ch. 3, pp ; panel 3-1 Bi 8 Lecture 7 PROTEIN STRUCTURE, Functional analysis, and evolution Ellen Rothenberg 26 January 2016 Reading: Ch. 3, pp. 109-134; panel 3-1 (end with free amine) aromatic, hydrophobic small, hydrophilic

More information

Structure formation and association of biomolecules. Prof. Dr. Martin Zacharias Lehrstuhl für Molekulardynamik (T38) Technische Universität München

Structure formation and association of biomolecules. Prof. Dr. Martin Zacharias Lehrstuhl für Molekulardynamik (T38) Technische Universität München Structure formation and association of biomolecules Prof. Dr. Martin Zacharias Lehrstuhl für Molekulardynamik (T38) Technische Universität München Motivation Many biomolecules are chemically synthesized

More information

Proteins Higher Order Structures

Proteins Higher Order Structures Proteins Higher Order Structures Dr. Mohammad Alsenaidy Department of Pharmaceutics College of Pharmacy King Saud University Office: AA 101 msenaidy@ksu.edu.sa Previously on PHT 426!! Protein Structures

More information

Protein 3D Structure Prediction

Protein 3D Structure Prediction Protein 3D Structure Prediction Michael Tress CNIO ?? MREYKLVVLGSGGVGKSALTVQFVQGIFVDE YDPTIEDSYRKQVEVDCQQCMLEILDTAGTE QFTAMRDLYMKNGQGFALVYSITAQSTFNDL QDLREQILRVKDTEDVPMILVGNKCDLEDER VVGKEQGQNLARQWCNCAFLESSAKSKINVN

More information

All Rights Reserved. U.S. Patents 6,471,520B1; 5,498,190; 5,916, North Market Street, Suite CC130A, Milwaukee, WI 53202

All Rights Reserved. U.S. Patents 6,471,520B1; 5,498,190; 5,916, North Market Street, Suite CC130A, Milwaukee, WI 53202 Secondary Structure In the previous protein folding activity, you created a hypothetical 15-amino acid protein and learned that basic principles of chemistry determine how each protein spontaneously folds

More information

2/23/16. Protein-Protein Interactions. Protein Interactions. Protein-Protein Interactions: The Interactome

2/23/16. Protein-Protein Interactions. Protein Interactions. Protein-Protein Interactions: The Interactome Protein-Protein Interactions Protein Interactions A Protein may interact with: Other proteins Nucleic Acids Small molecules Protein-Protein Interactions: The Interactome Experimental methods: Mass Spec,

More information

Ab Initio SERVER PROTOTYPE FOR PREDICTION OF PHOSPHORYLATION SITES IN PROTEINS*

Ab Initio SERVER PROTOTYPE FOR PREDICTION OF PHOSPHORYLATION SITES IN PROTEINS* COMPUTATIONAL METHODS IN SCIENCE AND TECHNOLOGY 9(1-2) 93-100 (2003/2004) Ab Initio SERVER PROTOTYPE FOR PREDICTION OF PHOSPHORYLATION SITES IN PROTEINS* DARIUSZ PLEWCZYNSKI AND LESZEK RYCHLEWSKI BiolnfoBank

More information

PROTEINS & NUCLEIC ACIDS

PROTEINS & NUCLEIC ACIDS Chapter 3 Part 2 The Molecules of Cells PROTEINS & NUCLEIC ACIDS Lecture by Dr. Fernando Prince 3.11 Nucleic Acids are the blueprints of life Proteins are the machines of life We have already learned that

More information

BIRKBECK COLLEGE (University of London)

BIRKBECK COLLEGE (University of London) BIRKBECK COLLEGE (University of London) SCHOOL OF BIOLOGICAL SCIENCES M.Sc. EXAMINATION FOR INTERNAL STUDENTS ON: Postgraduate Certificate in Principles of Protein Structure MSc Structural Molecular Biology

More information

Protein Structure and Function: From Sequence to Consequence 1. From Sequence to Structure

Protein Structure and Function: From Sequence to Consequence 1. From Sequence to Structure Protein Structure and Function: From Sequence to Consequence Gregory A. Petsko and Dagmar Ringe, Brandeis University Published by New Science Press Ltd and distributed in the United States and Canada by

More information

Lecture 2: Central Dogma of Molecular Biology & Intro to Programming

Lecture 2: Central Dogma of Molecular Biology & Intro to Programming Lecture 2: Central Dogma of Molecular Biology & Intro to Programming Central Dogma of Molecular Biology Proteins: workhorse molecules of biological systems Proteins are synthesized from the genetic blueprints

More information

BCH222 - Greek Key β Barrels

BCH222 - Greek Key β Barrels BCH222 - Greek Key β Barrels Reading C.I. Branden and J. Tooze (1999) Introduction to Protein Structure, Second Edition, pp. 77-78 & 335-336 (look at the color figures) J.S. Richardson (1981) "The Anatomy

More information

Learning to Use PyMOL (includes instructions for PS #2)

Learning to Use PyMOL (includes instructions for PS #2) Learning to Use PyMOL (includes instructions for PS #2) To begin, download the saved PyMOL session file, 4kyz.pse from the Chem 391 Assignments web page: http://people.reed.edu/~glasfeld/chem391/assign.html

More information

Protein Bioinformatics PH Final Exam

Protein Bioinformatics PH Final Exam Name (please print) Protein Bioinformatics PH260.655 Final Exam => take-home questions => open-note => please use either * the Word-file to type your answers or * print out the PDF and hand-write your

More information

Overview. Secondary Structure. Tertiary Structure

Overview. Secondary Structure. Tertiary Structure Protein Structure Disclaimer: All information and images were taken from outside sources and the author claims no legal ownership of any material. Sources for images are linked on each slide and the information

More information

Chapter 8. One-Dimensional Structural Properties of Proteins in the Coarse-Grained CABS Model. Sebastian Kmiecik and Andrzej Kolinski.

Chapter 8. One-Dimensional Structural Properties of Proteins in the Coarse-Grained CABS Model. Sebastian Kmiecik and Andrzej Kolinski. Chapter 8 One-Dimensional Structural Properties of Proteins in the Coarse-Grained CABS Model Abstract Despite the significant increase in computational power, molecular modeling of protein structure using

More information

Polypharmacology. Giulio Rastelli Molecular Modelling and Drug Design Lab

Polypharmacology. Giulio Rastelli Molecular Modelling and Drug Design Lab Polypharmacology Giulio Rastelli Molecular Modelling and Drug Design Lab www.mmddlab.unimore.it Dipartimento di Scienze della Vita Università di Modena e Reggio Emilia giulio.rastelli@unimore.it Magic

More information

Genetic Algorithms For Protein Threading

Genetic Algorithms For Protein Threading From: ISMB-98 Proceedings. Copyright 1998, AAAI (www.aaai.org). All rights reserved. Genetic Algorithms For Protein Threading Jacqueline Yadgari #, Amihood Amir #, Ron Unger* # Department of Mathematics

More information

Sequence Databases and database scanning

Sequence Databases and database scanning Sequence Databases and database scanning Marjolein Thunnissen Lund, 2012 Types of databases: Primary sequence databases (proteins and nucleic acids). Composite protein sequence databases. Secondary databases.

More information

Nucleic Acids: DNA and RNA

Nucleic Acids: DNA and RNA Nucleic Acids: DNA and RNA Living organisms are complex systems. Hundreds of thousands of proteins exist inside each one of us to help carry out our daily functions. These proteins are produced locally,

More information

Chapter 3 Nucleic Acids, Proteins, and Enzymes

Chapter 3 Nucleic Acids, Proteins, and Enzymes 3 Nucleic Acids, Proteins, and Enzymes Chapter 3 Nucleic Acids, Proteins, and Enzymes Key Concepts 3.1 Nucleic Acids Are Informational Macromolecules 3.2 Proteins Are Polymers with Important Structural

More information

Product Applications for the Sequence Analysis Collection

Product Applications for the Sequence Analysis Collection Product Applications for the Sequence Analysis Collection Pipeline Pilot Contents Introduction... 1 Pipeline Pilot and Bioinformatics... 2 Sequence Searching with Profile HMM...2 Integrating Data in a

More information

Building an Antibody Homology Model

Building an Antibody Homology Model Application Guide Antibody Modeling: A quick reference guide to building Antibody homology models, and graphical identification and refinement of CDR loops in Discovery Studio. Francisco G. Hernandez-Guzman,

More information

Proteins the primary biological macromolecules of living organisms

Proteins the primary biological macromolecules of living organisms Proteins the primary biological macromolecules of living organisms Protein structure and folding Primary Secondary Tertiary Quaternary structure of proteins Structure of Proteins Protein molecules adopt

More information

STRUCTURAL BIOLOGY. α/β structures Closed barrels Open twisted sheets Horseshoe folds

STRUCTURAL BIOLOGY. α/β structures Closed barrels Open twisted sheets Horseshoe folds STRUCTURAL BIOLOGY α/β structures Closed barrels Open twisted sheets Horseshoe folds The α/β domains Most frequent domain structures are α/β domains: A central parallel or mixed β sheet Surrounded by α

More information

The Integrated Biomedical Sciences Graduate Program

The Integrated Biomedical Sciences Graduate Program The Integrated Biomedical Sciences Graduate Program at the university of notre dame Cutting-edge biomedical research and training that transcends traditional departmental and disciplinary boundaries to

More information

How Do You Clone a Gene?

How Do You Clone a Gene? S-20 Edvo-Kit #S-20 How Do You Clone a Gene? Experiment Objective: The objective of this experiment is to gain an understanding of the structure of DNA, a genetically engineered clone, and how genes are

More information

Protein Bioinformatics Part I: Access to information

Protein Bioinformatics Part I: Access to information Protein Bioinformatics Part I: Access to information 260.655 April 6, 2006 Jonathan Pevsner, Ph.D. pevsner@kennedykrieger.org Outline [1] Proteins at NCBI RefSeq accession numbers Cn3D to visualize structures

More information

Hmwk # 8 : DNA-Binding Proteins : Part II

Hmwk # 8 : DNA-Binding Proteins : Part II The purpose of this exercise is : Hmwk # 8 : DNA-Binding Proteins : Part II 1). to examine the case of a tandem head-to-tail homodimer binding to DNA 2). to view a Zn finger motif 3). to consider the case

More information

RNA does not adopt the classic B-DNA helix conformation when it forms a self-complementary double helix

RNA does not adopt the classic B-DNA helix conformation when it forms a self-complementary double helix Reason: RNA has ribose sugar ring, with a hydroxyl group (OH) If RNA in B-from conformation there would be unfavorable steric contact between the hydroxyl group, base, and phosphate backbone. RNA structure

More information

High Profile Publishing in Molecular Biology. Hélène Hodak, Marina Ostankovitch,

High Profile Publishing in Molecular Biology. Hélène Hodak, Marina Ostankovitch, High Profile Publishing in Molecular Biology Hélène Hodak, h.hodak@elsevier.com Marina Ostankovitch, m.ostankovitch@elsevier.com The field of Molecular Biology, inception to current trends Preparing a

More information

Università degli Studi di Roma La Sapienza

Università degli Studi di Roma La Sapienza Università degli Studi di Roma La Sapienza. Tutore Prof.ssa Anna Tramontano Coordinatore Prof. Marco Tripodi Dottorando Daniel Carbajo Pedrosa Dottorato di Ricerca in Scienze Pasteuriane XXIV CICLO Docente

More information

SMISS: A protein function prediction server by integrating multiple sources

SMISS: A protein function prediction server by integrating multiple sources SMISS 1 SMISS: A protein function prediction server by integrating multiple sources Renzhi Cao 1, Zhaolong Zhong 1 1, 2, 3, *, and Jianlin Cheng 1 Department of Computer Science, University of Missouri,

More information

The Discovery of the Molecular Structure of DNA - The Double Helix

The Discovery of the Molecular Structure of DNA - The Double Helix The Discovery of the Molecular Structure of DNA - The Double Helix A Scientific Breakthrough The sentence "This structure has novel features which are of considerable biological interest" may be one of

More information

Nanobiotechnology. Place: IOP 1 st Meeting Room Time: 9:30-12:00. Reference: Review Papers. Grade: 50% midterm, 50% final.

Nanobiotechnology. Place: IOP 1 st Meeting Room Time: 9:30-12:00. Reference: Review Papers. Grade: 50% midterm, 50% final. Nanobiotechnology Place: IOP 1 st Meeting Room Time: 9:30-12:00 Reference: Review Papers Grade: 50% midterm, 50% final Midterm: 5/15 History Atom Earth, Air, Water Fire SEM: 20-40 nm Silver 66.2% Gold

More information

Exploring Proteins and Nucleic Acids with Jmol

Exploring Proteins and Nucleic Acids with Jmol Exploring Proteins and Nucleic Acids with Jmol Revised December 30, 2017 Jeffrey A. Cohlberg Department of Chemistry and Biochemistry California State University, Long Beach Copyright 2010, Jeffrey A.

More information

Protein Sequence Analysis. BME 110: CompBio Tools Todd Lowe April 19, 2007 (Slide Presentation: Carol Rohl)

Protein Sequence Analysis. BME 110: CompBio Tools Todd Lowe April 19, 2007 (Slide Presentation: Carol Rohl) Protein Sequence Analysis BME 110: CompBio Tools Todd Lowe April 19, 2007 (Slide Presentation: Carol Rohl) Linear Sequence Analysis What can you learn from a (single) protein sequence? Calculate it s physical

More information

Across-proteome modeling of dimer structures for the bottom-up assembly of protein-protein interaction networks

Across-proteome modeling of dimer structures for the bottom-up assembly of protein-protein interaction networks Maheshwari and Brylinski BMC Bioinformatics (2017) 18:257 DOI 10.1186/s12859-017-1675-z RESEARCH ARTICLE Across-proteome modeling of dimer structures for the bottom-up assembly of protein-protein interaction

More information

BETA STRAND Prof. Alejandro Hochkoeppler Department of Pharmaceutical Sciences and Biotechnology University of Bologna

BETA STRAND Prof. Alejandro Hochkoeppler Department of Pharmaceutical Sciences and Biotechnology University of Bologna Prof. Alejandro Hochkoeppler Department of Pharmaceutical Sciences and Biotechnology University of Bologna E-mail: a.hochkoeppler@unibo.it C-ter NH and CO groups: right, left, right (plane of the slide)

More information

DNA and RNA. Chapter 12

DNA and RNA. Chapter 12 DNA and RNA Chapter 12 History of DNA Late 1800 s scientists discovered that DNA is in the nucleus of the cell 1902 Walter Sutton proposed that hereditary material resided in the chromosomes in the nucleus

More information

3D Structure Prediction with Fold Recognition/Threading. Michael Tress CNB-CSIC, Madrid

3D Structure Prediction with Fold Recognition/Threading. Michael Tress CNB-CSIC, Madrid 3D Structure Prediction with Fold Recognition/Threading Michael Tress CNB-CSIC, Madrid MREYKLVVLGSGGVGKSALTVQFVQGIFVDEYDPTIEDSY RKQVEVDCQQCMLEILDTAGTEQFTAMRDLYMKNGQGFAL VYSITAQSTFNDLQDLREQILRVKDTEDVPMILVGNKCDL

More information

Tutorial. In Silico Cloning. Sample to Insight. March 31, 2016

Tutorial. In Silico Cloning. Sample to Insight. March 31, 2016 In Silico Cloning March 31, 2016 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com support-clcbio@qiagen.com In Silico Cloning

More information

Kinetics Review. Tonight at 7 PM Phys 204 We will do two problems on the board (additional ones than in the problem sets)

Kinetics Review. Tonight at 7 PM Phys 204 We will do two problems on the board (additional ones than in the problem sets) Quiz 1 Kinetics Review Tonight at 7 PM Phys 204 We will do two problems on the board (additional ones than in the problem sets) I will post the problems with solutions on Toolkit for those that can t make

More information

RNA Secondary Structure Prediction

RNA Secondary Structure Prediction RNA Secondary Structure Prediction Outline 1) Introduction: RNA structure basics 2) Dynamic programming for RNA secondary structure prediction The Central Dogma of Molecular Biology DNA CCTGAGCCAACTATTGATGAA

More information

Structure/function relationship in DNA-binding proteins

Structure/function relationship in DNA-binding proteins PHRM 836 September 22, 2015 Structure/function relationship in DNA-binding proteins Devlin Chapter 8.8-9 u General description of transcription factors (TFs) u Sequence-specific interactions between DNA

More information

Peptide libraries: applications, design options and considerations. Laura Geuss, PhD May 5, 2015, 2:00-3:00 pm EST

Peptide libraries: applications, design options and considerations. Laura Geuss, PhD May 5, 2015, 2:00-3:00 pm EST Peptide libraries: applications, design options and considerations Laura Geuss, PhD May 5, 2015, 2:00-3:00 pm EST Overview 1 2 3 4 5 Introduction Peptide library basics Peptide library design considerations

More information

Protein-Ligand Analysis with SID

Protein-Ligand Analysis with SID Protein-Ligand Analysis with SID A new post-simulation analysis tool Dmitry Lupyan, PhD Understanding molecular processes with MD Drug discovery and design Protein-protein interactions + Protein-DNA interactions

More information

Bio11 Announcements. Ch 21: DNA Biology and Technology. DNA Functions. DNA and RNA Structure. How do DNA and RNA differ? What are genes?

Bio11 Announcements. Ch 21: DNA Biology and Technology. DNA Functions. DNA and RNA Structure. How do DNA and RNA differ? What are genes? Bio11 Announcements TODAY Genetics (review) and quiz (CP #4) Structure and function of DNA Extra credit due today Next week in lab: Case study presentations Following week: Lab Quiz 2 Ch 21: DNA Biology

More information

Advanced Level Biology BRIDGING WORK

Advanced Level Biology BRIDGING WORK Advanced Level Biology BRIDGING WORK The bridging work MUST be completed for each of your Advanced Level subjects by the time you start your course. Your work will be assessed in September. Anyone not

More information

B CELL EPITOPES AND PREDICTIONS

B CELL EPITOPES AND PREDICTIONS B CELL EPITOPES AND PREDICTIONS OUTLINE What is a B-cell epitope? How can you predict B-cell epitopes? WHAT IS A B-CELL EPITOPE? B-cell epitopes: Accessible structural feature of a pathogen molecule. Antibodies

More information

Chapter 8 DNA Recognition in Prokaryotes by Helix-Turn-Helix Motifs

Chapter 8 DNA Recognition in Prokaryotes by Helix-Turn-Helix Motifs Chapter 8 DNA Recognition in Prokaryotes by Helix-Turn-Helix Motifs 1. Helix-turn-helix proteins 2. Zinc finger proteins 3. Leucine zipper proteins 4. Beta-scaffold factors 5. Others λ-repressor AND CRO

More information

Homology modeling and structural validation of tissue factor pathway inhibitor

Homology modeling and structural validation of tissue factor pathway inhibitor www.bioinformation.net Hypothesis Volume 9(16) Homology modeling and structural validation of tissue factor pathway inhibitor Piyush Agrawal, Zoozeal Thakur & Mahesh Kulharia* Centre for Bioinformatics,

More information

Molecular Forces in Antibody Maturation*

Molecular Forces in Antibody Maturation* Molecular Forces in Antibody Maturation* Melik Demirel 1,2 1 Allen Pearce Assistant Professor, College of Engineering, The Pennsylvania State University, University Park, PA, USA, E-mail: mcd18@psu.edu

More information

Amino Acids. Amino Acid Structure

Amino Acids. Amino Acid Structure Amino Acids Pratt & Cornely Chapter 4 Alpha carbon Sidechain Proteins peptides Amino Acid Structure 1 L amino acids Glycine R/S vs D/L L isoleucine racemization Stereochemisty Common Amino Acids 2 Which

More information

MATH 5610, Computational Biology

MATH 5610, Computational Biology MATH 5610, Computational Biology Lecture 2 Intro to Molecular Biology (cont) Stephen Billups University of Colorado at Denver MATH 5610, Computational Biology p.1/24 Announcements Error on syllabus Class

More information

Algorithms in Bioinformatics

Algorithms in Bioinformatics Algorithms in Bioinformatics Sami Khuri Department of Computer Science San José State University San José, California, USA khuri@cs.sjsu.edu www.cs.sjsu.edu/faculty/khuri Outline Central Dogma of Molecular

More information

Exploring genomes - A Treasure Hunt on the Internet

Exploring genomes - A Treasure Hunt on the Internet Exploring genomes - A Treasure Hunt on the Internet An activity developed by staff of the European Bioinformatics Institute (EMBL-EBI) and suitable for use with students aged from 16-18 years old. Introduction

More information

PROTEIN MODEL CHALLANGE

PROTEIN MODEL CHALLANGE PROTEIN MODEL CHALLANGE Lin Wozniewski lwoz@iun.edu Disclaimer i This presentation was prepared using draft rules. There may be some changes in the final copy of the rules. The rules which will be in your

More information

The common structure of a DNA nucleotide. Hewitt

The common structure of a DNA nucleotide. Hewitt GENETICS Unless otherwise noted* the artwork and photographs in this slide show are original and by Burt Carter. Permission is granted to use them for non-commercial, non-profit educational purposes provided

More information

CHAPTER 17 FROM GENE TO PROTEIN. Section C: The Synthesis of Protein

CHAPTER 17 FROM GENE TO PROTEIN. Section C: The Synthesis of Protein CHAPTER 17 FROM GENE TO PROTEIN Section C: The Synthesis of Protein 1. Translation is the RNA-directed synthesis of a polypeptide: a closer look 2. Signal peptides target some eukaryotic polypeptides to

More information

Teaching Principles of Enzyme Structure, Evolution, and Catalysis Using Bioinformatics

Teaching Principles of Enzyme Structure, Evolution, and Catalysis Using Bioinformatics KBM Journal of Science Education (2010) 1 (1): 7-12 doi: 10.5147/kbmjse/2010/0013 Teaching Principles of Enzyme Structure, Evolution, and Catalysis Using Bioinformatics Pablo Sobrado Department of Biochemistry,

More information

Exam MOL3007 Functional Genomics

Exam MOL3007 Functional Genomics Faculty of Medicine Department of Cancer Research and Molecular Medicine Exam MOL3007 Functional Genomics Tuesday May 29 th 9.00-13.00 ECTS credits: 7.5 Number of pages (included front-page): 5 Supporting

More information

Plant Molecular and Cellular Biology Lecture 9: Nuclear Genome Organization: Chromosome Structure, Chromatin, DNA Packaging, Mitosis Gary Peter

Plant Molecular and Cellular Biology Lecture 9: Nuclear Genome Organization: Chromosome Structure, Chromatin, DNA Packaging, Mitosis Gary Peter Plant Molecular and Cellular Biology Lecture 9: Nuclear Genome Organization: Chromosome Structure, Chromatin, DNA Packaging, Mitosis Gary Peter 9/16/2008 1 Learning Objectives 1. List and explain how DNA

More information

NUCLEIC ACIDS Genetic material of all known organisms DNA: deoxyribonucleic acid RNA: ribonucleic acid (e.g., some viruses)

NUCLEIC ACIDS Genetic material of all known organisms DNA: deoxyribonucleic acid RNA: ribonucleic acid (e.g., some viruses) NUCLEIC ACIDS Genetic material of all known organisms DNA: deoxyribonucleic acid RNA: ribonucleic acid (e.g., some viruses) Consist of chemically linked sequences of nucleotides Nitrogenous base Pentose-

More information

Cell Research at the Molecular Level

Cell Research at the Molecular Level Image Credits All images in this lesson were created by or for Alberta Education with the following noted exceptions: Page 29 Photodisc/Getty Images 30 Photodisc/Getty Images 31 2004 2005 www.clipart.com

More information

ONLINE BIOINFORMATICS RESOURCES

ONLINE BIOINFORMATICS RESOURCES Dedan Githae Email: d.githae@cgiar.org BecA-ILRI Hub; Nairobi, Kenya 16 May, 2014 ONLINE BIOINFORMATICS RESOURCES Introduction to Molecular Biology and Bioinformatics (IMBB) 2014 The larger picture.. Lower

More information

From Gene to Drug: A Proof of Concept for a Plausible Computational Pathway

From Gene to Drug: A Proof of Concept for a Plausible Computational Pathway From Gene to Drug: A Proof of Concept for a Plausible Computational Pathway Sandhya Shenoy, Jayaram, B, Latha, N, Pooja Narang, Tarun Jain, Kumkum Bhushan, Saher Afshan Shaikh, Surojit Bose, Pankaj Sharma,

More information

TUTORIAL: PCR ANALYSIS AND PRIMER DESIGN

TUTORIAL: PCR ANALYSIS AND PRIMER DESIGN C HAPTER 8 TUTORIAL: PCR ANALYSIS AND PRIMER DESIGN Introduction This chapter introduces you to tools for designing and analyzing PCR primers and procedures. At the end of this tutorial session, you will

More information

Understanding DNA Structure

Understanding DNA Structure Understanding DNA Structure I619 Structural Bioinformatics Molecular Biology Basics + Scale total length of DNA in a human cell is about 2m DNA is compacted in length by a factor of 10000 the compaction

More information

Ranking Beta Sheet Topologies of Proteins

Ranking Beta Sheet Topologies of Proteins , October 20-22, 2010, San Francisco, USA Ranking Beta Sheet Topologies of Proteins Rasmus Fonseca, Glennie Helles and Pawel Winter Abstract One of the challenges of protein structure prediction is to

More information

Distributions of Beta Sheets in Proteins with Application to Structure Prediction

Distributions of Beta Sheets in Proteins with Application to Structure Prediction Distributions of Beta Sheets in Proteins with Application to Structure Prediction Ingo Ruczinski Department of Biostatistics Johns Hopkins University Email: ingo@jhu.edu http://biostat.jhsph.edu/ iruczins

More information

Combing molecular dynamics and machine learning for advanced pose and free-energy prediction

Combing molecular dynamics and machine learning for advanced pose and free-energy prediction Combing molecular dynamics and machine learning for advanced pose and free-energy prediction Oleksandr Yakovenko (Alex), PhD Chief Technical Officer IFOWON.CO (a spinoff from BC Cancer Agency) Vancouver,

More information

Protein and RNA Review

Protein and RNA Review Protein and RNA Review Protein and RNA Review 1 Proteins: Introduction Proteins are the versatile building blocks and active molecules that form the basis of living systems. Function follows structure

More information

Virtual bond representation

Virtual bond representation Today s subjects: Virtual bond representation Coordination number Contact maps Sidechain packing: is it an instrumental way of selecting and consolidating a fold? ASA of proteins Interatomic distances

More information

ProtPlot A Tissue Molecular Anatomy Program Java-based Data Mining Tool: Screen Shots

ProtPlot A Tissue Molecular Anatomy Program Java-based Data Mining Tool: Screen Shots ProtPlot A Tissue Molecular Anatomy Program Java-based Data Mining Tool: Screen Shots **** DRAFT - undergoing revision **** Peter F. Lemkin, Ph.D. (1), Djamel Medjahed Ph.D. (2) 1 NCI-Frederick; 2 SAIC-Frederick,

More information

Protein Synthesis Notes

Protein Synthesis Notes Protein Synthesis Notes Protein Synthesis: Overview Transcription: synthesis of mrna under the direction of DNA. Translation: actual synthesis of a polypeptide under the direction of mrna. Transcription

More information