[4] SCHEMA-Guided Protein Recombination
|
|
- Chastity Baker
- 6 years ago
- Views:
Transcription
1 [4] SCHEMA-guided protein recombination 35 [4] SCHEMA-Guided Protein Recombination By Jonathan J. Silberg, Jeffrey B. Endelman, and Frances H. Arnold Introduction SCHEMA is a scoring function that predicts which elements in homologous proteins can be swapped without disturbing the integrity of the structure. 1 Using the structural coordinates of the parent proteins, SCHEMA identifies pairs of residues that are interacting and determines the number of interactions, E, that are broken when a chimeric protein inherits portions of its sequence from different parents. E appears to be a good metric for anticipating structural conservation when homologous proteins are recombined. Analysis of well-defined libraries of -lactamase chimeras revealed that chimeras with low E retained function with higher probability than chimeras with the same effective level of mutation but higher E or chosen at random. 2 Another study also showed that E is a useful measure for anticipating disruption in chimeras of a larger, cofactor-containing protein, cytochrome P Using SCHEMA, libraries of chimeras can be compared in silico to determine which one is expected to contain the highest fraction of folded (and potentially interesting) sequences for laboratory evolution studies. 2 4 These libraries can be synthesized in vitro using site-directed recombination methods (see Fig. 1), which allow for the simultaneous recombination of two or more parents at specified locations. 2,5 This approach can be used to make chimeric libraries from any parent sequences. In addition, the sequence diversity of folded and functional chimeras encoded in the library can be controlled, i.e., the number of possible unique sequences and the average level of mutation of chimeras predicted to retain structure, can be 1 C. A. Voigt, C. Martinez, Z. G. Wang, S. L. Mayo, and F. H. Arnold, Nature Struct. Biol. 9, 553 (2002). 2 M. M. Meyer, J. J. Silberg, C. A. Voigt, J. B. Endelman, S. L. Mayo, Z. G. Wang, and F. H. Arnold, Protein Sci. 12, 1686 (2003). 3 C. R. Otey, J. J. Silberg, C. A. Voigt, J. B. Endelman, G. Bandara, and F. H. Arnold, Chem. Biol. 11, 309 (2004). 4 D. A. Drummond, J. J. Silberg, J. B. Endelman, C. A. Wilke, and F. H. Arnold, submitted for publication. 5 K. Hiraga and F. H. Arnold, J. Mol. Biol. 330, 287 (2003). Copyright 2004, Elsevier Inc. All rights reserved. METHODS IN ENZYMOLOGY, VOL /04 $35.00
2 36 methodology [4] Fig. 1. Library synthesis by site-directed recombination. Sequence elements encoding structurally related polypeptides are swapped at defined locations (dashed lines) in two or more homologous proteins. This yields a library containing h y h unique chimeras, where h is the number of parents recombined and y is the number of sequence elements that are exchanged. used to guide the selection of crossover locations and crossover number. In contrast, annealing-based recombination or DNA shuffling techniques, such as Stemmer shuffling, 6,7 StEP, 8 and in vivo methods, 9 generate crossovers only in regions of sequence identity and therefore can not generate diverse libraries from more distant parent sequences. The sequenceindependent random recombination methods now available (SHIPREC, 10 ITCHY, 11 or SCRATCHY 12 ) do not make multiple crossovers efficiently and therefore create libraries of very limited diversity. This article outlines the procedure used for calculating E for a chimera and discusses ideas for optimizing the design of combinatorial libraries for directed evolution. 6 W. P. Stemmer, Proc. Natl. Acad. Sci. USA 91, (1994). 7 W. P. Stemmer, Nature 370, 389 (1994). 8 H. Zhao, L. Giver, Z. Shao, J. A. Affholter, and F. H. Arnold, Nature Biotechnol. 16, 258 (1998). 9 A. A. Volkov, Z. Shao, and F. H. Arnold, Nucleic Acids Res. 27, e18 (1999). 10 V. Sieber, A. Pluckthun, and F. X. Schmid, Nature Biotechnol. 16, 955 (1998). 11 M. Ostermeier, A. E. Nixon, and S. J. Benkovic, Bioorg. Med. Chem. 7, 2139 (1999). 12 S. Lutz, M. Ostermeier, G. L. Moore, C. D. Maranas, and S. J. Benkovic, Proc. Natl. Acad. Sci. USA 98, (2001).
3 [4] SCHEMA-guided protein recombination 37 Methods Calculating SCHEMA Disruption Based on the structure of the parent proteins, SCHEMA determines which residues are interacting, defined as those residues within a cutoff distance, and generates a contact matrix. 1 When recombining two parents, the contacts are scaled by the sequence identity of the parents being recombined, i.e., all contacts that cannot be broken by recombination are removed from the matrix. E is determined by counting the number of contacts broken when a chimeric protein inherits portions of its sequence from different parents. The SCHEMA disruption E of a chimeric sequence s, made by recombining sequence elements from h homologous proteins, is given by E ¼ XN X N C ij Pði; j; s i ; s j Þ; (1) i¼1 j¼iþ1 where N is the number of residues that have defined coordinates in the parental structure, C ij ¼ 1 if residues i and j are within the cutoff distance d c (otherwise C ij ¼ 0), and s i designates the parent incorporated at position i in the chimera (e.g., s i ¼ 1 if the sequence is derived from parent #1, s i ¼ 2if derived from parent #2, etc.). A(s i, k) is the identity of the residue in parent s i at position k within the parental amino acid sequence, and P(i, j, s i, s j ) ¼ 0 if any parent has residue A(s i, i) at position i and residue A(s j, j) at position j [otherwise P(i, j, s i, s j ) ¼ 1]. It is essential that structurally related residues in each parent are numbered identically, e.g., A(1, k) and A(2, k) should represent structurally related residues in each parent, to ensure that sequence identities among the parents are properly accounted for when calculating E. Typically, we use d c ¼ 4.5 Å, and hydrogen, backbone nitrogen, and backbone oxygen atoms are excluded from the calculation of E. Small deviations from this value of d c or the use of all atoms to calculate C ij does not significantly affect the relative E of chimeras being compared, although the magnitude of E changes. When cofactor-containing proteins are recombined, contacts between the cofactor and residues in the proteins are also excluded from calculation of E. 3 In this simple model, contacts between the cofactor and the protein cannot be broken by the recombination of related proteins. Ideally, PDB coordinates for all the parent sequences are available, and a structure-based alignment is performed. For parents whose sequences differ in length, this ensures that structurally related residues in each parent are numbered identically. This can be done using free software packages
4 38 methodology [4] Fig. 2. Treatment of gaps in sequence alignments. A hypothetical sequence alignment used in SCHEMA calculations is shown. PDB coordinates corresponding to the top sequence are being used by SCHEMA to calculate C ij, and the bottom sequence represents a homologous protein for which no structural information is available. At position 1, the atomic coordinates of glycine are defined in the PDB file, so the gap in the second parent is treated as a mutation relative to G when computing E and m. Because there are no coordinates for position 2, it is ignored when computing E and m. such as SwissProt or the combinatorial-extension algorithm. 13,14 If structural coordinates are available for different conformational states of the parents being recombined, it is best to assess E using the coordinates for each conformation to ensure that both states of the chimeras are likely to exhibit similar low disruption. When the structure of only one parent is available, sequence alignments can be performed using the BLAST algorithm. 15 Often alignment of the parents requires the insertion of gaps within the primary amino acid sequence of one or more of the parents (see Fig. 2). When gaps are introduced into the parent whose structural coordinates are being used to generate the contact matrix C ij, the residues found in the other parents are ignored when calculating E because there is no corresponding structural information. In contrast, when gaps occur in any parent other than the one used for structural information, they are treated like real residues that differ in identity from the residues in the other parents. From Disruption to Probabilities The fraction of chimeras retaining function has been found to decrease exponentially with E. 2 If we posit that any disrupted contact has a probability f d of yielding a nonfunctional chimera and each contact acts statistically independently of the others, the fraction of chimeras at each E predicted to retain function is given by P f ¼ (1 f d E/N) N. In this case, N equals the total number of contacts that could be broken by recombination. When N becomes large, as with proteins, this equation can be approximated by a simple exponential, P f ¼ e fde. This relationship between E and P f is likely to hold for the recombination of any homologous proteins. However, functional data from different chimeric libraries may yield a range of f d values. We have found that the sensitivity of the 13 N. Guex and M. C. Peitsch, Electrophoresis 18, 2714 (1997). 14 I. N. Shindyalov and P. E. Bourne, Protein Eng. 11, 739 (1998). 15 S. Henikoff and J. G. Henikoff, Proc. Natl. Acad. Sci. USA 89, (1992).
5 [4] SCHEMA-guided protein recombination 39 functional assay alone can significantly affect the P f at each E. A more sensitive functional screen for the conservation of lactamase function, for example, identified more functional chimeras and yielded a higher value of P f at each E (and lower f d ) than a functional selection. 2,5 We also expect f d to depend on the protein scaffold. The value of f d can be calibrated rapidly for any system by analyzing folding or function of a small population of chimeras with known E. 3 From approximately 10 to 20 chimeras that encompass a broad range of disruption, we have observed that those exhibiting the highest levels of E are mostly nonfunctional and those with the lowest E are almost all functional. From such a population of chimeras, the E where chimeras exhibit a P f ¼ 0.5 (designated E 1/2 ) can be estimated, and the f d can be calculated, f d ¼ lnð2þ=e 1=2. SCHEMA-Guided Library Design Because screening and selection strategies can evaluate a limited number of protein variants for altered functions, we would like to make libraries that are enriched in folded chimeras. SCHEMA can help identify such libraries by computing the fraction of chimeras F expected to retain the parental function (and fold) in different libraries arising from recombination of the same parents at different crossover locations. The fraction of folded chimeras in a library is given by F ¼ 1 n X n i¼1 e f de i ; (2) where n is the number of unique chimeras present. Libraries with the highest possible F are not necessarily preferred for directed evolution. An additional issue to consider when choosing a library is the sequence diversity D of the functional chimeras in that library. The D of a library describes the average mutation level m of folded chimeras in that library, where m is the amino acid Hamming distance of each chimera to its closest parent. D is calculated from D ¼ 1 1 X n m i e f de i : (3) F n i¼1 Unfortunately, we know little about the effect of m on the evolution of function. Studies examining the effect of m on the acquisition of novel functional properties in laboratory evolution studies suggest that variants with higher m are more likely to exhibit altered functional properties, whereas those with lower m tend to be more similar to the parents. 3,16,17
6 40 methodology [4] In the case of cytochrome P450 chimeras, chimeras exhibiting altered substrate specificity had an average m of 34, and chimeras that displayed a parent-like substrate specificity had an average m of Because E also depends on m, there is a trade-off between F and D. Chimeras with low E on average also have low m; therefore, libraries that simply maximize F have limited diversity. If one can enumerate all possible libraries, then the library design problem is easy: simply choose the one with the desired combination of D and F. Exhaustive enumeration, however, is not feasible with today s computers for average-sized proteins and the library sizes appropriate for directed evolution (thousands or more chimeras). Consider the design of a 10 crossover library between the -lactamases PSE-4 and TEM-1, which have similar overall structure, similar length (265 residues), and 40% sequence identity. There are possible libraries, each with 2 11 ¼ 2048 unique chimeric sequences too many for exhaustive enumeration. In this case, one can make progress by evaluating D and F for thousands, or even millions of random libraries. Figure 3 shows the F and D for 50,000 randomly chosen libraries. Here we observe libraries with diversities ranging from 7 to 41 (mutations per folded sequence) and F values from 0.5 to 7%. With SCHEMA we can identify the library with the highest F at a particular level of D. For the random sample in Fig. 3, this increases the number of folded chimeras up to fivefold compared to the average for libraries with the same diversity. The random enumeration in Fig. 3 sampled a limited region of the (F, D) plane. Alternate enumeration schemes, e.g., enforcing minimum and maximum fragment sizes, can rapidly explore different combinations of F and D. Choosing different parents will also affect which regions of the (F, D) plane are accessible, even if the parents exhibit similar levels of sequence identities. Decreasing the sequence identity of the parents will, in general, lead to libraries with higher D but lower F values. The same trend is expected if one increases the number of parents recombined. Practical Considerations in Library Design The best methods available for creating the libraries identified by SCHEMA as enriched in folded chimeras are sequence-independent sitedirected chimeragenesis and chemical synthesis. 2,5 These techniques can recombine parents with any level of sequence identity at multiple sites and easily create libraries encoding thousands of chimeras. However, these 16 M. Zaccolo and E. Gherardi, J. Mol. Biol. 285, 775 (1999). 17 P. S. Daugherty, G. Chen, B. L. Iverson, and G. Georgiou, Proc. Natl. Acad. Sci. USA 97, 2029 (2000).
7 [4] SCHEMA-guided protein recombination 41 Fig. 3. SCHEMA analysis of computed libraries. Randomly chosen libraries made by allowing 10 crossovers between -lactamases TEM-1 and PSE-4 were compared. The crossover locations in each library were enumerated by choosing 10 distinct, random integers between 1 and 153 (the length of the structural alignment minus the number of conserved residues minus 1). Each integer k represents a crossover between k th and (k þ 1) th nonidentical residues. This was repeated until 50,000 ten-integer sequences were generated. The structural coordinates of PSE-4 (PDB ¼ 1G68) 18 were used to compute E for every chimera present in each library. F and D (rounded to the nearest integer) were computed for each library according to Eqs. (2) and (3) with f d ¼ The average F at each value of D is shown as a solid line. methods are limited in where they can recombine distantly related proteins without introducing amino acid sequence changes not found in either parent. This happens because both techniques generate chimeras by ligating double-stranded gene modules together (encoding each swapped polypeptide), which requires that parents exhibit 2 to 4 bp of identity at the crossover boundaries. When the parents do not exhibit sufficient identity at the desired crossover locations, synonymous mutations can often be introduced to allow recombination at that site. If synonymous mutations are not sufficient, then other crossover positions should be chosen for the library. SCHEMA only calculates the disruption arising from recombination, not from mutation. 18 D. Lim, F. Sanschagrin, L. Passmore, L. De Castro, R. C. Levesque, and N. C. Strynadka, Biochemistry 40, 395 (2001).
8 42 methodology [5] Program Availability Software for running SCHEMA calculations is available at the Arnold group web site at the California Institute of Technology ( caltech.edu/groups/fha/). [5] Staggered Extension Process In Vitro DNA Recombination By Huimin Zhao Introduction In vitro DNA recombination is an extremely powerful approach for the directed evolution of proteins and nucleic acids. Unlike random mutagenesis methods in which point mutations are introduced randomly into a single parent sequence to produce a library of progeny sequences, DNA recombination methods entail the block-wise exchange of genetic variations among multiple parent sequences created in the laboratory or existing in nature to produce a library of chimeric progeny sequences. The key advantage of DNA recombination is its ability to accumulate beneficial mutations while simultaneously removing deleterious mutations, which may greatly accelerate the evolution of a protein or nucleic acid molecule of interest toward a specific function. It was demonstrated in computational simulation studies that DNA recombination plays a critical role in the evolution of biological systems. 1 In the past decade, in vitro DNA recombination has been used successfully to alter and engineer many types of protein function, such as stability, activity, affinity, selectivity, substrate specificity, and protein folding/solubility. 2,3 The first described in vitro DNA recombination method, or DNA shuffling, was developed by Stemmer in 1994, 4,5 in which DNA fragments generated by the random digestion of parent genes with DNase I are combined and reassembled into full-length chimeric progeny genes in a polymerase chain reaction (PCR)-like process. Since then, a number of in vitro DNA recombination methods have been described, 6 such as 1 S. Forrest, Science 261, 872 (1993). 2 O. Kuchner and F. H. Arnold, Trends Biotechnol. 15, 523 (1997). 3 C. Schmidt-Dannert, Biochemistry 40, (2001). 4 W. P. Stemmer, Proc. Natl. Acad. Sci. USA 91, (1994). 5 W. P. Stemmer, Nature 370, 389 (1994). Copyright 2004, Elsevier Inc. All rights reserved. METHODS IN ENZYMOLOGY, VOL /04 $35.00
On the conservative nature of intragenic recombination
On the conservative nature of intragenic recombination D. Allan Drummond*, Jonathan J. Silberg, Michelle M. Meyer, Claus O. Wilke*, and Frances H. Arnold ** *Program in Computation and Neural Systems,
More informationModule 2 overview SPRING BREAK
1 Module 2 overview lecture lab 1. Introduction to the module 1. Start-up protein eng. 2. Rational protein design 2. Site-directed mutagenesis 3. Fluorescence and sensors 3. DNA amplification 4. Protein
More informationChapter VI: Mutagenesis to Restore Chimera Function
99 Chapter VI: Mutagenesis to Restore Chimera Function Introduction Identifying characteristics of functional and nonfunctional chimeras is one way to address the underlying reasons for why chimeric proteins
More informationModule 2 overview SPRING BREAK
1 Module 2 overview lecture lab 1. Introduction to the module 1. Start-up protein eng. 2. Rational protein design 2. Site-directed mutagenesis 3. Fluorescence and sensors 3. DNA amplification 4. Protein
More informationIn Vitro DNA Recombination by Random Priming
DNA Recombination by Random Priming 99 13 In Vitro DNA Recombination by Random Priming Olga Esteban, Ryan D. Woodyer, and Huimin Zhao 1. Introduction Variation coupled to selection is the hallmark of natural
More informationCombinatorial Recombination of Gene Fragments to Construct a Library of Chimeras
Combinatorial Recombination of Gene Fragments to Construct a Library of Chimeras ABSTRACT Recombination of distantly related and nonrelated genes is difficult using traditional PCR-based techniques, and
More informationENZYME CATALYSTS FOR A BIOTECHNOLOGY-BASED CHEMICAL INDUSTRY
ENZYME CATALYSTS FOR A BIOTECHNOLOGY-BASED CHEMICAL INDUSTRY Professor Frances H. Arnold California Institute of Technology DOE Contract DE-FG02-93CH10578 Quarterly Progress Report, September 29 - December
More informationEvolving Affinity Enhanced Antibodies: High Mutational Rates for Discovery of Coupled Mutations
Evolving Affinity Enhanced Antibodies: High Mutational Rates for Discovery of Coupled Mutations Introduction The expanding market for antibody therapeutics demonstrates the importance of protein engineering.
More informationProtein engineering-why?
PROTEIN ENGINEERING Protein engineering-why? Enhance stability/function under new conditions temperature, ph, organic/aqueous solvent, [salt] Alter enzyme substrate specificity Enhance enzymatic rate Alter
More informationDirected evolution methods use the process of natural selection
Predicting crossover generation in DNA shuffling Gregory L. Moore*, Costas D. Maranas*, Stefan Lutz, and Stephen J. Benkovic *Department of Chemical Engineering, 112A Fenske Laboratory, and Department
More informationRecombination and chimeragenesis by in vitro heteroduplex formation and in vivo repair
1999 Oxford University Press Nucleic Acids Research, 1999, Vol. 27, No. 18 e18 Recombination and chimeragenesis by in vitro heteroduplex formation and in vivo repair Alexander A. Volkov, Zhixin Shao and
More informationSite-directed protein recombination as a shortest-path problem
Protein Engineering, Design & Selection vol. 17 no. 7 pp. 589 594, 2004 Published online August 25, 2004 doi:10.1093/protein/gzh067 Site-directed protein recombination as a shortest-path problem Jeffrey
More information[5] Staggered Extension Process In Vitro DNA Recombination
42 methodology [5] Program Availability Software for running SCHEMA calculations is available at the Arnold group web site at the California Institute of Technology (http://www.che. caltech.edu/groups/fha/).
More informationStructural Bioinformatics (C3210) Conformational Analysis Protein Folding Protein Structure Prediction
Structural Bioinformatics (C3210) Conformational Analysis Protein Folding Protein Structure Prediction Conformational Analysis 2 Conformational Analysis Properties of molecules depend on their three-dimensional
More informationChapter 17. Prediction, Engineering, and Design of Protein Structures
Chapter 17. Prediction, Engineering, and Design of Protein Structures Homologous proteins have similar structure and function Branden & Tooze (1998), Introduction to protein structure, 2 nd ed., p.349.
More informationDesign. Construction. Characterization
Design Construction Characterization DNA mrna (messenger) A C C transcription translation C A C protein His A T G C T A C G Plasmids replicon copy number incompatibility selection marker origin of replication
More informationModule 2 overview! SPRING BREAK!
1! Module 2 overview! lecture!!!!!lab! 1. Introduction to the module!!1. Start-up protein eng.! 2. Rational protein design!!2. Site-directed mutagenesis! 3. Fluorescence and sensors!!3. DNA amplification!
More informationCreation of a PAM matrix
Rationale for substitution matrices Substitution matrices are a way of keeping track of the structural, physical and chemical properties of the amino acids in proteins, in such a fashion that less detrimental
More informationAlgorithms for optimizing cross-overs in DNA shuffling
Algorithms for optimizing cross-overs in DNA shuffling Lu He Dept. of Computer cience Dartmouth College 6211 udikoff Laboratory Hanover, NH 03755, UA lu.he@dartmouth.edu Alan M. Friedman Dept. of Biological
More informationSequence homology-dependent methods for recombining
Creating multiple-crossover DNA libraries independent of sequence identity Stefan Lutz*, Marc Ostermeier, Gregory L. Moore, Costas D. Maranas, and Stephen J. Benkovic* Department of Chemical Engineering,
More informationRecombinant DNA recombinant DNA DNA cloning gene cloning
DNA Technology Recombinant DNA In recombinant DNA, DNA from two different sources, often two species, are combined into the same DNA molecule. DNA cloning permits production of multiple copies of a specific
More informationBi 8 Lecture 4. Ellen Rothenberg 14 January Reading: from Alberts Ch. 8
Bi 8 Lecture 4 DNA approaches: How we know what we know Ellen Rothenberg 14 January 2016 Reading: from Alberts Ch. 8 Central concept: DNA or RNA polymer length as an identifying feature RNA has intrinsically
More informationLecture Four. Molecular Approaches I: Nucleic Acids
Lecture Four. Molecular Approaches I: Nucleic Acids I. Recombinant DNA and Gene Cloning Recombinant DNA is DNA that has been created artificially. DNA from two or more sources is incorporated into a single
More informationProtein Structure Prediction
Homology Modeling Protein Structure Prediction Ingo Ruczinski M T S K G G G Y F F Y D E L Y G V V V V L I V L S D E S Department of Biostatistics, Johns Hopkins University Fold Recognition b Initio Structure
More informationGenetics Lecture 21 Recombinant DNA
Genetics Lecture 21 Recombinant DNA Recombinant DNA In 1971, a paper published by Kathleen Danna and Daniel Nathans marked the beginning of the recombinant DNA era. The paper described the isolation of
More informationApplicazioni biotecnologiche
Applicazioni biotecnologiche Analisi forense Sintesi di proteine ricombinanti Restriction Fragment Length Polymorphism (RFLP) Polymorphism (more fully genetic polymorphism) refers to the simultaneous occurrence
More informationBi 8 Lecture 5. Ellen Rothenberg 19 January 2016
Bi 8 Lecture 5 MORE ON HOW WE KNOW WHAT WE KNOW and intro to the protein code Ellen Rothenberg 19 January 2016 SIZE AND PURIFICATION BY SYNTHESIS: BASIS OF EARLY SEQUENCING complex mixture of aborted DNA
More informationChapter IV: Construction and Characterization of Site- Directed Recombination Libraries
50 Chapter IV: Construction and Characterization of Site- Directed Recombination Libraries Introduction There are several different methods for creating protein chimeras. By far the simplest technique
More informationX ray structure data Molecular dynamics simulation Sequence homology Educated guessing. Rational. Protein design
Biocatalyst Design Rational X ray structure data Molecular dynamics simulation Sequence homology Educated guessing Protein design Random Do enough mistakes, so many time you are bound to get it right some
More informationRandom Field Model Reveals Structure of the Protein Recombinational Landscape
Random Field Model Reveals Structure of the Protein Recombinational Landscape Philip A. Romero, Frances H. Arnold* Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena,
More informationStructure-guided SCHEMA recombination of distantly related b-lactamases
Protein Engineering, Design & Selection vol. 19 no. 12 pp. 563 570, 2006 Published online November 6, 2006 doi:10.1093/protein/gzl045 Structure-guided SCHEMA recombination of distantly related b-lactamases
More informationSequence Analysis Lab Protocol
Sequence Analysis Lab Protocol You will need this handout of instructions The sequence of your plasmid from the ABI The Accession number for Lambda DNA J02459 The Accession number for puc 18 is L09136
More informationTala Saleh. Tamer Barakat ... Anas Abu. Humaidan
7 Tala Saleh Tamer Barakat... Anas Abu. Humaidan Some Information in this lecture may not be mentioned by the Dr. as thoroughly as this sheet. But they cannot be overlooked for a better understanding,
More informationConcepts: What are RFLPs and how do they act like genetic marker loci?
Restriction Fragment Length Polymorphisms (RFLPs) -1 Readings: Griffiths et al: 7th Edition: Ch. 12 pp. 384-386; Ch.13 pp404-407 8th Edition: pp. 364-366 Assigned Problems: 8th Ch. 11: 32, 34, 38-39 7th
More informationRegulation of enzyme synthesis
Regulation of enzyme synthesis The lac operon is an example of an inducible operon - it is normally off, but when a molecule called an inducer is present, the operon turns on. The trp operon is an example
More informationRecitation CHAPTER 9 DNA Technologies
Recitation CHAPTER 9 DNA Technologies DNA Cloning: General Scheme A cloning vector and eukaryotic chromosomes are separately cleaved with the same restriction endonuclease. (A single chromosome is shown
More informationThis place covers: Methods or systems for genetic or protein-related data processing in computational molecular biology.
G16B BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY Methods or systems for genetic
More informationBiocatalysis. Amanda Garner September 14, Trypsin Aminotransferase. RNA Ligase
Biocatalysis TIFF (Uncompressed) decompressortiff (Uncompressed) decompressor TIFF (Uncompressed) decompressor RNA Ligase Trypsin Aminotransferase Amanda Garner September 14, 2007 All enzyme structures
More informationGenetic Engineering for Biofuels Production
Genetic Engineering for Biofuels Production WSE 573 Spring 2013 Greeley Beck INTRODUCTION Alternative transportation fuels are needed in the United States because of oil supply insecurity, oil price increases,
More informationGenetic Algorithm for Predicting Protein Folding in the 2D HP Model
Genetic Algorithm for Predicting Protein Folding in the 2D HP Model A Parameter Tuning Case Study Eyal Halm Leiden Institute of Advanced Computer Science, University of Leiden Niels Bohrweg 1 2333 CA Leiden,
More informationBuilding DNA Libraries to Explore the Combinatorial Design Space. Dr. Yifan Li Senior Scientist GenScript USA Inc.
Building DNA Libraries to Explore the Combinatorial Design Space Dr. Yifan Li Senior Scientist GenScript USA Inc. Outline Combinatorial Optimization for Metabolic Pathway Engineering Case Studies for Combinatorial
More informationThe GeneEditor TM in vitro Mutagenesis System: Site- Directed Mutagenesis Using Altered Beta-Lactamase Specificity
Promega Notes Magazine Number 62, 1997, p. 02 The GeneEditor TM in vitro Mutagenesis System: Site- Directed Mutagenesis Using Altered Beta-Lactamase Specificity By Christine Andrews and Scott Lesley Promega
More informationGenetics and Genomics in Medicine Chapter 3. Questions & Answers
Genetics and Genomics in Medicine Chapter 3 Multiple Choice Questions Questions & Answers Question 3.1 Which of the following statements, if any, is false? a) Amplifying DNA means making many identical
More informationPredicting Out-of-Sequence Reassembly in DNA Shuffling
J. theor. Biol. (2002) 219, 9 17 doi:10.1006/yjtbi.3102, available online at http://www.idealibrary.comon Predicting Out-of-Sequence Reassembly in DNA Shuffling Gregory L. Moorew and Costas D. Maranas
More informationProf. Pete Heinzelman University of Oklahoma. Engineering Cellulase Families for Biomass-to-Biofuel Conversion Processes
Prof. Pete Heinzelman University of Oklahoma Engineering Cellulase Families for Biomass-to-Biofuel Conversion Processes United States Fossil Fuel Dependence 2010 United States gasoline consumption - 140
More informationThe ability to predict a protein s tolerance to amino acid
Thermodynamic prediction of protein neutrality Jesse D. Bloom*, Jonathan J. Silberg, Claus O. Wilke, D. Allan Drummond, Christoph Adami, and Frances H. Arnold* *Division of Chemistry and Chemical Engineering
More informationAn overhang-based DNA block shuffling method for creating a customized random library
An overhang-based DNA block shuffling method for creating a customized random library Kosuke Fujishima, Chris Venter, Kendrick Wang, Raphael Ferreira and Lynn J. Rothschild Supplementary figures and text:
More informationBiotechnology. Chapter 20. Biology Eighth Edition Neil Campbell and Jane Reece. PowerPoint Lecture Presentations for
Chapter 20 Biotechnology PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley with contributions from Joan Sharp Copyright
More informationThis Technical Note describes these characterization studies in more detail.
Technical Note Characterization of SOMAmer Reagents Binding Specificity in the SOMAscan 1.3k Assay Summary Slow Off-Rate Modified Aptamer (SOMAmer) reagents are identified via the SELEX process against
More informationOverview: The DNA Toolbox
Overview: The DNA Toolbox Sequencing of the genomes of more than 7,000 species was under way in 2010 DNA sequencing has depended on advances in technology, starting with making recombinant DNA In recombinant
More informationSynthetic Biology II. February 24, 2016
Synthetic Biology II February 24, 2016 RECAP: Building Synthetic Gene Networks and Systems Construction of small gene networks from well-characterized biological parts, guided by models Bistable Toggle
More informationProtein design. CS/CME/BioE/Biophys/BMI 279 Oct. 24, 2017 Ron Dror
Protein design CS/CME/BioE/Biophys/BMI 279 Oct. 24, 2017 Ron Dror 1 Outline Why design proteins? Overall approach: Simplifying the protein design problem < this step is really key! Protein design methodology
More informationBasic Local Alignment Search Tool
14.06.2010 Table of contents 1 History History 2 global local 3 Score functions Score matrices 4 5 Comparison to FASTA References of BLAST History the program was designed by Stephen W. Altschul, Warren
More informationProtein Synthesis: From Gene RNA Protein Trait
Protein Synthesis: From Gene RNA Protein Trait Human Genome The human genome contains about genes. Each gene is a of DNA (sequence of nitrogen bases) contained within each chromosome. Each chromosome contains
More informationProblem Set 8. Answer Key
MCB 102 University of California, Berkeley August 11, 2009 Isabelle Philipp Online Document Problem Set 8 Answer Key 1. The Genetic Code (a) Are all amino acids encoded by the same number of codons? no
More information7.1 Techniques for Producing and Analyzing DNA. SBI4U Ms. Ho-Lau
7.1 Techniques for Producing and Analyzing DNA SBI4U Ms. Ho-Lau What is Biotechnology? From Merriam-Webster: the manipulation of living organisms or their components to produce useful usually commercial
More informationMolecular Modeling Lecture 8. Local structure Database search Multiple alignment Automated homology modeling
Molecular Modeling 2018 -- Lecture 8 Local structure Database search Multiple alignment Automated homology modeling An exception to the no-insertions-in-helix rule Actual structures (myosin)! prolines
More informationCSE : Computational Issues in Molecular Biology. Lecture 19. Spring 2004
CSE 397-497: Computational Issues in Molecular Biology Lecture 19 Spring 2004-1- Protein structure Primary structure of protein is determined by number and order of amino acids within polypeptide chain.
More informationAppendix A DNA and PCR in detail DNA: A Detailed Look
Appendix A DNA and PCR in detail DNA: A Detailed Look A DNA molecule is a long polymer consisting of four different components called nucleotides. It is the various combinations of these four bases or
More informationAb Initio SERVER PROTOTYPE FOR PREDICTION OF PHOSPHORYLATION SITES IN PROTEINS*
COMPUTATIONAL METHODS IN SCIENCE AND TECHNOLOGY 9(1-2) 93-100 (2003/2004) Ab Initio SERVER PROTOTYPE FOR PREDICTION OF PHOSPHORYLATION SITES IN PROTEINS* DARIUSZ PLEWCZYNSKI AND LESZEK RYCHLEWSKI BiolnfoBank
More informationHonors Biology Reading Guide Chapter 10 v Fredrick Griffith Ø When he killed bacteria and then mixed the bacteria remains with living harmless
Honors Biology Reading Guide Chapter 10 v Fredrick Griffith Ø When he killed bacteria and then mixed the bacteria remains with living harmless bacteria some living bacteria cells converted to disease causing
More informationReports Phosphorothioate-based DNA recombination: an enzyme-free method for the combinatorial assembly of multiple DNA fragments
Reports Phosphorothioate-based DNA recombination: an enzyme-free method for the combinatorial assembly of multiple DNA fragments Jan Marienhagen, Alexander Dennig, and Ulrich Schwaneberg Department of
More informationProtein Structure Prediction. christian studer , EPFL
Protein Structure Prediction christian studer 17.11.2004, EPFL Content Definition of the problem Possible approaches DSSP / PSI-BLAST Generalization Results Definition of the problem Massive amounts of
More information4/10/2011. Rosetta software package. Rosetta.. Conformational sampling and scoring of models in Rosetta.
Rosetta.. Ph.D. Thomas M. Frimurer Novo Nordisk Foundation Center for Potein Reseach Center for Basic Metabilic Research Breif introduction to Rosetta Rosetta docking example Rosetta software package Breif
More informationEvolutionary Methods in B iotechnology
Evolutionary Methods in B iotechnology Clever Tricks for Directed Evolution Susanne Brakmann I Andreas Schwienhorst 1 Introduction Susanne Brakmann and Andreas Schwienhorst 1 References 3 2 Generation
More informationHow Proteins Adapt: Lessons from Directed Evolution
How Proteins Adapt: Lessons from Directed Evolution F.H. ARNOLD Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125 Correspondence: frances@cheme.caltech.edu
More informationproduces an RNA copy of the coding region of a gene
1. Transcription Gene Expression The expression of a gene into a protein occurs by: 1) Transcription of a gene into RNA produces an RNA copy of the coding region of a gene the RNA transcript may be the
More informationGenetic Algorithms For Protein Threading
From: ISMB-98 Proceedings. Copyright 1998, AAAI (www.aaai.org). All rights reserved. Genetic Algorithms For Protein Threading Jacqueline Yadgari #, Amihood Amir #, Ron Unger* # Department of Mathematics
More informationProtein-Protein Interactions II
Biochemistry 412 Protein-Protein Interactions II March 28, 2008 Delano (2002) Curr. Opin. Struct. Biol. 12, 14. Some ways that mutations can destabilize protein-protein interactions Delano (2002) Curr.
More informationCS 262 Lecture 14 Notes Human Genome Diversity, Coalescence and Haplotypes
CS 262 Lecture 14 Notes Human Genome Diversity, Coalescence and Haplotypes Coalescence Scribe: Alex Wells 2/18/16 Whenever you observe two sequences that are similar, there is actually a single individual
More information7 Gene Isolation and Analysis of Multiple
Genetic Techniques for Biological Research Corinne A. Michels Copyright q 2002 John Wiley & Sons, Ltd ISBNs: 0-471-89921-6 (Hardback); 0-470-84662-3 (Electronic) 7 Gene Isolation and Analysis of Multiple
More informationGene Mutation, DNA Repair, and Transposition
Gene Mutation, DNA Repair, and Transposition Mutations Are Classified in Various Ways Spontaneous mutations happen naturally and randomly and are usually linked to normal biological or chemical processes
More informationNon-coding Function & Variation, MPRAs II. Mike White Bio /5/18
Non-coding Function & Variation, MPRAs II Mike White Bio 5488 3/5/18 MPRA Review Problem 1: Where does your CRE DNA come from? DNA synthesis Genomic fragments Targeted regulome capture Problem 2: How do
More informationPhenotype analysis: biological-biochemical analysis. Genotype analysis: molecular and physical analysis
1 Genetic Analysis Phenotype analysis: biological-biochemical analysis Behaviour under specific environmental conditions Behaviour of specific genetic configurations Behaviour of progeny in crosses - Genotype
More informationSuper Models. Deoxyribonucleic Acid (DNA) Molecular Model Kit. Copyright 2015 Ryler Enterprises, Inc. Recommended for ages 10-adult
Super Models Deoxyribonucleic Acid (DNA) Molecular Model Kit Copyright 2015 Ryler Enterprises, Inc. Recommended for ages 10-adult! Caution: Atom centers and vinyl tubing are a choking hazard. Do not eat
More informationBiotechnology. Chapter 20. Biology Eighth Edition Neil Campbell and Jane Reece. PowerPoint Lecture Presentations for
Chapter 20 Biotechnology PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley with contributions from Joan Sharp Copyright
More informationMatch the Hash Scores
Sort the hash scores of the database sequence February 22, 2001 1 Match the Hash Scores February 22, 2001 2 Lookup method for finding an alignment position 1 2 3 4 5 6 7 8 9 10 11 protein 1 n c s p t a.....
More informationBiochemistry 661 Your Name: Nucleic Acids, Module I Prof. Jason Kahn Exam I (100 points total) September 23, 2010
Biochemistry 661 Nucleic Acids, Module I Your Name: Prof. Jason Kahn Exam I (100 points total) September 23, 2010 You have 60 minutes for this exam. Exams written in pencil or erasable ink will not be
More informationHow Do You Clone a Gene?
S-20 Edvo-Kit #S-20 How Do You Clone a Gene? Experiment Objective: The objective of this experiment is to gain an understanding of the structure of DNA, a genetically engineered clone, and how genes are
More informationDNA Model Stations. For the following activity, you will use the following DNA sequence.
Name: DNA Model Stations DNA Replication In this lesson, you will learn how a copy of DNA is replicated for each cell. You will model a 2D representation of DNA replication using the foam nucleotide pieces.
More informationA Possible Mechanism of DNA to DNA Transcription in Eukaryotic Cells : Endonuclease Dependent Transcript Cutout
A Possible Mechanism of DNA to DNA Transcription in Eukaryotic Cells : Endonuclease Dependent Transcript Cutout Gao-De Li* * Corresponding author Chinese Acupuncture Clinic, Liverpool, UK Email: gaode_li@yahoo.co.uk
More informationProblem Set 2B Name and Lab Section:
Problem Set 2B 9-26-06 Name and Lab Section: 1. Define each of the following rearrangements (mutations) (use one phrase or sentence for each). Then describe what kind of chromosomal structure you might
More informationFactors affecting PCR
Lec. 11 Dr. Ahmed K. Ali Factors affecting PCR The sequences of the primers are critical to the success of the experiment, as are the precise temperatures used in the heating and cooling stages of the
More informationPhenotype analysis: biological-biochemical analysis. Genotype analysis: molecular and physical analysis
1 Genetic Analysis Phenotype analysis: biological-biochemical analysis Behaviour under specific environmental conditions Behaviour of specific genetic configurations Behaviour of progeny in crosses - Genotype
More informationDiscovery and Humanization of Novel High Affinity Neutralizing Monoclonal Antibodies to Human IL-17A
Discovery and Humanization of Novel High Affinity Neutralizing Monoclonal Antibodies to Human IL-17A Contacts: Marty Simonetti martysimonetti@gmail.com Kirby Alton kirby.alton@abeomecorp.com Rick Shimkets
More informationBio11 Announcements. Ch 21: DNA Biology and Technology. DNA Functions. DNA and RNA Structure. How do DNA and RNA differ? What are genes?
Bio11 Announcements TODAY Genetics (review) and quiz (CP #4) Structure and function of DNA Extra credit due today Next week in lab: Case study presentations Following week: Lab Quiz 2 Ch 21: DNA Biology
More informationProtein design. CS/CME/BioE/Biophys/BMI 279 Oct. 24, 2017 Ron Dror
Protein design CS/CME/BioE/Biophys/BMI 279 Oct. 24, 2017 Ron Dror 1 Outline Why design proteins? Overall approach: Simplifying the protein design problem Protein design methodology Designing the backbone
More informationAn Evolutionary Optimization for Multiple Sequence Alignment
195 An Evolutionary Optimization for Multiple Sequence Alignment 1 K. Lohitha Lakshmi, 2 P. Rajesh 1 M tech Scholar Department of Computer Science, VVIT Nambur, Guntur,A.P. 2 Assistant Prof Department
More informationA Possible Mechanism of DNA to DNA Transcription in Eukaryotic Cells : Endonuclease Dependent Transcript Cutout
A Possible Mechanism of DNA to DNA Transcription in Eukaryotic Cells : Endonuclease Dependent Transcript Cutout Gao-De Li* * Corresponding author Chinese Acupuncture Clinic, Liverpool, UK Email: gaode_li@yahoo.co.uk
More informationEncycloPCRAmplificationKit 1 Mint-2cDNA SynthesisKit 2 DuplexSpecificNuclease 4 Trimmer-2cDNA NormalizationKit 6 CombinedMint-2/Trimmer-2Package
Contents Page EncycloPCRAmplificationKit 1 Mint-2cDNA SynthesisKit 2 DuplexSpecificNuclease 4 Trimmer-2cDNA NormalizationKit 6 CombinedMint-2/Trimmer-2Package References 8 Schematic outline of Mint cdna
More informationNon-coding Function & Variation, MPRAs. Mike White Bio5488 3/5/18
Non-coding Function & Variation, MPRAs Mike White Bio5488 3/5/18 Outline MONDAY Non-coding function and variation The barcode Basic versions of MRPA technology WEDNESDAY More varieties of MRPAs Some key
More informationDNA Structure and Replication, and Virus Structure and Replication Test Review
DNA Structure and Replication, and Virus Structure and Replication Test Review What does DNA stand for? Deoxyribonucleic Acid DNA is what type of macromolecule? DNA is a nucleic acid The building blocks
More informationWhat is DNA??? DNA = Deoxyribonucleic acid IT is a molecule that contains the code for an organism s growth and function
Review DNA and RNA 1) DNA and RNA are important organic compounds found in cells, called nucleic acids 2) Both DNA and RNA molecules contain the following chemical elements: carbon, hydrogen, oxygen, nitrogen
More informationPV92 PCR Bio Informatics
Purpose of PCR Chromosome 16 PV92 PV92 PCR Bio Informatics Alu insert, PV92 locus, chromosome 16 Introduce the polymerase chain reaction (PCR) technique Apply PCR to population genetics Directly measure
More informationProtein design. CS/CME/Biophys/BMI 279 Oct. 20 and 22, 2015 Ron Dror
Protein design CS/CME/Biophys/BMI 279 Oct. 20 and 22, 2015 Ron Dror 1 Optional reading on course website From cs279.stanford.edu These reading materials are optional. They are intended to (1) help answer
More informationDNA. translation. base pairing rules for DNA Replication. thymine. cytosine. amino acids. The building blocks of proteins are?
2 strands, has the 5-carbon sugar deoxyribose, and has the nitrogen base Thymine. The actual process of assembling the proteins on the ribosome is called? DNA translation Adenine pairs with Thymine, Thymine
More informationNucleic Acids. By Sarah, Zach, Joanne, and Dean
Nucleic Acids By Sarah, Zach, Joanne, and Dean Basic Functions Carry genetic information (DNA storing it) Protein synthesis Helps in cell division (DNA replicates itself) RNA- numerous functions during
More informationTHE SYNTHESIS OF LIVING MATTER: ENERGY AND INFORMATION
THE SYNTHESIS OF LIVING MATTER: ENERGY AND INFORMATION Lectures 10/16, 10/21. Respiration Make a "black box" diagram showing the inputs and outputs of the three main reaction blocks in aerobic respiration;
More informationLieberman-Aiden et al. (2009) Comprehensive Mapping of Long-Range Interactions Reveals Folding Principles of the Human Genome. Science 326:
Lieberman-Aiden et al. (2009) Comprehensive Mapping of Long-Range Interactions Reveals Folding Principles of the Human Genome. Science 326: 289-293. : Understanding the 3D conformation of the genome can
More informationSingle Nucleotide Variant Analysis. H3ABioNet May 14, 2014
Single Nucleotide Variant Analysis H3ABioNet May 14, 2014 Outline What are SNPs and SNVs? How do we identify them? How do we call them? SAMTools GATK VCF File Format Let s call variants! Single Nucleotide
More information