Solution Structure of the DNA-binding Domain of GAL4 from Saccharomyces cerevisiae

Similar documents
Chapter 14 Regulation of Transcription

STRUCTURE, DYNAMICS AND INTERACTIONS OF PROTEINS BY NMR SPECTROSCOPY

All Rights Reserved. U.S. Patents 6,471,520B1; 5,498,190; 5,916, North Market Street, Suite CC130A, Milwaukee, WI 53202

Gene and DNA structure. Dr Saeb Aliwaini

Chapter 8 DNA Recognition in Prokaryotes by Helix-Turn-Helix Motifs

Enhancers. Activators and repressors of transcription

Not The Real Exam Just for Fun! Chemistry 391 Fall 2018

RNA does not adopt the classic B-DNA helix conformation when it forms a self-complementary double helix

The mechanism(s) of protein folding. What is meant by mechanism. Experimental approaches

Structure/function relationship in DNA-binding proteins

Structural Bioinformatics (C3210) DNA and RNA Structure

Drug DNA interaction. Modeling DNA ligand interaction of intercalating ligands

BIOLOGY 200 Molecular Biology Students registered for the 9:30AM lecture should NOT attend the 4:30PM lecture.

Hmwk # 8 : DNA-Binding Proteins : Part II

Chapter 1 -- Life. Chapter 2 -- Atoms, Molecules and Bonds. Chapter 3 -- Water

What Are the Chemical Structures and Functions of Nucleic Acids?

DNA & DNA : Protein Interactions BIBC 100

CSE : Computational Issues in Molecular Biology. Lecture 19. Spring 2004

G. DNA-BINDING PROTEINS & REGULATION

3.1 Storing Information. Polypeptides. Protein Structure. Helical Secondary Structure. Types of Protein Structure. Molecular Biology Fourth Edition

Modularity of eukaryotic transcription activators

Comparative Modeling Part 1. Jaroslaw Pillardy Computational Biology Service Unit Cornell Theory Center

BIRKBECK COLLEGE (University of London)

Transcription factors

Bi 8 Lecture 7. Ellen Rothenberg 26 January Reading: Ch. 3, pp ; panel 3-1

Understanding DNA Structure

Fig Ch 17: From Gene to Protein

Supplementary Materials for

Reading for lecture 2

MBioS 503: Section 1 Chromosome, Gene, Translation, & Transcription. Gene Organization. Genome. Objectives: Gene Organization

Bioinformatics. ONE Introduction to Biology. Sami Khuri Department of Computer Science San José State University Biology/CS 123A Fall 2012

RNA Expression of the information in a gene generally involves production of an RNA molecule transcribed from a DNA template. RNA differs from DNA

BIOCHEMISTRY REVIEW. Overview of Biomolecules. Chapter 10 Nucleic Acids

Quick Review of Protein Synthesis

Chapter 9. Weaver, 2/e. Mol Biol X107A.

NMR Structural Profiling of Transcriptional Intermediates Reveals Riboswitch Regulation by Metastable RNA Conformations.

PHF20 is an effector protein of p53 double lysine methylation

DNA Structures. Biochemistry 201 Molecular Biology January 5, 2000 Doug Brutlag. The Structural Conformations of DNA

Molecular Biology (1)

Biochemistry 674: Nucleic Acids Your Name:

Basic concepts of molecular biology

BIOLOGY 111. CHAPTER 6: DNA: The Molecule of Life

NUCLEIC ACID. Subtitle

Chapter 8. Microbial Genetics. Lectures prepared by Christine L. Case. Copyright 2010 Pearson Education, Inc.

Proteins Amides from Amino Acids

CELL BIOLOGY: DNA. Generalized nucleotide structure: NUCLEOTIDES: Each nucleotide monomer is made up of three linked molecules:

Calcium binding proteins in malaria and heart disease

What is necessary for life?

DNA Glycosylase Exercise

Chapter 25: Regulating Eukaryotic Transcription The Ligand Responsive Activators

Biochemistry 111. Carl Parker x A Braun

DNA Structure and Analysis. Chapter 4: Background

Sequence Determinants of a Conformational Switch in a Protein

Non-standard base pairs Non-standard base pairs play critical roles in the varied structures observed in DNA and RNA.

Plant Molecular and Cellular Biology Lecture 9: Nuclear Genome Organization: Chromosome Structure, Chromatin, DNA Packaging, Mitosis Gary Peter

Protein isotopic enrichment for NMR studies

The Structure of Proteins The Structure of Proteins. How Proteins are Made: Genetic Transcription, Translation, and Regulation

Dina Al-Tamimi. Faisal Nimri. Ma amoun Ahram. 1 P a g e

DNA and RNA are both made of nucleotides. Proteins are made of amino acids. Transcription can be reversed but translation cannot.

BETA STRAND Prof. Alejandro Hochkoeppler Department of Pharmaceutical Sciences and Biotechnology University of Bologna

Nucleic acids. How DNA works. DNA RNA Protein. DNA (deoxyribonucleic acid) RNA (ribonucleic acid) Central Dogma of Molecular Biology

(Due Sept 9 th ) Problem Set 2

Unit 1: DNA and the Genome. Sub-Topic (1.3) Gene Expression

Your Name: MID TERM ANSWER SHEET SIN: ( )

The nucleic acids are deoxyribonucleic acid (DNA) and ribonucleic acid (RNA).

BASIC MOLECULAR GENETIC MECHANISMS Introduction:

Chapter 17 Lecture. Concepts of Genetics. Tenth Edition. Regulation of Gene Expression in Eukaryotes

Find this material useful? You can help our team to keep this site up and bring you even more content consider donating via the link on our site.

Multiple choice questions (numbers in brackets indicate the number of correct answers)

Chapter Fundamental Molecular Genetic Mechanisms

Gene regulation II Biochemistry 302. Bob Kelm March 1, 2004

Introduction to Cellular Biology and Bioinformatics. Farzaneh Salari

MBMB451A Section1 Fall 2008 KEY These questions may have more than one correct answer

Notes: (Our Friend) DNA. DNA Structure DNA is composed of 2 chains of repeating. A nucleotide = + +

Chapter 16 The Molecular Basis of Inheritance

NMR Resonance Assignment of the Therapeutic RNA Aptamer Macugen in Complex with its in Vivo Target, the Heparin Binding Domain of VEGF165

Roadmap. The Cell. Introduction to Molecular Biology. DNA RNA Protein Central dogma Genetic code Gene structure Human Genome

DNA. Branden & Tooze, Ch. 7 Deoxyribose nucleic acids are made of three parts

Structure of nucleic acids II Biochemistry 302. January 20, 2006

Protein Folding Problem I400: Introduction to Bioinformatics

Gene expression DNA RNA. Protein DNA. Replication. Initiation Elongation Processing Export. DNA RNA Protein. Transcription. Degradation.

Chapter 17. From Gene to Protein. Slide 1. Slide 2. Slide 3. Gene Expression. Which of the following is the best example of gene expression? Why?

Structural Analysis of the EGR Family of Transcription Factors: Templates for Predicting Protein DNA Interactions

8/21/2014. From Gene to Protein

Wilkins Franklin s photo below proved model on left to be correct for DNA

Alpha-helices, beta-sheets and U-turns within a protein are stabilized by (hint: two words).

Computational Methods for Protein Structure Prediction and Fold Recognition... 1 I. Cymerman, M. Feder, M. PawŁowski, M.A. Kurowski, J.M.

Nanobiotechnology. Place: IOP 1 st Meeting Room Time: 9:30-12:00. Reference: Review Papers. Grade: 50% midterm, 50% final.

BIOCHEMISTRY Nucleic Acids

Figure A summary of spontaneous alterations likely to require DNA repair.

Macromolecule Review

YOUR NAME: KEY. PLEASE PRINT your name (IN INDELIBLE INK) on the line above (& on the top right hand corner of every page).

Key Area 1.3: Gene Expression

Regulation of gene expression. (Lehninger pg )

Chapter 5: Introduction to the Nucleic Acids

From mechanism to medicne

Molecular Genetics. The flow of genetic information from DNA. DNA Replication. Two kinds of nucleic acids in cells: DNA and RNA.

What is necessary for life?

Make the protein through the genetic dogma process.

Transcription is the first stage of gene expression

Transcription:

Vol. 14, No. 1-4 175 Solution Structure of the DNA-binding Domain of GAL4 from Saccharomyces cerevisiae James D. Baleja, V. Thanabal, Ted Mau, and Gerhard Wagner Department of Biological Chemistry and Molecular Pharmacology Harvard Medical School Boston, Massachusetts 02115 1 Introduction The GAL4 transcriptional activator protein has long been a favorite for the study of transcription in eukaryotic biology. Genetic studies reveal a modular architecture for the protein with different functions associated with each module [1]. A DNA-binding domain of the protein recognizes and binds to a sequence of DNA termed the Upstream Activating Sequence (UASQ). Other parts of the protein are relevant for activation. They interact with the transcriptional machinery including RNA polymerase to activate transcription. The U A S Q is near the genes that encode the proteins required for galactose utilization. Upon presentation of galactose to the yeast cell, this DNA site is specifically bound by GAL4, the transcription function of RNA polymerase is activated, and enzymes required for galactose utilization are produced [2]. As a dimer of 881 amino acids, GAL4 is too large for the determination of a high-resolution NMR structure and we have instead studied a fragment containing the N- terminal 65 amino acid residues including its DNA-binding domain [3]. In the absence of DNA, GAL4(65) is monomeric in solution, presumably because it does not have the amino acid residues necessary for dimerization [4]. GAL4(65) is dimeric when bound to any of four DNA sites present in the UAS [5]. Each of these binding sites is approximately twofold palindromic, and like many other dimeric DNA-binding proteins, each monomer of the GAL4 dimer interacts with a half-site of DNA. The structure and dynamics of the monomeric DNA-binding domain of GAL4 (residues 1-65; Figure 1), an investigation of the binding to a DNA half-site (Figure 2), and the structure of the resultant protein-dna complex are presented in this paper. 1 11 21 MKLLSSIEQA CDICRLKKLK CSKEKPKCAK 31 41 51 CLKNNWECRY SPKTKRSPLT RAHLTEVESR 61 LERLE Figure 1. Primary amino acid sequence for the DNA-binding domain of GAL4.

176 BuJietin of Magnetic Resonance 1 2 3 4 5 6 7 3 9 10 C C G G A G G A C T G G C C T C C T G A 20 19 18 17 16 15 14 13 12 11 Figure 2. Nucleotide sequence for the one-half DNA-binding site of GAL4. 2 1H, 15N, and n^cd NMR resonance assignments Assignment of specific nuclei to the observed NMR resonance frequencies is the first step in determining the structure of a protein using NMR techniques [6]. Assignments for the 1 H, ^$N, and 113 Cd NMR resonances of GAL4(65) were made using homonuclear and hetero-nuclear NMR experiments and by following standard protocol ([6], Baleja and Wagner, unpublished results). Zinc is required for the DNAbinding activity of GAL4 [4]. It can be replaced by NMR-active 113 Cadmium without loss of DNA-binding. The 113 C d NMR spectrum (Figure 3) shows that two metal ions are coordinated by a Cys - (X)2 - Cys - (X>6 - Cys - (X) 6 - Cys - (X) 2 - Cys - (X) 6 - Cys motif using six cysteines and forming a bimetal-thiolate cluster [7]. Hetero-nuclear correlation experiments between the l* 3 Cd and *H define the liganding of the two central metal ions (Figure 3). Having assigned resonance frequencies to specific nuclei, the correspondence of cross peaks to the protons can be made and local structural information was can then be determined. A section of the twodimensional NOESY spectrum is shown in Figure 4. The region shows the cross peaks among the amide protons of the protein backbone and protons of the aromatic sidechains. Figure 3. Heteronuclear ^ C d H correlation experiments. The standard reverse INEPT pulse sequence [8] was followed by a short MLEV-17 TOCSY transfer [9] and proton detection./ Indicating the presence of two a- helices, there are a series of close approaches between amide proton resonance frequencies for several sequential residues. Other NOEs arise between amino acid residues more distant in sequence and define the unique topological features for the protein. NOE intensities at 56, 150, and 250, and 500 millisecond mixing times were converted to the corresponding interproton distances using the usual inverse r 6 relationship [3]. In case of overlap in the homonuclear NOESY, spectrum analysis was also made using a three-dimensional ^ J S J. I J J. I H NOESY-HMQC spectrum recorded on a uniformly ^ N - l a b e l e d protein (Figure 5, [10]). From the NOESY data,

Vol. 14, No. 1-4 177 residues 9-40 are observed to form a well-defined, compact globular cluster. The terminal residues 1-8 and 41-66 show little persistent structure in solution. There are many interresidue NOE crosspeaks for the central core (30 per residue, on average), but only a few weak NOE cross peaks for protons in the disordered region. In addition, the amide protons of the flexible trails are in rapid exchange with solvent [10] and have narrow resonance lines, indicating that these residues have considerable conformational mobility in the absence of DNA. This unstructured character for parts of the GAL4 protein may be vital to its biological function. On the other hand, at least some aspects of this mobility may merely be a consequence of the truncation used for GAL4 in this study. Nonetheless, our picture of GAL4 is one in which two N-terminal recognition modules are connected by flexible linkers to a dimeric core [4], 9.0 8.5 8.0 7.5 7.0 6)2 (ppm) 6.5 6.0 Figure 4. NOESY spectrum of amide and aromatic protons of GAL4(65). Sample concentration was 1.5 mm in 0.2 M NaCl, 20 mm sodium phosphate, ph 7.0 at 25 C. Numbers indicate the residues for which sequential strong amide to amide cross peaks are observed which are indicative of a- helices. 9.0 9.5 - V o K23 o J28 R15 D1i! L1 O E37 OC11 S41 DS22 >C21 C38, K&O BN358 O Q9E K18 M35 T»K27, 113 C31< OC14 o i R39 10.0 9.5 9.0 8.5 1 8.0 7.5 (ppm) 0N356 N34.Q9 HI345 7.0 6.5 N346 6.0 120 15 N (ppm) 130 1 Figure 5. Heteronuclear single quantum coherence 5 N-*H spectra [11] of uniformly 15 N-labeled GAL4(65). The sample concentration was 1.5 mm in a -. buffer of 0.1 M NaCl, 20 mm phosphate, ph 7.18.

178 Bulletin of Magnetic Resonance 3 Structure of the GAL4 DNA-binding domain Sets of interproton distances and torsion angles formed the basis for structure determination. Distances were determined from the NOESY data [3]. torsion angles were interpreted from measured NH-oc coupling constants and x 1 angles were derived from ^N-j} and a-(5 coupling constants [3]. Sulfur-Cd liganding distances were imposed to be 2.35 to 2.45 A, Cys Cp-Cd distances to be less than 3.4 A, and sulphurs liganding the same Cd to be between 3.3 and 4.2 A. 614 distance and 41 torsion angle measurements were used with the distance geometry package DG-II to generate a set of structures (0.6 A backbone atom rmsd) in agreement with the NMR data [3]. A schematic of the structure (Figure 6) shows the two central metal ions coordinated by the six cysteines. The DNA-binding domain consists of an a-helix and an extended structure, then a sharp turn that contains a cis-proline bond, and then a 2nd a helix followed by an extended structure. If the central metal binding subdomain of GAL4 is split, and the two halves superimposed, a striking correspondence between each part is revealed. The conformation of the 13 residue segment from residues 10 to 22 is almost identical to that of residues 27 to 39, with an rmsd of 0.8 A for the backbone atoms. Although the structural integrity of the protein would be provided mainly by the way in which the two metal ions are liganded, some hydrophobic packing is observed with the side chains of W36 and Y40 [3]. Slowly exchanging amides of GAL4, in both the free and DNA-bound forms [10], can be attributed, in part, to hydrogen bonding to carbonyl oxygens within the a-helices, and to sulfurs of the cysteinyl side-chains [12]. Figure 6. Structure of the central core for the DNA-binding domain of GAL4. Cysteines that ligand the two central metal ions are numbered. 4 Formation of a GAL4-DNA complex Imino protons of one half of the consensus dimeric GAL4 DNAbinding were monitored in forming the complex between the protein and DNA (Figure 7). One imino proton is present for each base-pair of a 10 base-pair DNA duplex. _JV 13.8 13.6 13.4 13.2 13.0 12.8 12.6 ppm Figure 7. Titration of DNA with GAL4. Experimentally, eight imino proton resonances are observed, since the imino protons on the base-pairs at each end of the duplex exchange rapidly with the bulk solvent at room temperature. Each imino resonance has been assigned to a specific base-

Vol. 14, No. 1-4 179 pair. Resonances shift as the environment around each proton changes upon addition of GAL4. All resonances broaden as the more slowly protein-dna complex is formed. The equilibrium dissociation constant is 169± 13 J.M. Resonances are in fast exchange between the protein-dna complex and the free components indicating rapid equilibrium between free and bound forms. NOESY and TOCSY two-dimensional data for the protein-dna complex have been collected (Figure 8). These spectra are promising for full analysis since the resonance lines are not too broad for resonance assignment, chemical shift dispersion is good, and solubility of the protein-dna complex is adequate. The DNA resonances have been completely re-assigned and the protein resonances have been partially reassigned within the protein-dna complex. 14.0 12.0 10.0 8.0 6.0 4.0 2.0 F2 (ppm ) Figure 8. NOESY spectrum of the GAL4:DNA complex. The concentration of the complex was approximately 0.5 mm. Buffer conditions were 0.15 M NaCl, 20 mm PO 4, ph 7, 25 C. 5 NMR structure of GAL4-DNA complex the Several NOE contacts have been observed between protein and DNA (dashed lines, Figure 9) yielding a preliminary stucture for this GAL4- DNA complex. The recognition helix for the free form of the protein was docked onto a B-DNA. The amino acid residues responsible for recognizing DNA are part of the metal-binding cluster and interact with edges of 'base-pairs exposed in the major groove of the DNA. The structure of the DNA-binding domain of GAL4 (residues 1-65) bound to full site DNA containing two GAL4 binding sites has been determined crystallographically. In agreement with our observed intermolecular NOE cross peaks (Figure 8), the C- terminal end of the first a-helix of the metal binding cluster (residues 9-40) provides for sequence-specific re-cognition of DNA by GAL4. When bound to full site DNA, parts of GAL4(65), before unstructured in solution, adopt a regular conformation. Residues 41 to 49 form a linker region which interacts with

180 Bulletin of Magnetic Resonance the backbone of the DNA. In addition, residues 50-64 form a small dimerization domain using a coiled-coil type packing arrangement [4]. Our preliminary results on an intact dimerization element for GAL4 (residues 50-106) indicate that the a- helical character for the coiled-coil of the protein extends beyond residue 64 (Baleja, Marmorstein, and Wagner, unpublished results). The same conformation (1.1 A rmsd) is observed for the recognition module of GAL4(65) in solution using NMR techniques as for the central metalbinding cluster of GAL4(65) bound to DNA using crystallographic techniques. Thus the core of the DNAbinding domain changes little upon binding DNA. Figure 9. Preliminary structure for a GAL4-DNA complex. 6 Summary The DNA-binding domain of GAL4 has many interesting features. It is a novel DNA-binding motif with a globular two-metal cluster that has hydrogen-bonds to sulfur and twofold internal symmetry. GAL4 reads the sequence of DNA through the amino acids present at the C-terminal end of the first a-helix. The two DNAreading modules of intact GAL4 are tethered by flexible linkers to the central body of the protein. Once bound to DNA through the N- terminal recognition domains, the C- terminal portion of GAL4 is in the correct position to interact with the components of the transcriptional machinery to bring about transcriptional activation. 7 References 1. Keegan, L., Gill, G., and Ptashne, M., Science 231, 699 (1986). 2. Johnston, M., Micro. Rev. 51, 458 (1987). 3. Baleja, J. D., Marmorstein, R., Harrison, S. C, and Wagner G., Nature 356, 448 (1992). 4. Marmorstein, R., Carey, M., Ptashne, M., and Harrison, S. C, Nature 356, 408 (1992). 5. Carey, M., Kakidani, H., Leatherwood, J., Mostashari, F., and Ptashne, M., J. Mol. Biol. 209, 423 (1989). 6. Wuthrich, K. NMR of Proteins and Nucleic Acids (Wiley, New York, 1986). 7. Pan, T., and Coleman, J. E., Biochemistry 209, 3023 (1990). 8. Briihwiler, D., and Wagner, G., J. Magn. Reson. 69, 546 (1986). 9. Bax, A., and Davis, D. G., J. Magn. Reson. 65, 355 (1985). 10. Mau, T., Baleja, J. D., and Wagner, G., Protein Science (submitted). 11. Bodenhausen, G., and Ruben, D. J., Chem. Phys. Lett. 69, 185 (1980). 12. Kraulis, P., Raine, A. R. C Gadhavi, P. L., and Laue, E. D., Nature 356, 448 (1992).