Transcription Gene regulation

Similar documents
V10 Transcriptional Gene regulation networks

Complex Transcription Machinery

Molecular Biology (BIOL 4320) Exam #1 March 12, 2002

Enhancers. Activators and repressors of transcription

Chapter 14 Regulation of Transcription

Transcription is the first step of gene expression, in which a particular segment of DNA is copied into RNA by the enzyme, RNA polymerase.

Structure/function relationship in DNA-binding proteins

Regulation of gene expression. (Lehninger pg )

ChIP. November 21, 2017

Year III Pharm.D Dr. V. Chitra

DNA Function: Information Transmission

Announcement Structure Analysis

Synthetic cells: do bacteria need all its genes? No.

Biotechnology Unit 3: DNA to Proteins. From DNA to RNA

Genome-Wide Survey of MicroRNA - Transcription Factor Feed-Forward Regulatory Circuits in Human. Supporting Information

M1 - Biochemistry. Nucleic Acid Structure II/Transcription I

DNA Transcription. Dr Aliwaini

Transcription. DNA to RNA

Gene Regulation in Eukaryotes

Make the protein through the genetic dogma process.

Expression of the genome. Books: 1. Molecular biology of the gene: Watson et al 2. Genetics: Peter J. Russell

Lecture 11. Initiation of RNA Pol II transcription. Transcription Initiation Complex

Motivation From Protein to Gene

GENETICS - CLUTCH CH.10 TRANSCRIPTION.

I. Gene Expression Figure 1: Central Dogma of Molecular Biology

Selected Techniques Part I

Chapters 31-32: Ribonucleic Acid (RNA)

Chem 465 Biochemistry II

GENES AND CHROMOSOMES V. Lecture 7. Biology Department Concordia University. Dr. S. Azam BIOL 266/

Übung V. Einführung, Teil 1. Transktiptionelle Regulation TFBS

Multiple choice questions (numbers in brackets indicate the number of correct answers)

Eukaryotic & Prokaryotic Transcription. RNA polymerases

DNA Transcription. Visualizing Transcription. The Transcription Process

وراثة األحياء الدقيقة Microbial Genetics

Chapter 25: Regulating Eukaryotic Transcription The Ligand Responsive Activators

Transcription in Eukaryotes

Transcription factors

Chapter 17 Lecture. Concepts of Genetics. Tenth Edition. Regulation of Gene Expression in Eukaryotes

CHAPTERS , 17: Eukaryotic Genetics

TRANSCRIPTION COMPARISON OF DNA & RNA TRANSCRIPTION. Umm AL Qura University. Sugar Ribose Deoxyribose. Bases AUCG ATCG. Strand length Short Long

Transcription Eukaryotic Cells

DNA & DNA : Protein Interactions BIBC 100

Transcription is the first stage of gene expression

(c) 2014 Dr. Alice Heicklen & Dr. Deborah Mowshowitz, Columbia University, New York, NY. Last update 02/26/ :57 PM

Computational Biology I LSM5191 (2003/4)

TRANSCRIPTION AND PROCESSING OF RNA

Gene Expression Transcription/Translation Protein Synthesis

Gene Expression: Transcription, Translation, RNAs and the Genetic Code

Gene Identification in silico

BIO 311C Spring Lecture 36 Wednesday 28 Apr.

17.5 Eukaryotic Transcription Initiation Is Regulated by Transcription Factors That Bind to Cis-Acting Sites

Gene Expression Technology

CELL BIOLOGY - CLUTCH CH. 7 - GENE EXPRESSION.

Gene expression DNA RNA. Protein DNA. Replication. Initiation Elongation Processing Export. DNA RNA Protein. Transcription. Degradation.

BS 50 Genetics and Genomics Week of Oct 24

DNA. Is a molecule that encodes the genetic instructions used in the development and functioning of all known living organisms and many viruses.

Biology. Biology. Slide 1 of 39. End Show. Copyright Pearson Prentice Hall

Biology. Biology. Slide 1 of 39. End Show. Copyright Pearson Prentice Hall

UNIT 3 GENETICS LESSON #41: Transcription

Fermentation. Lesson Overview. Lesson Overview 13.1 RNA

6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008

Characterizing DNA binding sites high throughput approaches Biol4230 Tues, April 24, 2018 Bill Pearson Pinn 6-057

Reading for lecture 2

NOTES Gene Expression ACP Biology, NNHS

Bioinformatics: Microarray Technology. Assc.Prof. Chuchart Areejitranusorn AMS. KKU.

Introduction to Bioinformatics Online Course: IBT

The Genetic Code and Transcription. Chapter 12 Honors Genetics Ms. Susan Chabot

DNA Binding Domains: Structural Motifs. Effector Domain. Zinc Fingers. Zinc Fingers, continued. Zif268

Chapter 11. Transcription. The biochemistry and molecular biology department of CMU

RNA Expression of the information in a gene generally involves production of an RNA molecule transcribed from a DNA template. RNA differs from DNA

AP Biology Gene Expression/Biotechnology REVIEW

Transcription. The sugar molecule found in RNA is ribose, rather than the deoxyribose found in DNA.

Themes: RNA and RNA Processing. Messenger RNA (mrna) What is a gene? RNA is very versatile! RNA-RNA interactions are very important!

Chromatographic Separation of the three forms of RNA Polymerase II.

The Structure of Proteins The Structure of Proteins. How Proteins are Made: Genetic Transcription, Translation, and Regulation

Differential Gene Expression

Transcription in Prokaryotes. Jörg Bungert, PhD Phone:

nature methods A paired-end sequencing strategy to map the complex landscape of transcription initiation

Replication Review. 1. What is DNA Replication? 2. Where does DNA Replication take place in eukaryotic cells?

Section C: The Control of Gene Expression

Activation of a Floral Homeotic Gene in Arabidopsis

Differential Gene Expression

IB BIO I Replication/Transcription/Translation Van Roekel/Madden. Name Date Period. D. It separates DNA strands. (Total 1 mark)

The Little Things About the Little Things Inside of Us The Eukaryotic Genome and Its Expression

Fig Ch 17: From Gene to Protein

Differential Gene Expression

Eukaryotic Gene Expression Prof. P. N. Rangarajan Department of Biochemistry Indian Institute of Science, Bangalore

Chapter 8 Lecture Outline. Transcription, Translation, and Bioinformatics

RNA POLYMERASE FUNCTIONS E-BOOK

Genome-Scale Predictions of the Transcription Factor Binding Sites of Cys 2 His 2 Zinc Finger Proteins in Yeast June 17 th, 2005

Control of Eukaryotic Genes

The nucleic acids are deoxyribonucleic acid (DNA) and ribonucleic acid (RNA).

Control of Eukaryotic Genes. AP Biology

Introduction to Microarray Data Analysis and Gene Networks. Alvis Brazma European Bioinformatics Institute

Control of Eukaryotic Genes. AP Biology

Genomics and Gene Recognition Genes and Blue Genes

Practice Exam A. Briefly describe how IL-25 treatment might be able to help this responder subgroup of liver cancer patients.

Lecture Summary: Regulation of transcription. General mechanisms-what are the major regulatory points?

Chapter 3. DNA, RNA, and Protein Synthesis

BISHOPAgEd.Weebly.com. Weeks: Dates: 1/18-1/29 Unit: RNA &Protein Synthesis. Monday Tuesday Wednesday Thursday Friday. FFA Meeting 6pm 27 E

Transcription:

Transcription Gene regulation The machine that transcribes a gene is composed of perhaps 50 proteins, including RNA polymerase, the enzyme that converts DNA code into RNA code. A crew of transcription factors grabs hold of the DNA just above the gene at a site called the core promoter, while associated activators bind to enhancer regions farther upstream of the gene to rev up transcription. Working as a tightly knit machine, these proteins transcribe a single gene into messenger RNA. The messenger RNA winds its way out of the nucleus to the factories that produce proteins, where it serves as a blueprint for production of a specific protein. http://www.berkeley.edu/news/features/1999/12/09_nogales.html 3. Lecture WS 2004/05 Bioinformatics III 1 a

Transcription in E.coli and in Eucaryotes Procaryotes Eucaryotes Genes are grouped into operons Genes are not grouped in operons mrna may contain transcript of several genes (poly-cistronic) each mrna contains only transcript of a single gene (mono-cistronic) Transcription and translation are coupled. Transcript is translated already during transcription. Transcription and translation are NOT coupled. Transcription takes place in nucleus, translation in cytosol. Gene regulation takes place by modification of transcription rate Gene regulation via transcription rate AND by RNA-processing, RNA stability etc. 3. Lecture WS 2004/05 Bioinformatics III 2

Promoter prediction in E.coli To analyze E.coli promoters, one may align a set of promoter sequences by the position that marks the known transcription start site (TSS) and search for conserved regions in the sequences. E.coli promoters are found to contain 3 conserved sequence features - a region approximately 6 bp long with consensus TATAAT at position -10 - a region approximately 6 bp long with consensus TTGACA at position -35 - a distance between these 2 regions of ca. 17 bp that is relatively constant 3. Lecture WS 2004/05 Bioinformatics III 3 a

Gene regulatory promoter network In E.coli, 240 transcription factors have been verified that regulate 3000 genes. Binding site matrics are available for more than 55 E.coli TFs (Robison et al. 1998) In S. cerevisae, genome-wide binding analysis of 106 transcription factors indicates that more than one-third of the promoter regions that were bound by regulators were bound by 2 or more regulators. Highly connected network of transcriptional regulators. 3. Lecture WS 2004/05 Bioinformatics III 4

Feasibility of computational motif search? Computational identification of transcription factor binding sites is difficult because they consist of short, degenerate sequences that occur frequently by chance. The problem is not easy to define (therefore: it is complex ) because - the motif is of unknown size - the motif might not be well conserved between promoters - the sequences used to search for the motif do not necessarily represent the complete promoter - genes with promoters to be analyzed are in many cases grouped together by a clustering algorithm which has its own limitations. 3. Lecture WS 2004/05 Bioinformatics III 5

Strategy 1 Arrival of microarray gene-expression data. Group of genes with similar expression profile (e.g. those that are activated at the same time in the cell cycle) one may assume that this profile ist, at least partly, caused by and reflected in a similar structure of the regions involved in transcription regulation. Search for common motifs in < 1000 base upstream regions. Sofar used: detection of single motifs (representing transcription-factor binding sites) common to the promoter sequences of putatively co-regulated genes. Better: search for simultaneous occurrence of 2 or more sites at a given distance interval! Search becomes more sensitive. 3. Lecture WS 2004/05 Bioinformatics III 6

Motif identifaction A flowchart to illustrate the two different approaches for motif identification. We analyzed 800 bp upstream from the translation start sites of the five genes from the yeast gene family PHO by the publicly available systems MEME (alignment) and RSA (exhaustive search). MEME was run on both strands, one occurrence per sequence mode, and found the known motif ranked as second best. RSA Tools was run with oligo size 6 and noncoding regions as background, as set by the demo mode of the system. The wellconserved heptamer of the motifs used by MEME to build the weight matrix is printed in bold. Ohler, Niemann Trends Gen 17, 2 (2001) 3. Lecture WS 2004/05 Bioinformatics III 7

Strategy 2: Exhaustive motiv search in upstream regions Exploit the finding that relevant motifs are often repeated many times, possibly with small variations, in the upstream region for the regulatory action to be effective. Search upstream region for overrepresented motifs (1) Group genes based on the overrepresented motifs (2) Analyze sets of genes that share motifs for coregulation in microarray exp. (3) Consider overrepresented motifs labelling sets of co-regulated genes as candidate binding sites. Cora et al. BMC Bioinformatics 5, 57 (2004) 3. Lecture WS 2004/05 Bioinformatics III 8

Exhaustive motiv search in upstream regions Exploit Cora et al. BMC Bioinformatics 5, 57 (2004) 3. Lecture WS 2004/05 Bioinformatics III 9

Exhaustive motiv search in upstream regions Cora et al. BMC Bioinformatics 5, 57 (2004) 3. Lecture WS 2004/05 Bioinformatics III 10

Exhaustive motiv search in upstream regions Cora et al. BMC Bioinformatics 5, 57 (2004) 3. Lecture WS 2004/05 Bioinformatics III 11

Recently published tools for promoter finding Ohler, Niemann Trends Gen 17, 2 (2001) 3. Lecture WS 2004/05 Bioinformatics III 12

Position-specific weight matrix Popular approach when list of genes available that share TF binding motif; Good multiple sequence alignment available. Alignment matrix: lists # of occurrences of each letter at each position of an alignment Hertz, Stormo (1999) Bioinformatics 15, 563 3. Lecture WS 2004/05 Bioinformatics III 13

Position-specific weight matrix Examples of matrices used by YRSA http://forkhead.cgb.ki.se/yrsa/matrixlist.html 3. Lecture WS 2004/05 Bioinformatics III 14

Exp. Identification of TF binding site: DNase 1 Footprinting A protein bound to a specific DNA sequence will interfere with the digestion of that region by DNase I. An end-labelled DNA probe is incubated with a protein extract or a purified DNA-binding factor. The unprotected DNA is then partially digested with DNase I such that on average every DNA molecule is cut once. Digestion products are then resolved by electrophoresis. Comparison of the DNase I digestion pattern in the presence and absence of protein will allow the identification of a footprint (protected region) * * * * Denaturing PAGE Footprint 3. Lecture WS 2004/05 Bioinformatics III 15

Gel retardation assays Gel Shifts Electro Mobility Shift Assay (EMSA) Band Shift No protein add protein Incubating a purified protein, or a complex mixture of proteins e.g. nuclear or cell extract, with a 32 P end-labelled DNA fragment containing the putative protein binding site (from promoter region). * * Non-denaturing PAGE Reaction products are then analysed on a nondenaturing polyacrylamide gel. The specificity of the DNA-binding protein for the putative binding site is established by competition experiments using DNA fragments or oligonucleotides containing a binding site for the protein of interest, or other unrelated DNA sequences. Free DNA probe Retarded mobility due to protein binding 3. Lecture WS 2004/05 Bioinformatics III 16

3D structures of transcription factors 1A02.pdb 1AM9.pdb 1AU7.pdb TFs bind with very different binding modes. Some are sensitive for DNA conformation. 1CIT.pdb 1GD2.pdb 1H88.pdb 2 TFs bound! http://www.rcsb.org 3. Lecture WS 2004/05 Bioinformatics III 17

DNA conformation Canonical and mechanically distorted forms of helical DNA (from left to right: A-DNA, B-DNA, overstretched S-DNA,32 overtwisted P-DNA33). Conformational fluctuations of a B- DNA oligomer with an alternating GA sequence. The snapshots (100 ps intervals) from a simulation at 300 K using explicit solvent and counterions show axis and backbone fluctuations E. Giudice, R. Lavery (2002) Acc. Chem. Res. 35, 350-357. 3. Lecture WS 2004/05 Bioinformatics III 18

DNA conformation Induced base opening within B-DNA. Images show the conformational changes associated with moving thymine (bold) into the major groove of an oligomer with an alternating GA sequence. E. Giudice, R. Lavery (2002) Acc. Chem. Res. 35, 350-357. 3. Lecture WS 2004/05 Bioinformatics III 19

EM low-resolution structure of TF machinery Single particle images 3D reconstruction of TFIID Nogales et al. Science (1999) 3. Lecture WS 2004/05 Bioinformatics III 20

Identification of individual components Position of IIB and IIA on the TFIID structure and mapping of the TBP. The blue mesh corresponds to the holo-tfiid, with the A, B, and C lobes indicated. (A) The green mesh corresponds to the density difference between the holo-tfiid and the TFIID-IIB complex. (B) The magenta and green meshes show the density difference between the holo-tfiid and the trimeric complex TFIID-IIA-IIB. The density depicted in light green can be attributed to TFIIB by comparison with (A), and the magenta density therefore corresponds to IIA. (C) The yellow mesh shows the density difference between the holo-tfiid and TFIID that is bound to the TBP antibody. Nogales et al. Science (1999) 3. Lecture WS 2004/05 Bioinformatics III 21

database for eukaryotic transcription factors: TRANSFAC BIOBase / TU Braunschweig / GBF Relational database 6 flat files: FACTOR interaction of TFs SITE their DNA binding site GENE through which they regulate these target genes CELL factor source MATRIX TF nucleotide weight matrices CLASS classification scheme of TFs Wingender et al. (1998) J Mol Biol 284,241 3. Lecture WS 2004/05 Bioinformatics III 22

database for eukaryotic transcription factors: TRANSFAC BIOBase / TU Braunschweig / GBF Matys et al. (2003) Nucl Acid Res 31,374 3. Lecture WS 2004/05 Bioinformatics III 23

Match TM Search for putative TF binding sites in DNA sequences based on weight matrices. Use 2 values to score putative hits: Matrix similarity score: quality of a match between the sequence and the whole matrix [0,1] Core similarity score: quality of a match between the sequence and the core sequence of a matrix which consists of the five most conserved consecutive positions in a matrix [0,1] Profile: set of matrices and their cut-offs designed for function-driven searches Special profiles available for immune-cells, muscle cells, liver cells, and for cellcycle. Matys et al. (2003) Nucl Acid Res 31,374 3. Lecture WS 2004/05 Bioinformatics III 24

database for eukaryotic transcription factors: TRANSFAC BIOBase / TU Braunschweig / GBF Matys et al. (2003) Nucl Acid Res 31,374 3. Lecture WS 2004/05 Bioinformatics III 25

TRANSFAC classification 1 Superclass basic domains 3 Superclass: Helix-turn-helix 1.1 Leuzine zipper factors (bzip) 1.2 Helix-loop-helix factors (bhlh) 4 Superclass: beta-scaffold 1.3 bhlh-bzip Factors with Minor Groove 1.4 NF-1 Contacts 1.5 RF-X 1.6 bhsh 5 Superclass: others 2 Superclass: Zinc-coordinating DNA-binding domains 2.1 Cys4 zinc finger of nuclear receptor type 2.2 diverse Cys4 zinc fingers 2.3 Cys2His2 zinc finger domains 2.4 Cys6 cysteine-zinc cluster 2.5 Zinc fingers of alternating composition http://www.gene-regulation.com/pub/databases/transfac/cl.html 3. Lecture WS 2004/05 Bioinformatics III 26

Eintrag für 1.1 Leuzine-Zippers TRANSFAC classification http://www.gene-regulation.com 3. Lecture WS 2004/05 Bioinformatics III 27

TRANSFAC classification http://www.gene-regulation.com 3. Lecture WS 2004/05 Bioinformatics III 28

TRANSFAC classification http://www.gene-regulation.com 3. Lecture WS 2004/05 Bioinformatics III 29

Summary Large databases available (e.g. TRANSFAC) with information about promoter sites. Information verified experimentally. Microarray data allows searching for common motifs of coregulated genes. Also possible: common GO annotation etc. TF binding motifs are frequently overrepresented in 1000 bp upstream region. Clear function of this is unknown. (Same as in proline-rich recognition sequences.) Relatively few TFs regulate large number of genes. Complex regulatory network, Thursday lecture. http://www.gene-regulation.com 3. Lecture WS 2004/05 Bioinformatics III 30