Genomics and Gene Recognition Genes and Blue Genes

Similar documents
Interpretation of sequence results

ORFs and genes. Please sit in row K or forward

PRINCIPLES OF BIOINFORMATICS

Materials Protein synthesis kit. This kit consists of 24 amino acids, 24 transfer RNAs, four messenger RNAs and one ribosome (see below).

Lecture 10, 20/2/2002: The process of solution development - The CODEHOP strategy for automatic design of consensus-degenerate primers for PCR

SAY IT WITH DNA: Protein Synthesis Activity by Larry Flammer

Course Introduction. Bioinformatics: Issues and Algorithms. CSE Fall 2007 Lecture 1

DNA sentences. How are proteins coded for by DNA? Materials. Teacher instructions. Student instructions. Reflection

Lecture 19A. DNA computing

Arabidopsis actin depolymerizing factor AtADF4 mediates defense signal transduction triggered by the Pseudomonas syringae effector AvrPphB

Electronic Supplementary Information

CSE : Computational Issues in Molecular Biology. Lecture 1. Spring 2004

Supporting information for Biochemistry, 1995, 34(34), , DOI: /bi00034a013

PGRP negatively regulates NOD-mediated cytokine production in rainbow trout liver cells

Disease and selection in the human genome 3

Lecture 11: Gene Prediction

Supplemental Data Supplemental Figure 1.

Genes and Proteins. Objectives

NAME:... MODEL ANSWER... STUDENT NUMBER:... Maximum marks: 50. Internal Examiner: Hugh Murrell, Computer Science, UKZN

I. Gene Expression Figure 1: Central Dogma of Molecular Biology

From DNA to Protein: Genotype to Phenotype

Gene Expression Transcription/Translation Protein Synthesis

BIOLOGY - CLUTCH CH.17 - GENE EXPRESSION.

From DNA to Protein: Genotype to Phenotype

RPA-AB RPA-C Supplemental Figure S1: SDS-PAGE stained with Coomassie Blue after protein purification.

Supplementary. Table 1: Oligonucleotides and Plasmids. complementary to positions from 77 of the SRα '- GCT CTA GAG AAC TTG AAG TAC AGA CTG C

Expression of the genome. Books: 1. Molecular biology of the gene: Watson et al 2. Genetics: Peter J. Russell

Figure S1. Characterization of the irx9l-1 mutant. (A) Diagram of the Arabidopsis IRX9L gene drawn based on information from TAIR (the Arabidopsis

Supplementary Figure 1A A404 Cells +/- Retinoic Acid

TRANSCRIPTION. Renáta Schipp

PROTEIN SYNTHESIS Study Guide

Supplement 1: Sequences of Capture Probes. Capture probes were /5AmMC6/CTG TAG GTG CGG GTG GAC GTA GTC

Hes6. PPARα. PPARγ HNF4 CD36

Gene Expression: Transcription, Translation, RNAs and the Genetic Code

1. DNA, RNA structure. 2. DNA replication. 3. Transcription, translation

DNA Replication and Repair

RNA Expression of the information in a gene generally involves production of an RNA molecule transcribed from a DNA template. RNA differs from DNA

From DNA to Proteins

Supplementary Information. Construction of Lasso Peptide Fusion Proteins

3'A C G A C C A G T A A A 5'

PCR analysis was performed to show the presence and the integrity of the var1csa and var-

Chapter 3: Information Storage and Transfer in Life

Supplemental Data. mir156-regulated SPL Transcription. Factors Define an Endogenous Flowering. Pathway in Arabidopsis thaliana

Supporting Information

Answers to Module 1. An obligate aerobe is an organism that has an absolute requirement of oxygen for growth.

Multiplexing Genome-scale Engineering

Add 5µl of 3N NaOH to DNA sample (final concentration 0.3N NaOH).

Hello! Outline. Cell Biology: RNA and Protein synthesis. In all living cells, DNA molecules are the storehouses of information. 6.

Name Date Class. The Central Dogma of Biology

Make the protein through the genetic dogma process.

Transcription. The sugar molecule found in RNA is ribose, rather than the deoxyribose found in DNA.

Cat. # Product Size DS130 DynaExpress TA PCR Cloning Kit (ptakn-2) 20 reactions Box 1 (-20 ) ptakn-2 Vector, linearized 20 µl (50 ng/µl) 1

NUCLEIC ACIDS AND PROTEIN SYNTHESIS

7.03 Problem Set 3 Due before 5 PM on Wednesday, October 18 Hand in answers in recitation section or in the box outside of

DNA Transcription. Dr Aliwaini

Biotechnology Unit 3: DNA to Proteins. From DNA to RNA

BioInformatics and Computational Molecular Biology. Course Website

Replication, Transcription, and Translation

Chapter 8 Lecture Outline. Transcription, Translation, and Bioinformatics

Supplementary Materials for

strain devoid of the aox1 gene [1]. Thus, the identification of AOX1 in the intracellular

Biomolecules: lecture 6

Y-chromosomal haplogroup typing Using SBE reaction

G+C content. 1 Introduction. 2 Chromosomes Topology & Counts. 3 Genome size. 4 Replichores and gene orientation. 5 Chirochores.

PROTEIN SYNTHESIS Flow of Genetic Information The flow of genetic information can be symbolized as: DNA RNA Protein

Expression of Recombinant Proteins

Chapter 11. Transcription. The biochemistry and molecular biology department of CMU

Codon Bias with PRISM. 2IM24/25, Fall 2007

Chapter 4 DNA Structure & Gene Expression

The Making of the Fittest: Natural Selection and Adaptation

evaluated with UAS CLB eliciting UAS CIT -N Libraries increase in the

Fig Ch 17: From Gene to Protein

Molecular Level of Genetics

Gene function at the level of traits Gene function at the molecular level

Supporting Online Information

SCBC203 Gene Expression. Assoc. Prof. Rutaiwan Tohtong Department of Biochemistry Faculty of Science PR318

Genes and How They Work. Chapter 15

Supplemental material

DNA is the MASTER PLAN. RNA is the BLUEPRINT of the Master Plan

Human Gene,cs 06: Gene Expression. Diversity of cell types. How do cells become different? 9/19/11. neuron

Independent Study Guide The Blueprint of Life, from DNA to Protein (Chapter 7)

Nucleic acids deoxyribonucleic acid (DNA) ribonucleic acid (RNA) nucleotide

Det matematisk-naturvitenskapelige fakultet

Project 07/111 Final Report October 31, Project Title: Cloning and expression of porcine complement C3d for enhanced vaccines

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Chapter 10: Gene Expression and Regulation

Supplementary Figure 1. Localization of MST1 in RPE cells. Proliferating or ciliated HA- MST1 expressing RPE cells (see Fig. 5b for establishment of


Big Idea 3C Basic Review

NUCLEIC ACID METABOLISM. Omidiwura, B.R.O

Supplemental Data. Bennett et al. (2010). Plant Cell /tpc

Protein Synthesis: Transcription and Translation

UNIT I RNA AND TYPES R.KAVITHA,M.PHARM LECTURER DEPARTMENT OF PHARMACEUTICS SRM COLLEGE OF PHARMACY KATTANKULATUR

RNA, & PROTEIN SYNTHESIS. 7 th Grade, Week 4, Day 1 Monday, July 15, 2013

DNA Structure DNA Nucleotide 3 Parts: 1. Phosphate Group 2. Sugar 3. Nitrogen Base

Protein Structure Analysis

RNA and PROTEIN SYNTHESIS. Chapter 13

Regulation of bacterial gene expression

Assessment Schedule 2013 Biology: Demonstrate understanding of gene expression (91159)

ANCIENT BACTERIA? 250 million years later, scientists revive life forms

Transcription:

Genomics and Gene Recognition Genes and Blue Genes November 1, 2004

Prokaryotic Gene Structure prokaryotes are simplest free-living organisms studying prokaryotes can give us a sense what is the minimum number of genes for survival great opportunities because of genome sequencing about 300 genes is minimum (must contain genes for replication, and genes to obtain and store energy) currently, about 120 genomes have been finished central dogma of molecular biology

Transcription and Regulation of Gene Expression transcription is highly regulated in all cells in prokaryotes, only about 3% of the genes are undergoing transcription at any given time in eukaryotes, only 0.01% of the genes are transcribed at any given time (for differentiated cells, cells that perform their specific function) which gene is transcribed? it is determined by the growth status of the cell, metabolic condition etc.

Gene Expression the process by which a gene s information is converted into its product (say, a protein) consists of transcription and translation followed by: folding post-translational modification targeting the amount of protein that is expressed depends on the tissue, developmental stage of the cell and metabolic and physiologic state of the cell

Transcription in Prokaryotes all RNA is synthesized by a single species of DNAdependent RNA polymerase the RNA polymerase of E. Coli, so called RNA polymerase holoenzyme is a complex multimeric protein large enough to be visible in the electron microscope it consists of four subunits α 2 ββ σ, where β is the largest subunit σ can be any of the remaining subunits and it recognizes promoters that identify the location of transcription start site β and β contribute to formation of a catalytic site two α subunits are essential for assembly of an enzyme

The Steps in Prokaryotic Transcription 1. binding of RNA polymerase holoenzyme at promoter sites 2. initiation of polymerization 3. chain elongation 4. chain termination

Experimental Promoter Identification Promoters are identified in vitro by a technique called DNA footprinting: RNA polymerase holoenzyme is bound to a putative promoter sequence in a DNA duplex DNA:protein complex is treated with DNase I DNase I cleaves the DNA at sites not protected by bound protein the set of DNA fragments left after DNase I digestion reveals the promoter (by definition, promoter is the RNA polymerase holoenzyme binding site)

Prokaryotic Promoters +1 site is defined as the transcription start site (that base is the first base in the RNA transcript) +2 base is the second base in RNA transcript bases upstream from the initiation site are 1, 2 etc. There is no zero! RNA polymerase binding is typically spanning 40 to +20 the transcript site on the template side is almost always a pyrimidine, so the transcript almost always begins with a purine

Prokaryotic Promoters promoters vary in size from 20 to 200bp typically consist of a 40-bp region upstream from the transcription start within a promoter are two consensus sequence elements a consensus is defined as the bases that appear with highest frequency at each position when a series of sequences believed to have common function are compared the two sequences are: Prinbow box: near 10, whose consensus sequence is TATAAT the sequence in the 35 region containing the TTGACA the two elements are separated by about 17bp of nonconserved sequence the more closely 35 region resembles consensus the greater the efficiency of transcription rrna promoters have a third upstream element at about 55 (recognized by the α subunit)

Prokaryotic Promoters Nucleotide sequences of representative E. Coli promoters

Prokaryotic Gene Structure a gene consists of both promoters and coding parts!

Open Reading Frame (ORF) ribosomes translate triplets of nucleotides into amino acids start codon is usually AUG (which is encoded into methionine) stop codons are UAA, UAG, UGA it is important to determine from which nucleotide to start open reading frame is a sequence from the start till the first stop codon 5' 3' atgcccaagctgaatagcgtagaggggttttcatcatttgaggacgatgtataa 1 atg ccc aag ctg aat agc gta gag ggg ttt tca tca ttt gag gac gat gta taa M P K L N S V E G F S S F E D D V * 2 tgc cca agc tga ata gcg tag agg ggt ttt cat cat ttg agg acg atg tat C P S * I A * R G F H H L R T M Y 3 gcc caa gct gaa tag cgt aga ggg gtt ttc atc att tga gga cga tgt ata A Q A E * R R G V F I I * G R C I

Conceptual Translation 1960s and 1970s easy to determine amino acid sequence of a protein 1980s it becomes easier to sequence DNA many proteins are INFERRED from DNA sequences this is conceptual translation after conceptual translation, protein s structure and function are inferred (database search, structure prediction, function prediction, cell localization prediction, post-translational modification prediction and so on)

Genetic Code

Termination Sequences ~90% of prokaryotic operons contain intrinsic terminators Properties of intrinsic terminators inverted nucleotide sequences, e.g. (5 ) CGGATG CATCCG (3 ) run of ~6 uracils follows the repeat sequence RNA can adopt stable secondary structure (directly related to length of repeats) RNA secondary structure causes RNA polymerase to slow down transcription poly-u causes termination

Strength of Base Pairing G:C - stronger A:T - weaker

GC Content in Prokaryotes for every G in a double stranded DNA there must be a C it has been noticed that some bacteria have increased GC content some bacteria have 75% GC content, some have 25% relative ratios between G/C to A/T content is relatively uniform across bacterial genomes high GC or AT contents are consequences of the work of DNA polymerases and DNA repair mechanisms over time many bacteria have genes from other organisms through horizontal gene transfer new genes are different in nucleotide composition than old genes there is also codon usage bias

Prokaryotic Gene Density very high gene density in bacteria and archaea, >85% of DNA code for genes Example: E. Coli 4,288 genes average coding sequence: 950 bp separation between genes: 118 bp Characteristics: long open reading frames (60 or more codons) simple promoter sequences due to a small number of sigma factors recognizable transcriptional termination signals

DNA Polymerase duplicates genetic information the most accurate enzyme, makes less than one error in billion nucleotides it also proofreads the copied base, and if it is wrong, it repairs it DNA polymerase is used in forensics to help create DNA out of bad samples one cell has several DNA polymerases, some help replicate some repair DNA copies