Advanced Algorithms and Models for Computational Biology

Similar documents
Algorithms in Nature. (brief) introduction to biology

Computational Genomics

Videos. Lesson Overview. Fermentation

Videos. Bozeman Transcription and Translation: Drawing transcription and translation:

Genes and How They Work. Chapter 15

Unit 1: DNA and the Genome. Sub-Topic (1.3) Gene Expression

Biology. Biology. Slide 1 of 39. End Show. Copyright Pearson Prentice Hall

Biology. Biology. Slide 1 of 39. End Show. Copyright Pearson Prentice Hall

Chapter 12 Packet DNA 1. What did Griffith conclude from his experiment? 2. Describe the process of transformation.

The Structure of Proteins The Structure of Proteins. How Proteins are Made: Genetic Transcription, Translation, and Regulation

DNA & Protein Synthesis. The source and the process!

I. Gene Expression Figure 1: Central Dogma of Molecular Biology

Fermentation. Lesson Overview. Lesson Overview 13.1 RNA

Themes: RNA and RNA Processing. Messenger RNA (mrna) What is a gene? RNA is very versatile! RNA-RNA interactions are very important!

PROTEIN SYNTHESIS Flow of Genetic Information The flow of genetic information can be symbolized as: DNA RNA Protein

Lesson Overview. Fermentation 13.1 RNA

Fig Ch 17: From Gene to Protein

Roadmap. The Cell. Introduction to Molecular Biology. DNA RNA Protein Central dogma Genetic code Gene structure Human Genome

DNA Function: Information Transmission

Gene Expression Transcription/Translation Protein Synthesis

Review of Protein (one or more polypeptide) A polypeptide is a long chain of..

Biotechnology Unit 3: DNA to Proteins. From DNA to RNA

Transcription and Translation. DANILO V. ROGAYAN JR. Faculty, Department of Natural Sciences

The Nature of Genes. The Nature of Genes. Genes and How They Work. Chapter 15/16

Section 10.3 Outline 10.3 How Is the Base Sequence of a Messenger RNA Molecule Translated into Protein?

Introduction to Cellular Biology and Bioinformatics. Farzaneh Salari

Key Area 1.3: Gene Expression

PROTEIN SYNTHESIS. copyright cmassengale

Protein Synthesis & Gene Expression

Hello! Outline. Cell Biology: RNA and Protein synthesis. In all living cells, DNA molecules are the storehouses of information. 6.

Chapter 12. DNA TRANSCRIPTION and TRANSLATION

7.2 Protein Synthesis. From DNA to Protein Animation

Chapter 13. From DNA to Protein

From Gene to Protein

RNA, & PROTEIN SYNTHESIS. 7 th Grade, Week 4, Day 1 Monday, July 15, 2013

3'A C G A C C A G T A A A 5'

PROTEIN SYNTHESIS. Higher Level

Ch. 10 Notes DNA: Transcription and Translation

TRANSCRIPTION AND PROCESSING OF RNA

PROTEIN SYNTHESIS. copyright cmassengale

The Genetic Code and Transcription. Chapter 12 Honors Genetics Ms. Susan Chabot

Biology A: Chapter 9 Annotating Notes Protein Synthesis

RNA and Protein Synthesis

Expression of the genome. Books: 1. Molecular biology of the gene: Watson et al 2. Genetics: Peter J. Russell

Transcription is the first stage of gene expression

Biology 3201 Genetics Unit #5

Section 3: DNA Replication

Protein Synthesis. DNA to RNA to Protein

From DNA to Protein: Genotype to Phenotype

Protein Synthesis Notes

Computational gene finding

Make the protein through the genetic dogma process.

Prokaryotic Transcription

DNA RNA PROTEIN. Professor Andrea Garrison Biology 11 Illustrations 2010 Pearson Education, Inc. unless otherwise noted

Molecular Cell Biology - Problem Drill 08: Transcription, Translation and the Genetic Code

MBioS 503: Section 1 Chromosome, Gene, Translation, & Transcription. Gene Organization. Genome. Objectives: Gene Organization

Big Idea 3C Basic Review

Gene Expression: Transcription, Translation, RNAs and the Genetic Code

Multiple choice questions (numbers in brackets indicate the number of correct answers)

6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008

UNIT 3 GENETICS LESSON #41: Transcription

BIO 311C Spring Lecture 36 Wednesday 28 Apr.

Genomes DNA Genes to Proteins. The human genome is a multi-volume instruction manual

Bioinformatics: Sequence Analysis. COMP 571 Luay Nakhleh, Rice University

Gene Expression: Transcription

Regulation of bacterial gene expression

DNA RNA PROTEIN SYNTHESIS -NOTES-

The Nature of Genes. The Nature of Genes. The Nature of Genes. The Nature of Genes. The Nature of Genes. The Genetic Code. Genes and How They Work

Transcription: Synthesis of RNA

Bundle 6 Test Review

Introduction to Microarray Data Analysis and Gene Networks. Alvis Brazma European Bioinformatics Institute

6.C: Students will explain the purpose and process of transcription and translation using models of DNA and RNA

From DNA to Protein: Genotype to Phenotype

8/21/2014. From Gene to Protein

IB BIO I Replication/Transcription/Translation Van Roekel/Madden. Name Date Period. D. It separates DNA strands. (Total 1 mark)

Algorithms in Bioinformatics

Lecture for Wednesday. Dr. Prince BIOL 1408

DNA is the MASTER PLAN. RNA is the BLUEPRINT of the Master Plan

MATH 5610, Computational Biology

Molecular Genetics. The flow of genetic information from DNA. DNA Replication. Two kinds of nucleic acids in cells: DNA and RNA.

Chapter 8 From DNA to Proteins. Chapter 8 From DNA to Proteins

Key Concepts. Ø DNA Replication Ø Protein Synthesis Ø Transcription: Ø Translation: Ø messenger RNA (mrna)

Gene function at the level of traits Gene function at the molecular level

BIOLOGY - CLUTCH CH.17 - GENE EXPRESSION.

13.1 RNA Lesson Objectives Contrast RNA and DNA. Explain the process of transcription.

The Structure of RNA. The Central Dogma

Transcription. The sugar molecule found in RNA is ribose, rather than the deoxyribose found in DNA.

How to Use This Presentation

CHapter 14. From DNA to Protein

Transcription Eukaryotic Cells

Year III Pharm.D Dr. V. Chitra

CLEP Biology - Problem Drill 11: Transcription, Translation and The Genetic Code

Sections 12.3, 13.1, 13.2

CH 17 :From Gene to Protein

Protein Synthesis. OpenStax College

Year Morgan and fellow researchers found that chromosomes contained DNA, RNA, and protein.

Section 14.1 Structure of ribonucleic acid

If Dna Has The Instructions For Building Proteins Why Is Mrna Needed

Introduction to Genome Biology

Find this material useful? You can help our team to keep this site up and bring you even more content consider donating via the link on our site.

Transcription:

10-810 Advanced Algorithms and Models for Computational Biology Ziv Bar-Joseph zivbj@cs.cmu.edu WeH 4107 Eric Xing epxing@cs.cmu.edu WeH 4127 http://www.cs.cmu.edu/~epxing/class/10810-06/

Topics Introduction (1 Week) Sequence analysis (3 weeks) Gene expression (4 weeks) Genetics (3 weeks) Systems biology (3 weeks) Projects (1 week)

Introduction to Molecular Biology

The Eukaryotic Cell

Cells Type Eukaryots: - Plants, animals, humans - DNA resides in the nucleus - Contain also other compartments Prokaryots: -Bacteria - Do not contain compartments

Central Dogma Transcription factors DNA transcription mrna translation Proteins

The Genome

Genome A genome is an organism s complete set of DNA (including its genes). However, in humans less than 3% of the genome actually encodes for genes. A part of the rest of the genome serves as a control regions (though that s also a small part). The goal of the rest of the genome is unknown (a possible project ).

DNA Four letters alphabet: A, C, G and T. Complementary base pairing: A-T and G-C Double stranded. Encodes proteins in segments called genes. Control regions in front of genes. Similar in all cells in the body.

DNA Four letters alphabet: A, C, G and T. Complementary base pairing: A-T and G-C Double stranded. Encodes proteins in segments called genes. Control regions in front of genes. Similar in MOST cells in the body.

Comparison of Different Organisms Genome size Num. of genes E. coli.05*10 8 4,200 Yeast.15*10 8 6,000 Worm 1*10 8 18,400 Fly 1.8*10 8 13,600 Human 30*10 8 30,000 Plant 1.3*10 8 25,000

DNA is Divided Between Chromosomes Single molecule of DNA 46 chromosomes in human cells 22 pairs, and two sex chromosomes: X and Y For different diseases, many diseases related genes reside on the same chromosome

Genes

What is a gene? Promoter Protein coding sequence Terminator Genomic DNA

Example of a Gene: Gal4 DNA ATGAAGCTACTGTCTTCTATCGAACAAGCATGCGATATTTGCCGACTTAAAAAGCTCAAG TGCTCCAAAGAAAAACCGAAGTGCGCCAAGTGTCTGAAGAACAACTGGGAGTGTCGCTAC TCTCCCAAAACCAAAAGGTCTCCGCTGACTAGGGCACATCTGACAGAAGTGGAATCAAGG CTAGAAAGACTGGAACAGCTATTTCTACTGATTTTTCCTCGAGAAGACCTTGACATGATT TTGAAAATGGATTCTTTACAGGATATAAAAGCATTGTTAACAGGATTATTTGTACAAGAT AATGTGAATAAAGATGCCGTCACAGATAGATTGGCTTCAGTGGAGACTGATATGCCTCTA ACATTGAGACAGCATAGAATAAGTGCGACATCATCATCGGAAGAGAGTAGTAACAAAGGT CAAAGACAGTTGACTGTATCGATTGACTCGGCAGCTCATCATGATAACTCCACAATTCCG TTGGATTTTATGCCCAGGGATGCTCTTCATGGATTTGATTGGTCTGAAGAGGATGACATG TCGGATGGCTTGCCCTTCCTGAAAACGGACCCCAACAATAATGGGTTCTTTGGCGACGGT TCTCTCTTATGTATTCTTCGATCTATTGGCTTTAAACCGGAAAATTACACGAACTCTAAC GTTAACAGGCTCCCGACCATGATTACGGATAGATACACGTTGGCTTCTAGATCCACAACA TCCCGTTTACTTCAAAGTTATCTCAATAATTTTCACCCCTACTGCCCTATCGTGCACTCA CCGACGCTAATGATGTTGTATAATAACCAGATTGAAATCGCGTCGAAGGATCAATGGCAA ATCCTTTTTAACTGCATATTAGCCATTGGAGCCTGGTGTATAGAGGGGGAATCTACTGAT ATAGATGTTTTTTACTATCAAAATGCTAAATCTCATTTGACGAGCAAGGTCTTCGAGTCA

Genes Encode for Proteins

Example of a Gene: Gal4 AA MKLLSSIEQACDICRLKKLKCSKEKPKCAKCLKNNWECRYSPKTKRSPLTRAHLTEVESR LERLEQLFLLIFPREDLDMILKMDSLQDIKALLTGLFVQDNVNKDAVTDRLASVETDMPL TLRQHRISATSSSEESSNKGQRQLTVSIDSAAHHDNSTIPLDFMPRDALHGFDWSEEDDM SDGLPFLKTDPNNNGFFGDGSLLCILRSIGFKPENYTNSNVNRLPTMITDRYTLASRSTT SRLLQSYLNNFHPYCPIVHSPTLMMLYNNQIEIASKDQWQILFNCILAIGAWCIEGESTD IDVFYYQNAKSHLTSKVFESGSIILVTALHLLSRYTQWRQKTNTSYNFHSFSIRMAISLG LNRDLPSSFSDSSILEQRRRIWWSVYSWEIQLSLLYGRSIQLSQNTISFPSSVDDVQRTT TGPTIYHGIIETARLLQVFTKIYELDKTVTAEKSPICAKKCLMICNEIEEVSRQAPKFLQ MDISTTALTNLLKEHPWLSFTRFELKWKQLSLIIYVLRDFFTNFTQKKSQLEQDQNDHQS YEVKRCSIMLSDAAQRTVMSVSSYMDNHNVTPYFAWNCSYYLFNAVLVPIKTLLSNSKSN AENNETAQLLQQINTVLMLLKKLATFKIQTCEKYIQVLEEVCAPFLLSQCAIPLPHISYN NSNGSAIKNIVGSATIAQYPTLPEENVNNISVKYVSPGSVGPSPVPLKSGASFSDLVKLL SNRPPSRNSPVTIPRSTPSHRSVTPFLGQQQQLQSLVPLTPSALFGGANFNQSGNIADSS

Number of Genes in Public Databases

Structure of Genes in Mammalian Cells Within coding DNA genes there can be junk regions (Introns) Exons are segments of DNA that contain the gene s information coding for a protein Need to cut Introns out of RNA and splice together Exons before protein can be made Alternative splicing increases the potential number of different proteins, allowing the generation of millions of proteins from a small number of genes.

Identifying Genes in Sequence Data Predicting the start and end of genes as well as the introns and exons in each gene is one of the basic problems in computational biology. Gene prediction methods look for ORFs (Open Reading Frame). These are (relatively long) DNA segments that start with the start codon, end with one of the end codons, and do not contain any other end codon in between. Splice site prediction has received a lot of attention in the literature.

Comparative genomics

Regulatory Regions

Promoter The promoter is the place where RNA polymerase binds to start transcription. This is what determines which strand is the coding strand.

DNA Binding Motifs In order to recruit the transcriptional machinery, a transcription factor (TF) needs to bind the DNA in front of the gene. TFs bind in to short segments which are known as DNA binding motifs. Usually consists 6 8 letters, and in many cases these letters generate palindromes.

Example of Motifs

Central Dogma Transcription factors DNA transcription mrna translation Proteins

RNA Three major types (plus a recently discovered regulatory RNA). mrna messenger RNA trna Transfer RNA rrna ribosomal RNA RNAi RNA interference

Messenger RNA Basically, an intermediate product Transcribed from the genome and translated into protein Number of copies correlates well with number of proteins for the gene. Unlike DNA, the amount of messenger RNA (as well as the number of proteins) differs between different cell types and under different conditions.

Complementary base-pairing mrna is transcribed from the DNA mrna (like DNA, but unlike proteins) binds to its complement mrna TFIIH Transcription apparatus RNAPII Gene Activators mrna label hybridization AUGC UACG

Hybridization and Scanning Glass slide arrays - Prepare Cy3, Cy5- labeled ss cdna - Hybridize 600 ng of labeled ss cdna to glass slide array -Scan

The Ribosome Decoding machine. Input: mrna, output: protein Built from a large number of proteins and a number of RNAs. Several ribosomes can work on one mrna

The Ribosome

RNAi

Central Dogma Transcription factors DNA transcription mrna translation Proteins

Proteins Proteins are polypeptide chains of amino acids. Four levels of structure: - Primary Structure: The sequence of the protein - Secondary structure: Local structure in regions of the chain - Tertiary Structure: Three dimensional structure - Quaternary Structure: multiple subunits

Secondary Structure: Alpha Helix

Secondary Structure: Beta Sheet

Protein Structure

Domains of a Protein While predicting the structure from the sequence is still an open problem, we can identify several domains within the protein. Domains are compactly folded structures. In many cases these domains are associated with specific biological function.

Assigning Function to Proteins While almost 30000 genes have been identified in the human genome, relatively few have known functional annotation. Determining the function of the protein can be done in several ways. - Sequence similarity to other (known) proteins - Using domain information - Using three dimensional structure - Based on high throughput experiments (when does it functions and who it interacts with)

Protein Interaction In order to fulfill their function, proteins interact with other proteins in a number of ways including: Regulation Pathways, for example A -> B -> C Post translational modifications Forming protein complexes Determining how are these protein interact, and what are the resulting pathways and networks is one of the major focuses of this class.

What you should remember Higher level organization of the cell The central dogma Composition of the DNA: Genes, control regions Properties of mrna Structure of proteins