Introduction to Microarray Data Analysis and Gene Networks. Alvis Brazma European Bioinformatics Institute

Similar documents
I. Gene Expression Figure 1: Central Dogma of Molecular Biology

Name: Period: Date: This handout will guide you through the format as you preview section 13.1 RNA in your textbook.

Videos. Lesson Overview. Fermentation

Videos. Bozeman Transcription and Translation: Drawing transcription and translation:

Replication Transcription Translation

CHAPTER 11 DNA NOTES PT. 4: PROTEIN SYNTHESIS TRANSCRIPTION & TRANSLATION

Biology. Biology. Slide 1 of 39. End Show. Copyright Pearson Prentice Hall

Biology. Biology. Slide 1 of 39. End Show. Copyright Pearson Prentice Hall

A DNA molecule consists of two strands of mononucleotides. Each of these strands

Chapter 12. DNA TRANSCRIPTION and TRANSLATION

Lecture Overview. Overview of the Genetic Information. Marieb s Human Anatomy and Physiology. Chapter 3 DNA & RNA Protein Synthesis Lecture 6

Introduction to Cellular Biology and Bioinformatics. Farzaneh Salari

Computational gene finding

13.1 RNA Lesson Objectives Contrast RNA and DNA. Explain the process of transcription.

Gene Expression. Student:

The study of the structure, function, and interaction of cellular proteins is called. A) bioinformatics B) haplotypics C) genomics D) proteomics

Developmental Biology BY1101 P. Murphy

Transcription. DNA to RNA

Sections 12.3, 13.1, 13.2

PRINCIPLES OF BIOINFORMATICS

UNIT 3 GENETICS LESSON #41: Transcription

DNA Structure & the Genome. Bio160 General Biology

Lecture 2: Biology Basics Continued. Fall 2018 August 23, 2018

COMP 555 Bioalgorithms. Fall Lecture 1: Course Introduction

CS313 Exercise 1 Cover Page Fall 2017

Algorithms in Bioinformatics

DNA RNA PROTEIN. Professor Andrea Garrison Biology 11 Illustrations 2010 Pearson Education, Inc. unless otherwise noted

Chapter 12 Packet DNA 1. What did Griffith conclude from his experiment? 2. Describe the process of transformation.

The Genetic Code and Transcription. Chapter 12 Honors Genetics Ms. Susan Chabot

Chapter 12 Reading Questions

Higher Human Biology Unit 1: Human Cells Pupils Learning Outcomes

DNA and Biotechnology Form of DNA Form of DNA Form of DNA Form of DNA Replication of DNA Replication of DNA

Molecular Biology. IMBB 2017 RAB, Kigali - Rwanda May 02 13, Francesca Stomeo

O C. 5 th C. 3 rd C. the national health museum

Unit 1: DNA and the Genome. Sub-Topic (1.3) Gene Expression

DNA. translation. base pairing rules for DNA Replication. thymine. cytosine. amino acids. The building blocks of proteins are?

What Are the Chemical Structures and Functions of Nucleic Acids?

Computational gene finding

Computational Genomics. Irit Gat-Viks & Ron Shamir & Haim Wolfson Fall

Advanced Algorithms and Models for Computational Biology

Big Idea 3C Basic Review

DNA: The Molecule Of Life

Molecular Genetics. The flow of genetic information from DNA. DNA Replication. Two kinds of nucleic acids in cells: DNA and RNA.

Transcription and Translation

Lecture Overview. Overview of the Genetic Information. Chapter 3 DNA & RNA Lecture 6

Course Information. Introduction to Algorithms in Computational Biology Lecture 1. Relations to Some Other Courses

Chemistry 106: Drugs in Society Lecture 17: Where Do Macromolecular Targets Come From? 5/07/18

Motivation From Protein to Gene

Transcription: Synthesis of RNA

Bundle 5 Test Review

Functional Genomics Overview RORY STARK PRINCIPAL BIOINFORMATICS ANALYST CRUK CAMBRIDGE INSTITUTE 18 SEPTEMBER 2017

Lesson Overview. Fermentation 13.1 RNA

Nucleic acids. How DNA works. DNA RNA Protein. DNA (deoxyribonucleic acid) RNA (ribonucleic acid) Central Dogma of Molecular Biology

What happens after DNA Replication??? Transcription, translation, gene expression/protein synthesis!!!!

6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008

Lecture 2: Central Dogma of Molecular Biology & Intro to Programming

(deoxyribonucleic acid)

3.1.4 DNA Microarray Technology

Introduction to Algorithms in Computational Biology Lecture 1

Basic concepts of molecular biology

Methods of Biomaterials Testing Lesson 3-5. Biochemical Methods - Molecular Biology -

MBioS 503: Section 1 Chromosome, Gene, Translation, & Transcription. Gene Organization. Genome. Objectives: Gene Organization

Name Class Date. Practice Test

MATH 5610, Computational Biology

RNA & PROTEIN SYNTHESIS

translation The building blocks of proteins are? amino acids nitrogen containing bases like A, G, T, C, and U Complementary base pairing links

Key Area 1.3: Gene Expression

PROTEIN SYNTHESIS Flow of Genetic Information The flow of genetic information can be symbolized as: DNA RNA Protein

The Structure of Proteins The Structure of Proteins. How Proteins are Made: Genetic Transcription, Translation, and Regulation

DNA Function: Information Transmission

Bioinformatics: Microarray Technology. Assc.Prof. Chuchart Areejitranusorn AMS. KKU.

Algorithms in Bioinformatics ONE Transcription Translation

Introduction to Genome Biology

Bis2A 12.0 Transcription *

Fermentation. Lesson Overview. Lesson Overview 13.1 RNA

Resources. How to Use This Presentation. Chapter 10. Objectives. Table of Contents. Griffith s Discovery of Transformation. Griffith s Experiments

Replication Review. 1. What is DNA Replication? 2. Where does DNA Replication take place in eukaryotic cells?

DNA is the genetic material. DNA structure. Chapter 7: DNA Replication, Transcription & Translation; Mutations & Ames test

Principle 2. Overview of Central. 3. Nucleic Acid Structure 4. The Organization of

Unit 6 Molecular Genetics

CHapter 14. From DNA to Protein

Chapter 2. An Introduction to Genes and Genomes

From Gene to Protein

Ch Molecular Biology of the Gene

The common structure of a DNA nucleotide. Hewitt

Hello! Outline. Cell Biology: RNA and Protein synthesis. In all living cells, DNA molecules are the storehouses of information. 6.

We have. Learned the structure of DNA. Talked about DNA replication and all of the complicated vocabulary that goes with it

DNA. Using DNA to solve crimes

Nucleic acids and protein synthesis

Chapter 8: DNA and RNA

Gene Expression: Transcription, Translation, RNAs and the Genetic Code

Genes & Gene Finding

13 1 Rna And Protein Synthesis Answers

ENGR 213 Bioengineering Fundamentals April 25, A very coarse introduction to bioinformatics

Nucleic Acid Structure:

BIOLOGY 111. CHAPTER 6: DNA: The Molecule of Life

RNA and Protein Synthesis

Central Dogma. 1. Human genetic material is represented in the diagram below.

Introduction. Why is your hair the color that it is??? Copyright 2002 Pearson Education, Inc., publishing as Benjamin Cummings

Human Molecular Genetics Prof. S. Ganesh Department of Biological Sciences and Bioengineering Indian Institute of Technology, Kanpur

Transcription:

Introduction to Microarray Data Analysis and Gene Networks Alvis Brazma European Bioinformatics Institute

A brief outline of this course What is gene expression, why it s important Microarrays and how they measure expression Steps in microarray data analysis Try some basic analysis of real microarray data A bit of theory about microarray data analysis Gene networks, what are they Methods or describing gene networks How microarrays can help to understand them Some more fancy stuff about gene networks

What will be needed to complete this course Complete some coursework on real data analysis using tools we ll try in the lectures Details to be finalised later this week

1. All you need to know about biology about this course in 10 20 min http://www.ebi.ac.uk/microarray/biology_intro.html Genomes and genes

Central dogma of molecular biology transcription DNA translation RNA Protein

DNA Four different nucleotides : adenosine, guanine, cytosine and thymine. They are usually referred to as bases and denoted by their initial letters, A,C,G and T 5' C-G-A-T-T-G-C-A-A-C-G-A-T-G-C 3' 3' G-C-T-A-A-C-G-T-T-G-C-T-A-C-G 5'

DNA - Biology as and information science 5' C-G-A-T-T-G-C-A-A-C-G-A-T-G-C 3' 3' G-C-T-A-A-C-G-T-T-G-C-T-A-C-G 5' Thus, for many information related purposes, the molecule can be represented as CGATTCAACGATGC The maximal amount of information that can be encoded in such a molecule is therefore 2 bits times the length of the sequence. Noting that the distance between nucleotide pairs in a DNA is about 0.34 nm, we can calculate that the linear information storage density in DNA is about 6x10 8 bits/cm, which is approximately 75 GB or 12.5 CD-Roms per cm.

Genomes, chromosomes Genome is a set of DNA molecules. Each chromosome contains (long) DAN molecule per chromosome The 23 human chromosomes Organism Number or chromosomes Genome size in base pairs Bacteria 1 ~400,000 - ~10,000,000 Yeast 12 14,000,000 Worm 6 100,000,000 Fly 4 300,000,000 Weed 5 125,000,000 Human 23 3,000,000,000

Genes and gene products, proteins For purposes of this course a gene is a continuous stretch of a genomic DNA molecule, from which a complex molecular machinery can read information (encoded as a string of A, T, G, and C) and make a particular type of a protein or a few different proteins Organism The number of predicted genes E.Coli (bacteria) 5000 90% Yeast 6000 70% Worm 18,000 27% Fly 14,000 20% Weed 25,500 20% Human 25,000 < 5% Part of the genome that encodes proteins (exons)

Central dogma of molecular biology transcription DNA translation RNA Protein

RNA Like DNA, RNA consists of 4 nucleotides, but instead of the thymine (T), it has an alternative uracil (U) RNA is similar to a DNA, but it s chemical properties are such that it keeps itself single stranded RNA is complimentary to a single stranded DNA 5' C-G-A-T-T-G-C-A-A-C-G-A-T-G-C 3' DNA 3' G-C-U-A-A-C-G-U-U-G-C-U-A-C-G 5' RNA

Splicing, translation, proteins When as according to the central dogma genes are transcribed into RNA, there may be interruptions called introns Because of alternative splicing (e.g., exon skipping) and posttranslational modification there are more proteins than genes

Proteins, their function Proteins are chains of 20 different types of aminoacids, and they have complex structures determined by their sequence. The structures in turn determine their functions

What are gene products doing? Molecular Function elemental activity or task Biological Process broad objective or goal Cellular Component location or complex Gene ontology

Gene expression A human organism has over 250 different cell types (e.g., muscle, skin, bone, neuron), most of which have identical genomes, yet they look different and do different jobs It is believed that less than 20% of the genes are expressed (i.e., making RNA) in a typical cell type Apparently the differences in gene expression is what makes the cells different

Some questions for the golden age of genomics How gene expression differs in different cell types? How gene expression differs in a normal and diseased (e.g., cancerous) cell? How gene expression changes when a cell is treated by a drug? How gene expression changes when the organism develops and cells are differentiating? How gene expression is regulated which genes regulate which and how?

Genes are regulated (switched on or off) Gene regulation networks outrageously simplified DNA GENE 1 GENE 2 GENE 3 GENE 4 Specific proteins called transcription factors promoter coding DNA G1 G2 G3 G4

2. Microarrays a tool for finding which genes have their products being produced (expressed) Type 1 - single channel (expensive) Type 2 - dual channel (cheaper)

How do microarrays work They exploit the DNA- RNA complementarity principle A single stranded DNA complementary to each gene are attached on the slide in a know location

How do microarrays work condition 1 mrna cdna hybridise to microarray condition 2

A microarray experiment Normally it will be more than one array per experiment More than 2 conditions can be copared The same condition can be used on array many times (replicate experiments) to fin out what is the noise level or natural gene expression variability within the same experiment

Sample RNA RNA RNA RNA RNA extract labelled acid acid acid nucleic acid acid A microarray experiment hybridisation genes Array design array array array Microarray Protocol Protocol Protocol Protocol Protocol Protocol normalization integration Gene expression data matrix

Steps in microarray data processing Array scans Quantitations Samples Spots Genes A B D C