Introduction to Bioinformatics and Gene Expression Technology

Similar documents
Introduction to Bioinformatics and Gene Expression Technologies

Introduction to Bioinformatics and Gene Expression Technologies

Advanced Statistical Methods: Beyond Linear Regression

Lecture #1. Introduction to microarray technology

Functional Genomics Overview RORY STARK PRINCIPAL BIOINFORMATICS ANALYST CRUK CAMBRIDGE INSTITUTE 18 SEPTEMBER 2017

Introduction to Microarray Analysis

Biology 644: Bioinformatics

Intro to Microarray Analysis. Courtesy of Professor Dan Nettleton Iowa State University (with some edits)

Gene Expression Technology

What we ll do today. Types of stem cells. Do engineered ips and ES cells have. What genes are special in stem cells?

Do engineered ips and ES cells have similar molecular signatures?

This place covers: Methods or systems for genetic or protein-related data processing in computational molecular biology.

Gene expression profiling experiments:

DNA Microarray Data Oligonucleotide Arrays

Outline. Array platform considerations: Comparison between the technologies available in microarrays

Measuring and Understanding Gene Expression

3.1.4 DNA Microarray Technology

Expressed genes profiling (Microarrays) Overview Of Gene Expression Control Profiling Of Expressed Genes

Outline. Analysis of Microarray Data. Most important design question. General experimental issues

STATC 141 Spring 2005, April 5 th Lecture notes on Affymetrix arrays. Materials are from

Biochemistry 412. DNA Microarrays. April 1, 2008

Gene expression analysis. Biosciences 741: Genomics Fall, 2013 Week 5. Gene expression analysis

6. GENE EXPRESSION ANALYSIS MICROARRAYS

Biochemistry 412. DNA Microarrays. March 30, 2007

Bioinformatics: Microarray Technology. Assc.Prof. Chuchart Areejitranusorn AMS. KKU.

Introduction to BioMEMS & Medical Microdevices DNA Microarrays and Lab-on-a-Chip Methods

Deoxyribonucleic Acid DNA

Microarray Technique. Some background. M. Nath

Introduction to biology and measurement of gene expression

Introduction to DNA microarrays. DTU - January Hanne Jarmer

Analysis of Microarray Data

Microarrays The technology

Recent technology allow production of microarrays composed of 70-mers (essentially a hybrid of the two techniques)

Microarray Informatics

Exploration and Analysis of DNA Microarray Data

DNA Chip Technology Benedikt Brors Dept. Intelligent Bioinformatics Systems German Cancer Research Center

Gene expression analysis. Gene expression analysis. Total RNA. Rare and abundant transcripts. Expression levels. Transcriptional output of the genome

Microarray Informatics

Bioinformatics Advice on Experimental Design

Exploration, Normalization, Summaries, and Software for Affymetrix Probe Level Data

BIOINF/BENG/BIMM/CHEM/CSE 184: Computational Molecular Biology. Lecture 2: Microarray analysis

COS 597c: Topics in Computational Molecular Biology. DNA arrays. Background

Moc/Bio and Nano/Micro Lee and Stowell

Chapter 1. from genomics to proteomics Ⅱ

CAP BIOINFORMATICS Su-Shing Chen CISE. 10/5/2005 Su-Shing Chen, CISE 1

Introduction to Microarray Data Analysis and Gene Networks. Alvis Brazma European Bioinformatics Institute

Genetics and Bioinformatics

Analysis of Microarray Data

Biology 644: Bioinformatics

MICROARRAYS: CHIPPING AWAY AT THE MYSTERIES OF SCIENCE AND MEDICINE

Introduction to Bioinformatics. Fabian Hoti 6.10.

Exome Sequencing Exome sequencing is a technique that is used to examine all of the protein-coding regions of the genome.

Genomes: What we know and what we don t know

AFFYMETRIX c Technology and Preprocessing Methods

Introduction to Genome Biology

Microarray Data Analysis Workshop. Preprocessing and normalization A trailer show of the rest of the microarray world.

Class Information. Introduction to Genome Biology and Microarray Technology. Biostatistics Rafael A. Irizarry. Lecture 1

INTRODUCTION. The Technology of Microarrays January Hanne Jarmer

Normalization. Getting the numbers comparable. DNA Microarray Bioinformatics - #27612

DNA Arrays Affymetrix GeneChip System

Introduction to gene expression microarray data analysis

RNA-SEQUENCING ANALYSIS

Introduction to Microarray Technique, Data Analysis, Databases Maryam Abedi PhD student of Medical Genetics

Gene expression analysis: Introduction to microarrays

Marker types. Potato Association of America Frederiction August 9, Allen Van Deynze

SolCAP. Executive Commitee : David Douches Walter De Jong Robin Buell David Francis Alexandra Stone Lukas Mueller AllenVan Deynze

Microarray. Key components Array Probes Detection system. Normalisation. Data-analysis - ratio generation

Bioinformatics Programming and Analysis CSC Dr. Garrett Dancik

Complete draft sequence 2001

Motivation From Protein to Gene

HC70AL SUMMER 2014 PROFESSOR BOB GOLDBERG Gene Annotation Worksheet

Introduction to Bioinformatics

Measuring gene expression (Microarrays) Ulf Leser

SGN-6106 Computational Systems Biology I

Image Analysis. Based on Information from Terry Speed s Group, UC Berkeley. Lecture 3 Pre-Processing of Affymetrix Arrays. Affymetrix Terminology

CS-E5870 High-Throughput Bioinformatics Microarray data analysis

Molecular Biology Primer. CptS 580, Computational Genomics, Spring 09

Seven Keys to Successful Microarray Data Analysis

DNA Microarray Technology

The University of California, Santa Cruz (UCSC) Genome Browser

CodeLink Human Whole Genome Bioarray

Microarrays & Gene Expression Analysis

Gene expression. What is gene expression?

Analysis of Microarray Data

Introduction to 'Omics and Bioinformatics

By the name of ALLA. To calculate the statistical significance we use certain formula to know the significance of certain mutation.

DNA Microarrays and Clustering of Gene Expression Data

Methods of Biomaterials Testing Lesson 3-5. Biochemical Methods - Molecular Biology -

Human Genomics. Higher Human Biology

Introduction to Bioinformatics: Chapter 11: Measuring Expression of Genome Information

Bioinformatics for Biologists

Please purchase PDFcamp Printer on to remove this watermark. DNA microarray

Lecture 22 Eukaryotic Genes and Genomes III

Following text taken from Suresh Kumar. Bioinformatics Web - Comprehensive educational resource on Bioinformatics. 6th May.2005

An Overview Of DNA Microarrays: From Technology To Biology And Beyond

Authors: Vivek Sharma and Ram Kunwar

DNA Microarrays & RNAi

Genomes contain all of the information needed for an organism to grow and survive.

Applicazioni biotecnologiche

Transcription:

Vocabulary Introduction to Bioinformatics and Gene Expression Technology Utah State University Spring 2014 STAT 5570: Statistical Bioinformatics Notes 1.1 Gene: Genetics: Genome: Genomics: hereditary DNA sequence at a specific location on chromosome (that does something ) study of heredity & variation in organisms an organism s total genetic content (full DNA sequence) study of organisms in terms of their genome 1 2 Vocabulary Protein: sequence of amino acids that does something Vocabulary Proteomics: Phylogeny: study of all of the proteins that can come from an organisms genome the evolutionary or historical development of an organism (or its DNA sequence) Bioinformatics: the collection, organization, & analysis of largescale, complex biological data Statistical Bioinformatics: Phylogenetics: Phenotype: the study of an organism s phylogeny the physical characteristic of interest in each individual for example, plant height, disease status, or embryo type the application of statistical approaches to bioinformatics, especially in identifying significant changes (in sequences, expression patterns, etc.) that are biologically relevant (especially in affecting the phenotype) 3 4

Central Dogma of Molecular Biology A road map to bioinformatics Central Dogma Technology Gene Genome Sequencing Genomic Hypothesis Genotype QTL Type of Study or Analysis mrna transcript Transcript Profiling Transcriptome Microarrays or Next-Gen Sequencing (Epigenetics / methylation) Protein Protein quantification and function Proteome Protein Microarrays or Proteomics 5 Phenotype (From introductory lecture by RW Doerge at 2013 Joint Statistical Meetings) 6 Alphabets DNA sequences defined by nucleotides (4) DNA sequence mrna sequence Protein sequence Protein sequences defined by amino acids (20) General assumption of microarray technology Use mrna transcript abundance level as a measure of the level of expression for the corresponding gene Proportional to degree of gene expression 7 8

How to measure mrna abundance? Several different approaches with similar themes: Affymetrix GeneChip oligonucleotide Nimblegen array arrays Two-color cdna array More, including next-gen sequencing (later) Representation of genes on slide Small portion of gene Larger sequence of gene A short word on bioinformatic technologies Never marry a technology, because it will always leave you. Scott Tingey, Director of Genetic Discovery at DuPont (shared in RW Doerge 2013 introductory overview lecture at 2013 JSM) In this class, we will discuss several technologies, emphasizing their recurring statistical issues These are perpetual (and compounding) 9 10 Affymetrix Probes Affymetrix Approach Gene represented by a set of G probe pairs PM perfect match MM mismatch homomeric substitution (Images courtesy Affymetrix, www.affymetrix.com) 11 12

Affymetrix Technology GeneChip Affymetrix Technology Expression Each gene is represented by a unique set of probe pairs (usually 12-20 probe pairs per probe set) Each spot on array represents a single probe (with millions of copies) These probes are fixed to the array A tissue sample is prepared so that its mrna has fluorescent tags; wait for hybridization (Image courtesy Affymetrix, www.affymetrix.com) (Images courtesy Affymetrix, www.affymetrix.com) 13 14 Affymetrix GeneChip Cartoon Representations (originally from Affymetrix outreach) Animation 1: GeneChip structure (1 min.) Animation 2: Measuring gene expression (2.5 min) Image courtesy Affymetrix, www.affymetrix.com 15 16

Affymetrix Steps of the Expression Assay Images Full Array Image Close-up of Array Image Image courtesy Affymetrix, www.affymetrix.com 17 Images courtesy Affymetrix, www.affymetrix.com 18 What are the data? Usually spot intensities from specific wavelengths Sometimes pixel-level intensities but then getting a single measurement for the spot isn t always obvious Identifying the spots is not always obvious Quality control is an issue How to analyze data meaningfully? Consider: Data quality Data distribution Data format & organization Appropriateness of measurement methods (& variance) Sources of variability (and their types) Appropriate models to account for sources of variability and address question of interest Meaning of P-values and appropriate tests of significance Statistical significance vs. biological relevance Appropriate and useful representation of results Many useful tools available from Bioconductor 19 20

The Bioconductor Project Main Features of the Bioconductor Project Bioconductor is an open source and open development software project for the analysis and comprehension of genomic data Not just for microarray data Like a living family of software packages, changing with needs Core team mainly at Fred Hutchinson Cancer Research, plus many other U.S. and international institutions Use of R Documentation and reproducible research Statistical and graphical methods Annotation Short courses Open source Open development Source: www.bioconductor.org 21 Source: www.bioconductor.org 22 What will we do? Learn basics of major Bioconductor tools Focus on statistical issues Discuss recent developments Learn to discuss all of this 23