Gene expression: Microarray data analysis. Copyright notice. Outline: microarray data analysis. Schedule
|
|
- Margaret Poole
- 6 years ago
- Views:
Transcription
1 Gene expression: Microarray data analysis Copyright notice Many of the images in this powerpoint presentation are from Bioinformatics and Functional Genomics by Jonathan Pevsner (ISBN ). Copyright 3 by John Wiley & Sons, Inc. January 5, 9 Jonathan Pevsner, Ph.D. Bioinformatics pevsner@kennedykrieger.org Johns Hopkins School of Medicine (M.E:8.77) These images and materials may not be used without permission from the publisher. We welcome instructors to use these powerpoints for educational purposes, but please acknowledge the source. The book has a homepage at including hyperlinks to the book chapters. Schedule Outline: microarray data analysis Gene expression Today : microarray data analysis (this morning) Today at :3: computer lab (and course review) Tuesday, January 6: no proteomics quiz due Monday, January : final exam 9-:3 am Monday, January : microarray quiz due (noon) Microarrays Preprocessing normalization scatter plots t-test ANOVA Exploratory (descriptive) statistics distances clustering principal components analysis (PCA)
2 Compare gene expression in this cell type Gene expression is context-dependent, and is regulated in several basic ways after viral infection relative to a knockout in samples from patients by region (e.g. brain versus kidney) in development (e.g. fetal versus adult tissue) in dynamic response to environmental signals (e.g. immediate-early response genes) in disease states after drug treatment at a later developmental time in a different body region by gene activity Page 57 UniGene: unique genes via ESTs DNA RNA protein DNA RNA protein Find UniGene at NCBI: UniGene clusters contain many ESTs cdna UniGene SAGE microarray cdna UniGene data come from many cdna libraries. Thus, when you look up a gene in UniGene you get information on its abundance and its regional distribution. next-generation sequencing!!! Fig. 6. Page 59 Page 64
3 Outline: microarray data analysis Microarrays: tools for gene expression Gene expression Microarrays Preprocessing normalization scatter plots A microarray is a solid support (such as a membrane or glass microscope slide) on which DNA of known sequence is deposited in a grid-like array. t-test ANOVA Exploratory (descriptive) statistics distances clustering principal components analysis (PCA) Page 73 Microarrays: tools for gene expression Advantages of microarray experiments Fast Data on >, transcripts in ~ weeks The most common form of microarray is used to measure gene expression. RNA is isolated from matched samples of interest. The RNA is typically converted to cdna, labeled with fluorescence (or radioactivity), then hybridized to microarrays in order to measure the expression levels of thousands of genes. Page 73 Comprehensive Flexible Easy Entire yeast or mouse genome on a chip Custom arrays can be made to represent genes of interest Submit RNA samples to a core facility Cheap? Chip representing, genes for $3 Table 6-4 Page 75 Disadvantages of microarray experiments Cost Some researchers can t afford to do appropriate numbers of controls, replicates RNA The final product of gene expression is protein significance Pervasive transcription of the genome is poorly understood (ENCODE project) There are many noncoding RNAs not yet represented on microarrays Quality control Impossible to assess elements on array surface Artifacts with image analysis Artifacts with data analysis Not enough attention to experimental design Not enough collaboration with statisticians Sample acquisition Data acquisition Data analysis Data confirmation Table 6-5 Page 76 Biological insight Fig. 6.6 Page 76 3
4 Stage : Experimental design Stage : Experimental design Stage : RNA and probe preparation Stage 3: Hybridization to DNA arrays Stage 4: Image analysis Stage 5: Microarray data analysis Stage 6: Biological confirmation Stage 7: Microarray databases Fig. 6.6 Page 76 [] Biological samples: technical and biological replicates: determine the data analysis approach at the outset [] RNA extraction, conversion, labeling, hybridization: except for RNA isolation, routinely performed at core facilities [3] Arrangement of array elements on a surface: randomization can reduce spatially-based artifacts Page 77 One sample per array (e.g. Affymetrix or radioactivity-based platforms) Two samples per array (competitive hybridization) Samples, Samples,3 Samples,3 Sample Sample Sample 3 Sample, pool Sample, pool Samples,: switch dyes Fig. 6.7 Page 77 Fig. 6.7 Page 77 Stage : RNA preparation Stage 3: Hybridization to DNA arrays For Affymetrix chips, need total RNA (about 5 ug) Confirm purity by running agarose gel Measure a6/a8 to confirm purity, quantity One of the greatest sources of error in microarray experiments is artifacts associated with RNA isolation; be sure to create an appropriately balanced, randomized experimental design. Page 78 The array consists of cdna or oligonucleotides Oligonucleotides can be deposited by photolithography The sample is converted to crna or cdna (Note that the terms probe and target may refer to the element immobilized on the surface of the microarray, or to the labeled biological sample; for clarity, it may be simplest to avoid both terms.) Page
5 Microarrays: array surface Stage 4: Image analysis RNA transcript levels are quantitated Fluorescence intensity is measured with a scanner, or radioactivity with a phosphorimager Southern et al. (999) Nature Genetics, microarray supplement Fig. 6.8 Page 79 Page 8 Differential Gene Expression on a cdna Microarray Control Rett α B Crystallin is over-expressed in Rett Syndrome Fig. 6.9 Page 8 Fig. 6. Page 8 Fig. 6. Page 8 Fig. 6. Page 8 5
6 Stage 5: Microarray data analysis Hypothesis testing How can arrays be compared? Which RNA transcripts (genes) are regulated? Are differences authentic? What are the criteria for statistical significance? Clustering Are there meaningful patterns in the data (e.g. groups)? Classification Do RNA transcripts predict predefined groups, such as disease subtypes? Fig. 6. Page 8 Page 8 Stage 6: Biological confirmation Stage 7: Microarray databases Microarray experiments can be thought of as hypothesis-generating experiments. The differential up- or down-regulation of specific RNA transcripts can be measured using independent assays such as -- Northern blots -- polymerase chain reaction (RT-PCR) -- in situ hybridization There are two main repositories: Gene expression omnibus (GEO) at NCBI ArrayExpress at the European Bioinformatics Institute (EBI) Page 8 Page 8 6
7 Array Express at the European Bioinformatics Institute MIAME In an effort to standardize microarray data presentation and analysis, Alvis Brazma and colleagues at 7 institutions introduced Minimum Information About a Microarray Experiment (MIAME). The MIAME framework standardizes six areas of information: experimental design microarray design sample preparation hybridization procedures image analysis controls for normalization Visit Page 8 7
8 Outline: microarray data analysis Microarray data analysis Gene expression Microarrays Preprocessing normalization scatter plots t-test ANOVA Exploratory (descriptive) statistics distances clustering principal components analysis (PCA) begin with a data matrix (gene expression values versus samples) genes (RNA transcript levels) Fig. 7. Page 9 Microarray data analysis Microarray data analysis begin with a data matrix (gene expression values versus samples) begin with a data matrix (gene expression values versus samples) Typically, there are many genes (>>,) and few samples (~ ) Preprocessing Descriptive statistics Fig. 7. Page 9 Fig. 7. Page 9 Microarray data analysis: preprocessing Microarray data analysis: preprocessing Observed differences in gene expression could be due to transcriptional changes, or they could be caused by artifacts such as: different labeling efficiencies of Cy3, Cy5 uneven spotting of DNA onto an array surface variations in RNA purity or quantity variations in washing efficiency variations in scanning efficiency The main goal of data preprocessing is to remove the systematic bias in the data as completely as possible, while preserving the variation in gene expression that occurs because of biologically relevant changes in transcription. A basic assumption of most normalization procedures is that the average gene expression level does not change in an experiment. Page 9 Page 9 8
9 Data analysis: global normalization Global normalization is used to correct two or more data sets. In one common scenario, samples are labeled with Cy3 (green dye) or Cy5 (red dye) and hybridized to DNA elements on a microrarray. After washing, probes are excited with a laser and detected with a scanning confocal microscope. Data analysis: global normalization Global normalization is used to correct two or more data sets Example: total fluorescence in Cy3 channel = 4 million units Cy 5 channel = million units Then the uncorrected ratio for a gene could show, units versus, units. This would artifactually appear to show -fold regulation. Page 9 Page 9 Data analysis: global normalization Scatter plots Global normalization procedure Step : subtract background intensity values (use a blank region of the array) Step : globally normalize so that the average ratio = (apply this to -channel or -channel data sets) Useful to represent gene expression values from two microarray experiments (e.g. control, experimental) Each dot corresponds to a gene expression value Most dots fall along a line Outliers represent up-regulated or down-regulated genes Page 9 Page 93 Brain Differential Gene Expression in Different Tissue and Cell Types Astrocyte Fibroblast Astrocyte Expression level (sample ) low expression level high up Expression level (sample ) down Fig. 7. Page 93 9
10 Scatter plots Typically, data are plotted on log-log coordinates Visually, this spreads out the data and offers symmetry raw ratio log ratio time behavior value value t= basal.. Log-log transformation Fig. 7.3 Page 95 t=h no change.. t=h -fold up.. t=3h -fold down.5 -. Page 94, 97 You can make these plots in Excel low expression level high up Log ratio down Mean log intensity Fig. 7.4 Page 96 but for many bioinformatics applications use R. Visit to download it. M M A A After RMA (a normalization procedure), the median is near zero, and skewing is corrected. Visit to download it. See chapter 9 ( nd edition) for a tutorial on microarray data analysis. Scatterplots display the effects of normalization.
11 SNOMAD converts array data to scatter plots Log-log plot - 4. old -f - - residual.5 - CON CON EXP > CON..5 -fold fold Mean ( Log ( Intensity ) ) EXP < CON Log (Ratio ). robust local regression fit old -f EXP > CON.5 -fold. -fold EXP < CON ld -fo Corrected Log ( Ratio ) [residuals] Log ( Ratio ) EXP EXP -fo ld 3 Linear-linear plot SNOMAD corrects local variance artifacts Mean ( Log ( Intensity ) ) Mean ( Log ( Intensity ) ) Page Robust multi-array analysis (RMA) SNOMAD describes regulated genes in Z-scores Developed by Rafael Irizarry (Dept. of Biostatistics), Terry Speed, and others Available at as an R package Also available in various software packages (including Partek, and Iobion Gene Traffic) See Bolstad et al. (3) Bioinformatics 9; Irizarry et al. (3) Biostatistics 4 Local Log ( Ratio ) Z-Score 5 Corrected Log ( Ratio ) Locally estimated standard deviation of positive ratios -5 Z= -fold Z= - -fold Z= Mean ( Log ( Intensity ) ) Z= 5.5 Corrected Log ( Ratio ) -.5 There are three steps: Locally estimated standard deviation of negative ratios - -. Mean ( Log ( Intensity ) ) Z= Z= [] Background adjustment based on a normal plus exponential model (no mismatch data are used) [] Quantile normalization (nonparametric fitting of signal intensity data to normalize their distribution) [3] Fitting a log scale additive model robustly. The model is additive: probe effect + sample effect -fold -fold Z= - - Z= -5 Z= - Z= Mean ( Log ( Intensity ) ) log intensity 8 RMA adjusts for the effect of GC content log signal intensity Histograms of raw intensity values for 4 arrays (plotted in R) before and after RMA was applied. 4 array GC content 6 4 log signal intensity Page array
12 Robust multi-array analysis (RMA) precision accuracy precision with accuracy RMA offers a large increase in precision (relative to Affymetrix MAS 5. software). precision Good performance: reproducibility of the result Good quality of the result (relative to a gold standard) log expression SD MAS 5. RMA average log expression Robust multi-array analysis (RMA) RMA offers comparable accuracy to MAS 5.. Outline: microarray data analysis Gene expression observed log expression accuracy log nominal concentration Microarrays Preprocessing normalization scatter plots t-test ANOVA Exploratory (descriptive) statistics distances clustering principal components analysis (PCA) are used to make inferences about a population from a sample. Hypothesis testing is a common form of inferential statistics. A null hypothesis is stated, such as: There is no difference in signal intensity for the gene expression measurements in normal and diseased samples. The alternative hypothesis is that there is a difference. We use a test statistic to decide whether to accept or reject the null hypothesis. For many applications, we set the significance level α to p <.5. Page 99 A t-test is a commonly used test statistic to assess the difference in mean values between two groups. x x difference between mean values t = = SE variability (standard error of the difference) Questions Is the sample size (n) adequate? Are the data normally distributed? Is the variance of the data known? Is the variance the same in the two groups? Is it appropriate to set the significance level to p <.5? Page 99
13 A t-test is a commonly used test statistic to assess the difference in mean values between two groups. x x difference between mean values t = = SE variability (standard error of the difference) Notes t is a ratio (it thus has no units) We assume the two populations are Gaussian The two groups may be of different sizes Obtain a P value from t using a table For a two-sample t test, the degrees of freedom is N -. For any value of t, P gets smaller as df gets larger Analyzing expression data in Excel Question: for each of my, transcripts, decide whether it is significantly regulated in some disease. control disease [] Obtain a matrix of genes (rows) and expression values columns. Here there are, rows of genes of which the first six are shown. There are three control samples and three disease samples. You can also calculate the mean value for each gene (transcript) for the controls and the disease (experimental) samples. Analyzing expression data in Excel Analyzing expression data in Excel [] You can calculate the ratios of control versus disease. Note that you can use the formula =E5/I5 in this case to divide the mean control and disease values. Also note that some ratios, such as., appear to be dramatic while others are not. Some researchers set a cutoff for changes of interest such as two-fold. [3] Perform a t-test. When you enter =TTEST into the function box above, a dialog box appears. Enter the range of values for controls and for disease samples, and specify a - or -tailed test. Analyzing expression data in Excel Analyzing expression data in Excel [3] Perform a t-test (continued). For a one-tailed test, your prior hypothesis is that the transcript in the disease group is up (or down) relative to controls; the change is unidirectional. For example, in Down syndrome samples you might hypothesize that chromosome transcripts are significantly up-regulated because of the extra copy of chromosome. [3] Perform a t-test (continued). For a two-tailed test, you hypothesize that the two groups are different, but you do not know in which direction. 3
14 Analyzing expression data in Excel t-test to determine statistical significance disease vs normal [3] Note the results: you can have a small p value (<.5) with a big ratio difference a small p value (<.5) with a trivial ratio difference a large p value (>.5) with a big ratio difference a large p value (>.5) with a trivial ratio difference Only the first group is worth reporting! Why? Error t statistic = difference between mean of disease and normal variation due to error ANOVA partitions total data variability Before partitioning After partitioning Paradigm Parametric test Nonparametric Error disease vs normal Subject Error disease vs normal Tissue type Compare two unpaired groups Unpaired t-test Mann-Whitney test Compare two paired groups Paired t-test Wilcoxon test F ratio = variation between DS and normal variation due to error Compare 3 or more groups ANOVA Table 7- Page 98- Is it appropriate to set the significance level to p <.5? If you hypothesize that a specific gene is up-regulated, you can set the probability value to.5. You might measure the expression of, genes and hope that any of them are up- or down-regulated. But you can expect to see 5% (5 genes) regulated at the p <.5 level by chance alone. To account for the thousands of repeated measurements you are making, some researchers apply a Bonferroni correction. The level for statistical significance is divided by the number of measurements, e.g. the criterion becomes: p < (.5)/, or p < 5 x -6 The Bonferroni correction is generally considered to be too conservative. Page 99 : false discovery rate The false discovery rate (FDR) is a popular multiple corrections correction. A false positive (also called a type I error) is sometimes called a false discovery. The FDR equals the p value of the t-test times the number of genes measured (e.g. for, genes and a p value of., there are expected false positives). You can adjust the false discovery rate. For example: FDR # regulated transcripts # false discoveries Would you report regulated transcripts of which are likely to be false positives, or transcripts of which one is likely to be a false positive? 4
15 A volcano plot displays both p values and fold change p value (treated versus control) log fold change (treated/untreated) Outline: microarray data analysis Gene expression Microarrays Preprocessing normalization scatter plots t-test ANOVA Exploratory (descriptive) statistics distances clustering principal components analysis (PCA) Descriptive statistics Microarray data are highly dimensional: there are many thousands of measurements made from a small number of samples. Descriptive (exploratory) statistics help you to find meaningful patterns in the data. A first step is to arrange the data in a matrix. Next, use a distance metric to define the relatedness of the different data points. Two commonly used distance metrics are: -- Euclidean distance -- Pearson coefficient of correlation Page 3 5
16 What is a cluster? samples (time points) A cluster is a group that has homogeneity (internal cohesion) and separation (external isolation). The relationships between objects being studied are assessed by similarity or dissimilarity measures. genes Data matrix ( genes and 3 time points from Chu et al., 998) Software: S- PLUS package Fig. 7.8 Page 5 Descriptive statistics: clustering Clustering algorithms offer useful visual descriptions of microarray data. t=. t=.5 t= 3D plot (using S-PLUS software) Fig. 7.8 Page 5 Genes may be clustered, or samples, or both. We will next describe hierarchical clustering. This may be agglomerative (building up the branches of a tree, beginning with the two most closely related objects) or divisive (building the tree by finding the most dissimilar objects first). In each case, we end up with a tree having branches and nodes. Page 4 Agglomerative clustering Agglomerative clustering a b a,b a b a,b c c d e d e d,e Adapted from Kaufman and Rousseeuw (99) Fig. 7.9 Page 6 Fig. 7.9 Page 6 6
17 Agglomerative clustering Agglomerative clustering a b c d e a,b d,e c,d,e a b c d e a,b d,e c,d,e a,b,c,d,e tree is constructed Fig. 7.9 Page 6 Fig. 7.9 Page 6 Divisive clustering Divisive clustering a,b,c,d,e a,b,c,d,e c,d,e Fig. 7.9 Page 6 Fig. 7.9 Page 6 Divisive clustering c,d,e a,b,c,d,e a,b Divisive clustering c,d,e a,b,c,d,e d,e d,e Fig. 7.9 Page 6 Fig. 7.9 Page 6 7
18 agglomerative a b c d e a,b d,e Divisive clustering c,d,e a,b,c,d,e a b c d e a,b d,e 3 c,d,e 4 a,b,c,d,e 4 3 tree is constructed Fig. 7.9 Page Adapted from Kaufman and Rousseeuw (99) divisive Fig. 7.9 Page 6 Fig. 7.8 Page 5 Fig. 7. Page 7 Cluster and TreeView Agglomerative and divisive clustering sometimes give conflicting results, as shown here Fig. 7. Page 7 Fig. 7. Page 8 8
19 Cluster and TreeView Cluster and TreeView clustering K means SOM PCA Fig. 7. Page 8 Fig. 7. Page 8 Two-way clustering of genes (y-axis) and cell lines (x-axis) (Alizadeh et al., ) Principal components analysis (PCA) An exploratory technique used to reduce the dimensionality of the data set to D or 3D For a matrix of m genes x n samples, create a new covariance matrix of size n x n Thus transform some large number of variables into a smaller number of uncorrelated variables called principal components (PCs). Fig. 7.3 Page 9 Page Principal components analysis (PCA): objectives to reduce dimensionality to determine the linear combination of variables to choose the most useful variables (features) to visualize multidimensional data to identify groups of objects (e.g. genes/samples) to identify outliers Page Page 9
20 Page Page Page Fig. 7.6 Page Fig. 7.6 Page Fig. 7.6 Page
21 Outline: microarray data analysis Gene expression Microarrays Preprocessing normalization scatter plots t-test ANOVA Exploratory (descriptive) statistics distances clustering principal components analysis (PCA)
Recent technology allow production of microarrays composed of 70-mers (essentially a hybrid of the two techniques)
Microarrays and Transcript Profiling Gene expression patterns are traditionally studied using Northern blots (DNA-RNA hybridization assays). This approach involves separation of total or polya + RNA on
More informationOutline. Analysis of Microarray Data. Most important design question. General experimental issues
Outline Analysis of Microarray Data Lecture 1: Experimental Design and Data Normalization Introduction to microarrays Experimental design Data normalization Other data transformation Exercises George Bell,
More informationIntroduction to gene expression microarray data analysis
Introduction to gene expression microarray data analysis Outline Brief introduction: Technology and data. Statistical challenges in data analysis. Preprocessing data normalization and transformation. Useful
More informationEECS730: Introduction to Bioinformatics
EECS730: Introduction to Bioinformatics Lecture 14: Microarray Some slides were adapted from Dr. Luke Huan (University of Kansas), Dr. Shaojie Zhang (University of Central Florida), and Dr. Dong Xu and
More informationAnalysis of Microarray Data
Analysis of Microarray Data Lecture 1: Experimental Design and Data Normalization George Bell, Ph.D. Senior Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Introduction
More informationAnalysis of Microarray Data
Analysis of Microarray Data Lecture 1: Experimental Design and Data Normalization George Bell, Ph.D. Senior Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Introduction
More informationIntroduction to Bioinformatics. Fabian Hoti 6.10.
Introduction to Bioinformatics Fabian Hoti 6.10. Analysis of Microarray Data Introduction Different types of microarrays Experiment Design Data Normalization Feature selection/extraction Clustering Introduction
More informationAnalysis of Microarray Data
Analysis of Microarray Data Lecture 3: Visualization and Functional Analysis George Bell, Ph.D. Senior Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Review
More informationMicroarray Technique. Some background. M. Nath
Microarray Technique Some background M. Nath Outline Introduction Spotting Array Technique GeneChip Technique Data analysis Applications Conclusion Now Blind Guess? Functional Pathway Microarray Technique
More informationEstoril Education Day
Estoril Education Day -Experimental design in Proteomics October 23rd, 2010 Peter James Note Taking All the Powerpoint slides from the Talks are available for download from: http://www.immun.lth.se/education/
More informationGene Expression Technology
Gene Expression Technology Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu Gene expression Gene expression is the process by which information from a gene
More information6. GENE EXPRESSION ANALYSIS MICROARRAYS
6. GENE EXPRESSION ANALYSIS MICROARRAYS BIOINFORMATICS COURSE MTAT.03.239 16.10.2013 GENE EXPRESSION ANALYSIS MICROARRAYS Slides adapted from Konstantin Tretyakov s 2011/2012 and Priit Adlers 2010/2011
More informationGene Expression Data Analysis (I)
Gene Expression Data Analysis (I) Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu Bioinformatics tasks Biological question Experiment design Microarray experiment
More informationLecture #1. Introduction to microarray technology
Lecture #1 Introduction to microarray technology Outline General purpose Microarray assay concept Basic microarray experimental process cdna/two channel arrays Oligonucleotide arrays Exon arrays Comparing
More informationSIMS2003. Instructors:Rus Yukhananov, Alex Loguinov BWH, Harvard Medical School. Introduction to Microarray Technology.
SIMS2003 Instructors:Rus Yukhananov, Alex Loguinov BWH, Harvard Medical School Introduction to Microarray Technology. Lecture 1 I. EXPERIMENTAL DETAILS II. ARRAY CONSTRUCTION III. IMAGE ANALYSIS Lecture
More informationBackground Correction and Normalization. Lecture 3 Computational and Statistical Aspects of Microarray Analysis June 21, 2005 Bressanone, Italy
Background Correction and Normalization Lecture 3 Computational and Statistical Aspects of Microarray Analysis June 21, 2005 Bressanone, Italy Feature Level Data Outline Affymetrix GeneChip arrays Two
More informationProbe-Level Data Normalisation: RMA and GC-RMA Sam Robson Images courtesy of Neil Ward, European Application Engineer, Agilent Technologies.
Probe-Level Data Normalisation: RMA and GC-RMA Sam Robson Images courtesy of Neil Ward, European Application Engineer, Agilent Technologies. References Summaries of Affymetrix Genechip Probe Level Data,
More informationExploration, Normalization, Summaries, and Software for Affymetrix Probe Level Data
Exploration, Normalization, Summaries, and Software for Affymetrix Probe Level Data Rafael A. Irizarry Department of Biostatistics, JHU March 12, 2003 Outline Review of technology Why study probe level
More informationFEATURE-LEVEL EXPLORATION OF THE CHOE ET AL. AFFYMETRIX GENECHIP CONTROL DATASET
Johns Hopkins University, Dept. of Biostatistics Working Papers 3-17-2006 FEATURE-LEVEL EXPLORATION OF THE CHOE ET AL. AFFYETRIX GENECHIP CONTROL DATASET Rafael A. Irizarry Johns Hopkins Bloomberg School
More information3.1.4 DNA Microarray Technology
3.1.4 DNA Microarray Technology Scientists have discovered that one of the differences between healthy and cancer is which genes are turned on in each. Scientists can compare the gene expression patterns
More informationExpression summarization
Expression Quantification: Affy Affymetrix Genechip is an oligonucleotide array consisting of a several perfect match (PM) and their corresponding mismatch (MM) probes that interrogate for a single gene.
More informationMicroarrays: since we use probes we obviously must know the sequences we are looking at!
These background are needed: 1. - Basic Molecular Biology & Genetics DNA replication Transcription Post-transcriptional RNA processing Translation Post-translational protein modification Gene expression
More informationHumboldt Universität zu Berlin. Grundlagen der Bioinformatik SS Microarrays. Lecture
Humboldt Universität zu Berlin Microarrays Grundlagen der Bioinformatik SS 2017 Lecture 6 09.06.2017 Agenda 1.mRNA: Genomic background 2.Overview: Microarray 3.Data-analysis: Quality control & normalization
More informationNew Statistical Algorithms for Monitoring Gene Expression on GeneChip Probe Arrays
GENE EXPRESSION MONITORING TECHNICAL NOTE New Statistical Algorithms for Monitoring Gene Expression on GeneChip Probe Arrays Introduction Affymetrix has designed new algorithms for monitoring GeneChip
More informationOligonucleotide microarray data are not normally distributed
Oligonucleotide microarray data are not normally distributed Johanna Hardin Jason Wilson John Kloke Abstract Novel techniques for analyzing microarray data are constantly being developed. Though many of
More informationDNA Microarray Data Oligonucleotide Arrays
DNA Microarray Data Oligonucleotide Arrays Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang Bioconductor Short Course 2003 Copyright 2002, all rights reserved Biological question Experimental
More informationExploration and Analysis of DNA Microarray Data
Exploration and Analysis of DNA Microarray Data Dhammika Amaratunga Senior Research Fellow in Nonclinical Biostatistics Johnson & Johnson Pharmaceutical Research & Development Javier Cabrera Associate
More informationDesigning Complex Omics Experiments
Designing Complex Omics Experiments Xiangqin Cui Section on Statistical Genetics Department of Biostatistics University of Alabama at Birmingham 6/15/2015 Some slides are from previous lectures given by
More informationDNA Arrays Affymetrix GeneChip System
DNA Arrays Affymetrix GeneChip System chip scanner Affymetrix Inc. hybridization Affymetrix Inc. data analysis Affymetrix Inc. mrna 5' 3' TGTGATGGTGGGAATTGGGTCAGAAGGACTGTGGGCGCTGCC... GGAATTGGGTCAGAAGGACTGTGGC
More informationIntroduction to Genome Wide Association Studies 2015 Sydney Brenner Institute for Molecular Bioscience Shaun Aron
Introduction to Genome Wide Association Studies 2015 Sydney Brenner Institute for Molecular Bioscience Shaun Aron Many sources of technical bias in a genotyping experiment DNA sample quality and handling
More informationFACTORS CONTRIBUTING TO VARIABILITY IN DNA MICROARRAY RESULTS: THE ABRF MICROARRAY RESEARCH GROUP 2002 STUDY
FACTORS CONTRIBUTING TO VARIABILITY IN DNA MICROARRAY RESULTS: THE ABRF MICROARRAY RESEARCH GROUP 2002 STUDY K. L. Knudtson 1, C. Griffin 2, A. I. Brooks 3, D. A. Iacobas 4, K. Johnson 5, G. Khitrov 6,
More informationIntroduction to Genome Wide Association Studies 2014 Sydney Brenner Institute for Molecular Bioscience/Wits Bioinformatics Shaun Aron
Introduction to Genome Wide Association Studies 2014 Sydney Brenner Institute for Molecular Bioscience/Wits Bioinformatics Shaun Aron Genotype calling Genotyping methods for Affymetrix arrays Genotyping
More informationAgilent GeneSpring GX 10: Beyond. Pam Tangvoranuntakul Product Manager, GeneSpring October 1, 2008
Agilent GeneSpring GX 10: Gene Expression and Beyond Pam Tangvoranuntakul Product Manager, GeneSpring October 1, 2008 GeneSpring GX 10 in the News Our Goals for GeneSpring GX 10 Goal 1: Bring back GeneSpring
More informationCS 5984: Application of Basic Clustering Algorithms to Find Expression Modules in Cancer
CS 5984: Application of Basic Clustering Algorithms to Find Expression Modules in Cancer T. M. Murali January 31, 2006 Innovative Application of Hierarchical Clustering A module map showing conditional
More informationGene-Level Analysis of Exon Array Data using Partek Genomics Suite 6.6
Gene-Level Analysis of Exon Array Data using Partek Genomics Suite 6.6 Overview This tutorial will demonstrate how to: Summarize core exon-level data to produce gene-level data Perform exploratory analysis
More informationBioinformatics III Structural Bioinformatics and Genome Analysis. PART II: Genome Analysis. Chapter 7. DNA Microarrays
Bioinformatics III Structural Bioinformatics and Genome Analysis PART II: Genome Analysis Chapter 7. DNA Microarrays 7.1 Motivation 7.2 DNA Microarray History and current states 7.3 DNA Microarray Techniques
More informationIntroduction to Bioinformatics and Gene Expression Technology
Vocabulary Introduction to Bioinformatics and Gene Expression Technology Utah State University Spring 2014 STAT 5570: Statistical Bioinformatics Notes 1.1 Gene: Genetics: Genome: Genomics: hereditary DNA
More informationQuality Control Assessment in Genotyping Console
Quality Control Assessment in Genotyping Console Introduction Prior to the release of Genotyping Console (GTC) 2.1, quality control (QC) assessment of the SNP Array 6.0 assay was performed using the Dynamic
More informationBackground and Normalization:
Background and Normalization: Investigating the effects of preprocessing on gene expression estimates Ben Bolstad Group in Biostatistics University of California, Berkeley bolstad@stat.berkeley.edu http://www.stat.berkeley.edu/~bolstad
More informationCalculation of Spot Reliability Evaluation Scores (SRED) for DNA Microarray Data
Protocol Calculation of Spot Reliability Evaluation Scores (SRED) for DNA Microarray Data Kazuro Shimokawa, Rimantas Kodzius, Yonehiro Matsumura, and Yoshihide Hayashizaki This protocol was adapted from
More informationPlease purchase PDFcamp Printer on to remove this watermark. DNA microarray
DNA microarray Example of an approximately 40,000 probe spotted oligo microarray with enlarged inset to show detail. A DNA microarray is a multiplex technology used in molecular biology. It consists of
More informationAnalysis of a Tiling Regulation Study in Partek Genomics Suite 6.6
Analysis of a Tiling Regulation Study in Partek Genomics Suite 6.6 The example data set used in this tutorial consists of 6 technical replicates from the same human cell line, 3 are SP1 treated, and 3
More informationMicroarray Gene Expression Analysis at CNIO
Microarray Gene Expression Analysis at CNIO Orlando Domínguez Genomics Unit Biotechnology Program, CNIO 8 May 2013 Workflow, from samples to Gene Expression data Experimental design user/gu/ubio Samples
More informationBioinformatics and Genomics: A New SP Frontier?
Bioinformatics and Genomics: A New SP Frontier? A. O. Hero University of Michigan - Ann Arbor http://www.eecs.umich.edu/ hero Collaborators: G. Fleury, ESE - Paris S. Yoshida, A. Swaroop UM - Ann Arbor
More informationCOMPUTATIONAL ANALYSIS OF MICROARRAY DATA
COMPUTATIONAL ANALYSIS OF MICROARRAY DATA John Quackenbush Microarray experiments are providing unprecedented quantities of genome-wide data on gene-expression patterns. Although this technique has been
More informationComparative Genomic Hybridization
Comparative Genomic Hybridization Srikesh G. Arunajadai Division of Biostatistics University of California Berkeley PH 296 Presentation Fall 2002 December 9 th 2002 OUTLINE CGH Introduction Methodology,
More informationTechnical Review. Real time PCR
Technical Review Real time PCR Normal PCR: Analyze with agarose gel Normal PCR vs Real time PCR Real-time PCR, also known as quantitative PCR (qpcr) or kinetic PCR Key feature: Used to amplify and simultaneously
More informationEnhancers mutations that make the original mutant phenotype more extreme. Suppressors mutations that make the original mutant phenotype less extreme
Interactomics and Proteomics 1. Interactomics The field of interactomics is concerned with interactions between genes or proteins. They can be genetic interactions, in which two genes are involved in the
More informationPreprocessing Affymetrix GeneChip Data. Affymetrix GeneChip Design. Terminology TGTGATGGTGGGGAATGGGTCAGAAGGCCTCCGATGCGCCGATTGAGAAT
Preprocessing Affymetrix GeneChip Data Credit for some of today s materials: Ben Bolstad, Leslie Cope, Laurent Gautier, Terry Speed and Zhijin Wu Affymetrix GeneChip Design 5 3 Reference sequence TGTGATGGTGGGGAATGGGTCAGAAGGCCTCCGATGCGCCGATTGAGAAT
More informationIntroduction to Bioinformatics and Gene Expression Technologies
Introduction to Bioinformatics and Gene Expression Technologies Utah State University Fall 2017 Statistical Bioinformatics (Biomedical Big Data) Notes 1 1 Vocabulary Gene: hereditary DNA sequence at a
More informationSoybean Microarrays. An Introduction. By Steve Clough. November Common Microarray platforms
Soybean Microarrays Microarray construction An Introduction By Steve Clough November 2005 Common Microarray platforms cdna: spotted collection of PCR products from different cdna clones, each representing
More informationAffymetrix GeneChip Arrays. Lecture 3 (continued) Computational and Statistical Aspects of Microarray Analysis June 21, 2005 Bressanone, Italy
Affymetrix GeneChip Arrays Lecture 3 (continued) Computational and Statistical Aspects of Microarray Analysis June 21, 2005 Bressanone, Italy Affymetrix GeneChip Design 5 3 Reference sequence TGTGATGGTGGGGAATGGGTCAGAAGGCCTCCGATGCGCCGATTGAGAAT
More informationQuality Measures for CytoChip Microarrays
Quality Measures for CytoChip Microarrays How to evaluate CytoChip Oligo data quality in BlueFuse Multi software. Data quality is one of the most important aspects of any microarray experiment. This technical
More informationDetection and Restoration of Hybridization Problems in Affymetrix GeneChip Data by Parametric Scanning
100 Genome Informatics 17(2): 100-109 (2006) Detection and Restoration of Hybridization Problems in Affymetrix GeneChip Data by Parametric Scanning Tomokazu Konishi konishi@akita-pu.ac.jp Faculty of Bioresource
More informationIntroduction to Microarray Analysis
Introduction to Microarray Analysis Methods Course: Gene Expression Data Analysis -Day One Rainer Spang Microarrays Highly parallel measurement devices for gene expression levels 1. How does the microarray
More informationCell Stem Cell, volume 9 Supplemental Information
Cell Stem Cell, volume 9 Supplemental Information Large-Scale Analysis Reveals Acquisition of Lineage-Specific Chromosomal Aberrations in Human Adult Stem Cells Uri Ben-David, Yoav Mayshar, and Nissim
More informationRNA-Seq analysis using R: Differential expression and transcriptome assembly
RNA-Seq analysis using R: Differential expression and transcriptome assembly Beibei Chen Ph.D BICF 12/7/2016 Agenda Brief about RNA-seq and experiment design Gene oriented analysis Gene quantification
More informationUsing Low Input of Poly (A) + RNA and Total RNA for Oligonucleotide Microarrays Application
Using Low Input of Poly (A) + RNA and Total RNA for Oligonucleotide Microarrays Application Gene Expression Author Michelle M. Chen Agilent Technologies, Inc. 3500 Deer Creek Road, MS 25U-7 Palo Alto,
More informationMixed effects model for assessing RNA degradation in Affymetrix GeneChip experiments
Mixed effects model for assessing RNA degradation in Affymetrix GeneChip experiments Kellie J. Archer, Ph.D. Suresh E. Joel Viswanathan Ramakrishnan,, Ph.D. Department of Biostatistics Virginia Commonwealth
More informationSTATC 141 Spring 2005, April 5 th Lecture notes on Affymetrix arrays. Materials are from
STATC 141 Spring 2005, April 5 th Lecture notes on Affymetrix arrays Materials are from http://www.ohsu.edu/gmsr/amc/amc_technology.html The GeneChip high-density oligonucleotide arrays are fabricated
More informationA GENOTYPE CALLING ALGORITHM FOR AFFYMETRIX SNP ARRAYS
Bioinformatics Advance Access published November 2, 2005 The Author (2005). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org
More informationBiotechnology and Genomics in Public Health. Sharon S. Krag, PhD Johns Hopkins University
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationGene Expression Analysis with Pathway-Centric DNA Microarrays
Gene Expression Analysis with Pathway-Centric DNA Microarrays SuperArray Bioscience Corporation George J. Quellhorst, Jr. Ph.D. Manager, Customer Education Topics to be Covered Introduction to DNA Microarrays
More informationWhole Transcriptome Analysis of Illumina RNA- Seq Data. Ryan Peters Field Application Specialist
Whole Transcriptome Analysis of Illumina RNA- Seq Data Ryan Peters Field Application Specialist Partek GS in your NGS Pipeline Your Start-to-Finish Solution for Analysis of Next Generation Sequencing Data
More informationaffy: Built-in Processing Methods
affy: Built-in Processing Methods Ben Bolstad October 30, 2017 Contents 1 Introduction 2 2 Background methods 2 2.1 none...................................... 2 2.2 rma/rma2...................................
More informationGene Signal Estimates from Exon Arrays
Gene Signal Estimates from Exon Arrays I. Introduction: With exon arrays like the GeneChip Human Exon 1.0 ST Array, researchers can examine the transcriptional profile of an entire gene (Figure 1). Being
More informationDNA Microarrays and Computational Analysis of DNA Microarray. Data in Cancer Research
DNA Microarrays and Computational Analysis of DNA Microarray Data in Cancer Research Mario Medvedovic, Jonathan Wiest Abstract 1. Introduction 2. Applications of microarrays 3. Analysis of gene expression
More informationNew Stringent Two-Color Gene Expression Workflow Enables More Accurate and Reproducible Microarray Data
Application Note GENOMICS INFORMATICS PROTEOMICS METABOLOMICS A T C T GATCCTTC T G AAC GGAAC T AATTTC AA G AATCTGATCCTTG AACTACCTTCCAAGGTG New Stringent Two-Color Gene Expression Workflow Enables More
More informationDNA Microarray Technology
CHAPTER 1 DNA Microarray Technology All living organisms are composed of cells. As a functional unit, each cell can make copies of itself, and this process depends on a proper replication of the genetic
More informationCHAPTER 8 T Tests. A number of t tests are available, including: The One-Sample T Test The Paired-Samples Test The Independent-Samples T Test
CHAPTER 8 T Tests A number of t tests are available, including: The One-Sample T Test The Paired-Samples Test The Independent-Samples T Test 8.1. One-Sample T Test The One-Sample T Test procedure: Tests
More informationMicroarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison. CodeLink compatible
Microarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison CodeLink compatible Microarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood
More informationless sensitive than RNA-seq but more robust analysis pipelines expensive but quantitiatve standard but typically not high throughput
Chapter 11: Gene Expression The availability of an annotated genome sequence enables massively parallel analysis of gene expression. The expression of all genes in an organism can be measured in one experiment.
More informationA Distribution Free Summarization Method for Affymetrix GeneChip Arrays
A Distribution Free Summarization Method for Affymetrix GeneChip Arrays Zhongxue Chen 1,2, Monnie McGee 1,*, Qingzhong Liu 3, and Richard Scheuermann 2 1 Department of Statistical Science, Southern Methodist
More informationQuantitative Real Time PCR USING SYBR GREEN
Quantitative Real Time PCR USING SYBR GREEN SYBR Green SYBR Green is a cyanine dye that binds to double stranded DNA. When it is bound to D.S. DNA it has a much greater fluorescence than when bound to
More informationClass Information. Introduction to Genome Biology and Microarray Technology. Biostatistics Rafael A. Irizarry. Lecture 1
This work is licensed under a Creative Commons ttribution-noncommercial-sharelike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationBCB 444/544 Fall 07 Dobbs 1
BCB 444/544 Required Reading (before lecture) Lecture 37 Brief Review: Microarrays Clustering & Classification #37_Nov26 Mon Nov 26 - Lecture 37 Clustering & Classification Chp 18 Functional Genomics Wed
More informationData Mining For Genomics
Data Mining For Genomics Alfred O. Hero III The University of Michigan, Ann Arbor, MI ISTeC Seminar, CSU Feb. 22, 23 1. Biotechnology Overview 2. Gene Microarray Technology 3. Mining the genomic database
More informationIntroduction to BioMEMS & Medical Microdevices DNA Microarrays and Lab-on-a-Chip Methods
Introduction to BioMEMS & Medical Microdevices DNA Microarrays and Lab-on-a-Chip Methods Companion lecture to the textbook: Fundamentals of BioMEMS and Medical Microdevices, by Prof., http://saliterman.umn.edu/
More informationGene Regulation Solutions. Microarrays and Next-Generation Sequencing
Gene Regulation Solutions Microarrays and Next-Generation Sequencing Gene Regulation Solutions The Microarrays Advantage Microarrays Lead the Industry in: Comprehensive Content SurePrint G3 Human Gene
More informationEstimating Cell Cycle Phase Distribution of Yeast from Time Series Gene Expression Data
2011 International Conference on Information and Electronics Engineering IPCSIT vol.6 (2011) (2011) IACSIT Press, Singapore Estimating Cell Cycle Phase Distribution of Yeast from Time Series Gene Expression
More informationV3 differential gene expression analysis
differential gene expression analysis - What is measured by microarrays? - Microarray normalization - Differential gene expression (DE) analysis based on microarray data - Detection of outliers - RNAseq
More informationMethods of Biomaterials Testing Lesson 3-5. Biochemical Methods - Molecular Biology -
Methods of Biomaterials Testing Lesson 3-5 Biochemical Methods - Molecular Biology - Chromosomes in the Cell Nucleus DNA in the Chromosome Deoxyribonucleic Acid (DNA) DNA has double-helix structure The
More informationDNA Collection. Data Quality Control. Whole Genome Amplification. Whole Genome Amplification. Measure DNA concentrations. Pros
DNA Collection Data Quality Control Suzanne M. Leal Baylor College of Medicine sleal@bcm.edu Copyrighted S.M. Leal 2016 Blood samples For unlimited supply of DNA Transformed cell lines Buccal Swabs Small
More informationUsing 2-way ANOVA to dissect gene expression following myocardial infarction in mice
Using 2-way ANOVA to dissect gene expression following myocardial infarction in mice Thank you for waiting. The presentation will be starting in a few minutes at 9AM Pacific Daylight Time. During this
More informationEstimation problems in high throughput SNP platforms
Estimation problems in high throughput SNP platforms Rob Scharpf Department of Biostatistics Johns Hopkins Bloomberg School of Public Health November, 8 Outline Introduction Introduction What is a SNP?
More informationREAL TIME PCR USING SYBR GREEN
REAL TIME PCR USING SYBR GREEN 1 THE PROBLEM NEED TO QUANTITATE DIFFERENCES IN mrna EXPRESSION SMALL AMOUNTS OF mrna LASER CAPTURE SMALL AMOUNTS OF TISSUE PRIMARY CELLS PRECIOUS REAGENTS 2 THE PROBLEM
More informationAcaStat How To Guide. AcaStat. Software. Copyright 2016, AcaStat Software. All rights Reserved.
AcaStat How To Guide AcaStat Software Copyright 2016, AcaStat Software. All rights Reserved. http://www.acastat.com Table of Contents Frequencies... 3 List Variables... 4 Descriptives... 5 Explore Means...
More informationOutline. Array platform considerations: Comparison between the technologies available in microarrays
Microarray overview Outline Array platform considerations: Comparison between the technologies available in microarrays Differences in array fabrication Differences in array organization Applications of
More informationMolecular Cell Biology - Problem Drill 11: Recombinant DNA
Molecular Cell Biology - Problem Drill 11: Recombinant DNA Question No. 1 of 10 1. Which of the following statements about the sources of DNA used for molecular cloning is correct? Question #1 (A) cdna
More informationmeasuring gene expression December 5, 2017
measuring gene expression December 5, 2017 transcription a usually short-lived RNA copy of the DNA is created through transcription RNA is exported to the cytoplasm to encode proteins some types of RNA
More informationQUESTION 2 What conclusion is most correct about the Experimental Design shown here with the response in the far right column?
QUESTION 1 When a Belt Poka-Yoke's a defect out of the process entirely then she should track the activity with a robust SPC system on the characteristic of interest in the defect as an early warning system.
More informationSupporting Online Material for
www.sciencemag.org/cgi/content/full/313/5795/1929/dc1 Supporting Online Material for The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease Justin Lamb, *
More informationMicroarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison
Microarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison Thank you for waiting. The presentation will be starting in a few minutes at 9AM Pacific Daylight
More informationIdentifying Candidate Informative Genes for Biomarker Prediction of Liver Cancer
Identifying Candidate Informative Genes for Biomarker Prediction of Liver Cancer Nagwan M. Abdel Samee 1, Nahed H. Solouma 2, Mahmoud Elhefnawy 3, Abdalla S. Ahmed 4, Yasser M. Kadah 5 1 Computer Engineering
More informationSECTION 11 ACUTE TOXICITY DATA ANALYSIS
SECTION 11 ACUTE TOXICITY DATA ANALYSIS 11.1 INTRODUCTION 11.1.1 The objective of acute toxicity tests with effluents and receiving waters is to identify discharges of toxic effluents in acutely toxic
More informationExpression Array System
Integrated Science for Gene Expression Applied Biosystems Expression Array System Expression Array System SEE MORE GENES The most complete, most sensitive system for whole genome expression analysis. The
More informationInherent variation in the reactions, type of enzymes used. Depends on the type of labeling and procedures, as well as the age of the labels.
332 Experimental design, analysis of variance and slide quality assessment in gene expression arrays Sorin Draghici*, Alexander Kuklin, Bruce Hoff & Soheil Shams Address BioDiscovery Inc 11150 West Olympic
More informationOverview. Presenter: Bill Cheney. Audience: Clinical Laboratory Professionals. Field Guide To Statistics for Blood Bankers
Field Guide To Statistics for Blood Bankers A Basic Lesson in Understanding Data and P.A.C.E. Program: 605-022-09 Presenter: Bill Cheney Audience: Clinical Laboratory Professionals Overview Statistics
More informationCAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Giri Narasimhan
CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools Giri Narasimhan ECS 254; Phone: x3748 giri@cis.fiu.edu www.cis.fiu.edu/~giri/teach/bioinfs15.html Gene Expression q Process of transcription
More informationearray 5.0 Create your own Custom Microarray Design
earray 5.0 Create your own Custom Microarray Design http://earray.chem.agilent.com earray 5.x Overview Session Summary Session Summary Agilent Genomics Microarray Solution earray Functional Overview Gene
More information