Microarray analysis of gene expression in male germ cell tumors

Size: px
Start display at page:

Download "Microarray analysis of gene expression in male germ cell tumors"

Transcription

1 Microarray analysis of gene expression in male germ cell tumors

2 Microarray analysis of gene expression in male germ cell tumors General microarry data analysis workflow From raw data to biological significance Comparison statistics and correction for multiple testing Clustering for visualization and partitioning GeneSifter Overview Microarray analysis of gene expression in male germ cell tumors Identification of biological themes Normal vs. seminoma Common to all tumors Tumor specific

3 Analysis Workflow Raw data Normalized, scaled data Differentially expressed genes Identify and partition expression patterns Gene Summaries Biological themes (Pathways, molecular function, etc.)

4 Analysis Workflow Raw data Normalized, scaled data Differentially expressed genes Identify and partition expression patterns Gene Summaries Data upload Comparison statistics, correction for multiple testing Up and down regulated, magnitude, clustering Annotation (UniGene, Entrez Gene, Gene Ontologies, etc.) Biological themes (Pathways, molecular function, etc.) Ontology report, pathway report, z-score

5 microarraysuccess.com Experiment Design Experimental design determines what can be inferred from the data as well as determining the confidence that can be assigned to those inferences. Careful experimental design and the presence of biological replicates are essential to the successful use of microarrays. Type of experiment Two groups Three or more groups Time series Dose response Multiple treatment The type of experiment and number of groups will affect the statistical methods used to detect differential expression Replicates The more the better, but at least 3 Biological better than technical Rigorous statistical inferences cannot be made with a sample size of one. The more replicates, the stronger the inference. Supporting material - Experimental Design and Other Issues in Microarray Studies - Kathleen Kerr -

6 microarraysuccess.com Differential Expression The fundamental goal of microarray experiments is to identify genes that are differentially expressed in the conditions being studied. Comparison statistics can be used to help identify differentially expressed genes and cluster analysis can be used to identify patterns of gene expression and to segregate a subset of genes based on these patterns. Statistical Significance Fold change Fold change does not address the reproducibility of the observed difference and cannot be used to determine the statistical significance. Comparison statistics 2 group t-test, Welch s t-test, Wilcoxon Rank Sum, 3 or more groups ANOVA, Kruskal-Wallis Comparison tests require replicates and use the variability within the replicates to assign a confidence level as to whether the gene is differentially expressed. Supporting material - Draghici S. (2002) Statistical intelligence: effective analysis of high-density microarray data. Drug Discov Today, 7(11 Suppl).: S55-63.

7 microarraysuccess.com Differential Expression Correction for multiple testing- Methods for adjusting the p-value from a comparison test based on the number of tests performed. These adjustments help to reduce the number of false positives in an experiment. FWER : Family Wise Error Rate (FWER) corrections adjust the p-value so that it reflects the chance of at least 1 false positive being found in the list. Bonferonni, Holm, W & Y MaxT FDR : False Discovery Rate corrections (FDR) adjust the p-value so that it reflects the frequency of false positives in the list. Benjamini and Hochberg, SAM The FWER is more conservative, but the FDR is usually acceptable for discovery experiments, i.e. where a small number of false positives is acceptable Dudoit, S., et al. (2003) Multiple hypothesis testing in microarray experiments. Statistical Science 18(1): Reiner, A., et al. (2003) Identifying differentially expressed genes using false discovery rate controlling procedures. Bioinformatics 19(3):

8 microarraysuccess.com Identification and partitioning of expression patterns Cluster Analysis - clustering methods are descriptive or exploratory tools that can be used to identify groups within complex datasets. Clustering methods can be used to identify patterns of gene expression in microarray datasets. Visualization Methods such as hierarchical clustering can be used to help identify patterns in a large dataset. Partitioning this type of cluster analysis can be used to separate data into discrete groups or clusters. - K-means -PAM (Partitioning around medoids) Cluster analysis is used to identify patterns of gene expression within large datasets and to segregate those genes based on these patterns. Quackenbush J. Computational analysis of microarray data. Nat Rev Genet Jun;2(6): Review. Kaufman L, Rousseeuw PJ: Finding Groups in Data: An Introduction to Cluster Analysis. New York: Wiley; 1990.

9 microarraysuccess.com Identification and partitioning of expression patterns Cluster analysis Used to identify groups, or clusters, of similar objects (gene expression profiles) on the basis of a set of feature vectors (expression measurements). Two general types - Hierarchical methods provide a hierarchy of clusters, from the smallest, where all objects (gene expression profiles) are in one cluster, through to the largest set, where each observation is in its own cluster. Partitioning methods partition the data (list of genes) into a pre-specified number (K) of mutually exclusive groups based on feature vector (expression profile).

10 microarraysuccess.com Identification and partitioning of expression patterns Hierarchical Partitioning Cluster analysis 1846 differentially expressed genes from FVB heart development time series.

11 Analysis Workflow 2 groups (Normal vs. Seminoma) MAS5 Raw data 3+ groups (All tumors) MAS5 Normalized, scaled data t-test ANOVA BH (FDR) BH (FDR) Differentially expressed genes Hierarchical clustering Upregulated genes Partitioning Identify and partition expression patterns One click gene summary Gene Summaries One click gene summary z-score report Biological themes (Pathways, molecular function, etc.) z-score report

12 GeneSifter Microarray Data Analysis Accessibility Web-based Secure Data management Data Annotation (MIAME) Multiple upload tools CodeLink Affymetrix Illumina Agilent Custom Differential Expression - Powerful, accessible tools for determining Statistical Significance R based statistics Bioconductor Comparison Tests t-test, Welch s t-test, Wilcoxon Rank sum test, ANOVA, Correction for Multiple Testing Bonferroni, Holm, Westfall and Young maxt, Benjamini and Hochberg Unsupervised Clustering PAM, CLARA, Hierarchical clustering Silhouettes

13 GeneSifter Microarray Data Analysis Integrated tools for determining Biological Significance One Click Gene Summary Ontology Report Pathway Report Search by ontology terms Search by KEGG terms or Chromosome

14 The GeneSifter Data Center Free resource Training Research Publishing 6 areas Cardiovascular Cancer Endocrinology Neuroscience Immunology Oral Biology Access to : Data Analysis summary Tutorials WebEx

15 The GeneSifter Data Center

16 Microarray analysis of gene expression in male germ cell tumors General microarry data analysis workflow From raw data to biological significance Comparison statistics and correction for multiple testing Clustering for visualization and partitioning GeneSifter Overview Microarray analysis of gene expression in male germ cell tumors Identification of biological themes Normal vs. seminoma Common to all tumors Tumor specific

17 Background Testicular Germ Cell Tumors Most common cancer in males Rates Increasing Seminoma undifferentiated germ cells Non seminoma Yolk Sac Tumor extraembryonic differentiation Teratoma multiple somatic lineages Embryonal Carcinoma least differentiated Choriocarcinoma - extraembryonic differentiation Mixed

18 The Data Affymetrix GeneChip Human Genome U133A Korkola JE, Houldsworth J, Dobrzynski D, Olshen AB, Reuter VE, Bosl GJ, Chaganti RS. Gene expression-based classification of nonseminomatous male germ cell tumors. Oncogene Jul 28;24(32):

19 Background - Data Gene expression-based classification of nonseminomatous male germ cell tumors Oncogene Jul 28;24(32): Korkola JE, Houldsworth J, Dobrzynski D, Olshen AB, Reuter VE, Bosl GJ, Chaganti RS.. Cell Biology Program, Memorial Sloan-Kettering Cancer Center, New York, NY 84 NSGCT (42 pure and 42 mixed) Affymetrix GeneChip human Genome U133A and U133B Identified 146 transcripts that classified histology (93% accuracy) Data available from GEO (GSE3218)

20 The Data Affymetrix GeneChip Human Genome U133A 5 Normal 10 Pure Yolk Sac Tumor 12 Pure Seminoma 15 Pure Teratoma 15 Pure Embryonal Carcinoma Korkola JE, Houldsworth J, Dobrzynski D, Olshen AB, Reuter VE, Bosl GJ, Chaganti RS. Gene expression-based classification of nonseminomatous male germ cell tumors. Oncogene Jul 28;24(32):

21 GeneSifter - Analysis Examples 2 groups (Normal testis vs. Seminoma Data Upload Affymetrix 3 + groups (All tumors.) Differential expression Fold change Quality t-test False discovery rate Differential expression Fold change Quality ANOVA False discovery rate Visualization Hierarchical clustering PCA Partitioning PAM Silhouettes Biological significance Gene Annotation Ontology report Pathway report

22 Pairwise Analysis Select group 1 5 Normal Testis Select group 2 12 Pure Seminoma

23 Pairwise Analysis Already normalized (MAS5) t-test Quality filter P (filters out A genes) Benjamini and Hochberg (FDR) Log transform data

24 Pairwise Analysis Gene List

25 Biological Significance Gene Annotation Sources UniGene - organizes GenBank sequences into a non-redundant set of gene-oriented clusters. Gene titles are assigned to the clusters and these titles are commonly used by researchers to refer to that particular gene. LocusLink (Entrez Gene) - provides a single query interface to curated sequence and descriptive information, including function, about genes. Gene Ontologies The Gene Ontology Consortium provides controlled vocabularies for the description of the molecular function, biological process and cellular component of gene products, that can be used by databases such as Entrez Gene. KEGG - Kyoto Encyclopedia of Genes and Genomes provides information about both regulatory and metabolic pathways for genes. Reference Sequences- The NCBI Reference Sequence project (RefSeq) provides reference sequences for both the mrna and protein products of included genes. GeneSifter maintains its own copies of these databases and updates them automatically.

26 One-Click Gene Summary

27 Pairwise Analysis Gene List

28 Ontology Report

29 Ontology Report : z-score R = total number of genes meeting selection criteria N = total number of genes measured r = number of genes meeting selection criteria with the specified GO term n = total number of genes measured with the specific GO term Reference: Scott W Doniger, Nathan Salomonis, Kam D Dahlquist, Karen Vranizan, Steven C Lawlor and Bruce R Conklin; MAPPFinder: usig Gene Ontology and GenMAPP to create a global gene-expression profile from microarray data, Genome Biology 2003, 4:R7

30 Z-score Report

31 Z-score Report

32 KEGG Report

33 Pairwise Analysis - Summary Normal Testis compared to Seminoma 5 Normal Testis 12 Pure Seminoma t-test, Benjamini and Hochberg (FDR) Pattern selection 2444 increased In Seminoma Z-scores Biological processes Protein biosysnthesis (155) Antigen presentation (19) RNA Splicing (30) Prostaglandin biosynthesis (5) KEGG Ribosome (76) ~21,000 transcripts 5990 genes 3546 decreased In Seminoma Biological processes Spermatogenesis (60) Protein ubiquitination (63) Protein amino acid dephosphorylation (39) KEGG Basal transcription factors (15) Ubiquitin mediated proteolysis (17) Circadian rhythm (7)

34 Microarray analysis of gene expression in male germ cell tumors General microarry data analysis workflow From raw data to biological significance Comparison statistics and correction for multiple testing Clustering for visualization and partitioning GeneSifter Overview Microarray analysis of gene expression in male germ cell tumors Identification of biological themes Normal vs. seminoma Common to all tumors Tumor specific

35 Pairwise Analysis Gene List Normal vs. Embryonal Carcinoma

36 Gene Lists Common and Unique Genes

37 Platform comparison Biological themes

38 Z-score Report

39 GeneSifter - Analysis Examples 2 groups (Normal testis vs. Seminoma Data Upload Affymetrix 3 + groups (All tumors.) Differential expression Fold change Quality t-test False discovery rate Differential expression Fold change Quality ANOVA False discovery rate Visualization Hierarchical clustering PCA Partitioning PAM Silhouettes Biological significance Gene Annotation Ontology report Pathway report

40 Projects User defined grouping of conditions Creates new set of normalized, averaged, transformed data Allows analysis across more than 2 groups Clustering, search by ontologies and other analysis options

41 Projects - Filtering ANOVA, 4 fold change cutoff, P quality, Benjamini and Hochberg (FDR)

42 Projects - Visualization Hierarchical clustering of 2573 genes

43 Projects - Partitioning

44 Projects Summary Cluster 4

45 Projects Summary Cluster 1 Cadmium ion binding

46 Projects - Visualization Hierarchical clustering of 2573 genes

47 Projects Pattern Search

48 Projects Increased in all tumors

49 Projects Increased in Teratoma

50 Projects Search by KEGG Pathway Wnt Signaling Pathway

51 Projects Search by KEGG Pathway Wnt Signaling Pathway

52 Projects Clustering of individual samples Hierarchical clustering of 57 samples

53 Projects Clustering of individual samples 2 Embryonal Carcinoma groups Group 2 Group 1

54 Pairwise Analysis Gene List Embryonal Carcinoma Group 1 vs. Group 2

55 Platform comparison Biological themes

56 Platform comparison Biological themes

57 Microarray analysis of gene expression in male germ cell tumors GSE3218 Affymetrix GeneChip Human Genome U133A 5 Normal 10 Pure Yolk Sac Tumor 12 Pure Seminoma 15 Pure Teratoma 15 Pure Embryonal Carcinoma 1000s of differentially expressed genes (normal vs tumor) Seminoma protein ubiquitination (dn), RNA splicing (up) 1000s of genes common to all tumors Spermatogenesis, carbohydrate metabolism (dn) Cell adhesion, regulation of actin cytoskeleton, immune response (up) 100s of genes common to 2 or more tumors Cadmium ion binding, cytokine activity (up) Small number of genes unique to each tumor 2 groups of embryonal carcinoma?

58 Microarray analysis of gene expression in male germ cell tumors Not examined: GSE3218 U133B Arrays Mixed tumors Metastatic vs primary Chemotherapy GSE1818 Agilent arrays 3 normal, 3 seminoma, 4 yolk sac tumor, 4 teratoma, 5 embryonal carcinoma Skotheim RI, Lind GE, Monni O, Nesland JM, Abeler VM, Fossa SD, Duale N, Brunborg G, Kallioniemi O, Andrews PW, Lothe RA. Differentiation of human embryonal carcinomas in vitro and in vivo reveals expression profiles relevant to normal development. Cancer Res Jul 1;65(13):

59 MicroarraySuccess.com Seven Keys to Successful Microarray Data Analysis Experiment Design Platform Selection Data Management System Access Differential Expression Biological Significance Data Publication Type of experiment Two groups Time series Dose Response Multiple treatments Replicates The more the better Technical vs. biological Platforms cdna Oligo One color Two color Feature Extraction Software File formats Databases Raw Data Storing Retrieving Experiment Annotation Samples Protocols Usability Intuitive Special training System Access Single user desktop Single user server Web-based Sharing data In the lab Collaboration Normalization Differential Expression Fold change Comparison statistics FWER/FDR Pattern Identification Clustering Visualization Partitioning Gene Annotation UniGene LocusLink Gene Ontology KEGG OMIM Single Genes Gene Summaries Gene Lists Ontology Report Pathway Report MIAME What is it? Publication Public databases GEO ArrayExpress SMD Using public data Meta analysis Academic partner University of Washington

60 The GeneSifter Data Center

61 Thank You Trial account, tutorials, sample data and Data Center Eric Olson

Microarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison. CodeLink compatible

Microarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison. CodeLink compatible Microarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison CodeLink compatible Microarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood

More information

Seven Keys to Successful Microarray Data Analysis

Seven Keys to Successful Microarray Data Analysis Seven Keys to Successful Microarray Data Analysis Experiment Design Platform Selection Data Management System Access Differential Expression Biological Significance Data Publication Type of experiment

More information

Using 2-way ANOVA to dissect the immune response to hookworm infection in mouse lung

Using 2-way ANOVA to dissect the immune response to hookworm infection in mouse lung Using 2-way ANOVA to dissect the immune response to hookworm infection in mouse lung Using 2-way ANOVA to dissect the immune response to hookworm infection in mouse lung General microarry data analysis

More information

Using 2-way ANOVA to dissect gene expression following myocardial infarction in mice

Using 2-way ANOVA to dissect gene expression following myocardial infarction in mice Using 2-way ANOVA to dissect gene expression following myocardial infarction in mice Thank you for waiting. The presentation will be starting in a few minutes at 9AM Pacific Daylight Time. During this

More information

Microarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison

Microarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison Microarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison Thank you for waiting. The presentation will be starting in a few minutes at 9AM Pacific Daylight

More information

The microarray data analysis process - from raw data to biological significance. N. Eric Olson

The microarray data analysis process - from raw data to biological significance. N. Eric Olson The microarray data analysis process - from raw data to biological significance N. Eric Olson eric@genesifter.net Microarrays 60,000 56,000 Data Points per Experiment 50,000 40,000 30,000 20,000 10,000

More information

The microarray data analysis process - from raw data to biological significance

The microarray data analysis process - from raw data to biological significance The microarray data analysis process - from raw data to biological significance Thank you for waiting. The presentation will be starting in a few minutes at 6AM Pacific Daylight Time. During this webinar

More information

Identification of biological themes in microarray data from a mouse heart development time series using GeneSifter

Identification of biological themes in microarray data from a mouse heart development time series using GeneSifter Identification of biological themes in microarray data from a mouse heart development time series using GeneSifter VizX Labs, LLC Seattle, WA 98119 Abstract Oligonucleotide microarrays were used to study

More information

Introduction to Microarray Technique, Data Analysis, Databases Maryam Abedi PhD student of Medical Genetics

Introduction to Microarray Technique, Data Analysis, Databases Maryam Abedi PhD student of Medical Genetics Introduction to Microarray Technique, Data Analysis, Databases Maryam Abedi PhD student of Medical Genetics abedi777@ymail.com Outlines Technology Basic concepts Data analysis Printed Microarrays In Situ-Synthesized

More information

Gene Expression Data Analysis

Gene Expression Data Analysis Gene Expression Data Analysis Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu BMIF 310, Fall 2009 Gene expression technologies (summary) Hybridization-based

More information

Analysis of Microarray Data

Analysis of Microarray Data Analysis of Microarray Data Lecture 3: Visualization and Functional Analysis George Bell, Ph.D. Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Review Visualizing

More information

Analysis of Microarray Data

Analysis of Microarray Data Analysis of Microarray Data Lecture 3: Visualization and Functional Analysis George Bell, Ph.D. Senior Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Review

More information

Microarray Informatics

Microarray Informatics Microarray Informatics Donald Dunbar MSc Seminar 4 th February 2009 Aims To give a biologistʼs view of microarray experiments To explain the technologies involved To describe typical microarray experiments

More information

The Microarray Data Analysis Process: From Raw Data to Biological Significance

The Microarray Data Analysis Process: From Raw Data to Biological Significance NeuroRx : The Journal of the American Society for Experimental NeuroTherapeutics The Microarray Data Analysis Process: From Raw Data to Biological Significance N. Eric Olson VizX Labs, Seattle, Washington

More information

Annotation. (Chapter 8)

Annotation. (Chapter 8) Annotation (Chapter 8) Genome annotation Genome annotation is the process of attaching biological information to sequences: identify elements on the genome attach biological information to elements store

More information

Microarray Informatics

Microarray Informatics Microarray Informatics Donald Dunbar MSc Seminar 31 st January 2007 Aims To give a biologist s view of microarray experiments To explain the technologies involved To describe typical microarray experiments

More information

PATHWAY ANALYSIS. Susan LM Coort, PhD Department of Bioinformatics, Maastricht University. PET course: Toxicogenomics

PATHWAY ANALYSIS. Susan LM Coort, PhD Department of Bioinformatics, Maastricht University. PET course: Toxicogenomics PATHWAY ANALYSIS Susan LM Coort, PhD Department of Bioinformatics, Maastricht University 1 Data analysis overview Microarray scans Slide based on a slide from J. Pennings, RIVM, NL Image analysis Preprocessing

More information

CodeLink Human Whole Genome Bioarray

CodeLink Human Whole Genome Bioarray CodeLink Human Whole Genome Bioarray 55,000 human gene targets on a single bioarray The CodeLink Human Whole Genome Bioarray comprises one of the most comprehensive coverages of the human genome, as it

More information

Research Powered by Agilent s GeneSpring

Research Powered by Agilent s GeneSpring Research Powered by Agilent s GeneSpring Agilent Technologies, Inc. Carolina Livi, Bioinformatics Segment Manager Research Powered by GeneSpring Topics GeneSpring (GS) platform New features in GS 13 What

More information

Agilent GeneSpring GX 10: Beyond. Pam Tangvoranuntakul Product Manager, GeneSpring October 1, 2008

Agilent GeneSpring GX 10: Beyond. Pam Tangvoranuntakul Product Manager, GeneSpring October 1, 2008 Agilent GeneSpring GX 10: Gene Expression and Beyond Pam Tangvoranuntakul Product Manager, GeneSpring October 1, 2008 GeneSpring GX 10 in the News Our Goals for GeneSpring GX 10 Goal 1: Bring back GeneSpring

More information

Deakin Research Online

Deakin Research Online Deakin Research Online This is the published version: Church, Philip, Goscinski, Andrzej, Wong, Adam and Lefevre, Christophe 2011, Simplifying gene expression microarray comparative analysis., in BIOCOM

More information

Standard Data Analysis Report Agilent Gene Expression Service

Standard Data Analysis Report Agilent Gene Expression Service Standard Data Analysis Report Agilent Gene Expression Service Experiment: S534662 Date: 2011-01-01 Prepared for: Dr. Researcher Genomic Sciences Lab Prepared by S534662 Standard Data Analysis Report 2011-01-01

More information

Bioconductor. Course in Practical Microarray Analysis Berlin Slides 2002 Sandrine Dudoit, Robert Gentleman. Adapted by Wolfgang Huber.

Bioconductor. Course in Practical Microarray Analysis Berlin Slides 2002 Sandrine Dudoit, Robert Gentleman. Adapted by Wolfgang Huber. Bioconductor Course in Practical Microarray Analysis Berlin 27.-30.1.2003 Slides 2002 Sandrine Dudoit, Robert Gentleman. Adapted by Wolfgang Huber. Statistical computing Applications statistical design

More information

Understanding protein lists from proteomics studies. Bing Zhang Department of Biomedical Informatics Vanderbilt University

Understanding protein lists from proteomics studies. Bing Zhang Department of Biomedical Informatics Vanderbilt University Understanding protein lists from proteomics studies Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu A typical comparative shotgun proteomics study IPI00375843

More information

Microarrays & Gene Expression Analysis

Microarrays & Gene Expression Analysis Microarrays & Gene Expression Analysis Contents DNA microarray technique Why measure gene expression Clustering algorithms Relation to Cancer SAGE SBH Sequencing By Hybridization DNA Microarrays 1. Developed

More information

Outline. Analysis of Microarray Data. Most important design question. General experimental issues

Outline. Analysis of Microarray Data. Most important design question. General experimental issues Outline Analysis of Microarray Data Lecture 1: Experimental Design and Data Normalization Introduction to microarrays Experimental design Data normalization Other data transformation Exercises George Bell,

More information

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS

More information

Measuring and Understanding Gene Expression

Measuring and Understanding Gene Expression Measuring and Understanding Gene Expression Dr. Lars Eijssen Dept. Of Bioinformatics BiGCaT Sciences programme 2014 Why are genes interesting? TRANSCRIPTION Genome Genomics Transcriptome Transcriptomics

More information

Gene expression analysis: Introduction to microarrays

Gene expression analysis: Introduction to microarrays Gene expression analysis: Introduction to microarrays Adam Ameur The Linnaeus Centre for Bioinformatics, Uppsala University February 15, 2006 Overview Introduction Part I: How a microarray experiment is

More information

Integrative Genomics 1a. Introduction

Integrative Genomics 1a. Introduction 2016 Course Outline Integrative Genomics 1a. Introduction ggibson.gt@gmail.com http://www.cig.gatech.edu 1a. Experimental Design and Hypothesis Testing (GG) 1b. Normalization (GG) 2a. RNASeq (MI) 2b. Clustering

More information

Bioconductor. Course in Practical Microarray Analysis Heidelberg

Bioconductor. Course in Practical Microarray Analysis Heidelberg Bioconductor Course in Practical Microarray Analysis Heidelberg 23.-27.9.2002 Slides 2002 Sandrine Dudoit, Robert Gentleman. Adapted by Wolfgang Huber. Statistical computing Everywhere for statistical

More information

Introduction to Bioinformatics and Gene Expression Technology

Introduction to Bioinformatics and Gene Expression Technology Vocabulary Introduction to Bioinformatics and Gene Expression Technology Utah State University Spring 2014 STAT 5570: Statistical Bioinformatics Notes 1.1 Gene: Genetics: Genome: Genomics: hereditary DNA

More information

advanced analysis of gene expression microarray data aidong zhang World Scientific State University of New York at Buffalo, USA

advanced analysis of gene expression microarray data aidong zhang World Scientific State University of New York at Buffalo, USA advanced analysis of gene expression microarray data aidong zhang State University of New York at Buffalo, USA World Scientific NEW JERSEY LONDON SINGAPORE BEIJING SHANGHAI HONG KONG TAIPEI CHENNAI Contents

More information

GS Analysis of Microarray Data

GS Analysis of Microarray Data GS01 0163 Analysis of Microarray Data Keith Baggerly and Brad Broom Department of Bioinformatics and Computational Biology UT M. D. Anderson Cancer Center kabagg@mdanderson.org bmbroom@mdanderson.org 7

More information

Gene expression analysis. Biosciences 741: Genomics Fall, 2013 Week 5. Gene expression analysis

Gene expression analysis. Biosciences 741: Genomics Fall, 2013 Week 5. Gene expression analysis Gene expression analysis Biosciences 741: Genomics Fall, 2013 Week 5 Gene expression analysis From EST clusters to spotted cdna microarrays Long vs. short oligonucleotide microarrays vs. RT-PCR Methods

More information

GS Analysis of Microarray Data

GS Analysis of Microarray Data GS01 0163 Analysis of Microarray Data Keith Baggerly and Brad Broom Department of Bioinformatics and Computational Biology UT M. D. Anderson Cancer Center kabagg@mdanderson.org bmbroom@mdanderson.org 8

More information

Microarray Data Analysis in GeneSpring GX 11. Month ##, 200X

Microarray Data Analysis in GeneSpring GX 11. Month ##, 200X Microarray Data Analysis in GeneSpring GX 11 Month ##, 200X Agenda Genome Browser GO GSEA Pathway Analysis Network building Find significant pathways Extract relations via NLP Data Visualization Options

More information

ChIP-seq data analysis with Chipster. Eija Korpelainen CSC IT Center for Science, Finland

ChIP-seq data analysis with Chipster. Eija Korpelainen CSC IT Center for Science, Finland ChIP-seq data analysis with Chipster Eija Korpelainen CSC IT Center for Science, Finland chipster@csc.fi What will I learn? Short introduction to ChIP-seq Analyzing ChIP-seq data Central concepts Analysis

More information

Next-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX

Next-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX Next-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX Technical Overview Introduction RNA Sequencing (RNA-Seq) is one of the most commonly used next-generation sequencing (NGS)

More information

A WEB-BASED TOOL FOR GENOMIC FUNCTIONAL ANNOTATION, STATISTICAL ANALYSIS AND DATA MINING

A WEB-BASED TOOL FOR GENOMIC FUNCTIONAL ANNOTATION, STATISTICAL ANALYSIS AND DATA MINING A WEB-BASED TOOL FOR GENOMIC FUNCTIONAL ANNOTATION, STATISTICAL ANALYSIS AND DATA MINING D. Martucci a, F. Pinciroli a,b, M. Masseroli a a Dipartimento di Bioingegneria, Politecnico di Milano, Milano,

More information

Microarray Experiment Design

Microarray Experiment Design Microarray Experiment Design Samples used, extract preparation and labelling: AML blasts were isolated from bone marrow by centrifugation on a Ficoll- Hypaque gradient. Total RNA was extracted using TRIzol

More information

GS Analysis of Microarray Data

GS Analysis of Microarray Data GS01 0163 Analysis of Microarray Data Keith Baggerly and Kevin Coombes Department of Bioinformatics and Computational Biology UT M. D. Anderson Cancer Center kabagg@mdanderson.org kcoombes@mdanderson.org

More information

Gene expression: Microarray data analysis. Copyright notice. Outline: microarray data analysis. Schedule

Gene expression: Microarray data analysis. Copyright notice. Outline: microarray data analysis. Schedule Gene expression: Microarray data analysis Copyright notice Many of the images in this powerpoint presentation are from Bioinformatics and Functional Genomics by Jonathan Pevsner (ISBN -47-4-8). Copyright

More information

Biology 644: Bioinformatics

Biology 644: Bioinformatics Processes Activation Repression Initiation Elongation.... Processes Splicing Editing Degradation Translation.... Transcription Translation DNA Regulators DNA-Binding Transcription Factors Chromatin Remodelers....

More information

Introduction to Bioinformatics and Gene Expression Technologies

Introduction to Bioinformatics and Gene Expression Technologies Introduction to Bioinformatics and Gene Expression Technologies Utah State University Fall 2017 Statistical Bioinformatics (Biomedical Big Data) Notes 1 1 Vocabulary Gene: hereditary DNA sequence at a

More information

Introduction to Bioinformatics and Gene Expression Technologies

Introduction to Bioinformatics and Gene Expression Technologies Vocabulary Introduction to Bioinformatics and Gene Expression Technologies Utah State University Fall 2017 Statistical Bioinformatics (Biomedical Big Data) Notes 1 Gene: Genetics: Genome: Genomics: hereditary

More information

Gene-centered resources at NCBI

Gene-centered resources at NCBI COURSE OF BIOINFORMATICS a.a. 2014-2015 Gene-centered resources at NCBI We searched Accession Number: M60495 AT NCBI Nucleotide Gene has been implemented at NCBI to organize information about genes, serving

More information

Analyzing Gene Set Enrichment

Analyzing Gene Set Enrichment Analyzing Gene Set Enrichment BaRC Hot Topics June 20, 2016 Yanmei Huang Bioinformatics and Research Computing Whitehead Institute http://barc.wi.mit.edu/hot_topics/ Purpose of Gene Set Enrichment Analysis

More information

Analysis of Microarray Data

Analysis of Microarray Data Analysis of Microarray Data Lecture 1: Experimental Design and Data Normalization George Bell, Ph.D. Senior Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Introduction

More information

Analysis of Gene Expression

Analysis of Gene Expression Analysis of Gene Expression An overview Setia Pramana Outline Biological background Central Dogma DNA Genes Genomics Microarrays Gene Expression data analysis pipeline What s next?? Central Dogma http://compbio.pbworks.com

More information

Experience with Weka by Predictive Classification on Gene-Expre

Experience with Weka by Predictive Classification on Gene-Expre Experience with Weka by Predictive Classification on Gene-Expression Data Czech Technical University in Prague Faculty of Electrical Engineering Department of Cybernetics Intelligent Data Analysis lab

More information

Bayesian Decomposition

Bayesian Decomposition Bayesian Decomposition Michael Ochs Making Proteins A Closer Look at Translation Post-Translational Modification RNA Splicing mirna Identifying Pathways A 1 2 3 B C D A B C D Bioinformatics www.promega.com

More information

Bioinformatics : Gene Expression Data Analysis

Bioinformatics : Gene Expression Data Analysis 05.12.03 Bioinformatics : Gene Expression Data Analysis Aidong Zhang Professor Computer Science and Engineering What is Bioinformatics Broad Definition The study of how information technologies are used

More information

Exploration and Analysis of DNA Microarray Data

Exploration and Analysis of DNA Microarray Data Exploration and Analysis of DNA Microarray Data Dhammika Amaratunga Senior Research Fellow in Nonclinical Biostatistics Johnson & Johnson Pharmaceutical Research & Development Javier Cabrera Associate

More information

Annotation and Function of Switch-like Genes in Health and Disease. A Thesis. Submitted to the Faculty. Drexel University. Adam M.

Annotation and Function of Switch-like Genes in Health and Disease. A Thesis. Submitted to the Faculty. Drexel University. Adam M. Annotation and Function of Switch-like Genes in Health and Disease A Thesis Submitted to the Faculty of Drexel University by Adam M. Ertel in partial fulfillment of the requirements for the degree of Doctor

More information

How to deal with the microarray results.

How to deal with the microarray results. How to deal with the microarray results. Britt Gabrielsson PhD RCEM, Div of metabolism and cardiovascular research Department of Medicine The Sahlgrenska Academy at Göteborg University and then we will

More information

Understanding protein lists from comparative proteomics studies

Understanding protein lists from comparative proteomics studies Understanding protein lists from comparative proteomics studies Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine bing.zhang@vanderbilt.edu A typical comparative

More information

Lecture #1. Introduction to microarray technology

Lecture #1. Introduction to microarray technology Lecture #1 Introduction to microarray technology Outline General purpose Microarray assay concept Basic microarray experimental process cdna/two channel arrays Oligonucleotide arrays Exon arrays Comparing

More information

Ingenuity Pathway Analysis (IPA )

Ingenuity Pathway Analysis (IPA ) Ingenuity Pathway Analysis (IPA ) For the analysis and interpretation of omics data IPA is a web-based software application for the analysis, integration, and interpretation of data derived from omics

More information

Knowledge-Guided Analysis with KnowEnG Lab

Knowledge-Guided Analysis with KnowEnG Lab Han Sinha Song Weinshilboum Knowledge-Guided Analysis with KnowEnG Lab KnowEnG Center Powerpoint by Charles Blatti Knowledge-Guided Analysis KnowEnG Center 2017 1 Exercise In this exercise we will be doing

More information

Outline. Array platform considerations: Comparison between the technologies available in microarrays

Outline. Array platform considerations: Comparison between the technologies available in microarrays Microarray overview Outline Array platform considerations: Comparison between the technologies available in microarrays Differences in array fabrication Differences in array organization Applications of

More information

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with

More information

Web-based tools for Bioinformatics; A (free) introduction to (freely available) NCBI, MUSC and World-wide.

Web-based tools for Bioinformatics; A (free) introduction to (freely available) NCBI, MUSC and World-wide. Page 1 of 24 Web-based tools for Bioinformatics; A (free) introduction to (freely available) NCBI, MUSC and World-wide. When and Where---Wednesdays at 1pm-2pmRoom 438 Library Admin Building Beginning September

More information

Types of Databases - By Scope

Types of Databases - By Scope Biological Databases Bioinformatics Workshop 2009 Chi-Cheng Lin, Ph.D. Department of Computer Science Winona State University clin@winona.edu Biological Databases Data Domains - By Scope - By Level of

More information

Product Applications for the Sequence Analysis Collection

Product Applications for the Sequence Analysis Collection Product Applications for the Sequence Analysis Collection Pipeline Pilot Contents Introduction... 1 Pipeline Pilot and Bioinformatics... 2 Sequence Searching with Profile HMM...2 Integrating Data in a

More information

Kyoto Encyclopedia of Genes and Genomes (KEGG)

Kyoto Encyclopedia of Genes and Genomes (KEGG) NPTEL Biotechnology -Systems Biology Kyoto Encyclopedia of Genes and Genomes (KEGG) Dr. M. Vijayalakshmi School of Chemical and Biotechnology SASTRA University Joint Initiative of IITs and IISc Funded

More information

Gene List Enrichment Analysis

Gene List Enrichment Analysis Outline Gene List Enrichment Analysis George Bell, Ph.D. BaRC Hot Topics March 16, 2010 Why do enrichment analysis? Main types Selecting or ranking genes Annotation sources Statistics Remaining issues

More information

Gene expression analysis. Gene expression analysis. Total RNA. Rare and abundant transcripts. Expression levels. Transcriptional output of the genome

Gene expression analysis. Gene expression analysis. Total RNA. Rare and abundant transcripts. Expression levels. Transcriptional output of the genome Gene expression analysis Gene expression analysis Biology of the transcriptome Observing the transcriptome Computational biology of gene expression sven.nelander@wlab.gu.se Recent examples Transcriptonal

More information

Gene Expression Data Analysis (I)

Gene Expression Data Analysis (I) Gene Expression Data Analysis (I) Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu Bioinformatics tasks Biological question Experiment design Microarray experiment

More information

Data Sheet. GeneChip Human Genome U133 Arrays

Data Sheet. GeneChip Human Genome U133 Arrays GeneChip Human Genome Arrays AFFYMETRIX PRODUCT FAMILY > ARRAYS > Data Sheet GeneChip Human Genome U133 Arrays The Most Comprehensive Coverage of the Human Genome in Two Flexible Formats: Single-array

More information

Microarray Data Analysis Workshop. Preprocessing and normalization A trailer show of the rest of the microarray world.

Microarray Data Analysis Workshop. Preprocessing and normalization A trailer show of the rest of the microarray world. Microarray Data Analysis Workshop MedVetNet Workshop, DTU 2008 Preprocessing and normalization A trailer show of the rest of the microarray world Carsten Friis Media glna tnra GlnA TnrA C2 glnr C3 C5 C6

More information

Figure S1. TT and MZ-CRC-1 cell viability and proliferation after Palbociclib treatment. Cell viability assays (A) and cell counting assays (B) were

Figure S1. TT and MZ-CRC-1 cell viability and proliferation after Palbociclib treatment. Cell viability assays (A) and cell counting assays (B) were Figure S1. TT and MZ-CRC-1 cell viability and proliferation after Palbociclib treatment. Cell viability assays (A) and cell counting assays (B) were performed with TT and MZ-CRC-1 cells treated with Palbociclib

More information

Bioinformatics for Proteomics. Ann Loraine

Bioinformatics for Proteomics. Ann Loraine Bioinformatics for Proteomics Ann Loraine aloraine@uab.edu What is bioinformatics? The science of collecting, processing, organizing, storing, analyzing, and mining biological information, especially data

More information

Microarray Technique. Some background. M. Nath

Microarray Technique. Some background. M. Nath Microarray Technique Some background M. Nath Outline Introduction Spotting Array Technique GeneChip Technique Data analysis Applications Conclusion Now Blind Guess? Functional Pathway Microarray Technique

More information

Defining an informativeness metric for clustering gene expression data

Defining an informativeness metric for clustering gene expression data Defining an informativeness metric for clustering gene expression data Author Mar, Jessica, Wells, Christine, Quackenbush, John Published 20 Journal Title Bioinformatics DOI https://doi.org/0.093/bioinformatics/btr074

More information

ELE4120 Bioinformatics. Tutorial 5

ELE4120 Bioinformatics. Tutorial 5 ELE4120 Bioinformatics Tutorial 5 1 1. Database Content GenBank RefSeq TPA UniProt 2. Database Searches 2 Databases A common situation for alignment is to search through a database to retrieve the similar

More information

Bioinformatics for Biologists

Bioinformatics for Biologists Bioinformatics for Biologists Microarray Data Analysis. Lecture 1. Fran Lewitter, Ph.D. Director Bioinformatics and Research Computing Whitehead Institute Outline Introduction Working with microarray data

More information

Array-Ready Oligo Set for the Rat Genome Version 3.0

Array-Ready Oligo Set for the Rat Genome Version 3.0 Array-Ready Oligo Set for the Rat Genome Version 3.0 We are pleased to announce Version 3.0 of the Rat Genome Oligo Set containing 26,962 longmer probes representing 22,012 genes and 27,044 gene transcripts.

More information

Agilent Genomics Software Future Directions

Agilent Genomics Software Future Directions Agilent Genomics Software Future Directions Michael Rosenberg, PhD Director, Genomics Software Agilent: A Focused Measurement Company Serving Diverse End Markets Electronic Measurement 2008 Revenue: $3.6

More information

Gene Regulation Solutions. Microarrays and Next-Generation Sequencing

Gene Regulation Solutions. Microarrays and Next-Generation Sequencing Gene Regulation Solutions Microarrays and Next-Generation Sequencing Gene Regulation Solutions The Microarrays Advantage Microarrays Lead the Industry in: Comprehensive Content SurePrint G3 Human Gene

More information

11/22/13. Proteomics, functional genomics, and systems biology. Biosciences 741: Genomics Fall, 2013 Week 11

11/22/13. Proteomics, functional genomics, and systems biology. Biosciences 741: Genomics Fall, 2013 Week 11 Proteomics, functional genomics, and systems biology Biosciences 741: Genomics Fall, 2013 Week 11 1 Figure 6.1 The future of genomics Functional Genomics The field of functional genomics represents the

More information

ArrayExpress and Gene Expression Atlas: Mining Functional Genomics data

ArrayExpress and Gene Expression Atlas: Mining Functional Genomics data ArrayExpress and Gene Expression Atlas: Mining Functional Genomics data Functional Genomics Team EBI-EMBL emma@ebi.ac.uk http://www.ebi.ac.uk/~emma/bcn_2012/ Talk structure Why do we need a database for

More information

SAS Microarray Solution for the Analysis of Microarray Data. Susanne Schwenke, Schering AG Dr. Richardus Vonk, Schering AG

SAS Microarray Solution for the Analysis of Microarray Data. Susanne Schwenke, Schering AG Dr. Richardus Vonk, Schering AG for the Analysis of Microarray Data Susanne Schwenke, Schering AG Dr. Richardus Vonk, Schering AG Overview Challenges in Microarray Data Analysis Software for Microarray Data Analysis SAS Scientific Discovery

More information

Final exam: Introduction to Bioinformatics and Genomics DUE: Friday June 29 th at 4:00 pm

Final exam: Introduction to Bioinformatics and Genomics DUE: Friday June 29 th at 4:00 pm Final exam: Introduction to Bioinformatics and Genomics DUE: Friday June 29 th at 4:00 pm Exam description: The purpose of this exam is for you to demonstrate your ability to use the different biomolecular

More information

March Product Release Information. About IPA. IPA Spring Release (2016): Release Notes. Table of Contents

March Product Release Information. About IPA. IPA Spring Release (2016): Release Notes. Table of Contents IPA Spring Release (2016): Release Notes Table of Contents IPA Spring Release (2016): Release Notes... 1 Product Release Information... 1 About IPA... 1 What s New in the IPA Spring Release (March 2016)...

More information

Analysis of Microarray Data

Analysis of Microarray Data Analysis of Microarray Data Lecture 1: Experimental Design and Data Normalization George Bell, Ph.D. Senior Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Introduction

More information

2. Materials and Methods

2. Materials and Methods Identification of cancer-relevant Variations in a Novel Human Genome Sequence Robert Bruggner, Amir Ghazvinian 1, & Lekan Wang 1 CS229 Final Report, Fall 2009 1. Introduction Cancer affects people of all

More information

Identifying Candidate Informative Genes for Biomarker Prediction of Liver Cancer

Identifying Candidate Informative Genes for Biomarker Prediction of Liver Cancer Identifying Candidate Informative Genes for Biomarker Prediction of Liver Cancer Nagwan M. Abdel Samee 1, Nahed H. Solouma 2, Mahmoud Elhefnawy 3, Abdalla S. Ahmed 4, Yasser M. Kadah 5 1 Computer Engineering

More information

Supplementary Methods

Supplementary Methods Supplementary Methods Microarray Data Analysis Gene expression data were obtained by hybridising a total of 24 samples from 6 experimental groups (n=4 per group) to Illumina HumanHT-12 Expression BeadChips.

More information

Designing a Complex-Omics Experiments. Xiangqin Cui. Section on Statistical Genetics Department of Biostatistics University of Alabama at Birmingham

Designing a Complex-Omics Experiments. Xiangqin Cui. Section on Statistical Genetics Department of Biostatistics University of Alabama at Birmingham Designing a Complex-Omics Experiments Xiangqin Cui Section on Statistical Genetics Department of Biostatistics University of Alabama at Birmingham 1/7/2015 Some slides are from previous lectures of Grier

More information

EECS 730 Introduction to Bioinformatics Sequence Alignment. Luke Huan Electrical Engineering and Computer Science

EECS 730 Introduction to Bioinformatics Sequence Alignment. Luke Huan Electrical Engineering and Computer Science EECS 730 Introduction to Bioinformatics Sequence Alignment Luke Huan Electrical Engineering and Computer Science http://people.eecs.ku.edu/~jhuan/ Database What is database An organized set of data Can

More information

Gene-Level Analysis of Exon Array Data using Partek Genomics Suite 6.6

Gene-Level Analysis of Exon Array Data using Partek Genomics Suite 6.6 Gene-Level Analysis of Exon Array Data using Partek Genomics Suite 6.6 Overview This tutorial will demonstrate how to: Summarize core exon-level data to produce gene-level data Perform exploratory analysis

More information

Analysis of a Tiling Regulation Study in Partek Genomics Suite 6.6

Analysis of a Tiling Regulation Study in Partek Genomics Suite 6.6 Analysis of a Tiling Regulation Study in Partek Genomics Suite 6.6 The example data set used in this tutorial consists of 6 technical replicates from the same human cell line, 3 are SP1 treated, and 3

More information

Introduction to RNA-Seq in GeneSpring NGS Software

Introduction to RNA-Seq in GeneSpring NGS Software Introduction to RNA-Seq in GeneSpring NGS Software Dipa Roy Choudhury, Ph.D. Strand Scientific Intelligence and Agilent Technologies Learn more at www.genespring.com Introduction to RNA-Seq In a few years,

More information

EECS730: Introduction to Bioinformatics

EECS730: Introduction to Bioinformatics EECS730: Introduction to Bioinformatics Lecture 14: Microarray Some slides were adapted from Dr. Luke Huan (University of Kansas), Dr. Shaojie Zhang (University of Central Florida), and Dr. Dong Xu and

More information

What is Bioinformatics?

What is Bioinformatics? What is Bioinformatics? Bioinformatics is the field of science in which biology, computer science, and information technology merge to form a single discipline. - NCBI The ultimate goal of the field is

More information

6. GENE EXPRESSION ANALYSIS MICROARRAYS

6. GENE EXPRESSION ANALYSIS MICROARRAYS 6. GENE EXPRESSION ANALYSIS MICROARRAYS BIOINFORMATICS COURSE MTAT.03.239 16.10.2013 GENE EXPRESSION ANALYSIS MICROARRAYS Slides adapted from Konstantin Tretyakov s 2011/2012 and Priit Adlers 2010/2011

More information

Transcriptome Assembly, Functional Annotation (and a few other related thoughts)

Transcriptome Assembly, Functional Annotation (and a few other related thoughts) Transcriptome Assembly, Functional Annotation (and a few other related thoughts) Monica Britton, Ph.D. Sr. Bioinformatics Analyst June 23, 2017 Differential Gene Expression Generalized Workflow File Types

More information