The microarray data analysis process - from raw data to biological significance. N. Eric Olson
|
|
- Sharon Copeland
- 5 years ago
- Views:
Transcription
1 The microarray data analysis process - from raw data to biological significance N. Eric Olson eric@genesifter.net
2 Microarrays 60,000 56,000 Data Points per Experiment 50,000 40,000 30,000 20,000 10, c c Today (Northern Blot) (PCR) (Microarray) Microarrays Allow Measurement of Expression Profiles for Entire Genomes Rather than Individual Genes
3 The impact of microarrays in biological research Data Data Experiments Experiments Traditional Biological Research Microarrays in Biological Research
4 Microarray Challenges 80% 70% 60% 50% 40% 30% 20% 10% 0% Labs Responding Throughput Reliability Software Bioinformatics Funding Commitments
5 Microarray Analysis
6 Microarray Data Analysis Con CHP Exp CHP 2 group 3 replicates U133 Plus 2 (56K Genes) 2,000,000+ data points Normalization Comparison statistics Multiple Testing Correction Clustering? Differentially Expressed Genes 1000s of genes UniGene LocusLink Gene Ontologies Identification of gene families and pathways 10s to 100s of genes Individual genes of interest
7 The Microarray Data Analysis Process Experimental Design Number of groups, factors, replicates Data management Data, sample annotation, gene annotation, databases Differential Expression Comparison statistics, Correction for multiple testing, Clustering Biological significance Individual genes, Biological themes Platform Selection One-color, two-color, platform comparisons System access Ease of you, accessibility Making data public and using public data MIAME, Journals, GEO, meta-analysis
8 The Microarray Data Analysis Process Experimental Design Number of groups, factors, replicates Data management Data, sample annotation, gene annotation, databases Differential Expression Comparison statistics, Correction for multiple testing, Clustering Biological significance Individual genes, Biological themes Platform Selection One-color, two-color, platform comparisons System access Ease of you, accessibility Making data public and using public data MIAME, Journals, GEO, meta-analysis
9 Experiment Design Type of experiment Two groups Normal vs. cancer Control vs. treated Three or more groups, single factor Time series Dose response Multiple treatment Three or more groups, multiple factors Time series with control and treated cells The type of experiment and number of groups and factors will determine the statistical methods needed to detect differential expression Replicates The more the better, but at least 3 Biological better than technical Rigorous statistical inferences cannot be made with a sample size of one. The more replicates, the stronger the inference. Pavlidis P, Li Q, Noble WS. The effect of replication on gene expression microarray experiments. Bioinformatics Sep 1;19(13): Experimental Design and Other Issues in Microarray Studies - Kathleen Kerr -
10 Differential Expression The fundamental goal of microarray experiments is to identify genes that are differentially expressed in the conditions being studied. Comparison statistics can be used to help identify differentially expressed genes and cluster analysis can be used to identify patterns of gene expression and to segregate a subset of genes based on these patterns. Statistical Significance Fold change Fold change does not address the reproducibility of the observed difference and cannot be used to determine the statistical significance. Comparison statistics 2 group t-test, Welch s t-test, Wilcoxon Rank Sum, 3 or more groups, single factor One-way ANOVA, Kruskal-Wallis 4 or more groups, multiple factors Two-way ANOVA Comparison tests require replicates and use the variability within the replicates to assign a confidence level as to whether the gene is differentially expressed. Supporting material - Draghici S. (2002) Statistical intelligence: effective analysis of high-density microarray data. Drug Discov Today, 7(11 Suppl).: S55-63.
11 t-test for comparison of two groups Calculate t statistic t = difference between groups difference within groups = Mean grp 1 Mean grp 2 ((s 12 /n 1 ) + (s 22 /n 2 )) 1/2 s = variance n = size of sample Determine confidence level for t (probability that t could occur by chance) df = n 1 + n 2-2 The larger the difference between the groups and the lower the variance the bigger t will be and the lower p will be
12 Differential Expression 2 groups, 4 replicates each Mean, standard deviation, fold change and p-value calculated 8 Mean Signal Exp Con Gene 1 Fold Change = 5.3 p = 0.19 Mean Signal Exp Con Gene 2 Fold Change = 5.3 p = 0.03 Fold change vs. p value
13 Analysis of Variance (ANOVA) Like t-test, identifies genes with large differences between groups and small differences within groups For use with 3 or more groups One-way and two-way One-way examines effects of one factor on gene expression Two-factor can examine effects of two factors on gene expression as well as the interaction of the two factors Pavlidis P. Using ANOVA for gene selection from microarray studies of the nervous system. Methods Dec;31(4): Glantz S. Primer of Biostatistics. 5 th Edition. McGraw-Hill. Glantz S, Slinker B. Primer of Regression and Analysis of Variance. McGraw-Hill.
14 Two-way ANOVA Data: Sex differences in salivary glands (CodeLink Ms 10K Bioarray) M Sex F Strain effect Gland Parotid Sublingual Par F Par M Sub F Sub M Gene expression pattern Sex effect Interaction Strain and sex effect (no Interaction)
15 Two-way ANOVA compared to t-test Data: Sex differences in salivary glands (CodeLink Ms 10K Bioarray) M Sex F Gland Parotid Sublingual Two-way t-test Sex Differences Pavlidis P, Noble WS. Analysis of strain and regional variation in gene expression in mouse brain. Genome Biol. 2001;2(10):RESEARCH0042.
16 Differential Expression Correction for multiple testing- Methods for adjusting the p-value from a comparison test based on the number of tests performed. These adjustments help to reduce the number of false positives in an experiment. FWER : Family Wise Error Rate (FWER) corrections adjust the p-value so that it reflects the chance of at least 1 false positive being found in the list. Bonferonni, Holm, W & Y MaxT FDR : False Discovery Rate corrections (FDR) adjust the p-value so that it reflects the frequency of false positives in the list. Benjamini and Hochberg, SAM The FWER methods are more conservative, but the FDR methods are usually acceptable for discovery experiments, i.e. where a small number of false positives is acceptable Dudoit, S., et al. (2003) Multiple hypothesis testing in microarray experiments. Statistical Science 18(1): Reiner, A., et al. (2003) Identifying differentially expressed genes using false discovery rate controlling procedures. Bioinformatics 19(3):
17 Multiple Hypothesis Testing in Microarray Experiments Per comparison error rate (PCER) the probability of error for each comparison Family-wise error rate (FWER) the probability of at least one error for all comparisons False discovery rate (FWER) the expected proportion of errors among your results An error means a false positive Example : 1000 genes and 50 differentially expressed using cutoff of 5% PCER - using a 5% PCER means a 5% chance of error for each comparison, so perhaps 50 errors for 1000 comparisons. This is not acceptable, you don t have confidence than any of your results are real (not errors). FWER using 5% FWER means there is a 5% chance that you have at least 1 error. This is very good and would be a very conservative requirement, you are confident that all of your results are real. FDR using 5% FDR you would expect 2.5 errors (5% of 50). This is probably acceptable, you are confident that most of your results are real.
18 Correction Example CodeLink Ms 10K BioArray Lacrimal + Placebo Lacrimal + Androgen 3 biological replicates 9982 Comparisons t-test 5% PCER : 2458 genes (estimate 499 errors) 5% FWER: 19 genes (5% chance of 1 error) 5% FDR: 904 genes (estimate 45 errors)
19 Identification and partitioning of expression patterns Cluster Analysis - clustering methods are descriptive or exploratory tools that can be used to identify groups within complex datasets. Visualization Methods such as hierarchical clustering can be used to help identify patterns in a large dataset. Hierarchical methods provide a hierarchy of clusters, from the smallest, where all objects (gene expression profiles) are in one cluster, through to the largest set, where each observation is in its own cluster. Partitioning this type of cluster analysis can be used to separate data into discrete groups or clusters. Partitioning methods partition the data (list of genes) into a prespecified number (K) of mutually exclusive groups based on feature vector (expression profile). - K-means -PAM (Partitioning around medoids) Cluster analysis is used to identify patterns of gene expression within large datasets and to segregate those genes based on these patterns. Quackenbush J. Computational analysis of microarray data. Nat Rev Genet Jun;2(6): Review. Kaufman L, Rousseeuw PJ: Finding Groups in Data: An Introduction to Cluster Analysis. New York: Wiley; 1990.
20 Identification and partitioning of expression patterns Hierarchical Partitioning Cluster analysis 1846 differentially expressed genes from FVB heart development time series.
21 Differential Expression - Gene Lists
22 Biological Significance Gene Annotation Sources UniGene - organizes GenBank sequences into a non-redundant set of gene-oriented clusters. Gene titles are assigned to the clusters and these titles are commonly used by researchers to refer to that particular gene. LocusLink (Entrez Gene) - provides a single query interface to curated sequence and descriptive information, including function, about genes. Gene Ontologies The Gene Ontology Consortium provides controlled vocabularies for the description of the molecular function, biological process and cellular component of gene products, that can be used by databases such as Entrez Gene. KEGG - Kyoto Encyclopedia of Genes and Genomes provides information about both regulatory and metabolic pathways for genes. Reference Sequences- The NCBI Reference Sequence project (RefSeq) provides reference sequences for both the mrna and protein products of included genes.
23 Gene annotation for individual genes
24 Ontology reports identify biological themes
25 Ontology Report : z-score R = total number of genes meeting selection criteria N = total number of genes measured r = number of genes meeting selection criteria with the specified GO term n = total number of genes measured with the specific GO term Reference: Scott W Doniger, Nathan Salomonis, Kam D Dahlquist, Karen Vranizan, Steven C Lawlor and Bruce R Conklin; MAPPFinder: usig Gene Ontology and GenMAPP to create a global gene-expression profile from microarray data, Genome Biology 2003, 4:R7
26 Identify biological themes associated with gene clusters
27 Workflow Examples 2 groups (Androgen-dependent and androgenindependent prostate cancer) 5 groups, single factor (Drosophila Innate Immune Response Time Series) 2 groups, two factors (Parotid and sublingual gland, Male and Female) t-test BH (FDR) Up regulated Down regulated Gene Lists One-way ANOVA BH (FDR) Clustering Gene Lists Two-way ANOVA BH (FDR) Clustering Gene Lists Individual genes of interest Biological themes (Pathways, molecular functions, etc.)
28 The Microarray Data Analysis Process Experimental Design Number of groups, factors, replicates Data management Data, sample annotation, gene annotation, databases Differential Expression Comparison statistics, Correction for multiple testing, Clustering Biological significance Individual genes, Biological themes Platform Selection One-color, two-color, platform comparisons System access Ease of you, accessibility Making data public and using public data MIAME, Journals, GEO, meta-analysis
29 Platform Selection The microarray platform used will have a substantial effect on the data analysis process. Two color arrays have several options for how the experiment can be designed. The second channel can either be part of the experiment or it can be an external reference RNA that is strictly used for the creation of a ratio. For single channel arrays the sample is hybridized to one array and the signal intensity is used for measuring gene expression. Microarray Platforms Affymetrix GeneChip: One-color oligo arrays, 22 probes/gene GEHC CodeLink Bioarray: One-color oligo array, 1 probe/gene Agilent Arrays: Two-color oligo array, 1 probe/gene Illumina bead arrays: One-color oligo bead array, 1 probe/gene ABI Arrays: One-color oligo array, 1 probe/gene Custom cdna or Oligo: usually two-color, 1 probe/gene Experimental design and analysis is generally more straightforward with one-color microarrays.
30 Platform Comparisons Huntington s Blood vs. Healthy (GSE1767 & GSE1751) Same samples on Affymetrix and CodeLink Genes 65% overlap Biological themes 100% top 10 Normal Testis vs. Seminoma (GSE3218 & GSE1818) Independent samples, Affymetrix, Agilent Genes 55% overlap Biological themes - 100% top 10 Additional references: Barnes M, Freudenberg J, Thompson S, Aronow B, Pavlidis P. Experimental comparison and cross-validation of the Affymetrix and Illumina gene expression analysis platforms. Nucleic Acids Res Oct 19;33(18): Print Larkin JE, Frank BC, Gavras H, Sultana R, Quackenbush J. Independence and reproducibility across microarray platforms. Nat Methods May;2(5): Epub 2005 Apr 21. Irizarry et al. Multiple-laboratory comparison of microarray platforms. Nat Methods May;2(5): Jun;2(6):477.
31 System Access System access, defined both in terms of physical access and the usability of the microarray data analysis system, is critical for the microarray data analysis process; the people who need to work with the data must be able to access the system when needed and they must be able to make full use of it. Usability - Intuitive interface Special training The usability of the system determines the amount of time spent learning the system and who will be able to work with the data. Accessibility Single-user desktop Single-user server Web-based The type of system used will determine who can access it, how many people can access it and the locations from which it can be accessed.
32 System Access Web-based systems Thin client runs through any browser on any platform IE, FireFox, Netscape, Opera, etc. Mac, PC, Linux Secure access from any computer on the internet Any computer in the lab Access from home or traveling Facilitates sharing data and collaboration Flexible data loading load from one location, analyze from another Live presentations of data
33 The Microarray Data Analysis Process Experimental Design Number of groups, factors, replicates Data management Data, sample annotation, gene annotation, databases Differential Expression Comparison statistics, Correction for multiple testing, Clustering Biological significance Individual genes, Biological themes Platform Selection One-color, two-color, platform comparisons System access Ease of you, accessibility Making data public and using public data MIAME, Journals, GEO, meta-analysis
34 The Gene Expression Omnibus (GEO) Gene expression data repository (mostly microarrays) Over 3000 data sets All array platforms represented Searchable by Platform Species Experiment annotation Downloadable data Using the Gene Expression Omnibus (
35 The Gene Expression Omnibus (GEO) Search by experiment annotation (or species or platform).
36 The Gene Expression Omnibus (GEO) 26 results
37 Androgen-independent prostate cancer
38 Androgen-independent prostate cancer Sample information Author (submitter) information Export Data GEO format Raw data (CEL files) for many
39 GeneSifter - Analysis Examples 2 groups (ADPC vs AIPC) Data Upload Affymetrix 3 + groups Differential expression Fold change Quality Wilcoxon rank sum test False discovery rate Differential expression Fold change Quality ANOVA False discovery rate Visualization Hierarchical clustering PCA Partitioning PAM Silhouettes Biological significance Gene Annotation Ontology report Pathway report
40 GeneSifter Microarray Data Analysis Accessibility Web-based Secure Data management Data Annotation (MIAME) Multiple upload tools CodeLink Affymetrix Illumina Agilent Custom Differential Expression - Powerful, accessible tools for determining Statistical Significance R based statistics Bioconductor Comparison Tests t-test, Welch s t-test, Wilcoxon Rank sum test, ANOVA, Correction for Multiple Testing Bonferroni, Holm, Westfall and Young maxt, Benjamini and Hochberg Unsupervised Clustering PAM, CLARA, Hierarchical clustering Silhouettes
41 GeneSifter Microarray Data Analysis Integrated tools for determining Biological Significance One Click Gene Summary Ontology Report Pathway Report Search by ontology terms Search by KEGG terms or Chromosome
42 The GeneSifter Data Center Free resource Training Research Publishing 6 areas Cardiovascular Cancer Endocrinology Neuroscience Immunology Oral Biology Access to : Data Analysis summary Tutorials WebEx
43 The GeneSifter Data Center
44 The Data Affymetrix GeneChip Human Genome U133A Tumor tissue from 20 primary prostate cancer tumors 10 Androgen Dependent 10 Androgen Independent CEL Land CHP data available CEL files loaded into GeneSifter and processed with GC-RMA Best CJ, Gillespie JW, Yi Y, Chandramouli GV, Perlmutter MA, Gathright Y, Erickson HS, Georgevich L, Tangrea MA, Duray PH, Gonzalez S, Velasco A, Linehan, WM, Matusik RJ, Price DK, Figg WD, Emmert-Buck MR, Chuaqui RF. Molecular alterations in primary prostate cancer after androgen ablation therapy. Clin Cancer Res Oct 1;11(19 Pt 1):
45 Pairwise Analysis Select group 1 10 ADPC Select group 2 10 AIPC Already normalized (Quantile, GC-RMA) Wilcoxon rank sum test Benjamini and Hochberg (FDR) Data already log transformed (GC-RMA)
46 Pairwise Analysis Gene List
47 One-Click Gene Summary
48 Pairwise Analysis Gene List
49 Ontology Report
50 Z-score Report
51 Z-score Report
52 KEGG Report
53 Pairwise Analysis - Summary Androgen-Dependent Prostate Cancer compared to Androgen-Independent Prostate Cancer 10 Androgen Dependent 10 Androgen Independent Wilcoxon, Benjamini and Hochberg (FDR) ~21,000 transcripts 468 genes Pattern selection 222 increased In AIPC 246 decreased In AIPC Z-scores Biological processes Cell adhesion (15) Cell cycle (19) RNA processing (13) Cellular localization (12) KEGG Apoptosis (3) Biological processes Protein biosynthesis (25) Regulation of apoptosis (11) KEGG Ribosome (15) Oxidative phosphorylation (7)
54 GeneSifter - Analysis Examples 2 groups (ADPC vs AIPC) Data Upload Affymetrix 3 + groups Differential expression Fold change Quality Wilcoxon rank sum test False discovery rate Differential expression Fold change Quality ANOVA False discovery rate Visualization Hierarchical clustering PCA Partitioning PAM Silhouettes Biological significance Gene Annotation Ontology report Pathway report
55 Metastatic PC Comparison Affymetrix U133 Plus 2 4 Benign prostate 5 Localized tumors 4 Metastatic (hormone refractory)
56 Project Metastatic PC Comparison Clustering using 486 AIPC gene set
57 Project Metastatic PC Comparison Clustering using AIPC Ribosome genes
58 Project Metastatic PC Comparison Clustering using AIPC Cell adhesion genes
59 The Microarray Data Analysis Process Experimental Design Number of groups, factors, replicates Data management Data, sample annotation, gene annotation, databases Differential Expression Comparison statistics, Correction for multiple testing, Clustering Biological significance Individual genes, Biological themes Platform Selection One-color, two-color, platform comparisons System access Ease of you, accessibility Making data public and using public data MIAME, Journals, GEO, meta-analysis
60 Thank You Monthly Webinar Series Archived - Microarray analysis of gene expression in androgen-independent prostate cancer Archived - Microarray analysis of gene expression in male germ cell tumors Archived - Microarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison
Using 2-way ANOVA to dissect the immune response to hookworm infection in mouse lung
Using 2-way ANOVA to dissect the immune response to hookworm infection in mouse lung Using 2-way ANOVA to dissect the immune response to hookworm infection in mouse lung General microarry data analysis
More informationUsing 2-way ANOVA to dissect gene expression following myocardial infarction in mice
Using 2-way ANOVA to dissect gene expression following myocardial infarction in mice Thank you for waiting. The presentation will be starting in a few minutes at 9AM Pacific Daylight Time. During this
More informationSeven Keys to Successful Microarray Data Analysis
Seven Keys to Successful Microarray Data Analysis Experiment Design Platform Selection Data Management System Access Differential Expression Biological Significance Data Publication Type of experiment
More informationMicroarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison
Microarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison Thank you for waiting. The presentation will be starting in a few minutes at 9AM Pacific Daylight
More informationMicroarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison. CodeLink compatible
Microarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison CodeLink compatible Microarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood
More informationMicroarray analysis of gene expression in male germ cell tumors
Microarray analysis of gene expression in male germ cell tumors Microarray analysis of gene expression in male germ cell tumors General microarry data analysis workflow From raw data to biological significance
More informationThe microarray data analysis process - from raw data to biological significance
The microarray data analysis process - from raw data to biological significance Thank you for waiting. The presentation will be starting in a few minutes at 6AM Pacific Daylight Time. During this webinar
More informationIdentification of biological themes in microarray data from a mouse heart development time series using GeneSifter
Identification of biological themes in microarray data from a mouse heart development time series using GeneSifter VizX Labs, LLC Seattle, WA 98119 Abstract Oligonucleotide microarrays were used to study
More informationIntroduction to Microarray Technique, Data Analysis, Databases Maryam Abedi PhD student of Medical Genetics
Introduction to Microarray Technique, Data Analysis, Databases Maryam Abedi PhD student of Medical Genetics abedi777@ymail.com Outlines Technology Basic concepts Data analysis Printed Microarrays In Situ-Synthesized
More informationThe Microarray Data Analysis Process: From Raw Data to Biological Significance
NeuroRx : The Journal of the American Society for Experimental NeuroTherapeutics The Microarray Data Analysis Process: From Raw Data to Biological Significance N. Eric Olson VizX Labs, Seattle, Washington
More informationMicroarray Informatics
Microarray Informatics Donald Dunbar MSc Seminar 4 th February 2009 Aims To give a biologistʼs view of microarray experiments To explain the technologies involved To describe typical microarray experiments
More informationMicroarray Informatics
Microarray Informatics Donald Dunbar MSc Seminar 31 st January 2007 Aims To give a biologist s view of microarray experiments To explain the technologies involved To describe typical microarray experiments
More informationGene Expression Data Analysis
Gene Expression Data Analysis Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu BMIF 310, Fall 2009 Gene expression technologies (summary) Hybridization-based
More informationIntegrative Genomics 1a. Introduction
2016 Course Outline Integrative Genomics 1a. Introduction ggibson.gt@gmail.com http://www.cig.gatech.edu 1a. Experimental Design and Hypothesis Testing (GG) 1b. Normalization (GG) 2a. RNASeq (MI) 2b. Clustering
More informationAnalysis of Microarray Data
Analysis of Microarray Data Lecture 3: Visualization and Functional Analysis George Bell, Ph.D. Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Review Visualizing
More informationAnalysis of Microarray Data
Analysis of Microarray Data Lecture 3: Visualization and Functional Analysis George Bell, Ph.D. Senior Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Review
More informationDeakin Research Online
Deakin Research Online This is the published version: Church, Philip, Goscinski, Andrzej, Wong, Adam and Lefevre, Christophe 2011, Simplifying gene expression microarray comparative analysis., in BIOCOM
More informationOutline. Analysis of Microarray Data. Most important design question. General experimental issues
Outline Analysis of Microarray Data Lecture 1: Experimental Design and Data Normalization Introduction to microarrays Experimental design Data normalization Other data transformation Exercises George Bell,
More informationPATHWAY ANALYSIS. Susan LM Coort, PhD Department of Bioinformatics, Maastricht University. PET course: Toxicogenomics
PATHWAY ANALYSIS Susan LM Coort, PhD Department of Bioinformatics, Maastricht University 1 Data analysis overview Microarray scans Slide based on a slide from J. Pennings, RIVM, NL Image analysis Preprocessing
More informationMeasuring and Understanding Gene Expression
Measuring and Understanding Gene Expression Dr. Lars Eijssen Dept. Of Bioinformatics BiGCaT Sciences programme 2014 Why are genes interesting? TRANSCRIPTION Genome Genomics Transcriptome Transcriptomics
More informationAgilent GeneSpring GX 10: Beyond. Pam Tangvoranuntakul Product Manager, GeneSpring October 1, 2008
Agilent GeneSpring GX 10: Gene Expression and Beyond Pam Tangvoranuntakul Product Manager, GeneSpring October 1, 2008 GeneSpring GX 10 in the News Our Goals for GeneSpring GX 10 Goal 1: Bring back GeneSpring
More informationCodeLink Human Whole Genome Bioarray
CodeLink Human Whole Genome Bioarray 55,000 human gene targets on a single bioarray The CodeLink Human Whole Genome Bioarray comprises one of the most comprehensive coverages of the human genome, as it
More informationGene expression: Microarray data analysis. Copyright notice. Outline: microarray data analysis. Schedule
Gene expression: Microarray data analysis Copyright notice Many of the images in this powerpoint presentation are from Bioinformatics and Functional Genomics by Jonathan Pevsner (ISBN -47-4-8). Copyright
More informationAnnotation. (Chapter 8)
Annotation (Chapter 8) Genome annotation Genome annotation is the process of attaching biological information to sequences: identify elements on the genome attach biological information to elements store
More informationWeb-based tools for Bioinformatics; A (free) introduction to (freely available) NCBI, MUSC and World-wide.
Page 1 of 24 Web-based tools for Bioinformatics; A (free) introduction to (freely available) NCBI, MUSC and World-wide. When and Where---Wednesdays at 1pm-2pmRoom 438 Library Admin Building Beginning September
More informationBioconductor. Course in Practical Microarray Analysis Berlin Slides 2002 Sandrine Dudoit, Robert Gentleman. Adapted by Wolfgang Huber.
Bioconductor Course in Practical Microarray Analysis Berlin 27.-30.1.2003 Slides 2002 Sandrine Dudoit, Robert Gentleman. Adapted by Wolfgang Huber. Statistical computing Applications statistical design
More informationMicroarray Data Analysis Workshop. Preprocessing and normalization A trailer show of the rest of the microarray world.
Microarray Data Analysis Workshop MedVetNet Workshop, DTU 2008 Preprocessing and normalization A trailer show of the rest of the microarray world Carsten Friis Media glna tnra GlnA TnrA C2 glnr C3 C5 C6
More informationGS Analysis of Microarray Data
GS01 0163 Analysis of Microarray Data Keith Baggerly and Brad Broom Department of Bioinformatics and Computational Biology UT M. D. Anderson Cancer Center kabagg@mdanderson.org bmbroom@mdanderson.org 7
More informationGene expression analysis. Biosciences 741: Genomics Fall, 2013 Week 5. Gene expression analysis
Gene expression analysis Biosciences 741: Genomics Fall, 2013 Week 5 Gene expression analysis From EST clusters to spotted cdna microarrays Long vs. short oligonucleotide microarrays vs. RT-PCR Methods
More informationAnalysis of Microarray Data
Analysis of Microarray Data Lecture 1: Experimental Design and Data Normalization George Bell, Ph.D. Senior Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Introduction
More informationStandard Data Analysis Report Agilent Gene Expression Service
Standard Data Analysis Report Agilent Gene Expression Service Experiment: S534662 Date: 2011-01-01 Prepared for: Dr. Researcher Genomic Sciences Lab Prepared by S534662 Standard Data Analysis Report 2011-01-01
More informationLecture #1. Introduction to microarray technology
Lecture #1 Introduction to microarray technology Outline General purpose Microarray assay concept Basic microarray experimental process cdna/two channel arrays Oligonucleotide arrays Exon arrays Comparing
More informationGS Analysis of Microarray Data
GS01 0163 Analysis of Microarray Data Keith Baggerly and Brad Broom Department of Bioinformatics and Computational Biology UT M. D. Anderson Cancer Center kabagg@mdanderson.org bmbroom@mdanderson.org 8
More informationAnalysis of Microarray Data
Analysis of Microarray Data Lecture 1: Experimental Design and Data Normalization George Bell, Ph.D. Senior Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Introduction
More informationThe essentials of microarray data analysis
The essentials of microarray data analysis (from a complete novice) Thanks to Rafael Irizarry for the slides! Outline Experimental design Take logs! Pre-processing: affy chips and 2-color arrays Clustering
More informationWhat is Bioinformatics?
What is Bioinformatics? Bioinformatics is the field of science in which biology, computer science, and information technology merge to form a single discipline. - NCBI The ultimate goal of the field is
More information6. GENE EXPRESSION ANALYSIS MICROARRAYS
6. GENE EXPRESSION ANALYSIS MICROARRAYS BIOINFORMATICS COURSE MTAT.03.239 16.10.2013 GENE EXPRESSION ANALYSIS MICROARRAYS Slides adapted from Konstantin Tretyakov s 2011/2012 and Priit Adlers 2010/2011
More informationGS Analysis of Microarray Data
GS01 0163 Analysis of Microarray Data Keith Baggerly and Kevin Coombes Department of Bioinformatics and Computational Biology UT M. D. Anderson Cancer Center kabagg@mdanderson.org kcoombes@mdanderson.org
More informationGene expression analysis: Introduction to microarrays
Gene expression analysis: Introduction to microarrays Adam Ameur The Linnaeus Centre for Bioinformatics, Uppsala University February 15, 2006 Overview Introduction Part I: How a microarray experiment is
More informationHow to deal with the microarray results.
How to deal with the microarray results. Britt Gabrielsson PhD RCEM, Div of metabolism and cardiovascular research Department of Medicine The Sahlgrenska Academy at Göteborg University and then we will
More informationAGILENT S BIOINFORMATICS ANALYSIS SOFTWARE
ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS
More informationIntroduction to Bioinformatics and Gene Expression Technology
Vocabulary Introduction to Bioinformatics and Gene Expression Technology Utah State University Spring 2014 STAT 5570: Statistical Bioinformatics Notes 1.1 Gene: Genetics: Genome: Genomics: hereditary DNA
More informationMicroarray Experiment Design
Microarray Experiment Design Samples used, extract preparation and labelling: AML blasts were isolated from bone marrow by centrifugation on a Ficoll- Hypaque gradient. Total RNA was extracted using TRIzol
More informationIntroduction to gene expression microarray data analysis
Introduction to gene expression microarray data analysis Outline Brief introduction: Technology and data. Statistical challenges in data analysis. Preprocessing data normalization and transformation. Useful
More informationUnderstanding protein lists from proteomics studies. Bing Zhang Department of Biomedical Informatics Vanderbilt University
Understanding protein lists from proteomics studies Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu A typical comparative shotgun proteomics study IPI00375843
More informationadvanced analysis of gene expression microarray data aidong zhang World Scientific State University of New York at Buffalo, USA
advanced analysis of gene expression microarray data aidong zhang State University of New York at Buffalo, USA World Scientific NEW JERSEY LONDON SINGAPORE BEIJING SHANGHAI HONG KONG TAIPEI CHENNAI Contents
More informationNormalization. Getting the numbers comparable. DNA Microarray Bioinformatics - #27612
Normalization Getting the numbers comparable The DNA Array Analysis Pipeline Question Experimental Design Array design Probe design Sample Preparation Hybridization Buy Chip/Array Image analysis Expression
More informationA WEB-BASED TOOL FOR GENOMIC FUNCTIONAL ANNOTATION, STATISTICAL ANALYSIS AND DATA MINING
A WEB-BASED TOOL FOR GENOMIC FUNCTIONAL ANNOTATION, STATISTICAL ANALYSIS AND DATA MINING D. Martucci a, F. Pinciroli a,b, M. Masseroli a a Dipartimento di Bioingegneria, Politecnico di Milano, Milano,
More informationTotal RNA was isolated using the TRIZOL reagent according to the manufacturer s
RNA extraction Total RNA was isolated using the TRIZOL reagent according to the manufacturer s instructions (Invitrogen, Carlsbad, CA). RNA integrity for each sample was confirmed with the Agilent 2100
More informationBioinformatics for Biologists
Bioinformatics for Biologists Microarray Data Analysis. Lecture 1. Fran Lewitter, Ph.D. Director Bioinformatics and Research Computing Whitehead Institute Outline Introduction Working with microarray data
More informationAnnotation and Function of Switch-like Genes in Health and Disease. A Thesis. Submitted to the Faculty. Drexel University. Adam M.
Annotation and Function of Switch-like Genes in Health and Disease A Thesis Submitted to the Faculty of Drexel University by Adam M. Ertel in partial fulfillment of the requirements for the degree of Doctor
More informationMicroarrays & Gene Expression Analysis
Microarrays & Gene Expression Analysis Contents DNA microarray technique Why measure gene expression Clustering algorithms Relation to Cancer SAGE SBH Sequencing By Hybridization DNA Microarrays 1. Developed
More informationComparison of Microarray Pre-Processing Methods
Comparison of Microarray Pre-Processing Methods K. Shakya, H. J. Ruskin, G. Kerr, M. Crane, J. Becker Dublin City University, Dublin 9, Ireland Abstract Data pre-processing in microarray technology is
More informationArrayExpress and Gene Expression Atlas: Mining Functional Genomics data
ArrayExpress and Gene Expression Atlas: Mining Functional Genomics data Functional Genomics Team EBI-EMBL emma@ebi.ac.uk http://www.ebi.ac.uk/~emma/bcn_2012/ Talk structure Why do we need a database for
More informationGene Signal Estimates from Exon Arrays
Gene Signal Estimates from Exon Arrays I. Introduction: With exon arrays like the GeneChip Human Exon 1.0 ST Array, researchers can examine the transcriptional profile of an entire gene (Figure 1). Being
More informationIntroduction to Bioinformatics and Gene Expression Technologies
Introduction to Bioinformatics and Gene Expression Technologies Utah State University Fall 2017 Statistical Bioinformatics (Biomedical Big Data) Notes 1 1 Vocabulary Gene: hereditary DNA sequence at a
More informationIntroduction to Bioinformatics and Gene Expression Technologies
Vocabulary Introduction to Bioinformatics and Gene Expression Technologies Utah State University Fall 2017 Statistical Bioinformatics (Biomedical Big Data) Notes 1 Gene: Genetics: Genome: Genomics: hereditary
More informationExploration and Analysis of DNA Microarray Data
Exploration and Analysis of DNA Microarray Data Dhammika Amaratunga Senior Research Fellow in Nonclinical Biostatistics Johnson & Johnson Pharmaceutical Research & Development Javier Cabrera Associate
More informationBioconductor. Course in Practical Microarray Analysis Heidelberg
Bioconductor Course in Practical Microarray Analysis Heidelberg 23.-27.9.2002 Slides 2002 Sandrine Dudoit, Robert Gentleman. Adapted by Wolfgang Huber. Statistical computing Everywhere for statistical
More informationELE4120 Bioinformatics. Tutorial 5
ELE4120 Bioinformatics Tutorial 5 1 1. Database Content GenBank RefSeq TPA UniProt 2. Database Searches 2 Databases A common situation for alignment is to search through a database to retrieve the similar
More informationGene Regulation Solutions. Microarrays and Next-Generation Sequencing
Gene Regulation Solutions Microarrays and Next-Generation Sequencing Gene Regulation Solutions The Microarrays Advantage Microarrays Lead the Industry in: Comprehensive Content SurePrint G3 Human Gene
More informationMicroarray Data Analysis in GeneSpring GX 11. Month ##, 200X
Microarray Data Analysis in GeneSpring GX 11 Month ##, 200X Agenda Genome Browser GO GSEA Pathway Analysis Network building Find significant pathways Extract relations via NLP Data Visualization Options
More informationChIP-seq data analysis with Chipster. Eija Korpelainen CSC IT Center for Science, Finland
ChIP-seq data analysis with Chipster Eija Korpelainen CSC IT Center for Science, Finland chipster@csc.fi What will I learn? Short introduction to ChIP-seq Analyzing ChIP-seq data Central concepts Analysis
More informationGene-Level Analysis of Exon Array Data using Partek Genomics Suite 6.6
Gene-Level Analysis of Exon Array Data using Partek Genomics Suite 6.6 Overview This tutorial will demonstrate how to: Summarize core exon-level data to produce gene-level data Perform exploratory analysis
More informationDesigning a Complex-Omics Experiments. Xiangqin Cui. Section on Statistical Genetics Department of Biostatistics University of Alabama at Birmingham
Designing a Complex-Omics Experiments Xiangqin Cui Section on Statistical Genetics Department of Biostatistics University of Alabama at Birmingham 1/7/2015 Some slides are from previous lectures of Grier
More informationExercise1 ArrayExpress Archive - High-throughput sequencing example
ArrayExpress and Atlas practical: querying and exporting gene expression data at the EBI Gabriella Rustici gabry@ebi.ac.uk This practical will introduce you to the data content and query functionality
More informationGene Expression Data Analysis (I)
Gene Expression Data Analysis (I) Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu Bioinformatics tasks Biological question Experiment design Microarray experiment
More informationNext-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX
Next-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX Technical Overview Introduction RNA Sequencing (RNA-Seq) is one of the most commonly used next-generation sequencing (NGS)
More informationBiological Interpretation of Metabolomics Data. Martina Kutmon Maastricht University
Biological Interpretation of Metabolomics Data Martina Kutmon Maastricht University Contents Background on pathway analysis WikiPathways Building Research Communities on Biological Pathways Data Analysis
More informationBioinformatics. Microarrays: designing chips, clustering methods. Fran Lewitter, Ph.D. Head, Biocomputing Whitehead Institute
Bioinformatics Microarrays: designing chips, clustering methods Fran Lewitter, Ph.D. Head, Biocomputing Whitehead Institute Course Syllabus Jan 7 Jan 14 Jan 21 Jan 28 Feb 4 Feb 11 Feb 18 Feb 25 Sequence
More informationMeasuring gene expression (Microarrays) Ulf Leser
Measuring gene expression (Microarrays) Ulf Leser This Lecture Gene expression Microarrays Idea Technologies Problems Quality control Normalization Analysis next week! 2 http://learn.genetics.utah.edu/content/molecules/transcribe/
More informationResearch Powered by Agilent s GeneSpring
Research Powered by Agilent s GeneSpring Agilent Technologies, Inc. Carolina Livi, Bioinformatics Segment Manager Research Powered by GeneSpring Topics GeneSpring (GS) platform New features in GS 13 What
More informationGene List Enrichment Analysis
Outline Gene List Enrichment Analysis George Bell, Ph.D. BaRC Hot Topics March 16, 2010 Why do enrichment analysis? Main types Selecting or ranking genes Annotation sources Statistics Remaining issues
More informationOutline. Array platform considerations: Comparison between the technologies available in microarrays
Microarray overview Outline Array platform considerations: Comparison between the technologies available in microarrays Differences in array fabrication Differences in array organization Applications of
More informationMICROARRAYS: CHIPPING AWAY AT THE MYSTERIES OF SCIENCE AND MEDICINE
MICROARRAYS: CHIPPING AWAY AT THE MYSTERIES OF SCIENCE AND MEDICINE National Center for Biotechnology Information With only a few exceptions, every
More informationBIOINF/BENG/BIMM/CHEM/CSE 184: Computational Molecular Biology. Lecture 2: Microarray analysis
BIOINF/BENG/BIMM/CHEM/CSE 184: Computational Molecular Biology Lecture 2: Microarray analysis Genome wide measurement of gene transcription using DNA microarray Bruce Alberts, et al., Molecular Biology
More informationDesigning Complex Omics Experiments
Designing Complex Omics Experiments Xiangqin Cui Section on Statistical Genetics Department of Biostatistics University of Alabama at Birmingham 6/15/2015 Some slides are from previous lectures given by
More informationIdentifying Candidate Informative Genes for Biomarker Prediction of Liver Cancer
Identifying Candidate Informative Genes for Biomarker Prediction of Liver Cancer Nagwan M. Abdel Samee 1, Nahed H. Solouma 2, Mahmoud Elhefnawy 3, Abdalla S. Ahmed 4, Yasser M. Kadah 5 1 Computer Engineering
More informationIntroduction to NGS analyses
Introduction to NGS analyses Giorgio L Papadopoulos Institute of Molecular Biology and Biotechnology Bioinformatics Support Group 04/12/2015 Papadopoulos GL (IMBB, FORTH) IMBB NGS Seminar 04/12/2015 1
More informationAnalysis of a Tiling Regulation Study in Partek Genomics Suite 6.6
Analysis of a Tiling Regulation Study in Partek Genomics Suite 6.6 The example data set used in this tutorial consists of 6 technical replicates from the same human cell line, 3 are SP1 treated, and 3
More informationSupporting Online Material for
www.sciencemag.org/cgi/content/full/313/5795/1929/dc1 Supporting Online Material for The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease Justin Lamb, *
More informationEECS730: Introduction to Bioinformatics
EECS730: Introduction to Bioinformatics Lecture 14: Microarray Some slides were adapted from Dr. Luke Huan (University of Kansas), Dr. Shaojie Zhang (University of Central Florida), and Dr. Dong Xu and
More informationExperience with Weka by Predictive Classification on Gene-Expre
Experience with Weka by Predictive Classification on Gene-Expression Data Czech Technical University in Prague Faculty of Electrical Engineering Department of Cybernetics Intelligent Data Analysis lab
More informationGeneQuery: A phenotype search tool based on gene co-expression clustering. Alexander Predeus 21-oct-2015
GeneQuery: A phenotype search tool based on gene co-expression clustering Alexander Predeus 21-oct-2015 About myself Graduated from Moscow state University (1998-2003) PhD: Michigan State University (2003-2009):
More informationGREG GIBSON SPENCER V. MUSE
A Primer of Genome Science ience THIRD EDITION TAGCACCTAGAATCATGGAGAGATAATTCGGTGAGAATTAAATGGAGAGTTGCATAGAGAACTGCGAACTG GREG GIBSON SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc.
More informationRandom matrix analysis for gene co-expression experiments in cancer cells
Random matrix analysis for gene co-expression experiments in cancer cells OIST-iTHES-CTSR 2016 July 9 th, 2016 Ayumi KIKKAWA (MTPU, OIST) Introduction : What is co-expression of genes? There are 20~30k
More informationCAP BIOINFORMATICS Su-Shing Chen CISE. 10/5/2005 Su-Shing Chen, CISE 1
CAP 5510-9 BIOINFORMATICS Su-Shing Chen CISE 10/5/2005 Su-Shing Chen, CISE 1 Basic BioTech Processes Hybridization PCR Southern blotting (spot or stain) 10/5/2005 Su-Shing Chen, CISE 2 10/5/2005 Su-Shing
More informationIntroduction to RNA-Seq in GeneSpring NGS Software
Introduction to RNA-Seq in GeneSpring NGS Software Dipa Roy Choudhury, Ph.D. Strand Scientific Intelligence and Agilent Technologies Learn more at www.genespring.com Introduction to RNA-Seq In a few years,
More informationSTATISTICAL CHALLENGES IN GENE DISCOVERY
STATISTICAL CHALLENGES IN GENE DISCOVERY THROUGH MICROARRAY DATA ANALYSIS 1 Central Tuber Crops Research Institute,Kerala, India 2 Dept. of Statistics, St. Thomas College, Pala, Kerala, India email:sreejyothi
More informationFrom Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow
From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with
More informationMicroarray Technique. Some background. M. Nath
Microarray Technique Some background M. Nath Outline Introduction Spotting Array Technique GeneChip Technique Data analysis Applications Conclusion Now Blind Guess? Functional Pathway Microarray Technique
More informationBioinformatics : Gene Expression Data Analysis
05.12.03 Bioinformatics : Gene Expression Data Analysis Aidong Zhang Professor Computer Science and Engineering What is Bioinformatics Broad Definition The study of how information technologies are used
More informationDNA Microarray Data Oligonucleotide Arrays
DNA Microarray Data Oligonucleotide Arrays Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang Bioconductor Short Course 2003 Copyright 2002, all rights reserved Biological question Experimental
More informationIPA Advanced Training Course
IPA Advanced Training Course Academia Sinica 2015 Oct Gene( 陳冠文 ) Supervisor and IPA certified analyst 1 Review for Introductory Training course Searching Building a Pathway Editing a Pathway for Publication
More informationNima Hejazi. Division of Biostatistics University of California, Berkeley stat.berkeley.edu/~nhejazi. nimahejazi.org github/nhejazi
Data-Adaptive Estimation and Inference in the Analysis of Differential Methylation for the annual retreat of the Center for Computational Biology, given 18 November 2017 Nima Hejazi Division of Biostatistics
More informationBasic aspects of Microarray Data Analysis
Hospital Universitari Vall d Hebron Institut de Recerca - VHIR Institut d Investigació Sanitària de l Instituto de Salud Carlos III (ISCIII) Basic aspects of Microarray Data Analysis Expression Data Analysis
More informationGene-centered resources at NCBI
COURSE OF BIOINFORMATICS a.a. 2014-2015 Gene-centered resources at NCBI We searched Accession Number: M60495 AT NCBI Nucleotide Gene has been implemented at NCBI to organize information about genes, serving
More informationLecture 2: March 8, 2007
Analysis of DNA Chips and Gene Networks Spring Semester, 2007 Lecture 2: March 8, 2007 Lecturer: Rani Elkon Scribe: Yuri Solodkin and Andrey Stolyarenko 1 2.1 Low Level Analysis of Microarrays 2.1.1 Introduction
More informationComparative Analysis using the Illumina DASL assay with FFPE tissue. Wendell Jones, PhD Vice President, Statistics and Bioinformatics
TM Comparative Analysis using the Illumina DASL assay with FFPE tissue Wendell Jones, PhD Vice President, Statistics and Bioinformatics Background EA has examined several protocol assay possibilities for
More information