MFMS: Maximal Frequent Module Set mining from multiple human gene expression datasets
|
|
- Marcus Reynolds
- 6 years ago
- Views:
Transcription
1 MFMS: Maximal Frequent Module Set mining from multiple human gene expression datasets Saeed Salem North Dakota State University Cagri Ozcaglar Amazon 8/11/2013
2 Introduction Gene expression analysis Use gene profile similarity for functional annotation of unknown genes Reduced to frequent subgraph mining problem Limits: Some genes with similar profiles do not have the same function Similarity in profiles due to perturbation of multiple biological pathways Solution: Analysis of multiple gene expression datasets from multiple experiments Sets of genes with similar expression profiles in a significant number of experiments Simultaneous analysis of multiple gene expression datasets 8/11/2013 MFMS: BioKDD
3 Motivation: Motivation and Goal Some genes with similar profiles do not have the same function Similar profiles due to perturbation of multiple biological pathways Goal: Analysis of multiple gene expression datasets from multiple experiments Sets of genes with similar expression profiles in a significant number of experiments Simultaneous analysis of multiple gene expression datasets 8/11/2013 MFMS: BioKDD
4 Outline 1. Introduction and motivation 2. Background: Gene expression analysis 3. MFMS algorithm 4. Results 5. Conclusion 8/11/2013 MFMS: BioKDD
5 Basics: Gene expression data Background Coexpression graph: G=(V,E) V: Genes E: Link between two genes if their expression profiles are correlated Edges can be weighted or unweighted (edges with small weights are pruned) Multiple gene expression datasets 8/11/2013 MFMS: BioKDD
6 Background Mining Frequent and Dense Subgraphs MULE: [Koyuturk et al., Bioinformatics, 2004] Efficient enumeration approach for mining frequent subgraphs from a set of graphs representing PPI networks for several species. Crochet: [Pei et al., KDD, 2005] Mines cross-all-graphs quasi-cliques Subgraphs which meet a density constraint and appear in all graphs. Crochet+: [Jiang et al., ACM TKDD, 2009] Same as Crochet, except that the density constraint is relaxed. The dense subgraphs appear in a subset of the graphs. 8/11/2013 MFMS: BioKDD
7 Background Gene expression analysis algorithms [Lee et al., Genome Research, 2004]: Summary graph with edges that occur in at least a number of graphs After building the summary graph, uses module discovery method (MCODE) to extract modules. CLOSECUT, SPLAT: [Yan et al., KDD, 2005] Mines relation graphs with connectivity constraints Input is multiple gene expression datasets, and output is a summarized set of frequent patterns CODENSE: [Hu et al., Bioinformatics, 2005] First step: Finds highly connected subgraphs from aggregate graph. Second step: Cluster the edges in the extracted subgraphs. [Huang et al., Bioinformatics, 2007] Builds a summary graph and represent it as a binary matrix of the form: Gene link x Graphs Biclustering on this matrix returns candidate frequent edgesets Connected components of each bicluster are returned as frequent patterns 8/11/2013 MFMS: BioKDD
8 Algorithm: MFMS MFMS: Algorithm for mining maximal frequent collections of: K-cliques Percolated k-cliques from graph representations of multiple gene expression datasets Edge mining problem Two steps: 1) Build the summary graph and generate the edge-attibuted matrix 2) Find MFMSs from the biclusters obtained from edge-attributed matrix: 2.1) Bicluster Edge-attributed matrix using GenMax algorithm [Gouda and Zaki, IEEE TKDD 2005] 2.2) For each bicluster, find highly connected components: k-cliques or k-percolated cliques Next: MFMS basic definitions 8/11/2013 MFMS: BioKDD
9 MFMS: Preliminaries Basic definitions and preliminaries: 8/11/2013 MFMS: BioKDD
10 Basic definitions and preliminaries: MFMS: Preliminaries In the edge attribute matrix, edges correspond to items and graph layers correspond to transactions. 8/11/2013 MFMS: BioKDD
11 FMS: Frequent Module Set: MFMS: Definitions Frequent Module Sets 8/11/2013 MFMS: BioKDD
12 Definitions: MFMS: Problem statement MFMS: Maximal Frequent Module Set MFMS problem statement: 8/11/2013 MFMS: BioKDD
13 MFMS: Flow diagram 8/11/2013 MFMS: BioKDD
14 MFMS: Algorithm 8/11/2013 MFMS: BioKDD
15 Results Human Gene Expression dataset: 52 Affymetrix microarray datasets [Huang et al., Bioinformatics, 2007] Summary graph: 9874 nodes, 49,817,037 edges, each appears in at least 1 graph. Many of these edges and genes appear in one or two datasets. Pruned edges that occur in < 7 graphs. Edge-attributed graph: Summary graph: 9784 nodes, edges Edge-attributed matrix: edges * 52 graphs 8/11/2013 MFMS: BioKDD
16 Results: Structural Topology analysis k-cliques An example of a maximal frequent edgesets and its induced subgraph and the associated collection of modules. Collection of 3-cliques Edge-induced subgraph Maximal Frequent edgeset RE = E in the edge-induced subgraph / E in the summary graph = 8 / 13 KRE = E in the collection / E in the summary graph = 6 /13 8/11/2013 MFMS: BioKDD
17 Results: Structural Topology analysis k-cliques An example of a maximal frequent edgesets and its induced subgraph and the associated collection of modules. Edge-attributed matrix Edge-induced subgraph Collection of 4-cliques 8/11/2013 MFMS: BioKDD
18 Results: Structural Topology analysis k-cliques k-cliques connected 8/11/2013 MFMS: BioKDD
19 Results: Structural Topology analysis percolated k-cliques Percolated k-cliques connected 8/11/2013 MFMS: BioKDD
20 Results: GO enrichment analysis GO enrichment analysis of human gene expression datasets Go-Miner tool with FDR-corrected p-values of ER: Percentage of modules enriched by at least one GO term. Module sets with complete gene symbol mapping only. Ratio of GO-enriched collections increase as α inreases. 8/11/2013 MFMS: BioKDD
21 Results: GO enrichment analysis Largest of these collections 4-cliques with 30 genes enriched with 111 GO terms 8/11/2013 MFMS: BioKDD
22 Results: GO enrichment analysis Top 20 GO terms enriched by 30 genes Top 4 GO terms: Mitotic phase of cell cycle GO: (M phase) GO: (mitotic cell cycle) GO: (mitosis) GO: (M phase of mitosis) Others include: GO: (DNA replication) GO: (Negative regulation cc) GO: (Cell division) 8/11/2013 MFMS: BioKDD
23 MFMS: Conclusion Two-step algorithm for mining collections of highly connected subnetworks from multiple gene expression datasets The first step is to construct edge-attributed graph, the second step is to find k-cliques and k-percolated cliques among the bicluster found on edge-attributed matrix. Experiments on Human gene expression dataset: MFMS on multiple Human gene expression datasets: Resulting gene sets are GO-enriched collections of genes: Gene sets are biologically significant. Gene sets are more likely to be independent of the conditions of microarray experiments, due to the support threshold used in MFMS. 8/11/2013 MFMS: BioKDD
24 Future work MFMS on other sets of multiple gene expression datasets: Yeast cell cycle dataset Time-course gene expression datasets Compare MFMS experimentally with existing algorithms: Molecular Hypergraphs [Rahman et al., ACM BCB 2012] CODENSE [Hu et al., Bioinformatics, 2005] ExpressCoExpress: Gene expression analysis using AWS A platform for applying various biclustering algorithms on gene expression datasets in the cloud 8/11/2013 MFMS: BioKDD
25 Thank you 8/11/2013 MFMS: BioKDD
CS 5984: Application of Basic Clustering Algorithms to Find Expression Modules in Cancer
CS 5984: Application of Basic Clustering Algorithms to Find Expression Modules in Cancer T. M. Murali January 31, 2006 Innovative Application of Hierarchical Clustering A module map showing conditional
More informationProtein-Protein-Interaction Networks. Ulf Leser, Samira Jaeger
Protein-Protein-Interaction Networks Ulf Leser, Samira Jaeger SHK Stelle frei Ab 1.9.2015, 2 Jahre, 41h/Monat Verbundprojekt MaptTorNet: Pankreatische endokrine Tumore Insb. statistische Aufbereitung und
More informationAnalysis of Microarray Data
Analysis of Microarray Data Lecture 3: Visualization and Functional Analysis George Bell, Ph.D. Senior Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Review
More informationGene List Enrichment Analysis - Statistics, Tools, Data Integration and Visualization
Gene List Enrichment Analysis - Statistics, Tools, Data Integration and Visualization Aik Choon Tan, Ph.D. Associate Professor of Bioinformatics Division of Medical Oncology Department of Medicine aikchoon.tan@ucdenver.edu
More informationUncovering the Small Community Structure in Large Networks: A Local Spectral Approach
Uncovering the Small Community Structure in Large Networks: A Local Spectral Approach Yixuan Li 1, Kun He 2, David Bindel 1 and John E. Hopcroft 1 1 Cornell University, USA 2 Huazhong University of Science
More informationNetwork System Inference
Network System Inference Francis J. Doyle III University of California, Santa Barbara Douglas Lauffenburger Massachusetts Institute of Technology WTEC Systems Biology Final Workshop March 11, 2005 What
More informationBIOINFORMATICS AND SYSTEM BIOLOGY (INTERNATIONAL PROGRAM)
BIOINFORMATICS AND SYSTEM BIOLOGY (INTERNATIONAL PROGRAM) PROGRAM TITLE DEGREE TITLE Master of Science Program in Bioinformatics and System Biology (International Program) Master of Science (Bioinformatics
More informationAGILENT S BIOINFORMATICS ANALYSIS SOFTWARE
ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS
More informationMissing Value Estimation In Microarray Data Using Fuzzy Clustering And Semantic Similarity
P a g e 18 Vol. 10 Issue 12 (Ver. 1.0) October 2010 Missing Value Estimation In Microarray Data Using Fuzzy Clustering And Semantic Similarity Mohammad Mehdi Pourhashem 1, Manouchehr Kelarestaghi 2, Mir
More informationData Mining and Applications in Genomics
Data Mining and Applications in Genomics Lecture Notes in Electrical Engineering Volume 25 For other titles published in this series, go to www.springer.com/series/7818 Sio-Iong Ao Data Mining and Applications
More informationEstimating Cell Cycle Phase Distribution of Yeast from Time Series Gene Expression Data
2011 International Conference on Information and Electronics Engineering IPCSIT vol.6 (2011) (2011) IACSIT Press, Singapore Estimating Cell Cycle Phase Distribution of Yeast from Time Series Gene Expression
More informationCS4220: Knowledge Discovery Methods for Bioinformatics Unit 6: Protein Complex Prediction. Limsoon Wong
CS4220: Knowledge Discovery Methods for Bioinformatics Unit 6: Protein Complex Prediction Limsoon Wong 2 Lecture Outline Overview of protein complex prediction A case study: MCL-CAw Impact of PPIN cleansing
More informationAssociation Analysis Techniques for Bioinformatics Problems
Association Analysis Techniques for Bioinformatics Problems Gowtham Atluri, Rohit Gupta, Gang Fang, Gaurav Pandey, Michael Steinbach, and Vipin Kumar Department of Computer Science and Engineering, University
More informationMining Association Rules among Gene Functions in Clusters of Similar Gene Expression Maps
Mining Association Rules among Gene Functions in Clusters of Similar Gene Expression Maps Li An 1 *, Zoran Obradovic 2, Desmond Smith 3, Olivier Bodenreider 4, and Vasileios Megalooikonomou 1 1 Data Engineering
More informationTime Series Motif Discovery
Time Series Motif Discovery Bachelor s Thesis Exposé eingereicht von: Jonas Spenger Gutachter: Dr. rer. nat. Patrick Schäfer Gutachter: Prof. Dr. Ulf Leser eingereicht am: 10.09.2017 Contents 1 Introduction
More informationA PROTEIN INTERACTION NETWORK OF THE MALARIA PARASITE PLASMODIUM FALCIPARUM
A PROTEIN INTERACTION NETWORK OF THE MALARIA PARASITE PLASMODIUM FALCIPARUM DOUGLAS J. LACOUNT, MARISSA VIGNALI, RAKESH CHETTIER, AMIT PHANSALKAR, RUSSELL BELL, JAY R. HESSELBERTH, LORI W. SCHOENFELD,
More informationTechnical University of Denmark
1 of 13 Technical University of Denmark Written exam, 15 December 2007 Course name: Introduction to Systems Biology Course no. 27041 Aids allowed: Open Book Exam Provide your answers and calculations on
More informationEnhanced Biclustering on Expression Data
Enhanced Biclustering on Expression Data Jiong Yang Haixun Wang Wei Wang Philip Yu UIUC IBM T. J. Watson UNC Chapel Hill IBM T. J. Watson jioyang@cs.uiuc.edu haixun@us.ibm.com weiwang@cs.unc.edu psyu@us.ibm.com
More informationBayesian Variable Selection and Data Integration for Biological Regulatory Networks
Bayesian Variable Selection and Data Integration for Biological Regulatory Networks Shane T. Jensen Department of Statistics The Wharton School, University of Pennsylvania stjensen@wharton.upenn.edu Gary
More informationExhaustive Search Method of Gene Expression Modules and Its Application to Human Tissue Data
Exhaustive Search Method of Gene Expression Modules and Its Application to Human Tissue Data Yoshifumi Okada, Kosaku Okubo, Paul Horton, and Wataru Fujibuchi Abstract Recently, several biclustering methods
More informationChIP-Seq Data Analysis. J Fass UCD Genome Center Bioinformatics Core Wednesday 15 June 2015
ChIP-Seq Data Analysis J Fass UCD Genome Center Bioinformatics Core Wednesday 15 June 2015 What s the Question? Where do Transcription Factors (TFs) bind genomic DNA 1? (Where do other things bind DNA
More informationMetabolic Networks. Ulf Leser and Michael Weidlich
Metabolic Networks Ulf Leser and Michael Weidlich This Lecture Introduction Systems biology & modelling Metabolism & metabolic networks Network reconstruction Strategy & workflow Mathematical representation
More informationBABELOMICS: Microarray Data Analysis
BABELOMICS: Microarray Data Analysis Madrid, 21 June 2010 Martina Marbà mmarba@cipf.es Bioinformatics and Genomics Department Centro de Investigación Príncipe Felipe (CIPF) (Valencia, Spain) DNA Microarrays
More informationFeature Selection of Gene Expression Data for Cancer Classification: A Review
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 50 (2015 ) 52 57 2nd International Symposium on Big Data and Cloud Computing (ISBCC 15) Feature Selection of Gene Expression
More informationToday. Last time. Lecture 5: Discrimination (cont) Jane Fridlyand. Oct 13, 2005
Biological question Experimental design Microarray experiment Failed Lecture : Discrimination (cont) Quality Measurement Image analysis Preprocessing Jane Fridlyand Pass Normalization Sample/Condition
More informationA STUDY ON STATISTICAL BASED FEATURE SELECTION METHODS FOR CLASSIFICATION OF GENE MICROARRAY DATASET
A STUDY ON STATISTICAL BASED FEATURE SELECTION METHODS FOR CLASSIFICATION OF GENE MICROARRAY DATASET 1 J.JEYACHIDRA, M.PUNITHAVALLI, 1 Research Scholar, Department of Computer Science and Applications,
More informationECLT 5810 E-Commerce Data Mining Techniques - Introduction. Prof. Wai Lam
ECLT 5810 E-Commerce Data Mining Techniques - Introduction Prof. Wai Lam Data Opportunities Business infrastructure have improved the ability to collect data Virtually every aspect of business is now open
More informationMicroarray Technique. Some background. M. Nath
Microarray Technique Some background M. Nath Outline Introduction Spotting Array Technique GeneChip Technique Data analysis Applications Conclusion Now Blind Guess? Functional Pathway Microarray Technique
More informationBioinformatics and Genomics: A New SP Frontier?
Bioinformatics and Genomics: A New SP Frontier? A. O. Hero University of Michigan - Ann Arbor http://www.eecs.umich.edu/ hero Collaborators: G. Fleury, ESE - Paris S. Yoshida, A. Swaroop UM - Ann Arbor
More informationBioinformatics Analysis of Nano-based Omics Data
Bioinformatics Analysis of Nano-based Omics Data Penny Nymark, Pekka Kohonen, Vesa Hongisto and Roland Grafström Hands-on Workshop on Nano Safety Assessment, 29 th September, 2016, National Technical University
More informationA New Path Length Measure Based on GO for Gene Similarity with Evaluation Using SGD Pathways
A New Path Length Measure Based on GO for Gene Similarity with Evaluation Using SGD Pathways Anurag Nagar University of Houston-Clear Lake Houston, TX, 77058, USA Hisham Al-Mubaid University of Houston-Clear
More informationFinancial Market - A Network Perspective
Financial Market - A Network Perspective Jukka-Pekka Onnela 1, Jari Saramäki 1, Kimmo Kaski 1, and János Kertész 2 1 Laboratory Computational Engineering, Helsinki University of Technology, P.O.Box 9203,
More informationFunction Prediction of Proteins from their Sequences with BAR 3.0
Open Access Annals of Proteomics and Bioinformatics Short Communication Function Prediction of Proteins from their Sequences with BAR 3.0 Giuseppe Profiti 1,2, Pier Luigi Martelli 2 and Rita Casadio 2
More informationMethods for network visualization and gene enrichment analysis February 22, Jeremy Miller Postdoctoral Fellow - Computational Data Analysis
Methods for network visualization and gene enrichment analysis February 22, 2012 Jeremy Miller Postdoctoral Fellow - Computational Data Analysis Outline Visualizing networks using R Visualizing networks
More informationAdvisors: Prof. Louis T. Oliphant Computer Science Department, Hiram College.
Author: Sulochana Bramhacharya Affiliation: Hiram College, Hiram OH. Address: P.O.B 1257 Hiram, OH 44234 Email: bramhacharyas1@my.hiram.edu ACM number: 8983027 Category: Undergraduate research Advisors:
More informationBioinformatics and computational tools
Bioinformatics and computational tools Etienne P. de Villiers (PhD) International Livestock Research Institute Nairobi, Kenya International Livestock Research Institute Nairobi, Kenya ILRI works at the
More informationSmart India Hackathon
TM Persistent and Hackathons Smart India Hackathon 2017 i4c www.i4c.co.in Digital Transformation 25% of India between age of 16-25 Our country needs audacious digital transformation to reach its potential
More informationA Genetic Approach for Gene Selection on Microarray Expression Data
A Genetic Approach for Gene Selection on Microarray Expression Data Yong-Hyuk Kim 1, Su-Yeon Lee 2, and Byung-Ro Moon 1 1 School of Computer Science & Engineering, Seoul National University Shillim-dong,
More informationV 1 Introduction! Fri, Oct 24, 2014! Bioinformatics 3 Volkhard Helms!
V 1 Introduction! Fri, Oct 24, 2014! Bioinformatics 3 Volkhard Helms! How Does a Cell Work?! A cell is a crowded environment! => many different proteins,! metabolites, compartments,! On a microscopic level!
More informationCompendium of Immune Signatures Identifies Conserved and Species-Specific Biology in Response to Inflammation
Immunity Supplemental Information Compendium of Immune Signatures Identifies Conserved and Species-Specific Biology in Response to Inflammation Jernej Godec, Yan Tan, Arthur Liberzon, Pablo Tamayo, Sanchita
More informationUnderstanding protein lists from comparative proteomics studies
Understanding protein lists from comparative proteomics studies Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine bing.zhang@vanderbilt.edu A typical comparative
More informationEnabling Reproducible Gene Expression Analysis
Enabling Reproducible Gene Expression Analysis Limsoon Wong 25 July 2011 (Joint work with Donny Soh, Difeng Dong, Yike Guo) 2 Plan An issue in gene expression analysis Comparing pathway sources: Comprehensiveness,
More informationKnowledge-Guided Analysis with KnowEnG Lab
Han Sinha Song Weinshilboum Knowledge-Guided Analysis with KnowEnG Lab KnowEnG Center Powerpoint by Charles Blatti Knowledge-Guided Analysis KnowEnG Center 2017 1 Exercise In this exercise we will be doing
More informationWhat is the Difference between a Human and a Chimp? T. M. Murali
What is the Difference between a Human and a Chimp? T. M. Murali murali@cs.vt.edu http://bioinformatics.cs.vt.edu/ murali Introduction to Computational Biology and Bioinformatics (CS 3824) March 25 and
More informationThe first and only fully-integrated microarray instrument for hands-free array processing
The first and only fully-integrated microarray instrument for hands-free array processing GeneTitan Instrument Transform your lab with a GeneTitan Instrument and experience the unparalleled power of streamlining
More informationIntroduction to ChIP Seq data analyses. Acknowledgement: slides taken from Dr. H
Introduction to ChIP Seq data analyses Acknowledgement: slides taken from Dr. H Wu @Emory ChIP seq: Chromatin ImmunoPrecipitation it ti + sequencing Same biological motivation as ChIP chip: measure specific
More information2/23/16. Protein-Protein Interactions. Protein Interactions. Protein-Protein Interactions: The Interactome
Protein-Protein Interactions Protein Interactions A Protein may interact with: Other proteins Nucleic Acids Small molecules Protein-Protein Interactions: The Interactome Experimental methods: Mass Spec,
More informationImproving the Response Time of an Isolated Service by using GSSN
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 5.258 IJCSMC,
More informationUnderstanding protein lists from proteomics studies. Bing Zhang Department of Biomedical Informatics Vanderbilt University
Understanding protein lists from proteomics studies Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu A typical comparative shotgun proteomics study IPI00375843
More informationMethods for comparing multiple microbial communities. james robert white, October 1 st, 2007
Methods for comparing multiple microbial communities. james robert white, whitej@umd.edu Advisor: Mihai Pop, mpop@umiacs.umd.edu October 1 st, 2007 Abstract We propose the development of new software to
More informationA Propagation-based Algorithm for Inferring Gene-Disease Associations
A Propagation-based Algorithm for Inferring Gene-Disease Associations Oron Vanunu Roded Sharan Abstract: A fundamental challenge in human health is the identification of diseasecausing genes. Recently,
More informationExploring protein interaction data using ppistats
Exploring protein interaction data using ppistats Denise Scholtens and Tony Chiang Introduction ppistats is a package that provides tools for the description and analysis of protein interaction data represented
More informationMulCom: a Multiple Comparison statistical test for microarray data in Bioconductor.
MulCom: a Multiple Comparison statistical test for microarray data in Bioconductor. Claudio Isella, Tommaso Renzulli, Davide Corà and Enzo Medico May 3, 2016 Abstract Many microarray experiments compare
More informationIntroduction to Bioinformatics
Introduction to Bioinformatics Dr. Taysir Hassan Abdel Hamid Lecturer, Information Systems Department Faculty of Computer and Information Assiut University taysirhs@aun.edu.eg taysir_soliman@hotmail.com
More informationProcedia - Social and Behavioral Sciences 97 ( 2013 )
Available online at www.sciencedirect.com ScienceDirect Procedia - Social and Behavioral Sciences 97 ( 2013 ) 602 611 Abstract The 9 th International Conference on Cognitive Science Filtering of background
More informationClassification and Learning Using Genetic Algorithms
Sanghamitra Bandyopadhyay Sankar K. Pal Classification and Learning Using Genetic Algorithms Applications in Bioinformatics and Web Intelligence With 87 Figures and 43 Tables 4y Spri rineer 1 Introduction
More informationGene Expression Data Analysis (I)
Gene Expression Data Analysis (I) Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu Bioinformatics tasks Biological question Experiment design Microarray experiment
More informationPersonalized Shopping App with Customer Feedback Mining and Secure Transaction
Personalized Shopping App with Customer Feedback Mining and Secure Transaction Swati H. Kalmegh, Nutan Dhande Mtech Student, Department of CSE, swatihkalmegh@gmail.com Contact No. 9404725175 Abstract Reputation-based
More informationGS Analysis of Microarray Data
GS01 0163 Analysis of Microarray Data Keith Baggerly and Kevin Coombes Department of Bioinformatics and Computational Biology UT M. D. Anderson Cancer Center kabagg@mdanderson.org kcoombes@mdanderson.org
More informationLees J.A., Vehkala M. et al., 2016 In Review
Sequence element enrichment analysis to determine the genetic basis of bacterial phenotypes Lees J.A., Vehkala M. et al., 2016 In Review Journal Club Triinu Kõressaar 16.03.2016 Introduction Bacterial
More informationGS Analysis of Microarray Data
GS01 0163 Analysis of Microarray Data Keith Baggerly and Brad Broom Department of Bioinformatics and Computational Biology UT M. D. Anderson Cancer Center kabagg@mdanderson.org bmbroom@mdanderson.org 7
More informationDeciphering gene sets annotations with ontology based visualization
Deciphering gene sets annotations with ontology based visualization A. Ayllón-Benítez 1,2, P. Thébault 1, J.T. Fernández-Breis 3, M. Quesada-Martínez 3, F. Mougin 1,2 and R. Bourqui 1 1 Univ. Bordeaux,
More informationPéter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS
Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS The Bioinformatics book covers new topics in the rapidly
More informationInternational Journal of Recent Trends in Electrical & Electronics Engg., Dec IJRTE ISSN:
Market Basket Analysis of Sports Store using Association Rules Harpreet Kaur 1 and Kawaljeet Singh 2 1 Department of Computer Science, Punjabi University, Patiala, India. 2 University Computer Centre,
More informationModelling gender-specific regulation of tau in Alzheimer s disease
Modelling gender-specific regulation of tau in Alzheimer s disease Motivation: Age-dependence of gender-differences in AD Sex-specific cumulative risk for a 65-year-old to develop AD by 95 years of age.
More informationInternational Journal of Computer Engineering and Applications, ICCSTAR-2016, Special Issue, May.16
A REVIEW OF DATA MINING TECHNIQUES FOR AGRICULTURAL CROP YIELD PREDICTION Akshitha K 1, Dr. Rajashree Shettar 2 1 M.Tech Student, Dept of CSE, RV College of Engineering,, Bengaluru, India 2 Prof: Dept.
More informationMining Residue Contacts in Proteins Using Local Structure Predictions
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART B: CYBERNETICS, VOL. 33, NO. 5, OCTOBER 2003 789 Mining Residue Contacts in Proteins Using Local Structure Predictions Mohammed J. Zaki, Associate
More informationless sensitive than RNA-seq but more robust analysis pipelines expensive but quantitiatve standard but typically not high throughput
Chapter 11: Gene Expression The availability of an annotated genome sequence enables massively parallel analysis of gene expression. The expression of all genes in an organism can be measured in one experiment.
More informationLarge Collection of Diverse Gene Set Search Queries Recapitulate Known Protein-Protein Interactions and Gene-Gene Functional Associations
Large Collection of Diverse Gene Set Search Queries Recapitulate Known Protein-Protein Interactions and Gene-Gene Functional Associations arxiv:1601.01653v1 [q-bio.mn] 7 Jan 2016 Neil R. Clark 1,2,3, Avi
More informationMachine Learning in Computational Biology CSC 2431
Machine Learning in Computational Biology CSC 2431 Lecture 9: Combining biological datasets Instructor: Anna Goldenberg What kind of data integration is there? What kind of data integration is there? SNPs
More informationSimultaneous profiling of transcriptome and DNA methylome from a single cell
Additional file 1: Supplementary materials Simultaneous profiling of transcriptome and DNA methylome from a single cell Youjin Hu 1, 2, Kevin Huang 1, 3, Qin An 1, Guizhen Du 1, Ganlu Hu 2, Jinfeng Xue
More information/ Computational Genomics. Time series analysis
10-810 /02-710 Computational Genomics Time series analysis Expression Experiments Static: Snapshot of the activity in the cell Time series: Multiple arrays at various temporal intervals Time Series Examples:
More informationAxiom mydesign Custom Array design guide for human genotyping applications
TECHNICAL NOTE Axiom mydesign Custom Genotyping Arrays Axiom mydesign Custom Array design guide for human genotyping applications Overview In the past, custom genotyping arrays were expensive, required
More informationVisual comparison of metabolic pathways
Journal of Visual Languages and Computing 14 (2003) 327 340 Journal of Visual Languages & Computing www.elsevier.com/locate/jvlc Visual comparison of metabolic pathways Falk Schreiber* Bioinformatics Center
More informationA logistic regression model for Semantic Web service matchmaking
. BRIEF REPORT. SCIENCE CHINA Information Sciences July 2012 Vol. 55 No. 7: 1715 1720 doi: 10.1007/s11432-012-4591-x A logistic regression model for Semantic Web service matchmaking WEI DengPing 1*, WANG
More informationIntroduction to Bioinformatics. Ulf Leser
Introduction to Bioinformatics Ulf Leser Bioinformatics 25.4.2003 50. Jubiläum der Entdeckung der Doppelhelix durch Watson/Crick 14.4.2003 Humanes Genom zu 99% sequenziert mit 99.99% Genauigkeit 2008 Genom
More informationA CLOSE-TO OPTIMUM BI-CLUSTERING ALGORITHM FOR MICROARRAY GENE EXPRESSION DATA
A CLOSE-TO OPTIMUM BI-CLUSTERING ALGORITHM FOR MICROARRAY GENE EXPRESSION DATA Guojun Li 1*, 3, Qin Ma 1,3, Bingqiang Liu 1,3, Haibao Tang 2, Andrew H. Paterson 2, and Ying Xu 1 1 Computational Systems
More informationIntroduction to cycle package
Introduction to cycle package Matthias E. Futschik SysBioLab, University of Algarve, Faro, Portugal URL: http://cycle.sysbiolab.eu April 30, 2018 Contents 1 Overview 1 2 Installation requirements 2 3 Case
More informationWhole Transcriptome Analysis of Illumina RNA- Seq Data. Ryan Peters Field Application Specialist
Whole Transcriptome Analysis of Illumina RNA- Seq Data Ryan Peters Field Application Specialist Partek GS in your NGS Pipeline Your Start-to-Finish Solution for Analysis of Next Generation Sequencing Data
More informationNima Hejazi. Division of Biostatistics University of California, Berkeley stat.berkeley.edu/~nhejazi. nimahejazi.org github/nhejazi
Data-Adaptive Estimation and Inference in the Analysis of Differential Methylation for the annual retreat of the Center for Computational Biology, given 18 November 2017 Nima Hejazi Division of Biostatistics
More informationOutline. Analysis of Microarray Data. Most important design question. General experimental issues
Outline Analysis of Microarray Data Lecture 1: Experimental Design and Data Normalization Introduction to microarrays Experimental design Data normalization Other data transformation Exercises George Bell,
More informationClassification Model for Intent Mining in Personal Website Based on Support Vector Machine
, pp.145-152 http://dx.doi.org/10.14257/ijdta.2016.9.2.16 Classification Model for Intent Mining in Personal Website Based on Support Vector Machine Shuang Zhang, Nianbin Wang School of Computer Science
More informationDependency Graph and Metrics for Defects Prediction
The Research Bulletin of Jordan ACM, Volume II(III) P a g e 115 Dependency Graph and Metrics for Defects Prediction Hesham Abandah JUST University Jordan-Irbid heshama@just.edu.jo Izzat Alsmadi Yarmouk
More informationPackage DNABarcodes. March 1, 2018
Type Package Package DNABarcodes March 1, 2018 Title A tool for creating and analysing DNA barcodes used in Next Generation Sequencing multiplexing experiments Version 1.9.0 Date 2014-07-23 Author Tilo
More informationAdvanced Introduction to Machine Learning
Advanced Introduction to Machine Learning 10715, Fall 2014 Structured Sparsity, with application in Computational Genomics Eric Xing Lecture 3, September 15, 2014 Reading: Eric Xing @ CMU, 2014 1 Structured
More informationEngineering Genetic Circuits
Engineering Genetic Circuits I use the book and slides of Chris J. Myers Lecture 0: Preface Chris J. Myers (Lecture 0: Preface) Engineering Genetic Circuits 1 / 19 Samuel Florman Engineering is the art
More informationWhat is Bioinformatics? Bioinformatics is the application of computational techniques to the discovery of knowledge from biological databases.
What is Bioinformatics? Bioinformatics is the application of computational techniques to the discovery of knowledge from biological databases. Bioinformatics is the marriage of molecular biology with computer
More informationAn Integrative -omics Approach to Identify Functional Sub-Networks in Human Colorectal Cancer
An Integrative -omics Approach to Identify Functional Sub-Networks in Human Colorectal Cancer Rod K. Nibbe 1,2 *, Mehmet Koyutürk 1,3, Mark R. Chance 1,4 1 Center for Proteomics & Bioinformatics, Case
More informationAnalysing Clickstream Data: From Anomaly Detection to Visitor Profiling
Analysing Clickstream Data: From Anomaly Detection to Visitor Profiling Peter I. Hofgesang and Wojtek Kowalczyk Free University of Amsterdam, Department of Computer Science, Amsterdam, The Netherlands
More informationUsing Decision Tree to predict repeat customers
Using Decision Tree to predict repeat customers Jia En Nicholette Li Jing Rong Lim Abstract We focus on using feature engineering and decision trees to perform classification and feature selection on the
More informationA Random Forest proximity matrix as a new measure for gene annotation *
A Random Forest proximity matrix as a new measure for gene annotation * Jose A. Seoane 1, Ian N.M. Day 1, Juan P. Casas 2, Colin Campbell 3 and Tom R. Gaunt 1,4 1 Bristol Genetic Epidemiology Labs. School
More informationIntroduction to Bioinformatics and Gene Expression Technology
Vocabulary Introduction to Bioinformatics and Gene Expression Technology Utah State University Spring 2014 STAT 5570: Statistical Bioinformatics Notes 1.1 Gene: Genetics: Genome: Genomics: hereditary DNA
More informationPackage bc3net. November 28, 2016
Version 1.0.4 Date 2016-11-26 Package bc3net November 28, 2016 Title Gene Regulatory Network Inference with Bc3net Author Ricardo de Matos Simoes [aut, cre], Frank Emmert-Streib [aut] Maintainer Ricardo
More informationAnalysis of Development Factors for Asian Countries using DWM on Big Data
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 19, Issue 2, Ver. IV (Mar.-Apr. 2017), PP 78-82 www.iosrjournals.org Analysis of Development Factors for Asian
More informationNext-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX
Next-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX Technical Overview Introduction RNA Sequencing (RNA-Seq) is one of the most commonly used next-generation sequencing (NGS)
More informationAgilent GeneSpring GX 10: Beyond. Pam Tangvoranuntakul Product Manager, GeneSpring October 1, 2008
Agilent GeneSpring GX 10: Gene Expression and Beyond Pam Tangvoranuntakul Product Manager, GeneSpring October 1, 2008 GeneSpring GX 10 in the News Our Goals for GeneSpring GX 10 Goal 1: Bring back GeneSpring
More information( 1 ) Mining disease integrated ontology
( 1 ) Mining disease integrated ontology Taysir Hassan A. Soliman, Marwa Hussein, and Mohamed El-Sharkawi Ontology has become a very vital issue to solve important issues regarding human diseases through
More informationBIOINFORMATICS ORIGINAL PAPER doi: /bioinformatics/btm276
Vol. 23 no. 15 2007, pages 1927 1935 BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm276 Gene expression Mining co-regulated gene profiles for the detection of functional associations in gene
More information7.1 Data infrastructure and data mining model of internal exposome
This project has received funding from the European Union s Seventh Programme for research, technological development and demonstration under grant agreement N 603946 FP7-ENV-2013-603946 http://www.heals-eu.eu/
More informationGET MORE VALUE OUT OF BIG DATA
GET MORE VALUE OUT OF BIG DATA Enterprise data is increasing at an alarming rate. An International Data Corporation (IDC) study estimates that data is growing at 50 percent a year and will grow by 50 times
More information