Introduction Reference System Requirements GIST (Gibbs sampler Infers Signal Transduction)... 4
|
|
- Sherman Barber
- 6 years ago
- Views:
Transcription
1 IMPACT user manual
2 Introduction... 3 Reference... 3 System Requirements... 3 GIST (Gibbs sampler Infers Signal Transduction)... 4 Framework... 4 Input Files... 5 Key Functions and Usage... 6 Output files SOUL (Structural Organization Uncovers pathway Landscape) Framework Input Files Key Functions and Usage Output files Examples Running GIST Running SOUL Re-running SOUL... 20
3 Introduction The IMPACT (Infer Modularization of PAthway CrossTalk) algorithm is a pipeline for pathway identification, which consists of two methods: GIST (Gibbs sampler Infer Signal Transduction) and SOUL (Structural Organization Uncovers pathway Landscape). GIST is a distribution learning method that builds pathways among given source and target proteins. By sampling pathway states according to pathway potential distributions, GIST extracts signal transduction pathways that are associated with phenotype and consent with knowledge. SOUL is a post-processing method that utilizes pathway samples from GIST to further investigate pathway modularization and crosstalk. IMPACT offers a novel perspective to identify aberrant signal transduction in cancer study by emphasizing on nodal proteins that crosslink multiple signaling pathways instead of studying individual pathways independently. The identified nodal proteins by IMPACT may be potential drug targets that oversee multiple disease related pathways. Reference Jinghua Gu, Jianhua Xuan, Ayesha N. Shajahan-Haq, Diane M. Demas, and Robert Clarke, Unraveling the intracellular signal transduction and pathway crosstalk by exploring pathway landscape, submitted to ** System Requirements The software was developed on Matlab R2012a and is compatible with both Windows and Linux enviroments. This package also includes Matlab BGL library for calculating betweenness centrality (./matlab_bgl/*). Please see LICENSE1.txt, LICENSE2.txt, and readme.txt for copy right information and disclaimer.
4 GIST (Gibbs sampler Infers Signal Transduction) Framework
5 Input Files 1. Gene_expression_data.txt Gene_expression_data.txt provides the gene expression of the genes across multiple samples with each row corresponding to one gene and each column corresponding to one sample. Sample names are listed in the first row and gene names are listed in the first column. Both Entrez_id and Gene_symbol can be used as the gene names with title Entrez_id and Gene_symbol, respectively. Note: Microarray gene expression data after logarithm is preferred for the sake of statistics calculated and used in the algorithm. 2. Phenotype_info.txt Phenotype_info.txt provides the phenotype information of the samples, where the sample names are listed in the first column and the corresponding phenotypes are listed in the second column. Note: The current version of the package is designed for comparison between samples from two different phenotypes. Differential study among multiple phenotypes will be extended in the future. 3. input_network.txt input_network.txt provides the interaction information of the genes with Entrez_id as the gene names. 4. Source_gene.txt, Sink_gene.txt Source_gene.txt and Sink_gene.txt provide the start and target genes of the pathways of interest. Official gene symbols are used as the gene names. 5. location_info.xls* location_info.xls provides the cellular location information of the genes. Users can use the one included in the package or provide their own information. 6. weight_parameters.txt*
6 weight_parameters.txt provides the weights used in the cost function. Users can modify the weight parameters for their specific case study. If not provided, the default values of the weight parameters are all one. *: If they are not provided, default settings are used. Key Functions and Usage Function: Main_GIST_func(L, ite, PATHNUM, INCONS, weights); Description: The main function of the GIST algorithm, which implements GIST algorithm on the given input files with predefined parameters which can be user defined or default values. The results are the selected top ranked samples of pathways as well as the estimated pathway landscape. Arguments: Table 1 Arguments of Main_GIST_func() Name Description Default value L Length of pathway 8 ite Number of samples of pathways PATHNUM Number of top ranked pathways saved for SOUL 100 INCONS Indicator of whether inconsistent pathways would be filtered weights Name of the weight parameter file All ones 1 Function: [G_entrez, G_loc, G_locn, G_symbol, G_p, G_nodez, G0, G1, G_fld, source, sink] = Get_input(); Description: Arguments: Calculate the values needed for the GIST algorithm from input files.
7 Table 2 Arguments of Get_input() Name Description Example Is mandatory G_entrez Entrez gene id 7157 NO G_loc Subcellular location Nucleus Yes** G_locn Subcellular location id 4 Yes G_symbol Official gene symbol TP53 Yes G_p Node p-value Yes** G_nodesz Node z-score Yes G0 Sparse binary network (undirected) 0 or 1 Yes G1 Sparse weighted network (undirected) Yes G_fld Log2 fold change Yes** source source gene symbols ESR1 Yes sink target gene symbols TP53 Yes **: data that is not directly required for the algorithm, but is only needed for complete summarization (annotation) of output results (e.g., fold change of genes). Function: F0 = bldflownet(g0,source,sink,l); Description: Build flow network between the given source nodes and sink nodes. Arguments: Name G0 source sink L F0 Table 3 Arguments of bldflownet() Description Sparse binary network (undirected) Source/start proteins Target/end proteins Length of pathway Flow network (unweighted)
8 Function: G = bldweightmatrix(f0,g0,g1,delta); Description: Build directed weighted matrix for given group. Arguments: Name F0 G0 G1 delta G Table 4 Arguments of bldweightmatrix() Description Flow network (unweighted) Sparse binary network (undirected) Sparse weighted network (undirected) Baseline score for pseudo-edges (delta = -5 as default) Sparse weighted network (directed) Function: Description: [S V W valid_path valid_edge] = rndinitial(g,g0,f0,h,l); Random initialization: a path is randomly generated as the starting state of the sampling procedure, where pseudo-edges may be included. Arguments: Name G G0 F0 H L S V W Table 5 Arguments of rndinitial() Description Sparse weighted network (directed) Sparse binary network (undirected) Flow network (unweighted) Node z-score Pathway length Initial pathway Pathway node potential Pathway edge potential
9 valid_path valid_edge If pathway contains pseudo-edges (binary indicator) (L-1) 1 vector indicating if an edge is a pseud-edge Function: [sampledpaths, pathfreq, pathscore] = GIST(G, G0, G_locn, F0, G_nodez, V, W, S, valid_edge, L, ite, T, weight); Description: Arguments: GIST algorithm: generate a sequence of samples from a sampling procedure. Name G G0 G_locn F0 G_nodez V W S valid_path valid_edge L ite T weight sampledpaths pathfreq pathscore Table 6 Arguments of GIST() Description Sparse weighted network (directed) Sparse binary network (undirected) 1:extracellular space; 2: plasma membrane; 3: cytoplasm; 4: nucleus Flow network (unweighted) Node z-score Pathway node potential of the initial pathway Pathway edge potential of the initial pathway Initial pathway If pathway contains pseudo-edges (binary indicator) (L-1) 1 vector indicating if an edge is a pseud-edge Pathway length Number of sampling iterations Temperature weight_parameters.txt Sampled pathways Frequency of sampled pathway Likelihood score of sampled pathway
10 Function: [gist_slist, gist_wlist, gist_symb, network, G_nscore, G_bc] = EST_edge(rankedSampledPaths1, rankedpathscore1, rankedpathsymb1, G, G_locn, G_symbol, PATHNUM,INCONS); Description: Arguments: Estimate the pathway landscape from the obtained samples of pathways. Name rankedsampledpaths1 rankedpathscore1 rankedpathsymb1 G G_locn G_symbol PATHNUM INCONS gist_slist gist_wlist gist_symb network G_nscore G_bc Table 7 Arguments of est_edge() Description Gene index of the samples of pathways ranked according to pathway potential score Pathway potential score Gene symbols of the ranked pathways Sparse weighted network (directed) 1:extracellular space; 2: plasma membrane; 3: cytoplasm; 4: nucleus Official gene symbol Number of top pathways used to estimate edge attributes Indicator of whether inconsistent pathways would be filtered Gene index of the selected top pathways Potential score of the selected top pathways Gene symbols of the selected top pathways Estimated pathway landscape with estimated edge score and estimated edge direction Estimated node score in the estimated pathway landscape Betweenness centrality of the nodes in the estimated pathway landscape Output files
11 The outputs are saved in the./outputs folder with both.mat and.xls formats. 1. output_gist_results.mat Selected top ranked samples of pathways are saved with.mat format for the following SOUL algorithm. Pathway landscape estimated from the top ranked samples in this case study is also saved in the file. 2. output_gist_network.xls; output_gist_node_attr.xls Estimated pathway landscape as well as the calculated statistics of the genes and the interactions is saved in these two files, which can be imported to Cytoscape for visualization. 3.output_GIST_SelectedPathScore.xls; output_gist_selectedpathsymb.xls These two files save top ranked samples of pathways for the following SOUL algorithm.
12 SOUL (Structural Organization Uncovers pathway Landscape) Framework
13 Input Files SOUL algorithm further analyzes the samples generated from GIST algorithm to reveal pathway landscape. Thus, the outputs of the GIST algorithm are the inputs of the SOUL algorithm. Users can pile up multiple sets of samples generated from GIST in different case studies as the input files for SOUL to identify cross talk of the pathways interested in different case studies. 1. output_gist_node_attr.xls Characteristics of the genes involved in the selected top ranked samples of pathways. 2.output_GIST_SelectedPathScore.xls; output_gist_selectedpathsymb.xls Selected top ranked samples of pathways generated by the GIST algorithm. Key Functions and Usage Function: main_soul_func(incluster, clusterid) Description: The main function of the SOUL algorithm, which implements the SOUL algorithm on the selected top ranked samples of pathways generated by the GIST algorithm. The results are the estimated pathway landscape. Arguments: Table 8 Arguments of Main_SOUL_func() Name Description Default value INCLUSTER clusterid* Indicator of whether further select samples using clustering method Name of the file of the index of pathways of interest 1 Not available
14 *When the SOUL algorithm is implemented for the first time, the clusterid file is not available. Then, the pathways automatically selected from hieratical clustering will be used for pathway landscape estimation, and the index of the further selected pathways corresponding to the clustering labels will be saved in cluster_id.xls. Based on the structural heatmap of the clustering result and the pathway scores, user may further select clusters of pathways of interest, and then modify the cluster_id.xls file to re-estimate pathway landscape using SOUL_part2(). Function: SOUL_part2() Description: Use the clustering results together with the index of samples of pathways as the inputs to estimate the pathway landscape. Both the inputs and the outputs are the.xls files. The output files are the output files of the main_soul_func(). Function: [network, G_nscore, G_bc] = est_edge_sub(gist_slist, gist_wlist, G_symbol) Description: Estimate the pathway landscape from the obtained samples of pathways. est_edge_sub(), which is the simplified version of EST_edge(), uses all of the samples of pathways in the inputs for pathway landscape estimation. Arguments: Name gist_slist gist_wlist G_symbol network G_nscore G_bc Table 9 Arguments of est_edge() Description Gene index of the selected top pathways Potential score of the selected top pathways Official gene symbol Estimated pathway landscape with estimated edge score and estimated edge direction Estimated node score in the estimated pathway landscape Betweenness centrality of the nodes in the estimated pathway landscape
15 Output files 1. output_soul_network.xls; output_soul_node_attr.xls The major results of the IMPACT algorithm, which can be imported into Cytoscape for visualization. Estimated pathway landscape as well as the calculated statistics of the genes and the interactions is saved in these two files. Table 10 Format of SOUL_network_xls Column Description Example 1 Protein 1* IRS1 2 Protein 2* BIRC5 3 Edge direction probability 1 4 Normalized edge score *: in the context of directed network, an edge always starts from protein1 to protein 2. Table 11 Format of SOUL_node_attr.xls Column Description Example 1 Official gene symbol BRCA1 2 Log2 fold change p-value Subcellular location Nucleus 5 Subcellular location id 4 6 Node score Betweenness centrality Log2(Betweenness centrality) Clustering result.fig; Pathway score.fig Clustering result.fig is the structural heatmap generated from hierarchical clustering. Pathway score.fig is the score the pathways with the same order as in the Clustering result.fig. 3*. Results from clustering output_soul_sorted_similarity_matrix.xls: Similarity matrix of the samples of the paths sorted by the hierarchical clustering result (the same order as in the Clustering result.fig ).
16 output_soul_clustered_paths_label.xls: Labels of the samples of pathways generated from hierarchical clustering. output_soul_score_sm_sorted.xls: Pathway scores in the same order as in Clustering result.fig. 4*. Sorted samples of pathways from the hierarchical clustering result output_soul_gist_symb_sorted.xls; output_soul_gist_slist_sorted.xls; output_soul_wlist_sorted.xls; output_soul_gene_symb.xls; *: Intermediate output files.
17 Examples Running GIST With input files in the folder./inputs, run Main_GIST_func(8,10000, 200,1,'weight_parameters.txt'), and the program will show the running status in Matlab command window: Fig. 1 Running window of GIST After GIST algorithm completes, five output files (output_gist_results.mat, output_gist_network.xls, output_gist_node_attr.xls, output_gist_selectedpathscore.xls, output_gist_selectedpathsymb.xls) will be generated and saved in the folder./outputs.
18 Running SOUL Run main_soul_func(1) in Matlab, and the current status of the program will be displayed in command line window: Fig. 2 Running window of SOUL After SOUL program completes, the major results will be saved in the two files: ouput_soul.networ.xls and output_soul_node_attr.xls. In addition, the algorithm will also generate figures of pathway landscape: structural heatmap and re-organized potential distribution (Fig. 3).
19 Structural heatmap Fig. 3 Pathway landscape Clusters of pathways can be selected considering the figures in Fig. 3 to reestimate the pathway landscape, since the automatic selection only considers the similarity of the samples.
20 Re-running SOUL Select three clusters from the two figures, and modify the cluster_id.xls. For example, three clusters of samples are selected as shown in Fig.4. The corresponding index of the samples are [9:22 66: :179]. Then, run SOUL_part2() to estimate pathway landscape from the selected clusters of samples. Structural heatmap Fig. 4 Selected clusters of samples
CS 5984: Application of Basic Clustering Algorithms to Find Expression Modules in Cancer
CS 5984: Application of Basic Clustering Algorithms to Find Expression Modules in Cancer T. M. Murali January 31, 2006 Innovative Application of Hierarchical Clustering A module map showing conditional
More informationPackage BAGS. September 21, 2014
Type Package Title A Bayesian Approach for Geneset Selection Version 2.4.0 Date 2013-06-12 Author Alejandro Quiroz-Zarate Package BAGS September 21, 2014 Maintainer Alejandro Quiroz-Zarate
More informationCRNET User Manual (V2.2)
CRNET User Manual (V2.2) CRNET is designed to use time-course RNA-seq data for the refinement of functional regulatory networks (FRNs) from initial candidate networks that can be constructed from ChIP-seq
More informationPREDICTING PREVENTABLE ADVERSE EVENTS USING INTEGRATED SYSTEMS PHARMACOLOGY
PREDICTING PREVENTABLE ADVERSE EVENTS USING INTEGRATED SYSTEMS PHARMACOLOGY GUY HASKIN FERNALD 1, DORNA KASHEF 2, NICHOLAS P. TATONETTI 1 Center for Biomedical Informatics Research 1, Department of Computer
More informationUnderstanding protein lists from comparative proteomics studies
Understanding protein lists from comparative proteomics studies Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine bing.zhang@vanderbilt.edu A typical comparative
More informationIPA Advanced Training Course
IPA Advanced Training Course Academia Sinica 2015 Oct Gene( 陳冠文 ) Supervisor and IPA certified analyst 1 Review for Introductory Training course Searching Building a Pathway Editing a Pathway for Publication
More informationNetwork System Inference
Network System Inference Francis J. Doyle III University of California, Santa Barbara Douglas Lauffenburger Massachusetts Institute of Technology WTEC Systems Biology Final Workshop March 11, 2005 What
More informationAnalysis of Microarray Data
Analysis of Microarray Data Lecture 3: Visualization and Functional Analysis George Bell, Ph.D. Senior Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Review
More informationPackage GeneExpressionSignature
Package GeneExpressionSignature December 30, 2017 Title Gene Expression Signature based Similarity Metric Version 1.25.0 Date 2012-10-24 Author Yang Cao Maintainer Yang Cao , Fei
More informationA Propagation-based Algorithm for Inferring Gene-Disease Associations
A Propagation-based Algorithm for Inferring Gene-Disease Associations Oron Vanunu Roded Sharan Abstract: A fundamental challenge in human health is the identification of diseasecausing genes. Recently,
More informationAnalysis of a Tiling Regulation Study in Partek Genomics Suite 6.6
Analysis of a Tiling Regulation Study in Partek Genomics Suite 6.6 The example data set used in this tutorial consists of 6 technical replicates from the same human cell line, 3 are SP1 treated, and 3
More informationLees J.A., Vehkala M. et al., 2016 In Review
Sequence element enrichment analysis to determine the genetic basis of bacterial phenotypes Lees J.A., Vehkala M. et al., 2016 In Review Journal Club Triinu Kõressaar 16.03.2016 Introduction Bacterial
More informationBayesian Variable Selection and Data Integration for Biological Regulatory Networks
Bayesian Variable Selection and Data Integration for Biological Regulatory Networks Shane T. Jensen Department of Statistics The Wharton School, University of Pennsylvania stjensen@wharton.upenn.edu Gary
More informationRisk Management User Guide
Risk Management User Guide Version 17 December 2017 Contents About This Guide... 5 Risk Overview... 5 Creating Projects for Risk Management... 5 Project Templates Overview... 5 Add a Project Template...
More informationSmall-Molecule Drug Target Identification/Deconvolution Technologies
Small-Molecule Drug Target Identification/Deconvolution Technologies Case-Studies Shantani Target ID Technology Tool Box Target Deconvolution is not Trivial = A single Tool / Technology May Not necessarily
More informationEstimating Cell Cycle Phase Distribution of Yeast from Time Series Gene Expression Data
2011 International Conference on Information and Electronics Engineering IPCSIT vol.6 (2011) (2011) IACSIT Press, Singapore Estimating Cell Cycle Phase Distribution of Yeast from Time Series Gene Expression
More informationGene Set Enrichment Analysis! Robert Gentleman!
Gene Set Enrichment Analysis! Robert Gentleman! Outline! Description of the experimental setting! Defining gene sets! Description of the original GSEA algorithm! proposed by Mootha et al (2003)! Our approach
More informationRuns of Homozygosity Analysis Tutorial
Runs of Homozygosity Analysis Tutorial Release 8.7.0 Golden Helix, Inc. March 22, 2017 Contents 1. Overview of the Project 2 2. Identify Runs of Homozygosity 6 Illustrative Example...............................................
More informationProtein-Protein-Interaction Networks. Ulf Leser, Samira Jaeger
Protein-Protein-Interaction Networks Ulf Leser, Samira Jaeger SHK Stelle frei Ab 1.9.2015, 2 Jahre, 41h/Monat Verbundprojekt MaptTorNet: Pankreatische endokrine Tumore Insb. statistische Aufbereitung und
More informationIntroduction to Next Generation Sequencing (NGS) Data Analysis and Pathway Analysis. Jenny Wu
Introduction to Next Generation Sequencing (NGS) Data Analysis and Pathway Analysis Jenny Wu Outline Introduction to NGS data analysis in Cancer Genomics NGS applications in cancer research Typical NGS
More informationIn silico prediction of novel therapeutic targets using gene disease association data
In silico prediction of novel therapeutic targets using gene disease association data, PhD, Associate GSK Fellow Scientific Leader, Computational Biology and Stats, Target Sciences GSK Big Data in Medicine
More informationNext-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX
Next-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX Technical Overview Introduction RNA Sequencing (RNA-Seq) is one of the most commonly used next-generation sequencing (NGS)
More information: Genomic Regions Enrichment of Annotations Tool
http://great.stanford.edu/ : Genomic Regions Enrichment of Annotations Tool Gill Bejerano Dept. of Developmental Biology & Dept. of Computer Science Stanford University 1 Human Gene Regulation 10 13 different
More informationGene Annotation and Gene Set Analysis
Gene Annotation and Gene Set Analysis After you obtain a short list of genes/clusters/classifiers what next? For each gene, you may ask What it is What is does What processes is it involved in Which chromosome
More informationmassir: MicroArray Sample Sex Identifier
massir: MicroArray Sample Sex Identifier Sam Buckberry October 30, 2017 Contents 1 The Problem 2 2 Importing data and beginning the analysis 2 3 Extracting the Y chromosome probe data 3 4 Predicting the
More informationPackage FSTpackage. June 27, 2017
Type Package Package FSTpackage June 27, 2017 Title Unified Sequence-Based Association Tests Allowing for Multiple Functional Annotation Scores Version 0.1 Date 2016-12-14 Author Zihuai He Maintainer Zihuai
More informationFunction Prediction of Proteins from their Sequences with BAR 3.0
Open Access Annals of Proteomics and Bioinformatics Short Communication Function Prediction of Proteins from their Sequences with BAR 3.0 Giuseppe Profiti 1,2, Pier Luigi Martelli 2 and Rita Casadio 2
More informationIdentifying Signaling Pathways. BMI/CS 776 Spring 2016 Anthony Gitter
Identifying Signaling Pathways BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2016 Anthony Gitter gitter@biostat.wisc.edu Goals for lecture Challenges of integrating high-throughput assays Connecting relevant
More informationBiomedical Text Analysis
Biomedical Text Analysis BMI/CS 776 www.biostat.wisc.edu/bmi776/ Mark Craven craven@biostat.wisc.edu Spring 2009 Some Important Text-Mining Problems! hypothesis generation Given: biomedical objects/classes
More informationTechnical University of Denmark
1 of 13 Technical University of Denmark Written exam, 15 December 2007 Course name: Introduction to Systems Biology Course no. 27041 Aids allowed: Open Book Exam Provide your answers and calculations on
More informationA STUDY ON STATISTICAL BASED FEATURE SELECTION METHODS FOR CLASSIFICATION OF GENE MICROARRAY DATASET
A STUDY ON STATISTICAL BASED FEATURE SELECTION METHODS FOR CLASSIFICATION OF GENE MICROARRAY DATASET 1 J.JEYACHIDRA, M.PUNITHAVALLI, 1 Research Scholar, Department of Computer Science and Applications,
More informationAnalyzing Affymetrix GeneChip SNP 6 Copy Number Data in Partek
Analyzing Affymetrix GeneChip SNP 6 Copy Number Data in Partek This example data set consists of 20 selected HapMap samples, representing 10 females and 10 males, drawn from a mixed ethnic population of
More informationFunctional Enrichment Analysis & Candidate Gene Ranking
Functional Enrichment Analysis & Candidate Gene Ranking Anil Jegga Biomedical Informatics Contact Information: Anil Jegga Biomedical Informatics Room # 232, S Building 10th Floor CCHMC Homepage: http://anil.cchmc.org
More informationMicroarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison. CodeLink compatible
Microarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood - a Platform Comparison CodeLink compatible Microarray Analysis of Gene Expression in Huntington's Disease Peripheral Blood
More informationQuick reference guide
Quick reference guide Our Invitrogen GeneArt CRISPR Search and Design Tool allows you to search our database of >600,000 predesigned CRISPR guide RNA (grna) sequences or analyze your sequence of interest
More informationIntro. ANN & Fuzzy Systems. Lecture 36 GENETIC ALGORITHM (1)
Lecture 36 GENETIC ALGORITHM (1) Outline What is a Genetic Algorithm? An Example Components of a Genetic Algorithm Representation of gene Selection Criteria Reproduction Rules Cross-over Mutation Potential
More informationWelcome to the Eastwood Harris Pty Ltd Primavera P6 Versions 8.2 EPPM Web Tool 2 day training course Enterprise Portfolio Project Management
Welcome to the Eastwood Harris Pty Ltd Primavera P6 Versions 8.2 EPPM Web Tool 2 day training course Enterprise Portfolio Project Management Page 2 Copyright Eastwood Harris Pty Ltd IMPORTANT POINTS FOR
More informationBCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers
BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers Web resources: NCBI database: http://www.ncbi.nlm.nih.gov/ Ensembl database: http://useast.ensembl.org/index.html UCSC
More informationUseful P6 EPPM Terms Security Configuration Process in P6 EPPM Defining Global Security Profiles in P6 EPPM... 19
P6 EPPM Application Administration Guide Version 17 October 2017 Contents Cloud or On-Premises Content... 7 About the P6 EPPM Application Administration Guide... 9 P6 Setup Tasks... 11 Users and Security
More informationA Random Forest proximity matrix as a new measure for gene annotation *
A Random Forest proximity matrix as a new measure for gene annotation * Jose A. Seoane 1, Ian N.M. Day 1, Juan P. Casas 2, Colin Campbell 3 and Tom R. Gaunt 1,4 1 Bristol Genetic Epidemiology Labs. School
More informationRNA-Seq analysis using R: Differential expression and transcriptome assembly
RNA-Seq analysis using R: Differential expression and transcriptome assembly Beibei Chen Ph.D BICF 12/7/2016 Agenda Brief about RNA-seq and experiment design Gene oriented analysis Gene quantification
More informationVersion Comparison. Getting Started Project Planning. Feature Key:
Version Comparison Version Comparison The table below introduces the new features of 2010 and Project. It also shows features initially included in previous versions that have been improved in Project
More informationMachine Learning in Computational Biology CSC 2431
Machine Learning in Computational Biology CSC 2431 Lecture 9: Combining biological datasets Instructor: Anna Goldenberg What kind of data integration is there? What kind of data integration is there? SNPs
More informationPredictive Modeling Using SAS Visual Statistics: Beyond the Prediction
Paper SAS1774-2015 Predictive Modeling Using SAS Visual Statistics: Beyond the Prediction ABSTRACT Xiangxiang Meng, Wayne Thompson, and Jennifer Ames, SAS Institute Inc. Predictions, including regressions
More informationPackage geno2proteo. December 12, 2017
Type Package Package geno2proteo December 12, 2017 Title Finding the DNA and Protein Sequences of Any Genomic or Proteomic Loci Version 0.0.1 Date 2017-12-12 Author Maintainer biocviews
More informationmetaarray package for meta-analysis of microarray data
metaarray package for meta-analysis of microarray data Debashis Ghosh and Hyungwon Choi October 30, 2017 Introduction metaarray is a collection of functions for large-scale meta-analysis of microarray
More informationCELL BIOLOGY BIOL3030, 3 credits Fall 2012, Aug 20, Dec 14, 2012 Tuesday/Thursday 8:00 am-9:15 am Bowman-Oddy Laboratories 1049
INSTRUCTOR: Dr. Song-Tao Liu WO3254B Tel: 419-530-7853 Email: Song-Tao.Liu@utoledo.edu OFFICE HOURS CELL BIOLOGY BIOL3030, 3 credits Fall 2012, Aug 20, 2012 - Dec 14, 2012 Tuesday/Thursday 8:00 am-9:15
More informationWhole Transcriptome Analysis of Illumina RNA- Seq Data. Ryan Peters Field Application Specialist
Whole Transcriptome Analysis of Illumina RNA- Seq Data Ryan Peters Field Application Specialist Partek GS in your NGS Pipeline Your Start-to-Finish Solution for Analysis of Next Generation Sequencing Data
More informationTSSpredator User Guide v 1.00
TSSpredator User Guide v 1.00 Alexander Herbig alexander.herbig@uni-tuebingen.de Kay Nieselt kay.nieselt@uni-tuebingen.de June 3, 2013 1 Getting Started TSSpredator is a tool for the comparative detection
More information3 Ways to Improve Your Targeted Marketing with Analytics
3 Ways to Improve Your Targeted Marketing with Analytics Introduction Targeted marketing is a simple concept, but a key element in a marketing strategy. The goal is to identify the potential customers
More informationSolving Transportation Logistics Problems Using Advanced Evolutionary Optimization
Solving Transportation Logistics Problems Using Advanced Evolutionary Optimization Transportation logistics problems and many analogous problems are usually too complicated and difficult for standard Linear
More informationAnalysis of Microarray Data
Analysis of Microarray Data Lecture 1: Experimental Design and Data Normalization George Bell, Ph.D. Senior Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Introduction
More informationV 1 Introduction! Fri, Oct 24, 2014! Bioinformatics 3 Volkhard Helms!
V 1 Introduction! Fri, Oct 24, 2014! Bioinformatics 3 Volkhard Helms! How Does a Cell Work?! A cell is a crowded environment! => many different proteins,! metabolites, compartments,! On a microscopic level!
More informationExploring a fatal outbreak of Escherichia coli using PATRIC
Exploring a fatal outbreak of Escherichia coli using PATRIC On May 19, 2011, the Robert Koch Institute, Germany's national-level public health authority, was informed about a cluster of three cases of
More informationOncomine cfdna Assays Part III: Variant Analysis
Oncomine cfdna Assays Part III: Variant Analysis USER GUIDE for use with: Oncomine Lung cfdna Assay Oncomine Colon cfdna Assay Oncomine Breast cfdna Assay Catalog Numbers A31149, A31182, A31183 Publication
More informationTime Series Motif Discovery
Time Series Motif Discovery Bachelor s Thesis Exposé eingereicht von: Jonas Spenger Gutachter: Dr. rer. nat. Patrick Schäfer Gutachter: Prof. Dr. Ulf Leser eingereicht am: 10.09.2017 Contents 1 Introduction
More informationOptimization-Based Peptide Mass Fingerprinting for Protein Mixture Identification
Optimization-Based Peptide Mass Fingerprinting for Protein Mixture Identification Weichuan Yu, Ph.D. Department of Electronic and Computer Engineering The Hong Kong University of Science and Technology
More informationGene expression connectivity mapping and its application to Cat-App
Gene expression connectivity mapping and its application to Cat-App Shu-Dong Zhang Northern Ireland Centre for Stratified Medicine University of Ulster Outline TITLE OF THE PRESENTATION Gene expression
More informationAb Initio SERVER PROTOTYPE FOR PREDICTION OF PHOSPHORYLATION SITES IN PROTEINS*
COMPUTATIONAL METHODS IN SCIENCE AND TECHNOLOGY 9(1-2) 93-100 (2003/2004) Ab Initio SERVER PROTOTYPE FOR PREDICTION OF PHOSPHORYLATION SITES IN PROTEINS* DARIUSZ PLEWCZYNSKI AND LESZEK RYCHLEWSKI BiolnfoBank
More informationEECS730: Introduction to Bioinformatics
EECS730: Introduction to Bioinformatics Lecture 14: Microarray Some slides were adapted from Dr. Luke Huan (University of Kansas), Dr. Shaojie Zhang (University of Central Florida), and Dr. Dong Xu and
More informationMicrosoft Dynamics GP Business Portal. Project Time and Expense User s Guide Release 3.0
Microsoft Dynamics GP Business Portal Project Time and Expense User s Guide Release 3.0 Copyright Copyright 2005 Microsoft Corporation. All rights reserved. Complying with all applicable copyright laws
More informationEvolutionary Computation. James A. Foster Prof., Biological Sciences Initiative for Bioinformatics & Evol. Studies University of Idaho
Evolutionary Computation James A. Foster Prof., Biological Sciences Initiative for Bioinformatics & Evol. Studies University of Idaho Outline Evolution: process, not material Case study: symbolic discriminant
More informationP6 Instructors Sample Presentation
Welcome to the Eastwood Harris Pty Ltd Primavera P6 Versions 8.1 to 8.4 Professional and Optional Client 3 day training course PMI REP No 3001 Course Number PP6 Page 2 Page 4 Administration Evacuation
More informationGetting Started with HLM 5. For Windows
For Windows Updated: August 2012 Table of Contents Section 1: Overview... 3 1.1 About this Document... 3 1.2 Introduction to HLM... 3 1.3 Accessing HLM... 3 1.4 Getting Help with HLM... 3 Section 2: Accessing
More informationPredictive Modeling using SAS. Principles and Best Practices CAROLYN OLSEN & DANIEL FUHRMANN
Predictive Modeling using SAS Enterprise Miner and SAS/STAT : Principles and Best Practices CAROLYN OLSEN & DANIEL FUHRMANN 1 Overview This presentation will: Provide a brief introduction of how to set
More informationCompendium of Immune Signatures Identifies Conserved and Species-Specific Biology in Response to Inflammation
Immunity Supplemental Information Compendium of Immune Signatures Identifies Conserved and Species-Specific Biology in Response to Inflammation Jernej Godec, Yan Tan, Arthur Liberzon, Pablo Tamayo, Sanchita
More informationChina National Grid --- BioNode. Jun Wang Beijing Genomics Institute
China National Grid --- BioNode Jun Wang Beijing Genomics Institute Core of life science and bio-tech: Getting, Mining, Applying the basic life information Old China meets New China? Sequencing, sequencing,
More informationGene-centered resources at NCBI
COURSE OF BIOINFORMATICS a.a. 2014-2015 Gene-centered resources at NCBI We searched Accession Number: M60495 AT NCBI Nucleotide Gene has been implemented at NCBI to organize information about genes, serving
More informationInference of predictive gene interaction networks
Inference of predictive gene interaction networks Benjamin Haibe-Kains DFCI/HSPH December 15, 2011 The Computational Biology and Functional Genomics Laboratory at the Dana-Farber Cancer Institute and Harvard
More informationPLINK gplink Haploview
PLINK gplink Haploview Whole genome association software tutorial Shaun Purcell Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA Broad Institute of Harvard & MIT, Cambridge,
More informationOracle Hyperion Planning for Interactive Users
Oracle University Contact Us: 1.800.529.0165 Oracle Hyperion Planning 11.1.2 for Interactive Users Duration: 0 Days What you will learn This course is designed to teach you how to use Planning. It includes
More informationAn automated image processing routine for segmentation of cell cytoplasms in high-resolution autofluorescence images
An automated image processing routine for segmentation of cell cytoplasms in high-resolution autofluorescence images Alex J. Walsh a, Melissa C. Skala *a a Department of Biomedical Engineering, Vanderbilt
More informationAdditional file 2. Figure 1: Receiver operating characteristic (ROC) curve using the top
Additional file 2 Figure Legends: Figure 1: Receiver operating characteristic (ROC) curve using the top discriminatory features between HIV-infected (n=32) and HIV-uninfected (n=15) individuals. The top
More informationMetabolic Networks. Ulf Leser and Michael Weidlich
Metabolic Networks Ulf Leser and Michael Weidlich This Lecture Introduction Systems biology & modelling Metabolism & metabolic networks Network reconstruction Strategy & workflow Mathematical representation
More informationLead Scoring CRM Integration
http://docs.oracle.com Lead Scoring CRM Integration Configuration Guide 2018 Oracle Corporation. All rights reserved 02-Mar-2018 Contents 1 Lead Scoring CRM Integration Setup 3 2 Creating two new internal
More informationRFM analysis for decision support in e-banking area
RFM analysis for decision support in e-banking area VASILIS AGGELIS WINBANK PIRAEUS BANK Athens GREECE AggelisV@winbank.gr DIMITRIS CHRISTODOULAKIS Computer Engineering and Informatics Department University
More information2/23/16. Protein-Protein Interactions. Protein Interactions. Protein-Protein Interactions: The Interactome
Protein-Protein Interactions Protein Interactions A Protein may interact with: Other proteins Nucleic Acids Small molecules Protein-Protein Interactions: The Interactome Experimental methods: Mass Spec,
More informationRESIDUAL FILES IN HLM OVERVIEW
HLML09_20120119 RESIDUAL FILES IN HLM Ralph B. Taylor All materials copyright (c) 1998-2012 by Ralph B. Taylor OVERVIEW Residual files in multilevel models where people are grouped into some type of cluster
More informationDesign Microarray Probes
Design Microarray Probes Erik S. Wright October 30, 2017 Contents 1 Introduction 1 2 Getting Started 1 2.1 Startup................................................ 1 2.2 Creating a Sequence Database....................................
More informationIntroduction to gene expression microarray data analysis
Introduction to gene expression microarray data analysis Outline Brief introduction: Technology and data. Statistical challenges in data analysis. Preprocessing data normalization and transformation. Useful
More informationAnalytical Capability Security Compute Ease Data Scale Price Users Traditional Statistics vs. Machine Learning In-Memory vs. Shared Infrastructure CRAN vs. Parallelization Desktop vs. Remote Explicit vs.
More informationPackage goseq. R topics documented: December 23, 2017
Package goseq December 23, 2017 Version 1.30.0 Date 2017/09/04 Title Gene Ontology analyser for RNA-seq and other length biased data Author Matthew Young Maintainer Nadia Davidson ,
More informationSocial Media Analytics
Social Media Analytics Outline Case Study : Twitter Analytics and Text Analytics Role of Social Media Analytics in Business Intelligence About AlgoAnalytics Page 2 Case Study : Twitter and Text Analytics
More informationIdentifying Candidate Informative Genes for Biomarker Prediction of Liver Cancer
Identifying Candidate Informative Genes for Biomarker Prediction of Liver Cancer Nagwan M. Abdel Samee 1, Nahed H. Solouma 2, Mahmoud Elhefnawy 3, Abdalla S. Ahmed 4, Yasser M. Kadah 5 1 Computer Engineering
More informationSIMS2003. Instructors:Rus Yukhananov, Alex Loguinov BWH, Harvard Medical School. Introduction to Microarray Technology.
SIMS2003 Instructors:Rus Yukhananov, Alex Loguinov BWH, Harvard Medical School Introduction to Microarray Technology. Lecture 1 I. EXPERIMENTAL DETAILS II. ARRAY CONSTRUCTION III. IMAGE ANALYSIS Lecture
More informationcomputational analysis of cell-to-cell heterogeneity in single-cell rna-sequencing data reveals hidden subpopulations of cells
computational analysis of cell-to-cell heterogeneity in single-cell rna-sequencing data reveals hidden subpopulations of cells Buettner et al., (2015) Nature Biotechnology, 1 32. doi:10.1038/nbt.3102 Saket
More informationStrategic Pricing. version 12.17
version 12.17 Disclaimer This document is for informational purposes only and is subject to change without notice. This document and its contents, including the viewpoints, dates and functional content
More informationPost-assembly Data Analysis
Assembled transcriptome Post-assembly Data Analysis Quantification: get expression for each gene in each sample Genes differentially expressed between samples Clustering/network analysis Identifying over-represented
More informationStefano Monti. Workshop Format
Gad Getz Stefano Monti Michael Reich {gadgetz,smonti,mreich}@broad.mit.edu http://www.broad.mit.edu/~smonti/aws Broad Institute of MIT & Harvard October 18-20, 2006 Cambridge, MA Workshop Format Morning
More informationToday. Last time. Lecture 5: Discrimination (cont) Jane Fridlyand. Oct 13, 2005
Biological question Experimental design Microarray experiment Failed Lecture : Discrimination (cont) Quality Measurement Image analysis Preprocessing Jane Fridlyand Pass Normalization Sample/Condition
More informationSupplier Guide for Plex Online. Supplier Web Access, Review Releases, Submit ASN & Label Printing
Supplier Guide for Plex Online Supplier Web Access, Review Releases, Submit ASN & Label Printing July 2017 Supplier Web Access Access to PLEX A Supplier must have a User ID and Password to access PLEX
More informationExploring Similarities of Conserved Domains/Motifs
Exploring Similarities of Conserved Domains/Motifs Sotiria Palioura Abstract Traditionally, proteins are represented as amino acid sequences. There are, though, other (potentially more exciting) representations;
More informationConcur Expense Integrator
Microsoft Dynamics GP Concur Expense Integrator This documentation describes how to use Concur Expense Integrator. The integration allows you to use Concur Expense to create, submit, and approve expense
More informationCreate a Planned Run. Using the Ion AmpliSeq Pharmacogenomics Research Panel Plugin USER BULLETIN. Publication Number MAN Revision A.
USER BULLETIN Create a Planned Run Using the Ion AmpliSeq Pharmacogenomics Research Panel Plugin Publication Number MAN0013730 Revision A.0 For Research Use Only. Not for use in diagnostic procedures.
More informationBSS ORACLE Toolbox User Guide
BSS ORACLE Toolbox User Guide Emmanuel Vincent Rémi Gribonval 29th August 2005, V ersion 1.0 2 Contents 1 Getting started 5 1.1 License - no warranty................................. 5 1.2 Cite this as:.......................................
More informationTIMETABLING EXPERIMENTS USING GENETIC ALGORITHMS. Liviu Lalescu, Costin Badica
TIMETABLING EXPERIMENTS USING GENETIC ALGORITHMS Liviu Lalescu, Costin Badica University of Craiova, Faculty of Control, Computers and Electronics Software Engineering Department, str.tehnicii, 5, Craiova,
More informationSVMerge Output File Format Specification Sheet
SVMerge Output File Format Specification Sheet Document Number: 30165 Document Revision: C For Research Use Only. Not for use in diagnostic procedures. Copyright 2017 Bionano Genomics, Inc. All Rights
More informationWelcome to the Eastwood Harris Pty Ltd Primavera P6 Version 7 and Earlier Versions 3 day training course
Welcome to the Eastwood Harris Pty Ltd Primavera P6 Version 7 and Earlier Versions 3 day training course PMI REP No 3001 Course Number PP6 Administration Evacuation Timings and meals Facilities Mobile
More informationCorporate Profile
www.datamine.gr Corporate Profile 1 www.datamine.gr 2 Contents About Datamine 4 Innovative Products for Demanding Business Scenarios 5 CAS for Telecoms 6 CAS for Retailers 7 Segment Designer 8 Corporate
More informationChromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Supplementary Material
Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions Joshua N. Burton 1, Andrew Adey 1, Rupali P. Patwardhan 1, Ruolan Qiu 1, Jacob O. Kitzman 1, Jay Shendure 1 1 Department
More information