Data Intensive Scientific Discovery Vijay Chandru

Size: px
Start display at page:

Download "Data Intensive Scientific Discovery Vijay Chandru"

Transcription

1 Data Intensive Scientific Discovery Vijay Chandru Hon. Professor, NIAS Chairman, Strand Life Sciences

2 The Promise Peta (10 15 )and Exa (10 18 ) scale Computing Astrophysics (Large Synoptic Survey Telescope) Materials Science (Nanoscale Chemistry & Physics) Earth Science (data assimilation for ocean, carbon cycle, etc.) Energy Assurance (combustion, power grids, fusion physics) Fundamental Science (Accelerator Physics, RHIC LHC) Biology & Medicine (1000 Genomes, Real-time Biology) National Security (Cybersecurity, Weapons Simulations) Engineering Design (Communication Networks)

3 The Challenge There is a crisis in all sciences these days. We are drowning in a sea of data, and yet we are thirsty. - Sydney Brenner, at IISc, 2008 The IT Challenge Storage, Computing The Computer Science Challenge Algorithm design and implementation The Mathematics Challenge Statistical Analysis, Systems Theory The Multi-Disciplinarity Challenge Contextual problem solving

4 The Mathematics Challenges Visualization Statistics and Optimization Uncertainty Quantification Mumford - persistence Models Statistical Ab Initio Simulation

5 The Algorithms Challenges Visualization Scalability Machine Learning Network and Graph Analysis Analysis of Streaming Data Text Mining Distributed Data Architectures Data and Dimension Reduction

6 Cultural Challenges Mathematicians and Applications Research Communities Computer Scientists as Intellectual Partners not Technicians Problem driven, directed funding forcing multi-disciplinary collaborations.

7 At the end of the last millennium Today, the most successful craft industries are concerned with software and biotechnology Freeman Dyson, The Sun, The Genome, The Internet: Tools of scientific revolutions, 1999 Biology should keep Computer Scientists busy for at least 50 years Donald Knuth, Vision for the 21 st Century, 1999 In 50 years people will assume that computers and computing were actually developed for biology Buzz at Yorktown Heights,

8 There is a crisis in all sciences these days. We are drowning in a sea of data, and yet we are thirsty. - Sydney Brenner, at IISc One NGS run generates 3x the sequence data generated during the Human Genome Project over 13 years. Current by 2010 Size of data from 1 run 1 TB 5 TB Data from these centers need to be acquired, analyzed, interpreted, viewed, managed, stored, compared and shared effectively & securely. 8

9 Genomics Data Deluge Growth in number of bases deposited in EMBL ( ) The size in data volume and nucleotide numbers on EMBL, trace archive & SRA The Genomes OnLine Database Instrument currently using: One human genome (30x cov) raw data: ~90Gb; 1 Billion bp raw reads; intermediate data: Gb; tertiary data: ~10Gb

10 Next Gen Sequencing analysis

11 Reads up close

12 NGS challenges One sample could have a billion reads Align them against the reference (a few days) Analyse for SNP patterns Do analysis for multiple several disease and normal samples Statistically determine which SNPs are correlated with the disease

13 Central Dogma of Biology Transcription factors are proteins that bind to the DNA and trigger this sequence

14 Control of a gene Copyright 2002, Bruce Alberts, Alexander Johnson, Julian Lewis, Martin Raff, Keith Roberts, and Peter Walter; Copyright 1983, 1989, 1994, Bruce Alberts, Dennis Bray, Julian Lewis, Martin Raff, Keith Roberts, and James D. Watson

15 Self-protection

16 Heat protection Pockley, G. (2001) Heat shock proteins in health and disease, Expert Reviews in Molecular Medicine. Cambridge University Press;

17

18 Gene expression at various stages A gene regulatory network armature for T lymphocyte specification, PNAS December 23, 2008 vol. 105 no

19 Next Gen Sequencing (NGS) ChIP-Seq: Each experiment is for one regulatory protein, x Analysis output is the list of DNA regions to which the protein x binds Can hypothesize that the genes in these regions are regulated in some manner by x. RNA-Seq: Determines the expression levels of all genes in the sample.

20 Interpreting ChIP-Seq and RNA-Seq together Heat on Heat off ChIP-Seq HSTFs bind near heat shock genes CHBF binds near heat shock genes RNA-Seq Heat shock protein levels are up. Heat shock protein levels are down. When heat is on, HSTF upregulate expression of heat shock proteins When heat is off, CHBF suppresses expression of heat shock proteins Need 4 ChIP-Seq experiments (num conditions X num Tfs) and 2 RNA-Seq experiments (num conditions) to reach this conclusion

21 Grand Idea For a particular condition ChIP-Seq experiment for TF X, tells us which all genes could X effect Conduct ChIP-Seq experiments for all Tfs to know exactly which combination of Tfs are binding ahead of which gene Conduct an RNA-Seq expriment to determine the expression levels of each gene. Repeat for all conditions Now we know under condition C, protein X,Y,Z were bound upstream of gene G with expression E Solving all these equations will give an idea of the regulatory network across the range of conditions

22 Biomarker Collaboration with IISc- Breast cancer Goal: Breast cancer marker discovery program Kidwai Memorial Institute of Oncology Indian Institute of Science Strand Life Sciences Patient samples & Histopathology RNA preps and Microarrays Data analysis (Putative markers) Pathway based analysis of known cancer targets revels consistent up-regulation of therapy targets across multiple datasets, in a rare subclass of triple negative breast cancer. Results have been confirmed in a 80 breast cancer patients of the Indian cohort. Ongoing: testing hypothesis about pathway combination therapies to inactivate a pathway, instead of individual targets.

23 ERBBs Triple Negative vs Rest * * A PLCx PLCxx D Cross-talk? JAK1 JAK2 E C * F STAT3 STAT5 G I * Receptor degradation Transformation Differentiation Apoptosis Proliferation Differentiation Tumor survival Cell proliferation Oncogenesis

24 A global In Vivo Drosophila RNAi Screen Identifies NOT3 as a Conserved Regulator of Heart Function, Cell, April 2010 Drosophila RNAi screen data Human Ortholog analysis Mouse Gene Ontology KEGG GSEA Find first degree neighbors and build connected network Heart Systems Map

25 Systems map of Cardiac function Find first degree neighbors and build connected network

26 Introducing Scientific Intelligence Business Intelligence Put results in business context Scientific Context Put results in scientific context Scientific Intelligence Scientific Visualization Analyze & visualize vast amounts of data Systems Modeling Create mathematical models Application of data integration, analysis and visualization, scientific context and modeling to effectively mine large amounts of data, from varied sources, and convert it to usable knowledge, insight and decisions 26

27 The Power of Scientific Intelligence in Genomics Proteomics Next Generation Sequencing Tox/ADME Clinical Decisions Microscopy 27

28 AVADIS The Scientific Intelligence Platform The AVADIS platform is rich development platform for the management, analysis and visualization of complex scientific data Written in JAVA with JYTHON scripting capabilities Produces rich, interactive environments for data exploration Optimized for tackling life science-specific problems The AVADIS Platform 28

IPA : Maximizing the Biological Interpretation of Gene, Transcript & Protein Expression Data with IPA

IPA : Maximizing the Biological Interpretation of Gene, Transcript & Protein Expression Data with IPA IPA : Maximizing the Biological Interpretation of Gene, Transcript & Protein Expression Data with IPA Marisa Chen Account Manager Qiagen Advanced Genomics Marisa.Chen@qiagen.com (203) 500-1237 Dev Mistry,

More information

Whole Transcriptome Analysis of Illumina RNA- Seq Data. Ryan Peters Field Application Specialist

Whole Transcriptome Analysis of Illumina RNA- Seq Data. Ryan Peters Field Application Specialist Whole Transcriptome Analysis of Illumina RNA- Seq Data Ryan Peters Field Application Specialist Partek GS in your NGS Pipeline Your Start-to-Finish Solution for Analysis of Next Generation Sequencing Data

More information

Agilent Genomics Software Future Directions

Agilent Genomics Software Future Directions Agilent Genomics Software Future Directions Michael Rosenberg, PhD Director, Genomics Software Agilent: A Focused Measurement Company Serving Diverse End Markets Electronic Measurement 2008 Revenue: $3.6

More information

Introduction to Bioinformatics and Gene Expression Technologies

Introduction to Bioinformatics and Gene Expression Technologies Introduction to Bioinformatics and Gene Expression Technologies Utah State University Fall 2017 Statistical Bioinformatics (Biomedical Big Data) Notes 1 1 Vocabulary Gene: hereditary DNA sequence at a

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Alla L Lapidus, Ph.D. SPbSU St. Petersburg Term Bioinformatics Term Bioinformatics was invented by Paulien Hogeweg (Полина Хогевег) and Ben Hesper in 1970 as "the study of

More information

Bioinformatics. Ingo Ruczinski. Some selected examples... and a bit of an overview

Bioinformatics. Ingo Ruczinski. Some selected examples... and a bit of an overview Bioinformatics Some selected examples... and a bit of an overview Department of Biostatistics Johns Hopkins Bloomberg School of Public Health July 19, 2007 @ EnviroHealth Connections Bioinformatics and

More information

Ion S5 and Ion S5 XL Systems

Ion S5 and Ion S5 XL Systems Ion S5 and Ion S5 XL Systems Targeted sequencing has never been simpler Explore the Ion S5 and Ion S5 XL Systems Adopting next-generation sequencing (NGS) in your lab is now simpler than ever The Ion S5

More information

China National Grid --- BioNode. Jun Wang Beijing Genomics Institute

China National Grid --- BioNode. Jun Wang Beijing Genomics Institute China National Grid --- BioNode Jun Wang Beijing Genomics Institute Core of life science and bio-tech: Getting, Mining, Applying the basic life information Old China meets New China? Sequencing, sequencing,

More information

Course Presentation. Ignacio Medina Presentation

Course Presentation. Ignacio Medina Presentation Course Index Introduction Agenda Analysis pipeline Some considerations Introduction Who we are Teachers: Marta Bleda: Computational Biologist and Data Analyst at Department of Medicine, Addenbrooke's Hospital

More information

QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd

QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd 1 Our current NGS & Bioinformatics Platform 2 Our NGS workflow and applications 3 QIAGEN s

More information

Top 5 Lessons Learned From MAQC III/SEQC

Top 5 Lessons Learned From MAQC III/SEQC Top 5 Lessons Learned From MAQC III/SEQC Weida Tong, Ph.D Division of Bioinformatics and Biostatistics, NCTR/FDA Weida.tong@fda.hhs.gov; 870 543 7142 1 MicroArray Quality Control (MAQC) An FDA led community

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Richard Corbett Canada s Michael Smith Genome Sciences Centre Vancouver, British Columbia June 28, 2017 Our mandate is to advance knowledge about cancer and other diseases

More information

Overview of Health Informatics. ITI BMI-Dept

Overview of Health Informatics. ITI BMI-Dept Overview of Health Informatics ITI BMI-Dept Fellowship Week 5 Overview of Health Informatics ITI, BMI-Dept Day 10 7/5/2010 2 Agenda 1-Bioinformatics Definitions 2-System Biology 3-Bioinformatics vs Computational

More information

Gene Regulation Solutions. Microarrays and Next-Generation Sequencing

Gene Regulation Solutions. Microarrays and Next-Generation Sequencing Gene Regulation Solutions Microarrays and Next-Generation Sequencing Gene Regulation Solutions The Microarrays Advantage Microarrays Lead the Industry in: Comprehensive Content SurePrint G3 Human Gene

More information

Bridging the Gap Between Basic and Clinical Research. Julio E. Celis Danish Cancer Society

Bridging the Gap Between Basic and Clinical Research. Julio E. Celis Danish Cancer Society Bridging the Gap Between Basic and Clinical Research Julio E. Celis Danish Cancer Society Barriers and Oportunities in Translational Research Promise of the new technologies What is Europe doing? Challenges

More information

ACCELERATING GENOMIC ANALYSIS ON THE CLOUD. Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes

ACCELERATING GENOMIC ANALYSIS ON THE CLOUD. Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes ACCELERATING GENOMIC ANALYSIS ON THE CLOUD Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia

More information

Network System Inference

Network System Inference Network System Inference Francis J. Doyle III University of California, Santa Barbara Douglas Lauffenburger Massachusetts Institute of Technology WTEC Systems Biology Final Workshop March 11, 2005 What

More information

Gene expression connectivity mapping and its application to Cat-App

Gene expression connectivity mapping and its application to Cat-App Gene expression connectivity mapping and its application to Cat-App Shu-Dong Zhang Northern Ireland Centre for Stratified Medicine University of Ulster Outline TITLE OF THE PRESENTATION Gene expression

More information

SYMPOSIUM March 22-23, 2018

SYMPOSIUM March 22-23, 2018 Bigger and Better Data Lessons from Frontlines of Precision Medicine Getting Your Transformation Right Frank Lee PhD IBM Global Industry Leader for Systems Group SYMPOSIUM March 22-23, 2018 5th Annual

More information

Microbial Metabolism Systems Microbiology

Microbial Metabolism Systems Microbiology 1 Microbial Metabolism Systems Microbiology Ching-Tsan Huang ( 黃慶璨 ) Office: Agronomy Hall, Room 111 Tel: (02) 33664454 E-mail: cthuang@ntu.edu.tw MIT OCW Systems Microbiology aims to integrate basic biological

More information

Personalized Medicine

Personalized Medicine Personalized Medicine Dr. Pablo Mentzinis, Director Government Relations, SAP SE Courtesy by Dominik Bertram, Marc von der Linden, Péter Adorján, SAP June 2016 1. Traditional medicine vs. personalized

More information

E2ES to Accelerate Next-Generation Genome Analysis in Clinical Research

E2ES to Accelerate Next-Generation Genome Analysis in Clinical Research www.hcltech.com E2ES to Accelerate Next-Generation Genome Analysis in Clinical Research whitepaper April 2015 TABLE OF CONTENTS Introduction 3 Challenges associated with NGS data analysis 3 HCL s NGS Solution

More information

Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH. BIOL 7210 A Computational Genomics 2/18/2015

Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH. BIOL 7210 A Computational Genomics 2/18/2015 Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH BIOL 7210 A Computational Genomics 2/18/2015 The $1,000 genome is here! http://www.illumina.com/systems/hiseq-x-sequencing-system.ilmn Bioinformatics bottleneck

More information

Year III Pharm.D Dr. V. Chitra

Year III Pharm.D Dr. V. Chitra Year III Pharm.D Dr. V. Chitra 1 Genome entire genetic material of an individual Transcriptome set of transcribed sequences Proteome set of proteins encoded by the genome 2 Only one strand of DNA serves

More information

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with

More information

Corporate Medical Policy

Corporate Medical Policy Corporate Medical Policy Proteogenomic Testing for Patients with Cancer (GPS Cancer Test) File Name: Origination: Last CAP Review: Next CAP Review: Last Review: proteogenomic_testing_for_patients_with_cancer_gps_cancer_test

More information

Welcome to the NGS webinar series

Welcome to the NGS webinar series Welcome to the NGS webinar series Webinar 1 NGS: Introduction to technology, and applications NGS Technology Webinar 2 Targeted NGS for Cancer Research NGS in cancer Webinar 3 NGS: Data analysis for genetic

More information

MAYO CLINIC CENTER FOR BIOMEDICAL DISCOVERY EXCEPTIONAL RESEARCH LEADS TO EXCEPTIONAL PATIENT CARE

MAYO CLINIC CENTER FOR BIOMEDICAL DISCOVERY EXCEPTIONAL RESEARCH LEADS TO EXCEPTIONAL PATIENT CARE MAYO CLINIC CENTER FOR BIOMEDICAL DISCOVERY EXCEPTIONAL RESEARCH LEADS TO EXCEPTIONAL PATIENT CARE THE RESEARCH WE DO TODAY WILL DETERMINE THE TYPE OF MEDICAL AND SURGICAL PRACTICE WE CARRY ON AT THE CLINIC

More information

TECHNOLOGIES, PRODUCTS & SERVICES for MOLECULAR DIAGNOSTICS, MDx ABA 298

TECHNOLOGIES, PRODUCTS & SERVICES for MOLECULAR DIAGNOSTICS, MDx ABA 298 DIAGNOSTICS BUSINESS ANALYSIS SERIES: TECHNOLOGIES, PRODUCTS & SERVICES for MOLECULAR DIAGNOSTICS, MDx ABA 298 By ADAMS BUSINESS ASSOCIATES March 2017. March 2017 ABA 298 1 Technologies, Products & Services

More information

Next-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX

Next-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX Next-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX Technical Overview Introduction RNA Sequencing (RNA-Seq) is one of the most commonly used next-generation sequencing (NGS)

More information

Introduction to Bioinformatics and Gene Expression Technology

Introduction to Bioinformatics and Gene Expression Technology Vocabulary Introduction to Bioinformatics and Gene Expression Technology Utah State University Spring 2014 STAT 5570: Statistical Bioinformatics Notes 1.1 Gene: Genetics: Genome: Genomics: hereditary DNA

More information

Introducing a Highly Integrated Approach to Translational Research: Biomarker Data Management, Data Integration, and Collaboration

Introducing a Highly Integrated Approach to Translational Research: Biomarker Data Management, Data Integration, and Collaboration Introducing a Highly Integrated Approach to Translational Research: Biomarker Data Management, Data Integration, and Collaboration 1 2 Translational Informatics Overview Precision s suite of services is

More information

DNA Transcription. Visualizing Transcription. The Transcription Process

DNA Transcription. Visualizing Transcription. The Transcription Process DNA Transcription By: Suzanne Clancy, Ph.D. 2008 Nature Education Citation: Clancy, S. (2008) DNA transcription. Nature Education 1(1) If DNA is a book, then how is it read? Learn more about the DNA transcription

More information

Introduction to the UCSC genome browser

Introduction to the UCSC genome browser Introduction to the UCSC genome browser Dominik Beck NHMRC Peter Doherty and CINSW ECR Fellow, Senior Lecturer Lowy Cancer Research Centre, UNSW and Centre for Health Technology, UTS SYDNEY NSW AUSTRALIA

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Dortmund, 16.-20.07.2007 Lectures: Sven Rahmann Exercises: Udo Feldkamp, Michael Wurst 1 Goals of this course Learn about Software tools Databases Methods (Algorithms) in

More information

Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS

Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS The Bioinformatics book covers new topics in the rapidly

More information

The Integrated Biomedical Sciences Graduate Program

The Integrated Biomedical Sciences Graduate Program The Integrated Biomedical Sciences Graduate Program at the university of notre dame Cutting-edge biomedical research and training that transcends traditional departmental and disciplinary boundaries to

More information

Cancer ImmunoTherapy Accelerator (CITA) Dr Shalini Jadeja

Cancer ImmunoTherapy Accelerator (CITA) Dr Shalini Jadeja Cancer ImmunoTherapy Accelerator (CITA) Dr Shalini Jadeja The Vision To develop an integrative London initiative that emerges as the UK hub for immunotherapy of cancer focusing on: Safe and effective

More information

B I O I N F O R M A T I C S

B I O I N F O R M A T I C S B I O I N F O R M A T I C S Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be SUPPLEMENTARY CHAPTER: DATA BASES AND MINING 1 What

More information

BIOINFORMATICS AND SYSTEM BIOLOGY (INTERNATIONAL PROGRAM)

BIOINFORMATICS AND SYSTEM BIOLOGY (INTERNATIONAL PROGRAM) BIOINFORMATICS AND SYSTEM BIOLOGY (INTERNATIONAL PROGRAM) PROGRAM TITLE DEGREE TITLE Master of Science Program in Bioinformatics and System Biology (International Program) Master of Science (Bioinformatics

More information

Gene Identification in silico

Gene Identification in silico Gene Identification in silico Nita Parekh, IIIT Hyderabad Presented at National Seminar on Bioinformatics and Functional Genomics, at Bioinformatics centre, Pondicherry University, Feb 15 17, 2006. Introduction

More information

3.1.4 DNA Microarray Technology

3.1.4 DNA Microarray Technology 3.1.4 DNA Microarray Technology Scientists have discovered that one of the differences between healthy and cancer is which genes are turned on in each. Scientists can compare the gene expression patterns

More information

TOTAL CANCER CARE: CREATING PARTNERSHIPS TO ADDRESS PATIENT NEEDS

TOTAL CANCER CARE: CREATING PARTNERSHIPS TO ADDRESS PATIENT NEEDS TOTAL CANCER CARE: CREATING PARTNERSHIPS TO ADDRESS PATIENT NEEDS William S. Dalton, PhD, MD CEO, M2Gen & Director, Personalized Medicine Institute, Moffitt Cancer Center JULY 15, 2013 MOFFITT CANCER CENTER

More information

Introduction to ChIP Seq data analyses. Acknowledgement: slides taken from Dr. H

Introduction to ChIP Seq data analyses. Acknowledgement: slides taken from Dr. H Introduction to ChIP Seq data analyses Acknowledgement: slides taken from Dr. H Wu @Emory ChIP seq: Chromatin ImmunoPrecipitation it ti + sequencing Same biological motivation as ChIP chip: measure specific

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics IMBB 2017 RAB, Kigali - Rwanda May 02 13, 2017 Joyce Nzioki Plan for the Week Introduction to Bioinformatics Raw sanger sequence data Introduction to CLC Bio Quality Control

More information

Neural Networks and Applications in Bioinformatics. Yuzhen Ye School of Informatics and Computing, Indiana University

Neural Networks and Applications in Bioinformatics. Yuzhen Ye School of Informatics and Computing, Indiana University Neural Networks and Applications in Bioinformatics Yuzhen Ye School of Informatics and Computing, Indiana University Contents Biological problem: promoter modeling Basics of neural networks Perceptrons

More information

The Pathways to Understanding Diseases

The Pathways to Understanding Diseases FOR PHARMA & LIFE SCIENCES White paper The Pathways to Understanding Diseases Deciphering complex biological processes Executive Summary Understanding complex biological processes is critical whether doing

More information

Introduction to BIOINFORMATICS

Introduction to BIOINFORMATICS Introduction to BIOINFORMATICS Antonella Lisa CABGen Centro di Analisi Bioinformatica per la Genomica Tel. 0382-546361 E-mail: lisa@igm.cnr.it http://www.igm.cnr.it/pagine-personali/lisa-antonella/ What

More information

Cancer Genetics Solutions

Cancer Genetics Solutions Cancer Genetics Solutions Cancer Genetics Solutions Pushing the Boundaries in Cancer Genetics Cancer is a formidable foe that presents significant challenges. The complexity of this disease can be daunting

More information

RNA-Seq with the Tuxedo Suite

RNA-Seq with the Tuxedo Suite RNA-Seq with the Tuxedo Suite Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 2015 Workshop The Basic Tuxedo Suite References Trapnell C, et al. 2009 TopHat: discovering splice junctions with

More information

PCR Arrays. An Advanced Real-time PCR Technology to Empower Your Pathway Analysis

PCR Arrays. An Advanced Real-time PCR Technology to Empower Your Pathway Analysis PCR Arrays An Advanced Real-time PCR Technology to Empower Your Pathway Analysis 1 Table of Contents 1. Introduction to the PCR Arrays 2. How PCR Arrays Work 3. Performance Data from PCR Arrays 4. Research

More information

6. GENE EXPRESSION ANALYSIS MICROARRAYS

6. GENE EXPRESSION ANALYSIS MICROARRAYS 6. GENE EXPRESSION ANALYSIS MICROARRAYS BIOINFORMATICS COURSE MTAT.03.239 16.10.2013 GENE EXPRESSION ANALYSIS MICROARRAYS Slides adapted from Konstantin Tretyakov s 2011/2012 and Priit Adlers 2010/2011

More information

BIOINFORMATICS Introduction

BIOINFORMATICS Introduction BIOINFORMATICS Introduction Mark Gerstein, Yale University bioinfo.mbb.yale.edu/mbb452a 1 (c) Mark Gerstein, 1999, Yale, bioinfo.mbb.yale.edu What is Bioinformatics? (Molecular) Bio -informatics One idea

More information

Lecture #1. Introduction to microarray technology

Lecture #1. Introduction to microarray technology Lecture #1 Introduction to microarray technology Outline General purpose Microarray assay concept Basic microarray experimental process cdna/two channel arrays Oligonucleotide arrays Exon arrays Comparing

More information

MOLECULAR BIOLOGY OF EUKARYOTES 2016 SYLLABUS

MOLECULAR BIOLOGY OF EUKARYOTES 2016 SYLLABUS 03-442 Lectures: MWF 9:30-10:20 a.m. Doherty Hall 2105 03-742 Advanced Discussion Section: Time and place to be announced Probably Mon 4-6 p.m. or 6-8p.m.? Once we establish who is taking the advanced

More information

Optimization of RNAi Targets on the Human Transcriptome Ahmet Arslan Kurdoglu Computational Biosciences Program Arizona State University

Optimization of RNAi Targets on the Human Transcriptome Ahmet Arslan Kurdoglu Computational Biosciences Program Arizona State University Optimization of RNAi Targets on the Human Transcriptome Ahmet Arslan Kurdoglu Computational Biosciences Program Arizona State University my background Undergraduate Degree computer systems engineer (ASU

More information

MICROARRAYS+SEQUENCING

MICROARRAYS+SEQUENCING MICROARRAYS+SEQUENCING The most efficient way to advance genomics research Down to a Science. www.affymetrix.com/downtoascience Affymetrix GeneChip Expression Technology Complementing your Next-Generation

More information

Applications of Big Data in Evidence-Based Medicine

Applications of Big Data in Evidence-Based Medicine Applications of Big Data in Evidence-Based Medicine Carolyn Compton, MD, PhD Professor Life Sciences, Arizona State University Professor Laboratory Medicine and Pathology, Mayo Clinic Adjunct Professor

More information

Introduction to Next Generation Sequencing (NGS) Data Analysis and Pathway Analysis. Jenny Wu

Introduction to Next Generation Sequencing (NGS) Data Analysis and Pathway Analysis. Jenny Wu Introduction to Next Generation Sequencing (NGS) Data Analysis and Pathway Analysis Jenny Wu Outline Introduction to NGS data analysis in Cancer Genomics NGS applications in cancer research Typical NGS

More information

Transcription. DNA to RNA

Transcription. DNA to RNA Transcription from DNA to RNA The Central Dogma of Molecular Biology replication DNA RNA Protein transcription translation Why call it transcription and translation? transcription is such a direct copy

More information

Developing an Accurate and Precise Companion Diagnostic Assay for Targeted Therapies in DLBCL

Developing an Accurate and Precise Companion Diagnostic Assay for Targeted Therapies in DLBCL Developing an Accurate and Precise Companion Diagnostic Assay for Targeted Therapies in DLBCL James Storhoff, Ph.D. Senior Manager, Diagnostic Test Development World Cdx, Boston, Sep. 10th Molecules That

More information

Algorithms in Bioinformatics

Algorithms in Bioinformatics Algorithms in Bioinformatics Sami Khuri Department of Computer Science San José State University San José, California, USA khuri@cs.sjsu.edu www.cs.sjsu.edu/faculty/khuri Outline Central Dogma of Molecular

More information

Cytomics in Action: Cytokine Network Cytometry

Cytomics in Action: Cytokine Network Cytometry Cytomics in Action: Cytokine Network Cytometry Jonni S. Moore, Ph.D. Director, Clinical and Research Flow Cytometry and PathBioResource Associate Professor of Pathology & Laboratory Medicine University

More information

Proteomics And Cancer Biomarker Discovery. Dr. Zahid Khan Institute of chemical Sciences (ICS) University of Peshawar. Overview. Cancer.

Proteomics And Cancer Biomarker Discovery. Dr. Zahid Khan Institute of chemical Sciences (ICS) University of Peshawar. Overview. Cancer. Proteomics And Cancer Biomarker Discovery Dr. Zahid Khan Institute of chemical Sciences (ICS) University of Peshawar Overview Proteomics Cancer Aims Tools Data Base search Challenges Summary 1 Overview

More information

Sanger vs Next-Gen Sequencing

Sanger vs Next-Gen Sequencing Tools and Algorithms in Bioinformatics GCBA815/MCGB815/BMI815, Fall 2017 Week-8: Next-Gen Sequencing RNA-seq Data Analysis Babu Guda, Ph.D. Professor, Genetics, Cell Biology & Anatomy Director, Bioinformatics

More information

Bioinformatics, in general, deals with the following important biological data:

Bioinformatics, in general, deals with the following important biological data: Pocket K No. 23 Bioinformatics for Plant Biotechnology Introduction As of July 30, 2006, scientists around the world are pursuing a total of 2,126 genome projects. There are 405 published complete genomes,

More information

Opportunities and Impacts

Opportunities and Impacts Nanotechnology: Opportunities and Impacts of a Scientific Revolution National Conference of State Legislatures Nashville, Tennessee August 16, 2006 James B. Roberto Deputy Director for Science and Technology

More information

Introduction to Bioinformatics CPSC 265. What is bioinformatics? Textbooks

Introduction to Bioinformatics CPSC 265. What is bioinformatics? Textbooks Introduction to Bioinformatics CPSC 265 Thanks to Jonathan Pevsner, Ph.D. Textbooks Johnathan Pevsner, who I stole most of these slides from (thanks!) has written a textbook, Bioinformatics and Functional

More information

disaccharides = two mono-s linked together e.g. lactose = glucose + galactose sucrose = glucose + fructose

disaccharides = two mono-s linked together e.g. lactose = glucose + galactose sucrose = glucose + fructose involved in the degradation of molecules found in animal cells membrane limited varies in shape and size contains acid hydrolases (phosphatase, nucleases, proteases, etc.), enzymes that work only at acid

More information

IPA Advanced Training Course

IPA Advanced Training Course IPA Advanced Training Course Academia Sinica 2015 Oct Gene( 陳冠文 ) Supervisor and IPA certified analyst 1 Review for Introductory Training course Searching Building a Pathway Editing a Pathway for Publication

More information

NCBI web resources I: databases and Entrez

NCBI web resources I: databases and Entrez NCBI web resources I: databases and Entrez Yanbin Yin Most materials are downloaded from ftp://ftp.ncbi.nih.gov/pub/education/ 1 Homework assignment 1 Two parts: Extract the gene IDs reported in table

More information

Functional Genomics Overview RORY STARK PRINCIPAL BIOINFORMATICS ANALYST CRUK CAMBRIDGE INSTITUTE 18 SEPTEMBER 2017

Functional Genomics Overview RORY STARK PRINCIPAL BIOINFORMATICS ANALYST CRUK CAMBRIDGE INSTITUTE 18 SEPTEMBER 2017 Functional Genomics Overview RORY STARK PRINCIPAL BIOINFORMATICS ANALYST CRUK CAMBRIDGE INSTITUTE 18 SEPTEMBER 2017 Agenda What is Functional Genomics? RNA Transcription/Gene Expression Measuring Gene

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Contents Cell biology Organisms and cells Building blocks of cells How genes encode proteins? Bioinformatics What is bioinformatics? Practical applications Tools and databases

More information

Introduction to BioMEMS & Medical Microdevices DNA Microarrays and Lab-on-a-Chip Methods

Introduction to BioMEMS & Medical Microdevices DNA Microarrays and Lab-on-a-Chip Methods Introduction to BioMEMS & Medical Microdevices DNA Microarrays and Lab-on-a-Chip Methods Companion lecture to the textbook: Fundamentals of BioMEMS and Medical Microdevices, by Prof., http://saliterman.umn.edu/

More information

WELCOME. Norma J. Nowak, PhD Executive Director, NY State Center of Excellence in Bioinformatics and Life Sciences (CBLS)

WELCOME. Norma J. Nowak, PhD Executive Director, NY State Center of Excellence in Bioinformatics and Life Sciences (CBLS) WELCOME Norma J. Nowak, PhD Executive Director, NY State Center of Excellence in Bioinformatics and Life Sciences (CBLS) Director, UB Genomics and Bioinformatics Core (GBC) o o o o o o o o o o o o Grow

More information

Bioinformatics and computational tools

Bioinformatics and computational tools Bioinformatics and computational tools Etienne P. de Villiers (PhD) International Livestock Research Institute Nairobi, Kenya International Livestock Research Institute Nairobi, Kenya ILRI works at the

More information

ChIP-Seq Data Analysis. J Fass UCD Genome Center Bioinformatics Core Wednesday 15 June 2015

ChIP-Seq Data Analysis. J Fass UCD Genome Center Bioinformatics Core Wednesday 15 June 2015 ChIP-Seq Data Analysis J Fass UCD Genome Center Bioinformatics Core Wednesday 15 June 2015 What s the Question? Where do Transcription Factors (TFs) bind genomic DNA 1? (Where do other things bind DNA

More information

DEPEI QIAN. HPC Development in China: A Brief Review and Prospect

DEPEI QIAN. HPC Development in China: A Brief Review and Prospect DEPEI QIAN Qian Depei, Professor at Sun Yat-sen university and Beihang University, Dean of the School of Data and Computer Science of Sun Yat-sen University. Since 1996 he has been the member of the expert

More information

Introduction to Microarray Analysis

Introduction to Microarray Analysis Introduction to Microarray Analysis Methods Course: Gene Expression Data Analysis -Day One Rainer Spang Microarrays Highly parallel measurement devices for gene expression levels 1. How does the microarray

More information

General Education Learning Outcomes

General Education Learning Outcomes BOROUGH OF MANHATTAN COMMUNITY COLLEGE City University of New York Department of Science Title of Course: Cell Biology Class hours 3 BIO Section: 260 Lab hours 3 Semester Spring 2018 Credits 4 Schedule:

More information

Product Applications for the Sequence Analysis Collection

Product Applications for the Sequence Analysis Collection Product Applications for the Sequence Analysis Collection Pipeline Pilot Contents Introduction... 1 Pipeline Pilot and Bioinformatics... 2 Sequence Searching with Profile HMM...2 Integrating Data in a

More information

DNA is normally found in pairs, held together by hydrogen bonds between the bases

DNA is normally found in pairs, held together by hydrogen bonds between the bases Bioinformatics Biology Review The genetic code is stored in DNA Deoxyribonucleic acid. DNA molecules are chains of four nucleotide bases Guanine, Thymine, Cytosine, Adenine DNA is normally found in pairs,

More information

Analysing genomes and transcriptomes using Illumina sequencing

Analysing genomes and transcriptomes using Illumina sequencing Analysing genomes and transcriptomes using Illumina uencing Dr. Heinz Himmelbauer Centre for Genomic Regulation (CRG) Ultrauencing Unit Barcelona The Sequencing Revolution High-Throughput Sequencing 2000

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Dr. Taysir Hassan Abdel Hamid Lecturer, Information Systems Department Faculty of Computer and Information Assiut University taysirhs@aun.edu.eg taysir_soliman@hotmail.com

More information

Goals of pharmacogenomics

Goals of pharmacogenomics Goals of pharmacogenomics Use drugs better and use better drugs! People inherit/exhibit differences in drug: Absorption Metabolism and degradation of the drug Transport of drug to the target molecule Excretion

More information

Gene Expression on the Fluidigm BioMark HD

Gene Expression on the Fluidigm BioMark HD Gene Expression on the Fluidigm BioMark HD Overview Introduction to Fluidigm James Miller Advantages of the technology Running a Fluidigm gene expression project Paul Lacaze Assay design, chemistry, experimental

More information

Intro to Microarray Analysis. Courtesy of Professor Dan Nettleton Iowa State University (with some edits)

Intro to Microarray Analysis. Courtesy of Professor Dan Nettleton Iowa State University (with some edits) Intro to Microarray Analysis Courtesy of Professor Dan Nettleton Iowa State University (with some edits) Some Basic Biology Genes are DNA sequences that code for proteins. (e.g. gene lengths perhaps 1000

More information

Feature Selection of Gene Expression Data for Cancer Classification: A Review

Feature Selection of Gene Expression Data for Cancer Classification: A Review Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 50 (2015 ) 52 57 2nd International Symposium on Big Data and Cloud Computing (ISBCC 15) Feature Selection of Gene Expression

More information

Molecular Diagnostics

Molecular Diagnostics Molecular Diagnostics Part II: Regulations, Markets & Companies By Prof. K. K. Jain MD, FRACS, FFPM Jain PharmaBiotech Basel, Switzerland May 2018 A Jain PharmaBiotech Report A U T H O R ' S B I O G R

More information

Outline and learning objectives. From Proteomics to Systems Biology. Integration of omics - information

Outline and learning objectives. From Proteomics to Systems Biology. Integration of omics - information From to Systems Biology Outline and learning objectives Omics science provides global analysis tools to study entire systems How to obtain omics - What can we learn Limitations Integration of omics - In-class

More information

Dana-Farber Cancer Institute Speeds Medical Research with Advanced Data Warehouse

Dana-Farber Cancer Institute Speeds Medical Research with Advanced Data Warehouse Dana-Farber Cancer Institute Speeds Medical Research with Advanced Data Warehouse Dana-Farber Cancer Institute Boston, MA www.dana-farber.org Industry: Healthcare Annual Revenue: US$665.7 million Employees:

More information

Ontologies - Useful tools in Life Sciences and Forensics

Ontologies - Useful tools in Life Sciences and Forensics Ontologies - Useful tools in Life Sciences and Forensics How today's Life Science Technologies can shape the Crime Sciences of tomorrow 04.07.2015 Dirk Labudde Mittweida Mittweida 2 Watson vs Watson Dr.

More information

Algorithms in Nature. (brief) introduction to biology

Algorithms in Nature. (brief) introduction to biology Algorithms in Nature (brief) introduction to biology Organism, Organ, Cell Organism 2 Types of Cells Eukaryots: - Plants, animals, humans - DNA resides in the nucleus - Contain also other compartments

More information

Review of Biomedical Image Processing

Review of Biomedical Image Processing BOOK REVIEW Open Access Review of Biomedical Image Processing Edward J Ciaccio Correspondence: ciaccio@columbia. edu Department of Medicine, Columbia University, New York, USA Abstract This article is

More information

Ion S5 and Ion S5 XL Systems

Ion S5 and Ion S5 XL Systems Ion S5 and Ion S5 XL Systems Targeted sequencing has never been simpler Introducing the Ion S5 and Ion S5 XL systems Now, adopting next-generation sequencing in your lab is simpler than ever. The Ion S5

More information

Biomedical Big Data and Precision Medicine

Biomedical Big Data and Precision Medicine Biomedical Big Data and Precision Medicine Jie Yang Department of Mathematics, Statistics, and Computer Science University of Illinois at Chicago October 8, 2015 1 Explosion of Biomedical Data 2 Types

More information

Introduction to Bioinformatics. Fabian Hoti 6.10.

Introduction to Bioinformatics. Fabian Hoti 6.10. Introduction to Bioinformatics Fabian Hoti 6.10. Analysis of Microarray Data Introduction Different types of microarrays Experiment Design Data Normalization Feature selection/extraction Clustering Introduction

More information