Genome-wide association studies. Gene regulation and the genomics of complex traits. Relating Variation to Phenotype. Relating Variation to Phenotype

Size: px
Start display at page:

Download "Genome-wide association studies. Gene regulation and the genomics of complex traits. Relating Variation to Phenotype. Relating Variation to Phenotype"

Transcription

1 Genome-wide association studies Gene regulation and the genomics of complex traits Eric R. Gamazon Vanderbilt University; Academic Medical Center, University of Amsterdam Nancy J. Cox Vanderbilt University June 27, 2016 January 27, 2016 Thousands of loci have been reproducibly associated with common diseases through GWAS Together they account for only a small proportion of the heritability to such traits Importantly, they have not always led to better understanding of the mechanism underlying the traits Relating Variation to Phenotype Genotyping Assay known variants in high-throughput (beads, chips) Focused largely on common variation (which will be largely non-coding) Inexpensive ~$65 / 2.1M variants Soon will have $30-40 (all in) Sequencing Assay all variation in exomes (coding sequences) ~$400 + or whole genomes ~$1200 +, costs continue to decline Relating Variation to Phenotype Genome wide association studies Genotyping 1M variants allows good interrogation (through imputation) of most common variation Usually conducted in unrelated cases with disease and controls without disease (or unphenotyped individuals) Predicated on assumption that common diseases will be driven by common variation Using Correlation Structure of Variation in Human Genome

2 The Real Problem? Missing Biology Two Primary Themes We need functional information to understand our signals; transcriptomes are a great place to start We need to integrate the functional information with the association information to get maximum utility Integration of Function Testing whether there is enrichment of functional variants among top signals Testing whether functional variants concentrate heritability Testing whether imputed transcript levels are associated with phenotypes Genome Variation: Regulatory Genome Variation: Splicing

3 Natural Variation in Transcriptional Activity eqtl analysis investigates variation in expression due to genetic regulation, and downstream phenotypic effects An eqtl is a locus (position on genome) associated with variation in RNA expression expression ~millions of markers genotypes - A GWAS for each gene (~10 10 tests) - Association of a SNP nearby the gene is a cis-acting eqtl - Association of a SNP far from the gene is a trans-acting eqtl expression genotype Gene regulation and GWAS Expression QTL and GWAS SNP co-localization may provide insights into the mechanistic basis for the observed associations Annotation with eqtl may improve our ability to distinguish associations likely to be replicated Are there many more common variants associated with disease and likely to be eqtls that can be identified through GWAS? GTEx: Genotype Tissue Expression Subjects at autopsy, tissue / organ donation Whole genome sequencing of subjects; RNAseq on ~50 tissues The Genotype-Tissue Expression (GTEx) project. Nat. Gen eqtl studies within and across tissues and a variety of applications The Genotype-Tissue Expression (GTEx) project. Nat. Gen. 2013

4 Eligibility Donors of any racial ethnic group and sex of age in whom biospecimen collection can start within 24 hours of death are eligible Medical exclusion criteria HIV infection or high risk behaviors Viral hepatitis Metastatic cancer Chemotherapy or radiation therapy in past 2 years Blood transfusion in past 48 hours BMI index >= 35 or <=18.5 GTEx Blood samples are collected for SNP and CNV genotyping and to establish LCLs RNA quantification by massively parallel sequencing eqtl analyses, both single-tissue and multitissue, are conducted The Genotype-Tissue Expression (GTEx) project. Nat. Gen Sample clustering based on gene expression and exon splicing profiles. The GTEx Consortium Science 2015;348: Published by AAAS PEER factors and influence on detection Published by AAAS GTEx

5 Expression QTLs in blood GTEx Hypertension and Adipose eqtls MAGIC: HOMA-IR (all SNPs) MAGIC: HOMA-IR Definition of splicing events GTEx

6 Splicing quantitative trait locus (sqtl) Splice-junction QTLs in blood Splice-junction QTL: a genetic variant associated with changes in exon junction abundance (calculated using Altrans [Roderic s group]) Splicing-isoform ratio QTL: a genetic variant associated with changes in the relative abundances of gene transcript isoforms (detected using sqtlseeker [Roderic s group]) Transcript QTL: a genetic variant associated with the absolute abundance of a single isoform of a gene Splicing-isoform ratio QTLs in blood Splice-junction QTL GTEx. Science Splicing-isoform ratio QTL eqtl Discovery is a Bunch of GWAS Measured transcript levels for each gene in a given tissue from each person is just a quantitative trait like any other We can test the association of each SNP with the transcript level of each gene. If we have a matrix of genome variation and a matrix of transcriptome variation GTEx. Science 2015.

7 Matrix eqtl 2-3 orders of magnitude faster than conventional association analysis on transcriptome data Special preprocessing allows the most computationally intensive parts of analysis to be done as large matrix operations Supports additive linear and ANOVA models, covariates, correlated error Calculates false discovery rates seperately for cis- and trans-eqtls One MAMMOTH Matrix Multiplication GS T, broken into 10K x 10K blocks for computational feasibility With covariate Gene Expression Matrix: Each row is expression measurement for a gene across columns of individuals Columns are identical sets of individuals Genotype Matrix: Each SNP is a row across columns of individuals

8 Huge Advantages to Fast Computation The analysis becomes an experiment that you can perform over and over to understand how different covariates affect results, and how different types of models perform Past studies often focused on only local SNPs as eqtls because using the whole genome was prohibitively computationally burdensome Writing p-values takes time (when you have 10 billion); they tally counts (for FDR, QQ plots, but save only values meeting a user-specified threshold Huge Advantages to Fast Computation The analysis becomes an experiment that you can perform over and over to understand how different covariates affect results, and how different types of models perform Past studies often focused on only local SNPs as eqtls because using the whole genome was prohibitively computationally burdensome Writing p-values takes time (when you have 10 billion); they tally counts (for FDR, QQ plots, but save only values meeting a user-specified threshold Example Application RNA from HapMap/1000 Genomes cell lines measured using RNA-Seq in the GEUVADIS study HEADS UP Data processing before these analyses is also very important: SNPs downloaded from 1000 Genomes Project data Chromosome 22 only Test different thresholds for reporting eqtls Test different models for cis- and trans- PEER factors, surrogate variable analysis to reduce batch effects If you choose PEERS to optimize cis-eqtl discovery, you will not be optimized for trans-eqtl discovery (networks of co-ordinately regulated genes picked up in higher order PEERS) If you want to focus on sex or tissue effects, how you choose to normalize data in advance of analysis will matter Polygenic Modeling Heritability is a population parameter that may elucidate the genetic architecture of complex traits Methods for obtaining estimates from pedigree data are well established Estimates may be derived from tag SNPs (e.g., on platforms) chip heritability Polygenic Modeling Relates phenotypic variation to many genetic variants Differs from traditional single-variant tests of association Estimating eqtl-derived heritability The proportion of variance captured by eqtl SNPs Predicting disease risk or therapeutic response on the basis of genotypes

9 Crohn s Disease Y = Xb + T g T + C + e Crohn s Disease Type 1 Diabetes Type 1 Diabetes Concentration of Heritability We are able to capture 20 to 30% of the heritability from genome-wide marker SNPs (~200,000 SNPs) with our cis eqtls far fewer SNPs. Tissue-specificity of eqtl contribution to heritability We can concentrate heritability with the use of cis eqtls.

10 PrediXcan Develop prediction models for expression Predict full transcriptome based on genetic data Quantify the association between genetically predicted levels of gene expression and phenotype (disease or QT) Transcriptome PrediXcan is More than Genetics GReX Genetically regulated expression Other factors Traitaltered component Figure 1: Regulatory Mechanism Tested by PrediXcan Rich data for capturing Trait consequences of environmental exposures Gamazon ER et al. Nature Genetics Gene Expression Decomposition Use of Transcriptome in Prediction Transcriptome captures important genome variation (regulating gene expression) enriched in disease Transcriptome captures consequences of environmental exposures on gene expression Example use: serial transcriptomes in watchful waiting compared with imaging Analogous to Imputation Learn relationship of genome variation to transcriptome in reference sample (GTEx) Store weights from prediction equations Apply to any dataset with genome interrogation Reduced multiple-testing burden Advantages of Framework Informative priors and groupings of functional units (on the basis of known pathways, for example) No actual transcriptome data are required, as the predicted expression levels are a function of genetic variation alone. Apply to any existing data set with large-scale genome interrogation, such as those dbgap or other repositories. Reverse causality is not a major concern; disease status or drug treatment does not alter germline genomic variation. Meta-analysis of gene-based results is simplified, as less stringent harmonization between studies is required. The approach can be applied to common or rare variants. In general, larger sample sizes for the training set will be needed to achieve good prediction models with rare variants. Advantages of Framework We iteratively use more and more of what we do know to figure out what we most want to learn for new discovery Signals come at the levels of the gene, improving the ability to do pathway/network analyses Sets up a natural framework for the unified analysis of whole genome sequence data

11 WTCCC RA outside HLA Resources for EMR-based research at Vanderbilt The Synthetic Derivative A de-identified and continuously-updated image of the EMR: 2,500,000 subjects BioVU Subjects with DNA: >214,000 Dense (GWAS-level) genotypes: ~20,000 Exome chip data: 42,000 Resources for EMR-based research at Vanderbilt 2017 The Synthetic Derivative A de-identified and continuously-updated image of the EMR: 3,000,000 subjects BioVU Subjects with DNA: >225,000 Dense genotypes: >100,000 Whole genome or exome sequencing: ~1000 s The genome-wide association study Target phenotype association P value chromosomal location The phenome-wide association study Target genotype association P value BioVU X PrediXcan: Gene-based PheWAS BioVU An in silico Discovery Engine diagnosis code PheWAS requirement: A large cohort of patients with genotype data and many diagnoses The First Gene X Medical Phenome Catalog

12 cat a log ˈkadlˌôɡ/ noun noun: catalogue; plural noun: catalogues; noun: catalog; plural noun: catalogs 1. a complete list of items, typically one in alphabetical or other systematic order, in particular. The Burden of Medical Disease vs. Deviation in the Transcriptome Across 13K BioVU subjects, the number of PheWAS codes is significantly (p = 8.5 x 10-4 ) correlated with the number of genes with predicted transcript levels +/- 5 SD from the mean Consistently observed across tissues Significant even at +/- 3 SD What s the Best Measure of Burden of Disease? What s the Best Measure of Transcriptome Deviance? Capture number of codes and severity? Other variables? Raw numbers of genes, or weight more connected genes more highly? More heritable genes more highly? Reduced Predicted Expression GRIK5 An Eye Super Gene? Zebrafish studies conducted in the Zebrafish Aquatic Facility by Ela Knapik, and students Daniel Levin, Gokhan Unlu, and Jessica Brown

13 GRIK5 expression in the zebrafish eye OK, GRIK5 is involved in normal eye development, but that does not fully explain how it can contribute to so many different eye phenotypes that were not previously thought to be related 3 dpf Lens RPE GCL PCL OPL INL Optic Nerve RPE Retinal Pigment Epithelium PCL Photoreceptor Cell Layer OPL Outer Plexiform Layer INL Inner Nuclear Layer GCL Ganglion Cell Layer Nuclei (DAPI) GRIK5 Results on ~500 genes in 5000 individuals Mega Project Team Results on all genes in 13,000 Results on all genes in 25,000 Results in 100,000+, 200,000+, Jim Sutcliffe Cara Sutcliffe Sarah Collier Lana Olson Janey Wang Vanderbilt Zebrafish Aquatic Facility Ela Knapik Gokhan Unlu Jess Brown Daniel Levic VICTR Vanderbilt Institute for Clinical and Translational Research Gordon Bernard 79

14 Our GTEx Team at University of Chicago CDR PRC Dan Nicolae Lin Chen Hae Kyung (Haky) Im Barbara Stranger Virginia Beach Roanoke Richmond Kaanan Shah Jason Torres Keston Aquino-Michaels Data analysis egtex BSS OPO CBR CDR Brain bank Inactive site LDACC ELSI NIH

Whole Transcriptome Analysis of Illumina RNA- Seq Data. Ryan Peters Field Application Specialist

Whole Transcriptome Analysis of Illumina RNA- Seq Data. Ryan Peters Field Application Specialist Whole Transcriptome Analysis of Illumina RNA- Seq Data Ryan Peters Field Application Specialist Partek GS in your NGS Pipeline Your Start-to-Finish Solution for Analysis of Next Generation Sequencing Data

More information

Linking Genetic Variation to Important Phenotypes

Linking Genetic Variation to Important Phenotypes Linking Genetic Variation to Important Phenotypes BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2018 Anthony Gitter gitter@biostat.wisc.edu These slides, excluding third-party material, are licensed under

More information

Genomic Research: Issues to Consider. IRB Brown Bag August 28, 2014 Sharon Aufox, MS, LGC

Genomic Research: Issues to Consider. IRB Brown Bag August 28, 2014 Sharon Aufox, MS, LGC Genomic Research: Issues to Consider IRB Brown Bag August 28, 2014 Sharon Aufox, MS, LGC Outline Key genomic terms and concepts Issues in genomic research Consent models Types of findings Returning results

More information

S SG. Metabolomics meets Genomics. Hemant K. Tiwari, Ph.D. Professor and Head. Metabolomics: Bench to Bedside. ection ON tatistical.

S SG. Metabolomics meets Genomics. Hemant K. Tiwari, Ph.D. Professor and Head. Metabolomics: Bench to Bedside. ection ON tatistical. S SG ection ON tatistical enetics Metabolomics meets Genomics Hemant K. Tiwari, Ph.D. Professor and Head Section on Statistical Genetics Department of Biostatistics School of Public Health Metabolomics:

More information

By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs

By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs (3) QTL and GWAS methods By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs Under what conditions particular methods are suitable

More information

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 Topics Genetic variation Population structure Linkage disequilibrium Natural disease variants Genome Wide Association Studies Gene

More information

UK Biobank Axiom Array

UK Biobank Axiom Array DATA SHEET Advancing human health studies with powerful genotyping technology Array highlights The Applied Biosystems UK Biobank Axiom Array is a powerful array for translational research. Designed using

More information

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary Executive Summary Helix is a personal genomics platform company with a simple but powerful mission: to empower every person to improve their life through DNA. Our platform includes saliva sample collection,

More information

Machine learning applications in genomics: practical issues & challenges. Yuzhen Ye School of Informatics and Computing, Indiana University

Machine learning applications in genomics: practical issues & challenges. Yuzhen Ye School of Informatics and Computing, Indiana University Machine learning applications in genomics: practical issues & challenges Yuzhen Ye School of Informatics and Computing, Indiana University Reference Machine learning applications in genetics and genomics

More information

Supplementary Note: Detecting population structure in rare variant data

Supplementary Note: Detecting population structure in rare variant data Supplementary Note: Detecting population structure in rare variant data Inferring ancestry from genetic data is a common problem in both population and medical genetic studies, and many methods exist to

More information

Top 5 Lessons Learned From MAQC III/SEQC

Top 5 Lessons Learned From MAQC III/SEQC Top 5 Lessons Learned From MAQC III/SEQC Weida Tong, Ph.D Division of Bioinformatics and Biostatistics, NCTR/FDA Weida.tong@fda.hhs.gov; 870 543 7142 1 MicroArray Quality Control (MAQC) An FDA led community

More information

Biomedical Big Data and Precision Medicine

Biomedical Big Data and Precision Medicine Biomedical Big Data and Precision Medicine Jie Yang Department of Mathematics, Statistics, and Computer Science University of Illinois at Chicago October 8, 2015 1 Explosion of Biomedical Data 2 Types

More information

Course Presentation. Ignacio Medina Presentation

Course Presentation. Ignacio Medina Presentation Course Index Introduction Agenda Analysis pipeline Some considerations Introduction Who we are Teachers: Marta Bleda: Computational Biologist and Data Analyst at Department of Medicine, Addenbrooke's Hospital

More information

less sensitive than RNA-seq but more robust analysis pipelines expensive but quantitiatve standard but typically not high throughput

less sensitive than RNA-seq but more robust analysis pipelines expensive but quantitiatve standard but typically not high throughput Chapter 11: Gene Expression The availability of an annotated genome sequence enables massively parallel analysis of gene expression. The expression of all genes in an organism can be measured in one experiment.

More information

A Statistical Framework for Joint eqtl Analysis in Multiple Tissues

A Statistical Framework for Joint eqtl Analysis in Multiple Tissues A Statistical Framework for Joint eqtl Analysis in Multiple Tissues Timothée Flutre 1,2., Xiaoquan Wen 3., Jonathan Pritchard 1,4, Matthew Stephens 1,5 * 1 Department of Human Genetics, University of Chicago,

More information

emerge-ii site report Vanderbilt

emerge-ii site report Vanderbilt emerge-ii site report Vanderbilt 29 June 2015 Vanderbilt activities emerge II PGx implementation locally and emerge-pgx SCN5A/KCNH2 project provider attitudes Phenotype contributions Methods development

More information

Genome-Wide Association Studies (GWAS): Computational Them

Genome-Wide Association Studies (GWAS): Computational Them Genome-Wide Association Studies (GWAS): Computational Themes and Caveats October 14, 2014 Many issues in Genomewide Association Studies We show that even for the simplest analysis, there is little consensus

More information

Target Enrichment Strategies for Next Generation Sequencing

Target Enrichment Strategies for Next Generation Sequencing Target Enrichment Strategies for Next Generation Sequencing Anuj Gupta, PhD Agilent Technologies, New Delhi Genotypic Conference, Sept 2014 NGS Timeline Information burst Nearly 30,000 human genomes sequenced

More information

BBMRI.NL A story of Sharing Eline Slagboom Molecular Epidemiology, Leiden University Medical Center Brussel 20 Juni 2017

BBMRI.NL A story of Sharing Eline Slagboom Molecular Epidemiology, Leiden University Medical Center Brussel 20 Juni 2017 BBMRI.NL A story of Sharing Eline Slagboom Molecular Epidemiology, Leiden University Medical Center Brussel 20 Juni 2017 BBMRI-NL TOWARDS A NATIONAL BIOBANKING INFRASTRUCTURE Founded in 2009 as the Dutch

More information

Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010

Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010 Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010 Traditional QTL approach Uses standard bi-parental mapping populations o F2 or RI These have a limited number of

More information

Runs of Homozygosity Analysis Tutorial

Runs of Homozygosity Analysis Tutorial Runs of Homozygosity Analysis Tutorial Release 8.7.0 Golden Helix, Inc. March 22, 2017 Contents 1. Overview of the Project 2 2. Identify Runs of Homozygosity 6 Illustrative Example...............................................

More information

The 150+ Tomato Genome (re-)sequence Project; Lessons Learned and Potential

The 150+ Tomato Genome (re-)sequence Project; Lessons Learned and Potential The 150+ Tomato Genome (re-)sequence Project; Lessons Learned and Potential Applications Richard Finkers Researcher Plant Breeding, Wageningen UR Plant Breeding, P.O. Box 16, 6700 AA, Wageningen, The Netherlands,

More information

QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd

QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd 1 Our current NGS & Bioinformatics Platform 2 Our NGS workflow and applications 3 QIAGEN s

More information

Stefano Monti. Workshop Format

Stefano Monti. Workshop Format Gad Getz Stefano Monti Michael Reich {gadgetz,smonti,mreich}@broad.mit.edu http://www.broad.mit.edu/~smonti/aws Broad Institute of MIT & Harvard October 18-20, 2006 Cambridge, MA Workshop Format Morning

More information

ACCELERATING GENOMIC ANALYSIS ON THE CLOUD. Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes

ACCELERATING GENOMIC ANALYSIS ON THE CLOUD. Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes ACCELERATING GENOMIC ANALYSIS ON THE CLOUD Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia

More information

E2ES to Accelerate Next-Generation Genome Analysis in Clinical Research

E2ES to Accelerate Next-Generation Genome Analysis in Clinical Research www.hcltech.com E2ES to Accelerate Next-Generation Genome Analysis in Clinical Research whitepaper April 2015 TABLE OF CONTENTS Introduction 3 Challenges associated with NGS data analysis 3 HCL s NGS Solution

More information

Reading Between the Genes: Computational Models to Discover Function from Noncoding DNA

Reading Between the Genes: Computational Models to Discover Function from Noncoding DNA Reading Between the Genes: Computational Models to Discover Function from Noncoding DNA Yves A. Lussier, Joanne Berghout, Francesca Vitali, Kenneth S. Ramos Center for Biomedical Informatics and Biostatistics,

More information

Gene Expression Technology

Gene Expression Technology Gene Expression Technology Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu Gene expression Gene expression is the process by which information from a gene

More information

Targeted resequencing

Targeted resequencing Targeted resequencing Sarah Calvo, Ph.D. Computational Biologist Vamsi Mootha laboratory Snapshots of Genome Wide Analysis in Human Disease (MPG), 4/20/2010 Vamsi Mootha, PI How can I assess a small genomic

More information

Illumina s GWAS Roadmap: next-generation genotyping studies in the post-1kgp era

Illumina s GWAS Roadmap: next-generation genotyping studies in the post-1kgp era Illumina s GWAS Roadmap: next-generation genotyping studies in the post-1kgp era Anthony Green Sr. Genotyping Sales Specialist North America 2010 Illumina, Inc. All rights reserved. Illumina, illuminadx,

More information

Bioinformatics Advice on Experimental Design

Bioinformatics Advice on Experimental Design Bioinformatics Advice on Experimental Design Where do I start? Please refer to the following guide to better plan your experiments for good statistical analysis, best suited for your research needs. Statistics

More information

TOTAL CANCER CARE: CREATING PARTNERSHIPS TO ADDRESS PATIENT NEEDS

TOTAL CANCER CARE: CREATING PARTNERSHIPS TO ADDRESS PATIENT NEEDS TOTAL CANCER CARE: CREATING PARTNERSHIPS TO ADDRESS PATIENT NEEDS William S. Dalton, PhD, MD CEO, M2Gen & Director, Personalized Medicine Institute, Moffitt Cancer Center JULY 15, 2013 MOFFITT CANCER CENTER

More information

Beyond single genes or proteins

Beyond single genes or proteins Beyond single genes or proteins Marylyn D Ritchie, PhD Professor, Biochemistry and Molecular Biology Director, Center for Systems Genomics The Pennsylvania State University Traditional Approach Genome-wide

More information

Gene Regulation Solutions. Microarrays and Next-Generation Sequencing

Gene Regulation Solutions. Microarrays and Next-Generation Sequencing Gene Regulation Solutions Microarrays and Next-Generation Sequencing Gene Regulation Solutions The Microarrays Advantage Microarrays Lead the Industry in: Comprehensive Content SurePrint G3 Human Gene

More information

RNA-Seq analysis using R: Differential expression and transcriptome assembly

RNA-Seq analysis using R: Differential expression and transcriptome assembly RNA-Seq analysis using R: Differential expression and transcriptome assembly Beibei Chen Ph.D BICF 12/7/2016 Agenda Brief about RNA-seq and experiment design Gene oriented analysis Gene quantification

More information

Ion S5 and Ion S5 XL Systems

Ion S5 and Ion S5 XL Systems Ion S5 and Ion S5 XL Systems Targeted sequencing has never been simpler Explore the Ion S5 and Ion S5 XL Systems Adopting next-generation sequencing (NGS) in your lab is now simpler than ever The Ion S5

More information

Reproducible RNA-seq analysis with. Leonardo #bioc2017

Reproducible RNA-seq analysis with. Leonardo #bioc2017 Reproducible RNA-seq analysis with Leonardo Collado-Torres @fellgernon #bioc2017 11 Reads Reference genome GTEx TCGA slide adapted from Shannon Ellis SRA Slide adapted from Ben Langmead http://rail.bio/

More information

Microarray Gene Expression Analysis at CNIO

Microarray Gene Expression Analysis at CNIO Microarray Gene Expression Analysis at CNIO Orlando Domínguez Genomics Unit Biotechnology Program, CNIO 8 May 2013 Workflow, from samples to Gene Expression data Experimental design user/gu/ubio Samples

More information

Lecture #1. Introduction to microarray technology

Lecture #1. Introduction to microarray technology Lecture #1 Introduction to microarray technology Outline General purpose Microarray assay concept Basic microarray experimental process cdna/two channel arrays Oligonucleotide arrays Exon arrays Comparing

More information

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with

More information

Outline. Analysis of Microarray Data. Most important design question. General experimental issues

Outline. Analysis of Microarray Data. Most important design question. General experimental issues Outline Analysis of Microarray Data Lecture 1: Experimental Design and Data Normalization Introduction to microarrays Experimental design Data normalization Other data transformation Exercises George Bell,

More information

Derrek Paul Hibar

Derrek Paul Hibar Derrek Paul Hibar derrek.hibar@ini.usc.edu Obtain the ADNI Genetic Data Quality Control Procedures Missingness Testing for relatedness Minor allele frequency (MAF) Hardy-Weinberg Equilibrium (HWE) Testing

More information

ACCEPTED. Victoria J. Wright Corresponding author.

ACCEPTED. Victoria J. Wright Corresponding author. The Pediatric Infectious Disease Journal Publish Ahead of Print DOI: 10.1097/INF.0000000000001183 Genome-wide association studies in infectious diseases Eleanor G. Seaby 1, Victoria J. Wright 1, Michael

More information

Age-Adjusted Death Rates for Coronary Heart Disease, U.S.,

Age-Adjusted Death Rates for Coronary Heart Disease, U.S., Age-Adjusted Death Rates for Coronary Heart Disease, U.S., 1950-2004 Deaths/100,000 Population 600 500 400 300 200 100 Risk Factors U.S. Actual U.S. "Could Be" (Based on Japan Actual) 0 1950 1960 1970

More information

Whole genome sequencing in drug discovery research: a one fits all solution?

Whole genome sequencing in drug discovery research: a one fits all solution? Whole genome sequencing in drug discovery research: a one fits all solution? Marc Sultan, September 24th, 2015 Biomarker Development, Translational Medicine, Novartis On behalf of the BMD WGS pilot team:

More information

Cancer Genetics Solutions

Cancer Genetics Solutions Cancer Genetics Solutions Cancer Genetics Solutions Pushing the Boundaries in Cancer Genetics Cancer is a formidable foe that presents significant challenges. The complexity of this disease can be daunting

More information

Practical integration of genomic selection in dairy cattle breeding schemes

Practical integration of genomic selection in dairy cattle breeding schemes 64 th EAAP Meeting Nantes, 2013 Practical integration of genomic selection in dairy cattle breeding schemes A. BOUQUET & J. JUGA 1 Introduction Genomic selection : a revolution for animal breeders Big

More information

Quantitative Genetics

Quantitative Genetics Quantitative Genetics Polygenic traits Quantitative Genetics 1. Controlled by several to many genes 2. Continuous variation more variation not as easily characterized into classes; individuals fall into

More information

Analysis of Microarray Data

Analysis of Microarray Data Analysis of Microarray Data Lecture 1: Experimental Design and Data Normalization George Bell, Ph.D. Senior Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Introduction

More information

Proteomics And Cancer Biomarker Discovery. Dr. Zahid Khan Institute of chemical Sciences (ICS) University of Peshawar. Overview. Cancer.

Proteomics And Cancer Biomarker Discovery. Dr. Zahid Khan Institute of chemical Sciences (ICS) University of Peshawar. Overview. Cancer. Proteomics And Cancer Biomarker Discovery Dr. Zahid Khan Institute of chemical Sciences (ICS) University of Peshawar Overview Proteomics Cancer Aims Tools Data Base search Challenges Summary 1 Overview

More information

Bayesian Networks as framework for data integration

Bayesian Networks as framework for data integration Bayesian Networks as framework for data integration Jun Zhu, Ph. D. Department of Genomics and Genetic Sciences Icahn Institute of Genomics and Multiscale Biology Icahn Medical School at Mount Sinai New

More information

Long and short/small RNA-seq data analysis

Long and short/small RNA-seq data analysis Long and short/small RNA-seq data analysis GEF5, 4.9.2015 Sami Heikkinen, PhD, Dos. Topics 1. RNA-seq in a nutshell 2. Long vs short/small RNA-seq 3. Bioinformatic analysis work flows GEF5 / Heikkinen

More information

Goals of pharmacogenomics

Goals of pharmacogenomics Goals of pharmacogenomics Use drugs better and use better drugs! People inherit/exhibit differences in drug: Absorption Metabolism and degradation of the drug Transport of drug to the target molecule Excretion

More information

Fundamentals of Clinical Genomics

Fundamentals of Clinical Genomics Fundamentals of Clinical Genomics Wellcome Genome Campus Hinxton, Cambridge, UK 17-19 January 2018 Lectures and Workshops to be held in the Rosalind Franklin Pavilion Lunch and Dinner to be held in the

More information

Introduction to statistics for Genome- Wide Association Studies (GWAS) Day 2 Section 8

Introduction to statistics for Genome- Wide Association Studies (GWAS) Day 2 Section 8 Introduction to statistics for Genome- Wide Association Studies (GWAS) 1 Outline Background on GWAS Presentation of GenABEL Data checking with GenABEL Data analysis with GenABEL Display of results 2 R

More information

Complementary Technologies for Precision Genetic Analysis

Complementary Technologies for Precision Genetic Analysis Complementary NGS, CGH and Workflow Featured Publication Zhu, J. et al. Duplication of C7orf58, WNT16 and FAM3C in an obese female with a t(7;22)(q32.1;q11.2) chromosomal translocation and clinical features

More information

Population Genetics & Drug Discovery

Population Genetics & Drug Discovery Population Genetics & Drug Discovery examples from Finland Mark J. Daly Chief, Analytic and Translational Genetics Unit Massachusetts General Hospital Co-director, Medical and Population Genetics Broad

More information

Mapping and Mapping Populations

Mapping and Mapping Populations Mapping and Mapping Populations Types of mapping populations F 2 o Two F 1 individuals are intermated Backcross o Cross of a recurrent parent to a F 1 Recombinant Inbred Lines (RILs; F 2 -derived lines)

More information

General aspects of genome-wide association studies

General aspects of genome-wide association studies General aspects of genome-wide association studies Abstract number 20201 Session 04 Correctly reporting statistical genetics results in the genomic era Pekka Uimari University of Helsinki Dept. of Agricultural

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Richard Corbett Canada s Michael Smith Genome Sciences Centre Vancouver, British Columbia June 28, 2017 Our mandate is to advance knowledge about cancer and other diseases

More information

Welcome to the NGS webinar series

Welcome to the NGS webinar series Welcome to the NGS webinar series Webinar 1 NGS: Introduction to technology, and applications NGS Technology Webinar 2 Targeted NGS for Cancer Research NGS in cancer Webinar 3 NGS: Data analysis for genetic

More information

Opportunities for industry/smes in EU-funded health research

Opportunities for industry/smes in EU-funded health research Opportunities for industry/smes in EU-funded health research Stéphane Hogan, M.Sc, MBA Head of Unit Applied Genomics and Biotechnology for Health DG Research - European Commission 1 Paris, 31 May 2005

More information

High-throughput scale. Desktop simplicity.

High-throughput scale. Desktop simplicity. High-throughput scale. Desktop simplicity. NextSeq 500 System. Flexible power. Speed and simplicity for whole-genome, exome, and transcriptome sequencing. Harness the power of next-generation sequencing.

More information

PHYSICIAN RESOURCES MAPPED TO GENOMICS COMPETENCIES AND GAPS IDENTIFIED WITH CURRENT EDUCATIONAL RESOURCES AVAILABLE 06/04/14

PHYSICIAN RESOURCES MAPPED TO GENOMICS COMPETENCIES AND GAPS IDENTIFIED WITH CURRENT EDUCATIONAL RESOURCES AVAILABLE 06/04/14 PHYSICIAN RESOURCES MAPPED TO GENOMICS COMPETENCIES AND GAPS IDENTIFIED WITH CURRENT EDUCATIONAL RESOURCES AVAILABLE 06/04/14 Submitted resources from physician organization representatives were mapped

More information

AGRO/ANSC/BIO/GENE/HORT 305 Fall, 2016 Overview of Genetics Lecture outline (Chpt 1, Genetics by Brooker) #1

AGRO/ANSC/BIO/GENE/HORT 305 Fall, 2016 Overview of Genetics Lecture outline (Chpt 1, Genetics by Brooker) #1 AGRO/ANSC/BIO/GENE/HORT 305 Fall, 2016 Overview of Genetics Lecture outline (Chpt 1, Genetics by Brooker) #1 - Genetics: Progress from Mendel to DNA: Gregor Mendel, in the mid 19 th century provided the

More information

Data Sources and Biobanks in the Asia-Pacific Region. Wei Zhou, MD, Ph.D. Department of Epidemiology, Merck Research Laboratories October 23, 2014

Data Sources and Biobanks in the Asia-Pacific Region. Wei Zhou, MD, Ph.D. Department of Epidemiology, Merck Research Laboratories October 23, 2014 Data Sources and Biobanks in the Asia-Pacific Region Wei Zhou, MD, Ph.D. Department of Epidemiology, Merck Research Laboratories October 23, 2014 1 Disclosures Wei Zhou is currently an employee of Merck

More information

MedSavant: An open source platform for personal genome interpretation

MedSavant: An open source platform for personal genome interpretation MedSavant: An open source platform for personal genome interpretation Marc Fiume 1, James Vlasblom 2, Ron Ammar 3, Orion Buske 1, Eric Smith 1, Andrew Brook 1, Sergiu Dumitriu 2, Christian R. Marshall

More information

USER MANUAL for the use of the human Genome Clinical Annotation Tool (h-gcat) uthors: Klaas J. Wierenga, MD & Zhijie Jiang, P PhD

USER MANUAL for the use of the human Genome Clinical Annotation Tool (h-gcat) uthors: Klaas J. Wierenga, MD & Zhijie Jiang, P PhD USER MANUAL for the use of the human Genome Clinical Annotation Tool (h-gcat)) Authors: Klaas J. Wierenga, MD & Zhijie Jiang, PhD First edition, May 2013 0 Introduction The Human Genome Clinical Annotation

More information

Advanced Plant Technology Program Vocabulary

Advanced Plant Technology Program Vocabulary Advanced Plant Technology Program Vocabulary A Below you ll find a list of words (and their simplified definitions) that our researchers use on a daily basis. Abiotic stress (noun): Stress brought on by

More information

Technical Review. Real time PCR

Technical Review. Real time PCR Technical Review Real time PCR Normal PCR: Analyze with agarose gel Normal PCR vs Real time PCR Real-time PCR, also known as quantitative PCR (qpcr) or kinetic PCR Key feature: Used to amplify and simultaneously

More information

Introduction to Next Generation Sequencing (NGS) Andrew Parrish Exeter, 2 nd November 2017

Introduction to Next Generation Sequencing (NGS) Andrew Parrish Exeter, 2 nd November 2017 Introduction to Next Generation Sequencing (NGS) Andrew Parrish Exeter, 2 nd November 2017 Topics to cover today What is Next Generation Sequencing (NGS)? Why do we need NGS? Common approaches to NGS NGS

More information

MICROARRAYS+SEQUENCING

MICROARRAYS+SEQUENCING MICROARRAYS+SEQUENCING The most efficient way to advance genomics research Down to a Science. www.affymetrix.com/downtoascience Affymetrix GeneChip Expression Technology Complementing your Next-Generation

More information

Supplementary Figures Montero et al._supplementary Figure 1

Supplementary Figures Montero et al._supplementary Figure 1 Montero et al_suppl. Info 1 Supplementary Figures Montero et al._supplementary Figure 1 Montero et al_suppl. Info 2 Supplementary Figure 1. Transcripts arising from the structurally conserved subtelomeres

More information

Feature Selection of Gene Expression Data for Cancer Classification: A Review

Feature Selection of Gene Expression Data for Cancer Classification: A Review Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 50 (2015 ) 52 57 2nd International Symposium on Big Data and Cloud Computing (ISBCC 15) Feature Selection of Gene Expression

More information

Serial Analysis of Gene Expression

Serial Analysis of Gene Expression Serial Analysis of Gene Expression Cloning of Tissue-Specific Genes Using SAGE and a Novel Computational Substraction Approach. Genomic (2001) Hung-Jui Shih Outline of Presentation SAGE EST Article TPE

More information

IPA Advanced Training Course

IPA Advanced Training Course IPA Advanced Training Course Academia Sinica 2015 Oct Gene( 陳冠文 ) Supervisor and IPA certified analyst 1 Review for Introductory Training course Searching Building a Pathway Editing a Pathway for Publication

More information

University of Groningen. The value of haplotypes Vries, Anne René de

University of Groningen. The value of haplotypes Vries, Anne René de University of Groningen The value of haplotypes Vries, Anne René de IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

QuantStudio 3D Digital PCR System

QuantStudio 3D Digital PCR System PRODUCT BULLETIN QuantStudio 3D Digital PCR System QuantStudio 3D Digital PCR System Absolutely attainable digital PCR Simple chip-based workflow no emulsion PCR Affordable low total cost of ownership

More information

http://genemapping.org/ Epistasis in Association Studies David Evans Law of Independent Assortment Biological Epistasis Bateson (99) a masking effect whereby a variant or allele at one locus prevents

More information

The Five Key Elements of a Successful Metabolomics Study

The Five Key Elements of a Successful Metabolomics Study The Five Key Elements of a Successful Metabolomics Study Metabolomics: Completing the Biological Picture Metabolomics is offering new insights into systems biology, empowering biomarker discovery, and

More information

NHS ENGLAND BOARD PAPER

NHS ENGLAND BOARD PAPER NHS ENGLAND BOARD PAPER Paper: PB.30.03.2017/06 Title: Creating a genomic medicine service to lay the foundations to deliver personalised interventions and treatments Lead Director: Professor Sir Bruce

More information

ICH Topic E16 Genomic Biomarkers Related to Drug Response: Context, Structure and Format of Qualification Submissions. Step 3

ICH Topic E16 Genomic Biomarkers Related to Drug Response: Context, Structure and Format of Qualification Submissions. Step 3 European Medicines Agency June 2009 EMEA/CHMP/ICH/380636/2009 ICH Topic E16 Genomic Biomarkers Related to Drug Response: Context, Structure and Format of Qualification Submissions Step 3 NOTE FOR GUIDANCE

More information

Exploring the Genetic Basis of Congenital Heart Defects

Exploring the Genetic Basis of Congenital Heart Defects Exploring the Genetic Basis of Congenital Heart Defects Sanjay Siddhanti Jordan Hannel Vineeth Gangaram szsiddh@stanford.edu jfhannel@stanford.edu vineethg@stanford.edu 1 Introduction The Human Genome

More information

SNP GENOTYPING WITH iplex REAGENTS AND THE MASSARRAY SYSTEM

SNP GENOTYPING WITH iplex REAGENTS AND THE MASSARRAY SYSTEM SNP GENOTYPING Accurate, sensitive, flexible MassARRAY System SNP GENOTYPING WITH iplex REAGENTS AND THE MASSARRAY SYSTEM Biomarker validation Routine genetic testing Somatic mutation profiling Up to 400

More information

Axiom mydesign Custom Array design guide for human genotyping applications

Axiom mydesign Custom Array design guide for human genotyping applications TECHNICAL NOTE Axiom mydesign Custom Genotyping Arrays Axiom mydesign Custom Array design guide for human genotyping applications Overview In the past, custom genotyping arrays were expensive, required

More information

Functional Genomics Overview RORY STARK PRINCIPAL BIOINFORMATICS ANALYST CRUK CAMBRIDGE INSTITUTE 18 SEPTEMBER 2017

Functional Genomics Overview RORY STARK PRINCIPAL BIOINFORMATICS ANALYST CRUK CAMBRIDGE INSTITUTE 18 SEPTEMBER 2017 Functional Genomics Overview RORY STARK PRINCIPAL BIOINFORMATICS ANALYST CRUK CAMBRIDGE INSTITUTE 18 SEPTEMBER 2017 Agenda What is Functional Genomics? RNA Transcription/Gene Expression Measuring Gene

More information

Gene expression connectivity mapping and its application to Cat-App

Gene expression connectivity mapping and its application to Cat-App Gene expression connectivity mapping and its application to Cat-App Shu-Dong Zhang Northern Ireland Centre for Stratified Medicine University of Ulster Outline TITLE OF THE PRESENTATION Gene expression

More information

Legislation and Facilitation of Health Data in Denmark Short update. Mads Melbye, MD, DMSc Statens Serum Institut, Copenhagen, Denmark

Legislation and Facilitation of Health Data in Denmark Short update. Mads Melbye, MD, DMSc Statens Serum Institut, Copenhagen, Denmark Legislation and Facilitation of Health Data in Denmark Short update Mads Melbye, MD, DMSc Statens Serum Institut, Copenhagen, Denmark Legislation short facts Register research: exempted from informed consent

More information

Mapping strategies for sequence reads

Mapping strategies for sequence reads Mapping strategies for sequence reads Ernest Turro University of Cambridge 21 Oct 2013 Quantification A basic aim in genomics is working out the contents of a biological sample. 1. What distinct elements

More information

Data and Metadata Models Recommendations Version 1.2 Developed by the IHEC Metadata Standards Workgroup

Data and Metadata Models Recommendations Version 1.2 Developed by the IHEC Metadata Standards Workgroup Data and Metadata Models Recommendations Version 1.2 Developed by the IHEC Metadata Standards Workgroup 1. Introduction The data produced by IHEC is illustrated in Figure 1. Figure 1. The space of epigenomic

More information

Multi-omics in biology: integration of omics techniques

Multi-omics in biology: integration of omics techniques 31/07/17 Летняя школа по биоинформатике 2017 Multi-omics in biology: integration of omics techniques Konstantin Okonechnikov Division of Pediatric Neurooncology German Cancer Research Center (DKFZ) 2 Short

More information

Association Mapping in Wheat: Issues and Trends

Association Mapping in Wheat: Issues and Trends Association Mapping in Wheat: Issues and Trends Dr. Pawan L. Kulwal Mahatma Phule Agricultural University, Rahuri-413 722 (MS), India Contents Status of AM studies in wheat Comparison with other important

More information

Exam MOL3007 Functional Genomics

Exam MOL3007 Functional Genomics Faculty of Medicine Department of Cancer Research and Molecular Medicine Exam MOL3007 Functional Genomics Tuesday May 29 th 9.00-13.00 ECTS credits: 7.5 Number of pages (included front-page): 5 Supporting

More information

Molecular Markers CRITFC Genetics Workshop December 9, 2014

Molecular Markers CRITFC Genetics Workshop December 9, 2014 Molecular Markers CRITFC Genetics Workshop December 9, 2014 Molecular Markers Tools that allow us to collect information about an individual, a population, or a species Application in fisheries mating

More information

Genetics of human gene expression: mapping DNA variants that influence gene expression

Genetics of human gene expression: mapping DNA variants that influence gene expression Nature Reviews Genetics AOP, published online 28 July 2009; doi:10.1038/nrg2630 REVIEWS Genetics of human gene expression: mapping DNA variants that influence gene expression Vivian G. Cheung* and Richard

More information