What Can the Epigenome Teach Us About Cellular States and Diseases?

Similar documents
NGS Approaches to Epigenomics

What is Epigenetics? Watch the video

The ENCODE Encyclopedia. & Variant Annotation Using RegulomeDB and HaploReg

2/10/17. Contents. Applications of HMMs in Epigenomics

DNA. bioinformatics. epigenetics methylation structural variation. custom. assembly. gene. tumor-normal. mendelian. BS-seq. prediction.

Finding Enrichments of Functional Annotations for Disease- Associated Single-Nucleotide Polymorphisms

Lecture 14. ENCODE. Michael Schatz. March 15, 2017 JHU : Applied Comparative Genomics

Why DNA Isn t Your Destiny

Lecture 9. Eukaryotic gene regulation: DNA METHYLATION

Integrative Genomics 3b. Systems Biology and Epigenetics

Escape from epigenetic silencing of lactase gene expression triggered by a single nucleotide change

Chapter 13 - Regulation of Gene Expression

Multi-omics in biology: integration of omics techniques

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016

2/19/13. Contents. Applications of HMMs in Epigenomics

Supplementary Material

Applications of HMMs in Epigenomics

Huijuan Feng, Shining Ma,Chao Ye & Zhixing Feng

E i p g i e g n e e n t e i t c i s

GATCGTGCACGATCTCGGCAATTCGGGATGCCGGCTCGTCACCGGTCGCT

Introduction to human genomics and genome informatics

Introduction to the UCSC genome browser

Introduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013

Reading Between the Genes: Computational Models to Discover Function from Noncoding DNA

Epigenetics and DNase-Seq

Finding Enrichments of Functional Annotations for Disease-Associated Single- Nucleotide Polymorphisms

Frumkin, 2e Part 1: Methods and Paradigms. Chapter 6: Genetics and Environmental Health

MODULE TSS1: TRANSCRIPTION START SITES INTRODUCTION (BASIC)

Reads to Discovery. Visualize Annotate Discover. Small DNA-Seq ChIP-Seq Methyl-Seq. MeDIP-Seq. RNA-Seq. RNA-Seq.

Understanding transcriptional regulation by integrative analysis of transcription factor binding data

Integrating diverse datasets improves developmental enhancer prediction

Workshop during the Pacific Symposium of Biocomputing, Jan 3-7, 2019: Reading between the genes: interpreting non-coding DNA in high-throughput

Nature Genetics: doi: /ng Supplementary Figure 1. H3K27ac HiChIP enriches enhancer promoter-associated chromatin contacts.

Computational Systems Biology Deep Learning in the Life Sciences

Non-coding Function & Variation, MPRAs. Mike White Bio5488 3/5/18

Linking Genetic Variation to Important Phenotypes: SNPs, CNVs, GWAS, and eqtls

Exome Sequencing Exome sequencing is a technique that is used to examine all of the protein-coding regions of the genome.

Linking Genetic Variation to Important Phenotypes: SNPs, CNVs, GWAS, and eqtls

ChroMoS Guide (version 1.2)

Complete Sample to Analysis Solutions for DNA Methylation Discovery using Next Generation Sequencing

Personal and population genomics of human regulatory variation

Modern Epigenomics. Histone Code

2. Outline the levels of DNA packing in the eukaryotic nucleus below next to the diagram provided.

Course Competencies Template - Form 112

Go to Bottom Left click WashU Epigenome Browser. Click

Chapter 11: Regulation of Gene Expression

Chapter 18. Regulation of Gene Expression

SNPs - GWAS - eqtls. Sebastian Schmeier

INTRODUCTION TO MOLECULAR GENETICS. Andrew McQuillin Molecular Psychiatry Laboratory UCL Division of Psychiatry 22 Sept 2017

V23: the histone code

RNA-SEQUENCING ANALYSIS

AGRO/ANSC/BIO/GENE/HORT 305 Fall, 2016 Overview of Genetics Lecture outline (Chpt 1, Genetics by Brooker) #1

Discovering similar (epi)genomics feature patterns in multiple genome browser tracks

Large-scale imputation of epigenomic datasets for systematic annotation of diverse human tissues

Chapter 17 Lecture. Concepts of Genetics. Tenth Edition. Regulation of Gene Expression in Eukaryotes

AP Biology. The BIG Questions. Chapter 19. Prokaryote vs. eukaryote genome. Prokaryote vs. eukaryote genome. Why turn genes on & off?

DOUBLE-STRAND DNA BREAK PREDICTION USING EPIGENOME MARKS AT KILOBASE RESOLUTION

Human Genetic Variation. Ricardo Lebrón Dpto. Genética UGR

CS273B: Deep learning for Genomics and Biomedicine

Advanced Cell Biology. Lecture 17

Functional Genomics Overview RORY STARK PRINCIPAL BIOINFORMATICS ANALYST CRUK CAMBRIDGE INSTITUTE 18 SEPTEMBER 2017

ChIP-Seq, mrna-seq, & Resequencing via the Genboree Workench

Advanced Cell Biology. Lecture 18

Our website:

Great Ideas of Biology

About Strand NGS. Strand Genomics, Inc All rights reserved.

Genetics BOE approved April 15, 2010

Chapter 14: Genes in Action

DNA and Biotechnology Form of DNA Form of DNA Form of DNA Form of DNA Replication of DNA Replication of DNA

Regulation of Gene Expression

Epigenetics. Medical studies in English, Lecture # 12,

Supplemental Figure 1.

Method development for DNA methylation on a single cell level

Nutrigenomics and nutrigenetics are they the keys for healthy nutrition?

Ensembl Funcgen: A Database and API for Epigenomics and Gene Regulation Data.

Unit 8: Genomics Guided Reading Questions (150 pts total)

elin grundberg JULY 2018 JK - PRESENTATION- 11 MINS [MIXED RESPONDENTS] [Other comments:]

Comparative eqtl analyses within and between seven tissue types suggest mechanisms underlying cell type specificity of eqtls

Decoding Chromatin States with Epigenome Data Advanced Topics in Computa8onal Genomics

7. For What Molecule Do Genes Contain The

Satellite Education Workshop (SW4): Epigenomics: Design, Implementation and Analysis for RNA-seq and Methyl-seq Experiments

Next-generation sequencing technologies

Genome 541! Unit 4, lecture 3! Genomics assays

Genome 541 Gene regulation and epigenomics Lecture 3 Integrative analysis of genomics assays

Applied Bioinformatics

CELL BIOLOGY - CLUTCH CH. 7 - GENE EXPRESSION.

Integrating Diverse Datasets Improves Developmental Enhancer Prediction

Chapter 5 DNA and Chromosomes

GENETICS - CLUTCH CH.15 GENOMES AND GENOMICS.

Bioinformatics of Transcriptional Regulation

Epigenetics, Environment and Human Health

Lecture 5: Regulation

Next Generation Genetics: Using deep sequencing to connect phenotype to genotype

Deep learning sequence-based ab initio prediction of variant effects on expression and disease risk

COURSE OUTLINE Biology 103 Molecular Biology and Genetics

res2)=)colocalisation(eqtl_file="icoslg_eqtl.txt",) gwas_file="uc_gwas_icoslg_locus.txt",)p1)=)1e+4,)p2)=)1e+4,)p12)=)1e+5,) region)=)200000)$

DNA & Protein Synthesis. The source and the process!

Bio 311 Learning Objectives

Gene expression analysis. Biosciences 741: Genomics Fall, 2013 Week 5. Gene expression analysis

BIOLOGY. Chapter 16 GenesExpression

Transcription:

What Can the Epigenome Teach Us About Cellular States and Diseases? (a computer scientist s view) Luca Pinello

Outline Epigenetic: the code over the code What can we learn from epigenomic data? Resources for epigenomic data & analysis

Human Genome Project We are what we are thanks to our genes Genes determine: Cellular state Disease Can be used to make diagnosis and design therapies First draft of our genome became available in 2001

Genes are not sufficient to explain the complexity of an organism

We need to learn to read the non-coding part of the genome adapted from: http://www.opont.hu/hirek3.php?k_hirazn=27234&k_hirfl=201112&k_hirkat=7 Gene Regulatory region?

Transcription Factors Transcription factors are proteins that control which genes are turned on or off in the genome Their activity determines how cells function and respond to cellular environments We have many TFs (>1000)

OK, how can I find my spot? Single and multiple alignments, motif search tools

DNA: a protein parking lot organized by sequences? A fundamental question is: is there a natural order dictated by the sequence, or are the binding locations of a protein dictated by other factors?

The Epigenetic revolution

Interest in Epigenetic is still raising

Epigenetics Epigenetic can be used to describe anything other than DNA sequence that influences the development of an organism.

Same genotype different phenotypes inheritance of phenotype is presumably based on epigenetic modifications of the IAP that may include DNA methylation or chromatin packaging Different diets in otherwise identical mice can determine glucose intolerance and obesity risk in offspring

I told you! Lamarck s theory of inheritance of acquired characters Darwin s theory of Natural Selection You can inherit something beyond the DNA sequence!

Epigenetic and gene regulation

Epigenetic and chromatin structure All the cells (almost) of our body share the same genome but have very different gene expression programs. Adapted from: http://jpkc.scu.edu.cn/ywwy/zbsw(e)/edetail12.htm

Chromatin Structure: it s not static! Adapted from: http://alexnabaum.blogspot.com/

The code over the code The chromatin structure and the accessibility are mainly controlled by: 1. Nucleosome positioning 2. DNA methylation 3. Histone modifications Adapted from: The Cell Biology of Stem Cells (2010)

Histone Modifications Specific histone modifications or combinations of modifications confer unique biological functions to the region of the genome associated with them: Gene Enhancer Adapted from Turner, Cell 2002 Heterochromatin

Histone Modifications are not static

Epigenetic (part of) the control logic of the software

Sequence them all! ChIP-seq Transcription Factors Histone modifications, nucleosomes Chromatin remodelers Bisulfite-seq DNA Methylation DNASE-seq Open Chromatin RNA-seq Gene Expression

Epigenetic and diseases? De-regulation of chromatin regulators/modifiers and chromatin structure Use epigenetic information to annotate genetic variants involved in disease

The problem: What are the functional mechanisms underlying genetic variants and epigenetic alterations associated with complex traits and diseases? Genetic Variation Chromosome Rearrangements Insertion Deletions Mutations SNP Non Coding RNA Disease Chromatin Regulators Nucleosome Positioning Dysregulation Histone Modifications Dysregulation DNA Methylation Dysregulation Epigenetic Variation

Epigenetic and Disease Deregulation of chromatin remodelers, modifiers and aberrant pattern of histone modifications Chromatin instability: although epigenetic changes do not alter the sequence of DNA, an altered chromatin can facilitate mutations and erroneous recombination Hyper-methylation of tumor suppressor genes, hypomethylation or hypo-acethylation of oncogenes Loss of Imprinting: activation of the normally silenced allele of an imprinted gene.

Genetic and Epigenetic variation in Cancer Transcription factors Chromatin regulators Suvà et al. Science 2013

GWAS: you can find variants associated with different diseases but

We need to look into the junk Although 88% of trait/disease-associated SNPs (TASs) were intronic (45%) or intergenic (43%), TASs were not overrepresented in introns and were significantly depleted in intergenic regions

Where should we look? adapted from: Paul et al. Bioessays 2014 Correlations between close variants make it difficult to pinpoint the causal one/s.

Where should we look? We can use external functional annotations: Conservation Enhancers Open Chromatin... adapted from: Paul et al. Bioessays 2014 Which one should we use? In which cell type?

Exploit epigenetic variability to highlight functional regions and regulators

Where to focus? 31

Exploit the cross cell-type variability to find interesting regions Cell type 2: What s that? Cell type 1: Boring Cell type 3: Boring

DHS and genetic variants Tissue-selective enrichment of diseaseassociated variants within DHSs Disease-associated variants systematically perturb transcription factor recognition sequences

DHS and partitioning heritability of regulatory variants Across the 11 diseases DNaseI hypersensitivity sites (DHSs) from 217 cell types spanned 16% of imputed SNPs (and 24% of genotyped SNPs) but explained an average of 79% (SE = 8%) of from imputed SNPs (5.1 enrichment; p = 3.7 10 17 )

Exploit the cross cell-type variability to find interesting regions GOAL Find non-redundant cell type specific regulators and functional regions We used 19 ChIP-seq datasets for H3K27me3 from the ENCODE project and validated a novel TF in blood development.

Haystack Pipeline Using this pipeline we predicted and experimentally validated regions and novel transcription factors important in blood development

Haystack integrative analysis H3k27ac and gene expression of lymphoblastoid lines from 19 individuals Data from Kasowski et al, Science 2014

Use Haystack on your data! Exploiting the variability to find non-redundant cell type/sample specific regulators and functional regions A Python package called HAYSTACK implements our pipeline github.com/lucapinello/haystack hub.docker.com/r/lucapinello/haystack_bio/

We have many histone modifications Idea: We need a way to summarize the combinatorial patterns of multiple histone marks

http://compbio.mit.edu/chromhmm/ Scaling up: Chromatin States Chromatin states are defined based on different combinations of histone modifications and correspond to different functional regions The goal is to segment the genome into biologically meaningful units.

How can we learn the combinatorial code? ChromHMM quantifies the presence or absence of each mark in bins of fixed size 1 1 1 1 0 1 0 1 1 1 0 1 0 1 0 1.... 1 1 0 1 Genomic sequence

ChromHMM and segmentation

Chromatin states and diseases Intersection of strong enhancer states with disease-associated SNPs from GWASs shows significant enrichment in relevant cell types

Roadmap Epigenomics

Roadmap Epigenomics

Roadmap Epigenomics

Roadmap Epigenomics

Roadmap Epigenomics

Resources

Roadmap Epigenomics portal http://www.roadmapepigenomics.org

Roadmap Epigenomics portal http://www.roadmapepigenomics.org

ENCODE portal https://www.encodeproject.org/

Blueprint epigenome http://www.blueprint-epigenome.eu BLUEPRINT focuses on haematopoietic cells from healthy and diseased individuals

International Human Epigenome Consortium IHEC makes available comprehensive sets of reference epigenomes relevant to health and disease

Haploreg www.broadinstitute.org/mammals/haploreg/haploreg.php

Regulome DB http://www.regulomedb.org/

http://screen.umassmed.edu/

Interesting directions Single cell epigenomics Genome editing to uncover other uncharacterized/unmarked regulatory elements? (see for example Rajagopa et al Nat Biotech 2016) Epigenetic Wide Association Studies (EWAS)

pinellolab.org Computational positions available* * Boats not included