NGS Approaches to Epigenomics

Similar documents
Applications of HMMs in Epigenomics

2/19/13. Contents. Applications of HMMs in Epigenomics

2/10/17. Contents. Applications of HMMs in Epigenomics

DNA. bioinformatics. epigenetics methylation structural variation. custom. assembly. gene. tumor-normal. mendelian. BS-seq. prediction.

What Can the Epigenome Teach Us About Cellular States and Diseases?

MODULE TSS1: TRANSCRIPTION START SITES INTRODUCTION (BASIC)

Lecture 9. Eukaryotic gene regulation: DNA METHYLATION

Complete Sample to Analysis Solutions for DNA Methylation Discovery using Next Generation Sequencing

DNA METHYLATION NH 2 H 3. Solutions using bisulfite conversion and immunocapture Ideal for NGS, Sanger sequencing, Pyrosequencing, and qpcr

ChIP-seq/Functional Genomics/Epigenomics. CBSU/3CPG/CVG Next-Gen Sequencing Workshop. Josh Waterfall. March 31, 2010

E i p g i e g n e e n t e i t c i s

Bioinformatics of Transcriptional Regulation

Epigenetics and DNase-Seq

Gene expression analysis. Biosciences 741: Genomics Fall, 2013 Week 5. Gene expression analysis

Functional Genomics Overview RORY STARK PRINCIPAL BIOINFORMATICS ANALYST CRUK CAMBRIDGE INSTITUTE 18 SEPTEMBER 2017

Impact of gdna Integrity on the Outcome of DNA Methylation Studies

What we ll do today. Types of stem cells. Do engineered ips and ES cells have. What genes are special in stem cells?

Do engineered ips and ES cells have similar molecular signatures?

Discovering gene regulatory control using ChIP-chip and ChIP-seq. An introduction to gene regulatory control, concepts and methodologies

Experimental validation of candidates of tissuespecific and CpG-island-mediated alternative polyadenylation in mouse

Introduction to the UCSC genome browser

Why DNA Isn t Your Destiny

What is Bioinformatics?

ChIP-Seq Tools. J Fass UCD Genome Center Bioinformatics Core Wednesday September 16, 2015

ChIP-Seq Data Analysis. J Fass UCD Genome Center Bioinformatics Core Wednesday 15 June 2015

ChIP-Seq Data Analysis. J Fass UCD Genome Center Bioinformatics Core Wednesday December 17, 2014

Discovering gene regulatory control using ChIP-chip and ChIP-seq. Part 1. An introduction to gene regulatory control, concepts and methodologies

Epigenetics. Medical studies in English, Lecture # 12,

Introduction to genome biology

Method development for DNA methylation on a single cell level

NEXT GENERATION SEQUENCING. Farhat Habib

Characterizing DNA binding sites high throughput approaches Biol4230 Tues, April 24, 2018 Bill Pearson Pinn 6-057

Next-generation sequencing technologies

Introduction to human genomics and genome informatics

The ENCODE Encyclopedia. & Variant Annotation Using RegulomeDB and HaploReg

NimbleGen Arrays and LightCycler 480 System: A Complete Workflow for DNA Methylation Biomarker Discovery and Validation.

Chapter 18. Regulation of Gene Expression

Data and Metadata Models Recommendations Version 1.2 Developed by the IHEC Metadata Standards Workgroup

O C. 5 th C. 3 rd C. the national health museum

Transcriptomics. Marta Puig Institut de Biotecnologia i Biomedicina Universitat Autònoma de Barcelona

Deep Sequencing technologies

8.1 Nucleosomes 8.2 Covalent modification of histones 8.3 The histone code 8.4 Decoding proteins 8.5 DNA CpG methylation

Chapter 17 Lecture. Concepts of Genetics. Tenth Edition. Regulation of Gene Expression in Eukaryotes

Non-coding Function & Variation, MPRAs. Mike White Bio5488 3/5/18

Differential Gene Expression

Assay Standards Working Group Nov 2012 Assay Standards Working Group Recommendations, November 2012

What is Epigenetics? Watch the video

elin grundberg JULY 2018 JK - PRESENTATION- 11 MINS [MIXED RESPONDENTS] [Other comments:]

Introduction to Bioinformatics

Unit IX Problem 3 Genetics: Basic Concepts in Molecular Biology

Decoding Chromatin States with Epigenome Data Advanced Topics in Computa8onal Genomics

Whole Transcriptome Analysis of Illumina RNA- Seq Data. Ryan Peters Field Application Specialist

Control of Eukaryotic Gene Expression (Learning Objectives)

Introduction to NGS analyses

DNA METHYLATION RESEARCH TOOLS

CMPS 6630: Introduction to Computational Biology and Bioinformatics. High-Throughput Sequencing and Applications

DNA:CHROMATIN INTERACTIONS

Green Center Computational Core ChIP- Seq Pipeline, Just a Click Away

resequencing storage SNP ncrna metagenomics private trio de novo exome ncrna RNA DNA bioinformatics RNA-seq comparative genomics

ENCODE and modencode Guidelines For Experiments Generating Chromatin and Transcription Factor Data Version 1.0, April 09, 2010

Plant Molecular and Cellular Biology Lecture 9: Nuclear Genome Organization: Chromosome Structure, Chromatin, DNA Packaging, Mitosis Gary Peter

Applications of ChIP. November 05, David Grotsky, PhD Scientific Support Specialist - Epigenetics

The Plant Cell, March 2013, American Society of Plant Biologists. All rights reserved.

02 Agenda Item 03 Agenda Item

Introduction to BIOINFORMATICS

Unit II Problem 3 Genetics: Summary of Basic Concepts in Molecular Biology

Multi-omics in biology: integration of omics techniques

ChIP-seq analysis 2/28/2018

Exploring Genome-wide Organization of Chromatin Structure by ChIP Alon Goren, Ph.D.

Lecture 21: Epigenetics Nurture or Nature? Chromatin DNA methylation Histone Code Twin study X-chromosome inactivation Environemnt and epigenetics

Chromatin. Structure and modification of chromatin. Chromatin domains

Genome 373: High- Throughput DNA Sequencing. Doug Fowler

About Strand NGS. Strand Genomics, Inc All rights reserved.

Unit 7. Genetic Regulation, Development, and Biotechnology. AP Biology

DIAMANTINA INSTITUTE for Cancer, Immunology and Metabolic Medicine

Biology 644: Bioinformatics

Bioinformatics Advice on Experimental Design

Differential Gene Expression

Gene Regulation 10/19/05

GATCGTGCACGATCTCGGCAATTCGGGATGCCGGCTCGTCACCGGTCGCT

Suppl. Table S1. Characteristics of DHS regions analyzed by bisulfite sequencing. No. CpGs analyzed in the amplicon. Genomic location specificity

Introduction to genome biology

Recombinant DNA: Basics and Advanced Applications. Code: ECTS Credits: 9. Degree Type Year Semester

Next- genera*on Sequencing. Lecture 13


Announcement Structure Analysis

11/22/13. Proteomics, functional genomics, and systems biology. Biosciences 741: Genomics Fall, 2013 Week 11

Go to Bottom Left click WashU Epigenome Browser. Click

Integrative Genomics 3b. Systems Biology and Epigenetics

Chapter 18: Regulation of Gene Expression

CELL BIOLOGY - CLUTCH CH. 7 - GENE EXPRESSION.

7. For What Molecule Do Genes Contain The

Control of Eukaryotic Genes. AP Biology

Biologia Cellulare Molecolare Avanzata 2.2. Functional & regulatory genomics module

Shaping Epigenetics Discovery

Chemical and/or enzymatic deamination across various sequencing methods to localize modified cytosines.

SNPs - GWAS - eqtls. Sebastian Schmeier

The ChIP-Seq project. Giovanna Ambrosini, Philipp Bucher. April 19, 2010 Lausanne. EPFL-SV Bucher Group

Value Correct Answer Feedback. Student Response. A. Dicer enzyme. complex. C. the Dicer-RISC complex D. none of the above

V23: the histone code

Transcription:

I519 Introduction to Bioinformatics, 2013 NGS Approaches to Epigenomics Yuzhen Ye (yye@indiana.edu) School of Informatics & Computing, IUB

Contents Background: chromatin structure & DNA methylation Epigenomic projects ENCODE modencode Human epigenome atlas Techniques ChIP-Seq MeDIP (MeDIP-chip & MeDIP-seq) Data analysis Peak detection Decoding problem (I529)

Views of genomes beyond strings of ATCG Genomic DNA is packaged into a complex molecular structure known as chromatin. This structure mediates the interaction between the genome and all types of regulatory and transcriptional molecules. In vertebrate genomes, methylation at position 5 of the cytosine in CpG dinucleotides is a heritable epigenetic mark that has been connected with both transcriptional silencing and imprinting Ref: DNA methylation patterns and epigenetic memory (Genes & Dev. 2002. 16: 6-21 )

A few terminologies Epigenetics the branch of biology which studies the causal interactions between genes and their products which bring the phenotype into being. (Conrad Waddington, 1940s) the study of changes in gene function that are mitotically and/or meiotically heritable and that do not entail a change in DNA sequence. (Epigenetics: Definition, Mechanisms and Clinical Perspective, 2010) An epigenetic trait is a stably heritable phenotype resulting from changes in a chromosome without alterations in the DNA sequence. (An operational definition of epigenetics, 2009) Epigenetic modifications histone variants, posttranslational modifications of amino acids on the amino-terminal tail of histones, and covalent modifications of DNA bases Epigenome refers to the different chromatin states at the whole genome level Epigenomics studies of epigenomes

The epigenetic pathway. Berger S L et al. Genes Dev. 2009;23:781-783 Copyright 2009 by Cold Spring Harbor Laboratory Press

ENCODE Encyclopedia of DNA Elements The ENCODE Consortium is integrating multiple technologies and approaches in a collective effort to discover and define the functional elements encoded in the human genome, including genes, transcripts, and transcriptional regulatory regions, together with their attendant chromatin states and DNA methylation patterns. Ref: A User's Guide to the Encyclopedia of DNA Elements (ENCODE) (PLoS Biology, 2011) Initial phase launched in 2003 1% of the human genome Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project (Nature, June 13, 2007)

Figure 1. The Organization of the ENCODE Consortium. The ENCODE Project Consortium (2011) A User's Guide to the Encyclopedia of DNA Elements (ENCODE). PLoS Biol 9(4): e1001046. doi:10.1371/journal.pbio.1001046 http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001046

Table 1. Experimental assays used by the ENCODE Consortium. The ENCODE Project Consortium (2011) A User's Guide to the Encyclopedia of DNA Elements (ENCODE). PLoS Biol 9(4): e1001046. doi:10.1371/journal.pbio.1001046 http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001046

ENCODE data UCSC Genome Browser on Human Feb. 2009 (GRCh37/hg19) Assembly

Figure 4. ENCODE chromatin annotations in the HLA locus. The ENCODE Project Consortium (2011) A User's Guide to the Encyclopedia of DNA Elements (ENCODE). PLoS Biol 9(4): e1001046. doi:10.1371/journal.pbio.1001046 http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001046

Figure 5. Occupancy of transcription factors and RNA polymerase 2 on human chromosome 6p as determined by ChIP-seq. The ENCODE Project Consortium (2011) A User's Guide to the Encyclopedia of DNA Elements (ENCODE). PLoS Biol 9(4): e1001046. doi:10.1371/journal.pbio.1001046 http://www.plosbiology.org/article/info:doi/10.1371/journal.pbio.1001046

http://www.modencode.org/ modencode The modencode Project will try to identify all of the sequence-based functional elements in the Caenorhabditis elegans and Drosophila melanogaster genomes.

Human epigenome atlas Successive releases of the Atlas will provide progressively more detailed insights into locusspecific epigenomic states, including histone marks and DNA methylation marks across specific tissues and cell types, developmental stages, physiological conditions, genotypes, and disease states.

ChIP-Seq Chromatin immunoprecipitation (ChIP) followed by high-throughput DNA sequencing (ChIP-seq) has become a valuable and widely used approach for mapping the genomic location of transcriptionfactor binding and histone modifications in living cells. Genome-Wide Mapping of in Vivo Protein-DNA Interactions (Science, 2007); 1946 binding sites of the Neuron-restrictive silencer factor (NRSF) were mapped at ~50bp resolution There are considerable differences in how these experiments are conducted, how the results are scored and evaluated for quality, and how the data and metadata are archived for public use. Genome Res. 2012 Sep;22(9):1813-31

Barcoded ChIP-Seq Efficient yeast ChIP-Seq using multiplex short-read DNA sequencing (BMC Genomics 2009, 10:37 )

DNase-seq DNase digestion followed by sequencing. DNase I hypersensitive sites (DHS), short regions of chromatin that are highly sensitive to cleavage by DNase I, typically occur in nucleosome free (nucleosome-depleted) regions as a result of transcription factor binding. DNA sequence motif analysis on DHS data was proposed as a method for discovering the binding sites of multiple transcription factors in a single experiment. DNase-seq profile resemble to some extent the data from ChIP-seq, with important differences (Front. Genet., 31 October 2012)

Genome-wide DNA methylation profiling Restriction enzyme-based methods Use one or more enzymes that will restrict DNA only if it is unmethylated (e.g. HpaII or NotI), or methylated (e.g. McrBC). Limited to the analysis of CpG sites located within the enzyme recognition site(s). Bisulfite-conversion based approaches Unmethylated cytosines are converted to uracil; offer single CpG resolution; the gold standard for DNA methylation analysis Con: reduction of sequence complexity following bisulfite conversion (Bi-chip) & Bi-seq approach is expensive. Align BS-treated reads to a reference genome Immunoprecipitation-based methods Use either 5-methylcytosine-specific antibodies (MeDIP) or methyl-binding domain proteins, to enrich for the methylated (or unmethylated) fraction of the genome.

Methylation analysis by DNA immunoprecipitation (MeDIP) MeDIP can be coupled with microarray or high-throughput sequencing MeDIP-chip MeDIP-seq Image from wikipedia

Long-range chromatin interaction Long-range Chromatin interactions: Chromosome Conformation Capture Carbon Copy (5C) Dostie J et al. Genome Res. 2006;16:1299-1309 Copyright 2006, Cold Spring Harbor Laboratory Press

CHIP-Seq: peak detection

Chromatin-state decoding Automated mapping of large-scale chromatin structure in ENCODE Bioinformatics (2008) 24 (17): 1911-1916. ChromHMM: automating chromatin-state discovery and characterization Nature Methods 9, 215 216 (2012)

NCBI Epigenomics: What s new for 2013