Researching Cancer with the minion: Methylation and Structural Variation

Size: px
Start display at page:

Download "Researching Cancer with the minion: Methylation and Structural Variation"

Transcription

1 Agilent Webinar November 2017 Researching Cancer with the minion: Methylation and Structural Variation Winston Timp Department of Biomedical Engineering Johns Hopkins University

2 Nanopore: Single Molecule Sequencing Oxford Nanopore Technologies, CsgG biological pore No theoretical upper limit to sequencing read length, practical limit only in delivering DNA to the pore intact Palm sized sequencer Predicted sequencing output 3-6Gb ATCGATCGATAGTAT TAGATACGACTAGC GATCAG Oxford Nanopore Google Hangout March 2016 Deamer et al 2016, Nature Biotech Disclosure: Timp has two patents (US 2011/ A1; US2012/ A1) licensed to ONT 2

3 Nanopore Sequencing Workflow Four steps to generating usable data with nanopore sequencing Base-calling : the process of converting raw signal into nucleotide sequences Nanopolish : uses alignment and current signal to improve base-calls 3

4 Sequencing Operation Oxford Nanopore Technologies Protein nanopores on a synthetic polymer Multiple base-pairs at a time ( k-mers ) Characteristic current signature is converted to nucleotide sequences 4

5 Nanopore Library Prep Library prep is very similar to methods for short-read sequencing For DNA shearing we used Covaris gtubes or Diagenode Megaruptor After end-repair and A-tailing, leader adapter with motor protein is ligated MinION arrays 512 channels (with 4 pores possible per channel) (shown bottom left from running software); dark green pores are sequencing, light green available, other colors inactive. 5

6 Problems with Nanopore basecalling Multiple bases influence the current passing through the pore. Through simulation with Brownian Dynamics, we calculated the contribution from triplets of DNA in a solid-state nanopore - 64 current levels. Not all of these different currents are distinguishable Comer and Aksimentiev J. of Phys Chem C 116(5) (2012) 6

7 Hidden Markov Model Markov chains represent a series of states occurring in sequence, with defined transition probabilities. In a hidden Markov model the state is only observed indirectly. As an example, consider inferring the weather (without going outside) by observing whether people coming in from outside are wearing gloves or not 7

8 Prior Information for Decoding With no prior information, a given current value may not be called correctly (333pA would be called as GGG) If we know the previous triplet, the next triplet is well defined, leaving only four possibilities, resulting in the correct call of TCG Timp, et al. Biophys J. 30, (2012) 8

9 Nanopore HMM basecalling By using a sequence of observables and maximizing the total joint probability given below, we find the sequence of states. This is done using the Viterbi algorithm which grows, finding the most likely path for each step, saving the probabilities, to avoid recalculation. 1 st generation basecallers from Oxford used a HMM for basecalling similar to the one detailed in our Biophysical paper Transistion probablility matrix for oxford seems to allow for a 0, 1 (most common), 2, or 5 (reset) move. We think that Oxford trained its basecalling model on unmethylated lambda d k Õ t P(I(t) k t ) T (t-1)(t) 9

10 Basecalling shifting to RNN Recently (~6 months) there has been a shift to neural network based basecalling A recurrent neural network is still one with memory, that has a dependence on past computations Specifically two layers of Bidirectional Long Short Term Memory (BLSTM) These still require the same training data to learn what current distributions correspond to which k-mers and the results are still k-mer based, as multiple bases still influence the current. Oxford Nanopore 10

11 Assemblies SPAdes (Illumina only) Canu (Nanopore only) For Research Use Only. Not for use in diagnostic procedures. Long reads really help in getting complete assemblies using twice as many Illumina reads, the N50 is still ~0.3 Mb for Illumina only, while it s full length chromosome from nanopore only 11

12 Pilon Correction Though ONT error rate is improving, still not sufficient from raw basecalled for SNPs Application of Illumina data via pilon allows for correction Multiple rounds of correction needed initially Illumina reads won t align correctly to error prone scaffold Next we are trying to apply nanopolish which returns to the electrical signal to correct nanopore only assemblies 12

13 Nanopolish tools Consensus Calling github.com/jts/nanopolish Reference-based SNP Calling Methylation Detection Read Phasing TAGAAGATATCATGTATAGTACGAT TAGAAGATATCATG TAGCAGATATCATGTATATTACGAT CATGTATATTACGAT 13

14 Nanopolish SNP Calling and Genotyping input: pairs of haplotypes ACTACGATCGAC ACTACGATCGAC ACTACGATCGAC ACTACCATCGAC ACTACCATCGAC ACTACCATCGAC hidden Markov model /1 output: genotype call 14

15 Human Genotyping Results Genotype accuracy at all sites: 99.2% Genotype accuracy at variable sites: 94.8% Jain et al biorxiv

16 Structural Variation Defined as an abnormality in large region (50b-3mb) of a chromosome Pervasive in cancer 50% of pancreatic ductal adenocarcinoma (PDAC) have SVs Common in tumor suppressor genes such as CDKN2A and SMAD4 Short-read sequencing has difficulty resolving SVs, but nanopore sequencing long reads can stretch across them. High coverage needed to detect, but yield from nanopore sequencing still relatively low per flowcell (~3-5Gb) Baker, Monya Nature Methods 16

17 Solution-phase Hybridization Capture Use of Agilent SureSelectXT Targeted Sequencing System in cancer research ~90 bps biotinylated RNA probes complementary to target sequence Biotin-streptavidin interaction to enrich for the targeted region Optimization for long-reads : > 2 kb 17

18 Targeted Capture Optimization Trial 1 Probe tiling, No empty spaces between probes Target region CDKN2A : 1.5 Mbp Low stringency to allow mismatches Result: 2.28 % on-target Trial 2 No tiling, average 400 bp space between probes Target regions CDKN2A : 1.5 Mbps SMAD4 : 850 Kbps High stringency to limit off-target capture Consideration of known SV breakpoints PDAC SVs from James Eshleman lab Result: 30 % on-target Collaboration with Josh Wang from Agilent 18

19 Targeted Sequencing Performance Control : NA12878 lymphoblast Sample : PDAC from Eshleman lab Illumina short-read targeted sequencing for comparison > 300-fold enrichment > 20X average coverage Agilent App Note: Total yield (reads) On-target On-target percentage Fold enrichment Coverage Illumina NA m 3.7m 85% 641X 113X Nanopore NA k 32k 30% 353X 27X Nanopore PDAC 56k 20k 26% 332X 20X 19

20 Nanopore Structural Variation Detection in Cancer Research NA12878 (ENCODE Human lymphoblast cell line) SVs detected with Sniffles (Schatz lab) chr9:21,038,354-21,038,506; 152 bps duplication Validated with PacBio data from Genome in a Bottle (Mt. Sinai School of Medicine) w/josh Wang (Agilent) & Jim Eshleman (JHMI) 20

21 Nanopore Structural Variation Detection in Cancer Research PDAC cell line (Eshleman): Novel, putative SVs detected from PDAC chr18: 51,198,535 51,199,143; 600 bps deletion Possibly allele-specific SV w/josh Wang (Agilent) & Jim Eshleman (JHMI) 21

22 Nanopore Structural Variation Detection in Cancer Research For Research Use Only. Not for use in diagnostic procedures. Large window of coverage; Absence in CDKN2A region chr9:21,950,000-22,436,000; 486 kbps SV Homozygous Deletion previously identified with Illumina data Sniffles did not detect this SV; No reads covering either breakpoint, unlucky coincidence of probes w/josh Wang (Agilent) & Jim Eshleman (JHMI) 22

23 Single Nucleotide Variation Detection in Cancer Research Phased SNV analysis is possible with coverage from targeted sequencing Illumina Pre-polish Post-polish Avg. Coverage Correct Total Precision 94% 60% 93% Sensitivity 32% 69% 26% Number of True SNVs: 3587(Eberle,et al. biorxiv, 2016) 23

24 Epigenetics: Historical The historical definition of epigenetics, by Waddington, is how genotype interacts with the environment to create phenotype. The analogy is the rolling ball in the image to the right the genotype has set up the hills, environment nudges the ball into one valley or another, then the ball is, to mix metaphor, lineage committed. Which valley the ball rolls into is not predetermined, but a stochastic behavior. Waddington, 1940; Raser et al. Science (2005) 24

25 Epigenetics: Modern For Research Use Only. Not for use in diagnostic procedures. Modern Definition of epigenetics involves heritable changes other than genetic sequence, e.g., positive feedback, high order structure, chromatin organization, histone modifications, DNA methylation. An analogy to a computer system: DNA Sequence = Hardware User input = Environment Systems Biology = Running programs Epigenetics = RAM 25

26 Single Read Methylation: Distribution We performed hybridization capture, then Illumina bisulfite sequencing on 6 paired colon cancer and normal samples. We then examined methylation patterns *within reads* and looked at the distribution in normal vs. cancer samples. Colors in the stacked bar graph represent different sequenced samples. Areas are clusters in regions which show significantly different methylation levels (ttest). Number of reads per pattern Number of reads per pattern 26

27 Single Read Methylation: Distribution We performed hybridization capture, then Illumina bisulfite sequencing on 6 paired colon cancer and normal samples. We then examined methylation patterns *within reads* and looked at the distribution in normal vs. cancer samples. Colors in the stacked bar graph represent different sequenced samples. Areas are clusters in regions which show significantly different methylation levels (ttest). Number of reads per pattern Number of reads per pattern 27

28 Single Read Methylation: Distribution We performed hybridization capture, then Illumina bisulfite sequencing on 6 paired colon cancer and normal samples. We then examined methylation patterns *within reads* and looked at the distribution in normal vs. cancer samples. Colors in the stacked bar graph represent different sequenced samples. Areas are clusters in regions which show significantly different methylation levels (ttest). Number of reads per pattern Number of reads per pattern 28

29 Nanopore: Methylation Cytosine methylation increases DNA stiffness, decreasing stretching force This may allow for filtering of methylated versus unmethylated DNA using a pore Mirsaidov, et al. Biophys J. (2009) 29

30 Nanopore: Methylation Differences between methylated and unmethylated cytosine have been detected using nanopores. Methylation state can be called with 90% accuracy. We have implemented a classifier for mc on using Oxford Nanopore signals. Laszlo, et al. PNAS (2013) Schreiber, et al. PNAS. (2013) 30

31 Generation of methylated Samples To generated methylated samples, we treat unmethylated DNA (lambda, dam-/dcm- E. Coli, PCR product) with M. SssI methyltransferase We confirm the CG specific methylation using Illumina bisulfite sequencing of the sample pictured right is methylation in different contexts E. Coli dataset treated with M. SssI (red) versus untreated (green) 31

32 Nanopore: Methylated Error Simpson, Workman, et al. Nature Methods (2017) We sequenced samples with either SssI or no treatment. Notably, mismatch error rate is higher on methylated samples than unmethylated, though indel rate seems mostly unchanged At CG locations, there is not a clear alternative basecalling, though error is higher; Ts are not substituted for methylated Cs 32

33 Emission Probabilities We measured distributions of current for k-mers from E. Coli M.SssI treated (methylated; green) and untreated (unmethylated; red) samples on two different sets of pores - R7.3 and R9 flowcells. Boxplots of AGGTCG and TCGAGT k- mers which both contain CGs show significant differences in current in some cases (AGGTCG R7.3) and little to none in others (TCGAGT R7.3) R9 current distribution seem wider in both cases, but gives better discrimination in TCGAGT. Simpson, Workman, et al. Nature Methods (2017) 33

34 Distance of methylation effect We looked at the difference in current levels dependent on the position of the methylated base plotted are the current differences for R7.3(blue) and R9 pores(orange). Signal seems again stronger but more variable for R9 pores than R7.3 Methylation can either reduce current or increase it. Some positions are more sensitive to methylation than others. Simpson, Workman, Nature Methods (2017) 34

35 Nanopore: nanopolish methyltrain Multiple bases influence the current passing through the pore. Current basecallers use a neuralnetwork based methodology to call bases. We currently use a HMM based classifier to call methylation With nanopolish we can call the probability: Where S m is the probability methylated for a given observable D and S r the probability unmethylated We then take the log of this likelihood ratio. d Õ k P(I(t) k t t ) T (t-1)(t) 35

36 Simpson, Workman, Nature Methods (2017) NA12878 Methylation R7.3 R9 For Research Use Only. Not for use in diagnostic procedures. NA12878 (lymphoblast) gdna: Illumina WGBS on X-axis (24X coverage) (SRA: GSM ) vs. R7.3 (0.02X) or R9 (0.13X) nanopore sequencing. Correlation of 0.83 (R7.3) and 0.84 (R9) most gene promoters unmethylated 36

37 Nanopolish Methylation Methylation comparison between bisulfite and nanopore calling In whole genome Jain et al biorxiv

38 Simpson, Workman, Nature Methods (2017) Binned Methylation vs. Transcription Start Sites On human genomic samples: Binning methylation levels vs. distance to TSS sites, compared to bisulfite data (NA12878). We also generated completely methylated (M.SssI treated; ~95% meth) and unmethylated used to generate the ROC curve (right) R7.3 91% accurate at 68% of sites R9 94% accurate at 77% of sites 38

39 Cancer-Normal Comparison Reduced representation method:12.5mb of the genome (3.5-6kb size selection) We sequenced this fraction on nanopore and bisulfite Illumina seq Long reads measure phased methylation For Research Use Only. Not for use in diagnostic procedures. Simpson, Workman, Nature Methods (2017) 39

40 Haplotype-Phased Methylation nanopolish has experimental support for phasing methylation patterns this haplotype is highly methylated 40

41 Haplotype-Phased Methylation nanopolish has experimental support for phasing methylation patterns this haplotype isn t 41

42 Summary Nanopore technology is full of potential for sequencing, but always choose the right tool for the right job. Often multiple approaches with complementary data yield the best results. Multiple bases affect the electrical signal from nanopores; rather than a problem, this can be an advantage, as each base is interrogated multiple times. Using a hidden Markov model and Viterbi algorithm, we can decode the electrical signal to a DNA base sequence. Long-read sequencing is great at detection of structural variants in cancer samples used in clinical research when coupled with hybridization capture this can be a powerful approach. Modifications to the primary DNA sequence (e.g. cytosine methylation) can be detected directly using nanopores, as compared to the gold standard of chemical modification (bisulfite treatment). 42

43 Acknowledgments Timp Lab Johns Hopkins University Winston Timp, PhD Rachael Workman, MS Stephanie Hao, MS Yunfan Fan Isac Lee Amy Vandiver, MD, PhD Eshleman Lab Johns Hopkins School of Medicine James Eshleman, MD, PhD Alexis Norris, PhD Simner Lab Johns Hopkins School of Medicine Patricia (Trish) Simner, PhD Belita Opene Annie Antar Yehudit Bergman Ontario Institute for Cancer Research Jared Simpson, PhD P.C. Zuzarte, PhD Matei David, PhD L. J. Dursi, PhD Agilent Technologies Josh Wang, PhD Jonathan Levine, PhD 1R01HG A1 43

44 Other mod detection 44

Applications of modification detection in nanopore sequencing

Applications of modification detection in nanopore sequencing Biology of Genomes Oxford Nanopore Workshop Cold Spring Harbor Lab; May 2018 Applications of modification detection in nanopore sequencing Winston Timp Department of Biomedical Engineering Johns Hopkins

More information

Detecting Base Modifications Using Nanopore Sequencing

Detecting Base Modifications Using Nanopore Sequencing Long-Read Sequencing Workshop Jackson Laboratory for Genomic Medicine Farmington, CT; April 2018 Detecting Base Modifications Using Nanopore Sequencing Winston Timp Department of Biomedical Engineering

More information

Bacterial Sequencing and Assembly for Analysis of Antibiotic Resistance Genes and Mutations

Bacterial Sequencing and Assembly for Analysis of Antibiotic Resistance Genes and Mutations Bacterial Sequencing and Assembly for Analysis of Antibiotic Resistance Genes and Mutations Sequencing, Finishing and Analysis in the Future May 24, 2018 Yunfan Fan Department of Biomedical Engineering

More information

Nature Methods: doi: /nmeth Supplementary Figure 1. Position dependence of methylation parameter shift.

Nature Methods: doi: /nmeth Supplementary Figure 1. Position dependence of methylation parameter shift. Supplementary Figure 1 Position dependence of methylation parameter shift. A histogram of the difference between the trained mean for the methylated R7.3 (left column; PCR+M.SssI-R7.3-timp-021216) and

More information

Machine Learning. HMM applications in computational biology

Machine Learning. HMM applications in computational biology 10-601 Machine Learning HMM applications in computational biology Central dogma DNA CCTGAGCCAACTATTGATGAA transcription mrna CCUGAGCCAACUAUUGAUGAA translation Protein PEPTIDE 2 Biological data is rapidly

More information

Nature Biotechnology: doi: /nbt Supplementary Figure 1. Read Complexity

Nature Biotechnology: doi: /nbt Supplementary Figure 1. Read Complexity Supplementary Figure 1 Read Complexity A) Density plot showing the percentage of read length masked by the dust program, which identifies low-complexity sequence (simple repeats). Scrappie outputs a significantly

More information

DNA METHYLATION RESEARCH TOOLS

DNA METHYLATION RESEARCH TOOLS SeqCap Epi Enrichment System Revolutionize your epigenomic research DNA METHYLATION RESEARCH TOOLS Methylated DNA The SeqCap Epi System is a set of target enrichment tools for DNA methylation assessment

More information

TruSPAdes: analysis of variations using TruSeq Synthetic Long Reads (TSLR)

TruSPAdes: analysis of variations using TruSeq Synthetic Long Reads (TSLR) tru TruSPAdes: analysis of variations using TruSeq Synthetic Long Reads (TSLR) Anton Bankevich Center for Algorithmic Biotechnology, SPbSU Sequencing costs 1. Sequencing costs do not follow Moore s law

More information

AUDREY FARBOS JEREMIE POSCHMANN PAUL O NEILL KONRAD PASZKIEWICZ KAREN MOORE

AUDREY FARBOS JEREMIE POSCHMANN PAUL O NEILL KONRAD PASZKIEWICZ KAREN MOORE We provide: AUDREY FARBOS JEREMIE POSCHMANN PAUL O NEILL KONRAD PASZKIEWICZ KAREN MOORE State of the art genomics and bioinformatics analysis Training in experimental techniques, analysis and modelling

More information

Get to Know Your DNA. Every Single Fragment.

Get to Know Your DNA. Every Single Fragment. HaloPlex HS NGS Target Enrichment System Get to Know Your DNA. Every Single Fragment. High sensitivity detection of rare variants using molecular barcodes How Does Molecular Barcoding Work? HaloPlex HS

More information

New Frontiers of Genetic Profiling Achieve Higher Sensitivity and Greater Insights with Molecular Barcodes, Long Read Capture and Optimized Exomes

New Frontiers of Genetic Profiling Achieve Higher Sensitivity and Greater Insights with Molecular Barcodes, Long Read Capture and Optimized Exomes New Frontiers of Genetic Profiling Achieve Higher Sensitivity and Greater Insights with Molecular Barcodes, Long Read Capture and Optimized Exomes Jennifer Jones, PhD Senior Field Application Scientist

More information

APPLICATION NOTE

APPLICATION NOTE APPLICATION NOTE www.swiftbiosci.com NGS Library Preparation Produces Balanced, Comprehensive Methylome Coverage from Low Input Quantities ABSTRACT Next-generation sequencing (NGS) of bisulfite-converted

More information

Complete Sample to Analysis Solutions for DNA Methylation Discovery using Next Generation Sequencing

Complete Sample to Analysis Solutions for DNA Methylation Discovery using Next Generation Sequencing Complete Sample to Analysis Solutions for DNA Methylation Discovery using Next Generation Sequencing SureSelect Human/Mouse Methyl-Seq Kyeong Jeong PhD February 5, 2013 CAG EMEAI DGG/GSD/GFO Agilent Restricted

More information

Impact of gdna Integrity on the Outcome of DNA Methylation Studies

Impact of gdna Integrity on the Outcome of DNA Methylation Studies Impact of gdna Integrity on the Outcome of DNA Methylation Studies Application Note Nucleic Acid Analysis Authors Emily Putnam, Keith Booher, and Xueguang Sun Zymo Research Corporation, Irvine, CA, USA

More information

Supplementary Figures

Supplementary Figures Supplementary Figures A B Supplementary Figure 1. Examples of discrepancies in predicted and validated breakpoint coordinates. A) Most frequently, predicted breakpoints were shifted relative to those derived

More information

DNA. bioinformatics. epigenetics methylation structural variation. custom. assembly. gene. tumor-normal. mendelian. BS-seq. prediction.

DNA. bioinformatics. epigenetics methylation structural variation. custom. assembly. gene. tumor-normal. mendelian. BS-seq. prediction. Epigenomics T TM activation SNP target ncrna validation metagenomics genetics private RRBS-seq de novo trio RIP-seq exome mendelian comparative genomics DNA NGS ChIP-seq bioinformatics assembly tumor-normal

More information

Next-Generation Sequencing. Technologies

Next-Generation Sequencing. Technologies Next-Generation Next-Generation Sequencing Technologies Sequencing Technologies Nicholas E. Navin, Ph.D. MD Anderson Cancer Center Dept. Genetics Dept. Bioinformatics Introduction to Bioinformatics GS011062

More information

2/19/13. Contents. Applications of HMMs in Epigenomics

2/19/13. Contents. Applications of HMMs in Epigenomics 2/19/13 I529: Machine Learning in Bioinformatics (Spring 2013) Contents Applications of HMMs in Epigenomics Yuzhen Ye School of Informatics and Computing Indiana University, Bloomington Spring 2013 Background:

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Richard Corbett Canada s Michael Smith Genome Sciences Centre Vancouver, British Columbia June 28, 2017 Our mandate is to advance knowledge about cancer and other diseases

More information

Jared Simpson Ontario Institute for Cancer Research & Department of Computer Science University of Toronto

Jared Simpson Ontario Institute for Cancer Research & Department of Computer Science University of Toronto Portable DNA Sequencing: Analysis Methods and Applications Jared Simpson Ontario Institute for Cancer Research & Department of Computer Science University of Toronto Computing for Medicine November 22nd,

More information

Nanopore Sequencing in the Redwood Genome Project: DNA Extraction to Sequencing Optimization Rachael Workman

Nanopore Sequencing in the Redwood Genome Project: DNA Extraction to Sequencing Optimization Rachael Workman Nanopore Sequencing in the Redwood Genome Project: DNA Extraction to Sequencing Optimization Rachael Workman Timp Lab, Johns Hopkins University, Department of Biomedical Engineering Photo: Beatrix M. Varga

More information

Applications of HMMs in Epigenomics

Applications of HMMs in Epigenomics I529: Machine Learning in Bioinformatics (Spring 2013) Applications of HMMs in Epigenomics Yuzhen Ye School of Informatics and Computing Indiana University, Bloomington Spring 2013 Contents Background:

More information

HaloPlex HS. Get to Know Your DNA. Every Single Fragment. Kevin Poon, Ph.D.

HaloPlex HS. Get to Know Your DNA. Every Single Fragment. Kevin Poon, Ph.D. HaloPlex HS Get to Know Your DNA. Every Single Fragment. Kevin Poon, Ph.D. Sr. Global Product Manager Diagnostics & Genomics Group Agilent Technologies For Research Use Only. Not for Use in Diagnostic

More information

MinION, GridION, how does Nanopore technology meet the needs of our users?

MinION, GridION, how does Nanopore technology meet the needs of our users? MinION, GridION, how does Nanopore technology meet the needs of our users? Journée Long Reads GeT 28 Novembre 2017 Catherine Zanchetta & Maxime Manno get@genotoul.fr @get_genotoul 1 Wet Lab Nanopore technology

More information

Deep Sequencing technologies

Deep Sequencing technologies Deep Sequencing technologies Gabriela Salinas 30 October 2017 Transcriptome and Genome Analysis Laboratory http://www.uni-bc.gwdg.de/index.php?id=709 Microarray and Deep-Sequencing Core Facility University

More information

DNA concentration and purity were initially measured by NanoDrop 2000 and verified on Qubit 2.0 Fluorometer.

DNA concentration and purity were initially measured by NanoDrop 2000 and verified on Qubit 2.0 Fluorometer. DNA Preparation and QC Extraction DNA was extracted from whole blood or flash frozen post-mortem tissue using a DNA mini kit (QIAmp #51104 and QIAmp#51404, respectively) following the manufacturer s recommendations.

More information

NGS Approaches to Epigenomics

NGS Approaches to Epigenomics I519 Introduction to Bioinformatics, 2013 NGS Approaches to Epigenomics Yuzhen Ye (yye@indiana.edu) School of Informatics & Computing, IUB Contents Background: chromatin structure & DNA methylation Epigenomic

More information

DNA Sequencing and Assembly

DNA Sequencing and Assembly DNA Sequencing and Assembly CS 262 Lecture Notes, Winter 2016 February 2nd, 2016 Scribe: Mark Berger Abstract In this lecture, we survey a variety of different sequencing technologies, including their

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Contents De novo assembly... 2 Assembly statistics for all 150 individuals... 2 HHV6b integration... 2 Comparison of assemblers... 4 Variant calling and genotyping... 4 Protein truncating variants (PTV)...

More information

Genomic resources. for non-model systems

Genomic resources. for non-model systems Genomic resources for non-model systems 1 Genomic resources Whole genome sequencing reference genome sequence comparisons across species identify signatures of natural selection population-level resequencing

More information

Supplementary Information for:

Supplementary Information for: Supplementary Information for: A streamlined and high-throughput targeting approach for human germline and cancer genomes using Oligonucleotide-Selective Sequencing Samuel Myllykangas 1, Jason D. Buenrostro

More information

The Diploid Genome Sequence of an Individual Human

The Diploid Genome Sequence of an Individual Human The Diploid Genome Sequence of an Individual Human Maido Remm Journal Club 12.02.2008 Outline Background (history, assembling strategies) Who was sequenced in previous projects Genome variations in J.

More information

High Throughput Sequencing Technologies. UCD Genome Center Bioinformatics Core Monday 15 June 2015

High Throughput Sequencing Technologies. UCD Genome Center Bioinformatics Core Monday 15 June 2015 High Throughput Sequencing Technologies UCD Genome Center Bioinformatics Core Monday 15 June 2015 Sequencing Explosion www.genome.gov/sequencingcosts http://t.co/ka5cvghdqo Sequencing Explosion 2011 PacBio

More information

Estimation problems in high throughput SNP platforms

Estimation problems in high throughput SNP platforms Estimation problems in high throughput SNP platforms Rob Scharpf Department of Biostatistics Johns Hopkins Bloomberg School of Public Health November, 8 Outline Introduction Introduction What is a SNP?

More information

Genome Resequencing. Rearrangements. SNPs, Indels CNVs. De novo genome Sequencing. Metagenomics. Exome Sequencing. RNA-seq Gene Expression

Genome Resequencing. Rearrangements. SNPs, Indels CNVs. De novo genome Sequencing. Metagenomics. Exome Sequencing. RNA-seq Gene Expression Genome Resequencing De novo genome Sequencing SNPs, Indels CNVs Rearrangements Metagenomics RNA-seq Gene Expression Splice Isoform Abundance High Throughput Short Read Sequencing: Illumina Exome Sequencing

More information

Direct RNA sequencing of human transcripts using the Oxford Nanopore sequencing platform. Rachael Workman The Nanopore RNA Consortium AGBT 2018

Direct RNA sequencing of human transcripts using the Oxford Nanopore sequencing platform. Rachael Workman The Nanopore RNA Consortium AGBT 2018 Direct RNA sequencing of human transcripts using the Oxford Nanopore sequencing platform Rachael Workman The Nanopore RNA Consortium AGBT 2018 Direct RNA sequencing opens new frontiers for transcriptome

More information

02 Agenda Item 03 Agenda Item

02 Agenda Item 03 Agenda Item 01 Agenda Item 02 Agenda Item 03 Agenda Item SOLiD 3 System: Applications Overview April 12th, 2010 Jennifer Stover Field Application Specialist - SOLiD Applications Workflow for SOLiD Application Application

More information

Welcome to the NGS webinar series

Welcome to the NGS webinar series Welcome to the NGS webinar series Webinar 1 NGS: Introduction to technology, and applications NGS Technology Webinar 2 Targeted NGS for Cancer Research NGS in cancer Webinar 3 NGS: Data analysis for genetic

More information

Nature Biotechnology: doi: /nbt Supplementary Figure 1. Number and length distributions of the inferred fosmids.

Nature Biotechnology: doi: /nbt Supplementary Figure 1. Number and length distributions of the inferred fosmids. Supplementary Figure 1 Number and length distributions of the inferred fosmids. Fosmid were inferred by mapping each pool s sequence reads to hg19. We retained only those reads that mapped to within a

More information

Parts of a standard FastQC report

Parts of a standard FastQC report FastQC FastQC, written by Simon Andrews of Babraham Bioinformatics, is a very popular tool used to provide an overview of basic quality control metrics for raw next generation sequencing data. There are

More information

DNA Methylation Sequencing (Bisulfite-seq) Kunde Ramamoorthy Govindarajan (Govind)

DNA Methylation Sequencing (Bisulfite-seq) Kunde Ramamoorthy Govindarajan (Govind) DNA Methylation Sequencing (Bisulfite-seq) Kunde Ramamoorthy Govindarajan (Govind) Overview Importance of DNA methylation Bisulfite-seq Technology Mapping challenges QC & analysis workflow Case study:

More information

Outline General NGS background and terms 11/14/2016 CONFLICT OF INTEREST. HLA region targeted enrichment. NGS library preparation methodologies

Outline General NGS background and terms 11/14/2016 CONFLICT OF INTEREST. HLA region targeted enrichment. NGS library preparation methodologies Eric T. Weimer, PhD, D(ABMLI) Assistant Professor, Pathology & Laboratory Medicine, UNC School of Medicine Director, Molecular Immunology Associate Director, Clinical Flow Cytometry, HLA, and Immunology

More information

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es Sequencing technologies Jose Blanca COMAV institute bioinf.comav.upv.es Outline Sequencing technologies: Sanger 2nd generation sequencing: 3er generation sequencing: 454 Illumina SOLiD Ion Torrent PacBio

More information

Revolutionize Genomics with SMRT Sequencing. Single Molecule, Real-Time Technology

Revolutionize Genomics with SMRT Sequencing. Single Molecule, Real-Time Technology Revolutionize Genomics with SMRT Sequencing Single Molecule, Real-Time Technology Resolve to Master Complexity Despite large investments in population studies, the heritability of the majority of Mendelian

More information

NUCLEOTIDE RESOLUTION STRUCTURAL VARIATION DETECTION USING NEXT- GENERATION WHOLE GENOME RESEQUENCING

NUCLEOTIDE RESOLUTION STRUCTURAL VARIATION DETECTION USING NEXT- GENERATION WHOLE GENOME RESEQUENCING NUCLEOTIDE RESOLUTION STRUCTURAL VARIATION DETECTION USING NEXT- GENERATION WHOLE GENOME RESEQUENCING Ken Chen, Ph.D. kchen@genome.wustl.edu The Genome Center, Washington University in St. Louis The path

More information

End of the road for short read sequencing. Prof. Graham Taylor PhD FRCPath Guy s, St Thomas s & King s College Hospital

End of the road for short read sequencing. Prof. Graham Taylor PhD FRCPath Guy s, St Thomas s & King s College Hospital End of the road for short read sequencing Prof. Graham Taylor PhD FRCPath Viapath @ Guy s, St Thomas s & King s College Hospital This talk 1. Short Read Sequencing and its Limitations 2. Applications of

More information

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es Sequencing technologies Jose Blanca COMAV institute bioinf.comav.upv.es Outline Sequencing technologies: Sanger 2nd generation sequencing: 3er generation sequencing: 454 Illumina SOLiD Ion Torrent PacBio

More information

De novo whole genome assembly

De novo whole genome assembly De novo whole genome assembly Qi Sun Bioinformatics Facility Cornell University Sequencing platforms Short reads: o Illumina (150 bp, up to 300 bp) Long reads (>10kb): o PacBio SMRT; o Oxford Nanopore

More information

An Extreme Metabolism: Iso-Seq analysis of the Ruby-Throated Hummingbird

An Extreme Metabolism: Iso-Seq analysis of the Ruby-Throated Hummingbird An Extreme Metabolism: Iso-Seq analysis of the Ruby-Throated Hummingbird Rachael Workman 1, Kenneth Welch 2, G. William Wong 3, Winston Timp 1 1 Department of Biomedical Engineering, Johns Hopkins University

More information

Gene expression analysis. Biosciences 741: Genomics Fall, 2013 Week 5. Gene expression analysis

Gene expression analysis. Biosciences 741: Genomics Fall, 2013 Week 5. Gene expression analysis Gene expression analysis Biosciences 741: Genomics Fall, 2013 Week 5 Gene expression analysis From EST clusters to spotted cdna microarrays Long vs. short oligonucleotide microarrays vs. RT-PCR Methods

More information

ACCEL-NGS 2S DNA LIBRARY KITS

ACCEL-NGS 2S DNA LIBRARY KITS ACCEL-NGS 2S DNA LIBRARY KITS Accel-NGS 2S DNA Library Kits produce high quality libraries with an all-inclusive, easy-to-use format. The kits contain all reagents necessary to build high complexity libraries

More information

SEQUENCING. M Ataei, PhD. Feb 2016

SEQUENCING. M Ataei, PhD. Feb 2016 CLINICAL NEXT GENERATION SEQUENCING M Ataei, PhD Tehran Medical Genetics Laboratory Feb 2016 Overview 2 Background NGS in non-invasive prenatal diagnosis (NIPD) 3 Background Background 4 In the 1970s,

More information

Analytics Behind Genomic Testing

Analytics Behind Genomic Testing A Quick Guide to the Analytics Behind Genomic Testing Elaine Gee, PhD Director, Bioinformatics ARUP Laboratories 1 Learning Objectives Catalogue various types of bioinformatics analyses that support clinical

More information

Bio5488 Practice Midterm (2018) 1. Next-gen sequencing

Bio5488 Practice Midterm (2018) 1. Next-gen sequencing 1. Next-gen sequencing 1. You have found a new strain of yeast that makes fantastic wine. You d like to sequence this strain to ascertain the differences from S. cerevisiae. To accurately call a base pair,

More information

SureSelect XT HS. Target Enrichment

SureSelect XT HS. Target Enrichment SureSelect XT HS Target Enrichment What Is It? SureSelect XT HS joins the SureSelect library preparation reagent family as Agilent s highest sensitivity hybrid capture-based library prep and target enrichment

More information

2/10/17. Contents. Applications of HMMs in Epigenomics

2/10/17. Contents. Applications of HMMs in Epigenomics 2/10/17 I529: Machine Learning in Bioinformatics (Spring 2017) Contents Applications of HMMs in Epigenomics Yuzhen Ye School of Informatics and Computing Indiana University, Bloomington Spring 2017 Background:

More information

Optimizing High Molecular Weight gdna Extraction for Single Molecule Sequencing in the Redwood Genome Project

Optimizing High Molecular Weight gdna Extraction for Single Molecule Sequencing in the Redwood Genome Project Optimizing High Molecular Weight gdna Extraction for Single Molecule Sequencing in the Redwood Genome Project Rachael Workman Timp Lab, Johns Hopkins University, Department of Biomedical Engineering Photo:

More information

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es Sequencing technologies Jose Blanca COMAV institute bioinf.comav.upv.es Outline Sequencing technologies: Sanger 2nd generation sequencing: 3er generation sequencing: 454 Illumina SOLiD Ion Torrent PacBio

More information

Introduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013

Introduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013 Introduction to RNA-Seq David Wood Winter School in Mathematics and Computational Biology July 1, 2013 Abundance RNA is... Diverse Dynamic Central DNA rrna Epigenetics trna RNA mrna Time Protein Abundance

More information

Transcriptomics analysis with RNA seq: an overview Frederik Coppens

Transcriptomics analysis with RNA seq: an overview Frederik Coppens Transcriptomics analysis with RNA seq: an overview Frederik Coppens Platforms Applications Analysis Quantification RNA content Platforms Platforms Short (few hundred bases) Long reads (multiple kilobases)

More information

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center High Throughput Sequencing the Multi-Tool of Life Sciences Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center Complementary Approaches Illumina Still-imaging of clusters (~1000

More information

Cancer Genetics Solutions

Cancer Genetics Solutions Cancer Genetics Solutions Cancer Genetics Solutions Pushing the Boundaries in Cancer Genetics Cancer is a formidable foe that presents significant challenges. The complexity of this disease can be daunting

More information

Next Gen Sequencing. Expansion of sequencing technology. Contents

Next Gen Sequencing. Expansion of sequencing technology. Contents Next Gen Sequencing Contents 1 Expansion of sequencing technology 2 The Next Generation of Sequencing: High-Throughput Technologies 3 High Throughput Sequencing Applied to Genome Sequencing (TEDed CC BY-NC-ND

More information

Services Presentation Genomics Experts

Services Presentation Genomics Experts Services Presentation Genomics Experts Illumina Seminar Marriott May 11th IntegraGen at a glance Autism Oncology Genomics Services Serves the researcher s most complex needs in genomics The n 1 privately-owned

More information

Current'Advances'in'Sequencing' Technology' James'Gurtowski' Schatz'Lab'

Current'Advances'in'Sequencing' Technology' James'Gurtowski' Schatz'Lab' Current'Advances'in'Sequencing' Technology' James'Gurtowski' Schatz'Lab' Outline' 1. Assembly'Review' 2. Pacbio' Technology'Overview' Data'CharacterisFcs' Algorithms' Results' 'Assemblies' 3. Oxford'Nanopore'

More information

RADSeq Data Analysis. Through STACKS on Galaxy. Yvan Le Bras Anthony Bretaudeau Cyril Monjeaud Gildas Le Corguillé

RADSeq Data Analysis. Through STACKS on Galaxy. Yvan Le Bras Anthony Bretaudeau Cyril Monjeaud Gildas Le Corguillé RADSeq Data Analysis Through STACKS on Galaxy Yvan Le Bras Anthony Bretaudeau Cyril Monjeaud Gildas Le Corguillé RAD sequencing: next-generation tools for an old problem INTRODUCTION source: Karim Gharbi

More information

Sequence Assembly and Alignment. Jim Noonan Department of Genetics

Sequence Assembly and Alignment. Jim Noonan Department of Genetics Sequence Assembly and Alignment Jim Noonan Department of Genetics james.noonan@yale.edu www.yale.edu/noonanlab The assembly problem >>10 9 sequencing reads 36 bp - 1 kb 3 Gb Outline Basic concepts in genome

More information

Structural variation analysis using NGS sequencing

Structural variation analysis using NGS sequencing Structural variation analysis using NGS sequencing Victor Guryev NBIC NGS taskforce meeting April 15th, 2011 Scale of genomic variants Scale 1 bp 10 bp 100 bp 1 kb 10 kb 100 kb 1 Mb Variants SNPs Short

More information

Informatic Issues in Genomics

Informatic Issues in Genomics Informatic Issues in Genomics DEISE PRACE Symposium Barcelona, 10 12 May 2010 Ivo Glynne Gut, PhD Centro Nacional de Analisis Genomico Barcelona Our Objectives Improve the quality of life Understand the

More information

Overview of Next Generation Sequencing technologies. Céline Keime

Overview of Next Generation Sequencing technologies. Céline Keime Overview of Next Generation Sequencing technologies Céline Keime keime@igbmc.fr Next Generation Sequencing < Second generation sequencing < General principle < Sequencing by synthesis - Illumina < Sequencing

More information

Variant Callers. J Fass 24 August 2017

Variant Callers. J Fass 24 August 2017 Variant Callers J Fass 24 August 2017 Variant Types Caller Consistency Pabinger (2014) Briefings Bioinformatics 15:256 Freebayes Bayesian haplotype caller that can call SNPs, short CNVs / duplications,

More information

Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Supplementary Material

Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Supplementary Material Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions Joshua N. Burton 1, Andrew Adey 1, Rupali P. Patwardhan 1, Ruolan Qiu 1, Jacob O. Kitzman 1, Jay Shendure 1 1 Department

More information

RNA-SEQUENCING ANALYSIS

RNA-SEQUENCING ANALYSIS RNA-SEQUENCING ANALYSIS Joseph Powell SISG- 2018 CONTENTS Introduction to RNA sequencing Data structure Analyses Transcript counting Alternative splicing Allele specific expression Discovery APPLICATIONS

More information

The New Genome Analyzer IIx Delivering more data, faster, and easier than ever before. Jeremy Preston, PhD Marketing Manager, Sequencing

The New Genome Analyzer IIx Delivering more data, faster, and easier than ever before. Jeremy Preston, PhD Marketing Manager, Sequencing The New Genome Analyzer IIx Delivering more data, faster, and easier than ever before Jeremy Preston, PhD Marketing Manager, Sequencing Illumina Genome Analyzer: a Paradigm Shift 2000x gain in efficiency

More information

Integrated NGS Sample Preparation Solutions for Limiting Amounts of RNA and DNA. March 2, Steven R. Kain, Ph.D. ABRF 2013

Integrated NGS Sample Preparation Solutions for Limiting Amounts of RNA and DNA. March 2, Steven R. Kain, Ph.D. ABRF 2013 Integrated NGS Sample Preparation Solutions for Limiting Amounts of RNA and DNA March 2, 2013 Steven R. Kain, Ph.D. ABRF 2013 NuGEN s Core Technologies Selective Sequence Priming Nucleic Acid Amplification

More information

Mate-pair library data improves genome assembly

Mate-pair library data improves genome assembly De Novo Sequencing on the Ion Torrent PGM APPLICATION NOTE Mate-pair library data improves genome assembly Highly accurate PGM data allows for de Novo Sequencing and Assembly For a draft assembly, generate

More information

Agilent NGS Solutions : Addressing Today s Challenges

Agilent NGS Solutions : Addressing Today s Challenges Agilent NGS Solutions : Addressing Today s Challenges Charmian Cher, Ph.D Director, Global Marketing Programs 1 10 years of Next-Gen Sequencing 2003 Completion of the Human Genome Project 2004 Pyrosequencing

More information

RNA sequencing with the MinION at Genoscope

RNA sequencing with the MinION at Genoscope RNA sequencing with the MinION at Genoscope Jean-Marc Aury jmaury@genoscope.cns.fr @J_M_Aury December 13, 2017 RNA workshop, Genoscope Overview Genoscope Overview MinION sequencing at Genoscope RNA-Seq

More information

CS273B: Deep learning for Genomics and Biomedicine

CS273B: Deep learning for Genomics and Biomedicine CS273B: Deep learning for Genomics and Biomedicine Lecture 2: Convolutional neural networks and applications to functional genomics 09/28/2016 Anshul Kundaje, James Zou, Serafim Batzoglou Outline Anatomy

More information

Wet-lab Considerations for Illumina data analysis

Wet-lab Considerations for Illumina data analysis Wet-lab Considerations for Illumina data analysis Based on a presentation by Henriette O Geen Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center Complementary Approaches Illumina

More information

Supplemental Figure 1.

Supplemental Figure 1. Supplemental Data. Charron et al. Dynamic landscapes of four histone modifications during de-etiolation in Arabidopsis. Plant Cell (2009). 10.1105/tpc.109.066845 Supplemental Figure 1. Immunodetection

More information

Next-generation sequencing technologies

Next-generation sequencing technologies Next-generation sequencing technologies NGS applications Illumina sequencing workflow Overview Sequencing by ligation Short-read NGS Sequencing by synthesis Illumina NGS Single-molecule approach Long-read

More information

De novo meta-assembly of ultra-deep sequencing data

De novo meta-assembly of ultra-deep sequencing data De novo meta-assembly of ultra-deep sequencing data Hamid Mirebrahim 1, Timothy J. Close 2 and Stefano Lonardi 1 1 Department of Computer Science and Engineering 2 Department of Botany and Plant Sciences

More information

The Genome Analysis Centre. Building Excellence in Genomics and Computa5onal Bioscience

The Genome Analysis Centre. Building Excellence in Genomics and Computa5onal Bioscience Building Excellence in Genomics and Computa5onal Bioscience Resequencing approaches Sarah Ayling Crop Genomics and Diversity sarah.ayling@tgac.ac.uk Why re- sequence plants? To iden

More information

100 Genomes in 100 Days: The Structural Variant Landscape of Tomato Genomes

100 Genomes in 100 Days: The Structural Variant Landscape of Tomato Genomes 100 Genomes in 100 Days: The Structural Variant Landscape of Tomato Genomes Michael Schatz January 15, 2019 PAG2019 Bioinformatics Workshop @mike_schatz Tomato Domestication & Agriculture Tomatoes are

More information

Whole Human Genome Sequencing Report This is a technical summary report for PG DNA

Whole Human Genome Sequencing Report This is a technical summary report for PG DNA Whole Human Genome Sequencing Report This is a technical summary report for PG0002601-DNA Physician and Patient Information Physician name: Vinodh Naraynan Address: Suite 406 222 West Thomas Road Phoenix

More information

Illumina s Suite of Targeted Resequencing Solutions

Illumina s Suite of Targeted Resequencing Solutions Illumina s Suite of Targeted Resequencing Solutions Colin Baron Sr. Product Manager Sequencing Applications 2011 Illumina, Inc. All rights reserved. Illumina, illuminadx, Solexa, Making Sense Out of Life,

More information

Runs of Homozygosity Analysis Tutorial

Runs of Homozygosity Analysis Tutorial Runs of Homozygosity Analysis Tutorial Release 8.7.0 Golden Helix, Inc. March 22, 2017 Contents 1. Overview of the Project 2 2. Identify Runs of Homozygosity 6 Illustrative Example...............................................

More information

Analysis of structural variation. Alistair Ward USTAR Center for Genetic Discovery University of Utah

Analysis of structural variation. Alistair Ward USTAR Center for Genetic Discovery University of Utah Analysis of structural variation Alistair Ward USTAR Center for Genetic Discovery University of Utah What is structural variation? What differentiates SV from short variants? What are the major SV types?

More information

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center High Throughput Sequencing the Multi-Tool of Life Sciences Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center DNA Technologies & Expression Analysis Cores HT Sequencing (Illumina

More information

Data Basics. Josef K Vogt Slides by: Simon Rasmussen Next Generation Sequencing Analysis

Data Basics. Josef K Vogt Slides by: Simon Rasmussen Next Generation Sequencing Analysis Data Basics Josef K Vogt Slides by: Simon Rasmussen 2017 Generalized NGS analysis Sample prep & Sequencing Data size Main data reductive steps SNPs, genes, regions Application Assembly: Compare Raw Pre-

More information

Fast, Accurate and Sensitive DNA Variant Detection from Sanger Sequencing:

Fast, Accurate and Sensitive DNA Variant Detection from Sanger Sequencing: Fast, Accurate and Sensitive DNA Variant Detection from Sanger Sequencing: Patented, Anti-Correlation Technology Provides 99.5% Accuracy & Sensitivity to 5% Variant Knowledge Base and External Annotation

More information

Nature Genetics: doi: /ng Supplementary Figure 1. H3K27ac HiChIP enriches enhancer promoter-associated chromatin contacts.

Nature Genetics: doi: /ng Supplementary Figure 1. H3K27ac HiChIP enriches enhancer promoter-associated chromatin contacts. Supplementary Figure 1 H3K27ac HiChIP enriches enhancer promoter-associated chromatin contacts. (a) Schematic of chromatin contacts captured in H3K27ac HiChIP. (b) Loop call overlap for cohesin HiChIP

More information

Simultaneous profiling of transcriptome and DNA methylome from a single cell

Simultaneous profiling of transcriptome and DNA methylome from a single cell Additional file 1: Supplementary materials Simultaneous profiling of transcriptome and DNA methylome from a single cell Youjin Hu 1, 2, Kevin Huang 1, 3, Qin An 1, Guizhen Du 1, Ganlu Hu 2, Jinfeng Xue

More information

Analysing genomes and transcriptomes using Illumina sequencing

Analysing genomes and transcriptomes using Illumina sequencing Analysing genomes and transcriptomes using Illumina uencing Dr. Heinz Himmelbauer Centre for Genomic Regulation (CRG) Ultrauencing Unit Barcelona The Sequencing Revolution High-Throughput Sequencing 2000

More information

DNA METHYLATION NH 2 H 3. Solutions using bisulfite conversion and immunocapture Ideal for NGS, Sanger sequencing, Pyrosequencing, and qpcr

DNA METHYLATION NH 2 H 3. Solutions using bisulfite conversion and immunocapture Ideal for NGS, Sanger sequencing, Pyrosequencing, and qpcr NH 2 DNA METHYLATION H 3 C Solutions using bisulfite conversion and immunocapture Ideal for NGS, Sanger sequencing, Pyrosequencing, and qpcr NH 2 N N H PAGE 3 Understanding DNA Methylation DNA methylation

More information

Jenny Gu, PhD Strategic Business Development Manager, PacBio

Jenny Gu, PhD Strategic Business Development Manager, PacBio IDT and PacBio joint presentation Characterizing Alzheimer s Disease candidate genes and transcripts with targeted, long-read, single-molecule sequencing Jenny Gu, PhD Strategic Business Development Manager,

More information

Introduction to human genomics and genome informatics

Introduction to human genomics and genome informatics Introduction to human genomics and genome informatics Session 1 Prince of Wales Clinical School Dr Jason Wong ARC Future Fellow Head, Bioinformatics & Integrative Genomics Adult Cancer Program, Lowy Cancer

More information

G E N OM I C S S E RV I C ES

G E N OM I C S S E RV I C ES GENOMICS SERVICES ABOUT T H E N E W YOR K G E NOM E C E N T E R NYGC is an independent non-profit implementing advanced genomic research to improve diagnosis and treatment of serious diseases. Through

More information

The Journey of DNA Sequencing. Chromosomes. What is a genome? Genome size. H. Sunny Sun

The Journey of DNA Sequencing. Chromosomes. What is a genome? Genome size. H. Sunny Sun The Journey of DNA Sequencing H. Sunny Sun What is a genome? Genome is the total genetic complement of a living organism. The nuclear genome comprises approximately 3.2 * 10 9 nucleotides of DNA, divided

More information