The SMRTer Way: Single Genes to Complex Genomes

Size: px
Start display at page:

Download "The SMRTer Way: Single Genes to Complex Genomes"

Transcription

1 The SMRTer Way: Single Genes to Complex Genomes Ulf Gyllensten, Professor Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden

2 Topics National Genomics Infrastructure (NGI). PacBio from single genes to complex genomes.

3 National Genomics Infrastructure (NGI) Among the five largest European sequencing centers. Core facility open to Swedish research groups. MPS sequencing, Sanger sequencing and genotyping. Funded as a National Research Infrastructure by SciLifeLab, Swedish Research Council (VR-RFI) and KAW Foundation.

4 MPS technologies at NGI Short-read MPS Long-read MPS

5 Analysis cluster and storage of MPS data From reads to. assembled genomes ~3 M cpuh/month on a dedicated cluster ~7 PB storage. Long-term storage in archive. CPU with extra large memory (2TB)

6 PacBio sequencing at NGI/Uppsala Two Pacific Biosciences RSII systems June 2013 August 2014

7 PacBio Data production in Uppsala

8 Assembly projects BACs, YACs, fosmids, plasmids, Gram positive and negativembacteria Archaea Parasitic protists Fungi (yeasts, mushrooms) Algae Mosses Higher plants Worms Butterflies, Insects Birds Lizards Fish Mammals

9 Applications on PacBio Non-clinical applications Complete genomes BACs/YACs/plasmids 16S rrna Gap filling Whole transcriptome sequencing Isoform discovery Amplicon sequencing Mutation detection Haplotype phasing Target re-sequencing Metagenomics Procaryotic methylation Clinical applications Chronic Myeloid Leukemia Acute Myeloid Leukemia HLA sequencing Repeat expansions Infection screening

10 PacBio applications A. Small genome assembly B. De novo complex genome assembly C. Targeted sequencing

11 Small genome assembly - PacBio the method of choice for small genomes. - Sample quality is crucial. Good quality an (almost) complete genome, poor quality partial or no genome. Example:

12 Complex genomes: De Novo Assembly of Rabbit Genome Two for one genome : Assembly of an F1-hybrid between two subspecies of rabbit. PI: Professor Leif Andersson, Uppsala Aims: Create a New reference assembl/y/ies In depth characterization of loci exhibiting strong allele frequency shifts around hybrid zone between O. c. coniculus and O. c. algirus in Spain. 2 % of genome shows dramatic reduction in ability to spread to other side, rest of genome leaks into other side.

13 The order Lagomorpha consists of two families: Leporidae (hares and rabbits) and the Ochotonidae (pikas) Likely radiated from common ancestor in Asia 60 million years ago European rabbit (Oryctulagus coniculus) and the closest extant species, the hispid hare (Caprolagus hispidus)in South Asia diverged approximately 7-10 million years ago, like most of the Leporidae Lagomorpha Evolutionary History of Lagomorphs in Response to Global Environmental Change, PLoS One, April 2013 Volume 8 Issue 4

14 Origin and domestication of the European rabbit (O. cuniculus) O. c. coniculus Dispersal to southern France O. c. algirus

15 Strategy and challenges 300 SMRT cells (around 200Gb) run in Uppsala O.c.c x O.c.a hybrid 6 BioNano runs (by BioNano) Parents of F1-hybrid sequenced to 30x using PCR free Illumina libraries. BAC-ends and phosmids from Sanger assembly (250k & 2 million respectively) Sanger assembly OryCun2 (2.74 Gb) Falcon diploid assembly attempted Very high heterozygosity!

16 De Novo Assembly of Rabbit using BioNano 6 runs conducted with 400 Gb of molecules >150kb Raw Data (molecules > 150 kb) Initial Assembly High Depth Assembly Data input 184 Gb 367 Gb 367 Gb Stringent Assembly Number of genome maps Assembly size 2.57 Gb 3.76 Gb 4.44 Gb Genome map N Mb 1.4 Mb 1.07 Mb Longest genome map 4.5 Mb 6.4 Mb 6.3 Mb

17 Heterozygous Genome Maps are Produced Ref GM

18 SciLifeLab Whole Human Genome Initiative - WGS of patient cohorts (n=10,000 ind /year). - Establish a Genetic Variant Database for the Swedish Population (n = 1,000).

19 Population genomics projects The 1000 Genomes Project - genomes of 2500 unidentified people from 25 populations Genomics England: 100,000 whole genomes from patients by 2017.

20 The Swedish Genetic Variant Project A. Identify a cohort that reflects the genetic structure of the Swedish population. B. Generate WGS data using short- and long-read MPS technologies. C. Establish a user-friendly database to make information available to the research community (association analyses) and clinical genetics laboratories.

21 The Swedish Twin registry Inclusion based on twinning and distribution like population density. General population-prevalence of any disease. 10,000 individuals have been analysed with SNP arrays. Identify 1,000 individuals based on genetic structure and diversity across Sweden.

22 Principal components of European samples from 1,000 genomes project and 10,000 Swedish samples Finland Northern Sweden Southern- Central Sweden England and Scotland Italy Spain

23 European Individuals samples selected for from WGS 1,000 and 1000 genomes G EUR project and 1,000 selected Swedish samples Main genetic differentiation between Southern - Central and Sweden Northern Northern Sweden Southern Central Sweden

24 Step 1: WGS of Swedish control cohort Short-read Illumina X-Ten data to 30X coverage of the 1,000 individuals. Standard pipeline (GATK) for variant calling (SNP and indels). Construct user-friendly database for the community to make use of the data. Status: Identification of a control cohort Q Short-read MPS Q Data base implementation Q

25 Database for genetic variants CanvasDB (CANdidate Variant Analysis System and Data Base) Stores genetic variants with annotations, such as prediction of the functional consequence. At present the 3.1 billion genetic variants in the 1000 Genomes project. Search time not proportional to database size. Filter tools for analyses of monogenic and complex genetic disease analyses.

26 The Present Human Reference is Not Complete Some regions have been recalcitrant to closure with short-read MPS technology. Structural variation makes it difficult to assemble a truly representative genome. Long-read whole human genome sequencing provide the information.

27 Genome reference standards Platinum genome sequence A contiguous, haplotype resolved representation of the entire genome. Gold genome sequence A high-quality, highly contiguous representation. Silver genome sequence Standards TBD. Non-trio, PB/BN, no Bac library.

28 Gold Genome Sequencing Approach

29 The Human Reference Genomes Project Gold Reference Genomes Platinum Reference Genomes HG00733 HG00514 NA12878 CHM1 CHM13 NA19240 NA19434

30 New Reference Human Genome Sequences Platinum Genomes CHM1 An integrated assembly of Illumina, PacBio, BAC and BioNano data. CHM13 PacBio data assembly + BioNano data. Gold Genomes NA19240 Yoruba trio child; assembly completed. HG Puerto Rican trio child; sequencing in progress. HG00514 Han Chinese trio child, Q NA19434 Luhya (Kenya) trio child, Q

31 Step 2: WGS of Swedish cohort Establish Swedish reference genome sequences by de novo assembly of long-read Pacific Biosciences data (+BN). Ref genome individuals

32 First Swedish PacBio WGS First PacBio Assembly 20 kb library 157 SMRT cells 140 Gb data (~45X) FALCON assembly # of contigs (>=0 bp) 7708 # of contigs (>=1000 bp) 7653 Total length (>=0 bp) 2844 Mb Total length (>=1000 bp) 2844 Mb No of contigs 7692 Largest contig Total contig length N50 N Mb 2844 Mb 4.35 Mb 1.97 Mb

33 Step 3: WGS of Swedish control cohort Targeted long-read sequencing of regions of high medical importance (HLA, Trinucleotide expansion repeats). Resolve structural variation and repeats. Phase variation in repetitive regions and individual alleles. Study the methylation pattern in native DNA.

34 Methods for Targeted PacBio sequencing Long-range PCR. Target enrichment by hybridisation using DNA or RNA probe arrays. Amplification-free targeted enrichment.

35 Long-range PCR: HLA sequencing

36 HLA sequencing workflow 1. LR-PCR Amplification 4. PB Long Amplicon Analysis 2. SMRTbell prep 3. SMRT Sequencing 5. Allele identification (GenDx)

37 Long-range PCR: FADS FADS region has been under selection in human evolution Regulates the production of Omega-3/6 fatty acids (PUFA) Region is associated to many traits and diseases Two main haplotypes in humans: Ancestral and Derived

38 FADS project - functional variant at rs Functional variant for FADS1 expression identified! But is it linked to the Ancestral or Derived haplotype? Pan et al (submitted)

39 PacBio sequencing of FADS region Hybridization capture and pooled sequencing of FADS region: rs Results: AluYe5 rs > 1.2 kb rs Derived haplotype increases FADS1 activity Ancestral haplotype, reduces FADS1 activity

40 Targeted enrichment using DNA probe arrays

41 Targeted enrichment using RNA probes Modified version of PacBio+Agilent protocol

42 Capture of a ~2 kb library Reads mapped back to human genome

43 Off-target capture of gene not in probe design region MIC-B gene is captured because of high similarity to MIC-A! MIC-B MIC_A

44 De novo assembly of captured region A method to resolve structural variations and repeats Repeat length in example: bp Difficult or impossible to resolve with short reads

45 Amplification-free targeted enrichment Using Cas9 for targeting. Sequence native DNA. Compatible with multiple targets: HTT, FMR1, ALS & SCA10 in one reaction. Under development Input DNA SMRTbell library CAS9 targeting Sequencing

46 Technology Waves in Human Genome Analysis Genome Wide Association Studies Exome (Re-) Sequencing Short-read Genome (Re-) Sequencing Comprehensive Short-read Genome (Re-) Sequencing Whole-Genome De Novo Sequencing using long-reads. Jim Lupski: The Goal Is De Novo Assembly in the Clinic

47 What we sequence at NGI /

48 Who does the sequencing? Ulf Gyllensten Platform director Inger Jonasson Facility manager Olga Vinnere Pettersson Project coordinator Adam Ameur Bioinformatician, NGS Ignas Bunikis Bioinformatician, NGS Christian Tellgren-Roth Bioinformatician, NGS Susana Häggqvist Research engineer NGS Ida Höijer Research engineer NGS Cecilia Lindau Research engineer NGS Maria Schenström Research engineer NGS Magdalena Andersson Research engineer NGS Ulrika Broström Research engineer NGS Nina Williams Research engineer NGS Carolina Ilbäck Research engineer NGS Anna Petri Research engineer Sequencing Service Anne-Christine Lindström Research engineer Sequencing Service

49 What we sequence at NGI / THANK YOU

Next Generation Sequencing and Bioinformatics Analysis Pipelines. Adam Ameur National Genomics Infrastructure SciLifeLab Uppsala

Next Generation Sequencing and Bioinformatics Analysis Pipelines. Adam Ameur National Genomics Infrastructure SciLifeLab Uppsala GA N AT ION ALCTAC ATCA G ENOMI C SGT INF R A S T RU CTURE Next Generation Sequencing and Bioinformatics Analysis Pipelines Adam Ameur National Genomics Infrastructure SciLifeLab Uppsala adam.ameur@igp.uu.se

More information

Jenny Gu, PhD Strategic Business Development Manager, PacBio

Jenny Gu, PhD Strategic Business Development Manager, PacBio IDT and PacBio joint presentation Characterizing Alzheimer s Disease candidate genes and transcripts with targeted, long-read, single-molecule sequencing Jenny Gu, PhD Strategic Business Development Manager,

More information

Welcome to the NGS webinar series

Welcome to the NGS webinar series Welcome to the NGS webinar series Webinar 1 NGS: Introduction to technology, and applications NGS Technology Webinar 2 Targeted NGS for Cancer Research NGS in cancer Webinar 3 NGS: Data analysis for genetic

More information

Comprehensive Views of Genetic Diversity with Single Molecule, Real-Time (SMRT) Sequencing

Comprehensive Views of Genetic Diversity with Single Molecule, Real-Time (SMRT) Sequencing Comprehensive Views of Genetic Diversity with Single Molecule, Real-Time (SMRT) Sequencing Alix Kieu Cruse November 2015 For Research Use Only. Not for use in diagnostics procedures. Copyright 2015 by

More information

Next-Generation Sequencing Services à la carte

Next-Generation Sequencing Services à la carte Next-Generation Sequencing Services à la carte www.seqme.eu ngs@seqme.eu SEQme 2017 All rights reserved The trademarks and names of other companies and products mentioned in this brochure are the property

More information

Next Generation Sequencing. Target Enrichment

Next Generation Sequencing. Target Enrichment Next Generation Sequencing Target Enrichment Next Generation Sequencing Your Partner in Every Step from Sample to Data NGS: Revolutionizing Genetic Analysis with Single-Molecule Resolution Next generation

More information

Ion S5 and Ion S5 XL Systems

Ion S5 and Ion S5 XL Systems Ion S5 and Ion S5 XL Systems Targeted sequencing has never been simpler Explore the Ion S5 and Ion S5 XL Systems Adopting next-generation sequencing (NGS) in your lab is now simpler than ever The Ion S5

More information

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es Sequencing technologies Jose Blanca COMAV institute bioinf.comav.upv.es Outline Sequencing technologies: Sanger 2nd generation sequencing: 3er generation sequencing: 454 Illumina SOLiD Ion Torrent PacBio

More information

Research school methods seminar Genomics and Transcriptomics

Research school methods seminar Genomics and Transcriptomics Research school methods seminar Genomics and Transcriptomics Stephan Klee 19.11.2014 2 3 4 5 Genetics, Genomics what are we talking about? Genetics and Genomics Study of genes Role of genes in inheritence

More information

Targeted Sequencing in the NBS Laboratory

Targeted Sequencing in the NBS Laboratory Targeted Sequencing in the NBS Laboratory Christopher Greene, PhD Newborn Screening and Molecular Biology Branch Division of Laboratory Sciences Gene Sequencing in Public Health Newborn Screening February

More information

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es Sequencing technologies Jose Blanca COMAV institute bioinf.comav.upv.es Outline Sequencing technologies: Sanger 2nd generation sequencing: 3er generation sequencing: 454 Illumina SOLiD Ion Torrent PacBio

More information

Ion S5 and Ion S5 XL Systems

Ion S5 and Ion S5 XL Systems Ion S5 and Ion S5 XL Systems Targeted sequencing has never been simpler Introducing the Ion S5 and Ion S5 XL systems Now, adopting next-generation sequencing in your lab is simpler than ever. The Ion S5

More information

Applications of PacBio Single Molecule, Real- Time (SMRT) DNA Sequencing

Applications of PacBio Single Molecule, Real- Time (SMRT) DNA Sequencing Applications of PacBio Single Molecule, Real- Time (SMRT) DNA Sequencing Stephen Turner November 5, 2014 FIND MEANING IN COMPLEXITY For Research Use Only. Not for use in diagnostic procedures. Pacific

More information

Next-Generation Sequencing. Technologies

Next-Generation Sequencing. Technologies Next-Generation Next-Generation Sequencing Technologies Sequencing Technologies Nicholas E. Navin, Ph.D. MD Anderson Cancer Center Dept. Genetics Dept. Bioinformatics Introduction to Bioinformatics GS011062

More information

RADSeq Data Analysis. Through STACKS on Galaxy. Yvan Le Bras Anthony Bretaudeau Cyril Monjeaud Gildas Le Corguillé

RADSeq Data Analysis. Through STACKS on Galaxy. Yvan Le Bras Anthony Bretaudeau Cyril Monjeaud Gildas Le Corguillé RADSeq Data Analysis Through STACKS on Galaxy Yvan Le Bras Anthony Bretaudeau Cyril Monjeaud Gildas Le Corguillé RAD sequencing: next-generation tools for an old problem INTRODUCTION source: Karim Gharbi

More information

NGS, a suitable approach for TP53 screening in CLL?

NGS, a suitable approach for TP53 screening in CLL? NGS, a suitable approach for TP53 screening in CLL? Ferran Nadeu 2nd ERIC WORKSHOP ON TP53 ANALYSIS IN CHRONIC LYMPHOCYTIC LEUKEMIA 7-8 November 2017, Stresa (Italy) The Sanger sequencing bottleneck 1

More information

Next Generation Sequencing. Jeroen Van Houdt - Leuven 13/10/2017

Next Generation Sequencing. Jeroen Van Houdt - Leuven 13/10/2017 Next Generation Sequencing Jeroen Van Houdt - Leuven 13/10/2017 Landmarks in DNA sequencing 1953 Discovery of DNA double helix structure 1977 A Maxam and W Gilbert "DNA seq by chemical degradation" F Sanger"DNA

More information

HLA and Next Generation Sequencing it s all about the Data

HLA and Next Generation Sequencing it s all about the Data HLA and Next Generation Sequencing it s all about the Data John Ord, NHSBT Colindale and University of Cambridge BSHI Annual Conference Manchester September 2014 Introduction In 2003 the first full public

More information

Gap Filling for a Human MHC Haplotype Sequence

Gap Filling for a Human MHC Haplotype Sequence American Journal of Life Sciences 2016; 4(6): 146-151 http://www.sciencepublishinggroup.com/j/ajls doi: 10.11648/j.ajls.20160406.12 ISSN: 2328-5702 (Print); ISSN: 2328-5737 (Online) Gap Filling for a Human

More information

High Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays

High Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays High Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays Ali Pirani and Mohini A Patil ISAG July 2017 The world leader in serving science

More information

Gene Regulation Solutions. Microarrays and Next-Generation Sequencing

Gene Regulation Solutions. Microarrays and Next-Generation Sequencing Gene Regulation Solutions Microarrays and Next-Generation Sequencing Gene Regulation Solutions The Microarrays Advantage Microarrays Lead the Industry in: Comprehensive Content SurePrint G3 Human Gene

More information

Course Overview: Mutation Detection Using Massively Parallel Sequencing

Course Overview: Mutation Detection Using Massively Parallel Sequencing Course Overview: Mutation Detection Using Massively Parallel Sequencing From Data Generation to Variant Annotation Eliot Shearer The Iowa Initiative in Human Genetics Bioinformatics Short Course 2012 August

More information

Next Gen Sequencing. Expansion of sequencing technology. Contents

Next Gen Sequencing. Expansion of sequencing technology. Contents Next Gen Sequencing Contents 1 Expansion of sequencing technology 2 The Next Generation of Sequencing: High-Throughput Technologies 3 High Throughput Sequencing Applied to Genome Sequencing (TEDed CC BY-NC-ND

More information

IMGM Laboratories GmbH. Sales Manager

IMGM Laboratories GmbH. Sales Manager IMGM Laboratories GmbH Dr. Jennifer K. Kuhn Sales Manager About IMGM Laboratories IMGM Laboratories was founded in 2001 IMGM operates as professional provider of advanced genomic services from research

More information

ACCEL-NGS 2S DNA LIBRARY KITS

ACCEL-NGS 2S DNA LIBRARY KITS ACCEL-NGS 2S DNA LIBRARY KITS Accel-NGS 2S DNA Library Kits produce high quality libraries with an all-inclusive, easy-to-use format. The kits contain all reagents necessary to build high complexity libraries

More information

Cancer Genetics Solutions

Cancer Genetics Solutions Cancer Genetics Solutions Cancer Genetics Solutions Pushing the Boundaries in Cancer Genetics Cancer is a formidable foe that presents significant challenges. The complexity of this disease can be daunting

More information

Introduction to Next Generation Sequencing (NGS)

Introduction to Next Generation Sequencing (NGS) Introduction to Next eneration Sequencing (NS) Simon Rasmussen Assistant Professor enter for Biological Sequence analysis Technical University of Denmark 2012 Today 9.00-9.45: Introduction to NS, How it

More information

NGI Seminar Series- Epigenetics

NGI Seminar Series- Epigenetics GI Seminar Series- Epigenetics Introduction to GS and genotyping techniques Jessica ordlund, PhD Head of Research and Development SP&SEQ Technology Platform ational Genomics Infrastructure Uppsala University

More information

Bioinformatics Advice on Experimental Design

Bioinformatics Advice on Experimental Design Bioinformatics Advice on Experimental Design Where do I start? Please refer to the following guide to better plan your experiments for good statistical analysis, best suited for your research needs. Statistics

More information

Mate-pair library data improves genome assembly

Mate-pair library data improves genome assembly De Novo Sequencing on the Ion Torrent PGM APPLICATION NOTE Mate-pair library data improves genome assembly Highly accurate PGM data allows for de Novo Sequencing and Assembly For a draft assembly, generate

More information

Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie. Sander van Boheemen Medical Microbiology

Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie. Sander van Boheemen Medical Microbiology Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie Sander van Boheemen Medical Microbiology Next-generation sequencing Next-generation sequencing (NGS), also known as

More information

Nature Biotechnology: doi: /nbt Supplementary Figure 1. Number and length distributions of the inferred fosmids.

Nature Biotechnology: doi: /nbt Supplementary Figure 1. Number and length distributions of the inferred fosmids. Supplementary Figure 1 Number and length distributions of the inferred fosmids. Fosmid were inferred by mapping each pool s sequence reads to hg19. We retained only those reads that mapped to within a

More information

Third Generation Sequencing

Third Generation Sequencing Third Generation Sequencing By Mohammad Hasan Samiee Aref Medical Genetics Laboratory of Dr. Zeinali History of DNA sequencing 1953 : Discovery of DNA structure by Watson and Crick 1973 : First sequence

More information

Sequencing the genomes of Nicotiana sylvestris and Nicotiana tomentosiformis Nicolas Sierro

Sequencing the genomes of Nicotiana sylvestris and Nicotiana tomentosiformis Nicolas Sierro Sequencing the genomes of Nicotiana sylvestris and Nicotiana tomentosiformis Nicolas Sierro Philip Morris International R&D, Philip Morris Products S.A., Neuchatel, Switzerland Introduction Nicotiana sylvestris

More information

Sequence assembly. Jose Blanca COMAV institute bioinf.comav.upv.es

Sequence assembly. Jose Blanca COMAV institute bioinf.comav.upv.es Sequence assembly Jose Blanca COMAV institute bioinf.comav.upv.es Sequencing project Unknown sequence { experimental evidence result read 1 read 4 read 2 read 5 read 3 read 6 read 7 Computational requirements

More information

Target Enrichment Strategies for Next Generation Sequencing

Target Enrichment Strategies for Next Generation Sequencing Target Enrichment Strategies for Next Generation Sequencing Anuj Gupta, PhD Agilent Technologies, New Delhi Genotypic Conference, Sept 2014 NGS Timeline Information burst Nearly 30,000 human genomes sequenced

More information

Sequence Assembly and Alignment. Jim Noonan Department of Genetics

Sequence Assembly and Alignment. Jim Noonan Department of Genetics Sequence Assembly and Alignment Jim Noonan Department of Genetics james.noonan@yale.edu www.yale.edu/noonanlab The assembly problem >>10 9 sequencing reads 36 bp - 1 kb 3 Gb Outline Basic concepts in genome

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Richard Corbett Canada s Michael Smith Genome Sciences Centre Vancouver, British Columbia June 28, 2017 Our mandate is to advance knowledge about cancer and other diseases

More information

DNA. bioinformatics. epigenetics methylation structural variation. custom. assembly. gene. tumor-normal. mendelian. BS-seq. prediction.

DNA. bioinformatics. epigenetics methylation structural variation. custom. assembly. gene. tumor-normal. mendelian. BS-seq. prediction. Epigenomics T TM activation SNP target ncrna validation metagenomics genetics private RRBS-seq de novo trio RIP-seq exome mendelian comparative genomics DNA NGS ChIP-seq bioinformatics assembly tumor-normal

More information

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 Topics Genetic variation Population structure Linkage disequilibrium Natural disease variants Genome Wide Association Studies Gene

More information

Plasmodium vivax. (Guerra, 2006) (Winzeler, 2008)

Plasmodium vivax. (Guerra, 2006) (Winzeler, 2008) Plasmodium vivax Major cause of malaria outside Africa 25 40% of clinical cases worldwide Not amenable to in vitro culture Interesting biology Hypnozoites: dormant liver stage responsible for relapses

More information

NGS technologies approaches, applications and challenges!

NGS technologies approaches, applications and challenges! www.supagro.fr NGS technologies approaches, applications and challenges! Jean-François Martin Centre de Biologie pour la Gestion des Populations Centre international d études supérieures en sciences agronomiques

More information

A Roadmap to the De-novo Assembly of the Banana Slug Genome

A Roadmap to the De-novo Assembly of the Banana Slug Genome A Roadmap to the De-novo Assembly of the Banana Slug Genome Stefan Prost 1 1 Department of Integrative Biology, University of California, Berkeley, United States of America April 6th-10th, 2015 Outline

More information

March 20-23, 2010 Sacramento, CA

March 20-23, 2010 Sacramento, CA Comparison of Commercially Available Target Enrichment Methods for Next Generation Sequencing with the Illumina Platform March 20-23, 2010 Sacramento, CA Anoja Perera, Scottie Adams, David Bintzler, Kip

More information

Whole genome sequencing in drug discovery research: a one fits all solution?

Whole genome sequencing in drug discovery research: a one fits all solution? Whole genome sequencing in drug discovery research: a one fits all solution? Marc Sultan, September 24th, 2015 Biomarker Development, Translational Medicine, Novartis On behalf of the BMD WGS pilot team:

More information

CM581A2: NEXT GENERATION SEQUENCING PLATFORMS AND LIBRARY GENERATION

CM581A2: NEXT GENERATION SEQUENCING PLATFORMS AND LIBRARY GENERATION CM581A2: NEXT GENERATION SEQUENCING PLATFORMS AND LIBRARY GENERATION Fall 2015 Instructors: Coordinator: Carol Wilusz, Associate Professor MIP, CMB Instructor: Dan Sloan, Assistant Professor, Biology,

More information

Outline. General principles of clonal sequencing Analysis principles Applications CNV analysis Genome architecture

Outline. General principles of clonal sequencing Analysis principles Applications CNV analysis Genome architecture The use of new sequencing technologies for genome analysis Chris Mattocks National Genetics Reference Laboratory (Wessex) NGRL (Wessex) 2008 Outline General principles of clonal sequencing Analysis principles

More information

Complementary Technologies for Precision Genetic Analysis

Complementary Technologies for Precision Genetic Analysis Complementary NGS, CGH and Workflow Featured Publication Zhu, J. et al. Duplication of C7orf58, WNT16 and FAM3C in an obese female with a t(7;22)(q32.1;q11.2) chromosomal translocation and clinical features

More information

Variant detection analysis in the BRCA1/2 genes from Ion torrent PGM data

Variant detection analysis in the BRCA1/2 genes from Ion torrent PGM data Variant detection analysis in the BRCA1/2 genes from Ion torrent PGM data Bruno Zeitouni Bionformatics department of the Institut Curie Inserm U900 Mines ParisTech Ion Torrent User Meeting 2012, October

More information

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary Executive Summary Helix is a personal genomics platform company with a simple but powerful mission: to empower every person to improve their life through DNA. Our platform includes saliva sample collection,

More information

Harnessing the power of RADseq for ecological and evolutionary genomics

Harnessing the power of RADseq for ecological and evolutionary genomics STUDY DESIGNS Harnessing the power of RADseq for ecological and evolutionary genomics Kimberly R. Andrews 1, Jeffrey M. Good 2, Michael R. Miller 3, Gordon Luikart 4 and Paul A. Hohenlohe 5 Abstract High-throughput

More information

SNP calling and VCF format

SNP calling and VCF format SNP calling and VCF format Laurent Falquet, Oct 12 SNP? What is this? A type of genetic variation, among others: Family of Single Nucleotide Aberrations Single Nucleotide Polymorphisms (SNPs) Single Nucleotide

More information

Data Analysis with CASAVA v1.8 and the MiSeq Reporter

Data Analysis with CASAVA v1.8 and the MiSeq Reporter Data Analysis with CASAVA v1.8 and the MiSeq Reporter Eric Smith, PhD Bioinformatics Scientist September 15 th, 2011 2010 Illumina, Inc. All rights reserved. Illumina, illuminadx, Solexa, Making Sense

More information

SeqStudio Genetic Analyzer

SeqStudio Genetic Analyzer SeqStudio Genetic Analyzer Optimized for Sanger sequencing and fragment analysis Easy to use for all levels of experience From a leader in genetic analysis instrumentation, introducing the new Applied

More information

High Throughput Sequencing Technologies. UCD Genome Center Bioinformatics Core Monday 15 June 2015

High Throughput Sequencing Technologies. UCD Genome Center Bioinformatics Core Monday 15 June 2015 High Throughput Sequencing Technologies UCD Genome Center Bioinformatics Core Monday 15 June 2015 Sequencing Explosion www.genome.gov/sequencingcosts http://t.co/ka5cvghdqo Sequencing Explosion 2011 PacBio

More information

De novo Genome Assembly

De novo Genome Assembly De novo Genome Assembly A/Prof Torsten Seemann Winter School in Mathematical & Computational Biology - Brisbane, AU - 3 July 2017 Introduction The human genome has 47 pieces MT (or XY) The shortest piece

More information

What is genetic variation?

What is genetic variation? enetic Variation Applied Computational enomics, Lecture 05 https://github.com/quinlan-lab/applied-computational-genomics Aaron Quinlan Departments of Human enetics and Biomedical Informatics USTAR Center

More information

Biotechnology. DNA Cloning Finding Needles in Haystacks. DNA Sequencing. Genetic Engineering. Gene Therapy

Biotechnology. DNA Cloning Finding Needles in Haystacks. DNA Sequencing. Genetic Engineering. Gene Therapy Biotechnology DNA Cloning Finding Needles in Haystacks DNA Sequencing Genetic Engineering Gene Therapy What is DNA Cloning? Set of methods that uses live cells to make many identical copies of a DNA fragment

More information

Bioinformatics and computational tools

Bioinformatics and computational tools Bioinformatics and computational tools Etienne P. de Villiers (PhD) International Livestock Research Institute Nairobi, Kenya International Livestock Research Institute Nairobi, Kenya ILRI works at the

More information

Modern Epigenomics. Histone Code

Modern Epigenomics. Histone Code Modern Epigenomics Histone Code Ting Wang Department of Genetics Center for Genome Sciences and Systems Biology Washington University Dragon Star 2012 Changchun, China July 2, 2012 DNA methylation + Histone

More information

HLA-Typing Strategies

HLA-Typing Strategies HLA-Typing Strategies Cologne, 13.5.2017 Joannis Mytilineos MD, PhD Department of Transplantation Immunology Institute for Clinical Transfusion Medicine and Immunogenetics German Red Cross Blood Transfusion

More information

Corporate Overview. March 2017

Corporate Overview. March 2017 Corporate Overview March 2017 Bionano Genomics Overview Commercial-stage company developing and selling instruments & consumables for whole genome analysis Addressing the needs for: A better understanding

More information

Analysing genomes and transcriptomes using Illumina sequencing

Analysing genomes and transcriptomes using Illumina sequencing Analysing genomes and transcriptomes using Illumina uencing Dr. Heinz Himmelbauer Centre for Genomic Regulation (CRG) Ultrauencing Unit Barcelona The Sequencing Revolution High-Throughput Sequencing 2000

More information

SMRT Analysis Barcoding Overview

SMRT Analysis Barcoding Overview SMRT Analysis Barcoding Overview Introduction This document is for users with Sequel Systems using SMRT Link v5.0.0 or v5.0.1. This document covers: Barcoding designs, strategies and modes for preparing

More information

Standard Products Nucleic Acid Sample Submission Guideline

Standard Products Nucleic Acid Sample Submission Guideline Standard Products Nucleic Acid Sample Submission Guideline Document NO.: Version NO.: SOP-SMM-028 A1 Effective Date: 2018-1-3 Document NO.:SOP-SMM-028 Version NO.:A1 Page 1 of 19 CONTETS ABOUT THIS GUIDELINE...

More information

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday June 16, 2014

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday June 16, 2014 High Throughput Sequencing Technologies J Fass UCD Genome Center Bioinformatics Core Monday June 16, 2014 Sequencing Explosion www.genome.gov/sequencingcosts http://t.co/ka5cvghdqo Sequencing Explosion

More information

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Tuesday December 16, 2014

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Tuesday December 16, 2014 High Throughput Sequencing Technologies J Fass UCD Genome Center Bioinformatics Core Tuesday December 16, 2014 Sequencing Explosion www.genome.gov/sequencingcosts http://t.co/ka5cvghdqo Sequencing Explosion

More information

The Agilent Technologies SureSelect Platform for Target Enrichment

The Agilent Technologies SureSelect Platform for Target Enrichment The Agilent Technologies SureSelect Platform for Target Enrichment Focus your next-gen sequencing on DNA that matters Kimberly Troutman Field Applications Scientist January 27 th, 2011 Agenda 1 Introduction:

More information

MHC Region. MHC expression: Class I: All nucleated cells and platelets Class II: Antigen presenting cells

MHC Region. MHC expression: Class I: All nucleated cells and platelets Class II: Antigen presenting cells DNA based HLA typing methods By: Yadollah Shakiba, MD, PhD MHC Region MHC expression: Class I: All nucleated cells and platelets Class II: Antigen presenting cells Nomenclature of HLA Alleles Assigned

More information

BENG 183 Trey Ideker. Genome Assembly and Physical Mapping

BENG 183 Trey Ideker. Genome Assembly and Physical Mapping BENG 183 Trey Ideker Genome Assembly and Physical Mapping Reasons for sequencing Complete genome sequencing!!! Resequencing (Confirmatory) E.g., short regions containing single nucleotide polymorphisms

More information

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday September 15, 2014

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday September 15, 2014 High Throughput Sequencing Technologies J Fass UCD Genome Center Bioinformatics Core Monday September 15, 2014 Sequencing Explosion www.genome.gov/sequencingcosts http://t.co/ka5cvghdqo Sequencing Explosion

More information

Examples of founding and evolving leading LifeScience companies. November 2016

Examples of founding and evolving leading LifeScience companies. November 2016 Examples of founding and evolving leading LifeScience companies November 2016 Peter Pohl Born: in Salzburg Love: my wife and my two children (age 9&7) Education: Business Adminstration Serial entrepreneur

More information

Human SNP haplotypes. Statistics 246, Spring 2002 Week 15, Lecture 1

Human SNP haplotypes. Statistics 246, Spring 2002 Week 15, Lecture 1 Human SNP haplotypes Statistics 246, Spring 2002 Week 15, Lecture 1 Human single nucleotide polymorphisms The majority of human sequence variation is due to substitutions that have occurred once in the

More information

Genome Assembly. J Fass UCD Genome Center Bioinformatics Core Friday September, 2015

Genome Assembly. J Fass UCD Genome Center Bioinformatics Core Friday September, 2015 Genome Assembly J Fass UCD Genome Center Bioinformatics Core Friday September, 2015 From reads to molecules What s the Problem? How to get the best assemblies for the smallest expense (sequencing) and

More information

de novo sequencing of the sunflower genome

de novo sequencing of the sunflower genome de novo sequencing of the sunflower genome Stéphane Muños LIPM INRA Toulouse stephane.munos@toulouse.inra.fr @stephane_munos @SUNRISE_France 39 Sunflower, an important cropfor Europe Million tons of seed

More information

Practical quality control for whole genome sequencing in clinical microbiology

Practical quality control for whole genome sequencing in clinical microbiology Practical quality control for whole genome sequencing in clinical microbiology John WA Rossen, PhD, MMM Department of Medical Microbiology, University of Groningen, UMCG, Groningen, The Netherlands Disclosure

More information

1.1 Post Run QC Analysis

1.1 Post Run QC Analysis Post Run QC Analysis 100 339 200 01 1. Post Run QC Analysis 1.1 Post Run QC Analysis Welcome to Pacific Biosciences' Post Run QC Analysis Overview. This training module will describe the workflow to assess

More information

SNP GENOTYPING WITH iplex REAGENTS AND THE MASSARRAY SYSTEM

SNP GENOTYPING WITH iplex REAGENTS AND THE MASSARRAY SYSTEM SNP GENOTYPING Accurate, sensitive, flexible MassARRAY System SNP GENOTYPING WITH iplex REAGENTS AND THE MASSARRAY SYSTEM Biomarker validation Routine genetic testing Somatic mutation profiling Up to 400

More information

FGCZ NEWSLETTER FALL Next Generation Sequencing at the Functional Genomics Center Zurich

FGCZ NEWSLETTER FALL Next Generation Sequencing at the Functional Genomics Center Zurich FGCZ NEWSLETTER FALL 2011 newsletter Technologies, Applications, and Access to Support Next Generation Sequencing at the Functional Genomics Center Zurich OVERVIEW 1 NGS AT THE FGCZ Technologies and organization

More information

DNA Collection. Data Quality Control. Whole Genome Amplification. Whole Genome Amplification. Measure DNA concentrations. Pros

DNA Collection. Data Quality Control. Whole Genome Amplification. Whole Genome Amplification. Measure DNA concentrations. Pros DNA Collection Data Quality Control Suzanne M. Leal Baylor College of Medicine sleal@bcm.edu Copyrighted S.M. Leal 2016 Blood samples For unlimited supply of DNA Transformed cell lines Buccal Swabs Small

More information

Digital DNA/RNA sequencing enables highly accurate and sensitive biomarker detection and quantification

Digital DNA/RNA sequencing enables highly accurate and sensitive biomarker detection and quantification Digital DNA/RNA sequencing enables highly accurate and sensitive biomarker detection and quantification Erwin Chen ( 陳立德 ) Technical Product Specialist QIAGEN Taiwan Precision medicine: Right drug, right

More information

Course Presentation. Ignacio Medina Presentation

Course Presentation. Ignacio Medina Presentation Course Index Introduction Agenda Analysis pipeline Some considerations Introduction Who we are Teachers: Marta Bleda: Computational Biologist and Data Analyst at Department of Medicine, Addenbrooke's Hospital

More information

Tracing Your Matrilineal Ancestry: Mitochondrial DNA PCR and Sequencing

Tracing Your Matrilineal Ancestry: Mitochondrial DNA PCR and Sequencing Tracing Your Matrilineal Ancestry: Mitochondrial DNA PCR and Sequencing BABEC s Curriculum Rewrite Curriculum to Align with NGSS Standards NGSS work group Your ideas for incorporating NGSS Feedback from

More information

Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS

Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS The Bioinformatics book covers new topics in the rapidly

More information

Title: High-quality genome assembly of channel catfish, Ictalurus punctatus

Title: High-quality genome assembly of channel catfish, Ictalurus punctatus Author s response to reviews Title: High-quality genome assembly of channel catfish, Ictalurus punctatus Authors: Qiong Shi (shiqiong@genomics.cn) Xiaohui Chen (xhchenffri@hotmail.com) Liqiang Zhong (lqzhongffri@hotmail.com)

More information

Incorporating Molecular ID Technology. Accel-NGS 2S MID Indexing Kits

Incorporating Molecular ID Technology. Accel-NGS 2S MID Indexing Kits Incorporating Molecular ID Technology Accel-NGS 2S MID Indexing Kits Molecular Identifiers (MIDs) MIDs are indices used to label unique library molecules MIDs can assess duplicate molecules in sequencing

More information

The first and only fully-integrated microarray instrument for hands-free array processing

The first and only fully-integrated microarray instrument for hands-free array processing The first and only fully-integrated microarray instrument for hands-free array processing GeneTitan Instrument Transform your lab with a GeneTitan Instrument and experience the unparalleled power of streamlining

More information

Mapping strategies for sequence reads

Mapping strategies for sequence reads Mapping strategies for sequence reads Ernest Turro University of Cambridge 21 Oct 2013 Quantification A basic aim in genomics is working out the contents of a biological sample. 1. What distinct elements

More information

Corporate Overview of BioNano Genomics, Inc. September 2016

Corporate Overview of BioNano Genomics, Inc. September 2016 Corporate Overview of BioNano Genomics, Inc. September 2016 BioNano Is the Key to Unlocking the $100+ Billion Potential of the Genomics Market Market Size Growth Catalyst Key Driver Bottleneck $40B- $110B

More information

INTRODUCCIÓ A LES TECNOLOGIES DE 'NEXT GENERATION SEQUENCING'

INTRODUCCIÓ A LES TECNOLOGIES DE 'NEXT GENERATION SEQUENCING' INTRODUCCIÓ A LES TECNOLOGIES DE 'NEXT GENERATION SEQUENCING' Bioinformàtica per a la Recerca Biomèdica Ricardo Gonzalo Sanz ricardo.gonzalo@vhir.org 14/12/2016 1. Introduction to NGS 2. First Generation

More information

Targeted Sequencing Using Droplet-Based Microfluidics. Keith Brown Director, Sales

Targeted Sequencing Using Droplet-Based Microfluidics. Keith Brown Director, Sales Targeted Sequencing Using Droplet-Based Microfluidics Keith Brown Director, Sales brownk@raindancetech.com Who we are: is a Provider of Microdroplet-based Solutions The Company s RainStorm TM Technology

More information

Technical note: Molecular Index counting adjustment methods

Technical note: Molecular Index counting adjustment methods Technical note: Molecular Index counting adjustment methods By Jue Fan, Jennifer Tsai, Eleen Shum Introduction. Overview of BD Precise assays BD Precise assays are fast, high-throughput, next-generation

More information

Variant calling in NGS experiments

Variant calling in NGS experiments Variant calling in NGS experiments Jorge Jiménez jjimeneza@cipf.es BIER CIBERER Genomics Department Centro de Investigacion Principe Felipe (CIPF) (Valencia, Spain) 1 Index 1. NGS workflow 2. Variant calling

More information

Fundamentals of Next-Generation Sequencing: Technologies and Applications

Fundamentals of Next-Generation Sequencing: Technologies and Applications Fundamentals of Next-Generation Sequencing: Technologies and Applications Society for Hematopathology European Association for Haematopathology 2017 Workshop Eric Duncavage, MD Washington University in

More information

CNV and variant detection for human genome resequencing data - for biomedical researchers (II)

CNV and variant detection for human genome resequencing data - for biomedical researchers (II) CNV and variant detection for human genome resequencing data - for biomedical researchers (II) Chuan-Kun Liu 劉傳崑 Senior Maneger National Center for Genome Medican bioit@ncgm.sinica.edu.tw Abstract Common

More information

Structural variation. Marta Puig Institut de Biotecnologia i Biomedicina Universitat Autònoma de Barcelona

Structural variation. Marta Puig Institut de Biotecnologia i Biomedicina Universitat Autònoma de Barcelona Structural variation Marta Puig Institut de Biotecnologia i Biomedicina Universitat Autònoma de Barcelona Genetic variation How much genetic variation is there between individuals? What type of variants

More information

Workshop on Genomics. Český Krumlov Monday, January 7, 13

Workshop on Genomics. Český Krumlov Monday, January 7, 13 Workshop on Genomics Český Krumlov 2013 objectives of this presentation provide some information about the setting and logistics provide some background information about the Workshop help you establish

More information

A near perfect de novo assembly of a eukaryotic genome using sequence reads of greater than 10 kilobases generated by the Pacific Biosciences RS II

A near perfect de novo assembly of a eukaryotic genome using sequence reads of greater than 10 kilobases generated by the Pacific Biosciences RS II A near perfect de novo assembly of a eukaryotic genome using sequence reads of greater than 10 kilobases generated by the Pacific Biosciences RS II W. Richard McCombie Disclosures Introduction to the challenge

More information

De novo whole genome assembly

De novo whole genome assembly De novo whole genome assembly Lecture 1 Qi Sun Minghui Wang Bioinformatics Facility Cornell University DNA Sequencing Platforms Illumina sequencing (100 to 300 bp reads) Overlapping reads ~180bp fragment

More information