Nature Methods: doi: /nmeth Supplementary Figure 1. Ideograms showing scaffold boundaries and segmental duplication locations.

Size: px
Start display at page:

Download "Nature Methods: doi: /nmeth Supplementary Figure 1. Ideograms showing scaffold boundaries and segmental duplication locations."

Transcription

1 Supplementary Figure 1 Ideograms showing scaffold boundaries and segmental duplication locations. Blue lines mark the boundaries of scaffolds. Black marks show the locations of segmental duplications. Magenta regions mark unassembled regions around the centromeres and telomeres. Ideogram were generated using The Genome Decoration Page, NCBI

2 Supplementary Figure 2 Architecture of complex regions at the MHC and Amylase loci. (a) MHC region (chr6: Mb). Upper panel: green bar = reference, blue bars = hybrid (bottom). Bottom panel: green phase blocks separated by SNVs in the hybrid in the middle. (b) Amylase region (chr1: Mb). Top panel: green bar = reference, blue bars = assemby. Assembly in the red box expanded to show nicking pattern in 450 kb region (bottom panel). (c) Haplotypes in increasing resolution to show alleles on the same phase block (green line = allele 1, grey line = allele 2).

3 Supplementary Figure 3 Assembly and phasing of Inversion. Inversion at Chr10q Mb with partial haplotyping.

4 Supplementary Table 1. Phasing comparisons. 10XG reads vs * 10XG reads vs hg19 Ref 21 vs Ref 21 vs hg19 Phased SNVs 2,783,119 2,615,312 2,421,740 2,221,354 Values removed 294, , , ,785 Phase block N Mb 16.7 Mb 145 kb 229 kb * Phasing data files are available at

5 Supplementary Table 2. Statistics for 10X-linked and hybrid assemblies built using different fragscaff parameters. In the third fragscaff processing stage, the score cutoff p was set to a range between 19 (the default auto-determined cutoff) and 150. The resulting assemblies were subsequently processed identically, as described in Materials and Methods. The with score cutoff of 36 (automatically determined using parameters j 1 and u 3) was chosen in this study as representing a middle ground with respect to accuracy, longest scaffold size, and overall contiguity. Score cutoff p19 (default) p36 (j1 u3) p50 p75 p100 p150 fragscaff Total contigs scaffolded Total scaffolds created Placement accuracy (%) Orientation accuracy (%) Bases properly placed (%) N50 (Mb) Assembly length (Gb) Longest scaffold (Mb) Number of scaffolds

6 Supplementary Table 3. Software tools used in this study and in related studies. Software/tool Task Settings Availability Reference / PMID SOAPdenovo Illumina short reads /soapdenovo.html Long Ranger (10X Genomics) BWA-MEM FragScaff Lastz Align to ref genome: evaluate accuracy and data visualization. Default Stage1: default Stage 2: default, -C = 10 Stage 3: Various combinations of j, u, and p (final: j=1, u=3, p=default) Accuracy evaluation: --nogapped -- notransition -- exact= identity=95 -- seed=match15 - -twins= ambiguous=n -- match=1,5 -- masking=3 Final alignment: RepeatMasker soft masking, -- format=sam -- nogapped -- notransition -- exact=500 maxwordcount= 90% -- identity=95 -- seed=match15 - -twins= s.com/downloads 10X barcode analysis and filtering, phasing Mapping 10X libraries to shortread-based 10X-linked scaffolding cts/fragscaff/files/ r_lab/dist/readme.lastz /README.lastz a.html n.github.io/ab yss-scaffoldpaper/

7 fragscaff_check Ordering.pl Knickers (BioNano Genomics) Irys Solve software (BioNano Genomics) Samtools Bedtools PhenoGram 10X Genomics Loupe evaluate fragscaff accuracy In silico nicking BioNano de-novo. Hybrid scaffold and genome maps alignment to reference maps File format conversion File format conversion Alignment visualization on ideograms Phasing, SNPs and SVs visualization ambiguous=n -- match=1,5 cts/fragscaff/files/ nickers/knickers.htm aining/docs/irysview_softw are_installation_guide_23. pdf net/ dtools2/releases ware/phenogramdownloads s.com/downloads

Building a platinum human genome assembly from single haplotype human genomes. Karyn Meltz Steinberg PacBio UGM December,

Building a platinum human genome assembly from single haplotype human genomes. Karyn Meltz Steinberg PacBio UGM December, Building a platinum human genome assembly from single haplotype human genomes Karyn Meltz Steinberg PacBio UGM December, 2015 @KMS_Meltzy Single haplotype from hydatidiform mole Enucleated egg (no maternal

More information

Nature Biotechnology: doi: /nbt Supplementary Figure 1. Number and length distributions of the inferred fosmids.

Nature Biotechnology: doi: /nbt Supplementary Figure 1. Number and length distributions of the inferred fosmids. Supplementary Figure 1 Number and length distributions of the inferred fosmids. Fosmid were inferred by mapping each pool s sequence reads to hg19. We retained only those reads that mapped to within a

More information

Bionano Access v1.2 Release Notes

Bionano Access v1.2 Release Notes Bionano Access v1.2 Release Notes Document Number: 30220 Document Revision: A For Research Use Only. Not for use in diagnostic procedures. Copyright 2018 Bionano Genomics, Inc. All Rights Reserved. Table

More information

Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Supplementary Material

Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Supplementary Material Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions Joshua N. Burton 1, Andrew Adey 1, Rupali P. Patwardhan 1, Ruolan Qiu 1, Jacob O. Kitzman 1, Jay Shendure 1 1 Department

More information

Assemblytics: a web analytics tool for the detection of assembly-based variants Maria Nattestad and Michael C. Schatz

Assemblytics: a web analytics tool for the detection of assembly-based variants Maria Nattestad and Michael C. Schatz Assemblytics: a web analytics tool for the detection of assembly-based variants Maria Nattestad and Michael C. Schatz Table of Contents Supplementary Note 1: Unique Anchor Filtering Supplementary Figure

More information

Bionano Access : Assembly Report Guidelines

Bionano Access : Assembly Report Guidelines Bionano Access : Assembly Report Guidelines Document Number: 30255 Document Revision: A For Research Use Only. Not for use in diagnostic procedures. Copyright 2018 Bionano Genomics Inc. All Rights Reserved

More information

De novo human genome assemblies reveal spectrum of alternative haplotypes in diverse

De novo human genome assemblies reveal spectrum of alternative haplotypes in diverse SUPPLEMENTARY INFORMATION De novo human genome assemblies reveal spectrum of alternative haplotypes in diverse populations Wong et al. The Supplementary Information contains 4 Supplementary Figures, 3

More information

Chang Xu Mohammad R Nezami Ranjbar Zhong Wu John DiCarlo Yexun Wang

Chang Xu Mohammad R Nezami Ranjbar Zhong Wu John DiCarlo Yexun Wang Supplementary Materials for: Detecting very low allele fraction variants using targeted DNA sequencing and a novel molecular barcode-aware variant caller Chang Xu Mohammad R Nezami Ranjbar Zhong Wu John

More information

Implementation and Evaluation of 10X Genomics Chromium technology

Implementation and Evaluation of 10X Genomics Chromium technology Implementation and Evaluation of 10X Genomics Chromium technology Claire Kuchly & Olivier Bouchez 28/11/2017 get@genotoul.fr @get_genotoul 1 Chromium evaluation: pilot phase Platform installed in november

More information

Gap Filling for a Human MHC Haplotype Sequence

Gap Filling for a Human MHC Haplotype Sequence American Journal of Life Sciences 2016; 4(6): 146-151 http://www.sciencepublishinggroup.com/j/ajls doi: 10.11648/j.ajls.20160406.12 ISSN: 2328-5702 (Print); ISSN: 2328-5737 (Online) Gap Filling for a Human

More information

Bionano Solve Theory of Operation: Hybrid Scaffold

Bionano Solve Theory of Operation: Hybrid Scaffold Bionano Solve Theory of Operation: Hybrid Scaffold Document Number: 30073 Document Revision: D1 For Research Use Only. Not for use in diagnostic procedures. Copyright 2018 Bionano Genomics, Inc. All Rights

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Contents De novo assembly... 2 Assembly statistics for all 150 individuals... 2 HHV6b integration... 2 Comparison of assemblers... 4 Variant calling and genotyping... 4 Protein truncating variants (PTV)...

More information

Bionano Solve Theory of Operation: Variant Annotation Pipeline

Bionano Solve Theory of Operation: Variant Annotation Pipeline Bionano Solve Theory of Operation: Variant Annotation Pipeline Document Number: 30190 Document Revision: B For Research Use Only. Not for use in diagnostic procedures. Copyright 2018 Bionano Genomics,

More information

The Diploid Genome Sequence of an Individual Human

The Diploid Genome Sequence of an Individual Human The Diploid Genome Sequence of an Individual Human Maido Remm Journal Club 12.02.2008 Outline Background (history, assembling strategies) Who was sequenced in previous projects Genome variations in J.

More information

Bionano Solve v3.0.1 Release Notes

Bionano Solve v3.0.1 Release Notes Bionano Solve v3.0.1 Release Notes Document Number: 30157 Document Revision: B For Research Use Only. Not for use in diagnostic procedures. Copyright 2017 Bionano Genomics, Inc. All Rights Reserved. Table

More information

Nature Biotechnology: doi: /nbt Supplementary Figure 1. Read Complexity

Nature Biotechnology: doi: /nbt Supplementary Figure 1. Read Complexity Supplementary Figure 1 Read Complexity A) Density plot showing the percentage of read length masked by the dust program, which identifies low-complexity sequence (simple repeats). Scrappie outputs a significantly

More information

Bionano Access 1.1 Software User Guide

Bionano Access 1.1 Software User Guide Bionano Access 1.1 Software User Guide Document Number: 30142 Document Revision: B For Research Use Only. Not for use in diagnostic procedures. Copyright 2017 Bionano Genomics, Inc. All Rights Reserved.

More information

Supplementary Figure 1 Genotyping by Sequencing (GBS) pipeline used in this study to genotype maize inbred lines. The 14,129 maize inbred lines were

Supplementary Figure 1 Genotyping by Sequencing (GBS) pipeline used in this study to genotype maize inbred lines. The 14,129 maize inbred lines were Supplementary Figure 1 Genotyping by Sequencing (GBS) pipeline used in this study to genotype maize inbred lines. The 14,129 maize inbred lines were processed following GBS experimental design 1 and bioinformatics

More information

Bionano Access 1.0 Software User Guide

Bionano Access 1.0 Software User Guide Bionano Access 1.0 Software User Guide Document Number: 30142 Document Revision: A For Research Use Only. Not for use in diagnostic procedures. Copyright 2017 Bionano Genomics, Inc. All Rights Reserved.

More information

TruSPAdes: analysis of variations using TruSeq Synthetic Long Reads (TSLR)

TruSPAdes: analysis of variations using TruSeq Synthetic Long Reads (TSLR) tru TruSPAdes: analysis of variations using TruSeq Synthetic Long Reads (TSLR) Anton Bankevich Center for Algorithmic Biotechnology, SPbSU Sequencing costs 1. Sequencing costs do not follow Moore s law

More information

Bionano Access v1.1 Release Notes

Bionano Access v1.1 Release Notes Bionano Access v1.1 Release Notes Document Number: 30188 Document Revision: C For Research Use Only. Not for use in diagnostic procedures. Copyright 2017 Bionano Genomics, Inc. All Rights Reserved. Table

More information

SNP calling and VCF format

SNP calling and VCF format SNP calling and VCF format Laurent Falquet, Oct 12 SNP? What is this? A type of genetic variation, among others: Family of Single Nucleotide Aberrations Single Nucleotide Polymorphisms (SNPs) Single Nucleotide

More information

The Human Genome and its upcoming Dynamics

The Human Genome and its upcoming Dynamics The Human Genome and its upcoming Dynamics Matthias Platzer Genome Analysis Leibniz Institute for Age Research - Fritz-Lipmann Institute (FLI) Sequencing of the Human Genome Publications 2004 2001 2001

More information

MI615 Syllabus Illustrated Topics in Advanced Molecular Genetics Provisional Schedule Spring 2010: MN402 TR 9:30-10:50

MI615 Syllabus Illustrated Topics in Advanced Molecular Genetics Provisional Schedule Spring 2010: MN402 TR 9:30-10:50 MI615 Syllabus Illustrated Topics in Advanced Molecular Genetics Provisional Schedule Spring 2010: MN402 TR 9:30-10:50 DATE TITLE LECTURER Thu Jan 14 Introduction, Genomic low copy repeats Pierce Tue Jan

More information

Mate-pair library data improves genome assembly

Mate-pair library data improves genome assembly De Novo Sequencing on the Ion Torrent PGM APPLICATION NOTE Mate-pair library data improves genome assembly Highly accurate PGM data allows for de Novo Sequencing and Assembly For a draft assembly, generate

More information

Nanopore long read sequencing for detection of point mutations and structural variants

Nanopore long read sequencing for detection of point mutations and structural variants Nanopore long read sequencing for detection of point mutations and structural variants Viapath Genetics, Guy s Hospital, London Presented by: Kezia Brown Nanopore for long read sequencing Why do we need

More information

Supplementary Information: Genome sequencing of two closely related non-human primate animal models, cynomolgus and the Chinese rhesus macaque

Supplementary Information: Genome sequencing of two closely related non-human primate animal models, cynomolgus and the Chinese rhesus macaque Supplementary Information: Genome sequencing of two closely related non-human primate animal models, cynomolgus and the Chinese rhesus macaque -1 1 Genome sequencing and assembly... 3 1.1 Genome sequencing...

More information

DE NOVO WHOLE GENOME ASSEMBLY AND SEQUENCING OF THE SUPERB FAIRYWREN. (Malurus cyaneus) JOSHUA PEÑALBA LEO JOSEPH CRAIG MORITZ ANDREW COCKBURN

DE NOVO WHOLE GENOME ASSEMBLY AND SEQUENCING OF THE SUPERB FAIRYWREN. (Malurus cyaneus) JOSHUA PEÑALBA LEO JOSEPH CRAIG MORITZ ANDREW COCKBURN DE NOVO WHOLE GENOME ASSEMBLY AND SEQUENCING OF THE SUPERB FAIRYWREN (Malurus cyaneus) JOSHUA PEÑALBA LEO JOSEPH CRAIG MORITZ ANDREW COCKBURN ... 2014 2015 2016 2017 ... 2014 2015 2016 2017 Synthetic

More information

Mapping Next Generation Sequence Reads. Bingbing Yuan Dec. 2, 2010

Mapping Next Generation Sequence Reads. Bingbing Yuan Dec. 2, 2010 Mapping Next Generation Sequence Reads Bingbing Yuan Dec. 2, 2010 1 What happen if reads are not mapped properly? Some data won t be used, thus fewer reads would be aligned. Reads are mapped to the wrong

More information

Supplement to: The Genomic Sequence of the Chinese Hamster Ovary (CHO)-K1 cell line

Supplement to: The Genomic Sequence of the Chinese Hamster Ovary (CHO)-K1 cell line Supplement to: The Genomic Sequence of the Chinese Hamster Ovary (CHO)-K1 cell line Table of Contents SUPPLEMENTARY TEXT:... 2 FILTERING OF RAW READS PRIOR TO ASSEMBLY:... 2 COMPARATIVE ANALYSIS... 2 IMMUNOGENIC

More information

The Irys System. Rapid Genome Wide Mapping for de novo Assembly and Structural Variation Analysis. Jack Peart, Ph.D. Director of Sales EMEA

The Irys System. Rapid Genome Wide Mapping for de novo Assembly and Structural Variation Analysis. Jack Peart, Ph.D. Director of Sales EMEA The Irys System Rapid Genome Wide Mapping for de novo Assembly and Structural Variation Analysis Jack Peart, Ph.D. Director of Sales EMEA BioNano Snapshot Developed & Commercialized the Irys System for

More information

Sequence assembly. Jose Blanca COMAV institute bioinf.comav.upv.es

Sequence assembly. Jose Blanca COMAV institute bioinf.comav.upv.es Sequence assembly Jose Blanca COMAV institute bioinf.comav.upv.es Sequencing project Unknown sequence { experimental evidence result read 1 read 4 read 2 read 5 read 3 read 6 read 7 Computational requirements

More information

Supplementary Figures

Supplementary Figures Supplementary Figures A B Supplementary Figure 1. Examples of discrepancies in predicted and validated breakpoint coordinates. A) Most frequently, predicted breakpoints were shifted relative to those derived

More information

De novo whole genome assembly

De novo whole genome assembly De novo whole genome assembly Qi Sun Bioinformatics Facility Cornell University Sequencing platforms Short reads: o Illumina (150 bp, up to 300 bp) Long reads (>10kb): o PacBio SMRT; o Oxford Nanopore

More information

Direct determination of diploid genome sequences. Supplemental material: contents

Direct determination of diploid genome sequences. Supplemental material: contents Direct determination of diploid genome sequences Neil I. Weisenfeld, Vijay Kumar, Preyas Shah, Deanna M. Church, David B. Jaffe Supplemental material: contents Supplemental Note 1. Comparison of performance

More information

The Resurgence of Reference Quality Genome Sequence

The Resurgence of Reference Quality Genome Sequence The Resurgence of Reference Quality Genome Sequence Michael Schatz Jan 12, 2016 PAG IV @mike_schatz / #PAGIV Genomics Arsenal in the year 2015 Sample Preparation Sequencing Chromosome Mapping Summary &

More information

T G T A. artificial chimera

T G T A. artificial chimera False mutation detection caused by WA artifacts original genome A C A artifacts in amplified DNA C C A A false detection of local variants true ssnv mistaken as error artificial chimera false LOH early

More information

02 Agenda Item 03 Agenda Item

02 Agenda Item 03 Agenda Item 01 Agenda Item 02 Agenda Item 03 Agenda Item SOLiD 3 System: Applications Overview April 12th, 2010 Jennifer Stover Field Application Specialist - SOLiD Applications Workflow for SOLiD Application Application

More information

Genome Assembly Software for Different Technology Platforms. PacBio Canu Falcon. Illumina Soap Denovo Discovar Platinus MaSuRCA.

Genome Assembly Software for Different Technology Platforms. PacBio Canu Falcon. Illumina Soap Denovo Discovar Platinus MaSuRCA. Genome Assembly Software for Different Technology Platforms PacBio Canu Falcon 10x SuperNova Illumina Soap Denovo Discovar Platinus MaSuRCA Experimental design using Illumina Platform Estimate genome size:

More information

High-throughput genome scaffolding from in vivo DNA interaction frequency

High-throughput genome scaffolding from in vivo DNA interaction frequency correction notice Nat. Biotechnol. 31, 1143 1147 (213) High-throughput genome scaffolding from in vivo DNA interaction frequency Noam Kaplan & Job Dekker In the version of this supplementary file originally

More information

De novo meta-assembly of ultra-deep sequencing data

De novo meta-assembly of ultra-deep sequencing data De novo meta-assembly of ultra-deep sequencing data Hamid Mirebrahim 1, Timothy J. Close 2 and Stefano Lonardi 1 1 Department of Computer Science and Engineering 2 Department of Botany and Plant Sciences

More information

C3BI. VARIANTS CALLING November Pierre Lechat Stéphane Descorps-Declère

C3BI. VARIANTS CALLING November Pierre Lechat Stéphane Descorps-Declère C3BI VARIANTS CALLING November 2016 Pierre Lechat Stéphane Descorps-Declère General Workflow (GATK) software websites software bwa picard samtools GATK IGV tablet vcftools website http://bio-bwa.sourceforge.net/

More information

RNA sequencing with the MinION at Genoscope

RNA sequencing with the MinION at Genoscope RNA sequencing with the MinION at Genoscope Jean-Marc Aury jmaury@genoscope.cns.fr @J_M_Aury December 13, 2017 RNA workshop, Genoscope Overview Genoscope Overview MinION sequencing at Genoscope RNA-Seq

More information

Workflow of de novo assembly

Workflow of de novo assembly Workflow of de novo assembly Experimental Design Clean sequencing data (trim adapter and low quality sequences) Run assembly software for contiging and scaffolding Evaluation of assembly Several iterations:

More information

De novo assembly of human genomes with massively parallel short read sequencing. Mikk Eelmets Journal Club

De novo assembly of human genomes with massively parallel short read sequencing. Mikk Eelmets Journal Club De novo assembly of human genomes with massively parallel short read sequencing Mikk Eelmets Journal Club 06.04.2010 Problem DNA sequencing technologies: Sanger sequencing (500-1000 bp) Next-generation

More information

De novo whole genome assembly

De novo whole genome assembly De novo whole genome assembly Lecture 1 Qi Sun Minghui Wang Bioinformatics Facility Cornell University DNA Sequencing Platforms Illumina sequencing (100 to 300 bp reads) Overlapping reads ~180bp fragment

More information

Genome Projects. Part III. Assembly and sequencing of human genomes

Genome Projects. Part III. Assembly and sequencing of human genomes Genome Projects Part III Assembly and sequencing of human genomes All current genome sequencing strategies are clone-based. 1. ordered clone sequencing e.g., C. elegans well suited for repetitive sequences

More information

Comparison and Evaluation of Cotton SNPs Developed by Transcriptome, Genome Reduction on Restriction Site Conservation and RAD-based Sequencing

Comparison and Evaluation of Cotton SNPs Developed by Transcriptome, Genome Reduction on Restriction Site Conservation and RAD-based Sequencing Comparison and Evaluation of Cotton SNPs Developed by Transcriptome, Genome Reduction on Restriction Site Conservation and RAD-based Sequencing Hamid Ashrafi Amanda M. Hulse, Kevin Hoegenauer, Fei Wang,

More information

MinION, GridION, how does Nanopore technology meet the needs of our users?

MinION, GridION, how does Nanopore technology meet the needs of our users? MinION, GridION, how does Nanopore technology meet the needs of our users? Journée Long Reads GeT 28 Novembre 2017 Catherine Zanchetta & Maxime Manno get@genotoul.fr @get_genotoul 1 Wet Lab Nanopore technology

More information

Bioinformatics in next generation sequencing projects

Bioinformatics in next generation sequencing projects Bioinformatics in next generation sequencing projects Rickard Sandberg Assistant Professor Department of Cell and Molecular Biology Karolinska Institutet May 2013 Standard sequence library generation Illumina

More information

CREST maps somatic structural variation in cancer genomes with base-pair resolution

CREST maps somatic structural variation in cancer genomes with base-pair resolution Nature Methods CREST maps somatic structural variation in cancer genomes with base-pair resolution Jianmin Wang, Charles G Mullighan, John Easton, Stefan Roberts, Jing Ma, Michael C Rusch, Ken Chen, Christopher

More information

De Novo and Hybrid Assembly

De Novo and Hybrid Assembly On the PacBio RS Introduction The PacBio RS utilizes SMRT technology to generate both Continuous Long Read ( CLR ) and Circular Consensus Read ( CCS ) data. In this document, we describe sequencing the

More information

SNP calling. Jose Blanca COMAV institute bioinf.comav.upv.es

SNP calling. Jose Blanca COMAV institute bioinf.comav.upv.es SNP calling Jose Blanca COMAV institute bioinf.comav.upv.es SNP calling Genotype matrix Genotype matrix: Samples x SNPs SNPs and errors A change in a read may due to: Sample contamination Cloning or PCR

More information

N50 must die!? Genome assembly workshop, Santa Cruz, 3/15/11

N50 must die!? Genome assembly workshop, Santa Cruz, 3/15/11 N50 must die!? Genome assembly workshop, Santa Cruz, 3/15/11 twitter: @assemblathon web: assemblathon.org Should N50 die in its role as a frequently used measure of genome assembly quality? Are there other

More information

Compute- and Data-Intensive Analyses in Bioinformatics"

Compute- and Data-Intensive Analyses in Bioinformatics Compute- and Data-Intensive Analyses in Bioinformatics" Wayne Pfeiffer SDSC/UCSD August 8, 2012 Questions for today" How big is the flood of data from high-throughput DNA sequencers? What bioinformatics

More information

Bionano Access Software User Guide

Bionano Access Software User Guide Bionano Access Software User Guide Document Number: 30142 Document Revision: C For Research Use Only. Not for use in diagnostic procedures. Copyright 2018 Bionano Genomics, Inc. All Rights Reserved. Table

More information

The genome of Leishmania panamensis: insights into genomics of the L. (Viannia) subgenus.

The genome of Leishmania panamensis: insights into genomics of the L. (Viannia) subgenus. SUPPLEMENTARY INFORMATION The genome of Leishmania panamensis: insights into genomics of the L. (Viannia) subgenus. Alejandro Llanes, Carlos Mario Restrepo, Gina Del Vecchio, Franklin José Anguizola, Ricardo

More information

BIOINFORMATICS ORIGINAL PAPER

BIOINFORMATICS ORIGINAL PAPER BIOINFORMATICS ORIGINAL PAPER Vol. 27 no. 21 2011, pages 2957 2963 doi:10.1093/bioinformatics/btr507 Genome analysis Advance Access publication September 7, 2011 : fast length adjustment of short reads

More information

Functional genomics to improve wheat disease resistance. Dina Raats Postdoctoral Scientist, Krasileva Group

Functional genomics to improve wheat disease resistance. Dina Raats Postdoctoral Scientist, Krasileva Group Functional genomics to improve wheat disease resistance Dina Raats Postdoctoral Scientist, Krasileva Group Talk plan Goal: to contribute to the crop improvement by isolating YR resistance genes from cultivated

More information

Structural(varia+on!

Structural(varia+on! Structural(varia+on! Programming)for)Biology)! CSH,!October!2012!!!! Tomas!Marques7Bonet! ICREA!Research!Professor! InsAtut!de!Biologia!EvoluAva! Nucleotide Forms!of!geneAc!variaAon.!! Single)base8pair)changes)

More information

Experimental Design. Sequencing. Data Quality Control. Read mapping. Differential Expression analysis

Experimental Design. Sequencing. Data Quality Control. Read mapping. Differential Expression analysis -Seq Analysis Quality Control checks Reproducibility Reliability -seq vs Microarray Higher sensitivity and dynamic range Lower technical variation Available for all species Novel transcript identification

More information

GENOME ASSEMBLY FINAL PIPELINE AND RESULTS

GENOME ASSEMBLY FINAL PIPELINE AND RESULTS GENOME ASSEMBLY FINAL PIPELINE AND RESULTS Faction 1 Yanxi Chen Carl Dyson Sean Lucking Chris Monaco Shashwat Deepali Nagar Jessica Rowell Ankit Srivastava Camila Medrano Trochez Venna Wang Seyed Alireza

More information

The SMRTer Way: Single Genes to Complex Genomes

The SMRTer Way: Single Genes to Complex Genomes The SMRTer Way: Single Genes to Complex Genomes Ulf Gyllensten, Professor Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden Topics National

More information

Revolutionize Genomics with SMRT Sequencing. Single Molecule, Real-Time Technology

Revolutionize Genomics with SMRT Sequencing. Single Molecule, Real-Time Technology Revolutionize Genomics with SMRT Sequencing Single Molecule, Real-Time Technology Resolve to Master Complexity Despite large investments in population studies, the heritability of the majority of Mendelian

More information

Toward a better understanding of plant genomes structure : Combining NGS, optical mapping technology and CRISPR-CATCH approach

Toward a better understanding of plant genomes structure : Combining NGS, optical mapping technology and CRISPR-CATCH approach Toward a better understanding of plant genomes structure : Combining NGS, optical mapping technology and CRISPR-CATCH approach Hélène BERGES Director of the Plant Genomic Center Global warming effects,

More information

De Novo Assembly of High-throughput Short Read Sequences

De Novo Assembly of High-throughput Short Read Sequences De Novo Assembly of High-throughput Short Read Sequences Chuming Chen Center for Bioinformatics and Computational Biology (CBCB) University of Delaware NECC Third Skate Genome Annotation Workshop May 23,

More information

Next Generation Genetics: Using deep sequencing to connect phenotype to genotype

Next Generation Genetics: Using deep sequencing to connect phenotype to genotype Next Generation Genetics: Using deep sequencing to connect phenotype to genotype http://1001genomes.org Korbinian Schneeberger Connecting Genotype and Phenotype Genotyping SNPs small Resequencing SVs*

More information

Long-read sequencing and de novo assembly of a Chinese genome

Long-read sequencing and de novo assembly of a Chinese genome Received 23 Nov 2015 Accepted 26 May 2016 Published 30 Jun 2016 DOI: 10.1038/ncomms12065 Long-read sequencing and de novo assembly of a Chinese genome OPEN Lingling Shi 1,2,3, *, Yunfei Guo 4, *, Chengliang

More information

Structural variation analysis using NGS sequencing

Structural variation analysis using NGS sequencing Structural variation analysis using NGS sequencing Victor Guryev NBIC NGS taskforce meeting April 15th, 2011 Scale of genomic variants Scale 1 bp 10 bp 100 bp 1 kb 10 kb 100 kb 1 Mb Variants SNPs Short

More information

Variant Detection in Next Generation Sequencing Data. John Osborne Sept 14, 2012

Variant Detection in Next Generation Sequencing Data. John Osborne Sept 14, 2012 + Variant Detection in Next Generation Sequencing Data John Osborne Sept 14, 2012 + Overview My Bias Talk slanted towards analyzing whole genomes using Illumina paired end reads with open source tools

More information

Supplementary Table 1. Summary of whole genome shotgun sequence used for genome assembly

Supplementary Table 1. Summary of whole genome shotgun sequence used for genome assembly Supplementary Tables Supplementary Table 1. Summary of whole genome shotgun sequence used for genome assembly Library Read length Raw data Filtered data insert size (bp) * Total Sequence depth Total Sequence

More information

Guidelines for Interpreting the Bionano Molecule Quality Report

Guidelines for Interpreting the Bionano Molecule Quality Report Guidelines for Interpreting the Bionano Molecule Quality Report Important: The guidelines described herein are based on internal experiences at Bionano Genomics and are provided as-is. The purpose of this

More information

Identifying copy number alterations and genotype with Control-FREEC

Identifying copy number alterations and genotype with Control-FREEC Identifying copy number alterations and genotype with Control-FREEC Valentina Boeva contact: freec@curie.fr Most approaches for predicting copy number alterations (CNAs) require you to have whole exomesequencing

More information

Sequence Assembly and Alignment. Jim Noonan Department of Genetics

Sequence Assembly and Alignment. Jim Noonan Department of Genetics Sequence Assembly and Alignment Jim Noonan Department of Genetics james.noonan@yale.edu www.yale.edu/noonanlab The assembly problem >>10 9 sequencing reads 36 bp - 1 kb 3 Gb Outline Basic concepts in genome

More information

Why can GBS be complicated? Tools for filtering, error correction and imputation.

Why can GBS be complicated? Tools for filtering, error correction and imputation. Why can GBS be complicated? Tools for filtering, error correction and imputation. Edward Buckler USDA-ARS Cornell University http://www.maizegenetics.net Many Organisms Are Diverse Humans are at the lower

More information

Sequencing and assembly of the sheep genome reference sequence

Sequencing and assembly of the sheep genome reference sequence Sequencing and assembly of the sheep genome reference sequence Yu Jiang Kunming Institute of Zoology, CAS, China the International Sheep Genomics Consortium (ISGC) ISGC Presentations Yu Jiang, Kunming

More information

John Hammond Targeted genomic enrichment and SMRT sequencing of immune-related gene complexes

John Hammond Targeted genomic enrichment and SMRT sequencing of immune-related gene complexes John Hammond (john.hammond@pirbright.ac.uk) Targeted genomic enrichment and SMRT sequencing of immune-related gene complexes Innate immune gene variation: germ-line encoded NK cell receptors and MHC class

More information

Title: High-quality genome assembly of channel catfish, Ictalurus punctatus

Title: High-quality genome assembly of channel catfish, Ictalurus punctatus Author s response to reviews Title: High-quality genome assembly of channel catfish, Ictalurus punctatus Authors: Qiong Shi (shiqiong@genomics.cn) Xiaohui Chen (xhchenffri@hotmail.com) Liqiang Zhong (lqzhongffri@hotmail.com)

More information

Validation of synthetic long reads for use in constructing variant graphs for dairy cattle breeding

Validation of synthetic long reads for use in constructing variant graphs for dairy cattle breeding Validation of synthetic long reads for use in constructing variant graphs for dairy cattle breeding M Keehan and C Couldrey Research and Development, Livestock Improvment Corporation, Hamilton, New Zealand

More information

Annotation Practice Activity [Based on materials from the GEP Summer 2010 Workshop] Special thanks to Chris Shaffer for document review Parts A-G

Annotation Practice Activity [Based on materials from the GEP Summer 2010 Workshop] Special thanks to Chris Shaffer for document review Parts A-G Annotation Practice Activity [Based on materials from the GEP Summer 2010 Workshop] Special thanks to Chris Shaffer for document review Parts A-G Introduction: A genome is the total genetic content of

More information

Lecture 23: Causes and Consequences of Linkage Disequilibrium. November 16, 2012

Lecture 23: Causes and Consequences of Linkage Disequilibrium. November 16, 2012 Lecture 23: Causes and Consequences of Linkage Disequilibrium November 16, 2012 Last Time Signatures of selection based on synonymous and nonsynonymous substitutions Multiple loci and independent segregation

More information

RADSeq Data Analysis. Through STACKS on Galaxy. Yvan Le Bras Anthony Bretaudeau Cyril Monjeaud Gildas Le Corguillé

RADSeq Data Analysis. Through STACKS on Galaxy. Yvan Le Bras Anthony Bretaudeau Cyril Monjeaud Gildas Le Corguillé RADSeq Data Analysis Through STACKS on Galaxy Yvan Le Bras Anthony Bretaudeau Cyril Monjeaud Gildas Le Corguillé RAD sequencing: next-generation tools for an old problem INTRODUCTION source: Karim Gharbi

More information

Wheat Genome Structural Annotation Using a Modular and Evidence-combined Annotation Pipeline

Wheat Genome Structural Annotation Using a Modular and Evidence-combined Annotation Pipeline Wheat Genome Structural Annotation Using a Modular and Evidence-combined Annotation Pipeline Xi Wang Bioinformatics Scientist Computational Life Science Page 1 Bayer 4:3 Template 2010 March 2016 17/01/2017

More information

NUCLEOTIDE RESOLUTION STRUCTURAL VARIATION DETECTION USING NEXT- GENERATION WHOLE GENOME RESEQUENCING

NUCLEOTIDE RESOLUTION STRUCTURAL VARIATION DETECTION USING NEXT- GENERATION WHOLE GENOME RESEQUENCING NUCLEOTIDE RESOLUTION STRUCTURAL VARIATION DETECTION USING NEXT- GENERATION WHOLE GENOME RESEQUENCING Ken Chen, Ph.D. kchen@genome.wustl.edu The Genome Center, Washington University in St. Louis The path

More information

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es Sequencing technologies Jose Blanca COMAV institute bioinf.comav.upv.es Outline Sequencing technologies: Sanger 2nd generation sequencing: 3er generation sequencing: 454 Illumina SOLiD Ion Torrent PacBio

More information

Supplemental Figures Supplemental Figure 1.

Supplemental Figures Supplemental Figure 1. Supplemental Material: Annu. Rev. Genom. Hum. Genet. 2017. 18:321-356 https://doi.org/10.1146/annurev-genom-091416-035526 A Robust Framework for Microbial Archaeology Warinner et al. Supplemental Figures

More information

De novo whole genome assembly

De novo whole genome assembly De novo whole genome assembly Lecture 1 Qi Sun Bioinformatics Facility Cornell University Data generation Sequencing Platforms Short reads: Illumina Long reads: PacBio; Oxford Nanopore Contiging/Scaffolding

More information

Data Collection Guidelines

Data Collection Guidelines Data Collection Guidelines Document Number: 30173 Document Revision: B 30173 Rev B, Data Collection Guidelines Page 1 of 5 Legal Notice For Research Use Only. Not for use in diagnostic procedures. This

More information

Introduction to metagenome assembly. Bas E. Dutilh Metagenomic Methods for Microbial Ecologists, NIOO September 18 th 2014

Introduction to metagenome assembly. Bas E. Dutilh Metagenomic Methods for Microbial Ecologists, NIOO September 18 th 2014 Introduction to metagenome assembly Bas E. Dutilh Metagenomic Methods for Microbial Ecologists, NIOO September 18 th 2014 Sequencing specs* Method Read length Accuracy Million reads Time Cost per M 454

More information

Toward a better understanding of plant genomes structure: combining NGS and optical mapping technology to improve the sunflower assembly

Toward a better understanding of plant genomes structure: combining NGS and optical mapping technology to improve the sunflower assembly Toward a better understanding of plant genomes structure: combining NGS and optical mapping technology to improve the sunflower assembly Céline CHANTRY-DARMON 1 CNRGV The French Plant Genomic Center Created

More information

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es Sequencing technologies Jose Blanca COMAV institute bioinf.comav.upv.es Outline Sequencing technologies: Sanger 2nd generation sequencing: 3er generation sequencing: 454 Illumina SOLiD Ion Torrent PacBio

More information

High quality reference genome of the domestic sheep (Ovis aries) Yu Jiang and Brian P. Dalrymple

High quality reference genome of the domestic sheep (Ovis aries) Yu Jiang and Brian P. Dalrymple High quality reference genome of the domestic sheep (Ovis aries) Yu Jiang and Brian P. Dalrymple CSIRO Livestock Industries on behalf of the International Sheep Genomics Consortium Outline of presentation

More information

Whole Human Genome Sequencing Report This is a technical summary report for PG DNA

Whole Human Genome Sequencing Report This is a technical summary report for PG DNA Whole Human Genome Sequencing Report This is a technical summary report for PG0002601-DNA Physician and Patient Information Physician name: Vinodh Naraynan Address: Suite 406 222 West Thomas Road Phoenix

More information

Sequencing, Assembling, and Correcting Draft Genomes Using Recombinant Populations

Sequencing, Assembling, and Correcting Draft Genomes Using Recombinant Populations INVESTIGATION Sequencing, Assembling, and Correcting Draft Genomes Using Recombinant Populations Matthew W. Hahn,*,,1 Simo V. Zhang, and Leonie C. Moyle* *Department of Biology and School of Informatics

More information

De Novo Repeats Construction: Methods and Applications

De Novo Repeats Construction: Methods and Applications University of Connecticut DigitalCommons@UConn Doctoral Dissertations University of Connecticut Graduate School 5-5-2017 De Novo Repeats Construction: Methods and Applications Chong Chu chong.chu@engr.uconn.edu

More information

Next Genera*on Sequencing II: Personal Genomics. Jim Noonan Department of Gene*cs

Next Genera*on Sequencing II: Personal Genomics. Jim Noonan Department of Gene*cs Next Genera*on Sequencing II: Personal Genomics Jim Noonan Department of Gene*cs Personal genome sequencing Iden*fying the gene*c basis of phenotypic diversity among humans Gene*c risk factors for disease

More information

Genomic resources. for non-model systems

Genomic resources. for non-model systems Genomic resources for non-model systems 1 Genomic resources Whole genome sequencing reference genome sequence comparisons across species identify signatures of natural selection population-level resequencing

More information

Data Basics. Josef K Vogt Slides by: Simon Rasmussen Next Generation Sequencing Analysis

Data Basics. Josef K Vogt Slides by: Simon Rasmussen Next Generation Sequencing Analysis Data Basics Josef K Vogt Slides by: Simon Rasmussen 2017 Generalized NGS analysis Sample prep & Sequencing Data size Main data reductive steps SNPs, genes, regions Application Assembly: Compare Raw Pre-

More information

SMAP File Format Specification Sheet

SMAP File Format Specification Sheet SMAP File Format Specification Sheet Document Number: 30041 Document Revision: E For Research Use Only. Not for use in diagnostic procedures. Copyright 2018 Bionano Genomics Inc. All Rights Reserved Table

More information

Haploid Assembly of Diploid Genomes

Haploid Assembly of Diploid Genomes Haploid Assembly of Diploid Genomes Challenges, Trials, Tribulations 13 October 2011 İnanç Birol Assembly By Short Sequencing IEEE InfoVis 2009 2 3 in Literature ~40 citations on tool comparisons ~20 citations

More information