Sequence variation in the short tandem repeat system SE33 discovered by next generation sequencing

Similar documents
NEW TECHNOLOGIES - AN INTRODUCTION

Our motivation for a NGS/MPS SNP panel

Implementation of MPS into a Specialist Casework Laboratory. David Ballard

Genetic Characteristics of 22 Y-STR loci in Koreans

FORENSIC GENETICS. DNA in the cell FORENSIC GENETICS PERSONAL IDENTIFICATION KINSHIP ANALYSIS FORENSIC GENETICS. Sources of biological evidence

Genetic Identity. Steve Harris SPASH - Biotechnology

Report of Analyzing Short Tandem Repeats for Parentage Testing

First Look: Applied Biosystems NGM Detect PCR Amplification Kit

Genetic characteristics of 22 Y-STRs in Koreans:

Y chromosomal STRs in forensics

Performance Characteristics drmid Dx for Illumina NGS systems

AmpF STR NGM PCR Amplification Kit - Overview

What is DNA? Deoxyribonucleic Acid The inherited genetic material that makes us what we are

DNA Typing Strategy for the Identification of Old Skeletal Remains

Pharmacogenetics: A SNPshot of the Future. Ani Khondkaryan Genomics, Bioinformatics, and Medicine Spring 2001

Data Quality Worth Sharing

Uniparental disomy (UPD) analysis of chromosome 15

Shaare Zedek Medical Center (SZMC) Gaucher Clinic. Peripheral blood samples were collected from each

Enhanced Resolution and Statistical Power Through SNP Distributions Within the Short Tandem Repeats

The Polymerase Chain Reaction. Chapter 6: Background

AmpFlSTR Identifiler PCR Amplification Kit

The Real CSI: Using DNA to Identify Criminals and Missing Persons

Evaluation of NGS technology for identification, ancestry prediction and phenotypic characters prediction with the Ion PGM System

Microsatellite markers

Overview of the Ibis STR Assay

Mutations during meiosis and germ line division lead to genetic variation between individuals

DNA Typing for the Identification of Korean War Victims

Incorporating Molecular ID Technology. Accel-NGS 2S MID Indexing Kits

SUPPLEMENTARY INFORMATION

Transcriptomics analysis with RNA seq: an overview Frederik Coppens

GenPlex HID Training Class I

Major histocompatibility complex class I haplotype diversity in Chinese rhesus macaques

GENOTYPING-BY-SEQUENCING USING CUSTOM ION AMPLISEQ TECHNOLOGY AS A TOOL FOR GENOMIC SELECTION IN ATLANTIC SALMON

Performance of the Newly Developed Non-Invasive Prenatal Multi- Gene Sequencing Screen

dbcamplicons pipeline Amplicons

DNBseq TM SERVICE OVERVIEW Plant and Animal Whole Genome Re-Sequencing

Variant calling workflow for the Oncomine Comprehensive Assay using Ion Reporter Software v4.4

Product Catalog 2012 AmplGen Ltd.

CAPTURE-BASED APPROACH FOR COMPREHENSIVE DETECTION OF IMPORTANT ALTERATIONS

Maximizing your NGS sequencing with IDT. Adam Chernick, PhD Field Applications Manager, Functional Genomics

Overview. Background ~30 min. Lab activity ~50 min. DNA profiling Polymerase Chain Reaction (PCR) Gel Electrophoresis PCR

Welcome to the NGS webinar series

Targeted Sequencing in the NBS Laboratory

DNA concentration and purity were initially measured by NanoDrop 2000 and verified on Qubit 2.0 Fluorometer.

Next Generation Sequencing. Jeroen Van Houdt - Leuven 13/10/2017

Single Nucleotide Variant Analysis. H3ABioNet May 14, 2014

STR DNA Data Analysis. Genetic Analysis

Short Tandam Repeat (D1S58) Detection Kit (for Academic Instructions) Product # 54600

Certificate of Analysis of the Holotype HLA 24/7 Configuration A1 & CE

DNA Analysis Students will learn:

Expand your forensics workflow with the Precision ID NGS System for human identification with mtdna, SNP, and STR analysis

Development and characterization of a high throughput targeted genotypingby-sequencing solution for agricultural genetic applications

dbcamplicons pipeline Amplicons

The injection of the 28 and 29 PCR cycle data was performed at 3kV for 10 and 5 seconds.

LightScanner Hi-Res Melting Comparison of Six Master Mixes for Scanning and Small Amplicon and LunaProbes Genotyping

!"!"!"!"!"!"!"!"!"!"!"!"!"!"!"!"!"!"!"!"!"!"!"!"!"!"!"

STR Profiling Matching Criteria: Establishment and Importance of a Cell Line Database

Implementing ACMG guidelines on sequence variant interpretation: software-assisted variant curation and filtering

A multiplex panel of short-amplicon insertion-deletion DNA polymorphisms for forensic analysis

Expand your forensics workflow with the Precision ID NGS System for human identification

THE WHOLE GENE: APPLICATION AND IMPLEMENTATION OF WHOLE GENE SEQUENCING IN THE CLINICAL LABORATORY.

The Polymerase Chain Reaction. Chapter 6: Background

Forensic Applications of CE and MPS

The Iso-Seq Method: Transcriptome Sequencing Using Long Reads

Single Cell Genomics

Outline General NGS background and terms 11/14/2016 CONFLICT OF INTEREST. HLA region targeted enrichment. NGS library preparation methodologies

KAPA hgdna QUANTIFICATION AND QC KIT:

Advances and New Technologies in Forensic DNA

Get to Know Your DNA. Every Single Fragment.

Quality assurance in NGS (diagnostics)

LATE-PCR. Linear-After-The-Exponential

Sequence structure of 12 novel Y chromosome microsatellites and PCR amplification strategies

VALIDATION OF PROMEGA S POWERPLEX 16 FOR PATERNITY TESTING

Incorporating SeqStudio Genetic Analyzer and Sanger sequencing into genome editing workflows

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary

Twisted Intercalating Nucleic Acid (TINA) a novel group of molecules with improved performance in PCR and qpcr applications

Next generation sequencing techniques" Toma Tebaldi Centre for Integrative Biology University of Trento

HLA-Typing Strategies

Short Tandem Repeat (STR) Analysis

Introducing NGS and Holotype HLA EAP User Experiences

Targeted Sequencing of Leukemia-Associated Genes Using 454 Sequencing Systems

Next-Generation Sequencing. Technologies

..C C C T C A T T C A T T C A T T C A T T C A..

LightScanner Hi-Res Melting Comparison of Six Master Mixes for Scanning and Small Amplicon and LunaProbes Genotyping

Implementing ACMG guidelines on sequence variant interpretation: software-assisted variant curation and filtering

Next Generation Sequencing Lecture Saarbrücken, 19. March Sequencing Platforms

Latest Tools for Human Identification

An Efficient Method of Extracting DNA from Bone Remains from the Spanish Civil War A Comparative Study of Two Methods: PrepFiler BTA and DNAzol.

Overview of techniques in Molecular Diagnostics ELKE BOONE 23 MAART 2017

Axiom mydesign Custom Array design guide for human genotyping applications

Generating Forensic DNA Profiles

TWO INTERESTING CASES OF MOLECULAR DIAGNOSIS FOR HHT:

QIAGEN Whole Genome Amplification REPLI-g Eliminating Sample Limitations, Potential Use for Reference Material

Alissa Interpret The next evolution of Cartagenia Bench

High Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays

Next Generation Oncology Sequencing in your Laboratory. Built by pioneers in cancer genomics and liquid biopsy approaches PROGENEUS

Chapter 5. Structural Genomics

Lab methods: Exome / Genome. Ewart de Bruijn

2. Pyrosequencing Assay Design

Jennifer D. Churchill, Maiko Takahashi, Christina Strobl, Dixie Peters, Christina Capt, Walther Parson, Bruce Budowle

Transcription:

Sequence variation in the short tandem repeat system SE33 discovered by next generation sequencing Eszter Rockenbauer, MSc, PhD and Line Møller, MSc Forensic Geneticist Section of Forensic Genetics Department of Forensic Medicine Faculty of Health and Medical Sciences University of Copenhagen Denmark

Aims Sequence SE33 with an NGS assay Use Roche Junior Platform Search for sequence variation Identify new allele variants Find population specific alleles Test usefulness of higher allele diversity in paternity cases Dias 2

SE33 Core locus in the European Standard Set of STRs Included in several commercial STR typing kits o E.g. NGM Select, ESI17 Complex and highly variable sequence Consisting of four nucleotide AAAG units interrupted by AA and AG units Wide span of allele length variation (3 to 39 four repeat units) Dias 3

Challenges with SE33 No commercially available NGS kits to sequence SE33 Several polya stretches difficult to sequence with presently used NGS techniques Alleles can be up to 343 bp most NGS machines produce 300 bp reads Dias 4

GS Junior 454 Pyrosequencing technology 200 cycles of nucleotide addition (A, T, C and G in fixed order) Read length 800 bases Accuracy of 99% for reads of 400bp Several successful STR sequencing studies* *Fordyce et al. 2011, Van Neste et al. 2012, Dalsgaard et al. 2014; Rockenbauer et al. 2014; Gelardi et al. 2014; Scheible et al. 2014 Dias 5

Samples Samples 203 samples from Danes(144), Somalis(81) and Greenlanders(58) 188 from unrelated individuals 5 trios with an unsolved genetic inconsistency in SE33 between parent and child Control ISO17025 accredited STR typing (AmpFLSTR NGMSelect PCR Amplification Kit) Dias 6

STR sequencing Amplicon sequencing with Multiplex IDentifiers (MIDs) in the PCR primer sequence Quantification and pooling amplicons into a library empcr and sequencing Dias 7

Data analysis Generally low sequencing quality Standard filters were switched off to get more reads in output file Filters_off_amplicons -pipeline*: o Sorting by MIDs o Filtering by flanking sequences Both flanking sequence must be present (optional) Alignment in BioEdit (Ibis Biosciences) Primer Flanking Repeat Flanking Primer *in-house Python-based algorithm Dias 8

Results of allele calling Total sample coverage 4,643 (before) and 318 (after filtering) 10-20% forward 80-90% reverse strand reads 57 samples were sequenced twice Low-quality samples were reanalysed with only one flanking sequence 295 correct allele calls (from 394 expected alleles) ~20% called with an insecurity of +/-1bp Most correctly called alleles had 30 repeats (~300bp incl. flanks) Dias 9

Results of allele calling Uncertain calls: o Only few reads for long alleles (>30 repeats) o Uncertain number of A s ( T in reverse strand read) Undetermined alleles: o lack of sequencing reads o inconsistent reads o reads deviating from PCR-CE STR repeat sequence Dias 10

Results of allele calling High increase in allele variation 27 novel alleles not reported in STRbase Raised the power of discrimination by 282% compared to CE Identified the mutated allele in 3 out of 5 family trios Dias 11

Novel alleles Bold black bases are missing in some reads. Bold red bases are appear duplicated in some reads. Dias 12

Frequency 12 12.2 13 13.2 14 15 16 17 17.3 18 19 19.2 20 20.2 21 21.2 22 22.2 23 23.2 24.2 25.2 26.2 27.2 28.2 29.2 30.2 31.2 Allele distribution 0,5 0,45 CE Allele Distribution 0,4 0,35 0,3 Number of called alleles: 0,25 0,2 0,15 0,1 DK SOM GRL 144 81 58 0,05 0 Repeat number PCR-CE /NGS Danes 12 43 27 61 Somali 1 10 14 14 15 28 Greenlanders 0 4 9 9 8 7 9 15 Dias 13

Mutation studies Rolle NGS Allele sequence No usable sequence information obtained missing CE length (31.2) No usable sequence information obtained missing (20) M: mother, F: father, C: child. ( ) Alleles with no sequence information were not sequenced correctly in comparison to CE results and were left out of analysis. Dias 14

Conclusions SE33 is a promising candidate locus for NGS based genotyping Large sequence variation is observed Difficulties owing to poly-a stretches and long alleles The Roche Junior platform does not perform optimally Similar difficulties expected on other available NGS platforms Alternative sequencing technique/s and platform/s are needed Dias 15

Acknowledgements Professor and Director of the department: Niels Morling Forensic geneticists: Claus Børsting Bioinformatics: Carina G. Jønck Thank you for your attention Lab: Nadia Jochumsen