EFI 2016 DEBATE: WHOLE GENE VERSUS EXONIC SEQUENCING. Dr Katy Latham Stance: Whole gene sequencing should be the norm for HLA typing

Similar documents
HLA and Next Generation Sequencing it s all about the Data

HLA-Typing Strategies

MHC Region. MHC expression: Class I: All nucleated cells and platelets Class II: Antigen presenting cells

SeCore SBT Sequence Based Typing

A*01:02, 68:02 B*15:10, 58:02 DRB1*03:01, 12:01

Instructions for Confirmation of HLA Typing (Form 2005 Revision 5)

Report. B2. dbmhc. Introduction

HLA Nomenclature. Steven GE Marsh. Anthony Nolan Research Institute London

Certificate of Analysis of the Holotype HLA 24/7 Configuration A1 & CE

A high resolution HLA class I and class II matching method for bone marrow donor selection

PCR-SSP primer mixes for KIR3DL3 non-synonymous polymorphism, and SNP linkage (L) reactions.

Gap Filling for a Human MHC Haplotype Sequence

The HLA Community s Success in Combining Clinical & Genomic Data

Nature Biotechnology: doi: /nbt Supplementary Figure 1. Number and length distributions of the inferred fosmids.

The HLA system. The Application of NGS to HLA Typing. Challenges in Data Interpretation

QIAGEN - Evolution from Tool to System Provider in MDx 3 Levels of QIAGEN Product Offering for Different Customer Types

In The Name of GOD. HLA Typing. By: M. Farzanehkhah NoAvaran MAYA Teb Co.

Whole genome sequencing in drug discovery research: a one fits all solution?

Dominant Sequences of Human Major Histocompatibility Complex Conserved Extended Haplotypes from HLA- DQA2 to DAXX

Introduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013

Methods for Gene Editing Measurement and Off-Target Discovery

Illumina TruSight HLA Sequencing Panel Automated on the Biomek FX P HLA SP Liquid Handler

TITLE: Subgrouping Chronic Fatigue Syndrome Patients By Genetic And Immune Profiling

Next Generation Sequencing. Target Enrichment

Fast and reproducible amplicon library preparation for HLA typing with the 454 sequencing system

Sequence Variations. Baxevanis and Ouellette, Chapter 7 - Sequence Polymorphisms. NCBI SNP Primer:

SNPs - GWAS - eqtls. Sebastian Schmeier

Jenny Gu, PhD Strategic Business Development Manager, PacBio

Evaluation of the Autogen FlexSTAR automated DNA extractor, for suitability in routine diagnostic use

HLA Typing from 1000 Genomes Whole Genome and Whole Exome Illumina Data

High-resolution DNA melting curve analysis to establish HLA genotypic identity

Admera Health RUO Services

Mutation entries in SMA databases Guidelines for national curators

Gene Identification in silico

Do Donors With Non-Deletional Blood Group O Allels Boost Anti-A and Anti-B Titers in Blood Group O Recipients?

Global response plan for pfhrp2/3 deletions

Surely Better Target Enrichment from Sample to Sequencer and Analysis

SNP GENOTYPING WITH iplex REAGENTS AND THE MASSARRAY SYSTEM

Genome annotation. Erwin Datema (2011) Sandra Smit (2012, 2013)

High Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays

Ion S5 and Ion S5 XL Systems

Applications of PacBio Single Molecule, Real- Time (SMRT) DNA Sequencing

Standards for Accredited Laboratories

Target Enrichment Strategies for Next Generation Sequencing

The Agilent Technologies SureSelect Platform for Target Enrichment

This is a closed book, closed note exam. No calculators, phones or any electronic device are allowed.

SureSelect Clinical Research Exome V2 Definitive Answers Where it Matters Most

Rapid Cycle PCR, Real Time Analysis, and Hi-Res Melting

Single Nucleotide Variant Analysis. H3ABioNet May 14, 2014

Next Generation Sequencing and Bioinformatics Analysis Pipelines. Adam Ameur National Genomics Infrastructure SciLifeLab Uppsala

HLA Genotyping and Antibody Characterization Using the Luminex Multiplex Technology

resequencing storage SNP ncrna metagenomics private trio de novo exome ncrna RNA DNA bioinformatics RNA-seq comparative genomics

AmplideX PCR/CE TOMM40 Kit (RUO)

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Complementary Technologies for Precision Genetic Analysis

Targeted Sequencing in the NBS Laboratory

Emma Huxley. Principal Clinical Scientist West Midlands Regional Genetics Laboratory

AMERICAN SOCIETY FOR HISTOCOMPATIBILITY AND IMMUNOGENETICS PROFICIENCY TESTING PROGRAM OPERATIONS MANUAL

Structural variation. Marta Puig Institut de Biotecnologia i Biomedicina Universitat Autònoma de Barcelona

Cancer Genetics Solutions

5/18/2017. Genotypic, phenotypic or allelic frequencies each sum to 1. Changes in allele frequencies determine gene pool composition over generations

Release Notes for Genomes Processed Using Complete Genomics Software

RNA-Sequencing analysis

A Thesis: entitled. Improving a Method for High-Resolution HLA-Typing In Transplantation. Afnan Sami H Malebari

Immunogenicity of Stem Cells in Therapeutic Applications. MDPB-Registry meeting nov 25th Stem cell immunogenicity in therapeutic applications

TIGR THE INSTITUTE FOR GENOMIC RESEARCH

Supplementary Note: Detecting population structure in rare variant data

PAXgene Blood DNA PAXgene Blood DNA Tube (IVD) for clinical use PAXgene Blood DNA System (RUO) for research use. Explore more at

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary

The SMRTer Way: Single Genes to Complex Genomes

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Understanding Genes & Mutations. John A Phillips III May 16, 2005

Genome Sequencing: Applications and Newborn Screening. June 2, 2014

CONTRACTING ORGANIZATION: The Leland Stanford Junior University, Menlo Park, CA 94025

Sequencing Theory. Brett E. Pickett, Ph.D. J. Craig Venter Institute

Newborn screening for spinal muscular atrophy

with drmid Dx for Illumina NGS systems

132 Grundlagen der Bioinformatik, SoSe 14, D. Huson, June 22, This exposition is based on the following source, which is recommended reading:

Grundlagen der Bioinformatik, SoSe 11, D. Huson, July 4, This exposition is based on the following source, which is recommended reading:

BRACAnalysis CDx Technical Information

BENG 183 Trey Ideker. Genome Assembly and Physical Mapping

Getting high-quality cytogenetic data is a SNP.

Applications and Uses. (adapted from Roche RealTime PCR Application Manual)

Haploid Assembly of Diploid Genomes

Comprehensive Views of Genetic Diversity with Single Molecule, Real-Time (SMRT) Sequencing

Custom TaqMan Assays DESIGN AND ORDERING GUIDE. For SNP Genotyping and Gene Expression Assays. Publication Number Revision G

19/02/2018. Transplant Webinar Series: Ep. 3 Donor Selection For Solid Organ Transplants Is A Virtual Crossmatch Better Than Real?

HLA-DR Typing by Polymerase Chain Reaction with Sequence- Specific Primers Compared to Serological typing

TaqMan SNP Genotyping

Dedicated to Molecular Diagnostics

Hands-On Four Investigating Inherited Diseases

Introduction to Next Generation Sequencing (NGS) Andrew Parrish Exeter, 2 nd November 2017

Mutations during meiosis and germ line division lead to genetic variation between individuals

Redefining NIPT as we know it.

Walk Through The Y Project. FTDNA Conference 2010 Houston TX Dipl.- Ing. Thomas Krahn

The first and only fully-integrated microarray instrument for hands-free array processing

Validation Study of FUJIFILM QuickGene System for Affymetrix GeneChip

Transcription:

EFI 2016 DEBATE: WHOLE GENE VERSUS EXONIC SEQUENCING Dr Katy Latham Stance: Whole gene sequencing should be the norm for HLA typing

Why we should be utilising whole gene sequencing Ambiguity generated by HLA The importance of sequencing the whole genes The technical considerations Examples- when full length is better Cost

The structure of HLA E1 E2 E3 E4 E5 E6 E7 E8 5 I3 I5 3 I1 I2 I4 I6 I7 200bp

HLA-B 2000

HLA-B 2015

GENERATION OF DIVERSITY HLA-A68 HLA-A2 HLA-A69 Holmes N, Parham P, EMBO J 1985 Nov;4(11):2849-54

AMBIGUITY IN THE INTRONS E1 E2 E3 E4 E5 E6 E7 E8 A*01:01:01:01 vs A*02:01:01:01 145 bps different A*03:01:01:01 vs A*11:01:01 40 bps different I1 I2 I4 I6 I7 200bp

VARIATION OUTSIDE OF EXONS HLA Loci Location Effect B*15:01:01:02N Intron 1 Deletion affecting the splice site for exon 2 A*24:02:01:02L Intron 2 Point mutation affecting the splice site for exon 3 A*02:01:01:02L Promotor Point mutation affecting the enhancer B region

IMPACT OF VARIATION ACROSS THE GENE Variation in introns and promotor regions can effect expression levels Expression levels can impact disease pathogenesis Expression levels can impact transplant outcome Could help with understanding the new links between pharmalogical agents and adverse effects

REFERENCE SEQUENCES- A HURDLE? Loci Number of full length genomic Number of alleles In IMGT/HLA A 221 3399 B 345 4242 C 304 2950 DRB1 40 1883 DRB3/4/5 7 127 DQB1 28 911 DQA1 45 69 DPB1 12 644 DPA1 45 69

IMGT/HLA Full length sequences are preferable though not essential; the minimum requirements are exons 2 and 3 for an HLA class I sequence and exon 2 for an HLA class II sequence Where a novel sequence differs only within an intron or other non-coding part of the gene, a full length sequence must be obtained, which covers all coding and non-coding regions In the absence of a full length genomic sequence from the most closely related allele, it may be required that this also be sequenced and submitted before a name can be assigned to the novel sequence

FULL LENGTH REFERENCE SEQUENCES Routine use of whole gene HLA sequencing is beginning to generate data Cereb N, Yang SY, OR14 Lang K, et al, P108 Lang K, et al, P118 ASHI, 2015 Significant amount of variation seen across the whole gene, even in loci and alleles considered to be common (Mayor N et al, PLoS One 2015 May 27;10(5)) Lack of reference sequences are cited as an issue when analysing data (Hosomichi K et al J Hum Genet 2015 Nov;60(11):665-73) Especially problematic when utilising the IMGT/HLA library as a scaffold for estimation or imputation

WHEN COMMON IS NOT SO COMMON Allele strings with common alleles HLA-C*07:01/06/18/28/52/103/104N/106/110/111/112/113/116/118/ 119/120/128/131/150Q/153/156/162/164N/166 SMRT result Sample-031: C*07:06 SMRT result Sample-062: C*07:18

EXON SEQUENCING APPROACH HLA-A gene 5 UTR 1 2 3 4 5 6 7 8 3 UTR HLA-A amplicon ~35 kb Amplify the whole gene Typically sequence the exons during cycle sequencing Phasing requires additional rounds of sequencing Total no bases: >200,000,000 CDS accuracy: 100%

WHOLE GENE APPROACH HLA-A gene 5 UTR 1 2 3 4 5 6 7 8 3 UTR HLA-A amplicon ~35 kb

HLA-DPB1

HLA-DPB1 DPB1*04:01 DPB1*105:01 DPB1*04:02 DPB1*126:01 Errors have been identified when using a phased approach spanning exons 2 and 3 compared to exons in isolation

CHALLENGES OF PHASING SNPS WITH SHORT-READ ASSEMBLY SNP rich region SNP poor region SNP rich region Allele 1??? Unknown Allele 2???

REFERENCE DATA R ea d D ep t h IMGT/HLA reference sequence High confidence in consensus sequencing allele calling

REFERENCE DATA IMGT/HLA reference sequence R ea d D ep t h

REFERENCE DATA IMGT/HLA reference sequence R ea d D ep t h Lower read depth, but compared to a reference sequence Higher read depth, but no reference

REFERENCE DATA IMGT/HLA reference sequence R ea d D ep t h Lower read depth, but compared to a reference sequence Higher read depth, but no reference Errors can be masked by false confidence due to limited reference sequence

HLA-DQB1 E3 E4 E6 3 I4 HLA - DQB1*02:03, *03:02 SBT of exons 2 and 3 There is no sequence for exon 3 in IMGT/HLA for the DQB1*02:03

HLA-DQB1 E3 E4 E6 HLA - DQB1* DQB1*03:XX, *02:01 3 I4 SMRT sequencing allowing phase across exon 2, intron 3 and exon 3 The novel allele DQB1*03:03:02 438 G>T has been confirmed

HLA-DQB1 Exon 2 Exon 3 02:03 A? 03:02 C G 02:01 C G 03:xx A T Lack of a reference for HLA-DQB1*02:03 exon 3 Lack of phasing to distinguish the cis/ trans ambiguity False confidence in the exon 2 result

HLA-DPB1- A NOVEL CASE DPB1*04:01:01:01 DPB1*105:xx Sample sequenced using SMRT sequencing A novel mutation (443A>G) within exon 3

HLA-DPB1- A NOVEL CASE DPB1*23:01:01 DPB1*51:01 SBT of exons 2 and 3 DPB1*23:01:01, *51:01 No reference in IMGT/HLA for exon 3 of HLA-DPB1*51:01

HLA-DPB1- A NOVEL CASE DPB1*04:01:01:01 DPB1*105xx An approach sequencing through the exons identified a different HLA type

HLA-DPB1- A NOVEL CASE Exon 2 Exon 3 04:01:01:01 CCC A 105:01 TAA A 105:XX TAA G 23:01:01 TCC A 51:01 CAA? Lack of a reference for HLA-DPB1*51:01 exon 3 Lack of phasing to distinguish the cis/ trans ambiguity False confidence in the exon 2 result

REDUCE THE TIME TO TRANSPLANT The availability of unambiguous allelic level HLA typing data at the donor selection stage removes the need for additional typing to confirm the HLA match between donor and patient and can reduce the matching process from six to an estimated three weeks * Various studies have confirmed that reducing the time to transplant has a beneficial effect on HSCT outcome Clinical studies are required to demonstrate the impact N u m b e r Min 0 Med 6 Max 215 Time *Source: BBMR, UK, press release

THE TIME TO ACHIEVE A RESULT DNA extraction Library Prep Sequencing step Analysis and reporting Day 1 6 DNA extraction Core sequencing Group specific Analysis sequencing Analysis Reporting Influenced by Cost Batch size, throughput of Laboratory Urgency Automation versus manual Merlo D et al, P104

THE IMPACT Homozygous loci are not homozygous HLA-C*07:RHXS, *07:RXHT HLA-C*07:02/27/50/66/74/100/102/105/114/117/121Q/126/130/135/136/137/143/144/145/146/149/152N/ 154/157/158/159/160/167/169/171/174/175/178/185/187/193/194/195/198N/202/208/209/211/213/217/220, HLA-C*07:02/50/66/74/100/102/105/114/117/121Q/126/130/135/136/137/143/144/145/146/149/152N/ 154/157/158/159/160/167/169/171/174/175/178/185/187/193/194/195/198N/202/208/209/211/213/217/220 SMRT sequencing result: C*07:02:01:01, *07:02:01:03

THE IMPACT Petersdorf et al demonstrated that MHC haplotype match between patient and donor influenced GvHD Common associations revealed by whole gene sequencing utilising SMRT technology HLA-B*07:02:01 HLA-C*07:02:01:03 HLA-B*40:01:02 HLA-C*03:04:01:02 Petersdorf E et al, PLoS Medicine, 2007

Wetterstrand K, wwwgenomegov/sequencingcosts accessed 7/05/2016

THE ARGUMENTS AGAINST? Solid organ transplantation Verification samples Use an exonic approach in conjunction with a whole gene approach?

SUMMARY With the advent of NGS the ability to sequence the whole HLA gene is technically achievable in a cost effective way The ability to produce a HLA type unambiguously, in phase, reduces the need for secondary testing The whole gene can be sequenced in a time frame that is clinically relevant HLA is entering the era of personalised medicine

Thanks for coming

Registry Clinical Submission Exon Whole Gene Accuracy 1 4 5 Good Good Turn around time 1 3 1 3+ days 3+ days Resolution 2 3 5 PBD + Good Ease of use 1 3 1 Easy Depends CE IVD 0 3 0 No Depends Costs of reagents per batch 1 3 1 >1000 >500 Robustness 3 2 1 High Medium Costs of instrumentation 1 2 1 Medium Depends Costs of reagents per sample 5 1 1 Low High Throughput 5 1 1 High Medium