Microbiome Analysis in Kawasaki Disease. Kristine M. Wylie, Susan C. Baker, George M. Weinstock, Stanford T. Shulman, and Anne H.

Similar documents
Using New ThiNGS on Small Things. Shane Byrne

Introducing QIAseq. Accelerate your NGS performance through Sample to Insight solutions. Sample to Insight

Applications of Next Generation Sequencing in Metagenomics Studies

Impact of gdna Integrity on the Outcome of DNA Methylation Studies

Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie. Sander van Boheemen Medical Microbiology

Introduction to metagenome assembly. Bas E. Dutilh Metagenomic Methods for Microbial Ecologists, NIOO September 18 th 2014

Carl Woese. Used 16S rrna to developed a method to Identify any bacterium, and discovered a novel domain of life

QIAGEN - Evolution from Tool to System Provider in MDx 3 Levels of QIAGEN Product Offering for Different Customer Types

Understanding the science and technology of whole genome sequencing

Multiple choice questions (numbers in brackets indicate the number of correct answers)

Carl Woese. Used 16S rrna to develop a method to Identify any bacterium, and discovered a novel domain of life

TECH NOTE Stranded NGS libraries from FFPE samples

Reading Lecture 8: Lecture 9: Lecture 8. DNA Libraries. Definition Types Construction

TREE CODE PRODUCT BROCHURE

CHAPTER 20 DNA TECHNOLOGY AND GENOMICS. Section A: DNA Cloning

FRAUNHOFER INSTITUTE FOR INTERFACIAL ENGINEERING AND BIOTECHNOLOGY IGB NEXT-GENERATION SEQUENCING. From wet lab to dry lab complete sample analysis

Genome Sequence Assembly

ZIKA Virus (ZIKAV) Real Time RT-PCR Kit

Infectious Disease Omics

Parts of a standard FastQC report

Experimental Design. Sequencing. Data Quality Control. Read mapping. Differential Expression analysis

Virus- infectious particle consisting of nucleic acid packaged in a protein coat.

CBC Data Therapy. Metatranscriptomics Discussion

Exome Sequencing Exome sequencing is a technique that is used to examine all of the protein-coding regions of the genome.

TECH NOTE Pushing the Limit: A Complete Solution for Generating Stranded RNA Seq Libraries from Picogram Inputs of Total Mammalian RNA

NextGen Sequencing and Target Enrichment

Next G eneration Generation Microbial Microbial Genomics : The H uman Human Microbiome P roject Project George Weinstock

FUTURE PROSPECTS IN MOLECULAR INFECTIOUS DISEASES DIAGNOSIS

QPCR ASSAYS FOR MIRNA EXPRESSION PROFILING

Shantelle Claassen-Weitz Division of Medical Microbiology Department of Pathology

Genetic tests are available for hundreds of disorders. DNA testing can pinpoint the exact genetic basis of a disorder.

Single Cell Genomics

Incorporating Molecular ID Technology. Accel-NGS 2S MID Indexing Kits

3.1.4 DNA Microarray Technology

SUPPLEMENTARY MATERIAL AND METHODS

FY10 INVITATION PROPOSAL SUMMARY PAGE

Changing Concept of FMD diagnostics: from Central to Local. Aniket Sanyal Project Directorate on FMD Mukteswar, India

Ion S5 and Ion S5 XL Systems

Molecular Biology: DNA sequencing

DNA concentration and purity were initially measured by NanoDrop 2000 and verified on Qubit 2.0 Fluorometer.

Cell Free DNA and Other Immune Monitoring Techniques in Lung Transplantation Are We There Yet?

Experimental Design Microbial Sequencing

Microbiome: Metagenomics 4/4/2018

Motivation From Protein to Gene

Applicazioni biotecnologiche

Methods that do not require growth in laboratory PCR

G E N OM I C S S E RV I C ES

Outline. Array platform considerations: Comparison between the technologies available in microarrays

TANBead Reagent Kits. Total Solution Provider for Nucleic Acid Extraction FACILITATING YOUR RESEARCH WITH MAGNETIC BEADS

Course outlines for senior Microbiology and Virology units of study within the Bachelor of Medical Science Last updated 18 November 2006

Introduction to Bioinformatics

Bi 8 Lecture 5. Ellen Rothenberg 19 January 2016

Current molecular diagnostic system

RNA-Sequencing analysis

resequencing storage SNP ncrna metagenomics private trio de novo exome ncrna RNA DNA bioinformatics RNA-seq comparative genomics

Pharmacogenetics: A SNPshot of the Future. Ani Khondkaryan Genomics, Bioinformatics, and Medicine Spring 2001

Ion S5 and Ion S5 XL Systems

Metagenomics of the Human Intestinal Tract

Gene Expression Profiling and Validation Using Agilent SurePrint G3 Gene Expression Arrays

Polymerase chain reaction: advantages and drawbacks

More than Meets the Eye. Next Generation RNA Extraction

Course summary. Today. PCR Polymerase chain reaction. Obtaining molecular data. Sequencing. DNA sequencing. Genome Projects.

CHAPTER 24. Immunology

Authors: Vivek Sharma and Ram Kunwar

QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd

SmartChip Real-Time PCR System

Contact us for more information and a quotation

Day 3. Examine gels from PCR. Learn about more molecular methods in microbial ecology

Tech Note. Using the ncounter Analysis System with FFPE Samples for Gene Expression Analysis. ncounter Gene Expression. Molecules That Count

Building a deep immuno-oncology portfolio in less than a year. David Johnson, PhD PEGS Spring 2018

UltraFast Molecular Diagnostic System

FUNCTIONAL BIOINFORMATICS

West Nile Virus Real Time RT-PCR Kit

Experimental Design. Dr. Matthew L. Settles. Genome Center University of California, Davis

Finding Genes with Genomics Technologies

Introduction to the MiSeq

Development of quantitative targeted RNA-seq methodology for use in differential gene expression

INTELLIGENT ANTIBODY DISCOVERY FROM HUMANS AND OTHER ANIMALS. Guy Cavet

QIAseq mirna Library Kit The next-generation in mirna sequencing products

APPLICATION NOTE. MagMAX mirvana Total RNA Isolation Kit TaqMan Advanced mirna Assays

Sample to Insight. Dr. Bhagyashree S. Birla NGS Field Application Scientist

A Novel Pegivirus in Indian River Bottlenose Dolphin

Genomic resources. for non-model systems

DNA/RNA MICROARRAYS NOTE: USE THIS KIT WITHIN 6 MONTHS OF RECEIPT.

Sample Flexibility for Gene Expression Analysis with the ncounter Analysis System

GENETICS EXAM 3 FALL a) is a technique that allows you to separate nucleic acids (DNA or RNA) by size.

GENETICS - CLUTCH CH.15 GENOMES AND GENOMICS.

Infectious Diseases Institute Faculty Hires See below for research interests, areas of expertise and contact information

De novo assembly in RNA-seq analysis.

Novel methods for RNA and DNA- Seq analysis using SMART Technology. Andrew Farmer, D. Phil. Vice President, R&D Clontech Laboratories, Inc.

FFPE in your NGS Study

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Comparative genomics of clinical isolates of Pseudomonas fluorescens, including the discovery of a novel disease-associated subclade.

Production and sales demonstration of STH-PAS, genetic testing kit in Vietnam

PLNT2530 (2018) Unit 6b Sequence Libraries

Fundamentals of Next-Generation Sequencing: Technologies and Applications

Introduction to Microbial Sequencing

Growing Needs for Practical Molecular Diagnostics: Indonesia s Preparedness for Current Trend

Detection of Rare Variants in Degraded FFPE Samples Using the HaloPlex Target Enrichment System

GENOMERA TM MRSA/SA PRODUCTS A NEW ERA IN DIRECT MRSA DNA TESTING. Bringing genetic MRSA results in less than 1 hour!

Transcription:

Microbiome Analysis in Kawasaki Disease Kristine M. Wylie, Susan C. Baker, George M. Weinstock, Stanford T. Shulman, and Anne H. Rowley

Disclosures Kristine M. Wylie Microbiome Analysis in Kawasaki Disease No relevant financial relationships exist

Evidence for a viral etiology for Kawasaki Disease Epidemiologic Epidemics with geographic spread Seasonal predominance in non-temperate climates Clinical History of preceding respiratory symptoms Failure of antibiotic therapy Ultrastructural Intracytoplasmic inclusion bodies and virus-like particles detected in the bronchial epithelium of KD tissues but not controls (oligoclonal IgA) Gene expression Up-regulated cytotoxic T-cell and type I IFN pathways in coronary arteries from KD patients compared with controls (Presented at this meeting by Anne Rowley)

High-throughput metagenomic shotgun sequencing (MSS) A powerful approach for identifying pathogens associated with diseases of unknown etiologies No a priori knowledge of the pathogen required No culturing needed Insensitive to minor sequence variation that can confound molecular tests

MSS overview Sample collection and processing Frozen and FFPE tissues used Nucleic acid extracted (can use DNA/RNA/TNA) We focused on RNA due to previous data that suggested the etiologic agent is an RNA virus Library construction and sequencing cdna made from RNA Sheared to uniform size Sequencing library adapters ligated to ends Data generated on Illumina HiSeq platform 2x100 paired-end sequencing: 100 bp sequenced from each end of the fragment Sequence analysis Described on next slide Interpretation and additional follow up

MSS analysis overview Initial processing 100 base sequences Trim low quality, mask low complexity, etc. Known sequences Related to known sequences Unclassified sequences Assembly of overlapping sequences Alignment against reference genomes Alignments o Nucleotide o Translated amino acid o HMM Short sequence reads Contig (consensus sequence) Classification: human, bacterial, fungal, viral, other Unclassified sequences

Identify unclassified sequences that are shared among samples

Application of MSS to identifying viral pathogens in diseases of unknown etiologies Subject was 20-month-old boy with fever and petechial rash, no other pathogenic agents detected Plasma sequencing yielded sequences with remote similarity to astrovirus MLB-1 While this was being investigated, we learned astrovirus MLB-2 had been discovered and was being characterized by Dave Wang s group at WU. We obtained the complete genome sequence of MLB-2 from them. 238 reads were identified with > 80% identity to MLB-2, confirmed by specific RT-PCR MLB-2 was not found in 188 other samples screened by consensus astrovirus PCR First detection of an astrovirus in blood unexpected pathogen Holtz, L. R. et al. Astrovirus MLB2 viremia in febrile child. Emerging Infect. Dis. 17, 2050 2052 (2011). Wylie, K. M., et al. Sequence analysis of the human virome in febrile and afebrile children. PLoS ONE 7, e27735 (2012).

Application to Kawasaki Disease samples sequenced Cases number of sample Controls number of sample Tissue type Coronary artery (CA) 7 7 FFPE Serum 11 10 Frozen Throat 10 4 Frozen Lung 8 0 FFPE Heart 3 1 FFPE Spleen 1 0 Frozen 4.9 billion sequence reads generated Minimum 32 million Maximum 500 million Non-human sequences ranged from <1% (serum) to 99% (throat)

Virus discovery in sequence data Color Key 0 40 80 Value Case Control Metastats does not find any differentially represented viral candidates Human viruses do not associate with KD Reagent or environmental contaminants Parvo-like hybrid (Naccache, et al, PNAS) Many plant viruses Also likely contaminants Human.rhinovirus Rotavirus.A Anellovirus Human.parainfluenza.virus.1 Human.parechovirus.6 Human.parvovirus.B19 Paramecium.bursaria.Chlorella Parvo.like.hybrid.virus Acute.bee.paralysis Aphid.lethal.paralysis.virus Circovirus.like.genome No.genus..species..Brome.mosaic.virus Cherry.leaf.roll Cucumber.green.mottle Cucumber.mosaic.virus Lettuce.big.vein.associated Mirafiori.lettuce.big.vein Mirafiori.lettuce.virus Papaya.ringspot.virus Peanut.stunt.virus Pepper.mild.mottle.virus Tobacco.mosaic.virus Tomato.chlorotic.spot Tomato.mosaic.virus Tomato.spotted.wilt.virus Zucchini.yellow.mosaic.virus 9374X1.CASE_HEART 9374X3.CASE_HEART 9374X8.CASE_HEART 9374X6.CTRL_HEART D 10229X4.CASE_LUNG 10535X1.CASE_LUNG 10535X4.CASE_LUNG 9374X2.CASE_LUNG 9374X4.CASE_LUNG 9374X5.CASE_LUNG 9374X7.CASE_LUNG KD1.CASE_LUNG KD3.CASE_LUNG C 10535X2.CASE_THROAT KD2.CASE_THROAT 10535X3.CTRL_THROAT B 8898X6.CASE_SPLEEN A 62UW4AAXX_8.CASE_SERUM 630WBAAXX_2.CASE_SERUM 70EY4AAXX_2.CASE_SERUM 70EY4AAXX_3.CASE_SERUM 70EY4AAXX_4.CASE_SERUM 70EY4AAXX_5.CASE_SERUM 70EY4AAXX_6.CASE_SERUM 70EY4AAXX_7.CASE_SERUM 9639X1.CASE_SERUM 9639X2.CASE_SERUM 9785X1.CASE_SERUM 62VDFAAXX_5.CTRL_SERUM 630WBAAXX_1.CTRL_SERUM 630WBAAXX_3.CTRL_SERUM 630WBAAXX_4.CTRL_SERUM 630WBAAXX_5.CTRL_SERUM 630WBAAXX_6.CTRL_SERUM 630WBAAXX_7.CTRL_SERUM 630WBAAXX_8.CTRL_SERUM 70EY4AAXX_1.CTRL_SERUM 70EY4AAXX_8.CTRL_SERUM E 10229X1.CASE_CA 10229X2.CASE_CA 10229X3.CASE_CA 8502X3.CASE_CA 8502X4.CASE_CA 8898X1.CASE_CA 8898X2.CASE_CA 8502X1.CTRL_CA 8502X2.CTRL_CA 8898X3.CTRL_CA 8898X4.CTRL_CA 8898X5.CTRL_CA 8898X7.CTRL_CA 8898X8.CTRL_CA Heart Lung Throat Spleen Serum Coronary artery

Bacterial/fungal discovery in sequence data Color Key Metastats does not find any differentially represented bacterial or fungal candidates Many low level background bacteria/fungi (a smaller representative set shown here) Some known pathogens, but not associated with KD Many organisms that look like environmental or kit contaminants, but not associated with KD 0 40 Value Stenotrophomos.maltophilia.K279a Methanococcus.sp. Ruminococcus.sp. Streptobacillus.moniliformis.DSM.12112 Mollicutes.bacterium.D7 Prochlorococcus.marinus Acinetobacter.sp. Candida.albicans Staphylococcus.sp. Lactobacillus.gasseri.224.1 Fusobacterium.sp. Mycoplasma.sp. Clostridium.sp. Propionibacterium.acnes Streptococcus.sp. Neisseria.sp. Heamophilus.sp. Lindgomyces.breviappendiculatus Lentithecium.aquaticum Boeremia.exigua Teratosphaeria.fibrillosa Leptosphaeruli.australis Endosporium.populi.tremuloidis Westerdykella.cylindrica Zygosaccharomyces.fermentati Dissoconium.commune Hysterobrevium.mori Mytilinidion.resinicola Phoma.betae Aliquandostipite.khaoyaiensis Mycosphaerella.graminicola Dothidea.insculpta Case Control 9374X1.CASE_HEART 9374X3.CASE_HEART 9374X8.CASE_HEART 9374X6.CTRL_HEART D 10229X4.CASE_LUNG 10535X1.CASE_LUNG 10535X4.CASE_LUNG 9374X2.CASE_LUNG 9374X4.CASE_LUNG 9374X5.CASE_LUNG 9374X7.CASE_LUNG KD1.CASE_LUNG KD3.CASE_LUNG C KD2.CASE_THROAT 10535X3.CTRL_THROAT B 8898X6.CASE_SPLEEN A 62UW4AAXX_8.CASE_SERUM 630WBAAXX_2.CASE_SERUM 70EY4AAXX_2.CASE_SERUM 70EY4AAXX_3.CASE_SERUM 70EY4AAXX_4.CASE_SERUM 70EY4AAXX_5.CASE_SERUM 70EY4AAXX_6.CASE_SERUM 70EY4AAXX_7.CASE_SERUM 9639X1.CASE_SERUM 9639X2.CASE_SERUM 9785X1.CASE_SERUM 62VDFAAXX_5.CTRL_SERUM 630WBAAXX_1.CTRL_SERUM 630WBAAXX_3.CTRL_SERUM 630WBAAXX_4.CTRL_SERUM 630WBAAXX_5.CTRL_SERUM 630WBAAXX_6.CTRL_SERUM 630WBAAXX_7.CTRL_SERUM 630WBAAXX_8.CTRL_SERUM 70EY4AAXX_1.CTRL_SERUM 70EY4AAXX_8.CTRL_SERUM E 10229X1.CASE_CA 10229X3.CASE_CA 8502X3.CASE_CA 8502X4.CASE_CA 8898X1.CASE_CA 8898X2.CASE_CA 8502X1.CTRL_CA 8502X2.CTRL_CA 8898X3.CTRL_CA 8898X4.CTRL_CA 8898X5.CTRL_CA 8898X7.CTRL_CA 8898X8.CTRL_CA F Heart Lung Throat Spleen Serum Coronary artery

Discovery in unclassified contig data Metastats used to identify contigs more common in KD than controls Subset only in KD and Prioritizing for follow up not in controls shown here Contigs Cases Controls 10535X1.CASE.L 9374X5.CASE_L 9374X5.CASE_L 9374X5.CASE_L 9374X5.CASE_L KD1.CASE_LUN KD1.CASE_LUN KD1.CASE_LUN KD1.CASE_LUN KD1.CASE_LUN KD1.CASE_LUN KD1.CASE_LUN KD1.CASE_LUN

Current candidates Best candidates are sequences predominantly in patient samples but not controls Could be highly divergent from known viruses Could be very low abundance in samples Current work with existing data: Continue to try to classify sequences with updated databases Validate candidates with real-time RT-PCR assays on a larger set of cases controls Proceed cautiously due to confounding factors associated with pathogen discovery in MSS data

Confounding factors associated with MSS analysis Can find microbes (known or novel) that are contaminants from reagents, water, enzymes, etc. Contaminants can sometimes associate with a clinical phenotype by chance Example: If control samples are typically higher biomass (signal), case samples will have a higher representation of contaminants (noise) While we aim for relatively unbiased representation of the metagenome, biases are introduced during sample preparation Classifying sequences based on similarity to reference genomes is limited by the quality of the databases Missing entries Mis-labeled entries

Summary and conclusions Most comprehensive ultra-deep sequencing study to identify a causative agent of KD to date No etiologic agent of KD identified yet, but we are pursuing candidates We suspect that sequences of the agent are presently unclassified Prioritizing ORFs associated with KD samples for further study Possible challenges the agent is highly divergent from sequenced reference genomes causes a very early viremia is highly cell-associated is present in very low quantity in patient tissues may also be present in some controls. Our work highlights the challenges of identifying a new viral agent

Support for this work National Institutes of Health HL63771, HL109955, AI106030 (to AHR), the Max Goldenberg Foundation, and the Center for Kawasaki Disease at the Ann & Robert H. Lurie Children's Hospital of Chicago.