Applications of Next Generation Sequencing in Metagenomics Studies

Similar documents
How much sequencing do I need? Emily Crisovan Genomics Core

CM581A2: NEXT GENERATION SEQUENCING PLATFORMS AND LIBRARY GENERATION

Single Cell Genomics

DNA. bioinformatics. epigenetics methylation structural variation. custom. assembly. gene. tumor-normal. mendelian. BS-seq. prediction.

Welcome to the NGS webinar series

Research school methods seminar Genomics and Transcriptomics

Considerations for Illumina library preparation. Henriette O Geen June 20, 2014 UCD Genome Center

IMGM Laboratories GmbH. Sales Manager

Carl Woese. Used 16S rrna to developed a method to Identify any bacterium, and discovered a novel domain of life

Infectious Disease Omics

Introduction to taxonomic analysis of metagenomic amplicon and shotgun data with QIIME. Peter Sterk EBI Metagenomics Course 2014

Introductory Next Gen Workshop

Introduction Bioo Scientific

MICROBIOMICS Current and future tools of the trade

Incorporating Molecular ID Technology. Accel-NGS 2S MID Indexing Kits

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Genome Sequencing. I: Methods. MMG 835, SPRING 2016 Eukaryotic Molecular Genetics. George I. Mias

Next-Generation Sequencing Services à la carte

Index. E Electrophoretic Mobility Shift Assay (EMSA), 262 ENCODE project, 223, 224 European Nucleotide Archive (ENA), 34

Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie. Sander van Boheemen Medical Microbiology

WELCOME. Norma J. Nowak, PhD Executive Director, NY State Center of Excellence in Bioinformatics and Life Sciences (CBLS)

Bioinformatics Advice on Experimental Design

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday September 15, 2014

Development of NGS metabarcoding. characterization of aerobiological samples. Lucia Muggia

Culture-independent approaches to assess microbial diversity in biofilms

Microbiome Analysis. Research Day 2012 Ranjit Kumar

Data Analysis with CASAVA v1.8 and the MiSeq Reporter

Analysing genomes and transcriptomes using Illumina sequencing

QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd

Microbial Diversity and Assessment (III) Spring, 2007 Guangyi Wang, Ph.D. POST103B

Third Generation Sequencing

Microbially Mediated Plant Salt Tolerance and Microbiome based Solutions for Saline Agriculture

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Contents. 1 Basic Molecular Microbiology of Bacteria... 1 Exp. 1.1 Isolation of Genomic DNA Introduction Principle...

Next Gen Sequencing. Expansion of sequencing technology. Contents

INTRODUCCIÓ A LES TECNOLOGIES DE 'NEXT GENERATION SEQUENCING'

Standard Products Nucleic Acid Sample Submission Guideline

Introduction to Bioinformatics and Gene Expression Technologies

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Tuesday December 16, 2014

Molecular Cell Biology - Problem Drill 11: Recombinant DNA

Multiple choice questions (numbers in brackets indicate the number of correct answers)

About Strand NGS. Strand Genomics, Inc All rights reserved.

AP Biology Gene Expression/Biotechnology REVIEW

Ion S5 and Ion S5 XL Systems

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday June 16, 2014

Microarrays: since we use probes we obviously must know the sequences we are looking at!

Taxonomy. Classification of microorganisms 3/12/2017. Is the study of classification. Chapter 10 BIO 220

SMARTer Ultra Low RNA Kit for Illumina Sequencing Two powerful technologies combine to enable sequencing with ultra-low levels of RNA

DNA-Sequencing. Technologies & Devices. Matthias Platzer. Genome Analysis Leibniz Institute on Aging - Fritz Lipmann Institute (FLI)

ACCEL-NGS 2S DNA LIBRARY KITS

DNA-Sequencing. Technologies & Devices. Matthias Platzer. Genome Analysis Leibniz Institute on Aging - Fritz Lipmann Institute (FLI)

A Genomics (R)evolution: Harnessing the Power of Single Cells

RIPTIDE HIGH THROUGHPUT RAPID LIBRARY PREP (HT-RLP)

Course Overview: Mutation Detection Using Massively Parallel Sequencing

Data and Metadata Models Recommendations Version 1.2 Developed by the IHEC Metadata Standards Workgroup

Microbial Metabolism Systems Microbiology

Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH. BIOL 7210 A Computational Genomics 2/18/2015

High Throughput Sequencing Technologies. UCD Genome Center Bioinformatics Core Monday 15 June 2015

Impact of gdna Integrity on the Outcome of DNA Methylation Studies

Whole Transcriptome Analysis of Illumina RNA- Seq Data. Ryan Peters Field Application Specialist

Introduction to Bioinformatics

Next Generation Sequencing. Jeroen Van Houdt - Leuven 13/10/2017

RNA-Seq analysis workshop

MicroSEQ Rapid Microbial Identification System

Product selection guide Ion S5 and Ion S5 XL Systems

Genomics and Transcriptomics of Spirodela polyrhiza

Exploring new frontiers with next-generation sequencing

INTRODUCTION TO REVERSE TRANSCRIPTION PCR (RT-PCR) ABCF 2016 BecA-ILRI Hub, Nairobi 21 st September 2016 Roger Pelle Principal Scientist

Next-Generation Sequencing. Technologies

NEBNext. for Ion Torrent LIBRARY PREPARATION KITS

Introduction to Bioinformatics

The Polymerase Chain Reaction. Chapter 6: Background

Long and short/small RNA-seq data analysis

MHC Region. MHC expression: Class I: All nucleated cells and platelets Class II: Antigen presenting cells

Introduction to Next Generation Sequencing (NGS)

Ion S5 and Ion S5 XL Systems

Gene Expression Technology

Microbiome Analysis in Kawasaki Disease. Kristine M. Wylie, Susan C. Baker, George M. Weinstock, Stanford T. Shulman, and Anne H.

Product selection guide Ion GeneStudio S5 Series

Introduction to BioMEMS & Medical Microdevices DNA Microarrays and Lab-on-a-Chip Methods

Supplementary Figure 1

CHAPTER 20 DNA TECHNOLOGY AND GENOMICS. Section A: DNA Cloning

RADSeq Data Analysis. Through STACKS on Galaxy. Yvan Le Bras Anthony Bretaudeau Cyril Monjeaud Gildas Le Corguillé

Method development for DNA methylation on a single cell level

Molecular methods for detection of Antibiotic Resistance in environmental matrices: limits, prospects and challenges.

Introduction to Bioinformatics

High-throughput scale. Desktop simplicity.

DNA Microarray Technology

Digital DNA/RNA sequencing enables highly accurate and sensitive biomarker detection and quantification

M I C R O B I O L O G Y WITH DISEASES BY TAXONOMY, THIRD EDITION

High-yield, Scalable Library Preparation with the NEBNext Ultra II FS DNA Library Prep Kit

Sanger vs Next-Gen Sequencing

Next Generation Sequencing: An Overview

Unit 6: Molecular Genetics & DNA Technology Guided Reading Questions (100 pts total)

Computational Biology I LSM5191

Functional Genomics Overview RORY STARK PRINCIPAL BIOINFORMATICS ANALYST CRUK CAMBRIDGE INSTITUTE 18 SEPTEMBER 2017

P HENIX. PHENIX PCR Enzyme Guide Tools For Life Science Discovery RESEARCH PRODUCTS

CSC Assignment1SequencingReview- 1109_Su N_NEXT_GENERATION_SEQUENCING.docx By Anonymous. Similarity Index

Assay Standards Working Group Nov 2012 Assay Standards Working Group Recommendations, November 2012

Transcription:

Applications of Next Generation Sequencing in Metagenomics Studies Francesca Rizzo, PhD Genomix4life Laboratory of Molecular Medicine and Genomics Department of Medicine and Surgery University of Salerno francesca.rizzo@genomix4life.com frizzo@unisa.it

Genome = Parts list of a single genome Introduction Human genome: 3 Gbp, 24,000 ORFs Bacterial Genome: 3-5 Mbp, 200-4000 ORFs Viral Genome: 0.1-1Mbp, 10-100 ORFs e.g.: Garden soil 50 archeal species: 1.5x10 8 bp 1000 bacterial species: 5.0x10 9 bp 200 fungal species: 1.0x10 11 bp 50 invertebrate species: 5.0x10 10 bp 5000 viral genomes: 5.0x10 7 bp...functional analysis of the collective genomes of soil microflora, which we term the metagenome of the soil. - J. Handelsman et al., 1998 Metagenome = Parts list of the community 160 Gb DNA

Introduction "the application of modern genomics techniques to the study of communities of microbial organisms directly in their natural environments, bypassing the need for isolation and lab cultivation of individual species Chen, K.; Pachter, L. (2005) Comput. Biol. PubMed: metagenom*[title/abstract]

Introduction "the application of modern genomics techniques to the study of communities of microbial organisms directly in their natural environments, bypassing the need for isolation and lab cultivation of individual species Chen, K.; Pachter, L. (2005) Comput. Biol. PubMed: metagenom*[title/abstract]

Next Generation 3 rd Generation First Generation

Metagenome timeline

Metagenomics Who is there? What are they doing?

NGS methods Marker Genes (Amplicon sequencing) Extract DNA Amplify with targeted primers Diversity analysis Matagenomics (Shotgun sequencing) Extract DNA Sequence random fragments Diversity, function analysis Metatranscriptomics (RNA-sequencing) Extract RNA Sequence cdna Gene expression, function

NGS methods Marker Genes (Amplicon sequencing) Extract DNA Amplify with targeted primers Diversity analysis Matagenomics (Shotgun sequencing) Extract DNA Sequence random fragments Diversity, function analysis Metatranscriptomics (RNA-sequencing) Extract RNA Sequence cdna Gene expression, function

Marker Genes (Amplicon sequencing) 16S rrna Genes Bacteria Internal transcribed spacer (ITS) Fungi V4 V3 V6 V5 V7 V8 V1 V2 V9 Claesson, Marcus J., et al. Nucleic acids research 38.22 (2010) D. Lee Taylor et al. Appl. Environ. Microbiol. 2016;82:7217-7226 Alternative gene markers reca: Response to DNA Stress in Bacteria 18S rrna: 18S ribosomal RNA rpob: RNA polymerase ß subunit hsp70: heat shock protein 70

Typical workflow Marker Genes (Amplicon sequencing) Extract DNA Library preparation Environmental sample DNA

Typical workflow Marker Genes (Amplicon sequencing) Extract DNA Library preparation Environmental sample DNA

Typical workflow Marker Genes (Amplicon sequencing) Extract DNA Library preparation Environmental sample DNA Sequencing PLATFORM: Illumina MiSeq (2X150 bp, 2X250 bp, 2X300 bp) Read 1: 300 bp Index 2 Analysis Index 1 Read 2: 300 bp DEPTH OF SAMPLING: -50.000-100.000 reads/sample for simple communities -200.000-500.000 reads/sample for complex metagenomes

Typical workflow Marker Genes (Amplicon sequencing) Who is there? Barcodes are used to assign each sequence to the sample it come from, dropping low quality reads. Adapter are trimmed and multiple sequence alignment based on reference sequences is built. Phylogenetic tree is built using representative OTU. Quantitative assessment of the community 30 % 15 % 5% 10 % 40 % Use community clustering techniques to relates samples to one another Related sequences are grouped into OTUs and taxonomy is assigned OTU (Operational Taxonomic Unit)

Marker Genes (Amplicon sequencing) ADVANTAGE: -Fast survey of large community -Reduced cost -Mature set of tools and statistics for analysis -Good for first round surveys DISADVANTAGE: -Lack of tools for processing ITS/fungal microbiome data set -Amplification bias affect accuracy and replication -16S or ITS may not differentiate between similar strains -Different target regions in 16S gene and ITS region can give different results/resolution -Chloroplast 16S sequences can get amplified in plant metagenomes

NGS methods Marker Genes (Amplicon sequencing) Extract DNA Amplify with targeted primers Diversity analysis Matagenomics (Shotgun sequencing) Extract DNA Sequence random fragments Diversity, function analysis Metatranscriptomics (RNA-sequencing) Extract RNA Sequence cdna Gene expression, function

Typical workflow Matagenomics (Shotgun sequencing) Extract DNA DNA fragmentation Environmental sample DNA Library preparation End repair 3 End adenylation Adapter ligation PCR

Typical workflow Matagenomics (Shotgun sequencing) Extract DNA DNA fragmentation Environmental sample DNA Library preparation End repair Sequencing 3 End adenylation Adapter ligation PCR PLATFORM: Illumina NextSeq (2X150 bp), Hiseq X (2X125bp) Read 1: 150 bp Index 1 Analysis Index 2 DEPTH OF SAMPLING: 50.000.000-500.000.000 reads/sample Read 2: 150 bp Number of specie? Relative abundance of the species? Known o unknown genomes?

Typical workflow Matagenomics (Shotgun sequencing) Genome A Genome B Genome C Who is there? What are they doing? What can they do? Barcodes are used to assign each sequence to the sample it come from, dropping low quality reads. Taxonomic classification Genome D Unknown Metagemomes alignment to reference genome and/or de novo assembling. Function analysis Gene prediction and/or annotation Protein family Gene/genome structure Metabolic pathways Gene ontology

NGS methods Marker Genes (Amplicon sequencing) Extract DNA Amplify with targeted primers Diversity analysis Matagenomics (Shotgun sequencing) Extract DNA Sequence random fragments Diversity, function analysis Metatranscriptomics (RNA-sequencing) Extract RNA Sequence cdna Gene expression, function

Introduction Genome = Parts list of a single genome Metagenome = Parts list of the community Human genome: 3 Gbp, 24,000 ORFs Bacterial Genome: 3-5 Mbp, 200-4000 ORFs Viral Genome: 0.1-1Mbp, 10-100 ORFs e.g.: Garden soil 50 archeal species: 1.5x10 8 bp 1000 bacterial species: 5.0x10 9 bp 200 fungal species: 1.0x10 11 bp 50 invertebrate species: 5.0x10 10 bp 5000 viral genomes: 5.0x10 7 bp 160 Gb DNA 8 Gb RNA

Typical workflow Metatranscriptomics (RNA-sequencing) Extract RNA rrna depletion Environmental sample (Optional) Library preparation

Typical workflow Metatranscriptomics (RNA-sequencing) Extract RNA rrna depletion Environmental sample (Optional) RNA fragmented and primed Library preparation First strand cdna synthesis Second strand cdna synthesis 3 End adenylation Adapter ligation PCR

Typical workflow Metatranscriptomics (RNA-sequencing) Extract RNA rrna depletion Environmental sample Sequencing (Optional) Library preparation RNA fragmented and primed First strand cdna synthesis Second strand cdna synthesis 3 End adenylation PLATFORM: Illumina NextSeq (2X150 bp), Hiseq X (2X125bp) Read 1: 150 bp Index 1 Analysis Adapter ligation PCR Index 2 Read 2: 150 bp DEPTH OF SAMPLING: 20.000.000-500.000.000 reads/sample Number of specie? Relative abundance of the species? Known o unknown genomes? Levels of gene expression?

Typical workflow Metatranscriptomics (RNA-sequencing) Gene 1 Gene 2 Gene 3 Gene 1 Gene 2 Gene 3 Gene 1 Gene 2 Gene 3 Specie A Specie B Specie C Who is there? What are they doing? Protein family profile Barcodes are used to assign each sequence to the sample it come from, dropping low quality reads. Function analysis Gene 1 Gene 2 Gene 3 Alignment to reference transcriptome and/or de novo assembling. Specie D Unknown Taxonomic classification New molecules identification Gene ontology abundance Metabolic pathways reconstruction Pathways inference and enrichment analysis

Who is there What are they doing?

WHY? Metagenomics Discovery of: the diversity of life what is a genome/species the interplay between human and microbes how do microbial communities work and how stable are novel metabolic pathways novel natural products new antibiotic new molecules with new functions new enzymes and bioactive molecules

Our Team Our Services Contacts: www.genomics4life.com info@genomix4life.com Biomarkers Next Generation Sequencing Prof. Alessandro Dott.ssa Giovanna Weisz Marchese Dott. Giorgio Giurato Dott.ssa Rizzo Francesca Prof. Roberto Parente Dott.ssa Roberta Tarallo Fondazione dell Università degli Studi di Salerno Dott.ssa Teresa Rocco Dott.ssa Maria Ravo Dott.ssa Angela Cordella Dott. Giovanni Nassa Microarray QYAsynphony Liquid handling Ion PGM Bioinformatics Available Technologies NextSeq 500 MiSeq Research Service Server 40 Tb HiSeq 2500 Server 400 Tb NEXT GENERATION SEQUENCING DNA Whole genome & Exome sequencing targeted resequencing - de novo sequencing RNA RNA-Seq smallrna-seq - RIP-Seq CAGE Epigenomics and Gene regulation ChIP-Seq - BS-Seq targeted BS-Seq - methyl- Seq Metagenomics and metatrascriptomics Amplicon-Seq Shotgun-Seq Single cell sequencing MICROARRAY Genotyping Infinium HD - Vera Code Golden Gate Genotyping Epigenetics Human MethylationEPIC BeadChip Nutrigenomics

Applications of Next Generation Sequencing in Metagenomics Studies Francesca Rizzo, PhD Genomix4life Laboratory of Molecular Medicine and Genomics Department of Medicine and Surgery University of Salerno francesca.rizzo@genomix4life.com frizzo@unisa.it