Introduction to Microbial Community Analysis. Tommi Vatanen CS-E Statistical Genetics and Personalised Medicine

Size: px
Start display at page:

Download "Introduction to Microbial Community Analysis. Tommi Vatanen CS-E Statistical Genetics and Personalised Medicine"

Transcription

1 Introduction to Microbial Community Analysis Tommi Vatanen CS-E Statistical Genetics and Personalised Medicine

2 Structure of the lecture Motivation: human microbiome Terminology Data types, analysis Examples from human microbiome project (HMP) and DIABIMMUNE project

3 Our microbial selves: microbes are in, on & around us

4 Our microbial selves: microbes are in, on & around us More microbial cells than human cells in the human body (1-2 kg, mostly in gut) 1000s of species, each containing 1000s of genes (outnumber human genes 100:1) Under ideal conditions Aid in digestion, make nutrients (vit. K), keep bad guys out, train immune system Under non-ideal conditions Predispose, exacerbate, or directly cause deviations from health

5 What is metagenomics? Total collection of microorganisms within a community Also microbial community or microbiota Total genomic potential of a microbial community Study of uncultured microorganisms from the environment, which can include humans or other living hosts Total biomolecular repertoire of a microbial community

6 Sequencing techniques Massive parallel DNA sequencing revolutionized the study of microbial communities No need to isolate bacteria in lab Purify DNA and sequence Golden age of microbial community studies

7 What to do with your metagenome? Basic science Reservoir of gene and protein functional information Comprehensive snapshot of microbial ecology and evolution Translational science Public health tool monitoring population health and epidemiology Diagnostic or prognostic biomarker for host disease

8 Examples of metagenomic studies: Global ocean sampling 2003/ ongoing

9 The NIH Human Microbiome Project (HMP): A comprehensive microbial survey What is a normal human microbiome? 300 healthy human subjects Multiple body sites 15 male, 18 female Multiple visits Clinical metadata

10 DIABIMMUNE study on the infant gut microbiome Follow developing infant gut microbiome in Finland, Estonia and Russian Karelia 222 infants, at risk for autoimmune diseases by genotype Monthly stool samples from birth until 3 years Clinical metadata: Diet, antibiotics, mode of birth, vaccinations

11 Talking about microbes: Phylogenies OTU = operational taxonomic unit

12 Talking about microbes: Relative abundance Absolute abundance is always masked in data obtained by techniques discussed here Information is measured in relative abundances 30 % of the bacteria are XXX,

13 Talking about microbes: Abundance vs. prevalence Abundant but not prevalent Prevalent but not abundant Abundant and prevalent

14 Talking about microbes: diversity Diversity: broadly, a community s number and distribution of organisms Also community composition or structure Alpha-diversity refers to a diversity of a community (sample) Beta-diversity refers to dissimilarity between two communities

15 Talking about microbes: Alpha-diversity (1-sample) scenarios Not diverse Qualitatively diverse Taxonomically diverse Phylogenetically diverse Quantitatively diverse Taxonomically diverse

16 Talking about microbes: measures for alpha-diversity Richness: number of unique taxa Richness estimates (how many unobserved taxa?) Chao1 f 1 is the number of singleton taxa (observed only once, one read) and f 2 is the number of doubleton taxa Diversity as considered in information theory, entropy Shannon s diversity index p i is the relative abundance of taxon i Many other measures: Simpson, McIntosh, Berger-Parker, Vegan::diversity() in R

17 Alpha-diversity of the gut microbiome increases during first years of life Microbiome complexity & stability Birth 3 yrs Adult Elderly Kostic, A. D., Xavier, R. J., & Gevers, D. (2014). The microbiome in inflammatory bowel disease: current status and the future ahead. Gastroenterology, 146(6),

18 Increasing diversity in DIABIMMUNE Increase in diversity during first three years of life New microbes colonize the gut with increasing complexity of diet, environmental exposures, etc.

19 Talking about microbes: Beta-diversity (2-sample) scenarios Sample 1 Sample 2 Qualitatively diverse Taxonomically diverse Quantitatively diverse Taxonomically diverse Quantitatively diverse Phylogenetically diverse

20 Talking about microbes: measures for beta-diversity Jaccard index, proportion of shared taxa Bray-Curtis dissimilarity where C is the sum of the lesser values for only those species in common between both samples. S are the total number of species per sample. vegan::vegdist in R 20

21 UniFrac beta-diversity accounts for the phylogeny Raw weighted UniFrac metric Where n is the total number of branches in the tree, b i is the length of branch i, A i and B i are the number of descendants of branch i from communities A and B respectively, and A T and B T are the total number of sequences from communities A and B respectively Lozupone, C.; Knight, R. (2005). "UniFrac: A New Phylogenetic Method for Comparing Microbial Communities". Applied and Environmental Microbiology 71 21

22 Talking about microbes: ordination Ordination is a constrained projection of high-dimensional data into fewer dimensions Principal component analysis (PCA) guarantees the new dimensions to maximize normal variation Principal coordinates analysis (PCoA) denotes to any ordination method based on (dis)similarity matrix Nonmetric multidimensional scaling (NMDS) based on UniFrac beta-diversity is widely used in microbial community analysis Hamady, 2009

23 t-distributed stochastic neighborhood embedding Modern, distance / similarity matrix based technique for visualizing (highdimensional) data Find mapping / visualization which is faithful to the original local neighborhoods in the data Data points similar in the input data tend to be close in the visualization Rtsne::Rtsne in R

24 What aspects of a human host most influence microbial community composition? Rob Knight ~5,200 microbial communities profiled by 16S sequencing (closer = more similar)

25 How about infant gut microbiome? Variation in the infant gut microbiome is dominated by the age In DIABIMMUNE, Russians seem to have distinct microbiota compared to Finns and Estonians 25

26 Two big questions of microbial community analysis Who is there? What are they doing?

27 How to obtain data on microbes? Cultivate single strains of bacteria Traditional microbiology + sequencing Sequencing based methods for studying microbial communities Purify all DNA and sequence Amplicon-based methods target specific regions/genes of interest Shotgun sequencing for all DNA material Differences between sequencing methods Short vs. long reads Errors are more problematic than in e.g. human genome analysis

28 Sequencing as a tool for microbial community analysis (amplicon vs. shotgun) Lyse cells Extract & fragment DNA Features Samples Relative abundance Sequence short DNA reads 16S (18S, ITS) rrna gene Conserved across bacteria (Allows PCR amplification) Some regions are variable Permits genus-level ID Map reads to reference genomes AGCTAGA CCGATCG TTAGCAC ACTAGCA Assemble into contigs AGCTACAGC ACAGCACGGCAT GGCATCATC AGCTACAGCACGGCATCATC 28

29 Typical microbiome community analysis tasks Metagenomic data Stats 16S data 29 29

30 Two big questions of microbial community analysis Who is there? What are they doing?

31 Metagenomic methods: 16S rrna gene Structural component of the prokaryotic ribosome Used as molecular clock to identify phylogeny: Large, good scale for mutations Portions are constant, allowing amplification Relatively cheap Woese, 1987 Pace, 1997 V6 George Rice, Montana State University Ley, 2006 V2 31

32 Microbiome composition analysis: phylotypes and binning Binning: nontrivial assignment of reads to phylotypes or OTUs (=clustering / classification) Phylotype or operational taxonomic unit (OTU): organisms clonal to within some tolerance (e.g. 97%); species

33 Microbiome composition analysis: operational taxonomic unit (OTU) binning Open reference Clustering AAA AAG AAT TGA >Uniq1 AAA >Uniq2 TGA >Uniq3 TTT Closed reference Classification TTT TGG

34 QIIME for analysing amplicon sequencing data QIIME (pronounced chime) is a modular open-source bioinformatics pipeline for analysingmicrobial amplicon sequencing data Homepage qiime.org contains documentation, tutorials and other resource material Huge collection of scripts for many different analysis tasks

35 QIIME for analysing amplicon sequencing data

36 Profiling microbial communities by metagenomic shotgun sequencing Reference Genomes A Y X B Y Y C A X X B X Y C Short Reads 36

37 Indexing microbial pangenomes I II III I II IV III IV I II I II II IV III I II I I IV II V III II V NCBI isolate genomes Archaea 300 Bacteria 12,926 Viruses 4,646 Eukaryota 2,177 V V IV II III II Bags of protein coding genes 49.0 million total genes II IV III V Species pangenomes 7,677 containing 18.6 million gene clusters II V Core genes V Marker genes RepoPhlAn ChocoPhlAn (

38 MetaPhlAn Metagenomic Phylogenic Analysis Reference Genomes A Y X B Y Y C A X X B X Y C Short Reads 38

39 MetaPhlAn data, species x samples

40 Other software for taxonomic profiling motu (metagenomic OTU) MEGAN Kraken

41 Two big questions of microbial community analysis Who is there? What are they doing?

42 Metagenomic analysis: molecular functions in biological roles Subjects Phylum abundance Phylum abundance Nares Skin Oral (BM) Oral (SupP) Oral (TD) Gut Vaginal Pathway abundance Pathway abundance Subjects

43 Metagenomic analysis: molecular functions in biological roles Orthology: Grouping genes by conserved sequence features COG, KO, FIGfam Structure: Grouping genes by similar protein domains Pfam, TIGRfam, SMART, EC Biological roles: Grouping genes by pathway and process involvement GO, KEGG, MetaCyc, SEED Warnecke, 2007 Turnbaugh, 2009 DeLong, 2006

44 From reads to genes (HUMAnN2) INPUT: Quality controlled metagenome (or metatranscriptome) Rapidly identify species in the community with MetaPhlAn2 Nucleotide search reads vs. pangenomes of identified species Translated search unclassified reads vs. non-redundant protein db Isolate novel reads for external assembly 44

45 From reads to genes (HUMAnN2) IV II V Quality-controlled RNA or DNA seq reads Taxonomic profiling (MetaPhlAn 2) List of abundant organisms III II V KEY data input Analysis module Unmapped reads Nucleotide level pangenome mapping (Bowtie 2) Functionally annotated species pangenomes (ChocoPhlAn) data product Organism-agnostic translated search (diamond) Organism specific hits Universal protein reference database (UniRef) Hits to protein families HUMAnN core algorithms Pathway collection (MetaCyc) 45

46 Body site-specific signature pathways in the human microbiome Note typically large abundance relative to other body sites Note relatively small % of pathway copies unclassified L-rhamnose degradation (RHAMCAT-PWY) emerged as a signature of the human gut microbiome across >900 first-visit HMP1-II metagenomes analyzed

47 Body site-specific signature pathways in the human microbiome Max area 2% relative abundance (other areas square-root scaled) signature for area i Q1( area i ) > Q3( area j ) for all j i; very stringent! 50 total signature pathways across 4 major body areas Values plotted = median (Q2) abundance for samples from that area 47

48 Which functions of microbiome are disrupted in IBD? Over six times as many microbial metabolic processes disrupted in IBD as microbes. If there s a transit strike, everyone driving a bus in Helsinki is disrupted, not everyone named Virtanen or Doe Phylogenetic distribution of function is consistent but diffuse During IBD, microbes... Stop Creating most amino acids Degrading complex carbs. Producing short-chain fatty acids Start Taking up more host products Dodging the immune system Adhering to and invading host cells

49 Confounding effects in real world data Biology is complicated, everything affects everything Scientist cannot control everything, in observational cohorts they are not even trying to Observed associations may be explained by confounding factors

50 Confounding effects in psychology Classical example: drowning incidents and ice cream sales are highly positively correlated Explanations Possibility #1: People drowning causes other people to purchase ice cream Possibility #2: Purchasing ice cream causes people to drown Possibility #3: There is a third variable (confounding variable) that causes the increase in both ice cream sales and drowning incidents The weather confounds the relationship between ice cream sales and drowning incidents Confounding variables are common in microbiome studies Lots of environmental factors affect the gut microbiome

51 Solution #1 post hoc checking of results Consumption of vegetables is correlated with species X Check if any other collected metadata, information about the study subjects, is correlated or associated with the consumption of vegetables No: you did not see any confounding factors but there still might be some Yes: Can you stratify your analyses to further confirm the finding E.g.: females consume more vegetables and have more species X Does the correlation hold with females/males only

52 Solution #2 Design and conduct a controlled experiment Consumption of vegetables is correlated with species X Design an experiment where subjects are randomly assigned to consume 1) a lot, or 2) no vegetables Control known confounders E.g. both groups contain same amount of males and females

53 Solution #3 Statistical modeling Test if the correlation / association holds after correcting for the confounding effects statistically Linear models easy to understand and computationally low cost

54 Lipid A biosynthesis in DIABIMMUNE infants

55 Typical microbiome community analysis tasks Metagenomic data Stats 16S data 55 55

56

Microbiomics I August 24th, Introduction. Robert Kraaij, PhD Erasmus MC, Internal Medicine

Microbiomics I August 24th, Introduction. Robert Kraaij, PhD Erasmus MC, Internal Medicine Microbiomics I August 24th, 2017 Introduction Robert Kraaij, PhD Erasmus MC, Internal Medicine r.kraaij@erasmusmc.nl Welcome to Microbiomics I Infection & Immunity MSc students Only first day no practicals

More information

Introduction to taxonomic analysis of metagenomic amplicon and shotgun data with QIIME. Peter Sterk EBI Metagenomics Course 2014

Introduction to taxonomic analysis of metagenomic amplicon and shotgun data with QIIME. Peter Sterk EBI Metagenomics Course 2014 Introduction to taxonomic analysis of metagenomic amplicon and shotgun data with QIIME Peter Sterk EBI Metagenomics Course 2014 1 Taxonomic analysis using next-generation sequencing Objective we want to

More information

Microbiome: Metagenomics 4/4/2018

Microbiome: Metagenomics 4/4/2018 Microbiome: Metagenomics 4/4/2018 metagenomics is an extension of many things you have already learned! Genomics used to be computationally difficult, and now that s metagenomics! Still developing tools/algorithms

More information

Carl Woese. Used 16S rrna to develop a method to Identify any bacterium, and discovered a novel domain of life

Carl Woese. Used 16S rrna to develop a method to Identify any bacterium, and discovered a novel domain of life METAGENOMICS Carl Woese Used 16S rrna to develop a method to Identify any bacterium, and discovered a novel domain of life His amazing discovery, coupled with his solitary behaviour, made many contemporary

More information

Carl Woese. Used 16S rrna to developed a method to Identify any bacterium, and discovered a novel domain of life

Carl Woese. Used 16S rrna to developed a method to Identify any bacterium, and discovered a novel domain of life METAGENOMICS Carl Woese Used 16S rrna to developed a method to Identify any bacterium, and discovered a novel domain of life His amazing discovery, coupled with his solitary behaviour, made many contemporary

More information

Functional profiling with HUMAnN2

Functional profiling with HUMAnN2 Eric Franzosa Jason Lloyd-Price Functional profiling with HUMAnN2 Curtis Huttenhower (chuttenh@hsph.harvard.edu) Galeb Abu-Ali (gabuali@hsph.harvard.edu) Ali Rahnavard (rah@broadinstitute.org) Harvard

More information

Microbiomes and metabolomes

Microbiomes and metabolomes Microbiomes and metabolomes Michael Inouye Baker Heart and Diabetes Institute Univ of Melbourne / Monash Univ Summer Institute in Statistical Genetics 2017 Integrative Genomics Module Seattle @minouye271

More information

Introduction to metagenomic analysis

Introduction to metagenomic analysis Introduction to metagenomic analysis Eric A. Franzosa, Ph.D. Galeb Abu-Ali, Ph.D. Harvard University CFAR Workshop on Metagenomics and Transcriptomics 16 September 2014 Huttenhower Research Group Harvard

More information

Metagenomics Computational Genomics

Metagenomics Computational Genomics Metagenomics 02-710 Computational Genomics Metagenomics Investigation of the microbes that inhabit oceans, soils, and the human body, etc. with sequencing technologies Cooperative interactions between

More information

Human Microbiome Project: First Map of the World Within Us. Hsin-Jung Joyce Wu "Microbiota and man: the story about us

Human Microbiome Project: First Map of the World Within Us. Hsin-Jung Joyce Wu Microbiota and man: the story about us Human Microbiome Project: First Map of the World Within Us Immune disorders: The new epidemic Gut microbiota: health and disease Disease Health Human Microbiome Project: The concept of superorganism :

More information

What is metagenomics?

What is metagenomics? Metagenomics What is metagenomics? Term first used in 1998 by Jo Handelsman "the application of modern genomics techniques to the study of communities of microbial organisms directly in their natural environments,

More information

Nature Biotechnology: doi: /nbt Supplementary Figure 1. MBQC base beta diversity, major protocol variables, and taxonomic profiles.

Nature Biotechnology: doi: /nbt Supplementary Figure 1. MBQC base beta diversity, major protocol variables, and taxonomic profiles. Supplementary Figure 1 MBQC base beta diversity, major protocol variables, and taxonomic profiles. A) Multidimensional scaling of MBQC sample Bray-Curtis dissimilarities (see Fig. 1). Labels indicate centroids

More information

CBC Data Therapy. Metagenomics Discussion

CBC Data Therapy. Metagenomics Discussion CBC Data Therapy Metagenomics Discussion General Workflow Microbial sample Generate Metaomic data Process data (QC, etc.) Analysis Marker Genes Extract DNA Amplify with targeted primers Filter errors,

More information

TECHNIQUES FOR STUDYING METAGENOME DATASETS METAGENOMES TO SYSTEMS.

TECHNIQUES FOR STUDYING METAGENOME DATASETS METAGENOMES TO SYSTEMS. TECHNIQUES FOR STUDYING METAGENOME DATASETS METAGENOMES TO SYSTEMS. Ian Jeffery I.Jeffery@ucc.ie What is metagenomics Metagenomics is the study of genetic material recovered directly from environmental

More information

Bioinformatics for Microbial Biology

Bioinformatics for Microbial Biology Bioinformatics for Microbial Biology Chaochun Wei ( 韦朝春 ) ccwei@sjtu.edu.cn http://cbb.sjtu.edu.cn/~ccwei Fall 2013 1 Outline Part I: Visualization tools for microbial genomes Tools: Gbrowser Part II:

More information

Jianguo (Jeff) Xia, Assistant Professor McGill University, Quebec Canada June 26, 2017

Jianguo (Jeff) Xia, Assistant Professor McGill University, Quebec Canada   June 26, 2017 Jianguo (Jeff) Xia, Assistant Professor McGill University, Quebec Canada jeff.xia@mcgill.ca www.xialab.ca June 26, 2017 Metabolomics http://metaboanalyst.ca Systems transcriptomics http://networkanalyst.ca

More information

Microbiota and What the Clinical Gastroenterologist Needs to Know

Microbiota and What the Clinical Gastroenterologist Needs to Know Microbiota and What the Clinical Gastroenterologist Needs to Know Co-Speakers: Premysl Bercik and Michael Surette, Farncombe Family Digestive Health Research Institute McMaster University Small Group Session:

More information

ngs metagenomics target variation amplicon bioinformatics diagnostics dna trio indel high-throughput gene structural variation ChIP-seq mendelian

ngs metagenomics target variation amplicon bioinformatics diagnostics dna trio indel high-throughput gene structural variation ChIP-seq mendelian Metagenomics T TM storage genetics assembly ncrna custom genotyping RNA-seq de novo mendelian ChIP-seq exome genomics indel ngs trio prediction metagenomics SNP resequencing bioinformatics diagnostics

More information

OMNIgene GUT stabilizes the microbiome profile at ambient temperature for 60 days and during transport

OMNIgene GUT stabilizes the microbiome profile at ambient temperature for 60 days and during transport OMNIgene GUT stabilizes the microbiome profile at ambient temperature for 60 days and during transport Evgueni Doukhanine, Anne Bouevitch, Ashlee Brown, Jessica Gage LaVecchia, Carlos Merino and Lindsay

More information

NGS part 2: applications. Tobias Österlund

NGS part 2: applications. Tobias Österlund NGS part 2: applications Tobias Österlund tobiaso@chalmers.se NGS part of the course Week 4 Friday 13/2 15.15-17.00 NGS lecture 1: Introduction to NGS, alignment, assembly Week 6 Thursday 26/2 08.00-09.45

More information

An introduction into 16S rrna gene sequencing analysis. Stefan Boers

An introduction into 16S rrna gene sequencing analysis. Stefan Boers An introduction into 16S rrna gene sequencing analysis Stefan Boers Microbiome, microbiota or metagenomics? Microbiome The entire habitat, including the microorganisms, their genomes (i.e., genes) and

More information

THE HUMAN MICROBIOME: RECENT DISCOVERIES AND APPLICATIONS TO MEDICINE

THE HUMAN MICROBIOME: RECENT DISCOVERIES AND APPLICATIONS TO MEDICINE THE HUMAN MICROBIOME: RECENT DISCOVERIES AND APPLICATIONS TO MEDICINE American Society for Clinical Laboratory Science April 21, 2017 Richard A. Van Enk, Ph.D., CIC FSHEA Director, Infection Prevention

More information

Infectious Disease Omics

Infectious Disease Omics Infectious Disease Omics Metagenomics Ernest Diez Benavente LSHTM ernest.diezbenavente@lshtm.ac.uk Course outline What is metagenomics? In situ, culture-free genomic characterization of the taxonomic and

More information

HMP Data Set Documentation

HMP Data Set Documentation HMP Data Set Documentation Introduction This document provides detail about files available via the DACC website. The goal of the HMP consortium is to make the metagenomics sequence data generated by the

More information

Applications of Next Generation Sequencing in Metagenomics Studies

Applications of Next Generation Sequencing in Metagenomics Studies Applications of Next Generation Sequencing in Metagenomics Studies Francesca Rizzo, PhD Genomix4life Laboratory of Molecular Medicine and Genomics Department of Medicine and Surgery University of Salerno

More information

Experimental Design Microbial Sequencing

Experimental Design Microbial Sequencing Experimental Design Microbial Sequencing Matthew L. Settles Genome Center Bioinformatics Core University of California, Davis settles@ucdavis.edu; bioinformatics.core@ucdavis.edu General rules for preparing

More information

MUSiCC: a marker genes based framework for metagenomic normalization and accurate profiling of gene abundances in the microbiome

MUSiCC: a marker genes based framework for metagenomic normalization and accurate profiling of gene abundances in the microbiome Manor and Borenstein Genome Biology (2015) 16:53 DOI 10.1186/s13059-015-0610-8 RESEARCH Open Access MUSiCC: a marker genes based framework for metagenomic normalization and accurate profiling of gene abundances

More information

Functional profiling of metagenomic short reads: How complex are complex microbial communities?

Functional profiling of metagenomic short reads: How complex are complex microbial communities? Functional profiling of metagenomic short reads: How complex are complex microbial communities? Rohita Sinha Senior Scientist (Bioinformatics), Viracor-Eurofins, Lee s summit, MO Understanding reality,

More information

At the age of big data sequencing, what's new about the naughty and efficient microbes within the WWTPs

At the age of big data sequencing, what's new about the naughty and efficient microbes within the WWTPs At the age of big data sequencing, what's new about the naughty and efficient microbes within the WWTPs Jean-Jacques Godon INRA, UR0050, Laboratoire de Biotechnologie de l Environnement, Narbonne, F-11100

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION ARTICLE NUMBER: 16088 DOI: 10.1038/NMICROBIOL.2016.88 Species-function relationships shape ecological properties of the human gut microbiome Sara Vieira-Silva 1,2*, Gwen Falony

More information

Shantelle Claassen-Weitz Division of Medical Microbiology Department of Pathology

Shantelle Claassen-Weitz Division of Medical Microbiology Department of Pathology How important is sample collection and DNA/RNA extraction when profiling microbial communities Shantelle Claassen-Weitz Division of Medical Microbiology Department of Pathology tellafiela@gmail.com The

More information

dbcamplicons pipeline Amplicons

dbcamplicons pipeline Amplicons dbcamplicons pipeline Amplicons Matthew L. Settles Genome Center Bioinformatics Core University of California, Davis settles@ucdavis.edu; bioinformatics.core@ucdavis.edu Microbial community analysis Goal:

More information

Lecture 01: Overview of Metagenomics

Lecture 01: Overview of Metagenomics Lecture 01: Overview of Metagenomics 1 Culture Independent Techniques: Metagenomics Universal Gene census Shotgun Metagenome Sequencing Transcriptomics (shotgun mrna) Proteomics (protein fragments) Metabolomics

More information

Recent urbanization in China is correlated with a Westernized microbiome encoding increased virulence and antibiotic resistance genes

Recent urbanization in China is correlated with a Westernized microbiome encoding increased virulence and antibiotic resistance genes Winglee et al. Microbiome (2017) 5:121 DOI 10.1186/s40168-017-0338-7 RESEARCH Open Access Recent urbanization in China is correlated with a Westernized microbiome encoding increased virulence and antibiotic

More information

METAGENOMICS. Aina Maria Mas Calafell Genomics

METAGENOMICS. Aina Maria Mas Calafell Genomics METAGENOMICS Aina Maria Mas Calafell Genomics Introduction Microbial communities Primary role in biogeochemical systems Study of microbial communities 1.- Culture-based methodologies Only isolated microbes

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION doi:10.1038/nature12212 Supplementary Discussion Contamination Assessment We evaluated the amount of human contamination in our viral DNA preparations by identifying sequences

More information

Mini-Symposium MICROBIOTA. Free. Meet the speaker. 14. November :30 19:00 Bohnenkamp Haus. Everyone welcome. Sponsored by:

Mini-Symposium MICROBIOTA. Free. Meet the speaker. 14. November :30 19:00 Bohnenkamp Haus. Everyone welcome. Sponsored by: Mini-Symposium MICROBIOTA Free Meet the speaker Everyone welcome Sponsored by: 14. November 2017 13:30 19:00 Bohnenkamp Haus PROGRAM 13:30 14:30 Julia Vorholt The leaf microbiota: disassembling and rebuilding

More information

Human-microbe mutualism: stability and resilience in health and disease

Human-microbe mutualism: stability and resilience in health and disease Human-microbe mutualism: stability and resilience in health and disease David A. Relman, Stanford University IOM Forum on Microbial Threats March 7, 2012 Our extended self : human-microbe mutualism (Based

More information

Conducting Microbiome study, a How to guide

Conducting Microbiome study, a How to guide Conducting Microbiome study, a How to guide Sam Zhu Supervisor: Professor Margaret IP Joint Graduate Seminar Department of Microbiology 15 December 2015 Why study Microbiome? ü Essential component, e.g.

More information

Measuring the human gut microbiome: new tools and non alcoholic fatty liver disease

Measuring the human gut microbiome: new tools and non alcoholic fatty liver disease Western University Scholarship@Western Electronic Thesis and Dissertation Repository July 2016 Measuring the human gut microbiome: new tools and non alcoholic fatty liver disease Ruth G. Wong The University

More information

Functional annotation of metagenomes

Functional annotation of metagenomes Functional annotation of metagenomes Jeroen F. J. Laros Leiden Genome Technology Center Department of Human Genetics Center for Human and Clinical Genetics Introduction Functional analysis Objectives:

More information

Supplementary Figures

Supplementary Figures Supplementary Figures Supplementary Fig. S1 - Nationwide contributions of the most abundant genera. The figure shows log 10 of the relative percentage of genera, forming 80% of total abundance. (Russian

More information

CBC Data Therapy. Metatranscriptomics Discussion

CBC Data Therapy. Metatranscriptomics Discussion CBC Data Therapy Metatranscriptomics Discussion Metatranscriptomics Extract RNA, subtract rrna Sequence cdna QC Gene expression, function Institute for Systems Genomics: Computational Biology Core bioinformatics.uconn.edu

More information

Day 3. Examine gels from PCR. Learn about more molecular methods in microbial ecology

Day 3. Examine gels from PCR. Learn about more molecular methods in microbial ecology Day 3 Examine gels from PCR Learn about more molecular methods in microbial ecology Genes We Targeted 1: dsrab 1800bp 2: mcra 750bp 3: Bacteria 1450bp 4: Archaea 950bp 5: Archaea + 950bp 6: Negative control

More information

Next G eneration Generation Microbial Microbial Genomics : The H uman Human Microbiome P roject Project George Weinstock

Next G eneration Generation Microbial Microbial Genomics : The H uman Human Microbiome P roject Project George Weinstock Next Generation Microbial Genomics: The Human Microbiome Project George Weinstock San Rocco: Protector from Infectious Diseases Large genome centers All have metagenomics programs Baylor College of Medicine

More information

MB 668 Microbial Bioinformatics and Genome Evolution. 4 credits Spring, 2017

MB 668 Microbial Bioinformatics and Genome Evolution. 4 credits Spring, 2017 MB 668 Microbial Bioinformatics and Genome Evolution 4 credits Spring, 2017 Instructors: T. Sharpton, R. Mueller and S. Giovannoni Thomas Sharpton (Microbiology): thomas.sharpton@oregonstate.edu Office

More information

Name: Ally Bonney. Date: January 29, 2015 February 24, Purpose

Name: Ally Bonney. Date: January 29, 2015 February 24, Purpose Name: Ally Bonney Title: Genome sequencing and annotation of Pseudomonas veronii isolated from Oregon State University soil and 16S rrna characterization of Corvallis, OR soil microbial populations Date:

More information

dbcamplicons pipeline Amplicons

dbcamplicons pipeline Amplicons dbcamplicons pipeline Amplicons Matthew L. Settles Genome Center Bioinformatics Core University of California, Davis settles@ucdavis.edu; bioinformatics.core@ucdavis.edu Microbial community analysis Goal:

More information

Sequencing Errors, Diversity Estimates, and the Rare Biosphere

Sequencing Errors, Diversity Estimates, and the Rare Biosphere Sequencing Errors, Diversity Estimates, and the Rare Biosphere or Living in the shadow of Errares Susan Huse Marine Biological Laboratory June 13, 2012 Consistent Community Profile across samples and environments

More information

I AM NOT A METAGENOMIC EXPERT. I am merely the MESSENGER. Blaise T.F. Alako, PhD EBI Ambassador

I AM NOT A METAGENOMIC EXPERT. I am merely the MESSENGER. Blaise T.F. Alako, PhD EBI Ambassador I AM NOT A METAGENOMIC EXPERT I am merely the MESSENGER Blaise T.F. Alako, PhD EBI Ambassador blaise@ebi.ac.uk Hubert Denise Alex Mitchell Peter Sterk Sarah Hunter http://www.ebi.ac.uk/metagenomics Blaise

More information

ST 591: Introduction to Quantitative Genomics Syllabus

ST 591: Introduction to Quantitative Genomics Syllabus General Information Instructor: Thomas Sharpton Email: thomas.sharpton@oregonstate.edu Office: 530 Nash Hall Phone: (541) 737-8623 Office Hours: TBD Teaching Assistand: TBD Course credits: 3 Class meetings:

More information

Chapter 12: Human Microbiome Analysis

Chapter 12: Human Microbiome Analysis Education Chapter 12: Human Microbiome Analysis Xochitl C. Morgan 1, Curtis Huttenhower 1,2 * 1 Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, United States of America,

More information

Integrating Evolutionary, Ecological and Statistical Approaches to Metagenomics. A proposal to the Gordon and Betty Moore Foundation

Integrating Evolutionary, Ecological and Statistical Approaches to Metagenomics. A proposal to the Gordon and Betty Moore Foundation Integrating Evolutionary, Ecological and Statistical Approaches to Metagenomics A proposal to the Gordon and Betty Moore Foundation Jonathan A. Eisen University of California, Davis U. C. Davis Genome

More information

Supplementary Information for

Supplementary Information for Supplementary Information for Microbial community dynamics and stability during an ammonia- induced shift to syntrophic acetate oxidation Jeffrey J. Werner 1,2, Marcelo L. Garcia 3, Sarah D. Perkins 3,

More information

Customized Phage Therapies To Eradicate Harmful Bacteria In Chronic Diseases. Europe Microbiome Congress London, 14 Nov., 2018

Customized Phage Therapies To Eradicate Harmful Bacteria In Chronic Diseases. Europe Microbiome Congress London, 14 Nov., 2018 Customized Phage Therapies To Eradicate Harmful Bacteria In Chronic Diseases Europe Microbiome Congress London, 14 Nov., 2018 Biomx At A Glance We are a microbiome drug discovery company developing customized

More information

Chapter 7. Motif finding (week 11) Chapter 8. Sequence binning (week 11)

Chapter 7. Motif finding (week 11) Chapter 8. Sequence binning (week 11) Course organization Introduction ( Week 1) Part I: Algorithms for Sequence Analysis (Week 1-11) Chapter 1-3, Models and theories» Probability theory and Statistics (Week 2)» Algorithm complexity analysis

More information

Computing for Metagenome Analysis

Computing for Metagenome Analysis New Horizons of Computational Science with Heterogeneous Many-Core Processors Computing for Metagenome Analysis National Institute of Genetics Hiroshi Mori & Ken Kurokawa Contents Metagenome Sequence similarity

More information

Comparative genomics of clinical isolates of Pseudomonas fluorescens, including the discovery of a novel disease-associated subclade.

Comparative genomics of clinical isolates of Pseudomonas fluorescens, including the discovery of a novel disease-associated subclade. Comparative genomics of clinical isolates of Pseudomonas fluorescens, including the discovery of a novel disease-associated subclade. by Brittan Starr Scales A dissertation submitted in partial fulfillment

More information

Practical Bioinformatics for Life Scientists. Week 14, Lecture 27. István Albert Bioinformatics Consulting Center Penn State

Practical Bioinformatics for Life Scientists. Week 14, Lecture 27. István Albert Bioinformatics Consulting Center Penn State Practical Bioinformatics for Life Scientists Week 14, Lecture 27 István Albert Bioinformatics Consulting Center Penn State No homework this week Project to be given out next Thursday (Dec 1 st ) Due following

More information

Advisors: Prof. Louis T. Oliphant Computer Science Department, Hiram College.

Advisors: Prof. Louis T. Oliphant Computer Science Department, Hiram College. Author: Sulochana Bramhacharya Affiliation: Hiram College, Hiram OH. Address: P.O.B 1257 Hiram, OH 44234 Email: bramhacharyas1@my.hiram.edu ACM number: 8983027 Category: Undergraduate research Advisors:

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Alla L Lapidus, Ph.D. SPbSU St. Petersburg Term Bioinformatics Term Bioinformatics was invented by Paulien Hogeweg (Полина Хогевег) and Ben Hesper in 1970 as "the study of

More information

Microbiome Analysis. Research Day 2012 Ranjit Kumar

Microbiome Analysis. Research Day 2012 Ranjit Kumar Microbiome Analysis Research Day 2012 Ranjit Kumar Human Microbiome Microorganisms Bad or good? Human colon contains up to 100 trillion bacteria. Human microbiome - The community of bacteria that live

More information

Microbially Mediated Plant Salt Tolerance and Microbiome based Solutions for Saline Agriculture

Microbially Mediated Plant Salt Tolerance and Microbiome based Solutions for Saline Agriculture Microbially Mediated Plant Salt Tolerance and Microbiome based Solutions for Saline Agriculture Contents Introduction Abiotic Tolerance Approaches Reasons for failure Roots, microorganisms and soil-interaction

More information

Lecture 8: Predicting and analyzing metagenomic composition from 16S survey data

Lecture 8: Predicting and analyzing metagenomic composition from 16S survey data Lecture 8: Predicting and analyzing metagenomic composition from 16S survey data What can we tell about the taxonomic and functional stability of microbiota? Why? Nature. 2012; 486(7402): 207 214. doi:10.1038/nature11234

More information

Metagenomic Analysis in Human- Associated Projects

Metagenomic Analysis in Human- Associated Projects Metagenomic Analysis in Human- Associated Projects Wikimedia Commons Wikimedia Commons Daniel H. Huson Singapore Center for Environmental Life Science Engineering (SCELSE) ZBIT Center for Bioinformatics

More information

Robert Edgar. Independent scientist

Robert Edgar. Independent scientist Robert Edgar Independent scientist robert@drive5.com www.drive5.com Reads FASTQ format Millions of reads Many Gb USEARCH commands "UPARSE pipeline" OTU sequences FASTA format >Otu1 GATTAGCTCATTCGTA >Otu2

More information

Metagenomic species profiling using universal phylogenetic marker genes

Metagenomic species profiling using universal phylogenetic marker genes Metagenomic species profiling using universal phylogenetic marker genes Shinichi Sunagawa, Daniel R. Mende, Georg Zeller, Fernando Izquierdo-Carrasco, Simon A. Berger, Jens Roat Kultima, Luis Pedro Coelho,

More information

Metagenomics of the Human Intestinal Tract

Metagenomics of the Human Intestinal Tract Metagenomics of the Human Intestinal Tract http://www.metahit.eu This presentation is licensed under the Creative Commons Attribution 3.0 Unported License available at http://creativecommons.org/licenses/by/3.0/

More information

Lecture 8: Predicting metagenomic composition from 16S survey data

Lecture 8: Predicting metagenomic composition from 16S survey data Lecture 8: Predicting metagenomic composition from 16S survey data Taxonomic and functional stability of microbiota Nature. 2012; 486(7402): 207 214. doi:10.1038/nature11234 2 1 7/6/16 A model of functional

More information

A proposal to the Gordon and Betty Moore Foundation

A proposal to the Gordon and Betty Moore Foundation INTEGRATING EVOLUTIONARY, ECOLOGICAL AND STATISTICAL APPROACHES TO METAGENOMICS A proposal to the Gordon and Betty Moore Foundation Jonathan A. Eisen University of California, Davis U. C. Davis Genome

More information

Molecular Evolution and Ecology. Martin Polz

Molecular Evolution and Ecology. Martin Polz Molecular Evolution and Ecology Martin Polz mpolz@mit.edu Overview I. Molecular evolution 1. History of life on Earth 2. Genes as chronometers 3. Tree of life II. Molecular ecology 1. Prokaryotic abundance

More information

choose MBL-REGISTER user: dm00834 password: dm00834 http://register.mbl.edu/ stamps.mbl.edu this uses the username and password on your STAMPS name badge Strategies for Analysis of Microbial Population

More information

Finding Biology in the Human Microbiome. George Weinstock

Finding Biology in the Human Microbiome. George Weinstock Finding Biology in the Human Microbiome George Weinstock What s next for the Human Microbiome? George Weinstock Metagenomics Unfolds You are here Setting Up Descriptive Phase Hypothesis Testing Metagenomics

More information

IMPACT OF DIET ON MICROBIOTA COMPOSITION AND FUNCTION IN THE SMALL INTESTINE Microbiome Drug Development Summit

IMPACT OF DIET ON MICROBIOTA COMPOSITION AND FUNCTION IN THE SMALL INTESTINE Microbiome Drug Development Summit IMPACT OF DIET ON MICROBIOTA COMPOSITION AND FUNCTION IN THE SMALL INTESTINE Microbiome Drug Development Summit 2017 29-06-2017 Els van Hoffen Senior scientist and project manager Nutrition & Health els.vanhoffen@nizo.com

More information

Introduc)on to QIIME on the IPython Notebook

Introduc)on to QIIME on the IPython Notebook Strategies and Techniques for Analyzing Microbial Population Structures Introduc)on to QIIME on the IPython Notebook Rob Knight Adam Robbins- Pianka Will Van Treuren Yoshiki Vázquez- Baeza ( @yosmark )

More information

Following text taken from Suresh Kumar. Bioinformatics Web - Comprehensive educational resource on Bioinformatics. 6th May.2005

Following text taken from Suresh Kumar. Bioinformatics Web - Comprehensive educational resource on Bioinformatics. 6th May.2005 Bioinformatics is the recording, annotation, storage, analysis, and searching/retrieval of nucleic acid sequence (genes and RNAs), protein sequence and structural information. This includes databases of

More information

Methods for comparing multiple microbial communities. james robert white, October 1 st, 2007

Methods for comparing multiple microbial communities. james robert white, October 1 st, 2007 Methods for comparing multiple microbial communities. james robert white, whitej@umd.edu Advisor: Mihai Pop, mpop@umiacs.umd.edu October 1 st, 2007 Abstract We propose the development of new software to

More information

CONSIDERING THE MICROBIOME AS PART OF FUTURE MEDICINE AND NUTRITION STRATEGIES: Challenges and proposed answers

CONSIDERING THE MICROBIOME AS PART OF FUTURE MEDICINE AND NUTRITION STRATEGIES: Challenges and proposed answers CONSIDERING THE MICROBIOME AS PART OF FUTURE MEDICINE AND NUTRITION STRATEGIES: Challenges and proposed answers Bruxelles Workshop The Microbiome, Diet and Health: Assessing Gaps in Science and Innovation

More information

MICROBIOMICS Current and future tools of the trade

MICROBIOMICS Current and future tools of the trade MICROBIOMICS Current and future tools of the trade Ingeborg Klymiuk Core Facility Molecular Biology ZMF - CENTER FOR MEDICAL RESEARCH Medical University Graz MICROBIOMICS DEFINITION OF OMIC TECHNOLOGIES

More information

BIOINFORMATICS TO ANALYZE AND COMPARE GENOMES

BIOINFORMATICS TO ANALYZE AND COMPARE GENOMES BIOINFORMATICS TO ANALYZE AND COMPARE GENOMES We sequenced and assembled a genome, but this is only a long stretch of ATCG What should we do now? 1. find genes What are the starting and end points for

More information

Genetics and Bioinformatics

Genetics and Bioinformatics Genetics and Bioinformatics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be Lecture 1: Setting the pace 1 Bioinformatics what s

More information

Nagahama Institute of Bio-Science and Technology. National Institute of Genetics and SOKENDAI Nagahama Institute of Bio-Science and Technology

Nagahama Institute of Bio-Science and Technology. National Institute of Genetics and SOKENDAI Nagahama Institute of Bio-Science and Technology A Large-scale Batch-learning Self-organizing Map for Function Prediction of Poorly-characterized Proteins Progressively Accumulating in Sequence Databases Project Representative Toshimichi Ikemura Authors

More information

Supplementary Note 1. Description of the main MetaPhlAn2 additions compared to MetaPhlAn1

Supplementary Note 1. Description of the main MetaPhlAn2 additions compared to MetaPhlAn1 MetaPhlAn2 for enhanced metagenomic taxonomic profiling Duy Tin Truong 1, Eric Franzosa 2,3, Timothy L. Tickle 2,3, Matthias Scholz 1, George Weingart 2, Edoardo Pasolli 1, Adrian Tett 1, Curtis Huttenhower

More information

Microbial Community Assembly and Dynamics:.from AMD biofilms to colonization of the premature infant gut

Microbial Community Assembly and Dynamics:.from AMD biofilms to colonization of the premature infant gut Microbial Community Assembly and Dynamics:.from AMD biofilms to colonization of the premature infant gut Jill Banfield Talk Overview i) AMD microbial biofilms: an example of reproducible community assembly

More information

Next Generation Sequencing. Tobias Österlund

Next Generation Sequencing. Tobias Österlund Next Generation Sequencing Tobias Österlund tobiaso@chalmers.se NGS part of the course Week 4 Friday 13/2 15.15-17.00 NGS lecture 1: Introduction to NGS, alignment, assembly Week 6 Thursday 26/2 08.00-09.45

More information

Introduction to Microbiome Omics Technologies

Introduction to Microbiome Omics Technologies BICF Education Monthly Topics in Bioinformatics and Genomics https://portal.biohpc.swmed.edu/content/training/ BICF Astrocyte Workflows in Sequence Variation, RNASeq, ChipSeq, CRISPR BICF Data Resources

More information

Human Microbiome Project: A Community Resource. Lita M. Proctor, Ph.D. Coordinator, Human Microbiome Project NHGRI/NIH

Human Microbiome Project: A Community Resource. Lita M. Proctor, Ph.D. Coordinator, Human Microbiome Project NHGRI/NIH Human Microbiome Project: A Community Resource Lita M. Proctor, Ph.D. Coordinator, Human Microbiome Project NHGRI/NIH 2016 HIV Microbiome Workshop November 17, 2016 1 Human Microbiome Project: 2007 to

More information

Parts of a standard FastQC report

Parts of a standard FastQC report FastQC FastQC, written by Simon Andrews of Babraham Bioinformatics, is a very popular tool used to provide an overview of basic quality control metrics for raw next generation sequencing data. There are

More information

Day 3. Examine gels from PCR. Learn about more molecular methods in microbial ecology

Day 3. Examine gels from PCR. Learn about more molecular methods in microbial ecology Day 3 Examine gels from PCR Learn about more molecular methods in microbial ecology 1: dsrab 1800bp 2: mcra 750bp 3: Bacteria 1450bp 4: Archaea 950bp 5: Archaea + 950bp 6: Negative control Genes We Targeted

More information

The human microbiome and cancer: New opportunities for population studies

The human microbiome and cancer: New opportunities for population studies The human microbiome and cancer: New opportunities for population studies Emily Vogtmann, PhD, MPH Research Fellow Metabolic Epidemiology Branch Division of Cancer Epidemiology & Genetics National Cancer

More information

Bioinformatic tools for metagenomic data analysis

Bioinformatic tools for metagenomic data analysis Bioinformatic tools for metagenomic data analysis MEGAN - blast-based tool for exploring taxonomic content MG-RAST (SEED, FIG) - rapid annotation of metagenomic data, phylogenetic classification and metabolic

More information

Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH. BIOL 7210 A Computational Genomics 2/18/2015

Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH. BIOL 7210 A Computational Genomics 2/18/2015 Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH BIOL 7210 A Computational Genomics 2/18/2015 The $1,000 genome is here! http://www.illumina.com/systems/hiseq-x-sequencing-system.ilmn Bioinformatics bottleneck

More information

Introduction to OTU Clustering. Susan Huse August 4, 2016

Introduction to OTU Clustering. Susan Huse August 4, 2016 Introduction to OTU Clustering Susan Huse August 4, 2016 What is an OTU? Operational Taxonomic Units a.k.a. phylotypes a.k.a. clusters aggregations of reads based only on sequence similarity, independent

More information

Analysis of milk microbial profiles using 16s rrna gene sequencing in milk somatic cells and fat

Analysis of milk microbial profiles using 16s rrna gene sequencing in milk somatic cells and fat Analysis of milk microbial profiles using 16s rrna gene sequencing in milk somatic cells and fat Juan F. Medrano Anna Cuzco* Alma Islas-Trejo Armand Sanchez* Olga Francino* Dept. of Animal Science University

More information

Day 3. Examine gels from PCR. Learn about more molecular methods in microbial ecology. Tour the Bay Paul Center Keck Sequencing Facility

Day 3. Examine gels from PCR. Learn about more molecular methods in microbial ecology. Tour the Bay Paul Center Keck Sequencing Facility Day 3 Examine gels from PCR Learn about more molecular methods in microbial ecology Tour the Bay Paul Center Keck Sequencing Facility 1: dsrab 1800bp 2: mcra 750bp 3: Bacteria 1450bp 4: Archaea 950bp 5:

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature09944 Supplementary Figure 1. Establishing DNA sequence similarity thresholds for phylum and genus levels Sequence similarity distributions of pairwise alignments of 40 universal single

More information

Functional analysis using EBI Metagenomics

Functional analysis using EBI Metagenomics Functional analysis using EBI Metagenomics Contents Tutorial information... 2 Tutorial learning objectives... 2 An introduction to functional analysis using EMG... 3 What are protein signatures?... 3 Assigning

More information

The virome of the human gut: metagenomic analysis of changes associated with diet

The virome of the human gut: metagenomic analysis of changes associated with diet The virome of the human gut: metagenomic analysis of changes associated with diet James Lewis Gary Wu Frederic Bushman Diet, Genetic Factors, and the Gut Microbiome in Crohn s Disease University of Pennsylvania

More information

CS262 Lecture 12 Notes Single Cell Sequencing Jan. 11, 2016

CS262 Lecture 12 Notes Single Cell Sequencing Jan. 11, 2016 CS262 Lecture 12 Notes Single Cell Sequencing Jan. 11, 2016 Background A typical human cell consists of ~6 billion base pairs of DNA and ~600 million bases of mrna. It is time-consuming and expensive to

More information

DNA. bioinformatics. genomics. personalized. variation NGS. trio. custom. assembly gene. tumor-normal. de novo. structural variation indel.

DNA. bioinformatics. genomics. personalized. variation NGS. trio. custom. assembly gene. tumor-normal. de novo. structural variation indel. DNA Sequencing T TM variation DNA amplicon mendelian trio genomics NGS bioinformatics tumor-normal custom SNP resequencing target validation de novo prediction personalized comparative genomics exome private

More information