Introduc)on to QIIME on the IPython Notebook

Size: px
Start display at page:

Download "Introduc)on to QIIME on the IPython Notebook"

Transcription

1 Strategies and Techniques for Analyzing Microbial Population Structures Introduc)on to QIIME on the IPython Notebook Rob Knight Adam Robbins- Pianka Will Van Treuren Yoshiki Vázquez- Baeza ) Luke Ursell

2 A microbe dominated world The universal nature of biochemistry. Pace NR. Proc Natl Acad Sci U S A Jan 30;98(3):805-8.

3 Vast microbial diversity in every question: ecosystem, how including human our are own we? Human: 10 trillion human cells 20,000 human genes Microbiota: 100 trillion microbial cells Microbiota: 2-20 million microbial genes 99.9% of our genomes the same, but our microbes...?

4 How do we assay this diversity?

5 Sequencing output (454, Illumina, Sanger) fastq, fasta, qual, or sff/trace files Metadata mapping file Pre-processing e.g., remove primer(s), demultiplex, quality filter OTU (or other sample by observation) table Phylogenetic Tree Evolutionary relationship between OTUs Denoise 454 Data Database Submission α-diversity and rarefaction β-diversity and rarefaction PyroNoise, Denoiser (In development) e.g., Phylogenetic Diversity, Chao1, Observed Species e.g., Weighted and unweighted UniFrac, Bray- Curtis, Jaccard Pick OTUs and representative sequences Reference based BLAST, UCLUST, USEARCH De novo e.g., UCLUST, CD-HIT, MOTHUR, USEARCH Interactive visualizations e.g., PCoA plots, distance histograms, taxonomy charts, rarefaction plots, network visualization, jackknifed hierarchical clustering. Assign taxonomy BLAST, RDP Classifier Align sequences e.g., PyNAST, INFERNAL, MUSCLE, MAFFT Legend Currently supported for marker-gene data only Currently supported for general sample by observation data Build 'OTU table' i.e., sample by observation matrix Build phylogenetic tree e.g., FastTree, RAxML, ClearCut (i.e., 'upstream' step) Required step or input (i.e., 'downstream' step) Optional step or input

6 Samples to sequences Sequencing output (454, Illumina, Sanger) fastq, fasta, qual, or sff/trace files Metadata mapping file Pre-processing e.g., remove primer(s), demultiplex, quality filter Denoise 454 Data PyroNoise, Denoiser Database Submission (In development)

7 Error- correczng codes allow mulzplex sequencing >GCACCTGAGGACAGGCATGAGGAA >GCACCTGAGGACAGGGGAGGAGGA >TCACATGAACCTAGGCAGGACGAA >CTACCGGAGGACAGGCATGAGGAT >TCACATGAACCTAGGCAGGAGGAA >GCACCTGAGGACACGCAGGACGAC >CTACCGGAGGACAGGCAGGAGGAA >CTACCGGAGGACACACAGGAGGAA >GAACCTTCACATAGGCAGGAGGAT >TCACATGAACCTAGGGGCAAGGAA >GCACCTGAGGACAGGCAGGAGGAA >PC.634_1 FLP3FBN01ELBSX CTGGGCCGTGTCTCAGTCCCAATGTGGCCGTTTACCCTCTCAGGCCGGCTAC GCATCATCGCCTTGGTGGGCCGTTACCTCACCAACTAGCTAATGCGCCGCAG GTCCATCCATGTTCACGCCTTGATGGGCGCTTTAATATACTGAGCATGCGCT CTGTATACCTATCCGGTTTTAGCTACCGTTTCCAGCAGTTATCCCGGACACA TGGGCTAGG! >PC.354_3 FLP3FBN01EEWKD! TTGGACCGTGTCTCAGTTCCAATGTGGGGGCCTTCCTCTCAGAACCCCTATC CATCGAAGGCTTGGTGGGCCGTTACCCCGCCAACAACCTAATGGAACGCATC CCCATCGATGACCGAAGTTCTTTAATAGTTCTACCATGCGGAAGAACTATGC CATCGGGTATTAATCTTTCTTTCGAAAGGCTATCCCCGAGTCATCGGCAGGT TGGATACGTGTTACTCACCCGTGCGCCGGT! Micah Hamady, et al., Nature Methods, Error- correczng barcodes for pyrosequencing hundreds of samples in mulzplex.

8 Sequences to OTUs and Phylogeny Pick OTUs and representative sequences Reference based BLAST, UCLUST, USEARCH Assign taxonomy BLAST, RDP Classifier De novo e.g., UCLUST, CD-HIT, MOTHUR, USEARCH Align sequences e.g., PyNAST, INFERNAL, MUSCLE, MAFFT e.g. p Build 'OTU table' i.e., sample by observation matrix Build phylogenetic tree e.g., FastTree, RAxML, ClearCut

9 OTU Picking de- novo Clustering Algorithm Clustered Sequences TTGGAAGATGTCTCAGTTCCAG! TTGGAAGATGTCTCAGTTCCAG! TTGGAAGATGTCTCAGTTCCAG! TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA! TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA Experimental Sequences OTU1! OTUS OTU2! OTU3!

10 OTU Picking Closed Reference Reference! Sequences TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA! TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA Sequences that hit a reference TTGGAAGATGTCTCAGTTCCAG! TTGGAAGATGTCTCAGTTCCAG! TTGGAAGATGTCTCAGTTCCAG! Sequences that failed to hit TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA! TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA Experimental Sequences OTUS OTU1! OTU1! OTU1!

11 OTU Picking Open Reference Reference! Sequences TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA! TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA Sequences that hit a reference TTGGAAGATGTCTCAGTTCCAG! TTGGAAGATGTCTCAGTTCCAG! TTGGAAGATGTCTCAGTTCCAG! Sequences that failed to hit TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA! TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA Experimental Sequences Clustering Algorithm OTU4! OTU5! OTU6! OTU1! OTUS OTU2! OTU3!

12 CompuZng alpha and beta diversity OTU (or other sample by observation) table Phylogenetic Tree Evolutionary relationship between OTUs α-diversity and rarefaction e.g., Phylogenetic Diversity, Chao1, Observed Species β-diversity and rarefaction e.g., Weighted and unweighted UniFrac, Bray- Curtis, Jaccard

13 Comparing microbial communizes Who s there? How many are are there? α (i.e., within sample) diversity How similar are any two samples? Treatments? β (i.e., between sample) diversity

14 PhylogeneZc Diversity (PD): a qualitazve, phylogenezc α- diversity metric Sum of branch length covered by a sample Faith DP (1992) ConservaZon evaluazon and phylogenezc diversity. Biological ConservaZon. 61:1-10.

15 Unweighted UniFrac: a qualitazve, phylogenezc β- diversity metric IdenZcal communizes D = 0.0 Related communizes D ~ 0.5 Unrelated communizes D = 1.0 Percent of observed branch length that is unique to either sample Lozupone and Knight, 2005, Appl Environ Microbiol 71:8228

16 Clustering by UniFrac distance

17 Extract DNA and amplify marker gene with barcoded primers Pool amplicons and sequence >GCACCTGAGGACAGGCATGAGGAA >GCACCTGAGGACAGGGGAGGAGGA >TCACATGAACCTAGGCAGGACGAA >CTACCGGAGGACAGGCATGAGGAT >TCACATGAACCTAGGCAGGAGGAA >GCACCTGAGGACACGCAGGACGAC >CTACCGGAGGACAGGCAGGAGGAA >CTACCGGAGGACACACAGGAGGAA >GAACCTTCACATAGGCAGGAGGAT >TCACATGAACCTAGGGGCAAGGAA >GCACCTGAGGACAGGCAGGAGGAA Assign reads to samples RefSeq 1 RefSeq 2 RefSeq 3 RefSeq 4 RefSeq 5 RefSeq 6 RefSeq 7 RefSeq 8 RefSeq 9 RefSeq 10 Assign millions of sequences from thousands of samples to OTUs Compute UniFrac distances and compare samples

18 Key QIIME files Mapping file: per sample meta- data, user- defined OTU table: sample x OTU matrix, central to downstream analyses [now in biom format] Parameters file: defines analyses, for use with the workflow scripts (opzonal)

19 Parameters Can Be Set In a Few Ways qiime_config files Environment Variable $QIIME_CONFIG_FP User s home directory Parameter files Command line

20 Mapping file

21 Mapping file: always run check_id_map.py! = required field

22 OTU table (classic format) sample x OTU matrix

23 OTU table (classic format) sample x OTU matrix OTU idenzfiers

24 OTU table (classic format) sample x OTU matrix Sample idenzfiers

25 OTU table (classic format) sample x OTU matrix OpZonal per OTU taxonomic informazon

26 OTU tables are now in biological observazon matrix (.biom) format (QIIME dev and later) Google: biom format hsp://biom- format.org See convert_biom.py for translazng between classic and biom otu tables

27 sample x observa/on con/ngency matrix OTUs Samples Observa/on counts

28 sample x observa/on con/ngency matrix Functions Metagenomes Observa/on counts

29 sample x observa/on con/ngency matrix Samples Genomes Samples OTUs Marker gene (e.g., 16S) surveys Ortholog groups ComparaZve genomics Taxa Marker gene (e.g., 16S) surveys Functions Metagenomes Metagenomics Metabolites Samples Metabolomics... Metatranscriptomics

30 The Biological ObservaZon Matrix (BIOM) Format or: How I Learned To Stop Worrying and Love the Ome- ome JSON- based format for represenzng arbitrary sample x observazon conzngency tables with opzonal metadata McDonald et al., GigaScience (2012). hsp:// format.org

31 Running QIIME NaZve installazon on Mac (OS X) or Linux From laptops to 16,000+ core compute cluster qiime- deploy Ubuntu Virtual Box Cloud- based installazons hsp://ncar.janus.rc.colorado.edu/

32 Amazon ElasZc Compute Cloud (EC2)

33 Moving Pictures of the Human Microbiome Two subjects sampled daily, one for six months, one for 18 months Four body sites: tongue, palm of le{ hand, palm of right hand, and gut (via fecal swabs). Caporaso JG et al. (2011) Moving pictures of the human microbiome. Genome biology 12: R50.

34 Moving Pictures of the Human Microbiome InvesZgate the relazve temporal variability of body sites. Is there a temporal core microbiome? Technical points: do we observe the same conclusions on 454 and Illumina data?

35 Moving Pictures of the Human Microbiome: QIIME tutorial A small subset of the full data set to facilitate short run Zme: ~0.1% of the full sequence colleczon. Sequenced across six Illumina GAIIx lanes, with a subset of the samples also sequenced on 454.

36 Tutorial Click on the link in the wiki. Find your user name in the notebook. It will look something like: wvtreuren_stamps_2013.ipynb Click this link. It will open in a new window. Don t do anything else un)l we complete the next 4 slides.

37

38 IPython reference IPython acts like a hybrid python/bash environment. The way we interact with the IPython notebook is through the cells

39 IPython reference Commands prefixed by a '!' character are issued to the shell (just like what your terminal runs). Commands not prefixed with '!' are issued to python, and behave as they normally would in python. Each 'cell' of the notebook is executable. ShiR+Enter (or the play buton) is the way you execute (or re- execute) the commands in a given cell. You must click in the cell to gain focus in that cell, and then type ShiR+Enter or hit the play buton

40 IPython reference Each executable has a prefix that shows you its status (if it has been run, if it hasn t been run, or if its szll running) Hasn t been run Has been run SZll running

41 Tree Building Experimental Sequences TTGGAAGATGTCTCAGTTCCAGA! TTGGGCCGTATGTCAGTCCCTAAGGAG! CTGGGCCGTGTCTCAGTCCCAATCA! TTGGAAGATGTCTCAGTTCCAGGGGCTATAA! TTGGGCCGTATGTCAGTCCCTACGTAACA Phylogeny! CTG-CGCCGTGTCTCAGT CCTC--AA! TTGGAAGATGTCTCAGT----TCCAGA! TTGGGCCGTATGTCAGTCCCTAAGGAG! CTG-GGCG--TGTCTCAGTCCCAATCA! TTGGAAGATGT--CTCAGT-GCTATAA! TTGG---ATGTCAGTCCCTACGTAACA Aligned! Sequences CTG-CGCCGTGTCTCAGT CCTC--AA! CG! C! TTGGAAGATGTCTCAGT----TCCAGA! AA! A! TTGGGCCGTATGTCAGTCCCTAAGGAG! GC! A! CTG-GGCG--TGTCTCAGTCCCAATCA! GG! G! TTGGAAGATGT--CTCAGT-GCTATAA! AA! A! TTGG---ATGTCAGTCCCTACGTAACA - Masked and aligned! sequences

42 In the ancient times of We used KiNG for viewing 3D plots in QIIME.

43 It's 2013! Emperor

44 Description 3D visualizazon tool Cross- pla orm Integrates with QIIME and it's workflows Use case- driven Easy to use In aczve development hsp:// hsp://

45 hsp://24.media.tumblr.com/tumblr_m6q4dgigkw1qzjxifo1_1280.jpg

46

47

48 Issues, suggestions, feature requests? Contact us: o Or contact the QIIME Forum o hsp://groups.google.com/group/qiime- forum

49 Now try the Taxa Summary Plots and OTU Category Significance seczons on your own

Microbiome Analysis. Research Day 2012 Ranjit Kumar

Microbiome Analysis. Research Day 2012 Ranjit Kumar Microbiome Analysis Research Day 2012 Ranjit Kumar Human Microbiome Microorganisms Bad or good? Human colon contains up to 100 trillion bacteria. Human microbiome - The community of bacteria that live

More information

Introduction to taxonomic analysis of metagenomic amplicon and shotgun data with QIIME. Peter Sterk EBI Metagenomics Course 2014

Introduction to taxonomic analysis of metagenomic amplicon and shotgun data with QIIME. Peter Sterk EBI Metagenomics Course 2014 Introduction to taxonomic analysis of metagenomic amplicon and shotgun data with QIIME Peter Sterk EBI Metagenomics Course 2014 1 Taxonomic analysis using next-generation sequencing Objective we want to

More information

Contents 16S rrna SEQUENCING DATA ANALYSIS TUTORIAL WITH QIIME... 5

Contents 16S rrna SEQUENCING DATA ANALYSIS TUTORIAL WITH QIIME... 5 QIIME Analysis 1 Contents 16S rrna SEQUENCING DATA ANALYSIS TUTORIAL WITH QIIME... 5 Report Overview... 5 How to Obtain Microbiome Data... 6 How to Setup QIIME... 7 Essential files for QIIME... 7 Sequence

More information

Carl Woese. Used 16S rrna to developed a method to Identify any bacterium, and discovered a novel domain of life

Carl Woese. Used 16S rrna to developed a method to Identify any bacterium, and discovered a novel domain of life METAGENOMICS Carl Woese Used 16S rrna to developed a method to Identify any bacterium, and discovered a novel domain of life His amazing discovery, coupled with his solitary behaviour, made many contemporary

More information

A FRAMEWORK FOR ANALYSIS OF METAGENOMIC SEQUENCING DATA

A FRAMEWORK FOR ANALYSIS OF METAGENOMIC SEQUENCING DATA A FRAMEWORK FOR ANALYSIS OF METAGENOMIC SEQUENCING DATA A. MURAT EREN Department of Computer Science, University of New Orleans, 2000 Lakeshore Drive, New Orleans, LA 70148, USA Email: aeren@uno.edu MICHAEL

More information

Development of NGS metabarcoding. characterization of aerobiological samples. Lucia Muggia

Development of NGS metabarcoding. characterization of aerobiological samples. Lucia Muggia Development of NGS metabarcoding for the characterization of aerobiological samples Lucia Muggia Alberto Pallavicini, Elisa Banchi, Claudio G. Ametrano, David Stankovic, Silvia Ongaro, Enrico Tordoni,

More information

Introduction to Microbial Community Analysis. Tommi Vatanen CS-E Statistical Genetics and Personalised Medicine

Introduction to Microbial Community Analysis. Tommi Vatanen CS-E Statistical Genetics and Personalised Medicine Introduction to Microbial Community Analysis Tommi Vatanen CS-E5890 - Statistical Genetics and Personalised Medicine Structure of the lecture Motivation: human microbiome Terminology Data types, analysis

More information

An introduction into 16S rrna gene sequencing analysis. Stefan Boers

An introduction into 16S rrna gene sequencing analysis. Stefan Boers An introduction into 16S rrna gene sequencing analysis Stefan Boers Microbiome, microbiota or metagenomics? Microbiome The entire habitat, including the microorganisms, their genomes (i.e., genes) and

More information

Functional analysis using EBI Metagenomics

Functional analysis using EBI Metagenomics Functional analysis using EBI Metagenomics Contents Tutorial information... 2 Tutorial learning objectives... 2 An introduction to functional analysis using EMG... 3 What are protein signatures?... 3 Assigning

More information

Metagenome Analysis With MG- RAST

Metagenome Analysis With MG- RAST Metagenome Analysis With MG- RAST Folker Meyer, PhD Argonne National Laboratory and University of Chicago http://metagenomics.anl.gov Palm Springs, March 2013 Acknowledgements Team: Dion Antonopoulos Daniela

More information

Phylogenetic methods for taxonomic profiling

Phylogenetic methods for taxonomic profiling Phylogenetic methods for taxonomic profiling Siavash Mirarab University of California at San Diego (UCSD) Joint work with Tandy Warnow, Nam-Phuong Nguyen, Mike Nute, Mihai Pop, and Bo Liu Phylogeny reconstruction

More information

OMNIgene GUT stabilizes the microbiome profile at ambient temperature for 60 days and during transport

OMNIgene GUT stabilizes the microbiome profile at ambient temperature for 60 days and during transport OMNIgene GUT stabilizes the microbiome profile at ambient temperature for 60 days and during transport Evgueni Doukhanine, Anne Bouevitch, Ashlee Brown, Jessica Gage LaVecchia, Carlos Merino and Lindsay

More information

Infectious Disease Omics

Infectious Disease Omics Infectious Disease Omics Metagenomics Ernest Diez Benavente LSHTM ernest.diezbenavente@lshtm.ac.uk Course outline What is metagenomics? In situ, culture-free genomic characterization of the taxonomic and

More information

David Jacob Meltzer m. Supervisor: Dr. Umer Zeeshan Ijaz

David Jacob Meltzer m. Supervisor: Dr. Umer Zeeshan Ijaz AMPLIpyth: A Python Pipeline for Amplicon Processing David Jacob Meltzer 0803837m MSc Bioinformatics, Polyomics and Systems Biology Supervisor: Dr. Umer Zeeshan Ijaz A report submitted in partial fulfillment

More information

A comparison of sequencing platforms and bioinformatics pipelines for compositional analysis of the gut microbiome

A comparison of sequencing platforms and bioinformatics pipelines for compositional analysis of the gut microbiome Allali et al. BMC Microbiology (2017) 17:194 DOI 10.1186/s12866-017-1101-8 RESEARCH ARTICLE Open Access A comparison of sequencing platforms and bioinformatics pipelines for compositional analysis of the

More information

Introduction to OTU Clustering. Susan Huse August 4, 2016

Introduction to OTU Clustering. Susan Huse August 4, 2016 Introduction to OTU Clustering Susan Huse August 4, 2016 What is an OTU? Operational Taxonomic Units a.k.a. phylotypes a.k.a. clusters aggregations of reads based only on sequence similarity, independent

More information

Supplementary Figure and Table Legends

Supplementary Figure and Table Legends 1 Supplementary Figure and Table Legends Figure S1: Whole-animal metabolic analysis. 12 week old WT and Dvl1 / were singly housed in CLAMS cages (Comprehensive Laboratory Animals Monitoring System) for

More information

Enabling reproducible data analysis for metagenomics. eresearch Africa Conference 2017 Gerrit Botha CBIO H3ABioNet 3 May 2017

Enabling reproducible data analysis for metagenomics. eresearch Africa Conference 2017 Gerrit Botha CBIO H3ABioNet 3 May 2017 Enabling reproducible data analysis for metagenomics eresearch Africa Conference 2017 Gerrit Botha CBIO H3ABioNet 3 May 2017 Outline 16S rrna analysis Current CBIO 16S rrna analysis setup H3ABioNet hackathon

More information

CDC s Advanced Molecular Detection (AMD) Sequence Data Analysis and Management

CDC s Advanced Molecular Detection (AMD) Sequence Data Analysis and Management CDC s Advanced Molecular Detection (AMD) Sequence Data Analysis and Management Scott Sammons Technology Officer Office of Advanced Molecular Detection National Center for Emerging and Zoonotic Infectious

More information

Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH. BIOL 7210 A Computational Genomics 2/18/2015

Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH. BIOL 7210 A Computational Genomics 2/18/2015 Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH BIOL 7210 A Computational Genomics 2/18/2015 The $1,000 genome is here! http://www.illumina.com/systems/hiseq-x-sequencing-system.ilmn Bioinformatics bottleneck

More information

Bioinformatic Suggestions on MiSeq-Based Microbial Community S

Bioinformatic Suggestions on MiSeq-Based Microbial Community S J. Microbiol. Biotechnol. (2015), 25(6), 765 770 http://dx.doi.org/10.4014/jmb.1409.09057 Review Research Article jmb Bioinformatic Suggestions on MiSeq-Based Microbial Community S Analysis Tatsuya Unno*

More information

Bioinformatic tools for metagenomic data analysis

Bioinformatic tools for metagenomic data analysis Bioinformatic tools for metagenomic data analysis MEGAN - blast-based tool for exploring taxonomic content MG-RAST (SEED, FIG) - rapid annotation of metagenomic data, phylogenetic classification and metabolic

More information

Evaluating the accuracy of amplicon-based microbiome computational pipelines on simulated human gut microbial communities

Evaluating the accuracy of amplicon-based microbiome computational pipelines on simulated human gut microbial communities Golob et al. BMC Bioinformatics (2017) 18:283 DOI 10.1186/s12859-017-1690-0 RESEARCH ARTICLE Evaluating the accuracy of amplicon-based microbiome computational pipelines on simulated human gut microbial

More information

METAGENOMICS. Aina Maria Mas Calafell Genomics

METAGENOMICS. Aina Maria Mas Calafell Genomics METAGENOMICS Aina Maria Mas Calafell Genomics Introduction Microbial communities Primary role in biogeochemical systems Study of microbial communities 1.- Culture-based methodologies Only isolated microbes

More information

Sanger vs Next-Gen Sequencing

Sanger vs Next-Gen Sequencing Tools and Algorithms in Bioinformatics GCBA815/MCGB815/BMI815, Fall 2017 Week-8: Next-Gen Sequencing RNA-seq Data Analysis Babu Guda, Ph.D. Professor, Genetics, Cell Biology & Anatomy Director, Bioinformatics

More information

MICROBIOME SOFTWARE: END OF BEGINNING.

MICROBIOME SOFTWARE: END OF BEGINNING. MICROBIOME SOFTWARE: END OF BEGINNING. DR. CHARLES ROBERTSON DIVISION OF INFECTIOUS DISEASES, UNIVERSITY OF COLORADO SCHOOL OF MEDICINE DR. DANIEL N. FRANK, DIVISION OF INFECTIOUS DISEASES, SCHOOL OF MEDICINE

More information

Microbially Mediated Plant Salt Tolerance and Microbiome based Solutions for Saline Agriculture

Microbially Mediated Plant Salt Tolerance and Microbiome based Solutions for Saline Agriculture Microbially Mediated Plant Salt Tolerance and Microbiome based Solutions for Saline Agriculture Contents Introduction Abiotic Tolerance Approaches Reasons for failure Roots, microorganisms and soil-interaction

More information

Using Rule Induction to Elucidate Co-Occurrence Patterns in Microbial Data. K. Kumar Thurimella. A thesis submitted to the

Using Rule Induction to Elucidate Co-Occurrence Patterns in Microbial Data. K. Kumar Thurimella. A thesis submitted to the Using Rule Induction to Elucidate Co-Occurrence Patterns in Microbial Data by K. Kumar Thurimella A thesis submitted to the University of Colorado in partial fulfillment of the requirements for the degree

More information

Welcome to the NGS webinar series

Welcome to the NGS webinar series Welcome to the NGS webinar series Webinar 1 NGS: Introduction to technology, and applications NGS Technology Webinar 2 Targeted NGS for Cancer Research NGS in cancer Webinar 3 NGS: Data analysis for genetic

More information

Microbial Diversity and Assessment (III) Spring, 2007 Guangyi Wang, Ph.D. POST103B

Microbial Diversity and Assessment (III) Spring, 2007 Guangyi Wang, Ph.D. POST103B Microbial Diversity and Assessment (III) Spring, 2007 Guangyi Wang, Ph.D. POST103B guangyi@hawaii.edu http://www.soest.hawaii.edu/marinefungi/ocn403webpage.htm Overview of Last Lecture Taxonomy (three

More information

Biochemistry 412. New Strategies, Technologies, & Applications For DNA Sequencing. 12 February 2008

Biochemistry 412. New Strategies, Technologies, & Applications For DNA Sequencing. 12 February 2008 Biochemistry 412 New Strategies, Technologies, & Applications For DNA Sequencing 12 February 2008 Note: Scale is wrong!! (at least for sequences) 10 6 In 1980, the sequencing cost per finished bp $1.00

More information

Genomics and High Performance Computing. Folker Meyer Argonne National Laboratory and University of Chicago

Genomics and High Performance Computing. Folker Meyer Argonne National Laboratory and University of Chicago Genomics and High Performance Computing Folker Meyer and University of Chicago Brief intro: I am a computer scientist turned computational biologist My CS friends tell me I am a biologist My BIO friends

More information

Chapter 12: Human Microbiome Analysis

Chapter 12: Human Microbiome Analysis Education Chapter 12: Human Microbiome Analysis Xochitl C. Morgan 1, Curtis Huttenhower 1,2 * 1 Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, United States of America,

More information

Data Analysis with CASAVA v1.8 and the MiSeq Reporter

Data Analysis with CASAVA v1.8 and the MiSeq Reporter Data Analysis with CASAVA v1.8 and the MiSeq Reporter Eric Smith, PhD Bioinformatics Scientist September 15 th, 2011 2010 Illumina, Inc. All rights reserved. Illumina, illuminadx, Solexa, Making Sense

More information

Human-microbe mutualism: stability and resilience in health and disease

Human-microbe mutualism: stability and resilience in health and disease Human-microbe mutualism: stability and resilience in health and disease David A. Relman, Stanford University IOM Forum on Microbial Threats March 7, 2012 Our extended self : human-microbe mutualism (Based

More information

Quality assessment and control of sequence data. Naiara Rodríguez-Ezpeleta

Quality assessment and control of sequence data. Naiara Rodríguez-Ezpeleta Quality assessment and control of sequence data Naiara Rodríguez-Ezpeleta Workshop on Genomics 2014 Quality control is important Some of the artefacts/problems that can be detected with QC Sequencing Sequence

More information

RIPTIDE HIGH THROUGHPUT RAPID LIBRARY PREP (HT-RLP)

RIPTIDE HIGH THROUGHPUT RAPID LIBRARY PREP (HT-RLP) Application Note: RIPTIDE HIGH THROUGHPUT RAPID LIBRARY PREP (HT-RLP) Introduction: Innovations in DNA sequencing during the 21st century have revolutionized our ability to obtain nucleotide information

More information

Prokaryotic Diversity of the Wastewater Outfalls, Reefs, and Inlets of Broward County

Prokaryotic Diversity of the Wastewater Outfalls, Reefs, and Inlets of Broward County Nova Southeastern University NSUWorks Theses and Dissertations HCNSO Student Work 5-1-2014 Prokaryotic Diversity of the Wastewater Outfalls, Reefs, and Inlets of Broward County Alexandra Mandina Campbell

More information

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with

More information

SHAMAN : SHiny Application for Metagenomic ANalysis

SHAMAN : SHiny Application for Metagenomic ANalysis SHAMAN : SHiny Application for Metagenomic ANalysis Stevenn Volant, Amine Ghozlane Hub Bioinformatique et Biostatistique C3BI, USR 3756 IP CNRS Biomics CITECH Ribosome ITS (1) : located between 18S and

More information

16S rrna gene pyrosequencing of reference and clinical samples and investigation of the temperature stability of microbiome profiles

16S rrna gene pyrosequencing of reference and clinical samples and investigation of the temperature stability of microbiome profiles Hang et al. Microbiome 2014, 2:31 METHODOLOGY Open Access 16S rrna gene pyrosequencing of reference and clinical samples and investigation of the temperature stability of microbiome profiles Jun Hang 1*,

More information

ELE4120 Bioinformatics. Tutorial 5

ELE4120 Bioinformatics. Tutorial 5 ELE4120 Bioinformatics Tutorial 5 1 1. Database Content GenBank RefSeq TPA UniProt 2. Database Searches 2 Databases A common situation for alignment is to search through a database to retrieve the similar

More information

Why learn sequence database searching? Searching Molecular Databases with BLAST

Why learn sequence database searching? Searching Molecular Databases with BLAST Why learn sequence database searching? Searching Molecular Databases with BLAST What have I cloned? Is this really!my gene"? Basic Local Alignment Search Tool How BLAST works Interpreting search results

More information

Improved taxonomic assignment of human intestinal 16S rrna sequences by a dedicated reference database

Improved taxonomic assignment of human intestinal 16S rrna sequences by a dedicated reference database Ritari et al. BMC Genomics (2015) 16:1056 DOI 10.1186/s12864-015-2265-y RESEARCH ARTICLE Open Access Improved taxonomic assignment of human intestinal 16S rrna sequences by a dedicated reference database

More information

DNA extraction protocols cause differences in 16S rrna amplicon sequencing efficiency but not in community profile composition or structure

DNA extraction protocols cause differences in 16S rrna amplicon sequencing efficiency but not in community profile composition or structure DNA extraction protocols cause differences in 16S rrna amplicon sequencing efficiency but not in community profile composition or structure The Harvard community has made this article openly available.

More information

Next-Generation Sequencing. Technologies

Next-Generation Sequencing. Technologies Next-Generation Next-Generation Sequencing Technologies Sequencing Technologies Nicholas E. Navin, Ph.D. MD Anderson Cancer Center Dept. Genetics Dept. Bioinformatics Introduction to Bioinformatics GS011062

More information

Introduction. Jullien M. Flynn 1, Emily A. Brown 1,2,Frederic J. J. Chain 1, Hugh J. MacIsaac 2 & Melania E. Cristescu 1. Abstract

Introduction. Jullien M. Flynn 1, Emily A. Brown 1,2,Frederic J. J. Chain 1, Hugh J. MacIsaac 2 & Melania E. Cristescu 1. Abstract Toward accurate molecular identification of species in complex environmental samples: testing the performance of sequence filtering and clustering methods Jullien M. Flynn 1, Emily A. Brown 1,2,Frederic

More information

M1D2: Diagnostic Primer Design 2/10/15

M1D2: Diagnostic Primer Design 2/10/15 M1D2: Diagnostic Primer Design 2/10/15 Announcements 1. Expanded office hours for this week: Wednesday, 3-5pm in 16-319 Friday, 3-5pm in 16-319 Sunday, 3-5pm in 16-319 2. Weekly office hours (starting

More information

scgem Workflow Experimental Design Single cell DNA methylation primer design

scgem Workflow Experimental Design Single cell DNA methylation primer design scgem Workflow Experimental Design Single cell DNA methylation primer design The scgem DNA methylation assay uses qpcr to measure digestion of target loci by the methylation sensitive restriction endonuclease

More information

Integrating Evolutionary, Ecological and Statistical Approaches to Metagenomics. A proposal to the Gordon and Betty Moore Foundation

Integrating Evolutionary, Ecological and Statistical Approaches to Metagenomics. A proposal to the Gordon and Betty Moore Foundation Integrating Evolutionary, Ecological and Statistical Approaches to Metagenomics A proposal to the Gordon and Betty Moore Foundation Jonathan A. Eisen University of California, Davis U. C. Davis Genome

More information

Targeted Sequencing Using Droplet-Based Microfluidics. Keith Brown Director, Sales

Targeted Sequencing Using Droplet-Based Microfluidics. Keith Brown Director, Sales Targeted Sequencing Using Droplet-Based Microfluidics Keith Brown Director, Sales brownk@raindancetech.com Who we are: is a Provider of Microdroplet-based Solutions The Company s RainStorm TM Technology

More information

Kristin Tweel, PhD, MBA

Kristin Tweel, PhD, MBA Kristin Tweel, PhD, MBA Company Overview: Not-for-profit founded in 2000, enabled over $70M to date Identify areas where genomics and other omics can help Connect industry and academia Help identify and

More information

RESEARCH INSTITUTION: : BASELINE AND OIL SPILL IMPACTED MARINE SPONGE MICROBIAL COMMUNITIES AND GENE EXPRESSION ANALYSIS WITH METAGENOMICS

RESEARCH INSTITUTION: : BASELINE AND OIL SPILL IMPACTED MARINE SPONGE MICROBIAL COMMUNITIES AND GENE EXPRESSION ANALYSIS WITH METAGENOMICS RESEARCH INSTITUTION: : BASELINE AND OIL SPILL IMPACTED MARINE SPONGE MICROBIAL COMMUNITIES AND GENE EXPRESSION ANALYSIS WITH METAGENOMICS Jose V Lopez 1, Rebecca Vega Thurber, Peter McCarthy, Patricia

More information

RNA-seq Data Analysis

RNA-seq Data Analysis Lecture 3. Clustering; Function/Pathway Enrichment analysis RNA-seq Data Analysis Qi Sun Bioinformatics Facility Biotechnology Resource Center Cornell University Lecture 1. Map RNA-seq read to genome Lecture

More information

Microbiome analysis of skin undergoing acne treatments

Microbiome analysis of skin undergoing acne treatments Microbiome analysis of skin undergoing acne treatments Groups Sample size Time points Head Site Code Healthy, No treatment Acne, Receiving Spironolactone 4 0 2 0,1 Forehead Cheek Nose Chin Fh Ck No Ch

More information

Introns early. Introns late

Introns early. Introns late Introns early Introns late Self splicing RNA are an example for catalytic RNA that could have been present in RNA world. There is little reason to assume that the RNA world was not plagued by self-splicing

More information

Korilog. high-performance sequence similarity search tool & integration with KNIME platform. Patrick Durand, PhD, CEO. BIOINFORMATICS Solutions

Korilog. high-performance sequence similarity search tool & integration with KNIME platform. Patrick Durand, PhD, CEO. BIOINFORMATICS Solutions KLAST high-performance sequence similarity search tool & integration with KNIME platform Patrick Durand, PhD, CEO Sequence analysis big challenge DNA sequence... Context 1. Modern sequencers produce huge

More information

Supplementary Information

Supplementary Information Supplementary Information Title: Fat binding capacity and modulation of the gut microbiota both determine the effect of wheat bran fractions on adiposity Francesco Suriano 1,*, Laure B. Bindels 1,*, Joran

More information

arxiv: v1 [q-bio.gn] 25 Nov 2015

arxiv: v1 [q-bio.gn] 25 Nov 2015 MetaScope - Fast and accurate identification of microbes in metagenomic sequencing data Benjamin Buchfink 1, Daniel H. Huson 1,2 & Chao Xie 2,3 arxiv:1511.08753v1 [q-bio.gn] 25 Nov 2015 1 Department of

More information

Finding Biology in the Human Microbiome. George Weinstock

Finding Biology in the Human Microbiome. George Weinstock Finding Biology in the Human Microbiome George Weinstock What s next for the Human Microbiome? George Weinstock Metagenomics Unfolds You are here Setting Up Descriptive Phase Hypothesis Testing Metagenomics

More information

Recent urbanization in China is correlated with a Westernized microbiome encoding increased virulence and antibiotic resistance genes

Recent urbanization in China is correlated with a Westernized microbiome encoding increased virulence and antibiotic resistance genes Winglee et al. Microbiome (2017) 5:121 DOI 10.1186/s40168-017-0338-7 RESEARCH Open Access Recent urbanization in China is correlated with a Westernized microbiome encoding increased virulence and antibiotic

More information

European Union Reference Laboratory for Genetically Modified Food and Feed (EURL GMFF)

European Union Reference Laboratory for Genetically Modified Food and Feed (EURL GMFF) Guideline for the submission of DNA sequences derived from genetically modified organisms and associated annotations within the framework of Directive 2001/18/EC and Regulation (EC) No 1829/2003 European

More information

Assigning Sequences to Taxa CMSC828G

Assigning Sequences to Taxa CMSC828G Assigning Sequences to Taxa CMSC828G Outline Objective (1 slide) MEGAN (17 slides) SAP (33 slides) Conclusion (1 slide) Objective Given an unknown, environmental DNA sequence: Make a taxonomic assignment

More information

Predictive functional profiling of microbial communities using 16S rrna marker gene sequences

Predictive functional profiling of microbial communities using 16S rrna marker gene sequences Predictive functional profiling of microbial communities using 16S rrna marker gene sequences The Harvard community has made this article openly available. Please share how this access benefits you. Your

More information

Plan, Deploy and Configure Microsoft InTune

Plan, Deploy and Configure Microsoft InTune Plan, Deploy and Configure Microsoft InTune 5 Day Course AUDIENCE IT Pros that have experience with Windows 10 use, deployment and management Experience with any optional ios or Android devices. FORMAT

More information

How much sequencing do I need? Emily Crisovan Genomics Core

How much sequencing do I need? Emily Crisovan Genomics Core How much sequencing do I need? Emily Crisovan Genomics Core How much sequencing? Three questions: 1. How much sequence is required for good experimental design? 2. What type of sequencing run is best?

More information

Turning Customers into Marketers Kim Johnston, VP of Marketing, Parallels Emily Johnson, Account Director, Banyan Branch

Turning Customers into Marketers Kim Johnston, VP of Marketing, Parallels Emily Johnson, Account Director, Banyan Branch Turning Customers into Marketers Kim Johnston, VP of Marketing, Parallels Emily Johnson, Account Director, Banyan Branch 3 Key Messages 1. Your enthusiastic customers (aka Advocates ) are your best marketers

More information

De Novo Assembly of High-throughput Short Read Sequences

De Novo Assembly of High-throughput Short Read Sequences De Novo Assembly of High-throughput Short Read Sequences Chuming Chen Center for Bioinformatics and Computational Biology (CBCB) University of Delaware NECC Third Skate Genome Annotation Workshop May 23,

More information

(SHOTGUN) METAGENOMICS. Hélène Touzet, CNRS, CRIStAL

(SHOTGUN) METAGENOMICS. Hélène Touzet, CNRS, CRIStAL (SHOTGUN) METAGENOMICS Hélène Touzet, CNRS, CRIStAL helene.touzet@univ-lille.fr Shotgun sequencing for community samples Metagenomics potentially sequences all fragmented DNA in a community includes all

More information

Metagenomic Analysis in Human- Associated Projects

Metagenomic Analysis in Human- Associated Projects Metagenomic Analysis in Human- Associated Projects Wikimedia Commons Wikimedia Commons Daniel H. Huson Singapore Center for Environmental Life Science Engineering (SCELSE) ZBIT Center for Bioinformatics

More information

Exploring Microbial Diversity and Taxonomy Using SSU rrna Hypervariable Tag Sequencing

Exploring Microbial Diversity and Taxonomy Using SSU rrna Hypervariable Tag Sequencing Exploring Microbial Diversity and Taxonomy Using SSU rrna Hypervariable Tag Sequencing Susan M. Huse 1, Les Dethlefsen 2, Julie A. Huber 1, David Mark Welch 1, David A. Relman 2,3,4, Mitchell L. Sogin

More information

Introduction to NGS Analysis Tools

Introduction to NGS Analysis Tools National Center for Emerging and Zoonotic Infectious Diseases Introduction to NGS Analysis Tools Heather Carleton, PhD, MPH Team Lead, Enteric Diseases Bioinformatics, Enteric Diseases Laboratory Branch,

More information

Microbial Biogeography of Public Restroom Surfaces

Microbial Biogeography of Public Restroom Surfaces Microbial Biogeography of Public Restroom Surfaces Gilberto E. Flores 1, Scott T. Bates 1, Dan Knights 2, Christian L. Lauber 1, Jesse Stombaugh 3, Rob Knight 3,4, Noah Fierer 1,5 * 1 Cooperative Institute

More information

Forest soil bacterial community analysis using high-throughput amplicon sequencing

Forest soil bacterial community analysis using high-throughput amplicon sequencing DISSERTATIONES TECHNOLOGIAE CIRCUMIECTORIUM UNIVERSITATIS TARTUENSIS 27 JENS-KONRAD PREEM Forest soil bacterial community analysis using high-throughput amplicon sequencing 1 DISSERTATIONES TECHNOLOGIAE

More information

Water Quality and Waller Creek Dr. Kinney & UTBIOME Collaborators. What is in Waller Creek? A Wide Variety of Biota!

Water Quality and Waller Creek Dr. Kinney & UTBIOME Collaborators. What is in Waller Creek? A Wide Variety of Biota! Water Quality and Waller Creek Dr. Kinney & UTBIOME Collaborators The Visible & The Invisible What is in Waller Creek? A Wide Variety of Biota! Yellow crowned Night Heron at 24th Street Bridge June 2003

More information

NEXT-GENERATION SEQUENCING AND BIOINFORMATICS

NEXT-GENERATION SEQUENCING AND BIOINFORMATICS NEXT-GENERATION SEQUENCING AND BIOINFORMATICS Moore's law: the number of transistors in a dense integrated circuit doubles every two years Moore's law calculates and predicts the pace of improvement of

More information

6 Keys to SharePoint User Adoption.

6 Keys to SharePoint User Adoption. 6 Keys to SharePoint User Adoption http://www.dmcinfo.com The key to SharePoint success has nothing to do with workflows or customizations. The most critical aspect of implementing this powerful tool is

More information

Files for this Tutorial: All files needed for this tutorial are compressed into a single archive: [BLAST_Intro.tar.gz]

Files for this Tutorial: All files needed for this tutorial are compressed into a single archive: [BLAST_Intro.tar.gz] BLAST Exercise: Detecting and Interpreting Genetic Homology Adapted by W. Leung and SCR Elgin from Detecting and Interpreting Genetic Homology by Dr. J. Buhler Prequisites: None Resources: The BLAST web

More information

AP BIOLOGY. Investigation #2 Mathematical Modeling: Hardy-Weinberg. Slide 1 / 35. Slide 2 / 35. Slide 3 / 35. Investigation #2: Mathematical Modeling

AP BIOLOGY. Investigation #2 Mathematical Modeling: Hardy-Weinberg. Slide 1 / 35. Slide 2 / 35. Slide 3 / 35. Investigation #2: Mathematical Modeling New Jersey Center for Teaching and Learning Slide 1 / 35 Progressive Science Initiative This material is made freely available at www.njctl.org and is intended for the non-commercial use of students and

More information

Outline. Evolution. Adaptive convergence. Common similarity problems. Chapter 7: Similarity searches on sequence databases

Outline. Evolution. Adaptive convergence. Common similarity problems. Chapter 7: Similarity searches on sequence databases Chapter 7: Similarity searches on sequence databases All science is either physics or stamp collection. Ernest Rutherford Outline Why is similarity important BLAST Protein and DNA Interpreting BLAST Individualizing

More information

Synthetic spike-in standards for high-throughput 16S rrna gene amplicon sequencing

Synthetic spike-in standards for high-throughput 16S rrna gene amplicon sequencing Published online 15 December 2016 Nucleic Acids Research, 2017, Vol. 45, No. 4 e23 doi: 10.1093/nar/gkw984 Synthetic spike-in standards for high-throughput 16S rrna gene amplicon sequencing Dieter M. Tourlousse,

More information

Theory and Application of Multiple Sequence Alignments

Theory and Application of Multiple Sequence Alignments Theory and Application of Multiple Sequence Alignments a.k.a What is a Multiple Sequence Alignment, How to Make One, and What to Do With It Brett Pickett, PhD History Structure of DNA discovered (1953)

More information

AP BIOLOGY. Investigation #3 Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST. Slide 1 / 32. Slide 2 / 32.

AP BIOLOGY. Investigation #3 Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST. Slide 1 / 32. Slide 2 / 32. New Jersey Center for Teaching and Learning Slide 1 / 32 Progressive Science Initiative This material is made freely available at www.njctl.org and is intended for the non-commercial use of students and

More information

Basic Bioinformatics: Homology, Sequence Alignment,

Basic Bioinformatics: Homology, Sequence Alignment, Basic Bioinformatics: Homology, Sequence Alignment, and BLAST William S. Sanders Institute for Genomics, Biocomputing, and Biotechnology (IGBB) High Performance Computing Collaboratory (HPC 2 ) Mississippi

More information

A proposal to the Gordon and Betty Moore Foundation

A proposal to the Gordon and Betty Moore Foundation INTEGRATING EVOLUTIONARY, ECOLOGICAL AND STATISTICAL APPROACHES TO METAGENOMICS A proposal to the Gordon and Betty Moore Foundation Jonathan A. Eisen University of California, Davis U. C. Davis Genome

More information

T he diverse microbial communities that dwell in the human body are linked intimately with aspects of host

T he diverse microbial communities that dwell in the human body are linked intimately with aspects of host SUBJECT AREAS: BIOINFORMATICS COMPUTATIONAL BIOLOGY ENVIRONMENTAL MICROBIOLOGY BIODIVERSITY Received 14 June 2011 Accepted 7 November 2011 Published 25 November 2011 Correspondence and requests for materials

More information

Novel bacterial taxa in the human microbiome

Novel bacterial taxa in the human microbiome Washington University School of Medicine Digital Commons@Becker Open Access Publications 2012 Novel bacterial taxa in the human microbiome Kristine M. Wylie Washington University School of Medicine in

More information

Supplemental Information. Temperature-Phased Conversion of Acid. Whey Waste Into Medium-Chain Carboxylic. Acids via Lactic Acid: No External e-donor

Supplemental Information. Temperature-Phased Conversion of Acid. Whey Waste Into Medium-Chain Carboxylic. Acids via Lactic Acid: No External e-donor JOUL, Volume 2 Supplemental Information Temperature-Phased Conversion of Acid Whey Waste Into Medium-Chain Carboxylic Acids via Lactic Acid: No External e-donor Jiajie Xu, Jiuxiao Hao, Juan J.L. Guzman,

More information

ALGORITHMS IN BIO INFORMATICS. Chapman & Hall/CRC Mathematical and Computational Biology Series A PRACTICAL INTRODUCTION. CRC Press WING-KIN SUNG

ALGORITHMS IN BIO INFORMATICS. Chapman & Hall/CRC Mathematical and Computational Biology Series A PRACTICAL INTRODUCTION. CRC Press WING-KIN SUNG Chapman & Hall/CRC Mathematical and Computational Biology Series ALGORITHMS IN BIO INFORMATICS A PRACTICAL INTRODUCTION WING-KIN SUNG CRC Press Taylor & Francis Group Boca Raton London New York CRC Press

More information

CloudLCA: finding the lowest common ancestor in metagenome analysis using cloud computing

CloudLCA: finding the lowest common ancestor in metagenome analysis using cloud computing Protein Cell 2012, 3(2): 148 152 DOI 10.1007/s13238-012-2015-8 RESEARCH ARTICLE CloudLCA: finding the lowest common ancestor in metagenome analysis using cloud computing Guoguang Zhao 1,4*, Dechao Bu 1,4*,

More information

Introduction to Bioinformatics and Gene Expression Technologies

Introduction to Bioinformatics and Gene Expression Technologies Introduction to Bioinformatics and Gene Expression Technologies Utah State University Fall 2017 Statistical Bioinformatics (Biomedical Big Data) Notes 1 1 Vocabulary Gene: hereditary DNA sequence at a

More information

Barcoded primers used in multiplex amplicon pyrosequencing bias amplification

Barcoded primers used in multiplex amplicon pyrosequencing bias amplification AEM Accepts, published online ahead of print on 2 September 2011 Appl. Environ. Microbiol. doi:10.1128/aem.05220-11 Copyright 2011, American Society for Microbiology and/or the Listed Authors/Institutions.

More information

Bioinformatics and computational tools

Bioinformatics and computational tools Bioinformatics and computational tools Etienne P. de Villiers (PhD) International Livestock Research Institute Nairobi, Kenya International Livestock Research Institute Nairobi, Kenya ILRI works at the

More information

Assessing and Improving Methods Used in Operational Taxonomic Unit-Based Approaches for 16S rrna Gene Sequence Analysis

Assessing and Improving Methods Used in Operational Taxonomic Unit-Based Approaches for 16S rrna Gene Sequence Analysis APPLIED AND ENVIRONMENTAL MICROBIOLOGY, May 2011, p. 3219 3226 Vol. 77, No. 10 0099-2240/11/$12.00 doi:10.1128/aem.02810-10 Copyright 2011, American Society for Microbiology. All Rights Reserved. Assessing

More information

HLA and Next Generation Sequencing it s all about the Data

HLA and Next Generation Sequencing it s all about the Data HLA and Next Generation Sequencing it s all about the Data John Ord, NHSBT Colindale and University of Cambridge BSHI Annual Conference Manchester September 2014 Introduction In 2003 the first full public

More information

Last Update: 12/31/2017. Recommended Background Tutorial: An Introduction to NCBI BLAST

Last Update: 12/31/2017. Recommended Background Tutorial: An Introduction to NCBI BLAST BLAST Exercise: Detecting and Interpreting Genetic Homology Adapted by T. Cordonnier, C. Shaffer, W. Leung and SCR Elgin from Detecting and Interpreting Genetic Homology by Dr. J. Buhler Recommended Background

More information

Experimental design and quantitative analysis of microbial community multiomics

Experimental design and quantitative analysis of microbial community multiomics Mallick et al. Genome Biology (2017) 18:228 DOI 10.1186/s13059-017-1359-z REVIEW Experimental design and quantitative analysis of microbial community multiomics Himel Mallick 1,2, Siyuan Ma 1,2, Eric A.

More information

Student Learning Outcomes (SLOS)

Student Learning Outcomes (SLOS) Student Learning Outcomes (SLOS) KNOWLEDGE AND LEARNING SKILLS USE OF KNOWLEDGE AND LEARNING SKILLS - how to use Annhyb to save and manage sequences - how to use BLAST to compare sequences - how to get

More information

Product presentation. Fujitsu HPC Gateway SC 16. November Copyright 2016 FUJITSU

Product presentation. Fujitsu HPC Gateway SC 16. November Copyright 2016 FUJITSU Product presentation Fujitsu HPC Gateway SC 16 November 2016 0 Copyright 2016 FUJITSU In Brief: HPC Gateway Highlights 1 Copyright 2016 FUJITSU Convergent Stakeholder Needs HPC GATEWAY Intelligent Application

More information