Introduc)on to QIIME on the IPython Notebook
|
|
- Donald Preston
- 6 years ago
- Views:
Transcription
1 Strategies and Techniques for Analyzing Microbial Population Structures Introduc)on to QIIME on the IPython Notebook Rob Knight Adam Robbins- Pianka Will Van Treuren Yoshiki Vázquez- Baeza ) Luke Ursell
2 A microbe dominated world The universal nature of biochemistry. Pace NR. Proc Natl Acad Sci U S A Jan 30;98(3):805-8.
3 Vast microbial diversity in every question: ecosystem, how including human our are own we? Human: 10 trillion human cells 20,000 human genes Microbiota: 100 trillion microbial cells Microbiota: 2-20 million microbial genes 99.9% of our genomes the same, but our microbes...?
4 How do we assay this diversity?
5 Sequencing output (454, Illumina, Sanger) fastq, fasta, qual, or sff/trace files Metadata mapping file Pre-processing e.g., remove primer(s), demultiplex, quality filter OTU (or other sample by observation) table Phylogenetic Tree Evolutionary relationship between OTUs Denoise 454 Data Database Submission α-diversity and rarefaction β-diversity and rarefaction PyroNoise, Denoiser (In development) e.g., Phylogenetic Diversity, Chao1, Observed Species e.g., Weighted and unweighted UniFrac, Bray- Curtis, Jaccard Pick OTUs and representative sequences Reference based BLAST, UCLUST, USEARCH De novo e.g., UCLUST, CD-HIT, MOTHUR, USEARCH Interactive visualizations e.g., PCoA plots, distance histograms, taxonomy charts, rarefaction plots, network visualization, jackknifed hierarchical clustering. Assign taxonomy BLAST, RDP Classifier Align sequences e.g., PyNAST, INFERNAL, MUSCLE, MAFFT Legend Currently supported for marker-gene data only Currently supported for general sample by observation data Build 'OTU table' i.e., sample by observation matrix Build phylogenetic tree e.g., FastTree, RAxML, ClearCut (i.e., 'upstream' step) Required step or input (i.e., 'downstream' step) Optional step or input
6 Samples to sequences Sequencing output (454, Illumina, Sanger) fastq, fasta, qual, or sff/trace files Metadata mapping file Pre-processing e.g., remove primer(s), demultiplex, quality filter Denoise 454 Data PyroNoise, Denoiser Database Submission (In development)
7 Error- correczng codes allow mulzplex sequencing >GCACCTGAGGACAGGCATGAGGAA >GCACCTGAGGACAGGGGAGGAGGA >TCACATGAACCTAGGCAGGACGAA >CTACCGGAGGACAGGCATGAGGAT >TCACATGAACCTAGGCAGGAGGAA >GCACCTGAGGACACGCAGGACGAC >CTACCGGAGGACAGGCAGGAGGAA >CTACCGGAGGACACACAGGAGGAA >GAACCTTCACATAGGCAGGAGGAT >TCACATGAACCTAGGGGCAAGGAA >GCACCTGAGGACAGGCAGGAGGAA >PC.634_1 FLP3FBN01ELBSX CTGGGCCGTGTCTCAGTCCCAATGTGGCCGTTTACCCTCTCAGGCCGGCTAC GCATCATCGCCTTGGTGGGCCGTTACCTCACCAACTAGCTAATGCGCCGCAG GTCCATCCATGTTCACGCCTTGATGGGCGCTTTAATATACTGAGCATGCGCT CTGTATACCTATCCGGTTTTAGCTACCGTTTCCAGCAGTTATCCCGGACACA TGGGCTAGG! >PC.354_3 FLP3FBN01EEWKD! TTGGACCGTGTCTCAGTTCCAATGTGGGGGCCTTCCTCTCAGAACCCCTATC CATCGAAGGCTTGGTGGGCCGTTACCCCGCCAACAACCTAATGGAACGCATC CCCATCGATGACCGAAGTTCTTTAATAGTTCTACCATGCGGAAGAACTATGC CATCGGGTATTAATCTTTCTTTCGAAAGGCTATCCCCGAGTCATCGGCAGGT TGGATACGTGTTACTCACCCGTGCGCCGGT! Micah Hamady, et al., Nature Methods, Error- correczng barcodes for pyrosequencing hundreds of samples in mulzplex.
8 Sequences to OTUs and Phylogeny Pick OTUs and representative sequences Reference based BLAST, UCLUST, USEARCH Assign taxonomy BLAST, RDP Classifier De novo e.g., UCLUST, CD-HIT, MOTHUR, USEARCH Align sequences e.g., PyNAST, INFERNAL, MUSCLE, MAFFT e.g. p Build 'OTU table' i.e., sample by observation matrix Build phylogenetic tree e.g., FastTree, RAxML, ClearCut
9 OTU Picking de- novo Clustering Algorithm Clustered Sequences TTGGAAGATGTCTCAGTTCCAG! TTGGAAGATGTCTCAGTTCCAG! TTGGAAGATGTCTCAGTTCCAG! TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA! TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA Experimental Sequences OTU1! OTUS OTU2! OTU3!
10 OTU Picking Closed Reference Reference! Sequences TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA! TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA Sequences that hit a reference TTGGAAGATGTCTCAGTTCCAG! TTGGAAGATGTCTCAGTTCCAG! TTGGAAGATGTCTCAGTTCCAG! Sequences that failed to hit TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA! TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA Experimental Sequences OTUS OTU1! OTU1! OTU1!
11 OTU Picking Open Reference Reference! Sequences TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA! TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA Sequences that hit a reference TTGGAAGATGTCTCAGTTCCAG! TTGGAAGATGTCTCAGTTCCAG! TTGGAAGATGTCTCAGTTCCAG! Sequences that failed to hit TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA! TTGGAAGATGTCTCAGTTCCAG! TTGGGCCGTATGTCAGTCCCTA Experimental Sequences Clustering Algorithm OTU4! OTU5! OTU6! OTU1! OTUS OTU2! OTU3!
12 CompuZng alpha and beta diversity OTU (or other sample by observation) table Phylogenetic Tree Evolutionary relationship between OTUs α-diversity and rarefaction e.g., Phylogenetic Diversity, Chao1, Observed Species β-diversity and rarefaction e.g., Weighted and unweighted UniFrac, Bray- Curtis, Jaccard
13 Comparing microbial communizes Who s there? How many are are there? α (i.e., within sample) diversity How similar are any two samples? Treatments? β (i.e., between sample) diversity
14 PhylogeneZc Diversity (PD): a qualitazve, phylogenezc α- diversity metric Sum of branch length covered by a sample Faith DP (1992) ConservaZon evaluazon and phylogenezc diversity. Biological ConservaZon. 61:1-10.
15 Unweighted UniFrac: a qualitazve, phylogenezc β- diversity metric IdenZcal communizes D = 0.0 Related communizes D ~ 0.5 Unrelated communizes D = 1.0 Percent of observed branch length that is unique to either sample Lozupone and Knight, 2005, Appl Environ Microbiol 71:8228
16 Clustering by UniFrac distance
17 Extract DNA and amplify marker gene with barcoded primers Pool amplicons and sequence >GCACCTGAGGACAGGCATGAGGAA >GCACCTGAGGACAGGGGAGGAGGA >TCACATGAACCTAGGCAGGACGAA >CTACCGGAGGACAGGCATGAGGAT >TCACATGAACCTAGGCAGGAGGAA >GCACCTGAGGACACGCAGGACGAC >CTACCGGAGGACAGGCAGGAGGAA >CTACCGGAGGACACACAGGAGGAA >GAACCTTCACATAGGCAGGAGGAT >TCACATGAACCTAGGGGCAAGGAA >GCACCTGAGGACAGGCAGGAGGAA Assign reads to samples RefSeq 1 RefSeq 2 RefSeq 3 RefSeq 4 RefSeq 5 RefSeq 6 RefSeq 7 RefSeq 8 RefSeq 9 RefSeq 10 Assign millions of sequences from thousands of samples to OTUs Compute UniFrac distances and compare samples
18 Key QIIME files Mapping file: per sample meta- data, user- defined OTU table: sample x OTU matrix, central to downstream analyses [now in biom format] Parameters file: defines analyses, for use with the workflow scripts (opzonal)
19 Parameters Can Be Set In a Few Ways qiime_config files Environment Variable $QIIME_CONFIG_FP User s home directory Parameter files Command line
20 Mapping file
21 Mapping file: always run check_id_map.py! = required field
22 OTU table (classic format) sample x OTU matrix
23 OTU table (classic format) sample x OTU matrix OTU idenzfiers
24 OTU table (classic format) sample x OTU matrix Sample idenzfiers
25 OTU table (classic format) sample x OTU matrix OpZonal per OTU taxonomic informazon
26 OTU tables are now in biological observazon matrix (.biom) format (QIIME dev and later) Google: biom format hsp://biom- format.org See convert_biom.py for translazng between classic and biom otu tables
27 sample x observa/on con/ngency matrix OTUs Samples Observa/on counts
28 sample x observa/on con/ngency matrix Functions Metagenomes Observa/on counts
29 sample x observa/on con/ngency matrix Samples Genomes Samples OTUs Marker gene (e.g., 16S) surveys Ortholog groups ComparaZve genomics Taxa Marker gene (e.g., 16S) surveys Functions Metagenomes Metagenomics Metabolites Samples Metabolomics... Metatranscriptomics
30 The Biological ObservaZon Matrix (BIOM) Format or: How I Learned To Stop Worrying and Love the Ome- ome JSON- based format for represenzng arbitrary sample x observazon conzngency tables with opzonal metadata McDonald et al., GigaScience (2012). hsp:// format.org
31 Running QIIME NaZve installazon on Mac (OS X) or Linux From laptops to 16,000+ core compute cluster qiime- deploy Ubuntu Virtual Box Cloud- based installazons hsp://ncar.janus.rc.colorado.edu/
32 Amazon ElasZc Compute Cloud (EC2)
33 Moving Pictures of the Human Microbiome Two subjects sampled daily, one for six months, one for 18 months Four body sites: tongue, palm of le{ hand, palm of right hand, and gut (via fecal swabs). Caporaso JG et al. (2011) Moving pictures of the human microbiome. Genome biology 12: R50.
34 Moving Pictures of the Human Microbiome InvesZgate the relazve temporal variability of body sites. Is there a temporal core microbiome? Technical points: do we observe the same conclusions on 454 and Illumina data?
35 Moving Pictures of the Human Microbiome: QIIME tutorial A small subset of the full data set to facilitate short run Zme: ~0.1% of the full sequence colleczon. Sequenced across six Illumina GAIIx lanes, with a subset of the samples also sequenced on 454.
36 Tutorial Click on the link in the wiki. Find your user name in the notebook. It will look something like: wvtreuren_stamps_2013.ipynb Click this link. It will open in a new window. Don t do anything else un)l we complete the next 4 slides.
37
38 IPython reference IPython acts like a hybrid python/bash environment. The way we interact with the IPython notebook is through the cells
39 IPython reference Commands prefixed by a '!' character are issued to the shell (just like what your terminal runs). Commands not prefixed with '!' are issued to python, and behave as they normally would in python. Each 'cell' of the notebook is executable. ShiR+Enter (or the play buton) is the way you execute (or re- execute) the commands in a given cell. You must click in the cell to gain focus in that cell, and then type ShiR+Enter or hit the play buton
40 IPython reference Each executable has a prefix that shows you its status (if it has been run, if it hasn t been run, or if its szll running) Hasn t been run Has been run SZll running
41 Tree Building Experimental Sequences TTGGAAGATGTCTCAGTTCCAGA! TTGGGCCGTATGTCAGTCCCTAAGGAG! CTGGGCCGTGTCTCAGTCCCAATCA! TTGGAAGATGTCTCAGTTCCAGGGGCTATAA! TTGGGCCGTATGTCAGTCCCTACGTAACA Phylogeny! CTG-CGCCGTGTCTCAGT CCTC--AA! TTGGAAGATGTCTCAGT----TCCAGA! TTGGGCCGTATGTCAGTCCCTAAGGAG! CTG-GGCG--TGTCTCAGTCCCAATCA! TTGGAAGATGT--CTCAGT-GCTATAA! TTGG---ATGTCAGTCCCTACGTAACA Aligned! Sequences CTG-CGCCGTGTCTCAGT CCTC--AA! CG! C! TTGGAAGATGTCTCAGT----TCCAGA! AA! A! TTGGGCCGTATGTCAGTCCCTAAGGAG! GC! A! CTG-GGCG--TGTCTCAGTCCCAATCA! GG! G! TTGGAAGATGT--CTCAGT-GCTATAA! AA! A! TTGG---ATGTCAGTCCCTACGTAACA - Masked and aligned! sequences
42 In the ancient times of We used KiNG for viewing 3D plots in QIIME.
43 It's 2013! Emperor
44 Description 3D visualizazon tool Cross- pla orm Integrates with QIIME and it's workflows Use case- driven Easy to use In aczve development hsp:// hsp://
45 hsp://24.media.tumblr.com/tumblr_m6q4dgigkw1qzjxifo1_1280.jpg
46
47
48 Issues, suggestions, feature requests? Contact us: o Or contact the QIIME Forum o hsp://groups.google.com/group/qiime- forum
49 Now try the Taxa Summary Plots and OTU Category Significance seczons on your own
Microbiome Analysis. Research Day 2012 Ranjit Kumar
Microbiome Analysis Research Day 2012 Ranjit Kumar Human Microbiome Microorganisms Bad or good? Human colon contains up to 100 trillion bacteria. Human microbiome - The community of bacteria that live
More informationIntroduction to taxonomic analysis of metagenomic amplicon and shotgun data with QIIME. Peter Sterk EBI Metagenomics Course 2014
Introduction to taxonomic analysis of metagenomic amplicon and shotgun data with QIIME Peter Sterk EBI Metagenomics Course 2014 1 Taxonomic analysis using next-generation sequencing Objective we want to
More informationContents 16S rrna SEQUENCING DATA ANALYSIS TUTORIAL WITH QIIME... 5
QIIME Analysis 1 Contents 16S rrna SEQUENCING DATA ANALYSIS TUTORIAL WITH QIIME... 5 Report Overview... 5 How to Obtain Microbiome Data... 6 How to Setup QIIME... 7 Essential files for QIIME... 7 Sequence
More informationCarl Woese. Used 16S rrna to developed a method to Identify any bacterium, and discovered a novel domain of life
METAGENOMICS Carl Woese Used 16S rrna to developed a method to Identify any bacterium, and discovered a novel domain of life His amazing discovery, coupled with his solitary behaviour, made many contemporary
More informationA FRAMEWORK FOR ANALYSIS OF METAGENOMIC SEQUENCING DATA
A FRAMEWORK FOR ANALYSIS OF METAGENOMIC SEQUENCING DATA A. MURAT EREN Department of Computer Science, University of New Orleans, 2000 Lakeshore Drive, New Orleans, LA 70148, USA Email: aeren@uno.edu MICHAEL
More informationDevelopment of NGS metabarcoding. characterization of aerobiological samples. Lucia Muggia
Development of NGS metabarcoding for the characterization of aerobiological samples Lucia Muggia Alberto Pallavicini, Elisa Banchi, Claudio G. Ametrano, David Stankovic, Silvia Ongaro, Enrico Tordoni,
More informationIntroduction to Microbial Community Analysis. Tommi Vatanen CS-E Statistical Genetics and Personalised Medicine
Introduction to Microbial Community Analysis Tommi Vatanen CS-E5890 - Statistical Genetics and Personalised Medicine Structure of the lecture Motivation: human microbiome Terminology Data types, analysis
More informationAn introduction into 16S rrna gene sequencing analysis. Stefan Boers
An introduction into 16S rrna gene sequencing analysis Stefan Boers Microbiome, microbiota or metagenomics? Microbiome The entire habitat, including the microorganisms, their genomes (i.e., genes) and
More informationFunctional analysis using EBI Metagenomics
Functional analysis using EBI Metagenomics Contents Tutorial information... 2 Tutorial learning objectives... 2 An introduction to functional analysis using EMG... 3 What are protein signatures?... 3 Assigning
More informationMetagenome Analysis With MG- RAST
Metagenome Analysis With MG- RAST Folker Meyer, PhD Argonne National Laboratory and University of Chicago http://metagenomics.anl.gov Palm Springs, March 2013 Acknowledgements Team: Dion Antonopoulos Daniela
More informationPhylogenetic methods for taxonomic profiling
Phylogenetic methods for taxonomic profiling Siavash Mirarab University of California at San Diego (UCSD) Joint work with Tandy Warnow, Nam-Phuong Nguyen, Mike Nute, Mihai Pop, and Bo Liu Phylogeny reconstruction
More informationOMNIgene GUT stabilizes the microbiome profile at ambient temperature for 60 days and during transport
OMNIgene GUT stabilizes the microbiome profile at ambient temperature for 60 days and during transport Evgueni Doukhanine, Anne Bouevitch, Ashlee Brown, Jessica Gage LaVecchia, Carlos Merino and Lindsay
More informationInfectious Disease Omics
Infectious Disease Omics Metagenomics Ernest Diez Benavente LSHTM ernest.diezbenavente@lshtm.ac.uk Course outline What is metagenomics? In situ, culture-free genomic characterization of the taxonomic and
More informationDavid Jacob Meltzer m. Supervisor: Dr. Umer Zeeshan Ijaz
AMPLIpyth: A Python Pipeline for Amplicon Processing David Jacob Meltzer 0803837m MSc Bioinformatics, Polyomics and Systems Biology Supervisor: Dr. Umer Zeeshan Ijaz A report submitted in partial fulfillment
More informationA comparison of sequencing platforms and bioinformatics pipelines for compositional analysis of the gut microbiome
Allali et al. BMC Microbiology (2017) 17:194 DOI 10.1186/s12866-017-1101-8 RESEARCH ARTICLE Open Access A comparison of sequencing platforms and bioinformatics pipelines for compositional analysis of the
More informationIntroduction to OTU Clustering. Susan Huse August 4, 2016
Introduction to OTU Clustering Susan Huse August 4, 2016 What is an OTU? Operational Taxonomic Units a.k.a. phylotypes a.k.a. clusters aggregations of reads based only on sequence similarity, independent
More informationSupplementary Figure and Table Legends
1 Supplementary Figure and Table Legends Figure S1: Whole-animal metabolic analysis. 12 week old WT and Dvl1 / were singly housed in CLAMS cages (Comprehensive Laboratory Animals Monitoring System) for
More informationEnabling reproducible data analysis for metagenomics. eresearch Africa Conference 2017 Gerrit Botha CBIO H3ABioNet 3 May 2017
Enabling reproducible data analysis for metagenomics eresearch Africa Conference 2017 Gerrit Botha CBIO H3ABioNet 3 May 2017 Outline 16S rrna analysis Current CBIO 16S rrna analysis setup H3ABioNet hackathon
More informationCDC s Advanced Molecular Detection (AMD) Sequence Data Analysis and Management
CDC s Advanced Molecular Detection (AMD) Sequence Data Analysis and Management Scott Sammons Technology Officer Office of Advanced Molecular Detection National Center for Emerging and Zoonotic Infectious
More informationLeonardo Mariño-Ramírez, PhD NCBI / NLM / NIH. BIOL 7210 A Computational Genomics 2/18/2015
Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH BIOL 7210 A Computational Genomics 2/18/2015 The $1,000 genome is here! http://www.illumina.com/systems/hiseq-x-sequencing-system.ilmn Bioinformatics bottleneck
More informationBioinformatic Suggestions on MiSeq-Based Microbial Community S
J. Microbiol. Biotechnol. (2015), 25(6), 765 770 http://dx.doi.org/10.4014/jmb.1409.09057 Review Research Article jmb Bioinformatic Suggestions on MiSeq-Based Microbial Community S Analysis Tatsuya Unno*
More informationBioinformatic tools for metagenomic data analysis
Bioinformatic tools for metagenomic data analysis MEGAN - blast-based tool for exploring taxonomic content MG-RAST (SEED, FIG) - rapid annotation of metagenomic data, phylogenetic classification and metabolic
More informationEvaluating the accuracy of amplicon-based microbiome computational pipelines on simulated human gut microbial communities
Golob et al. BMC Bioinformatics (2017) 18:283 DOI 10.1186/s12859-017-1690-0 RESEARCH ARTICLE Evaluating the accuracy of amplicon-based microbiome computational pipelines on simulated human gut microbial
More informationMETAGENOMICS. Aina Maria Mas Calafell Genomics
METAGENOMICS Aina Maria Mas Calafell Genomics Introduction Microbial communities Primary role in biogeochemical systems Study of microbial communities 1.- Culture-based methodologies Only isolated microbes
More informationSanger vs Next-Gen Sequencing
Tools and Algorithms in Bioinformatics GCBA815/MCGB815/BMI815, Fall 2017 Week-8: Next-Gen Sequencing RNA-seq Data Analysis Babu Guda, Ph.D. Professor, Genetics, Cell Biology & Anatomy Director, Bioinformatics
More informationMICROBIOME SOFTWARE: END OF BEGINNING.
MICROBIOME SOFTWARE: END OF BEGINNING. DR. CHARLES ROBERTSON DIVISION OF INFECTIOUS DISEASES, UNIVERSITY OF COLORADO SCHOOL OF MEDICINE DR. DANIEL N. FRANK, DIVISION OF INFECTIOUS DISEASES, SCHOOL OF MEDICINE
More informationMicrobially Mediated Plant Salt Tolerance and Microbiome based Solutions for Saline Agriculture
Microbially Mediated Plant Salt Tolerance and Microbiome based Solutions for Saline Agriculture Contents Introduction Abiotic Tolerance Approaches Reasons for failure Roots, microorganisms and soil-interaction
More informationUsing Rule Induction to Elucidate Co-Occurrence Patterns in Microbial Data. K. Kumar Thurimella. A thesis submitted to the
Using Rule Induction to Elucidate Co-Occurrence Patterns in Microbial Data by K. Kumar Thurimella A thesis submitted to the University of Colorado in partial fulfillment of the requirements for the degree
More informationWelcome to the NGS webinar series
Welcome to the NGS webinar series Webinar 1 NGS: Introduction to technology, and applications NGS Technology Webinar 2 Targeted NGS for Cancer Research NGS in cancer Webinar 3 NGS: Data analysis for genetic
More informationMicrobial Diversity and Assessment (III) Spring, 2007 Guangyi Wang, Ph.D. POST103B
Microbial Diversity and Assessment (III) Spring, 2007 Guangyi Wang, Ph.D. POST103B guangyi@hawaii.edu http://www.soest.hawaii.edu/marinefungi/ocn403webpage.htm Overview of Last Lecture Taxonomy (three
More informationBiochemistry 412. New Strategies, Technologies, & Applications For DNA Sequencing. 12 February 2008
Biochemistry 412 New Strategies, Technologies, & Applications For DNA Sequencing 12 February 2008 Note: Scale is wrong!! (at least for sequences) 10 6 In 1980, the sequencing cost per finished bp $1.00
More informationGenomics and High Performance Computing. Folker Meyer Argonne National Laboratory and University of Chicago
Genomics and High Performance Computing Folker Meyer and University of Chicago Brief intro: I am a computer scientist turned computational biologist My CS friends tell me I am a biologist My BIO friends
More informationChapter 12: Human Microbiome Analysis
Education Chapter 12: Human Microbiome Analysis Xochitl C. Morgan 1, Curtis Huttenhower 1,2 * 1 Department of Biostatistics, Harvard School of Public Health, Boston, Massachusetts, United States of America,
More informationData Analysis with CASAVA v1.8 and the MiSeq Reporter
Data Analysis with CASAVA v1.8 and the MiSeq Reporter Eric Smith, PhD Bioinformatics Scientist September 15 th, 2011 2010 Illumina, Inc. All rights reserved. Illumina, illuminadx, Solexa, Making Sense
More informationHuman-microbe mutualism: stability and resilience in health and disease
Human-microbe mutualism: stability and resilience in health and disease David A. Relman, Stanford University IOM Forum on Microbial Threats March 7, 2012 Our extended self : human-microbe mutualism (Based
More informationQuality assessment and control of sequence data. Naiara Rodríguez-Ezpeleta
Quality assessment and control of sequence data Naiara Rodríguez-Ezpeleta Workshop on Genomics 2014 Quality control is important Some of the artefacts/problems that can be detected with QC Sequencing Sequence
More informationRIPTIDE HIGH THROUGHPUT RAPID LIBRARY PREP (HT-RLP)
Application Note: RIPTIDE HIGH THROUGHPUT RAPID LIBRARY PREP (HT-RLP) Introduction: Innovations in DNA sequencing during the 21st century have revolutionized our ability to obtain nucleotide information
More informationProkaryotic Diversity of the Wastewater Outfalls, Reefs, and Inlets of Broward County
Nova Southeastern University NSUWorks Theses and Dissertations HCNSO Student Work 5-1-2014 Prokaryotic Diversity of the Wastewater Outfalls, Reefs, and Inlets of Broward County Alexandra Mandina Campbell
More informationFrom Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow
From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with
More informationSHAMAN : SHiny Application for Metagenomic ANalysis
SHAMAN : SHiny Application for Metagenomic ANalysis Stevenn Volant, Amine Ghozlane Hub Bioinformatique et Biostatistique C3BI, USR 3756 IP CNRS Biomics CITECH Ribosome ITS (1) : located between 18S and
More information16S rrna gene pyrosequencing of reference and clinical samples and investigation of the temperature stability of microbiome profiles
Hang et al. Microbiome 2014, 2:31 METHODOLOGY Open Access 16S rrna gene pyrosequencing of reference and clinical samples and investigation of the temperature stability of microbiome profiles Jun Hang 1*,
More informationELE4120 Bioinformatics. Tutorial 5
ELE4120 Bioinformatics Tutorial 5 1 1. Database Content GenBank RefSeq TPA UniProt 2. Database Searches 2 Databases A common situation for alignment is to search through a database to retrieve the similar
More informationWhy learn sequence database searching? Searching Molecular Databases with BLAST
Why learn sequence database searching? Searching Molecular Databases with BLAST What have I cloned? Is this really!my gene"? Basic Local Alignment Search Tool How BLAST works Interpreting search results
More informationImproved taxonomic assignment of human intestinal 16S rrna sequences by a dedicated reference database
Ritari et al. BMC Genomics (2015) 16:1056 DOI 10.1186/s12864-015-2265-y RESEARCH ARTICLE Open Access Improved taxonomic assignment of human intestinal 16S rrna sequences by a dedicated reference database
More informationDNA extraction protocols cause differences in 16S rrna amplicon sequencing efficiency but not in community profile composition or structure
DNA extraction protocols cause differences in 16S rrna amplicon sequencing efficiency but not in community profile composition or structure The Harvard community has made this article openly available.
More informationNext-Generation Sequencing. Technologies
Next-Generation Next-Generation Sequencing Technologies Sequencing Technologies Nicholas E. Navin, Ph.D. MD Anderson Cancer Center Dept. Genetics Dept. Bioinformatics Introduction to Bioinformatics GS011062
More informationIntroduction. Jullien M. Flynn 1, Emily A. Brown 1,2,Frederic J. J. Chain 1, Hugh J. MacIsaac 2 & Melania E. Cristescu 1. Abstract
Toward accurate molecular identification of species in complex environmental samples: testing the performance of sequence filtering and clustering methods Jullien M. Flynn 1, Emily A. Brown 1,2,Frederic
More informationM1D2: Diagnostic Primer Design 2/10/15
M1D2: Diagnostic Primer Design 2/10/15 Announcements 1. Expanded office hours for this week: Wednesday, 3-5pm in 16-319 Friday, 3-5pm in 16-319 Sunday, 3-5pm in 16-319 2. Weekly office hours (starting
More informationscgem Workflow Experimental Design Single cell DNA methylation primer design
scgem Workflow Experimental Design Single cell DNA methylation primer design The scgem DNA methylation assay uses qpcr to measure digestion of target loci by the methylation sensitive restriction endonuclease
More informationIntegrating Evolutionary, Ecological and Statistical Approaches to Metagenomics. A proposal to the Gordon and Betty Moore Foundation
Integrating Evolutionary, Ecological and Statistical Approaches to Metagenomics A proposal to the Gordon and Betty Moore Foundation Jonathan A. Eisen University of California, Davis U. C. Davis Genome
More informationTargeted Sequencing Using Droplet-Based Microfluidics. Keith Brown Director, Sales
Targeted Sequencing Using Droplet-Based Microfluidics Keith Brown Director, Sales brownk@raindancetech.com Who we are: is a Provider of Microdroplet-based Solutions The Company s RainStorm TM Technology
More informationKristin Tweel, PhD, MBA
Kristin Tweel, PhD, MBA Company Overview: Not-for-profit founded in 2000, enabled over $70M to date Identify areas where genomics and other omics can help Connect industry and academia Help identify and
More informationRESEARCH INSTITUTION: : BASELINE AND OIL SPILL IMPACTED MARINE SPONGE MICROBIAL COMMUNITIES AND GENE EXPRESSION ANALYSIS WITH METAGENOMICS
RESEARCH INSTITUTION: : BASELINE AND OIL SPILL IMPACTED MARINE SPONGE MICROBIAL COMMUNITIES AND GENE EXPRESSION ANALYSIS WITH METAGENOMICS Jose V Lopez 1, Rebecca Vega Thurber, Peter McCarthy, Patricia
More informationRNA-seq Data Analysis
Lecture 3. Clustering; Function/Pathway Enrichment analysis RNA-seq Data Analysis Qi Sun Bioinformatics Facility Biotechnology Resource Center Cornell University Lecture 1. Map RNA-seq read to genome Lecture
More informationMicrobiome analysis of skin undergoing acne treatments
Microbiome analysis of skin undergoing acne treatments Groups Sample size Time points Head Site Code Healthy, No treatment Acne, Receiving Spironolactone 4 0 2 0,1 Forehead Cheek Nose Chin Fh Ck No Ch
More informationIntrons early. Introns late
Introns early Introns late Self splicing RNA are an example for catalytic RNA that could have been present in RNA world. There is little reason to assume that the RNA world was not plagued by self-splicing
More informationKorilog. high-performance sequence similarity search tool & integration with KNIME platform. Patrick Durand, PhD, CEO. BIOINFORMATICS Solutions
KLAST high-performance sequence similarity search tool & integration with KNIME platform Patrick Durand, PhD, CEO Sequence analysis big challenge DNA sequence... Context 1. Modern sequencers produce huge
More informationSupplementary Information
Supplementary Information Title: Fat binding capacity and modulation of the gut microbiota both determine the effect of wheat bran fractions on adiposity Francesco Suriano 1,*, Laure B. Bindels 1,*, Joran
More informationarxiv: v1 [q-bio.gn] 25 Nov 2015
MetaScope - Fast and accurate identification of microbes in metagenomic sequencing data Benjamin Buchfink 1, Daniel H. Huson 1,2 & Chao Xie 2,3 arxiv:1511.08753v1 [q-bio.gn] 25 Nov 2015 1 Department of
More informationFinding Biology in the Human Microbiome. George Weinstock
Finding Biology in the Human Microbiome George Weinstock What s next for the Human Microbiome? George Weinstock Metagenomics Unfolds You are here Setting Up Descriptive Phase Hypothesis Testing Metagenomics
More informationRecent urbanization in China is correlated with a Westernized microbiome encoding increased virulence and antibiotic resistance genes
Winglee et al. Microbiome (2017) 5:121 DOI 10.1186/s40168-017-0338-7 RESEARCH Open Access Recent urbanization in China is correlated with a Westernized microbiome encoding increased virulence and antibiotic
More informationEuropean Union Reference Laboratory for Genetically Modified Food and Feed (EURL GMFF)
Guideline for the submission of DNA sequences derived from genetically modified organisms and associated annotations within the framework of Directive 2001/18/EC and Regulation (EC) No 1829/2003 European
More informationAssigning Sequences to Taxa CMSC828G
Assigning Sequences to Taxa CMSC828G Outline Objective (1 slide) MEGAN (17 slides) SAP (33 slides) Conclusion (1 slide) Objective Given an unknown, environmental DNA sequence: Make a taxonomic assignment
More informationPredictive functional profiling of microbial communities using 16S rrna marker gene sequences
Predictive functional profiling of microbial communities using 16S rrna marker gene sequences The Harvard community has made this article openly available. Please share how this access benefits you. Your
More informationPlan, Deploy and Configure Microsoft InTune
Plan, Deploy and Configure Microsoft InTune 5 Day Course AUDIENCE IT Pros that have experience with Windows 10 use, deployment and management Experience with any optional ios or Android devices. FORMAT
More informationHow much sequencing do I need? Emily Crisovan Genomics Core
How much sequencing do I need? Emily Crisovan Genomics Core How much sequencing? Three questions: 1. How much sequence is required for good experimental design? 2. What type of sequencing run is best?
More informationTurning Customers into Marketers Kim Johnston, VP of Marketing, Parallels Emily Johnson, Account Director, Banyan Branch
Turning Customers into Marketers Kim Johnston, VP of Marketing, Parallels Emily Johnson, Account Director, Banyan Branch 3 Key Messages 1. Your enthusiastic customers (aka Advocates ) are your best marketers
More informationDe Novo Assembly of High-throughput Short Read Sequences
De Novo Assembly of High-throughput Short Read Sequences Chuming Chen Center for Bioinformatics and Computational Biology (CBCB) University of Delaware NECC Third Skate Genome Annotation Workshop May 23,
More information(SHOTGUN) METAGENOMICS. Hélène Touzet, CNRS, CRIStAL
(SHOTGUN) METAGENOMICS Hélène Touzet, CNRS, CRIStAL helene.touzet@univ-lille.fr Shotgun sequencing for community samples Metagenomics potentially sequences all fragmented DNA in a community includes all
More informationMetagenomic Analysis in Human- Associated Projects
Metagenomic Analysis in Human- Associated Projects Wikimedia Commons Wikimedia Commons Daniel H. Huson Singapore Center for Environmental Life Science Engineering (SCELSE) ZBIT Center for Bioinformatics
More informationExploring Microbial Diversity and Taxonomy Using SSU rrna Hypervariable Tag Sequencing
Exploring Microbial Diversity and Taxonomy Using SSU rrna Hypervariable Tag Sequencing Susan M. Huse 1, Les Dethlefsen 2, Julie A. Huber 1, David Mark Welch 1, David A. Relman 2,3,4, Mitchell L. Sogin
More informationIntroduction to NGS Analysis Tools
National Center for Emerging and Zoonotic Infectious Diseases Introduction to NGS Analysis Tools Heather Carleton, PhD, MPH Team Lead, Enteric Diseases Bioinformatics, Enteric Diseases Laboratory Branch,
More informationMicrobial Biogeography of Public Restroom Surfaces
Microbial Biogeography of Public Restroom Surfaces Gilberto E. Flores 1, Scott T. Bates 1, Dan Knights 2, Christian L. Lauber 1, Jesse Stombaugh 3, Rob Knight 3,4, Noah Fierer 1,5 * 1 Cooperative Institute
More informationForest soil bacterial community analysis using high-throughput amplicon sequencing
DISSERTATIONES TECHNOLOGIAE CIRCUMIECTORIUM UNIVERSITATIS TARTUENSIS 27 JENS-KONRAD PREEM Forest soil bacterial community analysis using high-throughput amplicon sequencing 1 DISSERTATIONES TECHNOLOGIAE
More informationWater Quality and Waller Creek Dr. Kinney & UTBIOME Collaborators. What is in Waller Creek? A Wide Variety of Biota!
Water Quality and Waller Creek Dr. Kinney & UTBIOME Collaborators The Visible & The Invisible What is in Waller Creek? A Wide Variety of Biota! Yellow crowned Night Heron at 24th Street Bridge June 2003
More informationNEXT-GENERATION SEQUENCING AND BIOINFORMATICS
NEXT-GENERATION SEQUENCING AND BIOINFORMATICS Moore's law: the number of transistors in a dense integrated circuit doubles every two years Moore's law calculates and predicts the pace of improvement of
More information6 Keys to SharePoint User Adoption.
6 Keys to SharePoint User Adoption http://www.dmcinfo.com The key to SharePoint success has nothing to do with workflows or customizations. The most critical aspect of implementing this powerful tool is
More informationFiles for this Tutorial: All files needed for this tutorial are compressed into a single archive: [BLAST_Intro.tar.gz]
BLAST Exercise: Detecting and Interpreting Genetic Homology Adapted by W. Leung and SCR Elgin from Detecting and Interpreting Genetic Homology by Dr. J. Buhler Prequisites: None Resources: The BLAST web
More informationAP BIOLOGY. Investigation #2 Mathematical Modeling: Hardy-Weinberg. Slide 1 / 35. Slide 2 / 35. Slide 3 / 35. Investigation #2: Mathematical Modeling
New Jersey Center for Teaching and Learning Slide 1 / 35 Progressive Science Initiative This material is made freely available at www.njctl.org and is intended for the non-commercial use of students and
More informationOutline. Evolution. Adaptive convergence. Common similarity problems. Chapter 7: Similarity searches on sequence databases
Chapter 7: Similarity searches on sequence databases All science is either physics or stamp collection. Ernest Rutherford Outline Why is similarity important BLAST Protein and DNA Interpreting BLAST Individualizing
More informationSynthetic spike-in standards for high-throughput 16S rrna gene amplicon sequencing
Published online 15 December 2016 Nucleic Acids Research, 2017, Vol. 45, No. 4 e23 doi: 10.1093/nar/gkw984 Synthetic spike-in standards for high-throughput 16S rrna gene amplicon sequencing Dieter M. Tourlousse,
More informationTheory and Application of Multiple Sequence Alignments
Theory and Application of Multiple Sequence Alignments a.k.a What is a Multiple Sequence Alignment, How to Make One, and What to Do With It Brett Pickett, PhD History Structure of DNA discovered (1953)
More informationAP BIOLOGY. Investigation #3 Comparing DNA Sequences to Understand Evolutionary Relationships with BLAST. Slide 1 / 32. Slide 2 / 32.
New Jersey Center for Teaching and Learning Slide 1 / 32 Progressive Science Initiative This material is made freely available at www.njctl.org and is intended for the non-commercial use of students and
More informationBasic Bioinformatics: Homology, Sequence Alignment,
Basic Bioinformatics: Homology, Sequence Alignment, and BLAST William S. Sanders Institute for Genomics, Biocomputing, and Biotechnology (IGBB) High Performance Computing Collaboratory (HPC 2 ) Mississippi
More informationA proposal to the Gordon and Betty Moore Foundation
INTEGRATING EVOLUTIONARY, ECOLOGICAL AND STATISTICAL APPROACHES TO METAGENOMICS A proposal to the Gordon and Betty Moore Foundation Jonathan A. Eisen University of California, Davis U. C. Davis Genome
More informationT he diverse microbial communities that dwell in the human body are linked intimately with aspects of host
SUBJECT AREAS: BIOINFORMATICS COMPUTATIONAL BIOLOGY ENVIRONMENTAL MICROBIOLOGY BIODIVERSITY Received 14 June 2011 Accepted 7 November 2011 Published 25 November 2011 Correspondence and requests for materials
More informationNovel bacterial taxa in the human microbiome
Washington University School of Medicine Digital Commons@Becker Open Access Publications 2012 Novel bacterial taxa in the human microbiome Kristine M. Wylie Washington University School of Medicine in
More informationSupplemental Information. Temperature-Phased Conversion of Acid. Whey Waste Into Medium-Chain Carboxylic. Acids via Lactic Acid: No External e-donor
JOUL, Volume 2 Supplemental Information Temperature-Phased Conversion of Acid Whey Waste Into Medium-Chain Carboxylic Acids via Lactic Acid: No External e-donor Jiajie Xu, Jiuxiao Hao, Juan J.L. Guzman,
More informationALGORITHMS IN BIO INFORMATICS. Chapman & Hall/CRC Mathematical and Computational Biology Series A PRACTICAL INTRODUCTION. CRC Press WING-KIN SUNG
Chapman & Hall/CRC Mathematical and Computational Biology Series ALGORITHMS IN BIO INFORMATICS A PRACTICAL INTRODUCTION WING-KIN SUNG CRC Press Taylor & Francis Group Boca Raton London New York CRC Press
More informationCloudLCA: finding the lowest common ancestor in metagenome analysis using cloud computing
Protein Cell 2012, 3(2): 148 152 DOI 10.1007/s13238-012-2015-8 RESEARCH ARTICLE CloudLCA: finding the lowest common ancestor in metagenome analysis using cloud computing Guoguang Zhao 1,4*, Dechao Bu 1,4*,
More informationIntroduction to Bioinformatics and Gene Expression Technologies
Introduction to Bioinformatics and Gene Expression Technologies Utah State University Fall 2017 Statistical Bioinformatics (Biomedical Big Data) Notes 1 1 Vocabulary Gene: hereditary DNA sequence at a
More informationBarcoded primers used in multiplex amplicon pyrosequencing bias amplification
AEM Accepts, published online ahead of print on 2 September 2011 Appl. Environ. Microbiol. doi:10.1128/aem.05220-11 Copyright 2011, American Society for Microbiology and/or the Listed Authors/Institutions.
More informationBioinformatics and computational tools
Bioinformatics and computational tools Etienne P. de Villiers (PhD) International Livestock Research Institute Nairobi, Kenya International Livestock Research Institute Nairobi, Kenya ILRI works at the
More informationAssessing and Improving Methods Used in Operational Taxonomic Unit-Based Approaches for 16S rrna Gene Sequence Analysis
APPLIED AND ENVIRONMENTAL MICROBIOLOGY, May 2011, p. 3219 3226 Vol. 77, No. 10 0099-2240/11/$12.00 doi:10.1128/aem.02810-10 Copyright 2011, American Society for Microbiology. All Rights Reserved. Assessing
More informationHLA and Next Generation Sequencing it s all about the Data
HLA and Next Generation Sequencing it s all about the Data John Ord, NHSBT Colindale and University of Cambridge BSHI Annual Conference Manchester September 2014 Introduction In 2003 the first full public
More informationLast Update: 12/31/2017. Recommended Background Tutorial: An Introduction to NCBI BLAST
BLAST Exercise: Detecting and Interpreting Genetic Homology Adapted by T. Cordonnier, C. Shaffer, W. Leung and SCR Elgin from Detecting and Interpreting Genetic Homology by Dr. J. Buhler Recommended Background
More informationExperimental design and quantitative analysis of microbial community multiomics
Mallick et al. Genome Biology (2017) 18:228 DOI 10.1186/s13059-017-1359-z REVIEW Experimental design and quantitative analysis of microbial community multiomics Himel Mallick 1,2, Siyuan Ma 1,2, Eric A.
More informationStudent Learning Outcomes (SLOS)
Student Learning Outcomes (SLOS) KNOWLEDGE AND LEARNING SKILLS USE OF KNOWLEDGE AND LEARNING SKILLS - how to use Annhyb to save and manage sequences - how to use BLAST to compare sequences - how to get
More informationProduct presentation. Fujitsu HPC Gateway SC 16. November Copyright 2016 FUJITSU
Product presentation Fujitsu HPC Gateway SC 16 November 2016 0 Copyright 2016 FUJITSU In Brief: HPC Gateway Highlights 1 Copyright 2016 FUJITSU Convergent Stakeholder Needs HPC GATEWAY Intelligent Application
More information