Comparative genomics of clinical isolates of Pseudomonas fluorescens, including the discovery of a novel disease-associated subclade.

Similar documents
Metagenomics Computational Genomics

Enrichment of the lung microbiome with gut bacteria in sepsis and the acute respiratory distress syndrome

Supplementary Materials

Carl Woese. Used 16S rrna to developed a method to Identify any bacterium, and discovered a novel domain of life

Carl Woese. Used 16S rrna to develop a method to Identify any bacterium, and discovered a novel domain of life

Dr. Sabri M. Naser Department of Biology and Biotechnology An-Najah National University Nablus, Palestine

Cystic Fibrosis and anaerobes. Dr Michael Tunney, Clinical & Practice Research Group, School of Pharmacy, Queen s University Belfast.

Name: Ally Bonney. Date: January 29, 2015 February 24, Purpose

NEW DRUG FORMULATIONS TO BOOST FIGHT AGAINST RESPIRATORY ILLNESSES AND ANTIBIOTIC-RESISTANT SUPERBUGS

Isolation and Characterization of Two Antibiotic-Producing Bacteria

Infectious Disease Omics

Abstract... i. Committee Membership... iii. Foreword... vii. 1 Scope Introduction Standard Precautions References...

Contents. 1 Basic Molecular Microbiology of Bacteria... 1 Exp. 1.1 Isolation of Genomic DNA Introduction Principle...

Whole Genome Sequence Data Quality Control and Validation

Title: Genome sequence of lineage III Listeria monocytogenes strain HCC23

Detection of Mobile Genetic Elements (MGEs) in Bacterial Genomes

They are similar to one another but different from other species: They are capable of breeding: Artificial classification: Natural classification:

CBC Data Therapy. Metagenomics Discussion

Stay or Go: A Study on Oxygen Tension on the Biofilm Formation of Cystic Fibrosis Bacteria. Honors Project. In fulfillment of the Requirements for

History of the CFTR chase

Applications of Next Generation Sequencing in Metagenomics Studies

GREG GIBSON SPENCER V. MUSE

Microbiota and What the Clinical Gastroenterologist Needs to Know

The role of PHE s AMRHAI Reference Unit

Practical Bioinformatics for Life Scientists. Week 14, Lecture 27. István Albert Bioinformatics Consulting Center Penn State

Copyright is owned by the Author of the thesis. Permission is given for a copy to be downloaded by an individual for the purpose of research and

The core genome and beyond: comparative bacterial pathogenomics and functional gene analysis in foodborne pathogens

Next G eneration Generation Microbial Microbial Genomics : The H uman Human Microbiome P roject Project George Weinstock

The Human Microbiome: A New Paradigm? John Huss Department of Philosophy The University of Akron

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

Identifying Bacteriophages in Metagenomic Data Sets

THE HUMAN MICROBIOME: RECENT DISCOVERIES AND APPLICATIONS TO MEDICINE

Microbially Mediated Plant Salt Tolerance and Microbiome based Solutions for Saline Agriculture

Chapter 10 Microbial Genetics: New Genes for Old Germs

Microbial Community Assembly and Dynamics:.from AMD biofilms to colonization of the premature infant gut

Functional annotation of metagenomes

BIOINFORMATICS TO ANALYZE AND COMPARE GENOMES

Faculty Mentor, Department of Microbiology and Molecular Genetics, Oklahoma State University

Forecast diagnostics for antimicrobial resistance (AMR)

Supplementary Figure 1. Design of the control microarray. a, Genomic DNA from the

Probiotic Strain Isolated from the Vagina of Healthy Women

TECHNIQUES FOR STUDYING METAGENOME DATASETS METAGENOMES TO SYSTEMS.

Conducting Microbiome study, a How to guide

Comparative Bioinformatics. BSCI348S Fall 2003 Midterm 1

Using Galaxy for the analysis of NGS-derived pathogen genomes in clinical microbiology

BIO PAL Problem Set Lecture 1 (Brooker Chapter 9) Molecular Structure of DNA and RNA

Mochamad Nurcholis. Food Technology Department Agricuktural Technology Faculty Brawijaya University 2013

Day 3. Examine gels from PCR. Learn about more molecular methods in microbial ecology

Introduction to DNA-Sequencing

HMP Data Set Documentation

Multilocus Sequence Typing - MLST. Characterisation of bacteria. Hospital outbreak of resistant bacteria. Community outbreak of diarrhoea

Corynebacterium pseudotuberculosis genome sequencing: Final Report

MAKING WHOLE GENOME ALIGNMENTS USABLE FOR BIOLOGISTS. EXAMPLES AND SAMPLE ANALYSES.

Biology 100. ALE #10. From Gene to Protein and Biotechnology Practice Problems DNA

N.H. Sea Grant Research Project Progress Report

Biology 4100 Minor Assignment 1 January 19, 2007

Table 2. Participating Faculty Members

NGS part 2: applications. Tobias Österlund

Microbiome: Metagenomics 4/4/2018

Providing clear solutions to microbiological challenges TM. cgmp/iso CLIA. Polyphasic Microbial Identification & DNA Fingerprinting

Antibiotic Resistance Genes: From The Farm To The Human Gut

Game Plan. Lecture. Lab. Growth Curve. Biofilms Review of basic genetics Bacterial gene structure Gene regulation Mutations

Genome Sequence Assembly

An introduction into 16S rrna gene sequencing analysis. Stefan Boers

Genetics: Analysis of Genes and Genomes, Ninth Edition. Jones & Bartlett Learning, LLC NOT FOR SALE OR DISTRIBUTION

Mini-Symposium MICROBIOTA. Free. Meet the speaker. 14. November :30 19:00 Bohnenkamp Haus. Everyone welcome. Sponsored by:

Antonio Del Casale. Microbion srl Verona University Spin-off - Italy

Lesson Overview Identifying the Substance of Genes

MicroSEQ TM ID Rapid Microbial Identification System:

Game Plan. Lecture. Lab. Biofilms Review of basic genetics Bacterial gene structure Gene regulation Mutations

Tapping the Potentials of Lichens Experimental Lichenology. The way ahead

Jan Company Introduction. Microbiome Drug Development Europe Assaf Oron, CBO

March Company Introduction. Microbiome R&D and Business Collaboration Forum: Europe Myriam Golembo, VP Development

Introduction to Bioinformatics

DOCTOR OF PHILOSOPHY PROGRAM IN ORAL BIOLOGY

ISOLATION AND MOLECULAR CHARACTERIZATION OF THERMOTOLERANT BACTERIA FROM MANIKARAN THERMAL SPRING OF HIMACHAL PRADESH, INDIA ABSTRACT

Microbiomes and metabolomes

Characterization and distribution of the Flavobacterium psychrophilum phage-host system from local to global scales AQUAPHAGE WP5

Why study sequence similarity?

Microbiomics I August 24th, Introduction. Robert Kraaij, PhD Erasmus MC, Internal Medicine

SAMPLE. Interpretive Criteria for Identification of Bacteria and Fungi by Targeted DNA Sequencing

Microbiome Analysis in Kawasaki Disease. Kristine M. Wylie, Susan C. Baker, George M. Weinstock, Stanford T. Shulman, and Anne H.

Challenges and opportunities for whole genome sequencing based surveillance of antibiotic resistance

I nternet Resources for Bioinformatics Data and Tools

Community-assisted genome annotation: The Pseudomonas example. Geoff Winsor, Simon Fraser University Burnaby (greater Vancouver), Canada

Introduction to Bioinformatics

Regents Review Assignment #4-JA10 Living Environment: Comet Part A Questions

Using New ThiNGS on Small Things. Shane Byrne

Bioinformatics for Microbial Biology

NATIONAL CENTER FOR CASE STUDY TEACHING IN SCIENCE

Supplemental Information. Cooperating Commensals Restore. Colonization Resistance to Vancomycin-Resistant Enterococcus faecium

11/19/2018 1:59 PM Approved (Changed Course) MICRO 5 Course Outline as of Fall 2017

Outline. Evolution. Adaptive convergence. Common similarity problems. Chapter 7: Similarity searches on sequence databases

Laboratory Testing for Diagnosis and Treatment of TB

Complete nucleotide sequence of pld-tex-kl, a 66-kb plasmid of Legionella dumoffii TEX-KL strain

DNA REPLICATION & BIOTECHNOLOGY Biology Study Review

on November 21, 2018 by guest

OncoMD User Manual Version 2.6. OncoMD: Cancer Analytics Platform

Our website:

At the age of big data sequencing, what's new about the naughty and efficient microbes within the WWTPs

Transcription:

Comparative genomics of clinical isolates of Pseudomonas fluorescens, including the discovery of a novel disease-associated subclade. by Brittan Starr Scales A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy (Microbiology and Immunology) in The University of Michigan 2015 Doctoral Committee: Professor Gary B. Huffnagle, Chair Professor John Joseph LiPuma Professor Bethany B. Moore Professor Michele S. Swanson

DEDICATION Twenty- five years ago a little girl told her mother that when she grew up she wanted to be a Know-it-all-Scientist. This dissertation is dedicated to the single mother who didn t tell her little girl that such a goal was impossible, but instead, did everything in her power to make it happen. ii

TABLE OF CONTENTS DEDICATION LIST OF FIGURES ABSTRACT v x ii CHAPTER I. Microbiology, Genomics and Clinical Significance of the Pseudomonas fluorescens Species Complex, an Unappreciated Colonizer of Humans 1 I. Figures 27 I. References 38 II. Materials and Methods 62 II. References 70 III. Multi-locus Sequence Analysis (MLSA) of newly sequenced clinical Pseudomonas spp. isolates 73 III. Figures 82 III. References 86 III. Appendix 88 IV. Comparative Genomics of Pseudomonas fluorescens Subclade III Strains from Human Lungs 91 IV. Figures 118 IV. References 129 IV. Appendix 151 V. Comparative Genomics of a Novel Human-Adapted Subclade of P. fluorescens Species Complex Bacteria That Contain a Yop/Ysc- like Type III Secretion System 164 V. Figures 184 V. References 203 V. Appendix 211 iii

VI. Chronic Inflammation in the Lower Airways Favors the Growth of P. fluorescens Species Complex Bacteria 233 VI. Figures 246 VI. References 255 VI. Appendix 260 VII. Finale 269 VII. Figure 287 VII. References 288 iv

LIST OF FIGURES Chapter I Figure I.1 Reported P. fluorescens infections 27 Figure I.2 P. fluorescens isolates cultured over an eleven-year period by the 28 University of Michigan Hospital Microbiology lab. Figure I.3 The functional range and environmental niches of the Pseudomonas 29 genus, highlighting the broad distribution of the P. fluorescens species complex. Figure I.4 Species diversity within the P. fluorescens species complex. 30 Figure I.5 Scanning electron micrograph of P. fluorescens (courtesy of Science Source). 31 Figure I.6 Characteristics of P. fluorescens complex bacteria. 32 Figure I.7 Phylogenetic tree of 38 Pseudomonas type strain, based on 33 concatenated nine gene MLST analysis. Figure I.8 Summary information of fully sequenced bacterial strains from 34 the P. fluorescens species complex. Figure I.9 Phylogenetic tree of 38 Pseudomonas type strains, based on the 35 V3-V5 region sequence of the 16S rrna gene. Figure I.10 Scanning electron micrographs of P. fluorescens biofilms. 36 Figure I.11 Type-III secretion systems in P. fluorescens. 37 Chapter III Figure III.1 Phylogenetic tree of 60 sequenced clinical Pseudomonas spp. 82 strains with six-gene MLSA scheme. Figure III.2 Phylogenetic tree with eight-gene MLSA scheme to characterize 83 P. fluorescens subclade III. v

Figure III.3 Phylogenetic tree with seven-gene MLSA scheme to characterize 84 P. fluorescens subclade IV. Figure III.4 Phylogenetic tree of 56 sequenced clinical Pseudomonas spp. 85 strains with seven-gene MLSA scheme. Appendix Figure III.1 Housekeeping genes used for screening and classifying 88 the newly sequenced Pseudomonas spp. strains. Appendix Figure III.2 Housekeeping genes in the newly sequenced 89 Pseudomonas spp. strains. Chapter IV Figure IV.1 Date, location and source of P. fluorescens strains. 118 Figure IV.2 Assembly statistics of newly sequenced P. fluorescens strains. 119 Figure IV.3 Genomic features of newly sequenced P. fluorescens strains. 120 Figure IV.4 Phenogram based on average nucleotide identity (ANI) between 121 subclade III strains. Figure IV.5 Secondary metabolite genes and gene clusters in subclade III strains. 122 Figure IV.6 Pan, accessory and core analysis of the eleven subclade III strains. 123 Figure IV.7 The czc gene cluster in environmental and clinical subclade III strains. 124 Figure IV.8 G+C content across the genomes of newly sequenced subclade III strains. 125 Figure IV.9 Alignment of the pfit amino acid sequences from subclade III strains. 126 Figure IV.10 Genes involved in Type-II, III, IV, VI and WCI secretion system in 127 subclade III strains. Figure IV.11 Neighbor-joining tree based on the amino acid sequence of 128 czca homologues. Appendix Figure IV.1 Coverage map of AU2989 assembled scaffolds and contigs. 151 Appendix Figure IV.2 The nucleotide sequence of the 16S rrna sequences was 152 vi

used to query the NCBI nucleotide collection (nr/nt). Appendix Figure IV.3 Average nucleotide identity (ANI). 153 Appendix Figure IV.4 Secondary metabolite genes and reference organisms for blast. 154 Appendix Figure IV.5 Blast results of selected GC Islands. 157 Appendix Figure IV.6 Full annotation of genes involved in secretion systems in 159 subclade III strains. Appendix Figure IV.7 Alignment of FimA homologues. 161 Appendix Figure IV.8 Alignment of VasH homologues. 162 Appendix Figure IV.9 Phylogenetic tree of VasH homologues. 163 Chapter V Figure V.1 Date, location and source P. fluorescens strains. 184 Figure V.2 Growth properties of clinical and reference Pseudomonas strains. 185 Figure V.3 Assembly statistics of clinical strains of P. fluorescens sequenced 186 in this study. Figure V.4 Subclade IV isolation across the United States 188 Figure V.5 Genome features of P. fluorescens strains in this study. 189 Figure V.6 Genome size of newly sequenced and reference Pseudomonas strains. 190 Figure V.7 % G+C content of newly sequenced and reference Pseudomonas strains. 191 Figure V.8a G+C content across the genomes of newly sequenced subclade 192 IV strains. Figure V.8b G+C content across the genomes of subclade I and II strains. 193 Figure V.9 Pan, accessory and core analysis of the ten subclade IV strains. 194 Figure V.10 PCA built on shared gene clusters between Pseudomonas strains. 195 Figure V.11 Alignment of the pfit amino acid sequences from subclade I and II 196 P. fluorescens strains. vii

Figure V.12 Alignment of I2 nucleotide sequence subclade I, II and III 197 P. fluorescens strains. Figure V.13 Number of gene homologues annotated as part of the 198 Ysc/Yop- like type-iii secretion system. Figure V.14 Amplification of the ExoU protein with P. fluorescens 199 subclade IV-specific primers. Figure V.15 ExoU and SpcU protein phylogeny in Pseudomonas strains. 200 Figure V.16 Alignment of SpcU homologues. 201 Figure V.17 Model of the type III secretion system in P. fluorescens 202 subclade IV strains. Appendix Figure V.1 Average Nucleotide Identity (ANI). 211 Appendix Figure V.2 Genome size of newly sequenced and reference 212 Pseudomonas strains. Appendix Figure V.3 % G+C content of newly sequenced and reference 213 Pseudomonas strains. Appendix Figure V.4 Nucleotide blast of selected GC Islands. 214 Appendix Figure V.5 Annotated type III secretion genes in P. fluorescens 225 subclades IV and P. aeruginosa strains. Appendix Figure V.6 Statistics of annotated type III secretion genes. 228 Appendix Figure V.7 Development of P. fluorescens subclade IV-specific 229 ExoU primers. Appendix Figure V.8 Blastp with ExoU Subclade IV protein. 230 Appendix Figure V.9 Blastp with SpcU Subclade IV protein. 232 Chapter VI Figure VI.1 Inflamed mouse lung microbiome from in-house colony. 247 viii

Figure VI.2 Inflamed mouse lung microbiome from Jackson labs. 248 Figure VI.3 Inflamed mouse tongue microbiome from in-house colony. 249 Figure VI.4 Inflamed mouse tongue microbiome from Jackson labs. 250 Figure VI.5 Relative abundance of mouse lung- associated Pseudomonas 251 OTUs in five treatment subsets. Figure VI.6 Relative abundance of human lung- associated Pseudomonas OTUs 252 in three patient subsets. Figure VI.7 Pseudomonas abundance in COPD lung explants. 253 Figure VI.8 V3 - V5 16S rrna phylogenetic tree. 254 Appendix Figure VI.1 Blast results of Pseudomonas OTUs. 260 Appendix Figure VI.2 Blast alignment of Pseudomonas OTUs against reference 264 Pseudomonas genomes. Appendix Figure VI.3 Blast and alignment of the two mouse lung-associated 267 Streptococcus OTUs. Chapter VII Figure VII.1 Core, accessory and pan genomes. 287 ix

ABSTRACT Comparative genomics of clinical isolates of Pseudomonas fluorescens, including the discovery of a novel disease-associated subclade. by Brittan Starr Scales Chair: Gary Huffnagle Abstract title: Comparative genomics of clinical isolates of Pseudomonas fluorescens, including the discovery of a novel disease-associated subclade. Taxonomically, there are over 52 Pseudomonas species that group within the Pseudomonas fluorescens species-complex. They are gram-negative bacteria (gammaproteobacteria) that have the ability to flourish in many different environmental niches due their metabolic adaptability. Prior to beginning the project described in this dissertation, we had identified (using culture-independent analysis) that the lung microbiome of patients with severe chronic obstructive pulmonary disease (COPD) contained abundant levels of P. fluorescens. This finding was highly unexpected because P. fluorescens colonization had not been previously reported in chronic respiratory disease (human or veterinary). Furthermore, there was no taxonomic or genome information on clinical P. fluorescens strains in the published literature or NCBI database. This raised the question of whether there were differences between P. fluorescens strains isolated from clinical vs. environmental sources. To begin to address this question, I sequenced a collection of Pseudomonas spp. isolates from individuals with chronic lung disease, which included 22 clinical P. fluorescens strains. Twelve of the 22 could be grouped within previously defined subclades I, II and III of the P. fluorescens species-complex. However, the other 10 strains were distinct and could be clustered as a phylogenetically unique fourth subclade, which lacked any representative genomes (clinical or environmental) in the x

NCBI database. I performed additional comparative genomic analyses on all four subclades in order to identify genomic attributes that were associated with growth in humans (vs. the environment). Subclade III clinical isolates were almost indistinguishable from environmental isolates; however, the clinical isolates contained additional homologues of genes involved in metal toxicity resistance, an attribute that has been reported for chronic antibiotic exposure, such as would occur in the cystic fibrosis lung. In sharp contrast, subclade IV strains displayed marked reductions in genome size, gene diversity and GC content, as well as containing all of the elements for a Ysc family type III secretion system that was extremely similar to that found in P. aeruginosa. Furthermore, our subclade IV strains, collected from geographic locations across the U.S., had a very high level of shared nucleotide identity and a small accessory genome, suggesting that they may have evolved from living in multi-trophic environments to a life in a much narrower niche, perhaps human airways. We also identified that some colonies of inbred mice contained indigenous P. fluorescens in their lung microbiome (subclade III) and chronic inflammation was associated with an outgrowth of P. fluorescens. Expanding my analysis to human respiratory disease, I found an increase in P. fluorescens (largely subclade III) in lung samples from multiple diseases (COPD, IPF and lung transplant) but not healthy subjects. Altogether, these studies reveal that P. fluorescens is an under-appreciated colonizer of humans, particularly in the context of pulmonary inflammation, and lay the groundwork for future studies delineating the contribution and molecular mechanisms of this host-microbe interaction. xi