Title: Genome sequence of lineage III Listeria monocytogenes strain HCC23

Similar documents
Probiotic Strain Isolated from the Vagina of Healthy Women

Genome sequence of Acinetobacter baumannii MDR-TJ

Genome sequence of Enterobacter mori type strain LMG 25706, a

Complete Genome Sequence of Bifidobacterium longum subsp. longum KACC 91563

Complete Genome Sequence of the Polycyclic Aromatic Hydrocarbon-Degrading. Bacterium Alteromonas sp. Strain SN2

ACCEPTED. Korean patient isolate in an effort to understand the prevalence, antibiotic resistance, and

Corynebacterium pseudotuberculosis genome sequencing: Final Report

Bacterial Genetics. Stijn van der Veen

CECT 5716, a probiotic strain isolated from human milk

Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH. BIOL 7210 A Computational Genomics 2/18/2015

Matthew Tinning Australian Genome Research Facility. July 2012

JB Accepts, published online ahead of print on 24 June 2011 J. Bacteriol. doi: /jb

Comparative genomics of clinical isolates of Pseudomonas fluorescens, including the discovery of a novel disease-associated subclade.

Course summary. Today. PCR Polymerase chain reaction. Obtaining molecular data. Sequencing. DNA sequencing. Genome Projects.

Gene Prediction Group

From Infection to Genbank

Contact us for more information and a quotation

Chapter 10 Genetic Engineering: A Revolution in Molecular Biology

Genome Projects. Part III. Assembly and sequencing of human genomes

Molecular Biology: DNA sequencing

Prokaryotic Annotation Pipeline SOP HGSC, Baylor College of Medicine

Applied bioinformatics in genomics

Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie. Sander van Boheemen Medical Microbiology

Advanced Technology in Phytoplasma Research

Using New ThiNGS on Small Things. Shane Byrne

Whole Genome Sequencing for food safety FSA Chief Scientific Advisor Report and 2013 Listeria pilot study

GENETICS - CLUTCH CH.15 GENOMES AND GENOMICS.

Supplementary Materials

Developing the CRISPR Interference System to Understand Bacterial Gene Function

Complete Genome Sequence of Pathogenic Bacterium

Carl Woese. Used 16S rrna to develop a method to Identify any bacterium, and discovered a novel domain of life

Detection of Mobile Genetic Elements (MGEs) in Bacterial Genomes

Carl Woese. Used 16S rrna to developed a method to Identify any bacterium, and discovered a novel domain of life

Bacterial Genetics. Prof. Dr. Asem Shehabi Faculty of Medicine University of Jordan

The Basics of Understanding Whole Genome Next Generation Sequence Data

How to Use This Presentation

Sequencing techniques

THE RISE OF WHOLE GENOME SEQUENCING AS A SUBTYPING TOOL FOR MICROBIAL SOURCE TRACKING: FROM FUNDAMENTALS TO APPLICATIONS

SAMPLE. Interpretive Criteria for Identification of Bacteria and Fungi by Targeted DNA Sequencing

Phenotype analysis: biological-biochemical analysis. Genotype analysis: molecular and physical analysis

Name: Ally Bonney. Date: January 29, 2015 February 24, Purpose

Next Gen Sequencing. Expansion of sequencing technology. Contents

Next Generation Sequencing Lecture Saarbrücken, 19. March Sequencing Platforms

Rapid Transcriptome Characterization for a nonmodel organism using 454 pyrosequencing

Phenotype analysis: biological-biochemical analysis. Genotype analysis: molecular and physical analysis

Module 6 Microbial Genetics. Chapter 8

DNA Sequencing. Happiness Kumburu BSU- workshop Nov, 2016

Faramarz Valafar.

Overview of Next Generation Sequencing technologies. Céline Keime

Overview of CIDT Challenges and Opportunities

TECHNIQUES FOR STUDYING METAGENOME DATASETS METAGENOMES TO SYSTEMS.

PRESENTING SEQUENCES 5 GAATGCGGCTTAGACTGGTACGATGGAAC 3 3 CTTACGCCGAATCTGACCATGCTACCTTG 5

DNA Sequencing by Ion Torrent. Marc Lavergne CHEM 4590

M I C R O B I O L O G Y WITH DISEASES BY TAXONOMY, THIRD EDITION

Next Generation Sequencing Applications in Food Safety and Quality

METAGENOMICS. Aina Maria Mas Calafell Genomics

Pathogenic organisms no thanks: Use of next generation sequencing techniques in risk assessment and HACCP

Infectious Disease Omics

European Union Reference Laboratory for Genetically Modified Food and Feed (EURL GMFF)

Whole Genome Sequence Data Quality Control and Validation

Basic Bioinformatics: Homology, Sequence Alignment,

Name: - Bio A.P. DNA Replication & Protein Synthesis

Analysis Report. Institution : Macrogen Japan Name : Macrogen Japan Order Number : 1501APB-0004 Sample Name : 8380 Type of Analysis : De novo assembly

Functional annotation of metagenomes

Genome Analysis. Bacterial genome projects

Videos. Lesson Overview. Fermentation

A Population Genetics-Based and Phylogenetic Approach to Understanding the Evolution of Virulence in the Genus Listeria

DNA amplification and analysis: minipcr TM Food Safety Lab

JB Accepts, published online ahead of print on 26 February 2010 J. Bacteriol. doi: /jb

Next Generation Sequencing. Jeroen Van Houdt - Leuven 13/10/2017

Biosafety and the NIH Guidelines

Instructions and Certifications (Failure to follow may result in a delay in processing) ORC Use Only. Date Received: IBC Determination:

Introduction to Bioinformatics analysis of Metabarcoding data

BIOLOGY 205 Midterm II - 19 February Each of the following statements are correct regarding Eukaryotic genes and genomes EXCEPT?

Bioinformatics for Microbial Biology

SUPPLEMENTARY INFORMATION

Glossary of Commonly used Annotation Terms

Supplementary Figure 1. Design of the control microarray. a, Genomic DNA from the

Validating Bionumerics 7.6: A strategic approach from Oregon

Genome Sequence Assembly

Next Generation Sequencing. Tobias Österlund

Molecular Genetics Student Objectives

Whole Genome Sequencing for Enteric Pathogen Surveillance and Outbreak Investigations

Genetics. Chapter 9 - Microbial Genetics. Chromosome. Genes. Topics - Genetics - Flow of Genetics - Regulation - Mutation - Recombination

Videos. Bozeman Transcription and Translation: Drawing transcription and translation:

The Basics of Understanding Whole Genome Next Generation Sequence Data

A Novel Approach for Rapid Identification and Sequencing of Different Bacteriocins Produced by LAB Based on a Practical Immunity Class

Presence of Region of Difference (RD) 1 among clinical isolates of. Mycobacterium tuberculosis from India. ACCEPTED

Microbiome: Metagenomics 4/4/2018

Q2 (1 point). How many carbon atoms does a glucose molecule contain?

Biosc10 schedule reminders

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday September 15, 2014

BSCI410-Liu/Spring 06 Exam #1 Feb. 23, 06

Genome Sequencing. I: Methods. MMG 835, SPRING 2016 Eukaryotic Molecular Genetics. George I. Mias

Chapter 7. Motif finding (week 11) Chapter 8. Sequence binning (week 11)

Nuts and bolts of phage genome sequencing. the 5 5 and 5 8 perspective. Allison Johnson & Anneke Padolina

NCBI web resources I: databases and Entrez

Antisense RNA Insert Design for Plasmid Construction to Knockdown Target Gene Expression

Exam 2 Key - Spring 2008 A#: Please see us if you have any questions!

The University of California, Santa Cruz (UCSC) Genome Browser

Transcription:

JB Accepts, published online ahead of print on 20 May 2011 J. Bacteriol. doi:10.1128/jb.05236-11 Copyright 2011, American Society for Microbiology and/or the Listed Authors/Institutions. All Rights Reserved. 1 Title: Genome sequence of lineage III Listeria monocytogenes strain HCC23 2 3 Running title: Listeria monocytogenes lineage III genome 4 5 6 Chelsea L. Steele 1, Janet R. Donaldson 2, Debarati Paul 1, Michelle M. Banes 1, Tony Arick 3, Susan M. Bridges 4, and Mark L. Lawrence 1 * 7 8 9 10 11 12 13 14 College of Veterinary Medicine, Mississippi State University, Mississippi State, Mississippi, 1 Department of Biological Sciences, Mississippi State University, Mississippi State, Mississippi, 2 Life Sciences and Biotechnology Institute, Mississippi State University, Mississippi State, Mississippi, 3 Department of Computer Sciences and Engineering, Mississippi State University, Mississippi State, Mississippi 4 15 16 17 18 * Corresponding author. Mailing address: Department of Basic Sciences, College of Veterinary Medicine, Mississippi State University, Mississippi State, MS 39762. Phone: 662-325-1205. Fax: 662-325-1031. Email: lawrence@cvm.msstate.edu. 19

20 21 22 23 More than 98% of reported human listeriosis cases are caused by Listeria monocytogenes serotypes within lineages I and II. Serotypes within lineage III (4a and 4c) are commonly isolated from environmental and food specimens. We report the first complete genome sequence of a lineage III isolate, HCC23, which will be used for comparative analysis.

24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 L. monocytogenes strain HCC23 is a serotype 4a lineage III strain that was isolated from a healthy channel catfish (Erdenlig et al., 2000). It is nonpathogenic in mice, even when given by injection (Erdenlig et al., 2000; Liu et al., 2006). Here we report the genome sequence of HCC23, which is the first strain to be sequenced from lineage III. The complete genome sequence of HCC23 was determined using 454 pyrosequencing on a GS-FLX at 454 Life Sciences (Roche) to generate an assembly with 15X depth of coverage. Quality filtered sequences from whole genome shotgun sequencing were assembled using the 454 Newbler assembler. The assembly produced 23 contigs, each at least 500 bases in length. Scaffolding was done with Projector 2 (van Hijum et al., 2005) using the L. monocytogenes strain F2365 genome sequence (NC_002973). Gaps were closed by direct Sanger sequencing of PCR templates with primer walking. Assembly of 454 generated contigs and Sanger reads was done using SeqmanPro (Lasergene). Assembled bases had a Phred-equivalent quality score of 40 or above, which means that the level of accuracy was 99.99%. The DNA sequence was submitted to the JCVI Annotation Service, where it was run through JCVI's prokaryotic annotation pipeline using Annotation Engine. The manual annotation tool Manatee was downloaded from SourceForge and used to manually review the output from the prokaryotic pipeline of the JCVI Annotation Service. A homopolymer analysis was done to ensure quality of sequence. PCR amplification and sequencing was done over all homopolymer regions containing 9 bases. Only one insertion was detected in one 10 bp run. However, frameshift analysis using the Microbial Genome Submission Check at NCBI identified 38 potential frameshifts. PCR amplification and Sanger sequencing resulted in correction of 28 frameshifts; the remaining ten potential frameshifts were shown to be authentic. Of the corrected frameshifts, 27 occurred in homopolymers 5 bp

48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 (composed of A/Ts). Thirteen were single base insertions, and fourteen were single base deletions. The completed genome of strain HCC23 is 2,976,212 bp in length and has a total of 3,011 predicted coding regions. The sequence has an average G+C content of 38.2%. Similar to other L. monocytogenes strains, HCC23 has six ribosomal operons and 67 total trna genes. Synteny is highly conserved between the HCC23 genome and the genomes of EGDe (lineage I; NC_003210) and F2365 (lineage II). Many listerial virulence proteins are encoded by HCC23, including internalins A and B and proteins in the prfa locus (Kathariou, 2002). However, some virulence proteins are not encoded, such as InlC, InlH, and InlJ (Engelbrecht et al., 1996; Raffelsbauer et al., 1998; Sabet et al., 2005). Proteins involved in energy metabolism and transport in strains HCC23, F2365 and EGDe are almost identical, and carbohydrate metabolism in these three strains appears conserved. The HCC23 genome sequence will allow whole genomic comparisons across all three L. monocytogenes lineages for phylogenic classification and better understanding the evolution of L. monocytogenes. More significantly, HCC23 offers the opportunity to conduct comparative genomics between pathogenic and nonpathogenic L. monocytogenes isolates to identify potential genetic differences responsible for pathogenicity. Nucleotide sequence accession number. The completed genome has been deposited in DDBJ/EMBL/GenBank under accession #CP001175. 67

68 69 70 71 72 Acknowledgements This project was supported by USDA AFRI grant number #2007-35201-17732 and USDA Agricultural Research Service (Agreement No. 58-6202-5-083). We thank JCVI for providing the JCVI Annotation Service and the manual annotation tool Manatee. We are grateful to the Mississippi State University Life Sciences and Biotechnology Institute for hosting Manatee.

73 74 75 76 77 References Engelbrecht, F., Chun, S. K., Ochs, C., Hess, J., Lottspeich, F., Goebel, W. & Sokolovic, Z. (1996). A new PrfA-regulated gene of Listeria monocytogenes encoding a small, secreted protein which belongs to the family of internalins. Mol Microbiol 21, 823-837. 78 79 80 81 Erdenlig, S., Ainsworth, A. J. & Austin, F. W. (2000). Pathogenicity and production of virulence factors by Listeria monocytogenes isolates from channel catfish. J Food Prot 63, 613-619. 82 83 84 Kathariou, S. (2002). Listeria monocytogenes virulence and pathogenicity, a food safety perspective. J Food Prot 65, 1811-1829. 85 86 87 88 Liu, D., Lawrence, M. L., Gorski, L., Mandrell, R. E., Ainsworth, A. J. & Austin, F. W. (2006). Listeria monocytogenes serotype 4b strains belonging to lineages I and III possess distinct molecular features. J Clin Microbiol 44, 214-217. 89 90 91 92 93 Raffelsbauer, D., Bubert, A., Engelbrecht, F., Scheinpflug, J., Simm, A., Hess, J., Kaufmann, S. H. & Goebel, W. (1998). The gene cluster inlc2de of Listeria monocytogenes contains additional new internalin genes and is important for virulence in mice. Mol Gen Genet 260, 144-158. 94

95 96 97 Sabet, C., Lecuit, M., Cabanes, D., Cossart, P. & Bierne, H. (2005). LPXTG protein InlJ, a newly identified internalin involved in Listeria monocytogenes virulence. Infect Immun 73, 6912-6922. 98 99 100 101 van Hijum, S. A., Zomer, A. L., Kuipers, O. P. & Kok, J. (2005). Projector 2: contig mapping for efficient gap-closure of prokaryotic genome sequence assemblies. Nucleic Acids Res 33, W560-566. 102 103 104