Complete genome sequence of Clostridium acetobutylicum. DSM 1731, a solvent producing strain with multi-replicon

Similar documents
Genome sequence of Acinetobacter baumannii MDR-TJ

Complete Genome Sequence of Pathogenic Bacterium

Complete Genome Sequence of the Polycyclic Aromatic Hydrocarbon-Degrading. Bacterium Alteromonas sp. Strain SN2

Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH. BIOL 7210 A Computational Genomics 2/18/2015

Title: Genome sequence of lineage III Listeria monocytogenes strain HCC23

China National Grid --- BioNode. Jun Wang Beijing Genomics Institute

Introduction to Bioinformatics

Genetic Engineering for Biofuels Production

Bio-Reagent Services. Custom Gene Services. Gateway to Smooth Molecular Biology! Your Innovation Partner in Drug Discovery!

The right answer, every time

De novo genome assembly with next generation sequencing data!! "

Optimizing Pentose Utilization in Clostridia for Improved Solvents Production from Lignocellulosic Hydrolysates

Providing clear solutions to microbiological challenges TM. cgmp/iso CLIA. Polyphasic Microbial Identification & DNA Fingerprinting

Lecture-2 Principles of microbiology. الدكتورة أسماء الصالح رقم المكتب 5T201 الموقع: إيميل

What the Genome of Raffaelea lauricola Can Tell Us About Laurel Wilt

Synthetic Biology. Sustainable Energy. Therapeutics Industrial Enzymes. Agriculture. Accelerating Discoveries, Expanding Possibilities. Design.

Genome Sequence Assembly

GeneMarkS-2: Raising Standards of Accuracy in Gene Recognition

Genomics and Bioinformatics GMS6231 (3 credits)

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

Infectious Disease Omics

Multiple choice questions (numbers in brackets indicate the number of correct answers)

Metabolic engineering of clostridium cellulolyticum for advanced biofuel production

I nternet Resources for Bioinformatics Data and Tools

Title: High-quality genome assembly of channel catfish, Ictalurus punctatus

Construction of plant complementation vector and generation of transgenic plants

JB Accepts, published online ahead of print on 26 February 2010 J. Bacteriol. doi: /jb

Mapping strategies for sequence reads

Recombinant DNA Technology. The Role of Recombinant DNA Technology in Biotechnology. yeast. Biotechnology. Recombinant DNA technology.

UCSC Genome Browser. Introduction to ab initio and evidence-based gene finding

ELE4120 Bioinformatics. Tutorial 5

A shotgun introduction to sequence assembly (with Velvet) MCB Brem, Eisen and Pachter

Year III Pharm.D Dr. V. Chitra

Technician: Dionne Lutz, BS: Biology & MsED Office: Kanbar Center, Room 704, 41 Cooper Sq. (212) (office)

DNA Structure and Analysis. Chapter 4: Background

Solutions to Quiz II

Problem Set 8. Answer Key

Genomics and Transcriptomics of Spirodela polyrhiza

Gene Expression Technology

PI NAME: Eleftherios Papoutsakis. Department of Chemical Engineering and the Delaware Biotechnology Institute, University of Delaware

AP Biology Gene Expression/Biotechnology REVIEW

M I C R O B I O L O G Y WITH DISEASES BY TAXONOMY, THIRD EDITION

Bioinformatic analysis of phage AB3, a phikmv-like virus infecting Acinetobacter baumannii

MICROBIAL GENETICS (BIO-375/575)

Genetics Lecture 21 Recombinant DNA

PLANT MOLECULAR BIOLOGY AND GENOMICS PCB 5530 Section 4205, 3 Credits FALL 2015

Microbial Diversity and Assessment (III) Spring, 2007 Guangyi Wang, Ph.D. POST103B

O C. 5 th C. 3 rd C. the national health museum

Link between iron homeostasis, oxidative mutagenesis and antibiotic resistance evolution

NCBI web resources I: databases and Entrez

Implementation of the VITEK MS and Its Use in Microbial Identification

QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd

Mate-pair library data improves genome assembly

Welcome to the NGS webinar series

Genome research in eukaryotes

METAGENOMICS. Aina Maria Mas Calafell Genomics

Small Genome Annotation and Data Management at TIGR

Bio 101 Sample questions: Chapter 10

Non-Organic-Based Isolation of Mammalian microrna using Norgen s microrna Purification Kit

Introduction to Bioinformatics and Gene Expression Technologies

Genomics and Gene Recognition Genes and Blue Genes

Guangdong Province Key Laboratory of Pharmacodynamic Constituents of TCM and New Drugs Research, Jinan University, Guangzhou, , China

Butanol Fermentation from Low-Value Sugar-Based Feedstocks by Clostridia

Transcription in Eukaryotes

Introduction to Bioinformatics

DNA RNA PROTEIN. Professor Andrea Garrison Biology 11 Illustrations 2010 Pearson Education, Inc. unless otherwise noted

Protein Synthesis Transcription And Translation Lab Answers

Analysis Report. Institution : Macrogen Japan Name : Macrogen Japan Order Number : 1501APB-0004 Sample Name : 8380 Type of Analysis : De novo assembly

Outline. General principles of clonal sequencing Analysis principles Applications CNV analysis Genome architecture

Introduction to Bioinformatics

Sequence Assembly and Alignment. Jim Noonan Department of Genetics

Name Per AP: CHAPTER 27: PROKARYOTES (Bacteria) p559,

Bundle 5 Test Review

Answers to Module 1. An obligate aerobe is an organism that has an absolute requirement of oxygen for growth.

How much sequencing do I need? Emily Crisovan Genomics Core

THE BENEFITS AND USES OF MICROBES

Name Class Date. Practice Test

measuring gene expression December 5, 2017

Sequence Analysis Lab Protocol

Bioinformatics, in general, deals with the following important biological data:

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Chapter 8 Recombinant DNA Technology. 10/1/ MDufilho

Microbial Biotechnology agustin krisna wardani

Gene-centered resources at NCBI

Activation of a Floral Homeotic Gene in Arabidopsis

Molecular Biology: DNA sequencing

Eukaryotic Gene Structure

Microbially Mediated Plant Salt Tolerance and Microbiome based Solutions for Saline Agriculture

CHAPTER 2A HOW DO YOU BEGIN TO CLONE A GENE? CHAPTER 2A STUDENT GUIDE 2013 Amgen Foundation. All rights reserved.

DNA Transcription. Visualizing Transcription. The Transcription Process

Higher Human Biology Unit 1: Human Cells Pupils Learning Outcomes


Serial Analysis of Gene Expression

Ultrasequencing: Methods and Applications of the New Generation Sequencing Platforms

Manipulating DNA. Nucleic acids are chemically different from other macromolecules such as proteins and carbohydrates.

Genetic Algorithms-Based Model for Multi-Project Human Resource Allocation

Introduction: Methods:

A Greedy Algorithm for Minimizing the Number of Primers in Multiple PCR Experiments

Biology 142 Advanced Topics in Genetics and Molecular Biology Course Syllabus Spring 2006

Dna And Replication Study Guide Answer Key

Transcription:

JB Accepts, published online ahead of print on 8 July 2011 J. Bacteriol. doi:10.1128/jb.05596-11 Copyright 2011, American Society for Microbiology and/or the Listed Authors/Institutions. All Rights Reserved. 1 Genome Announcements 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 Complete genome sequence of Clostridium acetobutylicum DSM 1731, a solvent producing strain with multi-replicon genome architecture Guanhui Bao 1,2, Runjiang Wang 1,2, Yan Zhu 1,2, Hongjun Dong 1,2, Shaoming Mao 1, Yanping Zhang 1, Zugen Chen 4, Yin Li 1*, Yanhe Ma 1,4 1 Institute of Microbiology, Chinese Academy of Sciences, Beijing, China 2 Graduate School of Chinese Academy of Sciences, Beijing, China 3 Department of Human Genetics, School of Medicine, University of California, Los Angeles, CA 90095 4 State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China *Corresponding author: Mailing address for Yin Li: Institute of Microbiology, Chinese Academy of Sciences, No.1 West Beichen Road, Chaoyang District, Beijing 100101, China. Phone: 86-10-64807485. Fax: 86-10-64807485. E-mail: yli@im.ac.cn. Abstract Clostridium acetobutylicum is an important microorganism for solvent production. We report the complete genome sequence of C. acetobutylicum DSM 1731, a genome

25 26 27 with multi-replicon architecture. Comparison with the sequenced type strain C. acetobutylicum ATCC 824, the genome of strain DSM1731 harbors a 1.7 kb insertion and a novel 11.1 kb plasmid, which might be acquired during evolution. 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 Acetone butanol ethanol (ABE) fermentation by Clostridium acetobutylicum is one of the oldest industrial fermentations ever known, and historically ranked the second next to the ethanol fermentation by yeast. C. acetobutylicum is a low-gc-content, gram-positive, spore-forming and obligate anaerobe (2). Recent omics studies including genomics (4), transcriptomics (12, 13), proteomics (9, 10) and metabolomics (1) have greatly improved our understandings to the physiology of this species. Here we report the complete genome sequence of Clostridium acetobutylicum DSM 1731, which was obtained from the German Collection of Microorganisms and Cell Cultures (DSMZ, Braunschweig, Germany). The genome was sequenced at the Department of Human Genetics in University of California Los Angeles, utilizing the Solexa technology according to Illumina s recommendations. A total of 6,043,147 reads, counting up to 217,553,292 bases were obtained, providing a 52X coverage. Assembly was performed by AMOScmp (5) and Velvet (14), using comparative genome assembly and de novo assembly. The gaps were filled by sequencing polymerase chain reaction (PCR) fragments using ABI 3730. The complete genome of C. acetobutylicum DSM 1731 consists of a single, circle chromosome of 3,942,462 bp (30.9% GC content), two plasmids: psmba (191,996 bp, 30.9% GC content) and psmbb (11,123 bp, 25.9% GC content).

50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 Putative protein coding sequences were identified by GeneMark.hmm (8) and Glimmer (3). trnascan-se (7) was used to predict transfer RNA genes and RNAmmer 1.2 server (6) was employed to detect rrna genes. BLASTP were applied to perform annotation, followed by manual curation. The chromosome of C. acetobutylicum DSM 1731 contains 3,765 predicated protein-coding genes, 11 rrna operons and 73 trna genes, while the two plasmids contains 176 (psmba) and 9 (psmbb) protein coding genes, respectively. Functional classification of all proteins was conducted using NCBI clusters of orthologous groups (COG), where 3072, 126, and 2 COG classified genes are located on the chromosome, psmba, and psmbb, respectively. Comparing with the genome sequence of C. acetobutylicum strain ATCC 824 (11) and EA 2018 (4), the most remarkable feature of strain DSM 1731 is the presence of a 11.1 kb plasmid, psmbb, which has not been previously reported in this species. In addition, 7 out of the 9 predicted genes of psmbb are hypothetical proteins, suggesting it plays an unknown role in clostridial physiology. Besides, 345 single nucleotide variations (SNVs) were identified between the chromosomes of strains DSM 1731 and ATCC 824. The largest insertion is 1,753 bp in the position of 1,278,525-1,280,278, located within a hypothetical protein gene. Consistent with strain EA 2018, the size of the megaplamid in strain DSM 1731 (psmba) is 4 bp smaller than that of strain ATCC824. Nucleotide sequence accession numbers The complete genome sequence of Clostridium acetobutylicum DSM 1731 has been deposited in GenBank under the accession numbers between CP002660 and

75 CP002662. 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 Acknowledgements This research was supported by the National Basic Research Programme of China (973 Project, 2011CBA00807) and the CAS/SAFEA International Partnership Program for Creative Research Teams. Yin Li is supported by the Hundreds of Talents Program of the Chinese Academy of Sciences. References 1. Amador-Noguez, D., X. J. Feng, J. Fan, N. Roquet, H. Rabitz, and J. D. Rabinowitz. 2010. Systems-Level Metabolic Flux Profiling Elucidates a Complete, Bifurcated Tricarboxylic Acid Cycle in Clostridium acetobutylicum. Journal of Bacteriology 192:4452-4461. 2. Awang, G. M., G. A. Jones, and W. M. Ingledew. 1988. The Acetone-Butanol-Ethanol Fermentation. Crc Critical Reviews in Microbiology 15:S33-S67. 3. Delcher, A. L., D. Harmon, S. Kasif, O. White, and S. L. Salzberg. 1999. Improved microbial gene identification with GLIMMER. Nucleic Acids Research 27:4636-4641. 4. Hu, S., H. Zheng, Y. Gu, J. Zhao, W. Zhang, Y. Yang, S. Wang, G. Zhao, S. Yang, and W. Jiang. 2011. Comparative genomic and transcriptomic analysis revealed genetic characteristics related to solvent formation and xylose utilization in Clostridium acetobutylicum EA 2018. Bmc Genomics 12:93. 5. Kurtz, S., A. Phillippy, A. L. Delcher, M. Smoot, M. Shumway, C. Antonescu, and S. L. Salzberg. 2004. Versatile and open software for

100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 comparing large genomes. Genome Biology 5:-. 6. Lagesen, K., P. Hallin, E. A. Rodland, H. H. Staerfeldt, T. Rognes, and D. W. Ussery. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Research 35:3100-3108. 7. Lowe, T. M., and S. R. Eddy. 1997. trnascan-se: A program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Research 25:955-964. 8. Lukashin, A. V., and M. Borodovsky. 1998. GeneMark.hmm: new solutions for gene finding. Nucleic Acids Research 26:1107-1115. 9. Mao, S. M., Y. A. M. Luo, T. R. Zhang, J. S. Li, G. A. H. Bao, Y. Zhu, Z. G. Chen, Y. P. Zhang, Y. Li, and Y. H. Ma. 2010. Proteome Reference Map and Comparative Proteomic Analysis between a Wild Type Clostridium acetobutylicum DSM 1731 and its Mutant with Enhanced Butanol Tolerance and Butanol Yield. Journal of Proteome Research 9:3046-3061. 10. Mao, S. M., Y. M. Luo, G. H. Bao, Y. P. Zhang, Y. Li, and Y. H. Ma. 2011. Comparative analysis on the membrane proteome of Clostridium acetobutylicum wild type strain and its butanol-tolerant mutant. Molecular Biosystems 7:1660-1677. 11. Nolling, J., G. Breton, M. V. Omelchenko, K. S. Makarova, Q. D. Zeng, R. Gibson, H. M. Lee, J. Dubois, D. Y. Qiu, J. Hitti, Y. I. Wolf, R. L. Tatusov, F. Sabathe, L. Doucette-Stamm, P. Soucaille, M. J. Daly, G. N. Bennett, E. V. Koonin, D. R. Smith, and G. S. C. P. Finishing. 2001. Genome sequence and comparative analysis of the solvent-producing bacterium Clostridium acetobutylicum. Journal of Bacteriology 183:4823-4838. 12. Tomas, C. A., K. V. Alsaker, H. P. J. Bonarius, W. T. Hendriksen, H. Yang,

125 126 127 128 129 130 131 132 133 134 135 136 J. A. Beamish, C. J. Paredes, and E. T. Papoutsakis. 2003. DNA array-based transcriptional analysis of asporogenous, nonsolventogenic Clostridium acetobutylicum strains SKO1 and M5. Journal of Bacteriology 185:4539-4547. 13. Tomas, C. A., N. E. Welker, and E. T. Papoutsakis. 2003. Overexpression of groesl in Clostridium acetobutylicum results in increased solvent production and tolerance, prolonged metabolism, and changes in the cell's transcriptional program. Appl Environ Microb 69:4951-4965. 14. Zerbino, D. R., and E. Birney. 2008. Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Research 18:821-829. Downloaded from http://jb.asm.org/ on April 30, 2018 by guest