Background Reading and tools: Proposal:

Size: px
Start display at page:

Download "Background Reading and tools: Proposal:"

Transcription

1 Background Reading and tools: 1. Ivanov II, Atarashi K, Manel N, Brodie EL, Shima T, Karaoz U, Wei D, Goldfarb KC, Santee CA, Lynch SV, Tanoue T, Imaoka A, Itoh K, Takeda K, Umesaki Y, Honda K, Littman DR. Induction of intestinal Th17 cells by segmented filamentous bacteria. Cell Oct 30;139(3): Sczesnak A, Segata N, Qin X, Gevers D, Petrosino JF, Huttenhower C, Littman DR, Ivanov II. The genome of th17 cell-inducing segmented filamentous bacteria reveals extensive auxotrophy and adaptations to the intestinal environment. Cell Host Microbe Sep 15;10(3): Prakash T, Oshima K, Morita H, Fukuda S, Imaoka A, Kumar N, Sharma VK, Kim SW, Takahashi M, Saitou N, Taylor TD, Ohno H, Umesaki Y, Hattori M. Complete genome sequences of rat and mouse segmented filamentous bacteria, a potent inducer of th17 cell differentiation. Cell Host Microbe Sep 15;10(3): Sonnenburg JL, Xu J, Leip DD, Chen CH, Westover BP, Weatherford J, Buhler JD, Gordon JI. Glycan foraging in vivo by an intestine-adapted bacterial symbiont. Science Mar 25;307(5717): Perkins TT, Kingsley RA, Fookes MC, Gardner PP, James KD, et al. (2009) A Strand-Specific RNA Seq Analysis of the Transcriptome of the Typhoid Bacillus Salmonella Typhi. PLoS Genet 5(7): e Langmead et al. Genome Biology 2010, 11:R83; 7. MATLAB RNA SEQ ANALYSIS DEMO: eqdedemo.html 8. JAVA app that provides RNA seq QC: Proposal: The human gut, which is sterile at birth, is colonized by a large array of microbes over the course of life. With resident bacterial species, there are many complex and diverse interactions that occur between resident microbes as well as between microbes and the host. Segmented Filamentous Bacteria (SFB) are Gram-positive anaerobic bacteria which are currently unculturable and residents of the small intestine of various mammals. Their presence is associated with a reduction in the growth and colonization of pathogenic bacteria (Garland et al., 1982 and Heczko et al. 2000). SFB not only impact the microbiota, but SFB also promote the development of the normal mouse immune system by inducing numerous pro- and anti-inflammatory T cell responses as well as B cell responses, including IgA secretion, without causing any harm to the host (Ivanov et al. 2009). Thus understanding the mechanisms used by SFB to alter host biology and microbe-microbe interactions may inform causes and treatments for irritable bowel syndrome (IBS), pathogen resistance, as well as the gut immune development. SFB have been shown to adhere to the intestinal epithelium, a niche which typically contains few bacteria. This intimate association with the host may contribute to the ability of SFB to affect the colonization of other members of the microbiota, including pathogens, and may contribute to their unique and specific ability to affect the host s immune development. However it is unclear how SFB are able to localize to this privy niche. The unique nature of SFB provides a platform to query some important questions in the field including: what are the minimal gene sets or pathways needed to survive in gut and what are the mechanisms used by commensals to modulate immune system and affect other members of the microbiota. To begin to answer some of these questions, we will utilize

2 germ-free mice which lack a microbiota, and can be colonized with defined populations of bacteria, generating gnotobiotic mice. We will colonize two strains of germ-free mice with SFB in the presence and absence of Bacteroides thetaiotaomicron (B. theta), a human symbiont which can cleave and consume mucin glycans and also alters the mouse intestinal niche in numerous ways including inducing fucosylation of epithelial and mucin glycans. To understand how SFB colonize their unique niche, we will conduct transcriptional profiling studies by RNA sequencing of SFB and B. theta under each in vivo condition. By transcriptionally profiling SFB, we can identify the key pathways up regulated for life in the gut as well as candidate mechanisms used by commensals to affect the host s immune system and to impact each others survival in the gut. RNA sequencing studies have been conducted for bacteria (Perkins et. al. 2009) as well as eukaryotic organisms (Langmead et al. Genome Biology 2010), and it involves using Illumina platforms to sequence the cdna created from a sample s mrna (see figure below). Briefly RNA is extracted, cleaned up by the removal of contaminating DNA as well as ribosomal RNA (which can account for ~80% of the sample), converted to cdna, then a library is generated and sequenced. The programs created in this project will help not only to answer some of key biological questions in our datasets, but will also address important issues that researchers face when analyzing RNA sequencing data. For instance, there is no consensus on the best or most efficient way to normalize RNA seq data or compare samples to each other. Also, no program allows one to look at whether there are certain operons or pathways being differentially expressed across samples. Ultimately, the programs that created in this project will help to streamline the data analysis process not only for this project, but also for many others!

3 Weekly plan: Week 1: Choose project My project will include two main datasets and a third conceptual dataset: 1. SFB dataset: 4 samples of SFB mono-colonized mice and 4 samples from SFB and B. theta bi-colonized mice. 8 Total samples. This is the main dataset we want to work with to address biological questions! 2. CS dataset: For validation and other work, I also have 1 sample from a CS monocolonized mouse and one sample from a CS + BT bi-colonized mouse. 2 Total samples rd fake/tweakable dataset: For proof of concepts and working on programs, we can make our own simplified dataset. Chop up a small genome and insert other sequences, errors. Week 2: Reading/understanding project and getting comfortable with the data 1. Go over protocol for getting biological samples, sequencing, what RNA seq is, how RNA seq can be used, and what the needs are in the field, specifically for prokaryotes. 2. Go through a CLC Genomics Workbench (program available through CMGM) analysis: a. Understand the formats: input and output. b. Potential for programming, what is expected, what is needed. c. Use data generated by CLC Genomics Workbench as a starting-off point (let s not reinvent the wheel)! Week 3: Pre-data analysis 1. Determine what the minimal read length is required to map uniquely to a genome. Generate a theoretical genome of size x and a read length of y base pairs plot. Create a graph/dot plot. 2. Write programs to see how different parameters affect read mapping to one or more reference genomes. Generate a graph or dotplot demonstrating the false discovery rate and uniquely mapped reads for each. Also evaluate what would be the optimal parameter per condition. a. How does the sequence length affect the ability to map reads (36 vs. 72 bp length reads)? b. How does allowing for sequence mismatches affect the ability to map reads (0 vs. 1 vs.2 mismatches)? c. How does having paired-end vs. single-end reads affect the ability to map reads? d. What if parts of the genomes are highly similar? What if the genomes are very different? e. How do you deal with reads that don t uniquely map, i.e. map to more than one gene? Week 4: Perform quality control on RNA seq data 1. Can you identify whether the sequencing data consist of high quality reads? *JAVA Fastqc app a. Evaluate the quality scores that were assigned by the sequencing facility

4 i. Exclude low-quality scores b. Evaluate the sequences for potential sequencing biases or artifacts i. Nucleotide bias, GC bias, etc. 2. Can you write a program to identify what the range of expression is per gene in each sample? a. How does RNA contamination (rrna and trna) affect the signal per sample (graph)? How much RNA contamination would be too much? *spike rrna into your fake dataset* 3. Write a program to normalize the samples so that they can be compared to each other. a. Try excluding vs. including rrna sequences as well as sequences that map to other genomes or don t map uniquely. b. Try raw counts vs. RPKM values. c. Justify which method is best, what you would choose and why! 4. Use Matlab to average 4 samples together and determine the mean and standard deviation for each gene expression. a. Can you determine a cutoff for background signal vs. differences in gene expression? Week 5: Preliminary analysis of RNA seq data: Can you write programs or Matlab GUIs to answer the following questions: 1. What genes are being expressed highly in all of the in vivo conditions by SFB, i.e. what genes/pathways/metabolic processes appear to be important for living in vivo in the SFB niche? a. What are the most highly expressed genes in vivo? 2. What genes are being expressed differentially between the conditions, i.e. what genes/pathways are important for SFB functionally in different conditions? a. What are the most highly expressed genes in mono-association as compared to biassociation? b. What are the most highly expressed genes in bi-association as compared to monoassociation? 3. SFB is not a clonal population. Can you identify gene variants, SNPs, mutations, etc., and are they differentially expressed? a. We have DNA sequencing data suggesting that the SFB population contains multiple variants of flagellin. Can you detect these variants in the RNA seq data, and is one variant expressed more highly than the other? Week 6: Secondary analysis of RNA seq data: 1. Generate a program to identify highly expressed operons (based on genome proximity). What are the most highly expressed operons? 2. Which non-orfs are highly expressed (small RNAs)? 3. Can you identify potential immune modulators or microbiota modulators from the transcriptional data? Which genes are cell surface or secreted? Which genes resemble toxins?

5 Week 7: Conquer the biology/functionality: Using pathway tools, kegg (a database of metabolic maps and pathways), biocyc (collections of metabolic pathways, reactions, enzymes), let s analyze differences in metabolic pathways. Let s perform some functional analyses of the transcriptome! 1. Compare the list of pathways and metabolic processes to the genome as a whole. 2. Compare the representation of COG Functional Categories in the expression datasets compared to the genome as a whole. Is there an over or under-representation of certain COG categories in the expression data? Can you write a program to determine whether there is statistical significantly different utilization of certain COG categories? Remember to normalize to genome content. 3. Any metabolic pathways overrepresented in the transcriptional data? Between the two sets of samples? What pathways are utilized differentially by SFB in the mono and bi-associated conditions? 4. Can you identify any biases in the location of the genes that are highly expressed across the genome, or are these genes randomly distributed across the genome? Are there any biases as far as PSORT predictions? 5. Can you come up with a method to look at all of these different variables together and give an overall ranking of important genes/pathways/operons? Ideally this would narrow your list of genes to the most important ones. Think about how much weight each analysis should be given, possibly come up with a ranking system. Think about clustering your outputs by proximity (operons) + overlapping pathways between genes Week 8: Discuss biological implications, future studies, and interesting findings. 1. What are the 3 most exciting findings you made in this project and why do you think they are exciting? 2. What are some explanations for differences you have found in the transcriptional expression between the samples? 3. If there are differences in metabolic pathways, how can one validate them experimentally? 4. What implications does this have for the biology of the gut microbiota and the host? Week 9: Write up findings

Parts of a standard FastQC report

Parts of a standard FastQC report FastQC FastQC, written by Simon Andrews of Babraham Bioinformatics, is a very popular tool used to provide an overview of basic quality control metrics for raw next generation sequencing data. There are

More information

Introduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013

Introduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013 Introduction to RNA-Seq David Wood Winter School in Mathematics and Computational Biology July 1, 2013 Abundance RNA is... Diverse Dynamic Central DNA rrna Epigenetics trna RNA mrna Time Protein Abundance

More information

Transcriptome analysis

Transcriptome analysis Statistical Bioinformatics: Transcriptome analysis Stefan Seemann seemann@rth.dk University of Copenhagen April 11th 2018 Outline: a) How to assess the quality of sequencing reads? b) How to normalize

More information

ChIP-seq and RNA-seq. Farhat Habib

ChIP-seq and RNA-seq. Farhat Habib ChIP-seq and RNA-seq Farhat Habib fhabib@iiserpune.ac.in Biological Goals Learn how genomes encode the diverse patterns of gene expression that define each cell type and state. Protein-DNA interactions

More information

(A) Extrachromosomal DNA (B) RNA found in bacterial cells (C) Is part of the bacterial chromosome (D) Is part of the eukaryote chromosome

(A) Extrachromosomal DNA (B) RNA found in bacterial cells (C) Is part of the bacterial chromosome (D) Is part of the eukaryote chromosome Microbiology - Problem Drill 07: Microbial Genetics and Biotechnology No. 1 of 10 1. A plasmid is? (A) Extrachromosomal DNA (B) RNA found in bacterial cells (C) Is part of the bacterial chromosome (D)

More information

Applications of RNA -seq in Microbiology

Applications of RNA -seq in Microbiology Applications of RNA -seq in Microbiology STUDENT: CHEUNG YUK YAM, ANDY SUPERVISOR: PROF MAMIE HUI JOINT GRADUATE SEMINAR DEC 2016 DEPARTMENT OF MICROBIOLOGY, FACULTY OF MEDICINE, THE CHINESE UNIVERSITY

More information

Introduction to RNA-Seq in GeneSpring NGS Software

Introduction to RNA-Seq in GeneSpring NGS Software Introduction to RNA-Seq in GeneSpring NGS Software Dipa Roy Choudhury, Ph.D. Strand Scientific Intelligence and Agilent Technologies Learn more at www.genespring.com Introduction to RNA-Seq In a few years,

More information

Transcriptomics analysis with RNA seq: an overview Frederik Coppens

Transcriptomics analysis with RNA seq: an overview Frederik Coppens Transcriptomics analysis with RNA seq: an overview Frederik Coppens Platforms Applications Analysis Quantification RNA content Platforms Platforms Short (few hundred bases) Long reads (multiple kilobases)

More information

Module 6 Microbial Genetics. Chapter 8

Module 6 Microbial Genetics. Chapter 8 Module 6 Microbial Genetics Chapter 8 Structure and function of the genetic material Genetics science of o Study of what genes are, how they determine the characteristics of an organism, how they carry

More information

Genome annotation & EST

Genome annotation & EST Genome annotation & EST What is genome annotation? The process of taking the raw DNA sequence produced by the genome sequence projects and adding the layers of analysis and interpretation necessary

More information

From Genotype to Phenotype

From Genotype to Phenotype From Genotype to Phenotype Johanna Vilkki Green technology, Natural Resources Institute Finland Systems biology Genome Transcriptome genes mrna Genotyping methodology SNP TOOLS, WG SEQUENCING Functional

More information

RNA-Seq Analysis. Simon Andrews, Laura v

RNA-Seq Analysis. Simon Andrews, Laura v RNA-Seq Analysis Simon Andrews, Laura Biggins simon.andrews@babraham.ac.uk @simon_andrews v2018-10 RNA-Seq Libraries rrna depleted mrna Fragment u u u u NNNN Random prime + RT 2 nd strand synthesis (+

More information

Bi 8 Lecture 4. Ellen Rothenberg 14 January Reading: from Alberts Ch. 8

Bi 8 Lecture 4. Ellen Rothenberg 14 January Reading: from Alberts Ch. 8 Bi 8 Lecture 4 DNA approaches: How we know what we know Ellen Rothenberg 14 January 2016 Reading: from Alberts Ch. 8 Central concept: DNA or RNA polymer length as an identifying feature RNA has intrinsically

More information

Protein Synthesis: From Gene RNA Protein Trait

Protein Synthesis: From Gene RNA Protein Trait Protein Synthesis: From Gene RNA Protein Trait Human Genome The human genome contains about genes. Each gene is a of DNA (sequence of nitrogen bases) contained within each chromosome. Each chromosome contains

More information

Goals of pharmacogenomics

Goals of pharmacogenomics Goals of pharmacogenomics Use drugs better and use better drugs! People inherit/exhibit differences in drug: Absorption Metabolism and degradation of the drug Transport of drug to the target molecule Excretion

More information

Biosc10 schedule reminders

Biosc10 schedule reminders Biosc10 schedule reminders Review of molecular biology basics DNA Is each person s DNA the same, or unique? What does DNA look like? What are the three parts of each DNA nucleotide Which DNA bases pair,

More information

Deep Sequencing technologies

Deep Sequencing technologies Deep Sequencing technologies Gabriela Salinas 30 October 2017 Transcriptome and Genome Analysis Laboratory http://www.uni-bc.gwdg.de/index.php?id=709 Microarray and Deep-Sequencing Core Facility University

More information

Sequence Analysis Lab Protocol

Sequence Analysis Lab Protocol Sequence Analysis Lab Protocol You will need this handout of instructions The sequence of your plasmid from the ABI The Accession number for Lambda DNA J02459 The Accession number for puc 18 is L09136

More information

less sensitive than RNA-seq but more robust analysis pipelines expensive but quantitiatve standard but typically not high throughput

less sensitive than RNA-seq but more robust analysis pipelines expensive but quantitiatve standard but typically not high throughput Chapter 11: Gene Expression The availability of an annotated genome sequence enables massively parallel analysis of gene expression. The expression of all genes in an organism can be measured in one experiment.

More information

ChIP-seq and RNA-seq

ChIP-seq and RNA-seq ChIP-seq and RNA-seq Biological Goals Learn how genomes encode the diverse patterns of gene expression that define each cell type and state. Protein-DNA interactions (ChIPchromatin immunoprecipitation)

More information

Next Generation Sequencing

Next Generation Sequencing Next Generation Sequencing Complete Report Catalogue # and Service: IR16001 rrna depletion (human, mouse, or rat) IR11081 Total RNA Sequencing (80 million reads, 2x75 bp PE) Xxxxxxx - xxxxxxxxxxxxxxxxxxxxxx

More information

Themes: RNA and RNA Processing. Messenger RNA (mrna) What is a gene? RNA is very versatile! RNA-RNA interactions are very important!

Themes: RNA and RNA Processing. Messenger RNA (mrna) What is a gene? RNA is very versatile! RNA-RNA interactions are very important! Themes: RNA is very versatile! RNA and RNA Processing Chapter 14 RNA-RNA interactions are very important! Prokaryotes and Eukaryotes have many important differences. Messenger RNA (mrna) Carries genetic

More information

Integrated NGS Sample Preparation Solutions for Limiting Amounts of RNA and DNA. March 2, Steven R. Kain, Ph.D. ABRF 2013

Integrated NGS Sample Preparation Solutions for Limiting Amounts of RNA and DNA. March 2, Steven R. Kain, Ph.D. ABRF 2013 Integrated NGS Sample Preparation Solutions for Limiting Amounts of RNA and DNA March 2, 2013 Steven R. Kain, Ph.D. ABRF 2013 NuGEN s Core Technologies Selective Sequence Priming Nucleic Acid Amplification

More information

TECH NOTE Pushing the Limit: A Complete Solution for Generating Stranded RNA Seq Libraries from Picogram Inputs of Total Mammalian RNA

TECH NOTE Pushing the Limit: A Complete Solution for Generating Stranded RNA Seq Libraries from Picogram Inputs of Total Mammalian RNA TECH NOTE Pushing the Limit: A Complete Solution for Generating Stranded RNA Seq Libraries from Picogram Inputs of Total Mammalian RNA Stranded, Illumina ready library construction in

More information

Bi 8 Lecture 5. Ellen Rothenberg 19 January 2016

Bi 8 Lecture 5. Ellen Rothenberg 19 January 2016 Bi 8 Lecture 5 MORE ON HOW WE KNOW WHAT WE KNOW and intro to the protein code Ellen Rothenberg 19 January 2016 SIZE AND PURIFICATION BY SYNTHESIS: BASIS OF EARLY SEQUENCING complex mixture of aborted DNA

More information

DNA and RNA. Chapter 12

DNA and RNA. Chapter 12 DNA and RNA Chapter 12 History of DNA Late 1800 s scientists discovered that DNA is in the nucleus of the cell 1902 Walter Sutton proposed that hereditary material resided in the chromosomes in the nucleus

More information

dbcamplicons pipeline Amplicons

dbcamplicons pipeline Amplicons dbcamplicons pipeline Amplicons Matthew L. Settles Genome Center Bioinformatics Core University of California, Davis settles@ucdavis.edu; bioinformatics.core@ucdavis.edu Microbial community analysis Goal:

More information

DNA Function. DNA Heredity and Protein Synthesis

DNA Function. DNA Heredity and Protein Synthesis DNA Function DNA Heredity and Protein Synthesis 1 Review DNA made of Nucleotide bases Proteins made of Amino acids Describe how DNA is involved in protein synthesis DNA base sequence codes for amino acid

More information

Experimental Design. Sequencing. Data Quality Control. Read mapping. Differential Expression analysis

Experimental Design. Sequencing. Data Quality Control. Read mapping. Differential Expression analysis -Seq Analysis Quality Control checks Reproducibility Reliability -seq vs Microarray Higher sensitivity and dynamic range Lower technical variation Available for all species Novel transcript identification

More information

Measuring and Understanding Gene Expression

Measuring and Understanding Gene Expression Measuring and Understanding Gene Expression Dr. Lars Eijssen Dept. Of Bioinformatics BiGCaT Sciences programme 2014 Why are genes interesting? TRANSCRIPTION Genome Genomics Transcriptome Transcriptomics

More information

C3BI. VARIANTS CALLING November Pierre Lechat Stéphane Descorps-Declère

C3BI. VARIANTS CALLING November Pierre Lechat Stéphane Descorps-Declère C3BI VARIANTS CALLING November 2016 Pierre Lechat Stéphane Descorps-Declère General Workflow (GATK) software websites software bwa picard samtools GATK IGV tablet vcftools website http://bio-bwa.sourceforge.net/

More information

Biology 201 (Genetics) Exam #3 120 points 20 November Read the question carefully before answering. Think before you write.

Biology 201 (Genetics) Exam #3 120 points 20 November Read the question carefully before answering. Think before you write. Name KEY Section Biology 201 (Genetics) Exam #3 120 points 20 November 2006 Read the question carefully before answering. Think before you write. You will have up to 50 minutes to take this exam. After

More information

Solutions to Quiz II

Solutions to Quiz II MIT Department of Biology 7.014 Introductory Biology, Spring 2005 Solutions to 7.014 Quiz II Class Average = 79 Median = 82 Grade Range % A 90-100 27 B 75-89 37 C 59 74 25 D 41 58 7 F 0 40 2 Question 1

More information

Novel methods for RNA and DNA- Seq analysis using SMART Technology. Andrew Farmer, D. Phil. Vice President, R&D Clontech Laboratories, Inc.

Novel methods for RNA and DNA- Seq analysis using SMART Technology. Andrew Farmer, D. Phil. Vice President, R&D Clontech Laboratories, Inc. Novel methods for RNA and DNA- Seq analysis using SMART Technology Andrew Farmer, D. Phil. Vice President, R&D Clontech Laboratories, Inc. Agenda Enabling Single Cell RNA-Seq using SMART Technology SMART

More information

From DNA to Protein: Genotype to Phenotype

From DNA to Protein: Genotype to Phenotype 12 From DNA to Protein: Genotype to Phenotype 12.1 What Is the Evidence that Genes Code for Proteins? The gene-enzyme relationship is one-gene, one-polypeptide relationship. Example: In hemoglobin, each

More information

How to deal with your RNA-seq data?

How to deal with your RNA-seq data? How to deal with your RNA-seq data? Rachel Legendre, Thibault Dayris, Adrien Pain, Claire Toffano-Nioche, Hugo Varet École de bioinformatique AVIESAN-IFB 2017 1 Rachel Legendre Bioinformatics 27/11/2018

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION doi:10.1038/nature12212 Supplementary Discussion Contamination Assessment We evaluated the amount of human contamination in our viral DNA preparations by identifying sequences

More information

Genomics and Gene Recognition Genes and Blue Genes

Genomics and Gene Recognition Genes and Blue Genes Genomics and Gene Recognition Genes and Blue Genes November 1, 2004 Prokaryotic Gene Structure prokaryotes are simplest free-living organisms studying prokaryotes can give us a sense what is the minimum

More information

CBC Data Therapy. Metatranscriptomics Discussion

CBC Data Therapy. Metatranscriptomics Discussion CBC Data Therapy Metatranscriptomics Discussion Metatranscriptomics Extract RNA, subtract rrna Sequence cdna QC Gene expression, function Institute for Systems Genomics: Computational Biology Core bioinformatics.uconn.edu

More information

Differential gene expression analysis using RNA-seq

Differential gene expression analysis using RNA-seq https://abc.med.cornell.edu/ Differential gene expression analysis using RNA-seq Applied Bioinformatics Core, August 2017 Friederike Dündar with Luce Skrabanek & Ceyda Durmaz Day 3 QC of aligned reads

More information

Total RNA isola-on End Repair of double- stranded cdna

Total RNA isola-on End Repair of double- stranded cdna Total RNA isola-on End Repair of double- stranded cdna mrna Isola8on using Oligo(dT) Magne8c Beads AAAAAAA A Adenyla8on (A- Tailing) A AAAAAAAAAAAA TTTTTTTTT AAAAAAA TTTTTTTTT TTTTTTTT TTTTTTTTT AAAAAAAA

More information

Videos. Lesson Overview. Fermentation

Videos. Lesson Overview. Fermentation Lesson Overview Fermentation Videos Bozeman Transcription and Translation: https://youtu.be/h3b9arupxzg Drawing transcription and translation: https://youtu.be/6yqplgnjr4q Objectives 29a) I can contrast

More information

SMRT-Cappable-seqreveals complex operon variants in bacteria

SMRT-Cappable-seqreveals complex operon variants in bacteria SMRT-Cappable-seqreveals complex operon variants in bacteria SMRTLeiden - Leiden Genome Technology Center (LGTC) June 12-14, 2018 Dr. Laurence Ettwiller @ettwiller New England Biolabs, Inc. Operon structures

More information

Unit 3c. Microbial Gene0cs

Unit 3c. Microbial Gene0cs Unit 3c Microbial Gene0cs Microbial Genetics! Gene0cs: the science of heredity Genome: the gene0c informa0on in the cell Genomics: the sequencing and molecular characteriza0on of genomes Gregor Mendel

More information

Annotating the Genome (H)

Annotating the Genome (H) Annotating the Genome (H) Annotation principles (H1) What is annotation? In general: annotation = explanatory note* What could be useful as an annotation of a DNA sequence? an amino acid sequence? What

More information

Transcriptome Assembly, Functional Annotation (and a few other related thoughts)

Transcriptome Assembly, Functional Annotation (and a few other related thoughts) Transcriptome Assembly, Functional Annotation (and a few other related thoughts) Monica Britton, Ph.D. Sr. Bioinformatics Analyst June 23, 2017 Differential Gene Expression Generalized Workflow File Types

More information

Gene is the basic physical and functional unit of heredity. A Gene, in molecular terms,

Gene is the basic physical and functional unit of heredity. A Gene, in molecular terms, Gene Structure-Introns, Exons and Pseudogenes... What is a gene? Gene is the basic physical and functional unit of heredity. A Gene, in molecular terms, is a nucleotide sequence necessary for the synthesis

More information

Analysis of RNA-seq Data. Feb 8, 2017 Peikai CHEN (PHD)

Analysis of RNA-seq Data. Feb 8, 2017 Peikai CHEN (PHD) Analysis of RNA-seq Data Feb 8, 2017 Peikai CHEN (PHD) Outline What is RNA-seq? What can RNA-seq do? How is RNA-seq measured? How to process RNA-seq data: the basics How to visualize and diagnose your

More information

1. ADHERE AND DEFEND: Our bacterium has entered the host. Now it needs to adhere and get past the normal microbiota.

1. ADHERE AND DEFEND: Our bacterium has entered the host. Now it needs to adhere and get past the normal microbiota. North Seattle College Stage 02 Colonization and Infection This explanatory model will tell the story of how one bacterium adheres to a host and, through binary fission, ends up making two daughter cells.

More information

RNA-Seq de novo assembly training

RNA-Seq de novo assembly training RNA-Seq de novo assembly training Training session aims Give you some keys elements to look at during read quality check. Transcriptome assembly is not completely a strait forward process : Multiple strategies

More information

Chapter 6 MOLECULAR BASIS OF INHERITANCE

Chapter 6 MOLECULAR BASIS OF INHERITANCE Chapter 6 MOLECULAR BASIS OF INHERITANCE 1mark each 1. Why is the ADA enzyme required in our body? 2. Which is not required for polypeptide synthesis: Termination codon, mrna, peptidyl tranferase, rrna?

More information

Supplement to: The Genomic Sequence of the Chinese Hamster Ovary (CHO)-K1 cell line

Supplement to: The Genomic Sequence of the Chinese Hamster Ovary (CHO)-K1 cell line Supplement to: The Genomic Sequence of the Chinese Hamster Ovary (CHO)-K1 cell line Table of Contents SUPPLEMENTARY TEXT:... 2 FILTERING OF RAW READS PRIOR TO ASSEMBLY:... 2 COMPARATIVE ANALYSIS... 2 IMMUNOGENIC

More information

AP Biology Gene Expression/Biotechnology REVIEW

AP Biology Gene Expression/Biotechnology REVIEW AP Biology Gene Expression/Biotechnology REVIEW Multiple Choice Identify the choice that best completes the statement or answers the question. 1. Gene expression can be a. regulated before transcription.

More information

NGS part 2: applications. Tobias Österlund

NGS part 2: applications. Tobias Österlund NGS part 2: applications Tobias Österlund tobiaso@chalmers.se NGS part of the course Week 4 Friday 13/2 15.15-17.00 NGS lecture 1: Introduction to NGS, alignment, assembly Week 6 Thursday 26/2 08.00-09.45

More information

TECH NOTE Stranded NGS libraries from FFPE samples

TECH NOTE Stranded NGS libraries from FFPE samples TECH NOTE Stranded NGS libraries from FFPE samples Robust performance with extremely degraded FFPE RNA (DV 200 >25%) Consistent library quality across a range of input amounts (5 ng 50 ng) Compatibility

More information

Introduction of RNA-Seq Analysis

Introduction of RNA-Seq Analysis Introduction of RNA-Seq Analysis Jiang Li, MS Bioinformatics System Engineer I Center for Quantitative Sciences(CQS) Vanderbilt University September 21, 2012 Goal of this talk 1. Act as a practical resource

More information

Name 10 Molecular Biology of the Gene Test Date Study Guide You must know: The structure of DNA. The major steps to replication.

Name 10 Molecular Biology of the Gene Test Date Study Guide You must know: The structure of DNA. The major steps to replication. Name 10 Molecular Biology of the Gene Test Date Study Guide You must know: The structure of DNA. The major steps to replication. The difference between replication, transcription, and translation. How

More information

BA, BSc, and MSc Degree Examinations

BA, BSc, and MSc Degree Examinations Examination Candidate Number: Desk Number: BA, BSc, and MSc Degree Examinations 2017-8 Department : BIOLOGY Title of Exam: Bacterial pathogenesis Time Allowed: 2 hours Marking Scheme: Total marks available

More information

RNA-Sequencing analysis

RNA-Sequencing analysis RNA-Sequencing analysis Markus Kreuz 25. 04. 2012 Institut für Medizinische Informatik, Statistik und Epidemiologie Content: Biological background Overview transcriptomics RNA-Seq RNA-Seq technology Challenges

More information

CHAPTER 21 LECTURE SLIDES

CHAPTER 21 LECTURE SLIDES CHAPTER 21 LECTURE SLIDES Prepared by Brenda Leady University of Toledo To run the animations you must be in Slideshow View. Use the buttons on the animation to play, pause, and turn audio/text on or off.

More information

Guidelines Analysis of RNA Quantity and Quality for Next-Generation Sequencing Projects

Guidelines Analysis of RNA Quantity and Quality for Next-Generation Sequencing Projects Title: Protocol number: Guidelines Analysis of RNA Quantity and Quality for Next-Generation Sequencing Projects GAF S002 Version: Version 4 Date: December 17 th 2015 Author: P. van der Vlies, C.C. van

More information

DNA, RNA & Proteins Chapter 13

DNA, RNA & Proteins Chapter 13 DNA, RNA & Proteins Chapter 13 DNA stands for. What is DNA? - The genetic information that controls the activity of a cell. - Located in the of every one of your cells. What is the structure of DNA like?

More information

METAGENOMICS. Aina Maria Mas Calafell Genomics

METAGENOMICS. Aina Maria Mas Calafell Genomics METAGENOMICS Aina Maria Mas Calafell Genomics Introduction Microbial communities Primary role in biogeochemical systems Study of microbial communities 1.- Culture-based methodologies Only isolated microbes

More information

Analysis of RNA-seq Data. Bernard Pereira

Analysis of RNA-seq Data. Bernard Pereira Analysis of RNA-seq Data Bernard Pereira The many faces of RNA-seq Applications Discovery Find new transcripts Find transcript boundaries Find splice junctions Comparison Given samples from different experimental

More information

Introducing QIAseq. Accelerate your NGS performance through Sample to Insight solutions. Sample to Insight

Introducing QIAseq. Accelerate your NGS performance through Sample to Insight solutions. Sample to Insight Introducing QIAseq Accelerate your NGS performance through Sample to Insight solutions Sample to Insight From Sample to Insight let QIAGEN enhance your NGS-based research High-throughput next-generation

More information

RNA-Seq with the Tuxedo Suite

RNA-Seq with the Tuxedo Suite RNA-Seq with the Tuxedo Suite Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 2015 Workshop The Basic Tuxedo Suite References Trapnell C, et al. 2009 TopHat: discovering splice junctions with

More information

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with

More information

NCEA Level 2 Biology (91159) 2017 page 1 of 6. Achievement Achievement with Merit Achievement with Excellence

NCEA Level 2 Biology (91159) 2017 page 1 of 6. Achievement Achievement with Merit Achievement with Excellence NCEA Level 2 Biology (91159) 2017 page 1 of 6 Assessment Schedule 2017 Biology: Demonstrate understanding of gene expression (91159) Assessment Criteria with Merit with Excellence Demonstrate understanding

More information

Course Information. Introduction to Algorithms in Computational Biology Lecture 1. Relations to Some Other Courses

Course Information. Introduction to Algorithms in Computational Biology Lecture 1. Relations to Some Other Courses Course Information Introduction to Algorithms in Computational Biology Lecture 1 Meetings: Lecture, by Dan Geiger: Mondays 16:30 18:30, Taub 4. Tutorial, by Ydo Wexler: Tuesdays 10:30 11:30, Taub 2. Grade:

More information

Basics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility

Basics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility 2018 ABRF Meeting Satellite Workshop 4 Bridging the Gap: Isolation to Translation (Single Cell RNA-Seq) Sunday, April 22 Basics of RNA-Seq (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly,

More information

Carl Woese. Used 16S rrna to develop a method to Identify any bacterium, and discovered a novel domain of life

Carl Woese. Used 16S rrna to develop a method to Identify any bacterium, and discovered a novel domain of life METAGENOMICS Carl Woese Used 16S rrna to develop a method to Identify any bacterium, and discovered a novel domain of life His amazing discovery, coupled with his solitary behaviour, made many contemporary

More information

Videos. Bozeman Transcription and Translation: Drawing transcription and translation:

Videos. Bozeman Transcription and Translation:   Drawing transcription and translation: Videos Bozeman Transcription and Translation: https://youtu.be/h3b9arupxzg Drawing transcription and translation: https://youtu.be/6yqplgnjr4q Objectives 29a) I can contrast RNA and DNA. 29b) I can explain

More information

DNBseq TM SERVICE OVERVIEW Plant and Animal Whole Genome Re-Sequencing

DNBseq TM SERVICE OVERVIEW Plant and Animal Whole Genome Re-Sequencing TM SERVICE OVERVIEW Plant and Animal Whole Genome Re-Sequencing Plant and animal whole genome re-sequencing (WGRS) involves sequencing the entire genome of a plant or animal and comparing the sequence

More information

Supplementary Materials for

Supplementary Materials for advances.sciencemag.org/cgi/content/full/4/2/eaao0665/dc1 Supplementary Materials for Variant ribosomal RNA alleles are conserved and exhibit tissue-specific expression Matthew M. Parks, Chad M. Kurylo,

More information

Wet-lab Considerations for Illumina data analysis

Wet-lab Considerations for Illumina data analysis Wet-lab Considerations for Illumina data analysis Based on a presentation by Henriette O Geen Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center Complementary Approaches Illumina

More information

Biology 3201 Genetics Unit #5

Biology 3201 Genetics Unit #5 Biology 3201 Genetics Unit #5 Protein Synthesis Protein Synthesis Protein synthesis: this is the process whereby instructions from DNA are used to create polypeptides that make up a protein. This process

More information

MCB 102 University of California, Berkeley August 11 13, Problem Set 8

MCB 102 University of California, Berkeley August 11 13, Problem Set 8 MCB 102 University of California, Berkeley August 11 13, 2009 Isabelle Philipp Handout Problem Set 8 The answer key will be posted by Tuesday August 11. Try to solve the problem sets always first without

More information

Workflows and Pipelines for NGS analysis: Lessons from proteomics

Workflows and Pipelines for NGS analysis: Lessons from proteomics Workflows and Pipelines for NGS analysis: Lessons from proteomics Conference on Applying NGS in Basic research Health care and Agriculture 11 th Sep 2014 Debasis Dash Where are the protein coding genes

More information

The New Genome Analyzer IIx Delivering more data, faster, and easier than ever before. Jeremy Preston, PhD Marketing Manager, Sequencing

The New Genome Analyzer IIx Delivering more data, faster, and easier than ever before. Jeremy Preston, PhD Marketing Manager, Sequencing The New Genome Analyzer IIx Delivering more data, faster, and easier than ever before Jeremy Preston, PhD Marketing Manager, Sequencing Illumina Genome Analyzer: a Paradigm Shift 2000x gain in efficiency

More information

7.014 Problem Set 4 Answers to this problem set are to be turned in. Problem sets will not be accepted late. Solutions will be posted on the web.

7.014 Problem Set 4 Answers to this problem set are to be turned in. Problem sets will not be accepted late. Solutions will be posted on the web. MIT Department of Biology 7.014 Introductory Biology, Spring 2005 Name: Section : 7.014 Problem Set 4 Answers to this problem set are to be turned in. Problem sets will not be accepted late. Solutions

More information

RNA-Seq Software, Tools, and Workflows

RNA-Seq Software, Tools, and Workflows RNA-Seq Software, Tools, and Workflows Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 1, 2016 Some mrna-seq Applications Differential gene expression analysis Transcriptional profiling Assumption:

More information

Introduction to Algorithms in Computational Biology Lecture 1

Introduction to Algorithms in Computational Biology Lecture 1 Introduction to Algorithms in Computational Biology Lecture 1 Background Readings: The first three chapters (pages 1-31) in Genetics in Medicine, Nussbaum et al., 2001. This class has been edited from

More information

How to Use This Presentation

How to Use This Presentation How to Use This Presentation To View the presentation as a slideshow with effects select View on the menu bar and click on Slide Show. To advance through the presentation, click the right-arrow key or

More information

Ch. 10 Notes DNA: Transcription and Translation

Ch. 10 Notes DNA: Transcription and Translation Ch. 10 Notes DNA: Transcription and Translation GOALS Compare the structure of RNA with that of DNA Summarize the process of transcription Relate the role of codons to the sequence of amino acids that

More information

DESIGNER GENES - BIOTECHNOLOGY

DESIGNER GENES - BIOTECHNOLOGY DESIGNER GENES - BIOTECHNOLOGY Technology to manipulate DNA techniques often called genetic engineering or Recombinant DNA Technology-Technology used to manipulate DNA Procedures often called genetic engineering

More information

What is Bioinformatics?

What is Bioinformatics? What is Bioinformatics? Bioinformatics is the field of science in which biology, computer science, and information technology merge to form a single discipline. - NCBI The ultimate goal of the field is

More information

11/17/14. Why would scientist want to make a mouse glow?

11/17/14. Why would scientist want to make a mouse glow? 11/17/14 Why would scientist want to make a mouse glow? 11/20 Your test today has ten words please use this time wisely. Chapter 8 Vocabulary Review Bacteriophage Viruses that infect bacteria, makes the

More information

02 Agenda Item 03 Agenda Item

02 Agenda Item 03 Agenda Item 01 Agenda Item 02 Agenda Item 03 Agenda Item SOLiD 3 System: Applications Overview April 12th, 2010 Jennifer Stover Field Application Specialist - SOLiD Applications Workflow for SOLiD Application Application

More information

Biology A: Chapter 9 Annotating Notes Protein Synthesis

Biology A: Chapter 9 Annotating Notes Protein Synthesis Name: Pd: Biology A: Chapter 9 Annotating Notes Protein Synthesis -As you read your textbook, please fill out these notes. -Read each paragraph state the big/main idea on the left side. -On the right side

More information

DNA Evolution of knowledge about gene. Contains information about RNAs and proteins. Polynucleotide chains; Double stranded molecule;

DNA Evolution of knowledge about gene. Contains information about RNAs and proteins. Polynucleotide chains; Double stranded molecule; Evolution of knowledge about gene G. Mendel Hereditary factors W.Johannsen, 1909 G.W.Beadle, E.L.Tatum, 1945 Ingram, 1957 Actual concepts The gene hereditary unit located in chromosomes Hypotheses One

More information

About Strand NGS. Strand Genomics, Inc All rights reserved.

About Strand NGS. Strand Genomics, Inc All rights reserved. About Strand NGS Strand NGS-formerly known as Avadis NGS, is an integrated platform that provides analysis, management and visualization tools for next-generation sequencing data. It supports extensive

More information

How much sequencing do I need? Emily Crisovan Genomics Core September 26, 2018

How much sequencing do I need? Emily Crisovan Genomics Core September 26, 2018 How much sequencing do I need? Emily Crisovan Genomics Core September 26, 2018 How much sequencing? Three questions: 1. How much sequence is required for good experimental design? 2. What type of sequencing

More information

Wake Acceleration Academy - Biology Note Guide Unit 5: Molecular Genetics

Wake Acceleration Academy - Biology Note Guide Unit 5: Molecular Genetics Wake Acceleration Academy - Biology Note Guide Unit 5: Molecular Genetics Extra Resources Website: http://waa-science.weebly.com Module 1: Overview of DNA Vocabulary Term Definition (You may use an Internet

More information

LECTURE 20. Repeated DNA Sequences. Prokaryotes:

LECTURE 20. Repeated DNA Sequences. Prokaryotes: LECTURE 20 Repeated DNA Sequences Prokaryotes: 1) Most DNA is in the form of unique sequences. Exceptions are the genes encoding ribosomal RNA (rdna, 10-20 copies) and various recognition sequences (e.g.,

More information

B.Sc V Semester Question Bank. B.Sc V Semester Question Bank. Prepared by Nitin Swamy, Department of Biotechnology, SACJ Page 1 of 6

B.Sc V Semester Question Bank. B.Sc V Semester Question Bank. Prepared by Nitin Swamy, Department of Biotechnology, SACJ Page 1 of 6 B.Sc V Semester Question Bank Prepared by Nitin Swamy, Department of Biotechnology, SACJ Page 1 of 6 UNIT I Genetic Material Different forms of DNA (DNA topology):- B-form, Z-form, D-form; Gene structure-introns,exons

More information

UAB DNA-Seq Analysis Workshop. John Osborne Research Associate Centers for Clinical and Translational Science

UAB DNA-Seq Analysis Workshop. John Osborne Research Associate Centers for Clinical and Translational Science + UAB DNA-Seq Analysis Workshop John Osborne Research Associate Centers for Clinical and Translational Science ozborn@uab.,edu + Thanks in advance You are the Guinea pigs for this workshop! At this point

More information

dbcamplicons pipeline Amplicons

dbcamplicons pipeline Amplicons dbcamplicons pipeline Amplicons Matthew L. Settles Genome Center Bioinformatics Core University of California, Davis settles@ucdavis.edu; bioinformatics.core@ucdavis.edu Microbial community analysis Goal:

More information

The gene. Fig. 1. The general structure of gene

The gene. Fig. 1. The general structure of gene The gene is the basic unit of heredity and carries the genetic information for a given protein and/or RNA molecule. In biochemical terms a gene represents a fragment of deoxyribonucleic acid (DNA), which

More information

G E N OM I C S S E RV I C ES

G E N OM I C S S E RV I C ES GENOMICS SERVICES ABOUT T H E N E W YOR K G E NOM E C E N T E R NYGC is an independent non-profit implementing advanced genomic research to improve diagnosis and treatment of serious diseases. Through

More information

Next- genera*on Sequencing. Lecture 13

Next- genera*on Sequencing. Lecture 13 Next- genera*on Sequencing Lecture 13 ChIP- seq Applica*ons iden%fy sequence varia%ons DNA- seq Iden%fy Pathogens RNA- seq Kahvejian et al, 2008 Protein-DNA interaction DNA is the informa*on carrier of

More information