(Practical) Bioinformatics for CRISPR/Cas9

Size: px
Start display at page:

Download "(Practical) Bioinformatics for CRISPR/Cas9"

Transcription

1 (Practical) Bioinformatics for CRISPR/Cas9 Jacob Corn IGI Workshop 2016 Bioinformatics is (mostly) things you could do yourself Just done very fast

2 What makes these guides different? GAGTCCGAGCAGAAGAAGAA NGG EMX1 highly active relatively specific GACCCCCTCCACCCCGCCTC NGG VEGFA site2 highly active many off-targets GCGACGAGTCTCATTCAAAC NGG deubiquitinase no activity at all

3 General guide design algorithm Find guides region depends on experiment Map them to a reference genome guide site potential off-targets genes, transcripts, exons other annotations (DNAse, etc) Score features of guide itself annotations We ll cover these in detail for a few popular approaches

4 Finding guides = finding PAMs GCAGAGTCCAAGCAGAAGAAGAACGGGTCGTA look for PAMs count backwards 20 bases GAGTCCAAGCAGAAGAAGAA CGG AGTCCAAGCAGAAGAAGAAC GGG ACGACCGTTCTTCTTCTGCT CCA AGCAGAAGAAGAACGGTCGT TGG

5 How might we map guides? Whole sea of a genome Where is a 20mer? (or seqs like it) Analogous to short-read next gen sequencing BWA Cas-OFFinder Bae et al., 2014, Bioinformatics Li and Durbin, 2009, Bioinformatics Langmead et al., 2009, Genome Biology Langmead et al., 2011, Curr Protocols Bioinformatics

6 Now we know guide coordinates chr : chr : Resources to find what s there Choice of database outside the scope of this workshop Make sure you re using consistent genome versions! (e.g. GRCh37 vs GRCh38)

7 We want guides to be active Easy things to avoid, but sometimes forgotten: 4+ T/U in a row à Pol III terminator Avoid homopolymers in general GC > 30% or < 80% Avoid stable secondary structure in protospacer (Vienna fold prediction no structure in protospacer also protospacer doesn t interfere with constant region But surely it s more complicated than that Do active guides have certain sequences?

8 A screen to find properties of active guides (single cut gene knockout) Doench et al., Nat Biotech 2014 Make every guide across a surface marker gene Turn into a lentiviral library Sort for loss of protein Next-gen sequencing à active vs inactive guides? good guides CD33 guide activity across 3 lines bad guides Guide activity can be cell-line dependent Local sequence effects might be a problem But what about on average?

9 Training features of active guides What sequences are enriched/depleted in 20% most active guides? position 20 is good position 20 is bad Complex à We need a predictor Turn each base at each position (30mer) into a feature Also include dimers and GC content What separates top 25% guides from all others? Use machine learning to find sequence preferences à Rule Set 1 Doench et al., Nat Biotech 2014

10 Machine learning can be odd BEST learned properties GC TC TA GA AC CC CG A A G G G G A C G WORST learned properties GG GC G C G GG Matches what the plot above (trained on this data) Seems to do well in back-prediction Calls good guides 50-75% of the time TC T C Doench et al., Nat Biotech 2014

11 Maybe we need even more features and training? Contribution Highest ranked features Proximity to 5 end of gene Tm single and double base features e.g. G at position Feature # à Rule Set 2 Doench et al., Nat Biotech 2016

12 A tool to find active ko guides Machine learning works best when use = training Trained on Knockout by single indel Mouse and human cell lines Cell surface expression, survival Finds highly active guides for ko pretty well Can false-report good guides as bad Assumes One True Guide Sequence May not be good for CRISPRi/a Code is available on the website! Does what is advertised

13 But sequence is not the whole story???? Cas9 preference human mouse Doench et al. zebrafish Moreno-Mateos et al. some guides in different cells correlate well other cell lines correlate poorly

14 DNA repair also affects apparent guide activity -NOE +NOE NOE Richardson and Ray et al., in Nature Communications

15 Finding active CRISPRi/a guides Gilbert and Horlbeck et al., Cell 2014 Tile guides around transcription start site Look for phenotype dcas9-krab = CRISPRi Plus many more features à +300 dcas9-sun-vp64 = CRISPRa -400 à -50

16 We want guides to be specific Easy things to avoid: repeat sequences targeting highly conserved part of a gene family But is it more complicated? Based on data from Hsu et al., Nat Biotech 2013

17 Hsu et al, Nat Biotech 2013 Target-focused specificity measurements Either Make 1 guide, find all targets Make many guides, measure activity at one target Target EMX1(3 different places) Make all possible guide variants Which ones still target EMX1?

18 Guide seed region is stringent Non-seed is permissive Hsu et al, Nat Biotech 2013 single mismatches can be tolerated in non-seed region alternative seed vs non-seed AGTCCGAGCAGAAGAAGAA NGG non-seed seed

19 Mismatches >= 3 are not tolerated Close mismatches are worse? mismatches together mismatches apart >= 3 mismatches apart Hsu et al, Nat Biotech 2013

20 Turning data in to a score Trained weights from heatmap (training details a bit unclear) (likeσ, but products) penalizes mismatches close together penalizes total #of mismatches pos W[e] GAGTCCGAGCAGAAGAAGAA GAATCCGAGCAGAAGAAGAA ( ) GAGTCCGAGCAGAAGAAGGA ( ) GAGTCCGAGCAGAAGTAGGA ( )*( ) Hsu et al, Nat Biotech 2013

21 Total guide score Calculate score across all offtargets unclear how a sequence is chosen as potential offtarget, but uses bowtie2 Score normalized to 100 Many high-scoring off-targets decreases score Off-targets in exons are noted but not extra penalized Reports exon hits, not introns, UTRs, etc.

22 We can get fancier ~28,000 sgrnas targeting human CD33 all on-target all 1-base bulges all 1-base mismatches 65 on-targets w/ phenotype ~10,000 variants CFD Cutting Frequency Determination Doench et al, Nat Biotech 2016

23 How well off-target prediction work? Shengdar et al. Nat Biotech 2014 Both on- and off-target prediction are still in their infancy Plenty of room for improvement What s wrong?

24 Just finding OTs makes a big difference??!! bowtie2 (MIT website) has trouble finding all potential offtargets to score Use Cas-OFFinder to find offtargets, score with Hsu-Zhang ( score vs website ) Haeussler et al Genome Biology 2016 Doench et al, Nat Biotech 2016

25 Some easily accessible tools

26 crispr.mit input bp

27 crispr.mit output

28 Other tools CHOPCHOP Heuristic ( arbitary ) scoring Not necessarily worse than trained! Allows any length sequence Easy T7E1 primer design

29 CHOPCHOP input

30 CHOPCHOP output

31 CCTop input

32 CCTop Output Super fast Good enough(?)

33 Cas-OFFinder input Takes GUIDES as input Very slow Very methodical

34 There are a lot of tools out there

35 Benchling or DesktopGenetics demo as meta guide designers

36 Questions?

37 crispr.mit.edu Based on Hsu et al., Nat Biotech 2013 Find all sgrnas in a nt sequence What are likely off-targets? Supports many organisms human, mouse, zfish, worm, fly, etc Nice graphical interface We ll talk more about this in detail

38 CHOPCHOP described in Montague et al. NAR 2014

Improving CRISPR-Cas9 Gene Knockout with a Validated Guide RNA Algorithm

Improving CRISPR-Cas9 Gene Knockout with a Validated Guide RNA Algorithm Improving CRISPR-Cas9 Gene Knockout with a Validated Guide RNA Algorithm Anja Smith Director R&D Dharmacon, part of GE Healthcare Imagination at work crrna:tracrrna program Cas9 nuclease Active crrna is

More information

CRISPR Design Considera1ons

CRISPR Design Considera1ons CRISPR Design Considera1ons Shifra Ben-Dor Bioinforma1cs Unit Life Sciences Core Facili1es Weizmann Ins1tute of Science June 2018 Jinek et al, elife 2013;2:e00471 Cradick et al Outline Define your

More information

Supplementary Figure 1

Supplementary Figure 1 number of cells, normalized number of cells, normalized number of cells, normalized Supplementary Figure CD CD53 Cd3e fluorescence intensity fluorescence intensity fluorescence intensity Supplementary

More information

Reviewers' Comments: Reviewer #1 (Remarks to the Author)

Reviewers' Comments: Reviewer #1 (Remarks to the Author) Reviewers' Comments: Reviewer #1 (Remarks to the Author) In this study, Rosenbluh et al reported direct comparison of two screening approaches: one is genome editing-based method using CRISPR-Cas9 (cutting,

More information

Pooled CRISPR guide RNA libraries for functional genomics screening: Do you know what s in your library?

Pooled CRISPR guide RNA libraries for functional genomics screening: Do you know what s in your library? Pooled CRISPR guide RNA libraries for functional genomics screening: Do you know what s in your library? Peter Sheffield R&D Scientific Program Director Agilent Technologies INTRODUCTION CRISPR: A Programmable

More information

C3BI. VARIANTS CALLING November Pierre Lechat Stéphane Descorps-Declère

C3BI. VARIANTS CALLING November Pierre Lechat Stéphane Descorps-Declère C3BI VARIANTS CALLING November 2016 Pierre Lechat Stéphane Descorps-Declère General Workflow (GATK) software websites software bwa picard samtools GATK IGV tablet vcftools website http://bio-bwa.sourceforge.net/

More information

Generation of App knock-in mice reveals deletion mutations protective against Alzheimer s. disease-like pathology. Nagata et al.

Generation of App knock-in mice reveals deletion mutations protective against Alzheimer s. disease-like pathology. Nagata et al. Generation of App knock-in mice reveals deletion mutations protective against Alzheimer s disease-like pathology Nagata et al. Supplementary Fig 1. Previous App knock-in model did not show Aβ accumulation

More information

You use the UCSC Genome Browser (www.genome.ucsc.edu) to assess the exonintron structure of each gene. You use four tracks to show each gene:

You use the UCSC Genome Browser (www.genome.ucsc.edu) to assess the exonintron structure of each gene. You use four tracks to show each gene: CRISPR-Cas9 genome editing Part 1: You would like to rapidly generate two different knockout mice using CRISPR-Cas9. The genes to be knocked out are Pcsk9 and Apoc3, both involved in lipid metabolism.

More information

Collect, analyze and synthesize. Annotation. Annotation for D. virilis. Evidence Based Annotation. GEP goals: Evidence for Gene Models 08/22/2017

Collect, analyze and synthesize. Annotation. Annotation for D. virilis. Evidence Based Annotation. GEP goals: Evidence for Gene Models 08/22/2017 Annotation Annotation for D. virilis Chris Shaffer July 2012 l Big Picture of annotation and then one practical example l This technique may not be the best with other projects (e.g. corn, bacteria) l

More information

Collect, analyze and synthesize. Annotation. Annotation for D. virilis. GEP goals: Evidence Based Annotation. Evidence for Gene Models 12/26/2018

Collect, analyze and synthesize. Annotation. Annotation for D. virilis. GEP goals: Evidence Based Annotation. Evidence for Gene Models 12/26/2018 Annotation Annotation for D. virilis Chris Shaffer July 2012 l Big Picture of annotation and then one practical example l This technique may not be the best with other projects (e.g. corn, bacteria) l

More information

Designing CRISPR mediated Gene disruptions with gblocks Gene Fragments

Designing CRISPR mediated Gene disruptions with gblocks Gene Fragments Designing CRISPR mediated Gene disruptions with gblocks Gene Fragments Adam Clore, PhD Manager, Synthetic Biology Design Integrated DNA technology Typical CRISPR timeline in Mammalian cell lines Design

More information

Analysis of neo-antigens to identify T-cell neo-epitopes in human Head & Neck cancer. Project XX1001. Customer Detail

Analysis of neo-antigens to identify T-cell neo-epitopes in human Head & Neck cancer. Project XX1001. Customer Detail Analysis of neo-antigens to identify T-cell neo-epitopes in human Head & Neck cancer Project XX Customer Detail Table of Contents. Bioinformatics analysis pipeline...3.. Read quality check. 3.2. Read alignment...3.3.

More information

Supplementary Materials. China

Supplementary Materials. China Supplementary Materials An Efficient Genotyping Method for Genome-modified Animals and Human Cells Generated with CRISPR/Cas9 System Xiaoxiao Zhu 1,2 *, Yajie Xu 1 *, Shanshan Yu 1 *, Lu Lu 1,2, Mingqin

More information

Galaxy Platform For NGS Data Analyses

Galaxy Platform For NGS Data Analyses Galaxy Platform For NGS Data Analyses Weihong Yan wyan@chem.ucla.edu Collaboratory Web Site http://qcb.ucla.edu/collaboratory http://collaboratory.lifesci.ucla.edu Workshop Outline ü Day 1 UCLA galaxy

More information

The first thing you will see is the opening page. SeqMonk scans your copy and make sure everything is in order, indicated by the green check marks.

The first thing you will see is the opening page. SeqMonk scans your copy and make sure everything is in order, indicated by the green check marks. Open Seqmonk Launch SeqMonk The first thing you will see is the opening page. SeqMonk scans your copy and make sure everything is in order, indicated by the green check marks. SeqMonk Analysis Page 1 Create

More information

Introduction to RNA-Seq in GeneSpring NGS Software

Introduction to RNA-Seq in GeneSpring NGS Software Introduction to RNA-Seq in GeneSpring NGS Software Dipa Roy Choudhury, Ph.D. Strand Scientific Intelligence and Agilent Technologies Learn more at www.genespring.com Introduction to RNA-Seq In a few years,

More information

TECHNOLOGIES. 3/19/18 Kayla Nygaard. https://i.ytimg.com/vi/pxw5yya-kh0/maxresdefault.jpg

TECHNOLOGIES. 3/19/18 Kayla Nygaard. https://i.ytimg.com/vi/pxw5yya-kh0/maxresdefault.jpg TECHNOLOGIES 3/19/18 Kayla Nygaard https://i.ytimg.com/vi/pxw5yya-kh0/maxresdefault.jpg CRISPR IN THE NEWS CRISPR in 2018: Coming to a Human Near You Sickle-cell treatment clinical trials CRISPR Therapeutics

More information

Introduction to CRISPR/Cas9 Background DNA Cleavage and Repair (NHEJ and HDR) Alternative Cas9 Variants Delivery of Cas9 and sgrna Library Products

Introduction to CRISPR/Cas9 Background DNA Cleavage and Repair (NHEJ and HDR) Alternative Cas9 Variants Delivery of Cas9 and sgrna Library Products Introduction to CRISPR/Cas9 Background DNA Cleavage and Repair (NHEJ and HDR) Alternative Cas9 Variants Delivery of Cas9 and sgrna Library Products which one is right for you? CRISPR Workflow abm s Toolbox

More information

SNP calling and VCF format

SNP calling and VCF format SNP calling and VCF format Laurent Falquet, Oct 12 SNP? What is this? A type of genetic variation, among others: Family of Single Nucleotide Aberrations Single Nucleotide Polymorphisms (SNPs) Single Nucleotide

More information

CRISPR 101: Optimizing Your Gene Editing Experiments

CRISPR 101: Optimizing Your Gene Editing Experiments CRISPR 101: Optimizing Your Gene Editing Experiments PRESENTER Michele Auldridge, Ph.D. Senior Scientist, R&D MODERATOR Beth Frey Product Manager Agenda 1 Technology Overview 2 Experimental Decisions 3

More information

A Guide to CRISPR/Cas9

A Guide to CRISPR/Cas9 Genome editing and beyond freepik A Guide to CRISPR/Cas9 The latest advance in genomic DNA editing is the Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR)/Cas9 system. This simple-touse

More information

CRISPR GENOMIC SERVICES PRODUCT CATALOG

CRISPR GENOMIC SERVICES PRODUCT CATALOG CRISPR GENOMIC SERVICES PRODUCT CATALOG DESIGN BUILD ANALYZE The experts at Desktop Genetics can help you design, prepare and manufacture all of the components needed for your CRISPR screen. We provide

More information

Accelerating Genomic Computations 1000X with Hardware

Accelerating Genomic Computations 1000X with Hardware Accelerating Genomic Computations 1000X with Hardware Yatish Turakhia EE PhD candidate Stanford University Prof. Bill Dally (Electrical Engineering and Computer Science) Prof. Gill Bejerano (Computer Science,

More information

ChIP-seq analysis 2/28/2018

ChIP-seq analysis 2/28/2018 ChIP-seq analysis 2/28/2018 Acknowledgements Much of the content of this lecture is from: Furey (2012) ChIP-seq and beyond Park (2009) ChIP-seq advantages + challenges Landt et al. (2012) ChIP-seq guidelines

More information

Construct Design and Cloning Guide for Cas9-triggered homologous recombination

Construct Design and Cloning Guide for Cas9-triggered homologous recombination Construct Design and Cloning Guide for Cas9-triggered homologous recombination Written by Dan Dickinson (ddickins@live.unc.edu) and last updated December 2013. Reference: Dickinson DJ, Ward JD, Reiner

More information

How to deal with your RNA-seq data?

How to deal with your RNA-seq data? How to deal with your RNA-seq data? Rachel Legendre, Thibault Dayris, Adrien Pain, Claire Toffano-Nioche, Hugo Varet École de bioinformatique AVIESAN-IFB 2017 1 Rachel Legendre Bioinformatics 27/11/2018

More information

UAB DNA-Seq Analysis Workshop. John Osborne Research Associate Centers for Clinical and Translational Science

UAB DNA-Seq Analysis Workshop. John Osborne Research Associate Centers for Clinical and Translational Science + UAB DNA-Seq Analysis Workshop John Osborne Research Associate Centers for Clinical and Translational Science ozborn@uab.,edu + Thanks in advance You are the Guinea pigs for this workshop! At this point

More information

Mapping Next Generation Sequence Reads. Bingbing Yuan Dec. 2, 2010

Mapping Next Generation Sequence Reads. Bingbing Yuan Dec. 2, 2010 Mapping Next Generation Sequence Reads Bingbing Yuan Dec. 2, 2010 1 What happen if reads are not mapped properly? Some data won t be used, thus fewer reads would be aligned. Reads are mapped to the wrong

More information

Quantifying gene expression

Quantifying gene expression Quantifying gene expression Genome GTF (annotation)? Sequence reads FASTQ FASTQ (+reference transcriptome index) Quality control FASTQ Alignment to Genome: HISAT2, STAR (+reference genome index) (known

More information

Array Informatics. Mark Gerstein

Array Informatics. Mark Gerstein 1 Lectures.GersteinLab.org (c) Array Informatics Mark Gerstein CEGS Informatics Developing Tools and Technical Analyses Related to Genome Technologies Main Genome Technologies Tiling Arrays Next Generation

More information

Ensembl Tools. EBI is an Outstation of the European Molecular Biology Laboratory.

Ensembl Tools. EBI is an Outstation of the European Molecular Biology Laboratory. Ensembl Tools EBI is an Outstation of the European Molecular Biology Laboratory. Questions? We ve muted all the mics Ask questions in the Chat box in the webinar interface I will check the Chat box periodically

More information

Nature Biotechnology: doi: /nbt Supplementary Figure 1. In vitro validation of OTC sgrnas and donor template.

Nature Biotechnology: doi: /nbt Supplementary Figure 1. In vitro validation of OTC sgrnas and donor template. Supplementary Figure 1 In vitro validation of OTC sgrnas and donor template. (a) In vitro validation of sgrnas targeted to OTC in the MC57G mouse cell line by transient transfection followed by 4-day puromycin

More information

Systematic comparison of CRISPR/Cas9 and RNAi screens for essential genes

Systematic comparison of CRISPR/Cas9 and RNAi screens for essential genes CORRECTION NOTICE Nat. Biotechnol. doi:10.1038/nbt. 3567 Systematic comparison of CRISPR/Cas9 and RNAi screens for essential genes David W Morgens, Richard M Deans, Amy Li & Michael C Bassik In the version

More information

Successful gene expression studies using validated qpcr assays. Jan Hellemans, CEO Biogazelle webinar October 28 th, 2015

Successful gene expression studies using validated qpcr assays. Jan Hellemans, CEO Biogazelle webinar October 28 th, 2015 Successful gene expression studies using validated qpcr assays Jan Hellemans, CEO Biogazelle webinar October 28 th, 2015 Agenda Requirements for high quality qpcr assays Approaches for qpcr assay validation

More information

Basics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility

Basics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility 2018 ABRF Meeting Satellite Workshop 4 Bridging the Gap: Isolation to Translation (Single Cell RNA-Seq) Sunday, April 22 Basics of RNA-Seq (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly,

More information

CRISPRseek Workshop Design of target-specific guide RNAs in CRISPR-Cas9 genome-editing systems

CRISPRseek Workshop Design of target-specific guide RNAs in CRISPR-Cas9 genome-editing systems April 2008 CRISPRseek Workshop Design of target-specific guide RNAs in CRISPR-Cas9 genome-editing systems Lihua Julie Zhu August 1st 2014 Outline Background and Motives CRISPRseek Functionality Dependency

More information

Optimized, chemically-modified crrna:tracrrna complexes for CRISPR gene editing

Optimized, chemically-modified crrna:tracrrna complexes for CRISPR gene editing Optimized, chemically-modified crrna:tracrrna complexes for CRISPR gene editing Mark Behlke MD, PhD Chief Scientific Officer February 24, 2016 1 Implementing CRISPR/Cas9 gene editing 2 To focus on RNA

More information

Database Searching and BLAST Dannie Durand

Database Searching and BLAST Dannie Durand Computational Genomics and Molecular Biology, Fall 2013 1 Database Searching and BLAST Dannie Durand Tuesday, October 8th Review: Karlin-Altschul Statistics Recall that a Maximal Segment Pair (MSP) is

More information

Question 2: There are 5 retroelements (2 LINEs and 3 LTRs), 6 unclassified elements (XDMR and XDMR_DM), and 7 satellite sequences.

Question 2: There are 5 retroelements (2 LINEs and 3 LTRs), 6 unclassified elements (XDMR and XDMR_DM), and 7 satellite sequences. Bio4342 Exercise 1 Answers: Detecting and Interpreting Genetic Homology (Answers prepared by Wilson Leung) Question 1: Low complexity DNA can be described as sequences that consist primarily of one or

More information

Mapping strategies for sequence reads

Mapping strategies for sequence reads Mapping strategies for sequence reads Ernest Turro University of Cambridge 21 Oct 2013 Quantification A basic aim in genomics is working out the contents of a biological sample. 1. What distinct elements

More information

Barcode Sequence Alignment and Statistical Analysis (Barcas) tool

Barcode Sequence Alignment and Statistical Analysis (Barcas) tool Barcode Sequence Alignment and Statistical Analysis (Barcas) tool 2016.10.05 Mun, Jihyeob and Kim, Seon-Young Korea Research Institute of Bioscience and Biotechnology Barcode-Sequencing Ø Genome-wide screening

More information

Bio5488 Practice Midterm (2018) 1. Next-gen sequencing

Bio5488 Practice Midterm (2018) 1. Next-gen sequencing 1. Next-gen sequencing 1. You have found a new strain of yeast that makes fantastic wine. You d like to sequence this strain to ascertain the differences from S. cerevisiae. To accurately call a base pair,

More information

domain. Bottom panel: hybrid surface/ribbon structure (PDB ID: 4UN3) of SpCas9 in complex with sgrna and target DNA. The REC3

domain. Bottom panel: hybrid surface/ribbon structure (PDB ID: 4UN3) of SpCas9 in complex with sgrna and target DNA. The REC3 Supplementary Figure 1 Yeast screening for high-specificity SpCas9 variants (a) Top panel: scheme of SpCas9 domains. The REC3 domain is part of the recognition lobe. BH: bridge helix. PI: PAM-interacting

More information

Lecture 5: Regulation

Lecture 5: Regulation Machine Learning in Computational Biology CSC 2431 Lecture 5: Regulation Instructor: Anna Goldenberg Central Dogma of Biology Transcription DNA RNA protein Process of producing RNA from DNA Constitutive

More information

Frequently Asked Questions

Frequently Asked Questions The DNA Files: Workshops and Activities The DNA Files workshops are an outreach component of The DNA Files public radio documentary series produced by SoundVision Productions with funding from the National

More information

Application of NGS (next-generation sequencing) for studying RNA regulation

Application of NGS (next-generation sequencing) for studying RNA regulation Application of NGS (next-generation sequencing) for studying RNA regulation SAIHST, SKKU Sung Wook Chi In this lecturre Intro: Sequencing Technology NGS (Next-Generation Sequencing) Sequencing of RNAs

More information

Sequence Annotation & Designing Gene-specific qpcr Primers (computational)

Sequence Annotation & Designing Gene-specific qpcr Primers (computational) James Madison University From the SelectedWorks of Ray Enke Ph.D. Fall October 31, 2016 Sequence Annotation & Designing Gene-specific qpcr Primers (computational) Raymond A Enke This work is licensed under

More information

Introduction to Short Read Alignment. UCD Genome Center Bioinformatics Core Tuesday 14 June 2016

Introduction to Short Read Alignment. UCD Genome Center Bioinformatics Core Tuesday 14 June 2016 Introduction to Short Read Alignment UCD Genome Center Bioinformatics Core Tuesday 14 June 2016 From reads to molecules Why align? Individual A Individual B ATGATAGCATCGTCGGGTGTCTGCTCAATAATAGTGCCGTATCATGCTGGTGTTATAATCGCCGCATGACATGATCAATGG

More information

Transcriptome analysis

Transcriptome analysis Statistical Bioinformatics: Transcriptome analysis Stefan Seemann seemann@rth.dk University of Copenhagen April 11th 2018 Outline: a) How to assess the quality of sequencing reads? b) How to normalize

More information

Systematic evaluation of spliced alignment programs for RNA- seq data

Systematic evaluation of spliced alignment programs for RNA- seq data Systematic evaluation of spliced alignment programs for RNA- seq data Pär G. Engström, Tamara Steijger, Botond Sipos, Gregory R. Grant, André Kahles, RGASP Consortium, Gunnar Rätsch, Nick Goldman, Tim

More information

Annotating your variants: Ensembl Variant Effect Predictor (VEP) Helen Sparrow Ensembl EMBL-EBI 2nd November 2016

Annotating your variants: Ensembl Variant Effect Predictor (VEP) Helen Sparrow Ensembl EMBL-EBI 2nd November 2016 Training materials Ensembl training materials are protected by a CC BY license http://creativecommons.org/licenses/by/4.0/ If you wish to re-use these materials, please credit Ensembl for their creation

More information

Genome Engineering with ZFNs, TALENs and CRISPR/Cas9

Genome Engineering with ZFNs, TALENs and CRISPR/Cas9 Genome Engineering with ZFNs, TALENs and CRISPR/Cas9 Designer Endonucleases ZFNs (zinc finger nucleases), TALENs (transcription activator-like effector nucleases) and CRISPR/Cas9 (clustered regularly interspaced

More information

Applying CRISPR in Environmental Health Research: From Cells to Human Populations Luoping Zhang

Applying CRISPR in Environmental Health Research: From Cells to Human Populations Luoping Zhang NASEM Workshop: The Promise of Genome Editing Tools in EHR Washington, D.C. January 10-11, 2018 Applying CRISPR in Environmental Health Research: From Cells to Human Populations Luoping Zhang School of

More information

CRISPR RNA-guided activation of endogenous human genes

CRISPR RNA-guided activation of endogenous human genes CRISPR RNA-guided activation of endogenous human genes Morgan L Maeder, Samantha J Linder, Vincent M Cascio, Yanfang Fu, Quan H Ho, J Keith Joung Supplementary Figure 1 Comparison of VEGF activation induced

More information

Genome annotation & EST

Genome annotation & EST Genome annotation & EST What is genome annotation? The process of taking the raw DNA sequence produced by the genome sequence projects and adding the layers of analysis and interpretation necessary

More information

A Brief History. Bootstrapping. Bagging. Boosting (Schapire 1989) Adaboost (Schapire 1995)

A Brief History. Bootstrapping. Bagging. Boosting (Schapire 1989) Adaboost (Schapire 1995) A Brief History Bootstrapping Bagging Boosting (Schapire 1989) Adaboost (Schapire 1995) What s So Good About Adaboost Improves classification accuracy Can be used with many different classifiers Commonly

More information

SUPPLEMENTAL MATERIALS

SUPPLEMENTAL MATERIALS SUPPLEMENL MERILS Eh-seq: RISPR epitope tagging hip-seq of DN-binding proteins Daniel Savic, E. hristopher Partridge, Kimberly M. Newberry, Sophia. Smith, Sarah K. Meadows, rian S. Roberts, Mark Mackiewicz,

More information

Figure S4 A-H : Initiation site properties and evolutionary changes

Figure S4 A-H : Initiation site properties and evolutionary changes A 0.3 Figure S4 A-H : Initiation site properties and evolutionary changes G-correction not used 0.25 Fraction of total counts 0.2 0.5 0. tag 2 tags 3 tags 4 tags 5 tags 6 tags 7tags 8tags 9 tags >9 tags

More information

Introduction to BIOINFORMATICS

Introduction to BIOINFORMATICS COURSE OF BIOINFORMATICS a.a. 2016-2017 Introduction to BIOINFORMATICS What is Bioinformatics? (I) The sinergy between biology and informatics What is Bioinformatics? (II) From: http://www.bioteach.ubc.ca/bioinfo2010/

More information

Parts of a standard FastQC report

Parts of a standard FastQC report FastQC FastQC, written by Simon Andrews of Babraham Bioinformatics, is a very popular tool used to provide an overview of basic quality control metrics for raw next generation sequencing data. There are

More information

Sequence Based Function Annotation

Sequence Based Function Annotation Sequence Based Function Annotation Qi Sun Bioinformatics Facility Biotechnology Resource Center Cornell University Sequence Based Function Annotation 1. Given a sequence, how to predict its biological

More information

Transcriptomics analysis with RNA seq: an overview Frederik Coppens

Transcriptomics analysis with RNA seq: an overview Frederik Coppens Transcriptomics analysis with RNA seq: an overview Frederik Coppens Platforms Applications Analysis Quantification RNA content Platforms Platforms Short (few hundred bases) Long reads (multiple kilobases)

More information

ChIP-seq and RNA-seq. Farhat Habib

ChIP-seq and RNA-seq. Farhat Habib ChIP-seq and RNA-seq Farhat Habib fhabib@iiserpune.ac.in Biological Goals Learn how genomes encode the diverse patterns of gene expression that define each cell type and state. Protein-DNA interactions

More information

Genome 373: Mapping Short Sequence Reads II. Doug Fowler

Genome 373: Mapping Short Sequence Reads II. Doug Fowler Genome 373: Mapping Short Sequence Reads II Doug Fowler The final Will be in this room on June 6 th at 8:30a Will be focused on the second half of the course, but will include material from the first half

More information

Nature Genetics: doi: /ng Supplementary Figure 1

Nature Genetics: doi: /ng Supplementary Figure 1 Supplementary Figure 1 Ihh interacts preferentially with its upstream neighboring gene Nhej1. Genes are indicated by gray lines, and Ihh and Nhej1 are highlighted in blue. 4C seq performed in E14.5 limbs

More information

RNA-Seq with the Tuxedo Suite

RNA-Seq with the Tuxedo Suite RNA-Seq with the Tuxedo Suite Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 2015 Workshop The Basic Tuxedo Suite References Trapnell C, et al. 2009 TopHat: discovering splice junctions with

More information

Prioritization: from vcf to finding the causative gene

Prioritization: from vcf to finding the causative gene Prioritization: from vcf to finding the causative gene vcf file making sense A vcf file from an exome sequencing project may easily contain 40-50 thousand variants. In order to optimize the search for

More information

Sequence Analysis. II: Sequence Patterns and Matrices. George Bell, Ph.D. WIBR Bioinformatics and Research Computing

Sequence Analysis. II: Sequence Patterns and Matrices. George Bell, Ph.D. WIBR Bioinformatics and Research Computing Sequence Analysis II: Sequence Patterns and Matrices George Bell, Ph.D. WIBR Bioinformatics and Research Computing Sequence Patterns and Matrices Multiple sequence alignments Sequence patterns Sequence

More information

microrna Shifra Ben-Dor March 2010

microrna Shifra Ben-Dor March 2010 microrna Shifra Ben-Dor March 2010 Outline Biology of mirna Prediction of mirna mirna Databases Prediction of mirna Targets micrornas (mirna) Naturally expressed small RNAs Involved in regulation of target

More information

Nature Methods: doi: /nmeth Supplementary Figure 1

Nature Methods: doi: /nmeth Supplementary Figure 1 Supplementary Figure 1 ATP1A1 variants with in-frame deletions are enriched in ouabain-resistant cell populations. (a) Total editing efficacy along with spectrum and frequency of individual indels as determined

More information

Bacterial Genome Annotation

Bacterial Genome Annotation Bacterial Genome Annotation Bacterial Genome Annotation For an annotation you want to predict from the sequence, all of... protein-coding genes their stop-start the resulting protein the function the control

More information

ISU CBC CRISPR Workshop. Michael Collingwood & Mollie Schubert

ISU CBC CRISPR Workshop. Michael Collingwood & Mollie Schubert ISU CBC CRISPR Workshop Michael Collingwood & Mollie Schubert 1 Genome editing (Cas12a) 2 Alt-R CRISPR System complete workflow Design Cut Repair Analyze Cas9/Cas12a + Guide RNA (grna) 3 CRISPR-Cas9 genome

More information

Week 1 BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers

Week 1 BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers Week 1 BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers Web resources: NCBI database: http://www.ncbi.nlm.nih.gov/ Ensembl database: http://useast.ensembl.org/index.html

More information

Annotation Walkthrough Workshop BIO 173/273 Genomics and Bioinformatics Spring 2013 Developed by Justin R. DiAngelo at Hofstra University

Annotation Walkthrough Workshop BIO 173/273 Genomics and Bioinformatics Spring 2013 Developed by Justin R. DiAngelo at Hofstra University Annotation Walkthrough Workshop NAME: BIO 173/273 Genomics and Bioinformatics Spring 2013 Developed by Justin R. DiAngelo at Hofstra University A Simple Annotation Exercise Adapted from: Alexis Nagengast,

More information

Non-coding Function & Variation, MPRAs II. Mike White Bio /5/18

Non-coding Function & Variation, MPRAs II. Mike White Bio /5/18 Non-coding Function & Variation, MPRAs II Mike White Bio 5488 3/5/18 MPRA Review Problem 1: Where does your CRE DNA come from? DNA synthesis Genomic fragments Targeted regulome capture Problem 2: How do

More information

Introduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013

Introduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013 Introduction to RNA-Seq David Wood Winter School in Mathematics and Computational Biology July 1, 2013 Abundance RNA is... Diverse Dynamic Central DNA rrna Epigenetics trna RNA mrna Time Protein Abundance

More information

Supplementary Figure 1 Activities of ABEs using extended sgrnas in HEK293T cells.

Supplementary Figure 1 Activities of ABEs using extended sgrnas in HEK293T cells. Supplementary Figure 1 Activities of ABEs using extended sgrnas in HEK293T cells. Base editing efficiencies of ABEs with extended sgrnas at Site 18 (a), Site 19 (b), the HBB-E2 site (c), and the HBB-E3

More information

Selective constraints on noncoding DNA of mammals. Peter Keightley Institute of Evolutionary Biology University of Edinburgh

Selective constraints on noncoding DNA of mammals. Peter Keightley Institute of Evolutionary Biology University of Edinburgh Selective constraints on noncoding DNA of mammals Peter Keightley Institute of Evolutionary Biology University of Edinburgh Most mammalian noncoding DNA evolves rapidly Homo-Pan Divergence (%) 1.5 1.25

More information

Theoretical cloning project

Theoretical cloning project Theoretical cloning project Needed to get credits Make it up yourself, don't copy Possible to do in groups of 2-4 students If you need help or an idea, ask! If you have no idea what to clone, I can give

More information

Genome edi3ng with the CRISPR-Cas9 system

Genome edi3ng with the CRISPR-Cas9 system CRISPR-Cas9 Genome Edi3ng Bootcamp AHA Council on Func3onal Genomics and Transla3onal Biology Narrated video link: hfps://youtu.be/h18hmftybnq Genome edi3ng with the CRISPR-Cas9 system Kiran Musunuru,

More information

Next-Generation Sequencing. Technologies

Next-Generation Sequencing. Technologies Next-Generation Next-Generation Sequencing Technologies Sequencing Technologies Nicholas E. Navin, Ph.D. MD Anderson Cancer Center Dept. Genetics Dept. Bioinformatics Introduction to Bioinformatics GS011062

More information

BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers

BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers Web resources: NCBI database: http://www.ncbi.nlm.nih.gov/ Ensembl database: http://useast.ensembl.org/index.html UCSC

More information

In silico variant analysis: Challenges and Pitfalls

In silico variant analysis: Challenges and Pitfalls In silico variant analysis: Challenges and Pitfalls Fiona Cunningham Variation annotation coordinator EMBL-EBI www.ensembl.org Sequencing -> Variants -> Interpretation Structural variants SNP? In-dels

More information

Nature Biotechnology: doi: /nbt Supplementary Figure 1

Nature Biotechnology: doi: /nbt Supplementary Figure 1 Supplementary Figure 1 Negative selection analysis of sgrnas targeting all Brd4 exons comparing day 2 to day 10 time points. Systematic evaluation of 64 Brd4 sgrnas in negative selection experiments, targeting

More information

Experimental Design. Sequencing. Data Quality Control. Read mapping. Differential Expression analysis

Experimental Design. Sequencing. Data Quality Control. Read mapping. Differential Expression analysis -Seq Analysis Quality Control checks Reproducibility Reliability -seq vs Microarray Higher sensitivity and dynamic range Lower technical variation Available for all species Novel transcript identification

More information

Machine Learning. HMM applications in computational biology

Machine Learning. HMM applications in computational biology 10-601 Machine Learning HMM applications in computational biology Central dogma DNA CCTGAGCCAACTATTGATGAA transcription mrna CCUGAGCCAACUAUUGAUGAA translation Protein PEPTIDE 2 Biological data is rapidly

More information

High-Throughput Bioinformatics: Re-sequencing and de novo assembly. Elena Czeizler

High-Throughput Bioinformatics: Re-sequencing and de novo assembly. Elena Czeizler High-Throughput Bioinformatics: Re-sequencing and de novo assembly Elena Czeizler 13.11.2015 Sequencing data Current sequencing technologies produce large amounts of data: short reads The outputted sequences

More information

RNA-Seq Software, Tools, and Workflows

RNA-Seq Software, Tools, and Workflows RNA-Seq Software, Tools, and Workflows Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 1, 2016 Some mrna-seq Applications Differential gene expression analysis Transcriptional profiling Assumption:

More information

ChIP-seq data analysis with Chipster. Eija Korpelainen CSC IT Center for Science, Finland

ChIP-seq data analysis with Chipster. Eija Korpelainen CSC IT Center for Science, Finland ChIP-seq data analysis with Chipster Eija Korpelainen CSC IT Center for Science, Finland chipster@csc.fi What will I learn? Short introduction to ChIP-seq Analyzing ChIP-seq data Central concepts Analysis

More information

measuring gene expression December 5, 2017

measuring gene expression December 5, 2017 measuring gene expression December 5, 2017 transcription a usually short-lived RNA copy of the DNA is created through transcription RNA is exported to the cytoplasm to encode proteins some types of RNA

More information

Short Read Alignment to a Reference Genome

Short Read Alignment to a Reference Genome Short Read Alignment to a Reference Genome Shamith Samarajiwa CRUK Summer School in Bioinformatics Cambridge, September 2018 Aligning to a reference genome BWA Bowtie2 STAR GEM Pseudo Aligners for RNA-seq

More information

Testing Non-Transgenic CRISPR Technology for Wheat Improvement 13 TH IWGS - TULLN, AUSTRIA

Testing Non-Transgenic CRISPR Technology for Wheat Improvement 13 TH IWGS - TULLN, AUSTRIA Testing Non-Transgenic CRISPR Technology for Wheat Improvement KALI M BRANDT, HILARY L GUNN, BRETT L BUSCHKE, ADAM HEESACKER, NATHALIA MORET TI, ALEXANDER KARASEV, ROBERT S ZEMETRA 13 TH IWGS - TULLN,

More information

Bioinformatics in next generation sequencing projects

Bioinformatics in next generation sequencing projects Bioinformatics in next generation sequencing projects Rickard Sandberg Assistant Professor Department of Cell and Molecular Biology Karolinska Institutet May 2013 Standard sequence library generation Illumina

More information

Generating stable cell lines expressing various Cas9 proteins from AAVS1 locus

Generating stable cell lines expressing various Cas9 proteins from AAVS1 locus IGI Protocol prepared by Chong Park 1 Generating stable cell lines expressing various Cas9 proteins from AAVS1 locus This is modified from original protocol from the Conklin laboratory (Mandegar et al.

More information

Introduction. CS482/682 Computational Techniques in Biological Sequence Analysis

Introduction. CS482/682 Computational Techniques in Biological Sequence Analysis Introduction CS482/682 Computational Techniques in Biological Sequence Analysis Outline Course logistics A few example problems Course staff Instructor: Bin Ma (DC 3345, http://www.cs.uwaterloo.ca/~binma)

More information

Optimisation and Operations Research

Optimisation and Operations Research Optimisation and Operations Research Lecture 17: Genetic Algorithms and Evolutionary Computing Matthew Roughan http://www.maths.adelaide.edu.au/matthew.roughan/ Lecture_notes/OORII/

More information

Figure S1: NUN preparation yields nascent, unadenylated RNA with a different profile from Total RNA.

Figure S1: NUN preparation yields nascent, unadenylated RNA with a different profile from Total RNA. Summary of Supplemental Information Figure S1: NUN preparation yields nascent, unadenylated RNA with a different profile from Total RNA. Figure S2: rrna removal procedure is effective for clearing out

More information

ChIP-seq and RNA-seq

ChIP-seq and RNA-seq ChIP-seq and RNA-seq Biological Goals Learn how genomes encode the diverse patterns of gene expression that define each cell type and state. Protein-DNA interactions (ChIPchromatin immunoprecipitation)

More information

Model Selection, Evaluation, Diagnosis

Model Selection, Evaluation, Diagnosis Model Selection, Evaluation, Diagnosis INFO-4604, Applied Machine Learning University of Colorado Boulder October 31 November 2, 2017 Prof. Michael Paul Today How do you estimate how well your classifier

More information

Transfection of CRISPR/Cas9 Nuclease NLS ribonucleoprotein (RNP) into adherent mammalian cells using Lipofectamine RNAiMAX

Transfection of CRISPR/Cas9 Nuclease NLS ribonucleoprotein (RNP) into adherent mammalian cells using Lipofectamine RNAiMAX Transfection of CRISPR/Cas9 Nuclease NLS ribonucleoprotein (RNP) into adherent mammalian cells using Lipofectamine RNAiMAX INTRODUCTION The CRISPR/Cas genome editing system consists of a single guide RNA

More information