(Practical) Bioinformatics for CRISPR/Cas9
|
|
- Alvin Payne
- 5 years ago
- Views:
Transcription
1 (Practical) Bioinformatics for CRISPR/Cas9 Jacob Corn IGI Workshop 2016 Bioinformatics is (mostly) things you could do yourself Just done very fast
2 What makes these guides different? GAGTCCGAGCAGAAGAAGAA NGG EMX1 highly active relatively specific GACCCCCTCCACCCCGCCTC NGG VEGFA site2 highly active many off-targets GCGACGAGTCTCATTCAAAC NGG deubiquitinase no activity at all
3 General guide design algorithm Find guides region depends on experiment Map them to a reference genome guide site potential off-targets genes, transcripts, exons other annotations (DNAse, etc) Score features of guide itself annotations We ll cover these in detail for a few popular approaches
4 Finding guides = finding PAMs GCAGAGTCCAAGCAGAAGAAGAACGGGTCGTA look for PAMs count backwards 20 bases GAGTCCAAGCAGAAGAAGAA CGG AGTCCAAGCAGAAGAAGAAC GGG ACGACCGTTCTTCTTCTGCT CCA AGCAGAAGAAGAACGGTCGT TGG
5 How might we map guides? Whole sea of a genome Where is a 20mer? (or seqs like it) Analogous to short-read next gen sequencing BWA Cas-OFFinder Bae et al., 2014, Bioinformatics Li and Durbin, 2009, Bioinformatics Langmead et al., 2009, Genome Biology Langmead et al., 2011, Curr Protocols Bioinformatics
6 Now we know guide coordinates chr : chr : Resources to find what s there Choice of database outside the scope of this workshop Make sure you re using consistent genome versions! (e.g. GRCh37 vs GRCh38)
7 We want guides to be active Easy things to avoid, but sometimes forgotten: 4+ T/U in a row à Pol III terminator Avoid homopolymers in general GC > 30% or < 80% Avoid stable secondary structure in protospacer (Vienna fold prediction no structure in protospacer also protospacer doesn t interfere with constant region But surely it s more complicated than that Do active guides have certain sequences?
8 A screen to find properties of active guides (single cut gene knockout) Doench et al., Nat Biotech 2014 Make every guide across a surface marker gene Turn into a lentiviral library Sort for loss of protein Next-gen sequencing à active vs inactive guides? good guides CD33 guide activity across 3 lines bad guides Guide activity can be cell-line dependent Local sequence effects might be a problem But what about on average?
9 Training features of active guides What sequences are enriched/depleted in 20% most active guides? position 20 is good position 20 is bad Complex à We need a predictor Turn each base at each position (30mer) into a feature Also include dimers and GC content What separates top 25% guides from all others? Use machine learning to find sequence preferences à Rule Set 1 Doench et al., Nat Biotech 2014
10 Machine learning can be odd BEST learned properties GC TC TA GA AC CC CG A A G G G G A C G WORST learned properties GG GC G C G GG Matches what the plot above (trained on this data) Seems to do well in back-prediction Calls good guides 50-75% of the time TC T C Doench et al., Nat Biotech 2014
11 Maybe we need even more features and training? Contribution Highest ranked features Proximity to 5 end of gene Tm single and double base features e.g. G at position Feature # à Rule Set 2 Doench et al., Nat Biotech 2016
12 A tool to find active ko guides Machine learning works best when use = training Trained on Knockout by single indel Mouse and human cell lines Cell surface expression, survival Finds highly active guides for ko pretty well Can false-report good guides as bad Assumes One True Guide Sequence May not be good for CRISPRi/a Code is available on the website! Does what is advertised
13 But sequence is not the whole story???? Cas9 preference human mouse Doench et al. zebrafish Moreno-Mateos et al. some guides in different cells correlate well other cell lines correlate poorly
14 DNA repair also affects apparent guide activity -NOE +NOE NOE Richardson and Ray et al., in Nature Communications
15 Finding active CRISPRi/a guides Gilbert and Horlbeck et al., Cell 2014 Tile guides around transcription start site Look for phenotype dcas9-krab = CRISPRi Plus many more features à +300 dcas9-sun-vp64 = CRISPRa -400 à -50
16 We want guides to be specific Easy things to avoid: repeat sequences targeting highly conserved part of a gene family But is it more complicated? Based on data from Hsu et al., Nat Biotech 2013
17 Hsu et al, Nat Biotech 2013 Target-focused specificity measurements Either Make 1 guide, find all targets Make many guides, measure activity at one target Target EMX1(3 different places) Make all possible guide variants Which ones still target EMX1?
18 Guide seed region is stringent Non-seed is permissive Hsu et al, Nat Biotech 2013 single mismatches can be tolerated in non-seed region alternative seed vs non-seed AGTCCGAGCAGAAGAAGAA NGG non-seed seed
19 Mismatches >= 3 are not tolerated Close mismatches are worse? mismatches together mismatches apart >= 3 mismatches apart Hsu et al, Nat Biotech 2013
20 Turning data in to a score Trained weights from heatmap (training details a bit unclear) (likeσ, but products) penalizes mismatches close together penalizes total #of mismatches pos W[e] GAGTCCGAGCAGAAGAAGAA GAATCCGAGCAGAAGAAGAA ( ) GAGTCCGAGCAGAAGAAGGA ( ) GAGTCCGAGCAGAAGTAGGA ( )*( ) Hsu et al, Nat Biotech 2013
21 Total guide score Calculate score across all offtargets unclear how a sequence is chosen as potential offtarget, but uses bowtie2 Score normalized to 100 Many high-scoring off-targets decreases score Off-targets in exons are noted but not extra penalized Reports exon hits, not introns, UTRs, etc.
22 We can get fancier ~28,000 sgrnas targeting human CD33 all on-target all 1-base bulges all 1-base mismatches 65 on-targets w/ phenotype ~10,000 variants CFD Cutting Frequency Determination Doench et al, Nat Biotech 2016
23 How well off-target prediction work? Shengdar et al. Nat Biotech 2014 Both on- and off-target prediction are still in their infancy Plenty of room for improvement What s wrong?
24 Just finding OTs makes a big difference??!! bowtie2 (MIT website) has trouble finding all potential offtargets to score Use Cas-OFFinder to find offtargets, score with Hsu-Zhang ( score vs website ) Haeussler et al Genome Biology 2016 Doench et al, Nat Biotech 2016
25 Some easily accessible tools
26 crispr.mit input bp
27 crispr.mit output
28 Other tools CHOPCHOP Heuristic ( arbitary ) scoring Not necessarily worse than trained! Allows any length sequence Easy T7E1 primer design
29 CHOPCHOP input
30 CHOPCHOP output
31 CCTop input
32 CCTop Output Super fast Good enough(?)
33 Cas-OFFinder input Takes GUIDES as input Very slow Very methodical
34 There are a lot of tools out there
35 Benchling or DesktopGenetics demo as meta guide designers
36 Questions?
37 crispr.mit.edu Based on Hsu et al., Nat Biotech 2013 Find all sgrnas in a nt sequence What are likely off-targets? Supports many organisms human, mouse, zfish, worm, fly, etc Nice graphical interface We ll talk more about this in detail
38 CHOPCHOP described in Montague et al. NAR 2014
Improving CRISPR-Cas9 Gene Knockout with a Validated Guide RNA Algorithm
Improving CRISPR-Cas9 Gene Knockout with a Validated Guide RNA Algorithm Anja Smith Director R&D Dharmacon, part of GE Healthcare Imagination at work crrna:tracrrna program Cas9 nuclease Active crrna is
More informationCRISPR Design Considera1ons
CRISPR Design Considera1ons Shifra Ben-Dor Bioinforma1cs Unit Life Sciences Core Facili1es Weizmann Ins1tute of Science June 2018 Jinek et al, elife 2013;2:e00471 Cradick et al Outline Define your
More informationSupplementary Figure 1
number of cells, normalized number of cells, normalized number of cells, normalized Supplementary Figure CD CD53 Cd3e fluorescence intensity fluorescence intensity fluorescence intensity Supplementary
More informationReviewers' Comments: Reviewer #1 (Remarks to the Author)
Reviewers' Comments: Reviewer #1 (Remarks to the Author) In this study, Rosenbluh et al reported direct comparison of two screening approaches: one is genome editing-based method using CRISPR-Cas9 (cutting,
More informationPooled CRISPR guide RNA libraries for functional genomics screening: Do you know what s in your library?
Pooled CRISPR guide RNA libraries for functional genomics screening: Do you know what s in your library? Peter Sheffield R&D Scientific Program Director Agilent Technologies INTRODUCTION CRISPR: A Programmable
More informationC3BI. VARIANTS CALLING November Pierre Lechat Stéphane Descorps-Declère
C3BI VARIANTS CALLING November 2016 Pierre Lechat Stéphane Descorps-Declère General Workflow (GATK) software websites software bwa picard samtools GATK IGV tablet vcftools website http://bio-bwa.sourceforge.net/
More informationGeneration of App knock-in mice reveals deletion mutations protective against Alzheimer s. disease-like pathology. Nagata et al.
Generation of App knock-in mice reveals deletion mutations protective against Alzheimer s disease-like pathology Nagata et al. Supplementary Fig 1. Previous App knock-in model did not show Aβ accumulation
More informationYou use the UCSC Genome Browser (www.genome.ucsc.edu) to assess the exonintron structure of each gene. You use four tracks to show each gene:
CRISPR-Cas9 genome editing Part 1: You would like to rapidly generate two different knockout mice using CRISPR-Cas9. The genes to be knocked out are Pcsk9 and Apoc3, both involved in lipid metabolism.
More informationCollect, analyze and synthesize. Annotation. Annotation for D. virilis. Evidence Based Annotation. GEP goals: Evidence for Gene Models 08/22/2017
Annotation Annotation for D. virilis Chris Shaffer July 2012 l Big Picture of annotation and then one practical example l This technique may not be the best with other projects (e.g. corn, bacteria) l
More informationCollect, analyze and synthesize. Annotation. Annotation for D. virilis. GEP goals: Evidence Based Annotation. Evidence for Gene Models 12/26/2018
Annotation Annotation for D. virilis Chris Shaffer July 2012 l Big Picture of annotation and then one practical example l This technique may not be the best with other projects (e.g. corn, bacteria) l
More informationDesigning CRISPR mediated Gene disruptions with gblocks Gene Fragments
Designing CRISPR mediated Gene disruptions with gblocks Gene Fragments Adam Clore, PhD Manager, Synthetic Biology Design Integrated DNA technology Typical CRISPR timeline in Mammalian cell lines Design
More informationAnalysis of neo-antigens to identify T-cell neo-epitopes in human Head & Neck cancer. Project XX1001. Customer Detail
Analysis of neo-antigens to identify T-cell neo-epitopes in human Head & Neck cancer Project XX Customer Detail Table of Contents. Bioinformatics analysis pipeline...3.. Read quality check. 3.2. Read alignment...3.3.
More informationSupplementary Materials. China
Supplementary Materials An Efficient Genotyping Method for Genome-modified Animals and Human Cells Generated with CRISPR/Cas9 System Xiaoxiao Zhu 1,2 *, Yajie Xu 1 *, Shanshan Yu 1 *, Lu Lu 1,2, Mingqin
More informationGalaxy Platform For NGS Data Analyses
Galaxy Platform For NGS Data Analyses Weihong Yan wyan@chem.ucla.edu Collaboratory Web Site http://qcb.ucla.edu/collaboratory http://collaboratory.lifesci.ucla.edu Workshop Outline ü Day 1 UCLA galaxy
More informationThe first thing you will see is the opening page. SeqMonk scans your copy and make sure everything is in order, indicated by the green check marks.
Open Seqmonk Launch SeqMonk The first thing you will see is the opening page. SeqMonk scans your copy and make sure everything is in order, indicated by the green check marks. SeqMonk Analysis Page 1 Create
More informationIntroduction to RNA-Seq in GeneSpring NGS Software
Introduction to RNA-Seq in GeneSpring NGS Software Dipa Roy Choudhury, Ph.D. Strand Scientific Intelligence and Agilent Technologies Learn more at www.genespring.com Introduction to RNA-Seq In a few years,
More informationTECHNOLOGIES. 3/19/18 Kayla Nygaard. https://i.ytimg.com/vi/pxw5yya-kh0/maxresdefault.jpg
TECHNOLOGIES 3/19/18 Kayla Nygaard https://i.ytimg.com/vi/pxw5yya-kh0/maxresdefault.jpg CRISPR IN THE NEWS CRISPR in 2018: Coming to a Human Near You Sickle-cell treatment clinical trials CRISPR Therapeutics
More informationIntroduction to CRISPR/Cas9 Background DNA Cleavage and Repair (NHEJ and HDR) Alternative Cas9 Variants Delivery of Cas9 and sgrna Library Products
Introduction to CRISPR/Cas9 Background DNA Cleavage and Repair (NHEJ and HDR) Alternative Cas9 Variants Delivery of Cas9 and sgrna Library Products which one is right for you? CRISPR Workflow abm s Toolbox
More informationSNP calling and VCF format
SNP calling and VCF format Laurent Falquet, Oct 12 SNP? What is this? A type of genetic variation, among others: Family of Single Nucleotide Aberrations Single Nucleotide Polymorphisms (SNPs) Single Nucleotide
More informationCRISPR 101: Optimizing Your Gene Editing Experiments
CRISPR 101: Optimizing Your Gene Editing Experiments PRESENTER Michele Auldridge, Ph.D. Senior Scientist, R&D MODERATOR Beth Frey Product Manager Agenda 1 Technology Overview 2 Experimental Decisions 3
More informationA Guide to CRISPR/Cas9
Genome editing and beyond freepik A Guide to CRISPR/Cas9 The latest advance in genomic DNA editing is the Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR)/Cas9 system. This simple-touse
More informationCRISPR GENOMIC SERVICES PRODUCT CATALOG
CRISPR GENOMIC SERVICES PRODUCT CATALOG DESIGN BUILD ANALYZE The experts at Desktop Genetics can help you design, prepare and manufacture all of the components needed for your CRISPR screen. We provide
More informationAccelerating Genomic Computations 1000X with Hardware
Accelerating Genomic Computations 1000X with Hardware Yatish Turakhia EE PhD candidate Stanford University Prof. Bill Dally (Electrical Engineering and Computer Science) Prof. Gill Bejerano (Computer Science,
More informationChIP-seq analysis 2/28/2018
ChIP-seq analysis 2/28/2018 Acknowledgements Much of the content of this lecture is from: Furey (2012) ChIP-seq and beyond Park (2009) ChIP-seq advantages + challenges Landt et al. (2012) ChIP-seq guidelines
More informationConstruct Design and Cloning Guide for Cas9-triggered homologous recombination
Construct Design and Cloning Guide for Cas9-triggered homologous recombination Written by Dan Dickinson (ddickins@live.unc.edu) and last updated December 2013. Reference: Dickinson DJ, Ward JD, Reiner
More informationHow to deal with your RNA-seq data?
How to deal with your RNA-seq data? Rachel Legendre, Thibault Dayris, Adrien Pain, Claire Toffano-Nioche, Hugo Varet École de bioinformatique AVIESAN-IFB 2017 1 Rachel Legendre Bioinformatics 27/11/2018
More informationUAB DNA-Seq Analysis Workshop. John Osborne Research Associate Centers for Clinical and Translational Science
+ UAB DNA-Seq Analysis Workshop John Osborne Research Associate Centers for Clinical and Translational Science ozborn@uab.,edu + Thanks in advance You are the Guinea pigs for this workshop! At this point
More informationMapping Next Generation Sequence Reads. Bingbing Yuan Dec. 2, 2010
Mapping Next Generation Sequence Reads Bingbing Yuan Dec. 2, 2010 1 What happen if reads are not mapped properly? Some data won t be used, thus fewer reads would be aligned. Reads are mapped to the wrong
More informationQuantifying gene expression
Quantifying gene expression Genome GTF (annotation)? Sequence reads FASTQ FASTQ (+reference transcriptome index) Quality control FASTQ Alignment to Genome: HISAT2, STAR (+reference genome index) (known
More informationArray Informatics. Mark Gerstein
1 Lectures.GersteinLab.org (c) Array Informatics Mark Gerstein CEGS Informatics Developing Tools and Technical Analyses Related to Genome Technologies Main Genome Technologies Tiling Arrays Next Generation
More informationEnsembl Tools. EBI is an Outstation of the European Molecular Biology Laboratory.
Ensembl Tools EBI is an Outstation of the European Molecular Biology Laboratory. Questions? We ve muted all the mics Ask questions in the Chat box in the webinar interface I will check the Chat box periodically
More informationNature Biotechnology: doi: /nbt Supplementary Figure 1. In vitro validation of OTC sgrnas and donor template.
Supplementary Figure 1 In vitro validation of OTC sgrnas and donor template. (a) In vitro validation of sgrnas targeted to OTC in the MC57G mouse cell line by transient transfection followed by 4-day puromycin
More informationSystematic comparison of CRISPR/Cas9 and RNAi screens for essential genes
CORRECTION NOTICE Nat. Biotechnol. doi:10.1038/nbt. 3567 Systematic comparison of CRISPR/Cas9 and RNAi screens for essential genes David W Morgens, Richard M Deans, Amy Li & Michael C Bassik In the version
More informationSuccessful gene expression studies using validated qpcr assays. Jan Hellemans, CEO Biogazelle webinar October 28 th, 2015
Successful gene expression studies using validated qpcr assays Jan Hellemans, CEO Biogazelle webinar October 28 th, 2015 Agenda Requirements for high quality qpcr assays Approaches for qpcr assay validation
More informationBasics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility
2018 ABRF Meeting Satellite Workshop 4 Bridging the Gap: Isolation to Translation (Single Cell RNA-Seq) Sunday, April 22 Basics of RNA-Seq (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly,
More informationCRISPRseek Workshop Design of target-specific guide RNAs in CRISPR-Cas9 genome-editing systems
April 2008 CRISPRseek Workshop Design of target-specific guide RNAs in CRISPR-Cas9 genome-editing systems Lihua Julie Zhu August 1st 2014 Outline Background and Motives CRISPRseek Functionality Dependency
More informationOptimized, chemically-modified crrna:tracrrna complexes for CRISPR gene editing
Optimized, chemically-modified crrna:tracrrna complexes for CRISPR gene editing Mark Behlke MD, PhD Chief Scientific Officer February 24, 2016 1 Implementing CRISPR/Cas9 gene editing 2 To focus on RNA
More informationDatabase Searching and BLAST Dannie Durand
Computational Genomics and Molecular Biology, Fall 2013 1 Database Searching and BLAST Dannie Durand Tuesday, October 8th Review: Karlin-Altschul Statistics Recall that a Maximal Segment Pair (MSP) is
More informationQuestion 2: There are 5 retroelements (2 LINEs and 3 LTRs), 6 unclassified elements (XDMR and XDMR_DM), and 7 satellite sequences.
Bio4342 Exercise 1 Answers: Detecting and Interpreting Genetic Homology (Answers prepared by Wilson Leung) Question 1: Low complexity DNA can be described as sequences that consist primarily of one or
More informationMapping strategies for sequence reads
Mapping strategies for sequence reads Ernest Turro University of Cambridge 21 Oct 2013 Quantification A basic aim in genomics is working out the contents of a biological sample. 1. What distinct elements
More informationBarcode Sequence Alignment and Statistical Analysis (Barcas) tool
Barcode Sequence Alignment and Statistical Analysis (Barcas) tool 2016.10.05 Mun, Jihyeob and Kim, Seon-Young Korea Research Institute of Bioscience and Biotechnology Barcode-Sequencing Ø Genome-wide screening
More informationBio5488 Practice Midterm (2018) 1. Next-gen sequencing
1. Next-gen sequencing 1. You have found a new strain of yeast that makes fantastic wine. You d like to sequence this strain to ascertain the differences from S. cerevisiae. To accurately call a base pair,
More informationdomain. Bottom panel: hybrid surface/ribbon structure (PDB ID: 4UN3) of SpCas9 in complex with sgrna and target DNA. The REC3
Supplementary Figure 1 Yeast screening for high-specificity SpCas9 variants (a) Top panel: scheme of SpCas9 domains. The REC3 domain is part of the recognition lobe. BH: bridge helix. PI: PAM-interacting
More informationLecture 5: Regulation
Machine Learning in Computational Biology CSC 2431 Lecture 5: Regulation Instructor: Anna Goldenberg Central Dogma of Biology Transcription DNA RNA protein Process of producing RNA from DNA Constitutive
More informationFrequently Asked Questions
The DNA Files: Workshops and Activities The DNA Files workshops are an outreach component of The DNA Files public radio documentary series produced by SoundVision Productions with funding from the National
More informationApplication of NGS (next-generation sequencing) for studying RNA regulation
Application of NGS (next-generation sequencing) for studying RNA regulation SAIHST, SKKU Sung Wook Chi In this lecturre Intro: Sequencing Technology NGS (Next-Generation Sequencing) Sequencing of RNAs
More informationSequence Annotation & Designing Gene-specific qpcr Primers (computational)
James Madison University From the SelectedWorks of Ray Enke Ph.D. Fall October 31, 2016 Sequence Annotation & Designing Gene-specific qpcr Primers (computational) Raymond A Enke This work is licensed under
More informationIntroduction to Short Read Alignment. UCD Genome Center Bioinformatics Core Tuesday 14 June 2016
Introduction to Short Read Alignment UCD Genome Center Bioinformatics Core Tuesday 14 June 2016 From reads to molecules Why align? Individual A Individual B ATGATAGCATCGTCGGGTGTCTGCTCAATAATAGTGCCGTATCATGCTGGTGTTATAATCGCCGCATGACATGATCAATGG
More informationTranscriptome analysis
Statistical Bioinformatics: Transcriptome analysis Stefan Seemann seemann@rth.dk University of Copenhagen April 11th 2018 Outline: a) How to assess the quality of sequencing reads? b) How to normalize
More informationSystematic evaluation of spliced alignment programs for RNA- seq data
Systematic evaluation of spliced alignment programs for RNA- seq data Pär G. Engström, Tamara Steijger, Botond Sipos, Gregory R. Grant, André Kahles, RGASP Consortium, Gunnar Rätsch, Nick Goldman, Tim
More informationAnnotating your variants: Ensembl Variant Effect Predictor (VEP) Helen Sparrow Ensembl EMBL-EBI 2nd November 2016
Training materials Ensembl training materials are protected by a CC BY license http://creativecommons.org/licenses/by/4.0/ If you wish to re-use these materials, please credit Ensembl for their creation
More informationGenome Engineering with ZFNs, TALENs and CRISPR/Cas9
Genome Engineering with ZFNs, TALENs and CRISPR/Cas9 Designer Endonucleases ZFNs (zinc finger nucleases), TALENs (transcription activator-like effector nucleases) and CRISPR/Cas9 (clustered regularly interspaced
More informationApplying CRISPR in Environmental Health Research: From Cells to Human Populations Luoping Zhang
NASEM Workshop: The Promise of Genome Editing Tools in EHR Washington, D.C. January 10-11, 2018 Applying CRISPR in Environmental Health Research: From Cells to Human Populations Luoping Zhang School of
More informationCRISPR RNA-guided activation of endogenous human genes
CRISPR RNA-guided activation of endogenous human genes Morgan L Maeder, Samantha J Linder, Vincent M Cascio, Yanfang Fu, Quan H Ho, J Keith Joung Supplementary Figure 1 Comparison of VEGF activation induced
More informationGenome annotation & EST
Genome annotation & EST What is genome annotation? The process of taking the raw DNA sequence produced by the genome sequence projects and adding the layers of analysis and interpretation necessary
More informationA Brief History. Bootstrapping. Bagging. Boosting (Schapire 1989) Adaboost (Schapire 1995)
A Brief History Bootstrapping Bagging Boosting (Schapire 1989) Adaboost (Schapire 1995) What s So Good About Adaboost Improves classification accuracy Can be used with many different classifiers Commonly
More informationSUPPLEMENTAL MATERIALS
SUPPLEMENL MERILS Eh-seq: RISPR epitope tagging hip-seq of DN-binding proteins Daniel Savic, E. hristopher Partridge, Kimberly M. Newberry, Sophia. Smith, Sarah K. Meadows, rian S. Roberts, Mark Mackiewicz,
More informationFigure S4 A-H : Initiation site properties and evolutionary changes
A 0.3 Figure S4 A-H : Initiation site properties and evolutionary changes G-correction not used 0.25 Fraction of total counts 0.2 0.5 0. tag 2 tags 3 tags 4 tags 5 tags 6 tags 7tags 8tags 9 tags >9 tags
More informationIntroduction to BIOINFORMATICS
COURSE OF BIOINFORMATICS a.a. 2016-2017 Introduction to BIOINFORMATICS What is Bioinformatics? (I) The sinergy between biology and informatics What is Bioinformatics? (II) From: http://www.bioteach.ubc.ca/bioinfo2010/
More informationParts of a standard FastQC report
FastQC FastQC, written by Simon Andrews of Babraham Bioinformatics, is a very popular tool used to provide an overview of basic quality control metrics for raw next generation sequencing data. There are
More informationSequence Based Function Annotation
Sequence Based Function Annotation Qi Sun Bioinformatics Facility Biotechnology Resource Center Cornell University Sequence Based Function Annotation 1. Given a sequence, how to predict its biological
More informationTranscriptomics analysis with RNA seq: an overview Frederik Coppens
Transcriptomics analysis with RNA seq: an overview Frederik Coppens Platforms Applications Analysis Quantification RNA content Platforms Platforms Short (few hundred bases) Long reads (multiple kilobases)
More informationChIP-seq and RNA-seq. Farhat Habib
ChIP-seq and RNA-seq Farhat Habib fhabib@iiserpune.ac.in Biological Goals Learn how genomes encode the diverse patterns of gene expression that define each cell type and state. Protein-DNA interactions
More informationGenome 373: Mapping Short Sequence Reads II. Doug Fowler
Genome 373: Mapping Short Sequence Reads II Doug Fowler The final Will be in this room on June 6 th at 8:30a Will be focused on the second half of the course, but will include material from the first half
More informationNature Genetics: doi: /ng Supplementary Figure 1
Supplementary Figure 1 Ihh interacts preferentially with its upstream neighboring gene Nhej1. Genes are indicated by gray lines, and Ihh and Nhej1 are highlighted in blue. 4C seq performed in E14.5 limbs
More informationRNA-Seq with the Tuxedo Suite
RNA-Seq with the Tuxedo Suite Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 2015 Workshop The Basic Tuxedo Suite References Trapnell C, et al. 2009 TopHat: discovering splice junctions with
More informationPrioritization: from vcf to finding the causative gene
Prioritization: from vcf to finding the causative gene vcf file making sense A vcf file from an exome sequencing project may easily contain 40-50 thousand variants. In order to optimize the search for
More informationSequence Analysis. II: Sequence Patterns and Matrices. George Bell, Ph.D. WIBR Bioinformatics and Research Computing
Sequence Analysis II: Sequence Patterns and Matrices George Bell, Ph.D. WIBR Bioinformatics and Research Computing Sequence Patterns and Matrices Multiple sequence alignments Sequence patterns Sequence
More informationmicrorna Shifra Ben-Dor March 2010
microrna Shifra Ben-Dor March 2010 Outline Biology of mirna Prediction of mirna mirna Databases Prediction of mirna Targets micrornas (mirna) Naturally expressed small RNAs Involved in regulation of target
More informationNature Methods: doi: /nmeth Supplementary Figure 1
Supplementary Figure 1 ATP1A1 variants with in-frame deletions are enriched in ouabain-resistant cell populations. (a) Total editing efficacy along with spectrum and frequency of individual indels as determined
More informationBacterial Genome Annotation
Bacterial Genome Annotation Bacterial Genome Annotation For an annotation you want to predict from the sequence, all of... protein-coding genes their stop-start the resulting protein the function the control
More informationISU CBC CRISPR Workshop. Michael Collingwood & Mollie Schubert
ISU CBC CRISPR Workshop Michael Collingwood & Mollie Schubert 1 Genome editing (Cas12a) 2 Alt-R CRISPR System complete workflow Design Cut Repair Analyze Cas9/Cas12a + Guide RNA (grna) 3 CRISPR-Cas9 genome
More informationWeek 1 BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers
Week 1 BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers Web resources: NCBI database: http://www.ncbi.nlm.nih.gov/ Ensembl database: http://useast.ensembl.org/index.html
More informationAnnotation Walkthrough Workshop BIO 173/273 Genomics and Bioinformatics Spring 2013 Developed by Justin R. DiAngelo at Hofstra University
Annotation Walkthrough Workshop NAME: BIO 173/273 Genomics and Bioinformatics Spring 2013 Developed by Justin R. DiAngelo at Hofstra University A Simple Annotation Exercise Adapted from: Alexis Nagengast,
More informationNon-coding Function & Variation, MPRAs II. Mike White Bio /5/18
Non-coding Function & Variation, MPRAs II Mike White Bio 5488 3/5/18 MPRA Review Problem 1: Where does your CRE DNA come from? DNA synthesis Genomic fragments Targeted regulome capture Problem 2: How do
More informationIntroduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013
Introduction to RNA-Seq David Wood Winter School in Mathematics and Computational Biology July 1, 2013 Abundance RNA is... Diverse Dynamic Central DNA rrna Epigenetics trna RNA mrna Time Protein Abundance
More informationSupplementary Figure 1 Activities of ABEs using extended sgrnas in HEK293T cells.
Supplementary Figure 1 Activities of ABEs using extended sgrnas in HEK293T cells. Base editing efficiencies of ABEs with extended sgrnas at Site 18 (a), Site 19 (b), the HBB-E2 site (c), and the HBB-E3
More informationSelective constraints on noncoding DNA of mammals. Peter Keightley Institute of Evolutionary Biology University of Edinburgh
Selective constraints on noncoding DNA of mammals Peter Keightley Institute of Evolutionary Biology University of Edinburgh Most mammalian noncoding DNA evolves rapidly Homo-Pan Divergence (%) 1.5 1.25
More informationTheoretical cloning project
Theoretical cloning project Needed to get credits Make it up yourself, don't copy Possible to do in groups of 2-4 students If you need help or an idea, ask! If you have no idea what to clone, I can give
More informationGenome edi3ng with the CRISPR-Cas9 system
CRISPR-Cas9 Genome Edi3ng Bootcamp AHA Council on Func3onal Genomics and Transla3onal Biology Narrated video link: hfps://youtu.be/h18hmftybnq Genome edi3ng with the CRISPR-Cas9 system Kiran Musunuru,
More informationNext-Generation Sequencing. Technologies
Next-Generation Next-Generation Sequencing Technologies Sequencing Technologies Nicholas E. Navin, Ph.D. MD Anderson Cancer Center Dept. Genetics Dept. Bioinformatics Introduction to Bioinformatics GS011062
More informationBCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers
BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers Web resources: NCBI database: http://www.ncbi.nlm.nih.gov/ Ensembl database: http://useast.ensembl.org/index.html UCSC
More informationIn silico variant analysis: Challenges and Pitfalls
In silico variant analysis: Challenges and Pitfalls Fiona Cunningham Variation annotation coordinator EMBL-EBI www.ensembl.org Sequencing -> Variants -> Interpretation Structural variants SNP? In-dels
More informationNature Biotechnology: doi: /nbt Supplementary Figure 1
Supplementary Figure 1 Negative selection analysis of sgrnas targeting all Brd4 exons comparing day 2 to day 10 time points. Systematic evaluation of 64 Brd4 sgrnas in negative selection experiments, targeting
More informationExperimental Design. Sequencing. Data Quality Control. Read mapping. Differential Expression analysis
-Seq Analysis Quality Control checks Reproducibility Reliability -seq vs Microarray Higher sensitivity and dynamic range Lower technical variation Available for all species Novel transcript identification
More informationMachine Learning. HMM applications in computational biology
10-601 Machine Learning HMM applications in computational biology Central dogma DNA CCTGAGCCAACTATTGATGAA transcription mrna CCUGAGCCAACUAUUGAUGAA translation Protein PEPTIDE 2 Biological data is rapidly
More informationHigh-Throughput Bioinformatics: Re-sequencing and de novo assembly. Elena Czeizler
High-Throughput Bioinformatics: Re-sequencing and de novo assembly Elena Czeizler 13.11.2015 Sequencing data Current sequencing technologies produce large amounts of data: short reads The outputted sequences
More informationRNA-Seq Software, Tools, and Workflows
RNA-Seq Software, Tools, and Workflows Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 1, 2016 Some mrna-seq Applications Differential gene expression analysis Transcriptional profiling Assumption:
More informationChIP-seq data analysis with Chipster. Eija Korpelainen CSC IT Center for Science, Finland
ChIP-seq data analysis with Chipster Eija Korpelainen CSC IT Center for Science, Finland chipster@csc.fi What will I learn? Short introduction to ChIP-seq Analyzing ChIP-seq data Central concepts Analysis
More informationmeasuring gene expression December 5, 2017
measuring gene expression December 5, 2017 transcription a usually short-lived RNA copy of the DNA is created through transcription RNA is exported to the cytoplasm to encode proteins some types of RNA
More informationShort Read Alignment to a Reference Genome
Short Read Alignment to a Reference Genome Shamith Samarajiwa CRUK Summer School in Bioinformatics Cambridge, September 2018 Aligning to a reference genome BWA Bowtie2 STAR GEM Pseudo Aligners for RNA-seq
More informationTesting Non-Transgenic CRISPR Technology for Wheat Improvement 13 TH IWGS - TULLN, AUSTRIA
Testing Non-Transgenic CRISPR Technology for Wheat Improvement KALI M BRANDT, HILARY L GUNN, BRETT L BUSCHKE, ADAM HEESACKER, NATHALIA MORET TI, ALEXANDER KARASEV, ROBERT S ZEMETRA 13 TH IWGS - TULLN,
More informationBioinformatics in next generation sequencing projects
Bioinformatics in next generation sequencing projects Rickard Sandberg Assistant Professor Department of Cell and Molecular Biology Karolinska Institutet May 2013 Standard sequence library generation Illumina
More informationGenerating stable cell lines expressing various Cas9 proteins from AAVS1 locus
IGI Protocol prepared by Chong Park 1 Generating stable cell lines expressing various Cas9 proteins from AAVS1 locus This is modified from original protocol from the Conklin laboratory (Mandegar et al.
More informationIntroduction. CS482/682 Computational Techniques in Biological Sequence Analysis
Introduction CS482/682 Computational Techniques in Biological Sequence Analysis Outline Course logistics A few example problems Course staff Instructor: Bin Ma (DC 3345, http://www.cs.uwaterloo.ca/~binma)
More informationOptimisation and Operations Research
Optimisation and Operations Research Lecture 17: Genetic Algorithms and Evolutionary Computing Matthew Roughan http://www.maths.adelaide.edu.au/matthew.roughan/ Lecture_notes/OORII/
More informationFigure S1: NUN preparation yields nascent, unadenylated RNA with a different profile from Total RNA.
Summary of Supplemental Information Figure S1: NUN preparation yields nascent, unadenylated RNA with a different profile from Total RNA. Figure S2: rrna removal procedure is effective for clearing out
More informationChIP-seq and RNA-seq
ChIP-seq and RNA-seq Biological Goals Learn how genomes encode the diverse patterns of gene expression that define each cell type and state. Protein-DNA interactions (ChIPchromatin immunoprecipitation)
More informationModel Selection, Evaluation, Diagnosis
Model Selection, Evaluation, Diagnosis INFO-4604, Applied Machine Learning University of Colorado Boulder October 31 November 2, 2017 Prof. Michael Paul Today How do you estimate how well your classifier
More informationTransfection of CRISPR/Cas9 Nuclease NLS ribonucleoprotein (RNP) into adherent mammalian cells using Lipofectamine RNAiMAX
Transfection of CRISPR/Cas9 Nuclease NLS ribonucleoprotein (RNP) into adherent mammalian cells using Lipofectamine RNAiMAX INTRODUCTION The CRISPR/Cas genome editing system consists of a single guide RNA
More information