Statistical Genomics and Bioinformatics Workshop. Genetic Association and RNA-Seq Studies
|
|
- Edmund Anthony
- 5 years ago
- Views:
Transcription
1 Statistical Genomics and Bioinformatics Workshop: Genetic Association and RNA-Seq Studies RNA Seq and Differential Expression Analysis Brooke L. Fridley, PhD University of Kansas Medical Center 1 Next-generation gap 2 1
2 Why sequence RNA (versus DNA)? Functional studies Genome may be constant but an experimental condition has a pronounced effect on gene expression Some molecular features can only be observed at the RNA level Alternative isoforms, fusion transcripts, RNA editing Predicting transcript sequence from genome sequence is difficult Alternative splicing, RNA editing, etc. 3 Why sequence RNA (versus DNA)? Interpreting mutations that do not have an obvious effect on protein sequence Regulatory mutations that affect what mrna isoform is expressed and how much e.g. splice sites, promoters, TFBS Prioritizing protein coding somatic mutations If the gene is not expressed, a mutation in that gene would be less interesting If the gene is expressed but only from the wild type allele, this might suggest loss-of-function If the mutant allele itself is expressed, this might suggest a candidate drug target 4 2
3 Challenges to RNA Studies Sample Purity?, quantity?, quality? RNAs consist of small exons that may be separated by large introns Mapping reads to genome is challenging The relative abundance of RNAs vary wildly orders of magnitude Since RNA sequencing works by random sampling, a small fraction of highly expressed genes may consume the majority of reads RNAs come in a wide range of sizes Small RNAs must be captured separately PolyA selection of large RNAs may result in 3 end bias RNA is fragile compared to DNA (easily degraded) 5 The evolution of transcriptomics Hybridization-based RNA seq is still a technology under active development 1995 P. Brown, et. al. Gene expression profiling using spotted cdna microarray: expression levels of known genes 2002 Affymetrix, whole genome expression profiling using tiling array: identifying and profiling novel genes and splicing variants 2008 many groups, mrna seq: direct sequencing of mrnas using next generation sequencing techniques (NGS) 6 3
4 RNA-Seq vs Microarrays General expression profiling Novel genes Alternative splicing Detect gene fusion Can use on any sequenced genome Better dynamic range Cleaner and more informative data Data analysis challenges 7 Advantages of RNA-Seq compared with other transcriptomics methods 8 4
5 Goals of RNA-seq Study Catalogue all species of transcript including: mrnas, non-coding RNAs and small RNAs Determine the transcriptional structure of genes in terms of: Start sites 5 and 3 ends Splicing patterns / novel isoforms Other post-transcriptional modifications Quantification of expression levels and comparison (different conditions, tissues, etc.) Gene and exon level Determine Allelic expression Gene Fusion Transcriptome for non-model organisms 9 Computation for ChIP seq and RNA seq studies Shirley Pepke, Barbara Wold & Ali Mortazavi Nature Methods 6, S22 S32 (2009) Published online: 15 October
6 Coding RNAs, genes Non coding RNAs Ribosomal RNA Types of RNAs Type Size Function microrna (mirna) nt regulation of gene expression small interfering RNA (sirna) nt antiviral mechanisms piwi interacting RNA (pirna) nt interaction with piwi proteins/spermatogenesis small nuclear RNA (snrna) nt RNA splicing small nucleolar RNA (snorna) modification of other RNAs 11 RNA Pre mrna intron splicing exon mature mrna AAAAAAAAAAAAAAA A Poly A tail 12 6
7 Alternative Splicing 13 Next-Gen Sequencing (NGS) Long RNAs are first converted into a library of cdna fragments through either: RNA fragmentation or DNA fragmentation 14 7
8 In contrast to small RNAs (like mirnas) larger RNA must be fragmented RNA fragmentation or cdna fragmentation (different techniques) Types of bias: RNA: depletion for ends cdna: biased 5 end Zhong Wang, Mark Gerstein & Michael Snyder. RNA Seq: a revolutionary tool for transcriptomics. Nature Reviews Genetics 10,57 63 (January 2009) 15 Sequencing adaptors (blue) are added to each cdna fragment and a short sequence is obtained from each cdna using high-throughput sequencing Technology Typical read length: bp depending on technology 16 8
9 Reads are aligned with the reference genome or transcriptome and classified as three types: exonic reads, junction reads and poly(a) end-reads. de novo assembly also possible for non-model organisms 17 These three types are used to generate a baseresolution expression profile for each gene Example: A yeast ORF with one intron 18 9
10 Un-replicated Experimental Design 1 biological replicate per treatment group Pros: Cheap, can be informative, prelim data Cons: Can only make inferences about the particular biological individuals, NOT the treatment groups Applications: Pilot studies (although can not assess variation), reference transcriptome assembly 19 Biological vs Technical Replicates Biological Replicates Multiple Unique Individuals/Samples Technical Replicates One Individual/Sample with some technical steps replicated Biological Variance > Technical Variance (typically) Biological replicates more useful as allows inferences about treatment 20 10
11 Sample Pooling Combining multiple samples / individuals / tissues during preparation into a single sample for assay. Pros: Necessary when not enough material per individual for assay. Cons: Measure variability between individuals is lost. A bad sample can bias pooled sample. 21 Multiplexing Each sample is indexed and combined into one pooled sample. Indexing allows one to identify reads for each subject Pros: Removes technical variation as source of confounding; cost effective Cons: Reduces depth or coverage per sample Lane Flow cell 22 11
12 Comparison of two designs for testing differential expression between treatments A and B. Treatment A is denoted by red tones and treatment B by blue tones. Auer P L, and Doerge R W Genetics 2010;185: Copyright 2010 by the Genetics Society of America Coverage Estimation Lander/Waterman equation for coverage C = LN / G C stands for coverage L is the read length N is the number of reads G is the haploid genome length verage_calculator.ilmn 12
13 What Coverage is needed? The amount of sequencing needed for a given sample is determined by the goals of the experiment and the nature of the RNA sample. More reads needed for alternative splicing / fusions Less reads needed for DE gene expression studies 15x is recommended for standard transcriptome studies Tarazone, et al (2011) Genome Research. Differential expression in RNA seq: A matter of depth 25 Coverage and Depth Number of detected genes (coverage) and costs increase with sequence depth (number of analyzed read) Calculation of coverage is less straightforward in transcriptome analysis (transcription activity varies) 26 13
14 ENCODE Standards DE_RNAseq_Standards_V1.0.pdf DE testing require only modest depths of sequencing: 30M pairend reads of length > 30NT, of which 20 25M are mappable to the genome or known transcriptome Experiments whose purpose is discovery of novel transcribed elements and strong quantification of known transcript isoforms requires more extensive sequencing. Experiments should be performed with two or more biological replicates, unless there is a compelling reason why this is impractical or wasteful. A biological replicate is defined as an independent growth of cells/tissue and subsequent analysis. Technical replicates not required, except to evaluate cases where biological variability is abnormally high. 27 Questions? 28 14
15 Example Workflow Brian T. Wilhelm, Josette Renée Landry. RNA Seq quantitative measurement of expression through massively parallel RNA sequencing. Methods Volume 48, Issue
16 Tuxedo Suite RNA-Seq Pipeline Sequencing Read alignment Transcript compilation Gene identification Differential expression RNA seq reads (2 x 100 bp) Bowtie/TopHat alignment (genome) Cufflinks Cufflinks (cuffcompare) Cuffdiff (A:B comparison) Raw sequence data (.fastq files) Reference genome (.fa file) Gene annotation (.gtf file) CummRbund Inputs Visualization 31 RNA-Seq - Bioinformatics challenges (I): Storing, retrieving and processing of large amounts of data Base calling Quality analysis for bases and reads => FastQ files Mapping/aligning RNA-Seq reads (Alternative: assemble contigs and align them to genome) Multiple alignment possible for some reads Sequencing errors and polymorphisms =>SAM/BAM files 16
17 RNA-Seq - Bioinformatics challenges (II): Exon junctions and poly(a) ends Identification of poly(a) -> long stretches of A(T) at end of reads Splice sites: Specific sequence context: CT AG dinucleotides Low expression for intronic regions Known or predicted splice sites Detection of new sites (e.g. via split read mapping) Overlapping genes RNA editing Secondary structure of transcripts Quantification of expression signals Mapping A multiread is a read that maps equally well to many reference sequences. read: AGTCGACTAGCTATTAGCATG 17
18 Read mapping vs. de novo assembly Options: Align and then assemble Assemble and then align Good reference No reference genome Align to: Genome transcriptome Haas and Zody, Nature Biotechnology 28, (2010) Genomic vs Transcript Mapping of Reads Exon A? Exon B???? Exon C????? Exon D Genome level mapping Transcript level mapping Exon A Exon B Exon C Exon D Mapping of reads at genome or transcript level 18
19 Advantages: Genomic Mapping Less likely to have multireads across different isoforms. One can get a sense of the coverage across exons. Disadvantages: It s a bit involved to estimate isoforms expression. Needs an (annotated) genome! (i.e. not great for non-model organisms) Transcriptome Mapping Advantages: Transcript-level expression Slightly easier to do. Disadvantages: Multiple isoforms can share an exon; can get multireads. Requires annotation to wrap to gene-level counts 19
20 RNA-Seq Quality Control Quality Control is Important to make sure experiment/data is valid Percentage of reads properly mapped / uniquely mapped 5 or 3 bias Per base sequence quality Per sequence GC content Sequence length Duplication levels Coverage (reads per base) Alignment of Reads Alignment using bowtie algorithm: Not more than 2 mismatches per read allowed Reads with multiple alignment discarded Read longer than 35 bp truncated to 35 bp Overlapping of alignment of reads with gene footprint from middle position of read 20
21 TopHat Pipeline: Splice Junctions Reads are mapped against a reference genome, and those reads that do not map are set aside. An initial consensus of mapped regions is computed by Maq. Sequences flanking potential donor/acceptor splice sites within neighboring regions are joined to form potential splice junctions. The initially unmapped (IUM) reads are indexed and aligned to these splice junction sequences. Dataset after Aligning Reads Gene Treatment 1 Treatment Total
22 Differential Expression (DE) Analysis We would like to test whether the proportion of reads aligning to gene 1 tends to be different for experimental units that received treatment 1 than for experimental units that received treatment out of out of out of vs. 13 out of out of out of Example 1: Need for Normalization Every gene in B is expressed in A at the same level. G = number of genes expressed in B A also contains a set of G genes that are expressed but not expressed in B. Thus, A has 2*G expressed genes and its RNA production is twice the size of sample B. Each sample is sequenced at approx. the same depth Without adjustment, a gene expressed in A and B will have ½ the number of reads as B, since the # reads is spread over twice as many genes. The normalization factor would be to adjust sample A by factor of 2. 22
23 Example 2: Need for Normalization Suppose you multiplex 4 samples to lane 1 and 2 samples to lane 2 of a flowcell. The 4 samples in lane 1 will have lower number of reads (counts) as compared to 2 samples in lane 2. Need to account for the total number of reads per sample (library size) Measure for expression: FPKM and RPKM Longer transcripts, more fragments/reads FPKM/RPKM measure average pair coverage per transcript FPKM: Fragments Per Kilobase per Million RPKM: Reads Per Kilobase per Million Paired-end RNA-Seq experiments produce two reads per fragment. Both reads may not be mappable. If we were to count reads rather than fragments, we might double-count some fragments but not others, leading to a skewed expression value. Thus, FPKM is calculated by counting fragments, not reads. If single-end experiment, FPKM = RPKM 23
24 RPKM RPKM: Reads Per Kilobase per Million mapped reads RPKM = C/(L x N) C: Number of mappable reads on a feature, such as an exon or transcript.. L: Length of feature (in kb) N: Total number of mappable reads (in millions) Distributions used for Modeling Count Data count of reads for subject on treatment for gene i = 1 N and g = 1,,G ~, 0! Observed in RNA-seq data that Over-dispersed Poisson (a.k.a. Negative Binomial or Poisson-Gamma Distribution) 24
25 Empirical Assessment of Over-Dispersion of NGS Counts Poisson Dist. Mean = Var Distributions used for Modeling Count Data Technical Variation follows a Poisson Distribution Biological Variation follows a Negative Binomial Distribution Let ~, library size for sample i. e., total number of reads dispersion parameter for gene represents the coefficient of variation of biological variation between the samples (able to separate biological from technical variation) =0 reduces to Poisson( ) and V (1+ ) DE is based on parameter Model used in edger 25
26 edger Package and Methods edger (Robinson, McCarthy, Smyth; 2010): models count data using an over-dispersed Poisson model estimates the gene-wise dispersions by conditional maximum likelihood, conditioning on the total count for that gene(smyth and Verbyla, 1996). Empirical Bayes (EB) procedure is used to shrink the dispersions towards a consensus value (i.e., borrowing information) (Robinson and Smyth, 2007). DE is assessed using an exact test analogous to Fisher s exact test, but adapted for over-dispersed data (Robinson and Smyth, 2008). Similar in idea to limma (Smyth, 2004) Other Methods/Programs for DE Analysis DESeq, BaySeq, Cuffdiff, SAMseq and many others Each has pros and cons with different assumptions. No single method will be optimal under all circumstances, and hence the method of choice in a particular situation depends on the experimental conditions. Suggest running more than one method to look for sensitivity of results. 26
27 Following DE analysis Visualization of results Volcano plot (FC vs p-value) Multiple Testing Correction FDR q-values Pathway Analysis IPA Network Analysis Validation Studies Technical validation of sequence data Confirm/replicate association results 27
28 Questions? 28
RNA-Sequencing analysis
RNA-Sequencing analysis Markus Kreuz 25. 04. 2012 Institut für Medizinische Informatik, Statistik und Epidemiologie Content: Biological background Overview transcriptomics RNA-Seq RNA-Seq technology Challenges
More informationSequence Analysis 2RNA-Seq
Sequence Analysis 2RNA-Seq Lecture 10 2/21/2018 Instructor : Kritika Karri kkarri@bu.edu Transcriptome Entire set of RNA transcripts in a given cell for a specific developmental stage or physiological
More informationIntroduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013
Introduction to RNA-Seq David Wood Winter School in Mathematics and Computational Biology July 1, 2013 Abundance RNA is... Diverse Dynamic Central DNA rrna Epigenetics trna RNA mrna Time Protein Abundance
More informationAnalysis of data from high-throughput molecular biology experiments Lecture 6 (F6, RNA-seq ),
Analysis of data from high-throughput molecular biology experiments Lecture 6 (F6, RNA-seq ), 2012-01-26 What is a gene What is a transcriptome History of gene expression assessment RNA-seq RNA-seq analysis
More informationTranscriptome analysis
Statistical Bioinformatics: Transcriptome analysis Stefan Seemann seemann@rth.dk University of Copenhagen April 11th 2018 Outline: a) How to assess the quality of sequencing reads? b) How to normalize
More informationRNA-Seq Analysis. Simon Andrews, Laura v
RNA-Seq Analysis Simon Andrews, Laura Biggins simon.andrews@babraham.ac.uk @simon_andrews v2018-10 RNA-Seq Libraries rrna depleted mrna Fragment u u u u NNNN Random prime + RT 2 nd strand synthesis (+
More informationIntroduction of RNA-Seq Analysis
Introduction of RNA-Seq Analysis Jiang Li, MS Bioinformatics System Engineer I Center for Quantitative Sciences(CQS) Vanderbilt University September 21, 2012 Goal of this talk 1. Act as a practical resource
More informationChIP-seq and RNA-seq. Farhat Habib
ChIP-seq and RNA-seq Farhat Habib fhabib@iiserpune.ac.in Biological Goals Learn how genomes encode the diverse patterns of gene expression that define each cell type and state. Protein-DNA interactions
More informationCanadian Bioinforma3cs Workshops
Canadian Bioinforma3cs Workshops www.bioinforma3cs.ca Module #: Title of Module 2 1 Module 3 Expression and Differen3al Expression (lecture) Obi Griffith & Malachi Griffith www.obigriffith.org ogriffit@genome.wustl.edu
More informationRNA
RNA sequencing Michael Inouye Baker Heart and Diabetes Institute Univ of Melbourne / Monash Univ Summer Institute in Statistical Genetics 2017 Integrative Genomics Module Seattle @minouye271 www.inouyelab.org
More informationAnalysis of RNA-seq Data. Bernard Pereira
Analysis of RNA-seq Data Bernard Pereira The many faces of RNA-seq Applications Discovery Find new transcripts Find transcript boundaries Find splice junctions Comparison Given samples from different experimental
More informationBackground Wikipedia Lee and Mahadavan, JCB, 2009 History (Platform Comparison) P Park, Nature Review Genetics, 2009 P Park, Nature Reviews Genetics, 2009 Rozowsky et al., Nature Biotechnology, 2009
More informationWheat CAP Gene Expression with RNA-Seq
Wheat CAP Gene Expression with RNA-Seq July 9 th -13 th, 2018 Overview of the workshop, Alina Akhunova http://www.ksre.k-state.edu/igenomics/workshops/ RNA-Seq Workshop Activities Lectures Laboratory Molecular
More informationChIP-seq and RNA-seq
ChIP-seq and RNA-seq Biological Goals Learn how genomes encode the diverse patterns of gene expression that define each cell type and state. Protein-DNA interactions (ChIPchromatin immunoprecipitation)
More informationExperimental Design. Sequencing. Data Quality Control. Read mapping. Differential Expression analysis
-Seq Analysis Quality Control checks Reproducibility Reliability -seq vs Microarray Higher sensitivity and dynamic range Lower technical variation Available for all species Novel transcript identification
More informationmeasuring gene expression December 5, 2017
measuring gene expression December 5, 2017 transcription a usually short-lived RNA copy of the DNA is created through transcription RNA is exported to the cytoplasm to encode proteins some types of RNA
More informationRNA-Seq with the Tuxedo Suite
RNA-Seq with the Tuxedo Suite Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 2015 Workshop The Basic Tuxedo Suite References Trapnell C, et al. 2009 TopHat: discovering splice junctions with
More informationLong and short/small RNA-seq data analysis
Long and short/small RNA-seq data analysis GEF5, 4.9.2015 Sami Heikkinen, PhD, Dos. Topics 1. RNA-seq in a nutshell 2. Long vs short/small RNA-seq 3. Bioinformatic analysis work flows GEF5 / Heikkinen
More informationmeasuring gene expression December 11, 2018
measuring gene expression December 11, 2018 Intervening Sequences (introns): how does the cell get rid of them? Splicing!!! Highly conserved ribonucleoprotein complex recognizes intron/exon junctions and
More informationIntroduction to RNA-Seq
Introduction to RNA-Seq Monica Britton, Ph.D. Bioinformatics Analyst September 2014 Workshop Overview of Today s Activities Morning RNA-Seq Concepts, Terminology, and Work Flows Two-Condition Differential
More informationRNA-sequencing. Next Generation sequencing analysis Anne-Mette Bjerregaard. Center for biological sequence analysis (CBS)
RNA-sequencing Next Generation sequencing analysis 2016 Anne-Mette Bjerregaard Center for biological sequence analysis (CBS) Terms and definitions TRANSCRIPTOME The full set of RNA transcripts and their
More informationIntroduction to transcriptome analysis using High Throughput Sequencing technologies. D. Puthier 2012
Introduction to transcriptome analysis using High Throughput Sequencing technologies D. Puthier 2012 A typical RNA-Seq experiment Library construction Protocol variations Fragmentation methods RNA: nebulization,
More information1. Introduction Gene regulation Genomics and genome analyses
1. Introduction Gene regulation Genomics and genome analyses 2. Gene regulation tools and methods Regulatory sequences and motif discovery TF binding sites Databases 3. Technologies Microarrays Deep sequencing
More informationless sensitive than RNA-seq but more robust analysis pipelines expensive but quantitiatve standard but typically not high throughput
Chapter 11: Gene Expression The availability of an annotated genome sequence enables massively parallel analysis of gene expression. The expression of all genes in an organism can be measured in one experiment.
More informationWhole Transcriptome Analysis of Illumina RNA- Seq Data. Ryan Peters Field Application Specialist
Whole Transcriptome Analysis of Illumina RNA- Seq Data Ryan Peters Field Application Specialist Partek GS in your NGS Pipeline Your Start-to-Finish Solution for Analysis of Next Generation Sequencing Data
More informationRNA standards v May
Standards, Guidelines and Best Practices for RNA-Seq: 2010/2011 I. Introduction: Sequence based assays of transcriptomes (RNA-seq) are in wide use because of their favorable properties for quantification,
More informationFrom reads to results: differential. Alicia Oshlack Head of Bioinformatics
From reads to results: differential expression analysis with ihrna seq Alicia Oshlack Head of Bioinformatics Murdoch Childrens Research Institute Benefits and opportunities ii of RNA seq All transcripts
More informationIntroduction to RNA-Seq
Introduction to RNA-Seq Monica Britton, Ph.D. Sr. Bioinformatics Analyst March 2015 Workshop Overview of RNA-Seq Activities RNA-Seq Concepts, Terminology, and Work Flows Using Single-End Reads and a Reference
More informationRNA Seq: Methods and Applica6ons. Prat Thiru
RNA Seq: Methods and Applica6ons Prat Thiru 1 Outline Intro to RNA Seq Biological Ques6ons Comparison with Other Methods RNA Seq Protocol RNA Seq Applica6ons Annota6on Quan6fica6on Other Applica6ons Expression
More informationApplications of short-read
Applications of short-read sequencing: RNA-Seq and ChIP-Seq BaRC Hot Topics March 2013 George Bell, Ph.D. http://jura.wi.mit.edu/bio/education/hot_topics/ Sequencing applications RNA-Seq includes experiments
More informationEucalyptus gene assembly
Eucalyptus gene assembly ACGT Plant Biotechnology meeting Charles Hefer Bioinformatics and Computational Biology Unit University of Pretoria October 2011 About Eucalyptus Most valuable and widely planted
More informationAnalysis of RNA-seq Data. Feb 8, 2017 Peikai CHEN (PHD)
Analysis of RNA-seq Data Feb 8, 2017 Peikai CHEN (PHD) Outline What is RNA-seq? What can RNA-seq do? How is RNA-seq measured? How to process RNA-seq data: the basics How to visualize and diagnose your
More informationSequencing applications. Today's outline. Hands-on exercises. Applications of short-read sequencing: RNA-Seq and ChIP-Seq
Sequencing applications Applications of short-read sequencing: RNA-Seq and ChIP-Seq BaRC Hot Topics March 2013 George Bell, Ph.D. http://jura.wi.mit.edu/bio/education/hot_topics/ RNA-Seq includes experiments
More informationIntroduction to RNAseq Analysis. Milena Kraus Apr 18, 2016
Introduction to RNAseq Analysis Milena Kraus Apr 18, 2016 Agenda What is RNA sequencing used for? 1. Biological background 2. From wet lab sample to transcriptome a. Experimental procedure b. Raw data
More informationHigh performance sequencing and gene expression quantification
High performance sequencing and gene expression quantification Ana Conesa Genomics of Gene Expression Lab Centro de Investigaciones Príncipe Felipe Valencia aconesa@cipf.es Next Generation Sequencing NGS
More informationNGS Data Analysis and Galaxy
NGS Data Analysis and Galaxy University of Pretoria Pretoria, South Africa 14-18 October 2013 Dave Clements, Emory University http://galaxyproject.org/ Fourie Joubert, Burger van Jaarsveld Bioinformatics
More informationBioinformatics Advice on Experimental Design
Bioinformatics Advice on Experimental Design Where do I start? Please refer to the following guide to better plan your experiments for good statistical analysis, best suited for your research needs. Statistics
More informationRNA-Seq de novo assembly training
RNA-Seq de novo assembly training Training session aims Give you some keys elements to look at during read quality check. Transcriptome assembly is not completely a strait forward process : Multiple strategies
More informationIntegrative Genomics 1a. Introduction
2016 Course Outline Integrative Genomics 1a. Introduction ggibson.gt@gmail.com http://www.cig.gatech.edu 1a. Experimental Design and Hypothesis Testing (GG) 1b. Normalization (GG) 2a. RNASeq (MI) 2b. Clustering
More informationIntroduction to RNA-Seq in GeneSpring NGS Software
Introduction to RNA-Seq in GeneSpring NGS Software Dipa Roy Choudhury, Ph.D. Strand Scientific Intelligence and Agilent Technologies Learn more at www.genespring.com Introduction to RNA-Seq In a few years,
More informationRNAseq Applications in Genome Studies. Alexander Kanapin, PhD Wellcome Trust Centre for Human Genetics, University of Oxford
RNAseq Applications in Genome Studies Alexander Kanapin, PhD Wellcome Trust Centre for Human Genetics, University of Oxford RNAseq Protocols Next generation sequencing protocol cdna, not RNA sequencing
More informationBasics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility
2018 ABRF Meeting Satellite Workshop 4 Bridging the Gap: Isolation to Translation (Single Cell RNA-Seq) Sunday, April 22 Basics of RNA-Seq (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly,
More informationNext Generation Sequencing
Next Generation Sequencing Complete Report Catalogue # and Service: IR16001 rrna depletion (human, mouse, or rat) IR11081 Total RNA Sequencing (80 million reads, 2x75 bp PE) Xxxxxxx - xxxxxxxxxxxxxxxxxxxxxx
More informationIntro to RNA-seq. July 13, 2015
Intro to RNA-seq July 13, 2015 Goal of the course To be able to effectively design, and interpret genomic studies of gene expression. We will focus on RNA-seq, but the class will provide a foothold into
More informationDeep sequencing of transcriptomes
1 / 40 Deep sequencing of transcriptomes An introduction to RNA-seq Michael Dondrup UNI BCCS 2. november 2010 2 / 40 Transcriptomics by Ultra-Fast Sequencing Microarrays have been the primary transcriptomics
More informationSO YOU WANT TO DO A: RNA-SEQ EXPERIMENT MATT SETTLES, PHD UNIVERSITY OF CALIFORNIA, DAVIS
SO YOU WANT TO DO A: RNA-SEQ EXPERIMENT MATT SETTLES, PHD UNIVERSITY OF CALIFORNIA, DAVIS SETTLES@UCDAVIS.EDU Bioinformatics Core Genome Center UC Davis BIOINFORMATICS.UCDAVIS.EDU DISCLAIMER This talk/workshop
More informationTranscriptomics analysis with RNA seq: an overview Frederik Coppens
Transcriptomics analysis with RNA seq: an overview Frederik Coppens Platforms Applications Analysis Quantification RNA content Platforms Platforms Short (few hundred bases) Long reads (multiple kilobases)
More informationVM origin. Okeanos: Image Trinity_U16 (upgrade to Ubuntu16.04, thanks to Alexandros Dimopoulos) X2go: LXDE
VM origin Okeanos: Image Trinity_U16 (upgrade to Ubuntu16.04, thanks to Alexandros Dimopoulos) X2go: LXDE NGS intro + Genome-Based Transcript Reconstruction and Analysis Using RNA-Seq Data Based on material
More information10/06/2014. RNA-Seq analysis. With reference assembly. Cormier Alexandre, PhD student UMR8227, Algal Genetics Group
RNA-Seq analysis With reference assembly Cormier Alexandre, PhD student UMR8227, Algal Genetics Group Summary 2 Typical RNA-seq workflow Introduction Reference genome Reference transcriptome Reference
More informationBenchmarking of RNA-seq data processing pipelines using whole transcriptome qpcr expression data
Benchmarking of RNA-seq data processing pipelines using whole transcriptome qpcr expression data Jan Hellemans 7th international qpcr & NGS Event - Freising March 24 th, 2015 Therapeutics lncrna oncology
More informationMeasuring and Understanding Gene Expression
Measuring and Understanding Gene Expression Dr. Lars Eijssen Dept. Of Bioinformatics BiGCaT Sciences programme 2014 Why are genes interesting? TRANSCRIPTION Genome Genomics Transcriptome Transcriptomics
More informationRNA-SEQUENCING ANALYSIS
RNA-SEQUENCING ANALYSIS Joseph Powell SISG- 2018 CONTENTS Introduction to RNA sequencing Data structure Analyses Transcript counting Alternative splicing Allele specific expression Discovery APPLICATIONS
More informationFFPE in your NGS Study
FFPE in your NGS Study Richard Corbett Canada s Michael Smith Genome Sciences Centre Vancouver, British Columbia Dec 6, 2017 Our mandate is to advance knowledge about cancer and other diseases and to use
More informationExperimental Design. Dr. Matthew L. Settles. Genome Center University of California, Davis
Experimental Design Dr. Matthew L. Settles Genome Center University of California, Davis settles@ucdavis.edu What is Differential Expression Differential expression analysis means taking normalized sequencing
More informationCSE 549: RNA-Seq aided gene finding
CSE 549: RNA-Seq aided gene finding Finding Genes We ll break gene finding methods into 3 main categories. ab initio latin from the beginning w/o experimental evidence comparative make use of knowledge
More informationAnalysis of RNA-seq Data
Analysis of RNA-seq Data A physicist and an engineer are in a hot-air balloon. Soon, they find themselves lost in a canyon somewhere. They yell out for help: "Helllloooooo! Where are we?" 15 minutes later,
More informationMapping strategies for sequence reads
Mapping strategies for sequence reads Ernest Turro University of Cambridge 21 Oct 2013 Quantification A basic aim in genomics is working out the contents of a biological sample. 1. What distinct elements
More informationGene Expression Technology
Gene Expression Technology Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu Gene expression Gene expression is the process by which information from a gene
More informationCBC Data Therapy. Metatranscriptomics Discussion
CBC Data Therapy Metatranscriptomics Discussion Metatranscriptomics Extract RNA, subtract rrna Sequence cdna QC Gene expression, function Institute for Systems Genomics: Computational Biology Core bioinformatics.uconn.edu
More informationCS-E5870 High-Throughput Bioinformatics RNA-seq analysis
CS-E5870 High-Throughput Bioinformatics RNA-seq analysis Harri Lähdesmäki Department of Computer Science Aalto University September 30, 2016 Acknowledgement for J Salojärvi and E Czeizler for the previous
More informationRNA-Seq Module 2 From QC to differential gene expression.
RNA-Seq Module 2 From QC to differential gene expression. Ying Zhang Ph.D, Informatics Analyst Research Informatics Support System (RISS) MSI Apr. 24, 2012 RNA-Seq Tutorials Tutorial 1: Introductory (Mar.
More information02 Agenda Item 03 Agenda Item
01 Agenda Item 02 Agenda Item 03 Agenda Item SOLiD 3 System: Applications Overview April 12th, 2010 Jennifer Stover Field Application Specialist - SOLiD Applications Workflow for SOLiD Application Application
More informationRNA-Seq data analysis course September 7-9, 2015
RNA-Seq data analysis course September 7-9, 2015 Peter-Bram t Hoen (LUMC) Jan Oosting (LUMC) Celia van Gelder, Jacintha Valk (BioSB) Anita Remmelzwaal (LUMC) Expression profiling DNA mrna protein Comprehensive
More informationRNA-Seq Workshop AChemS Sunil K Sukumaran Monell Chemical Senses Center Philadelphia
RNA-Seq Workshop AChemS 2017 Sunil K Sukumaran Monell Chemical Senses Center Philadelphia Benefits & downsides of RNA-Seq Benefits: High resolution, sensitivity and large dynamic range Independent of prior
More informationRNA-Seq. Joshua Ainsley, PhD Postdoctoral Researcher Lab of Leon Reijmers Neuroscience Department Tufts University
RNA-Seq Joshua Ainsley, PhD Postdoctoral Researcher Lab of Leon Reijmers Neuroscience Department Tufts University joshua.ainsley@tufts.edu Day five Alternative splicing Assembly RNA edits Alternative splicing
More informationIntroduction to transcriptome analysis using High Throughput Sequencing technologies. D. Puthier 2012
Introduction to transcriptome analysis using High Throughput Sequencing technologies D. Puthier 2012 Transcriptome: the old school Cyanine 5 (Cy5) Cy-3: - Excitation 550nm - Emission 570nm Cy-5: - Excitation
More informationTranscriptomics. Marta Puig Institut de Biotecnologia i Biomedicina Universitat Autònoma de Barcelona
Transcriptomics Marta Puig Institut de Biotecnologia i Biomedicina Universitat Autònoma de Barcelona Central dogma of molecular biology Central dogma of molecular biology Genome Complete DNA content of
More informationExperimental design of RNA-Seq Data
Experimental design of RNA-Seq Data RNA-seq course: The Power of RNA-seq Thursday June 6 th 2013, Marco Bink Biometris Overview Acknowledgements Introduction Experimental designs Randomization, Replication,
More informationTranscriptome Assembly, Functional Annotation (and a few other related thoughts)
Transcriptome Assembly, Functional Annotation (and a few other related thoughts) Monica Britton, Ph.D. Sr. Bioinformatics Analyst June 23, 2017 Differential Gene Expression Generalized Workflow File Types
More informationRNA sequencing Integra1ve Genomics module
RNA sequencing Integra1ve Genomics module Michael Inouye Centre for Systems Genomics University of Melbourne, Australia Summer Ins@tute in Sta@s@cal Gene@cs 2016 SeaBle, USA @minouye271 inouyelab.org This
More informationDavid M. Rocke Division of Biostatistics and Department of Biomedical Engineering University of California, Davis
David M. Rocke Division of Biostatistics and Department of Biomedical Engineering University of California, Davis Outline RNA-Seq for differential expression analysis Statistical methods for RNA-Seq: Structure
More informationSystematic evaluation of spliced alignment programs for RNA- seq data
Systematic evaluation of spliced alignment programs for RNA- seq data Pär G. Engström, Tamara Steijger, Botond Sipos, Gregory R. Grant, André Kahles, RGASP Consortium, Gunnar Rätsch, Nick Goldman, Tim
More informationQuantifying gene expression
Quantifying gene expression Genome GTF (annotation)? Sequence reads FASTQ FASTQ (+reference transcriptome index) Quality control FASTQ Alignment to Genome: HISAT2, STAR (+reference genome index) (known
More informationGeneScissors: a comprehensive approach to detecting and correcting spurious transcriptome inference owing to RNA-seq reads misalignment
GeneScissors: a comprehensive approach to detecting and correcting spurious transcriptome inference owing to RNA-seq reads misalignment Zhaojun Zhang, Shunping Huang, Jack Wang, Xiang Zhang, Fernando Pardo
More informationFinding Genes with Genomics Technologies
PLNT2530 Plant Biotechnology (2018) Unit 7 Finding Genes with Genomics Technologies Unless otherwise cited or referenced, all content of this presenataion is licensed under the Creative Commons License
More informationGene expression analysis. Biosciences 741: Genomics Fall, 2013 Week 5. Gene expression analysis
Gene expression analysis Biosciences 741: Genomics Fall, 2013 Week 5 Gene expression analysis From EST clusters to spotted cdna microarrays Long vs. short oligonucleotide microarrays vs. RT-PCR Methods
More informationFrom reads to results: differen1al expression analysis with RNA seq. Alicia Oshlack Bioinforma1cs Division Walter and Eliza Hall Ins1tute
From reads to results: differen1al expression analysis with RNA seq Alicia Oshlack Bioinforma1cs Division Walter and Eliza Hall Ins1tute Purported benefits and opportuni1es of RNA seq All transcripts are
More informationRNA-Seq Software, Tools, and Workflows
RNA-Seq Software, Tools, and Workflows Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 1, 2016 Some mrna-seq Applications Differential gene expression analysis Transcriptional profiling Assumption:
More informationDifferential expression analysis for sequencing count data. Simon Anders
Differential expression analysis for sequencing count data Simon Anders Two applications of RNA-Seq Discovery find new transcripts find transcript boundaries find splice junctions Comparison Given samples
More informationSanger vs Next-Gen Sequencing
Tools and Algorithms in Bioinformatics GCBA815/MCGB815/BMI815, Fall 2017 Week-8: Next-Gen Sequencing RNA-seq Data Analysis Babu Guda, Ph.D. Professor, Genetics, Cell Biology & Anatomy Director, Bioinformatics
More informationRNAseq Differential Gene Expression Analysis Report
RNAseq Differential Gene Expression Analysis Report Customer Name: Institute/Company: Project: NGS Data: Bioinformatics Service: IlluminaHiSeq2500 2x126bp PE Differential gene expression analysis Sample
More informationBioinformatics Monthly Workshop Series. Speaker: Fan Gao, Ph.D Bioinformatics Resource Office The Picower Institute for Learning and Memory
Bioinformatics Monthly Workshop Series Speaker: Fan Gao, Ph.D Bioinformatics Resource Office The Picower Institute for Learning and Memory Schedule for Fall, 2015 PILM Bioinformatics Web Server (09/21/2015)
More informationGenomic resources. for non-model systems
Genomic resources for non-model systems 1 Genomic resources Whole genome sequencing reference genome sequence comparisons across species identify signatures of natural selection population-level resequencing
More informationMeasuring transcriptomes with RNA-Seq. BMI/CS 776 Spring 2016 Anthony Gitter
Measuring transcriptomes with RNA-Seq BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2016 Anthony Gitter gitter@biostat.wisc.edu Overview RNA-Seq technology The RNA-Seq quantification problem Generative
More informationComparative analysis of RNA-Seq data with DESeq2
Comparative analysis of RNA-Seq data with DESeq2 Simon Anders EMBL Heidelberg Two applications of RNA-Seq Discovery find new transcripts find transcript boundaries find splice junctions Comparison Given
More informationOptimal Calculation of RNA-Seq Fold-Change Values
International Journal of Computational Bioinformatics and In Silico Modeling Vol. 2, No. 6 (2013): 285-292 Research Article Open Access ISSN: 2320-0634 Optimal Calculation of RNA-Seq Fold-Change Values
More informationMeasuring transcriptomes with RNA-Seq
Measuring transcriptomes with RNA-Seq BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2017 Anthony Gitter gitter@biostat.wisc.edu These slides, excluding third-party material, are licensed under CC BY-NC
More informationDeep Sequencing technologies
Deep Sequencing technologies Gabriela Salinas 30 October 2017 Transcriptome and Genome Analysis Laboratory http://www.uni-bc.gwdg.de/index.php?id=709 Microarray and Deep-Sequencing Core Facility University
More informationOvercome limitations with RNA-Seq
Buyer s Guide Simple, customized RNA-Seq workflows Evaluating options for next-generation RNA sequencing Overcome limitations with RNA-Seq Next-generation sequencing (NGS) has revolutionized the study
More informationIntegrated NGS Sample Preparation Solutions for Limiting Amounts of RNA and DNA. March 2, Steven R. Kain, Ph.D. ABRF 2013
Integrated NGS Sample Preparation Solutions for Limiting Amounts of RNA and DNA March 2, 2013 Steven R. Kain, Ph.D. ABRF 2013 NuGEN s Core Technologies Selective Sequence Priming Nucleic Acid Amplification
More informationSingle-Cell Whole Transcriptome Profiling With the SOLiD. System
APPLICATION NOTE Single-Cell Whole Transcriptome Profiling Single-Cell Whole Transcriptome Profiling With the SOLiD System Introduction The ability to study the expression patterns of an individual cell
More informationRNA Sequencing. Next gen insight into transcriptomes , Elio Schijlen
RNA Sequencing Next gen insight into transcriptomes 05-06-2013, Elio Schijlen Transcriptome complete set of transcripts in a cell, and their quantity, for a specific developmental stage or physiological
More informationBST 226 Statistical Methods for Bioinformatics David M. Rocke. March 10, 2014 BST 226 Statistical Methods for Bioinformatics 1
BST 226 Statistical Methods for Bioinformatics David M. Rocke March 10, 2014 BST 226 Statistical Methods for Bioinformatics 1 NGS Technologies Illumina Sequencing HiSeq 2500 & MiSeq PacBio Sequencing PacBio
More informationNext-Generation Sequencing. Technologies
Next-Generation Next-Generation Sequencing Technologies Sequencing Technologies Nicholas E. Navin, Ph.D. MD Anderson Cancer Center Dept. Genetics Dept. Bioinformatics Introduction to Bioinformatics GS011062
More informationNext-generation sequencing technologies
Next-generation sequencing technologies Illumina: Summary https://www.youtube.com/watch?v=fcd6b5hraz8 Illumina platforms: Benchtop sequencers https://www.illumina.com/systems/sequencing-platforms.html
More informationMassive Analysis of cdna Ends for simultaneous Genotyping and Transcription Profiling in High Throughput
Next Generation (Sequencing) Tools for Nucleotide-Based Information Massive Analysis of cdna Ends for simultaneous Genotyping and Transcription Profiling in High Throughput Björn Rotter, PhD GenXPro GmbH,
More informationresequencing storage SNP ncrna metagenomics private trio de novo exome ncrna RNA DNA bioinformatics RNA-seq comparative genomics
RNA Sequencing T TM variation genetics validation SNP ncrna metagenomics private trio de novo exome mendelian ChIP-seq RNA DNA bioinformatics custom target high-throughput resequencing storage ncrna comparative
More informationApplication of NGS (nextgeneration. for studying RNA regulation. Sung Wook Chi. Sungkyunkwan University (SKKU) Samsung Medical Center (SMC)
Application of NGS (nextgeneration sequencing) for studying RNA regulation Samsung Advanced Institute of Heath Sciences and Technology (SAIHST) Sungkyunkwan University (SKKU) Samsung Research Institute
More informationRNA-Seq analysis using R: Differential expression and transcriptome assembly
RNA-Seq analysis using R: Differential expression and transcriptome assembly Beibei Chen Ph.D BICF 12/7/2016 Agenda Brief about RNA-seq and experiment design Gene oriented analysis Gene quantification
More information