Next generation sequencing (NGS)- RNA sequencing
|
|
- Gabriella Bell
- 6 years ago
- Views:
Transcription
1 Next generation sequencing (NGS)- RNA sequencing Vijayachitra Modhukur BIIT 1
2 NGS lectures Genomics Transcriptomics Epigenomics Protomics 2
3 NGS lectures Genomics Transcriptomics Epigenomics Protomics 3
4 Recap 4
5 Sequencing 5
6 Different generations sequencing 6
7 Second generation sequencing 7
8 NGS platforms 454 Solexa/Illumina SOLiD (ABI) Bp per run 400 Mb 2-3 Gb 3-6 Gb Read length bp (70-100) bp bp run time 10 hr 2.5 days 5 days Download 20 min 27 hr (44 min) ~1 day Analysis 2-5 hr 2 days 2-3 days Files Gb 1T 1 T 8
9 Massive amount of sequenced data 9
10 De novo alignment Reference alignment Sequence alignment 10
11 Short read mapping (Denovo) - ssp Let f1,f2 fk be the words in Σ*. We want to find shortest substring g εσ* such that fi is the substring of g Example: Lets say we have set of strings f1 = ACGTA, f2 = CTTGA, f3 = ACTT, f4 = GTAAC Find the shortest common superstring of these 4 string 11
12 Reference alignment Find locations where short read is identical to reference genome 12
13 NGS Analysis 13
14 Data analysis cpu/memory intensive 14
15 Quality scores Each base from a sequencer comes with a quality score Base-calling error probabilities Phred quality score Q = 10 log10 P higher quality score indicates a smaller probability of error 15
16 Quality scores 16
17 File formats 17
18 fastq Raw data 18
19 Reference assembly Spaced seed BWT Alignment methods Denovo assembly Greedy Assemblers Graph based Overlap layout consensus Graph based Debruign graph 19
20 RNA sequencing 20
21 Transcription 21
22 RNA world hypothesis 22
23 What is RNA-seq? Use of high-throughput sequencing technologies to assess the RNA content of a sample. 23 slides from Halisha Holloway
24 RNA-seq vs microarray 24 RNA-seq ID novel genes, transcripts, & exons Greater dynamic range Less bias due to genetic variation Repeatable No species-specific primer/probe design More accurate relative to qpcr Many more applications Microarray Well vetted QC and analysis methods Well characterized biases Quick turnaround from established core facilities Currently less expensive
25 RNA-Seq vs microarray 25
26 Why do an RNA-seq experiment? Detect differential expression Assess allele-specific expression Quantify alternative transcript usage Discover novel genes/transcripts, gene fusions Profile transcriptome Ribosome profiling to measure translation 26
27 Why do an RNA-seq experiment? Detect differential expression Assess allele-specific expression Quantify alternative transcript usage Discover novel genes/transcripts, gene fusions Profile transcriptome Ribosome profiling to measure translation 27 Skelly et al. 2011
28 Why do an RNA-seq experiment? Detect differential expression Assess allele-specific expression Quantify alternative transcript usage Discover novel genes/transcripts, gene fusions Profile transcriptome Ribosome profiling to measure translation 28
29 Why do an RNA-seq experiment? Detect differential expression Assess allele-specific expression Quantify alternative transcript usage Discover novel genes/transcripts, gene fusions Profile transcriptome Ribosome profiling to measure translation 29
30 Why do an RNA-seq experiment? Detect differential expression Assess allele-specific expression Quantify alternative transcript usage Pluripotent Stem Cell Cardiogenic Mesoderm Cardiac Precursors Cardiomyocytes Discover novel genes/transcripts, gene fusions Profile transcriptome Ribosome profiling to measure translation 30
31 Why do an RNA-seq experiment? Detect differential expression Assess allele-specific expression Quantify alternative transcript usage Discover novel genes/transcripts, gene fusions Profile transcriptome Ribosome profiling to measure translation 31
32 RNA-seq protocol 32
33 RNA-seq protocol Sample RNA Amplified cdna cdna fragments reads reverse transcription + PCR fragmentation sequencing machine CCTTCNCACTTCGTTTCCCAC TTTTTNCAGAGTTTTTTCTTG GAACANTCCAACGCTTGGTGA GGAAANAAGACCCTGTTGAGC CCCGGNGATCCGCTGGGACAA GCAGCATATTGATAGATAACT CTAGCTACGCGTACGCGATCG CATCTAGCATCGCGTTGCGTT CCCGCGCGCTTAGGCTACTCG TCACACATCTCTAGCTAGCAT CATGCTAGCTATGCCTATCTA CACCCCGGGGATATATAGGAT 33
34 RNA-seq CCTTCNCACTTCGTTTCCCACTTAGCGATAATTTG +HWUSI-EAS1789_0001:3:2:1708:1305#0/1 TTTTTNCAGAGTTTTTTCTTGAACTGGAAATTTTT +HWUSI-EAS1789_0001:3:2:2062:1304#0/1 a GAACANTCCAACGCTTGGTGAATTCTGCTTCACAA +HWUSI-EAS1789_0001:3:2:3194:1303#0/1 ZZ[[VBZZY][TWQQZ\ZS\[ZZXV GGAAANAAGACCCTGTTGAGCTTGACTCTAGTCTG +HWUSI-EAS1789_0001:3:2:3716:1304#0/1 CCCGGNGATCCGCTGGGACAAGCAGCATATTGATA +HWUSI-EAS1789_0001:3:2:5000:1304#0/1 aaaaabeeeeffffehhhhhhggdhhhhahhhadh?? 34
35 Coverage Coverage = Number of sequenced reads/size of the original genome The number of sequenced reads = Number of reads length of the reads 35
36 Some things to consider in experimental design 36
37 Plan it well Experimental design Biological replicates Reference genome? Good gene annotation? Read depth Read length Paired vs. single-end Biological variation Technical variation 37
38 Experimental design Biological replicates Reference genome? Good gene annotation? Read depth Read length Paired vs. single-end Plan it well 38
39 Plan it well Experimental design Biological replicates Reference genome? Good gene annotation? Read depth Read length Paired vs. single-end Fraction of transcripts with non zero FPKM (relative to 100%) Robustness of transcript identification as input data are removed 10% 5% 2% 1% 0.1% Cufflinks USeq-DESeq Fraction of total number of reads in jackknifed data set 39
40 How much data do we need? ~15-20K genes expressed in a tissue cell line. Genes are on average 3KB For 1x coverage using 100 bp reads, would need 600K sequence reads In reality, we need MUCH higher coverage to accurately estimate gene expression levels million reads 40
41 Plan it well Experimental design Biological replicates Reference genome? Good gene annotation? Read depth Read length Paired vs. single-end Uniq seq = 4read length Read length Unique seq x x x10 60 ~60 million coding bases in vertebrate genome 41
42 Experimental design Biological replicates Reference genome? Good gene annotation? Read depth Barcoding Read length Paired vs. single-end Plan it well 42
43 Power of paired-end reads Huge impact on read mapping Pairs give two locations to determine whether read is unique Critical for estimating transcript-level abundance Increases number of splice junction spanning reads 43
44 Comparison of two designs for testing differential expression between treatments A and B. Treatment A is denoted by red tones and treatment B by blue tones. Auer P L, and Doerge R W Genetics 2010;185: Copyright 2010 by the Genetics Society of America
45 RNA-seq pipeline 45
46 Typical RNA-seq experiment 46
47 RNA-seq informatics workflow 1. Qc and genome mapping 2. Splice junction fragments 3. Predict novel junctions/ exons 4. Counts 5. Normalize 6. Differential expression 7. Gene lists 47
48 Quality control 48
49 QC: Raw Data Sequence call quality 49
50 Sequence bias QC: Raw Data 50
51 Duplication level QC: Raw Data 51
52 Mapping 52
53 Mapping Align read to the genome Simple for genomic sequences Difficult for transcripts with splice junction 53
54 Junction reads 54
55 Tophat-pipeline 55
56 Alternative splicing 56
57 Alternative splicing 57
58 Cuff-links 58
59 RNA-seq complete pipeline 59
60 RNA seq-summarization 60
61 Normalization aims Comparable across features (genes, isoforms etc.,) Comparable across different samples (libraries) Between samples (libraries) Within sampes(libraries) Easily interprettable 61
62 Within library normalization Allows quantification of expression levels of each gene relative to each other s gene with in the library Longer transcripts have higher read counts( with same expression level) Widely used : RPKM (Reads per Kilobase per Million Base) 62
63 No.of mapped reads =3 lenth of transcript=300 bp Total no. of reads =10,000 RPKM-example RPK = 3/(300/1000) = 3/0.3 = 10 RPKM = 10 / (10,000/1,000,000) = 10/ 0.01 = 1000 RPKM =
64 Between library normalization Adjust by total number of reads in the library Smaller number of highly expressed genes can consume significant amount of sequences Solution: scaling factor Scaling the number of reads in a library to a common value Quantile normalization 64
65 Differential expression List genes changed significantly in abundace across different experimental conditions Not same as microarrays, since not log transformed If reads independently sampled from population, reads would follow multinomial distribution appx by Poisson Pr(X = k) =λ k e -k /k! 65
66 Several tools for differential expression Differential expression maximization (RSEM) Cuffdiff 29 Uses isoform levels in analysis Identifying differentially DegSeq 79 Uses a normal distribution expressed genes or transcript isoforms EdgeR 77 Differential Expression analysis of count data (DESeq) 78 Myrna 75 Cloud-based permutation method Read alignments and transcript models 66
67 Analysis of differentially expressed gene list 67
68 Gene ontology analysis Nucleic Acids Research, (A) (B) 68 Figure 1. (A) A typical user input and output scenario of g:profiler. User inserts a set of genes in the main text window and optionally adjusts query parameters. Results are provided either graphically or in textual format. Genes are presented in columns, and significant functional categories in rows. The analysis of an ordered list shows the length of the most significant query head. GO annotation evidence codes are coloured like a heat map, showing the strength of evidence between a gene and GO term. The legend is provided at the top of the page. It is displayed when the user clicks on the tree icon on the results page. The g:orth, g:convert and G:Sorter tools are directly linked to relevant genes from the current query. Additional examples are available in Supplementary Data. (B) Hierarchical relations between the resulting GO categories can be browsed by clicking on corresponding icons.
69 epithelium development Arrhythmogenic right ventricular... extracellular matrix organization Glucose metabolism Huntington's disease response to inorganic substance cell junction assembly protein N linked glycosylation v... intracellular protein transport protein N linked glycosylation DNA dependent transcription, ter... Leukocyte transendothelial migra... induction of apoptosis positive regulation of leukocyte... negative regulation of programme... Natural killer cell mediated cyt... intracellular protein kinase cas... Hematopoietic cell lineage Chemokine signaling pathway protein complex subunit organiza... regulation of protein kinase act... vesicle mediated transport Dopaminergic synapse Retrograde endocannabinoid signa... cytoskeleton organization Glutamatergic synapse synapse organization Opioid Signalling purine nucleoside triphosphate m... Calcium signaling pathway regulation of small GTPase media... negative regulation of cellular... regulation of cellular localization actin filament based process regulation of transporter activity Gastric acid secretion regulation of cell morphogenesis... secretion by cell Salivary secretion cognition GABAergic synapse transmembrane receptor protein t... generation of a signal involved... ion transmembrane transport Long term potentiation GTP catabolic process Morphine addiction positive regulation of cellular... Gene ontology Gosummaries A cell.line VS brain B G1 > G2: 2168 G1 < G2: 2132 D Tissue brain cell line C hematopoietic system muscle spindle organization chromosome organization interspecies interaction between... viral reproduction RNA processing DNA replication translation anaphase promoting complex depen... chromosome segregation nuclear division mitotic cell cycle cell cycle phase cell cycle checkpoint establishment of organelle local... regulation of mitosis DNA conformation change p53 signaling pathway protein complex subunit organiza... Cell Cycle Checkpoints cellular component biogenesis at... RNA transport regulation of cellular amino aci... DNA Replication ncrna metabolic process cellular macromolecular complex... positive regulation of ligase ac... negative regulation of ubiquitin... positive regulation of protein u... DNA recombination Cell Cycle, Mitotic Cell cycle response to DNA damage stimulus cell division mrna metabolic process regulation of ubiquitin protein... DNA damage response, signal tran... ion transport regulation of nervous system dev... behavior central nervous system development neuron projection morphogenesis neuron development multicellular organismal signaling neuron projection development regulation of synaptic transmission regulation of membrane potential axon guidance regulation of neuron differentia... neurotransmitter transport E Enrichment P value muscle VS hematopoietic.system G1 > G2: 1527 G1 < G2: 1159 cell morphogenesis involved in d... acetyl CoA metabolic process actin filament based process taxis wound healing Cardiac muscle contraction circulatory system process Focal adhesion Glucose Regulation of Insulin Se... energy derivation by oxidation o... Alzheimer's disease cell adhesion enzyme linked receptor protein s... ECM receptor interaction muscle structure development cardiovascular system development muscle system process generation of precursor metaboli... muscle tissue development Oxidative phosphorylation Hypertrophic cardiomyopathy (HCM) regulation of anatomical structu... tissue morphogenesis anatomical structure formation i... cell migration Parkinson's disease glucose metabolic process Dilated cardiomyopathy organ morphogenesis regulation of system process response to endogenous stimulus regulation of cell migration neuron projection morphogenesis positive regulation of cytokine... integrin mediated signaling pathway positive regulation of protein m... positive regulation of catalytic... regulation of hydrolase activity blood coagulation cell chemotaxis innate immune response response to other organism cell adhesion regulation of defense response inflammatory response hemopoiesis positive regulation of immune sy... cell activation regulation of immune response immune effector process response to cytokine stimulus interspecies interaction between... leukocyte migration actin polymerization or depolyme... positive regulation of lymphocyt... cytokine production lymphocyte proliferation hemostasis adaptive immune response regulation of protein phosphoryl... positive regulation of cytokine... peptidyl tyrosine phosphorylation vesicle mediated transport 69
70 Pathway analysis 70
71 And many more.. 71
72 Novel genomes How do we compute RNA-seq gene expression for novel genomes? Must have complete genome sequence (or contigs). Use predicted gene models (all protein BLASTX or EST vs genome data) to create an exon map or de novo assembly of transcripts from RNA-seq data Computationally huge problem: all-against-all similarity searching and multiple overlapping transcripts. 72
73 73
74 RNA seq analysis programs Table 1 Selected list of RNA-seq analysis programs Class Category Package Notes Uses Input Read mapping Unspliced Seed methods Short-read mapping package Smith-Waterman extension Aligning reads to a aligners a (SHRiMP) 41 reference transcriptome Burrows-Wheeler transform methods Stampy 39 Bowtie 43 BWA 44 Probabilistic model Incorporates quality scores Spliced aligners Exon-first methods MapSplice 52 Works with multiple unspliced SpliceMap 50 aligners Transcriptome reconstruction Genome-guided reconstruction Genomeindependent reconstruction Expression quantification Expression quantification Differential expression TopHat 51 Uses Bowtie alignments Seed-extend methods GSNAP 53 Can use SNP databases QPALMA 54 Smith-Waterman for large gaps Aligning reads to a reference genome. Allows for the identification of novel splice junctions Exon identification G.Mor.Se Assembles exons Identifying novel transcripts Genome-guided Scripture 28 Reports all isoforms using a known reference assembly Cufflinks 29 Reports a minimal set of isoforms genome Genome-independent assembly Velvet 61 Reports all isoforms Identifying novel genes and Reads TransABySS 56 transcript isoforms without a known reference genome Gene quantification Alexa-seq 47 Quantifies using differentially included exons Enhanced read analysis of gene expression (ERANGE) 20 Quantifies using union of exons Normalization by expected uniquely mappable area (NEUMA) 82 Quantifies using unique reads Isoform quantification Cufflinks 29 Maximum likelihood estimation of MISO 33 relative isoform expression RNA-seq by expectaion maximization (RSEM) 69 Reads and reference transcriptome Reads and reference genome Alignments to reference genome Quantifying gene expression Reads and transcript models Quantifying transcript isoform expression levels Cuffdiff 29 Uses isoform levels in analysis Identifying differentially DegSeq 79 Uses a normal distribution expressed genes or EdgeR 77 transcript isoforms Differential Expression analysis of count data (DESeq) 78 Myrna 75 Cloud-based permutation method Read alignments to isoforms Read alignments and transcript models 74
75 Comparison of tools 75
76 Challenges Several sequencing technolgies Complex normalization Difficulty to achieve mappability Accurate detection of splice junction Proper summarization methods needed Most challenging for novel genomes Not many algorithms exist for denovo assembly when compared to reference assembly. 76
77 Summary RNA-seq to study RNA content Quantitative than microarrays Can be used for studying different layers of transcription several factors to be considered in experimental design Mapping, transcript assembly, summarization, differential expression and visualization are the major steps in RNA-seq Gene ontology analysis, pathway analysis, integrative study followed by systems biology are the possible proceeding steps of RNA-seq gene lists. 77
Measuring transcriptomes with RNA-Seq. BMI/CS 776 Spring 2016 Anthony Gitter
Measuring transcriptomes with RNA-Seq BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2016 Anthony Gitter gitter@biostat.wisc.edu Overview RNA-Seq technology The RNA-Seq quantification problem Generative
More informationMeasuring transcriptomes with RNA-Seq
Measuring transcriptomes with RNA-Seq BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2017 Anthony Gitter gitter@biostat.wisc.edu These slides, excluding third-party material, are licensed under CC BY-NC
More informationChIP-seq and RNA-seq
ChIP-seq and RNA-seq Biological Goals Learn how genomes encode the diverse patterns of gene expression that define each cell type and state. Protein-DNA interactions (ChIPchromatin immunoprecipitation)
More informationChIP-seq and RNA-seq. Farhat Habib
ChIP-seq and RNA-seq Farhat Habib fhabib@iiserpune.ac.in Biological Goals Learn how genomes encode the diverse patterns of gene expression that define each cell type and state. Protein-DNA interactions
More informationEucalyptus gene assembly
Eucalyptus gene assembly ACGT Plant Biotechnology meeting Charles Hefer Bioinformatics and Computational Biology Unit University of Pretoria October 2011 About Eucalyptus Most valuable and widely planted
More informationIntroduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013
Introduction to RNA-Seq David Wood Winter School in Mathematics and Computational Biology July 1, 2013 Abundance RNA is... Diverse Dynamic Central DNA rrna Epigenetics trna RNA mrna Time Protein Abundance
More informationAnalysis of data from high-throughput molecular biology experiments Lecture 6 (F6, RNA-seq ),
Analysis of data from high-throughput molecular biology experiments Lecture 6 (F6, RNA-seq ), 2012-01-26 What is a gene What is a transcriptome History of gene expression assessment RNA-seq RNA-seq analysis
More informationSupplemental Material for: A high-throughput RNA-seq approach to profile transcriptional responses
Supplemental Material for: A high-throughput RNA-seq approach to profile transcriptional responses G. A. Moyerbrailean 1, G. O. Davis 1, C. T. Harvey 1, D. Watza 1, X. Wen 2, R. Pique-Regi 1,3,, F. Luca
More informationRNA-sequencing. Next Generation sequencing analysis Anne-Mette Bjerregaard. Center for biological sequence analysis (CBS)
RNA-sequencing Next Generation sequencing analysis 2016 Anne-Mette Bjerregaard Center for biological sequence analysis (CBS) Terms and definitions TRANSCRIPTOME The full set of RNA transcripts and their
More informationIntroduction to RNA-Seq in GeneSpring NGS Software
Introduction to RNA-Seq in GeneSpring NGS Software Dipa Roy Choudhury, Ph.D. Strand Scientific Intelligence and Agilent Technologies Learn more at www.genespring.com Introduction to RNA-Seq In a few years,
More informationHow it All Works. Sample. Data analysis. Library Prepara>on. Sequencing
Library PREP How it All Works Extract DNA Fragment Sample Data analysis Sequencing Library Prepara>on Polymerase Chain Reaction Polymerase Chain Reaction Polymerase Chain Reaction Polymerase Chain Reaction
More informationBackground Wikipedia Lee and Mahadavan, JCB, 2009 History (Platform Comparison) P Park, Nature Review Genetics, 2009 P Park, Nature Reviews Genetics, 2009 Rozowsky et al., Nature Biotechnology, 2009
More informationTranscriptome analysis
Statistical Bioinformatics: Transcriptome analysis Stefan Seemann seemann@rth.dk University of Copenhagen April 11th 2018 Outline: a) How to assess the quality of sequencing reads? b) How to normalize
More informationCOMPUTATIONAL PREDICTION AND CHARACTERIZATION OF A TRANSCRIPTOME USING CASSAVA (MANIHOT ESCULENTA) RNA-SEQ DATA
COMPUTATIONAL PREDICTION AND CHARACTERIZATION OF A TRANSCRIPTOME USING CASSAVA (MANIHOT ESCULENTA) RNA-SEQ DATA AOBAKWE MATSHIDISO, SCOTT HAZELHURST, CHRISSIE REY Wits Bioinformatics, University of the
More informationExperimental Design. Sequencing. Data Quality Control. Read mapping. Differential Expression analysis
-Seq Analysis Quality Control checks Reproducibility Reliability -seq vs Microarray Higher sensitivity and dynamic range Lower technical variation Available for all species Novel transcript identification
More informationStatistical Genomics and Bioinformatics Workshop. Genetic Association and RNA-Seq Studies
Statistical Genomics and Bioinformatics Workshop: Genetic Association and RNA-Seq Studies RNA Seq and Differential Expression Analysis Brooke L. Fridley, PhD University of Kansas Medical Center 1 Next-generation
More informationWhole Transcriptome Analysis of Illumina RNA- Seq Data. Ryan Peters Field Application Specialist
Whole Transcriptome Analysis of Illumina RNA- Seq Data Ryan Peters Field Application Specialist Partek GS in your NGS Pipeline Your Start-to-Finish Solution for Analysis of Next Generation Sequencing Data
More informationApplications of short-read
Applications of short-read sequencing: RNA-Seq and ChIP-Seq BaRC Hot Topics March 2013 George Bell, Ph.D. http://jura.wi.mit.edu/bio/education/hot_topics/ Sequencing applications RNA-Seq includes experiments
More informationSequencing applications. Today's outline. Hands-on exercises. Applications of short-read sequencing: RNA-Seq and ChIP-Seq
Sequencing applications Applications of short-read sequencing: RNA-Seq and ChIP-Seq BaRC Hot Topics March 2013 George Bell, Ph.D. http://jura.wi.mit.edu/bio/education/hot_topics/ RNA-Seq includes experiments
More informationmeasuring gene expression December 5, 2017
measuring gene expression December 5, 2017 transcription a usually short-lived RNA copy of the DNA is created through transcription RNA is exported to the cytoplasm to encode proteins some types of RNA
More informationRNA-SEQUENCING ANALYSIS
RNA-SEQUENCING ANALYSIS Joseph Powell SISG- 2018 CONTENTS Introduction to RNA sequencing Data structure Analyses Transcript counting Alternative splicing Allele specific expression Discovery APPLICATIONS
More informationmeasuring gene expression December 11, 2018
measuring gene expression December 11, 2018 Intervening Sequences (introns): how does the cell get rid of them? Splicing!!! Highly conserved ribonucleoprotein complex recognizes intron/exon junctions and
More informationAnalysis of RNA-seq Data
Analysis of RNA-seq Data A physicist and an engineer are in a hot-air balloon. Soon, they find themselves lost in a canyon somewhere. They yell out for help: "Helllloooooo! Where are we?" 15 minutes later,
More information1. Introduction Gene regulation Genomics and genome analyses
1. Introduction Gene regulation Genomics and genome analyses 2. Gene regulation tools and methods Regulatory sequences and motif discovery TF binding sites Databases 3. Technologies Microarrays Deep sequencing
More informationRNA-Sequencing analysis
RNA-Sequencing analysis Markus Kreuz 25. 04. 2012 Institut für Medizinische Informatik, Statistik und Epidemiologie Content: Biological background Overview transcriptomics RNA-Seq RNA-Seq technology Challenges
More informationBayesian Decomposition
Bayesian Decomposition Michael Ochs Making Proteins A Closer Look at Translation Post-Translational Modification RNA Splicing mirna Identifying Pathways A 1 2 3 B C D A B C D Bioinformatics www.promega.com
More informationSequence Analysis 2RNA-Seq
Sequence Analysis 2RNA-Seq Lecture 10 2/21/2018 Instructor : Kritika Karri kkarri@bu.edu Transcriptome Entire set of RNA transcripts in a given cell for a specific developmental stage or physiological
More informationAnalysis of RNA-seq Data. Feb 8, 2017 Peikai CHEN (PHD)
Analysis of RNA-seq Data Feb 8, 2017 Peikai CHEN (PHD) Outline What is RNA-seq? What can RNA-seq do? How is RNA-seq measured? How to process RNA-seq data: the basics How to visualize and diagnose your
More informationIntroduction to RNAseq Analysis. Milena Kraus Apr 18, 2016
Introduction to RNAseq Analysis Milena Kraus Apr 18, 2016 Agenda What is RNA sequencing used for? 1. Biological background 2. From wet lab sample to transcriptome a. Experimental procedure b. Raw data
More informationQuantifying gene expression
Quantifying gene expression Genome GTF (annotation)? Sequence reads FASTQ FASTQ (+reference transcriptome index) Quality control FASTQ Alignment to Genome: HISAT2, STAR (+reference genome index) (known
More informationRNA
RNA sequencing Michael Inouye Baker Heart and Diabetes Institute Univ of Melbourne / Monash Univ Summer Institute in Statistical Genetics 2017 Integrative Genomics Module Seattle @minouye271 www.inouyelab.org
More informationRNA-Seq Analysis. Simon Andrews, Laura v
RNA-Seq Analysis Simon Andrews, Laura Biggins simon.andrews@babraham.ac.uk @simon_andrews v2018-10 RNA-Seq Libraries rrna depleted mrna Fragment u u u u NNNN Random prime + RT 2 nd strand synthesis (+
More informationFrom reads to results: differential. Alicia Oshlack Head of Bioinformatics
From reads to results: differential expression analysis with ihrna seq Alicia Oshlack Head of Bioinformatics Murdoch Childrens Research Institute Benefits and opportunities ii of RNA seq All transcripts
More informationCBC Data Therapy. Metatranscriptomics Discussion
CBC Data Therapy Metatranscriptomics Discussion Metatranscriptomics Extract RNA, subtract rrna Sequence cdna QC Gene expression, function Institute for Systems Genomics: Computational Biology Core bioinformatics.uconn.edu
More informationIntroduction to RNA sequencing
Introduction to RNA sequencing Bioinformatics perspective Olga Dethlefsen NBIS, National Bioinformatics Infrastructure Sweden November 2017 Olga (NBIS) RNA-seq November 2017 1 / 49 Outline Why sequence
More informationPost-assembly Data Analysis
Assembled transcriptome Post-assembly Data Analysis Quantification: the expression level of each gene in each sample DE genes: genes differentially expressed between samples Clustering/network analysis
More informationless sensitive than RNA-seq but more robust analysis pipelines expensive but quantitiatve standard but typically not high throughput
Chapter 11: Gene Expression The availability of an annotated genome sequence enables massively parallel analysis of gene expression. The expression of all genes in an organism can be measured in one experiment.
More informationWheat CAP Gene Expression with RNA-Seq
Wheat CAP Gene Expression with RNA-Seq July 9 th -13 th, 2018 Overview of the workshop, Alina Akhunova http://www.ksre.k-state.edu/igenomics/workshops/ RNA-Seq Workshop Activities Lectures Laboratory Molecular
More informationNext Generation Sequencing
Next Generation Sequencing Complete Report Catalogue # and Service: IR16001 rrna depletion (human, mouse, or rat) IR11081 Total RNA Sequencing (80 million reads, 2x75 bp PE) Xxxxxxx - xxxxxxxxxxxxxxxxxxxxxx
More informationBenchmarking of RNA-seq data processing pipelines using whole transcriptome qpcr expression data
Benchmarking of RNA-seq data processing pipelines using whole transcriptome qpcr expression data Jan Hellemans 7th international qpcr & NGS Event - Freising March 24 th, 2015 Therapeutics lncrna oncology
More informationRNA-Seq Workshop AChemS Sunil K Sukumaran Monell Chemical Senses Center Philadelphia
RNA-Seq Workshop AChemS 2017 Sunil K Sukumaran Monell Chemical Senses Center Philadelphia Benefits & downsides of RNA-Seq Benefits: High resolution, sensitivity and large dynamic range Independent of prior
More informationIntroduction of RNA-Seq Analysis
Introduction of RNA-Seq Analysis Jiang Li, MS Bioinformatics System Engineer I Center for Quantitative Sciences(CQS) Vanderbilt University September 21, 2012 Goal of this talk 1. Act as a practical resource
More informationDeep sequencing of transcriptomes
1 / 40 Deep sequencing of transcriptomes An introduction to RNA-seq Michael Dondrup UNI BCCS 2. november 2010 2 / 40 Transcriptomics by Ultra-Fast Sequencing Microarrays have been the primary transcriptomics
More informationTranscriptomics analysis with RNA seq: an overview Frederik Coppens
Transcriptomics analysis with RNA seq: an overview Frederik Coppens Platforms Applications Analysis Quantification RNA content Platforms Platforms Short (few hundred bases) Long reads (multiple kilobases)
More informationComputational & Quantitative Biology Lecture 6 RNA Sequencing
Peter A. Sims Dept. of Systems Biology Dept. of Biochemistry & Molecular Biophysics Sulzberger Columbia Genome Center October 27, 2014 Computational & Quantitative Biology Lecture 6 RNA Sequencing We Have
More informationRNA-Seq with the Tuxedo Suite
RNA-Seq with the Tuxedo Suite Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 2015 Workshop The Basic Tuxedo Suite References Trapnell C, et al. 2009 TopHat: discovering splice junctions with
More informationRNA-Seq data analysis course September 7-9, 2015
RNA-Seq data analysis course September 7-9, 2015 Peter-Bram t Hoen (LUMC) Jan Oosting (LUMC) Celia van Gelder, Jacintha Valk (BioSB) Anita Remmelzwaal (LUMC) Expression profiling DNA mrna protein Comprehensive
More informationFinding Genes with Genomics Technologies
PLNT2530 Plant Biotechnology (2018) Unit 7 Finding Genes with Genomics Technologies Unless otherwise cited or referenced, all content of this presenataion is licensed under the Creative Commons License
More informationHigh performance sequencing and gene expression quantification
High performance sequencing and gene expression quantification Ana Conesa Genomics of Gene Expression Lab Centro de Investigaciones Príncipe Felipe Valencia aconesa@cipf.es Next Generation Sequencing NGS
More informationDeep Sequencing technologies
Deep Sequencing technologies Gabriela Salinas 30 October 2017 Transcriptome and Genome Analysis Laboratory http://www.uni-bc.gwdg.de/index.php?id=709 Microarray and Deep-Sequencing Core Facility University
More informationComputational methods for transcriptome annotation and quantification using RNA-seq
Computational methods for transcriptome annotation and quantification using RNA-seq Manuel Garber 1, Manfred G Grabherr 1, Mitchell Guttman 1,2 & Cole Trapnell 1,3 High-throughput RNA sequencing (RNA-seq)
More informationNGS Data Analysis and Galaxy
NGS Data Analysis and Galaxy University of Pretoria Pretoria, South Africa 14-18 October 2013 Dave Clements, Emory University http://galaxyproject.org/ Fourie Joubert, Burger van Jaarsveld Bioinformatics
More information02 Agenda Item 03 Agenda Item
01 Agenda Item 02 Agenda Item 03 Agenda Item SOLiD 3 System: Applications Overview April 12th, 2010 Jennifer Stover Field Application Specialist - SOLiD Applications Workflow for SOLiD Application Application
More informationTranscriptome Assembly, Functional Annotation (and a few other related thoughts)
Transcriptome Assembly, Functional Annotation (and a few other related thoughts) Monica Britton, Ph.D. Sr. Bioinformatics Analyst June 23, 2017 Differential Gene Expression Generalized Workflow File Types
More informationRNA-Seq de novo assembly training
RNA-Seq de novo assembly training Training session aims Give you some keys elements to look at during read quality check. Transcriptome assembly is not completely a strait forward process : Multiple strategies
More informationSupplement to: The Genomic Sequence of the Chinese Hamster Ovary (CHO)-K1 cell line
Supplement to: The Genomic Sequence of the Chinese Hamster Ovary (CHO)-K1 cell line Table of Contents SUPPLEMENTARY TEXT:... 2 FILTERING OF RAW READS PRIOR TO ASSEMBLY:... 2 COMPARATIVE ANALYSIS... 2 IMMUNOGENIC
More informationIntroduction to transcriptome analysis using High Throughput Sequencing technologies. D. Puthier 2012
Introduction to transcriptome analysis using High Throughput Sequencing technologies D. Puthier 2012 A typical RNA-Seq experiment Library construction Protocol variations Fragmentation methods RNA: nebulization,
More informationMassive Analysis of cdna Ends for simultaneous Genotyping and Transcription Profiling in High Throughput
Next Generation (Sequencing) Tools for Nucleotide-Based Information Massive Analysis of cdna Ends for simultaneous Genotyping and Transcription Profiling in High Throughput Björn Rotter, PhD GenXPro GmbH,
More informationIntegrated NGS Sample Preparation Solutions for Limiting Amounts of RNA and DNA. March 2, Steven R. Kain, Ph.D. ABRF 2013
Integrated NGS Sample Preparation Solutions for Limiting Amounts of RNA and DNA March 2, 2013 Steven R. Kain, Ph.D. ABRF 2013 NuGEN s Core Technologies Selective Sequence Priming Nucleic Acid Amplification
More informationExperimental Design. Dr. Matthew L. Settles. Genome Center University of California, Davis
Experimental Design Dr. Matthew L. Settles Genome Center University of California, Davis settles@ucdavis.edu What is Differential Expression Differential expression analysis means taking normalized sequencing
More informationBasics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility
2018 ABRF Meeting Satellite Workshop 4 Bridging the Gap: Isolation to Translation (Single Cell RNA-Seq) Sunday, April 22 Basics of RNA-Seq (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly,
More informationMapping strategies for sequence reads
Mapping strategies for sequence reads Ernest Turro University of Cambridge 21 Oct 2013 Quantification A basic aim in genomics is working out the contents of a biological sample. 1. What distinct elements
More informationNEXT GENERATION SEQUENCING. Farhat Habib
NEXT GENERATION SEQUENCING HISTORY HISTORY Sanger Dominant for last ~30 years 1000bp longest read Based on primers so not good for repetitive or SNPs sites HISTORY Sanger Dominant for last ~30 years 1000bp
More informationRNA Seq: Methods and Applica6ons. Prat Thiru
RNA Seq: Methods and Applica6ons Prat Thiru 1 Outline Intro to RNA Seq Biological Ques6ons Comparison with Other Methods RNA Seq Protocol RNA Seq Applica6ons Annota6on Quan6fica6on Other Applica6ons Expression
More informationGenomic resources. for non-model systems
Genomic resources for non-model systems 1 Genomic resources Whole genome sequencing reference genome sequence comparisons across species identify signatures of natural selection population-level resequencing
More informationPost-assembly Data Analysis
Assembled transcriptome Post-assembly Data Analysis Quantification: get expression for each gene in each sample Genes differentially expressed between samples Clustering/network analysis Identifying over-represented
More informationDe novo assembly in RNA-seq analysis.
De novo assembly in RNA-seq analysis. Joachim Bargsten Wageningen UR/PRI/Plant Breeding October 2012 Motivation Transcriptome sequencing (RNA-seq) Gene expression / differential expression Reconstruct
More informationAnalysis of RNA-seq Data. Bernard Pereira
Analysis of RNA-seq Data Bernard Pereira The many faces of RNA-seq Applications Discovery Find new transcripts Find transcript boundaries Find splice junctions Comparison Given samples from different experimental
More informationTECH NOTE Pushing the Limit: A Complete Solution for Generating Stranded RNA Seq Libraries from Picogram Inputs of Total Mammalian RNA
TECH NOTE Pushing the Limit: A Complete Solution for Generating Stranded RNA Seq Libraries from Picogram Inputs of Total Mammalian RNA Stranded, Illumina ready library construction in
More informationSCALABLE, REPRODUCIBLE RNA-Seq
SCALABLE, REPRODUCIBLE RNA-Seq SCALABLE, REPRODUCIBLE RNA-Seq Advances in the RNA sequencing workflow, from sample preparation through data analysis, are enabling deeper and more accurate exploration
More informationSystematic evaluation of spliced alignment programs for RNA- seq data
Systematic evaluation of spliced alignment programs for RNA- seq data Pär G. Engström, Tamara Steijger, Botond Sipos, Gregory R. Grant, André Kahles, RGASP Consortium, Gunnar Rätsch, Nick Goldman, Tim
More informationNext Generation Sequencing. Tobias Österlund
Next Generation Sequencing Tobias Österlund tobiaso@chalmers.se NGS part of the course Week 4 Friday 13/2 15.15-17.00 NGS lecture 1: Introduction to NGS, alignment, assembly Week 6 Thursday 26/2 08.00-09.45
More informationAnalytics Behind Genomic Testing
A Quick Guide to the Analytics Behind Genomic Testing Elaine Gee, PhD Director, Bioinformatics ARUP Laboratories 1 Learning Objectives Catalogue various types of bioinformatics analyses that support clinical
More information10/06/2014. RNA-Seq analysis. With reference assembly. Cormier Alexandre, PhD student UMR8227, Algal Genetics Group
RNA-Seq analysis With reference assembly Cormier Alexandre, PhD student UMR8227, Algal Genetics Group Summary 2 Typical RNA-seq workflow Introduction Reference genome Reference transcriptome Reference
More informationCanadian Bioinforma3cs Workshops
Canadian Bioinforma3cs Workshops www.bioinforma3cs.ca Module #: Title of Module 2 1 Module 3 Expression and Differen3al Expression (lecture) Obi Griffith & Malachi Griffith www.obigriffith.org ogriffit@genome.wustl.edu
More informationLong and short/small RNA-seq data analysis
Long and short/small RNA-seq data analysis GEF5, 4.9.2015 Sami Heikkinen, PhD, Dos. Topics 1. RNA-seq in a nutshell 2. Long vs short/small RNA-seq 3. Bioinformatic analysis work flows GEF5 / Heikkinen
More informationDifferential gene expression analysis using RNA-seq
https://abc.med.cornell.edu/ Differential gene expression analysis using RNA-seq Applied Bioinformatics Core, August 2017 Friederike Dündar with Luce Skrabanek & Ceyda Durmaz Day 3 QC of aligned reads
More informationIntroduction to RNA-Seq
Introduction to RNA-Seq Monica Britton, Ph.D. Bioinformatics Analyst September 2014 Workshop Overview of Today s Activities Morning RNA-Seq Concepts, Terminology, and Work Flows Two-Condition Differential
More informationGalaxy Platform For NGS Data Analyses
Galaxy Platform For NGS Data Analyses Weihong Yan wyan@chem.ucla.edu Collaboratory Web Site http://qcb.ucla.edu/collaboratory http://collaboratory.lifesci.ucla.edu Workshop Outline ü Day 1 UCLA galaxy
More informationRNA-Seq. Joshua Ainsley, PhD Postdoctoral Researcher Lab of Leon Reijmers Neuroscience Department Tufts University
RNA-Seq Joshua Ainsley, PhD Postdoctoral Researcher Lab of Leon Reijmers Neuroscience Department Tufts University joshua.ainsley@tufts.edu Day five Alternative splicing Assembly RNA edits Alternative splicing
More informationChIP-seq analysis 2/28/2018
ChIP-seq analysis 2/28/2018 Acknowledgements Much of the content of this lecture is from: Furey (2012) ChIP-seq and beyond Park (2009) ChIP-seq advantages + challenges Landt et al. (2012) ChIP-seq guidelines
More informationCS-E5870 High-Throughput Bioinformatics RNA-seq analysis
CS-E5870 High-Throughput Bioinformatics RNA-seq analysis Harri Lähdesmäki Department of Computer Science Aalto University September 30, 2016 Acknowledgement for J Salojärvi and E Czeizler for the previous
More informationGene Expression Technology
Gene Expression Technology Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu Gene expression Gene expression is the process by which information from a gene
More informationMachine Learning Methods for RNA-seq-based Transcriptome Reconstruction
Machine Learning Methods for RNA-seq-based Transcriptome Reconstruction Gunnar Rätsch Friedrich Miescher Laboratory Max Planck Society, Tübingen, Germany NGS Bioinformatics Meeting, Paris (March 24, 2010)
More informationSO YOU WANT TO DO A: RNA-SEQ EXPERIMENT MATT SETTLES, PHD UNIVERSITY OF CALIFORNIA, DAVIS
SO YOU WANT TO DO A: RNA-SEQ EXPERIMENT MATT SETTLES, PHD UNIVERSITY OF CALIFORNIA, DAVIS SETTLES@UCDAVIS.EDU Bioinformatics Core Genome Center UC Davis BIOINFORMATICS.UCDAVIS.EDU DISCLAIMER This talk/workshop
More informationIntroduction to transcriptome analysis using High Throughput Sequencing technologies. D. Puthier 2012
Introduction to transcriptome analysis using High Throughput Sequencing technologies D. Puthier 2012 Transcriptome: the old school Cyanine 5 (Cy5) Cy-3: - Excitation 550nm - Emission 570nm Cy-5: - Excitation
More informationNon-conserved intronic motifs in human and mouse are associated with a conserved set of functions
Non-conserved intronic motifs in human and mouse are associated with a conserved set of functions Aristotelis Tsirigos Bioinformatics & Pattern Discovery Group IBM Research Outline. Discovery of DNA motifs
More informationRNAseq Applications in Genome Studies. Alexander Kanapin, PhD Wellcome Trust Centre for Human Genetics, University of Oxford
RNAseq Applications in Genome Studies Alexander Kanapin, PhD Wellcome Trust Centre for Human Genetics, University of Oxford RNAseq Protocols Next generation sequencing protocol cdna, not RNA sequencing
More informationRNA standards v May
Standards, Guidelines and Best Practices for RNA-Seq: 2010/2011 I. Introduction: Sequence based assays of transcriptomes (RNA-seq) are in wide use because of their favorable properties for quantification,
More informationSMARTer Ultra Low RNA Kit for Illumina Sequencing Two powerful technologies combine to enable sequencing with ultra-low levels of RNA
SMARTer Ultra Low RNA Kit for Illumina Sequencing Two powerful technologies combine to enable sequencing with ultra-low levels of RNA The most sensitive cdna synthesis technology, combined with next-generation
More informationHow to deal with your RNA-seq data?
How to deal with your RNA-seq data? Rachel Legendre, Thibault Dayris, Adrien Pain, Claire Toffano-Nioche, Hugo Varet École de bioinformatique AVIESAN-IFB 2017 1 Rachel Legendre Bioinformatics 27/11/2018
More informationRNA-Seq analysis using R: Differential expression and transcriptome assembly
RNA-Seq analysis using R: Differential expression and transcriptome assembly Beibei Chen Ph.D BICF 12/7/2016 Agenda Brief about RNA-seq and experiment design Gene oriented analysis Gene quantification
More informationRapid Transcriptome Characterization for a nonmodel organism using 454 pyrosequencing
Rapid Transcriptome Characterization for a nonmodel organism using 454 pyrosequencing "#$%&'()*+,"(-*."#$%&/.,"*01*0.,(%-*.&0("2*01*3,$,45,"-*4#66&*71** 3"#)(82,"-*2&9:)($*)1*"(03&"2-*#)66(*.(8$6#*;
More informationIntroduction to Bioinformatics and Gene Expression Technologies
Introduction to Bioinformatics and Gene Expression Technologies Utah State University Fall 2017 Statistical Bioinformatics (Biomedical Big Data) Notes 1 1 Vocabulary Gene: hereditary DNA sequence at a
More informationIntroduction to Bioinformatics and Gene Expression Technologies
Vocabulary Introduction to Bioinformatics and Gene Expression Technologies Utah State University Fall 2017 Statistical Bioinformatics (Biomedical Big Data) Notes 1 Gene: Genetics: Genome: Genomics: hereditary
More informationTargeted RNA sequencing reveals the deep complexity of the human transcriptome.
Targeted RNA sequencing reveals the deep complexity of the human transcriptome. Tim R. Mercer 1, Daniel J. Gerhardt 2, Marcel E. Dinger 1, Joanna Crawford 1, Cole Trapnell 3, Jeffrey A. Jeddeloh 2,4, John
More informationRNA-Seq analysis workshop
RNA-Seq analysis workshop Zhangjun Fei Boyce Thompson Institute for Plant Research USDA Robert W. Holley Center for Agriculture and Health Cornell University Outline Background of RNA-Seq Application of
More informationGeneScissors: a comprehensive approach to detecting and correcting spurious transcriptome inference owing to RNA-seq reads misalignment
GeneScissors: a comprehensive approach to detecting and correcting spurious transcriptome inference owing to RNA-seq reads misalignment Zhaojun Zhang, Shunping Huang, Jack Wang, Xiang Zhang, Fernando Pardo
More informationGENETICS - CLUTCH CH.15 GENOMES AND GENOMICS.
!! www.clutchprep.com CONCEPT: OVERVIEW OF GENOMICS Genomics is the study of genomes in their entirety Bioinformatics is the analysis of the information content of genomes - Genes, regulatory sequences,
More informationCourse Presentation. Ignacio Medina Presentation
Course Index Introduction Agenda Analysis pipeline Some considerations Introduction Who we are Teachers: Marta Bleda: Computational Biologist and Data Analyst at Department of Medicine, Addenbrooke's Hospital
More information