RNA-Seq Software, Tools, and Workflows
|
|
- Elfrieda Hines
- 6 years ago
- Views:
Transcription
1 RNA-Seq Software, Tools, and Workflows Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 1, 2016
2 Some mrna-seq Applications Differential gene expression analysis Transcriptional profiling Assumption: Changes in transcription/mrna levels correlate with phenotype (protein expression) Identification of splice variants Novel gene identification Transcriptome assembly SNP finding RNA editing
3 Experimental Design What biological question am I trying to answer? What types of samples (tissue, timepoints, etc.)? How much sequence do I need? Length of read? Platform? Single-end or paired-end? Barcoding? Pooling? Biological replicates: how many? Technical replicates: how many? Protocol considerations?
4 Strand-Specific (Directional) RNA-Seq Now the default for Illumina TruSeq kits Preserves orientation of RNA after reverse transcription to cdna Informs alignments to genome Determine which genomic DNA strand is transcribed Identify anti-sense transcription (e.g., lncrnas) Quantify expression levels more precisely Demarcate coding sequences in microbes with overlapping genes Very useful in transcriptome assemblies Allows precise construction of sense and anti-sense transcripts
5 Influence of the Organism Novel little/no previous sequencing (may need assembly) Non-Model some sequence available (draft genome or transcriptome assembly) Thousands of scaffolds, maybe tens of chromosomes Some annotation (ab initio, EST-based, etc.) Model genome fully sequenced and annotated Multiple genomes available for comparison Well-annotated transcriptome based on experimental evidence Genetic maps with markers available Basic research can be conducted to verify annotations (mutants available)
6 Differential Gene Expression Generalized Workflow
7 Read QC and Trimming Scythe Sickle Stringency of trimming depends on read length and downstream processing. Alignment can often tolerate some amount of contamination. Various sizes of trimmed reads can introduce biases in alignments. Transcriptome assembly works best with aggressive trimming.
8 QC and Alignment You may want to align your reads to a few different datasets, including rrna of your organism (or related organisms) Contaminant screening, if desirable Genome or transcriptome to generate counts for differential expression analysis First we ll go through alignment to a gene set/transcriptome, then we ll look at aligning to a genome
9 Choosing an Aligner for a Gene Set Reference Examples of gene sets: rrna sequences potential contaminant sequences (such as PhiX), a transcriptome (eukaryotic or bacterial), where each sequence in the fasta represents a different transcript (mrna) De novo transcriptome assembly A splicing-aware aligner is not needed bwa aln (<76 bp), bwa mem (>75 bp), or bowtie2 Requiring strand-specific orientation can be more difficult (requires filtering the bam file).
10 Library QC rrna Contamination mrna rrna rrna rrna mrna mrna Total RNA sample After PolyA isolation After rrna depletion Total RNA contains >95% rrna Poly A selected libraries should contain <5% rrna rrna-depleted libraries should contain <15% rrna Align reads to rrna sequences from organism or relatives Higher than expected or inconsistent rrna content may indicate degraded RNA or problems in library prep. Generally, don t need to remove rrna reads
11 Insert Size (determined from gene set alignments) Adapter ~60 bases cdna bases =Insert Size Adapter ~60 bases The insert is the cdna (or RNA) ligated between the adapters. (The SAM spec calls this the template.) Typical insert size is bases, but can be larger. Insert size distribution depends on library prep method. Overlapping paired-end reads Gapped paired-end reads Single-end read
12 Paired-End Reads Read INSERT SIZE (TLEN) Read 2 Read 1 INNER DISTANCE (+ or -) Read 2
13 Generating Raw Counts from Gene Set Alignments The sam/bam file can be filtered for just those pairs where both reads align in concordantly (proper pairs) to one transcript. sam2counts or samtools idxstats can be used for counting. Counts per transcript need to be added up to counts per gene before running stats software
14 Alignment to a Genome - Choosing a Reference Human/mouse: GENCODE (uses Ensembl IDs) ( but may need some manipulation to work with certain software Ensembl genomes ( and Biomart ( Illumina igenomes ( sequencing_software/igenome.html) provides indexes for some software, and files with extra info for Tophat/cufflinks. NCBI genomes ( Many specialized databases (Phytozome, Patric, VectorBase, FlyBase, WormBase) Do it yourself genome assembly and gene-finding (don t forget functional annotation)
15 Choosing an Aligner for a Genome Reference Eukaryotic genes are separated into exons and introns, so if you are using a genome reference, you need an aligner that is either splicing-aware or won t choke on spliced RNA reads. A splicing-aware aligner will recognize the difference between a short insert and a read that aligns across exon-intron boundaries Spliced Reads (joined by dashed line) Read Pairs (joined by solid line)
16 (Some) Splicing-Aware or Splicing-Agnostic Aligners Tophat2 (bowtie2) what we ll be using in these exercises. (remember, igenomes); also integrates with cufflinks to find novel genes and transcripts HISAT/HISAT2 successor to tophat2, and also works for alignment of populations (i.e., multiple human samples). I haven t tried this yet. It s on our Galaxy try it out and report what happens. GSNAP I haven t used this; can handle (and produce) non-standard file formats. STAR this is what I use. It s very fast and doesn t need a separate counting step. But it does need a lot of memory (>128G RAM to index a mammalian genome). bwa mem a local aligner, is not splicing-aware, but can handle local alignments (doesn t require the entire read to align).
17 Generating Raw Counts from Genome Alignments Reads aligning to a chromosome or scaffold must be parsed to assign them to a gene. We ll need a roadmap to indicate where each gene (exon) is located in the genome
18 Generating Raw Counts from Genome Alignments This roadmap is the GTF (Gene Transfer Format) file: The left columns list source, feature type, and genomic coordinates The right column includes attributes, including gene ID, etc. Let s look at the GTF fields in more detail
19 Fields in the GTF File (left to right) Sequence Name (i.e., chromosome, scaffold, etc.) chr12 Source (program that generated the gtf file or feature) unknown Feature (i.e., gene, exon, CDS, start codon, stop codon) CDS Start (starting location on sequence) End (end position on sequence) Score. Strand (+ or -) + Frame (0, 1, or 2: which is first base in codon, zero-based) 2 Attribute ( ; -delimited list of tags with additional info) gene_id "PRMT8"; gene_name "PRMT8"; p_id "P10933"; transcript_id "NM_019854"; tss_id "TSS4368";
20 An Unusual GTF File
21 Generating Gene Counts from Genome Alignments HTSeq-count runs rather slowly; doesn t work well on coordinate-sorted bam files. FeatureCounts Runs faster, and will work on coordinate-sorted bam files. Has more parameters than HTSeq-count. STAR Will generate counts equivalent to HTSeq-count with default parameters, in the same run as alignment. All of these can utilize strand-specific information in GTF file to assign reads from strand-specific libraries. All will also report reads that can t be assigned unambiguously to a gene feature.
22 Multiple-Mapping Reads ( Multireads ) Some reads will align to more than one place in the reference, due to: Common domains, gene families Paralogs, pseudogenes, polyploidy etc. This can distort counts, leading to misleading expression levels If a read can t be uniquely mapped, how should it be counted? Should it be ignored (not counted at all)? Should it be randomly assigned to one location among all the locations to which it aligns equally well? This may depend on the question you re asking and also depends on the alignment and counting software you use.
23 Reads That Align to Gene Features From: This read unambiguously aligns to Gene A This read also uniquely aligns to Gene A This read aligns uniquely, but could be unspliced This spliced read uniquely aligns to Gene A Genes A and B overlap, but the read is uniquely within Gene A Genes A and B overlap, the read is within Gene A, but overlaps Gene B Genes A and B overlap, and the read ambiguously maps to both genes
24 Read Count Definitions When using tools such as HTSeq-count, FeatureCounts, and STAR, the output table will include counts of reads that can be assigned to each gene and some additional lines for reads that can t be assigned, including: Ambiguous read overlaps more than one gene Multiple Mapping read maps equally well to more than one place in the genome No Feature read maps uniquely to genome, but in an intergenic (or intronic) region Unmapped read cannot be mapped to genome (based on run parameters)
25 Examples From Two Recent Experiments Genome Human (hg38) Mouse (mm10) Annotation GENCODE GENCODE Library Prep rrna depletion polya selection rrna 8.8% 5.6% Aligned to Genome 98.6% 98.1% Assigned to Genes (exons) 51.8% 84.1% Ambiguous 2.2% 1.3% Multimapping 12.9% 6.2% No Feature 31.6% 6.5% Unmapped 1.4% 1.9%
26 How Well Did Your Reads Align to the Reference? % Reads Mapped 100 Calculate percentage of reads mapped per sample Great! Good Incomplete Reference? Sample Contamination?
27 Checking Your Results Key genes that may confirm sample ID Knock-out or knock-down genes Genes identified in previous research Specific genes of interest Hypothesis testing Important pathways Experimental validation (e.g., qrt-pcr) Ideally, validation should be conducted on a different set of samples
28 Differential Gene Expression Generalized Workflow Bioinformatics analyses are in silico experiments The tools and parameters you choose will be influenced by factors including: Available reference/annotation Experimental design (e.g., pairwise vs. multi-factor) The right tools are the ones that best inform on your experiment Don t just shop for methods that give you the answer you want
29 Differential Gene Expression Generalized Workflow File Types Fastq Fasta (reference) gtf (gene features) sam/bam Text tables
30 Today s Exercises Differential Gene Expression Today, we ll analyze a few types of data you ll typically encounter: 1. Single-end reads ( tag counting with well-annotated genomes) 2. Paired-end reads (can also be used to discover novel transcripts in a genome with incomplete annotation) 3. Paired-end reads for differential gene expression when only a transcriptome is available (such as after a transcriptome assembly)
31 Today s Exercises Differential Gene Expression And we ll be using a few different software: 1. Tophat to align spliced reads to a genome 2. FeatureCounts to generate raw counts tables for 3. edger, for differential expression of genes 4. Cufflinks to find novel genes and transcripts 5. Bowtie2 to align reads to a transcriptome reference 6. sam2counts to extract raw counts from bowtie2 alignments
32 Gene Construction
33 Alignment and Differential Expression Read set(s) TopHat bam file(s) We will follow these steps in the first exercise. Existing annotation (GTF) Toptables, etc. edger/limma FeatureCounts
34 But, do we have all the genes? For organisms with genomes, gene models are stored in gtf files Assumptions: The gtf file contains annotation for ALL transcripts and genes All splice sites, start/stop codons, etc. are correct Are these assumptions correct for every sequenced organism? RNA-Seq reads can be used to independently construct genes and splice variants using limited or no annotation Method used depends on how much sequence information there is for the organism
35 Gene Construction (Alignment) vs. Assembly Genome- Sequenced Organisms Trinity software Novel or Non-Model Organisms Haas and Zody (2010) Nat. Biotech. 28:421-3
36 Gene / Transcriptome Construction Annotation can be improved even for well-annotated model organisms Identify all expressed exons Combine expressed exons into genes Find all splice variants for a gene Discover novel transcripts For newly sequenced organisms Validate ab initio annotation Comparison between different annotation sets Can assist in finding some types of contamination Reconstruction of rrna genes Genomic/mitochondrial DNA in RNA library preps.
37 Reference Annotation Based Transcript (RABT) Assembly Read set(s) TopHat bam file(s) Existing annotation (GTF) [optional] Cufflinks Cuffmerge Cuffcompare Read-set specific GTF(s) Merged GTF Final assembly (GTF and stats) Toptables, etc. edger/limma FeatureCounts
38 A Few Words About Bacterial RNA-Seq
39 Eukaryotic and Bacterial Gene Structures are Different Eukaryotes Gene structure includes introns and exons Splicing, poly-adenylation Each mrna is a discrete molecule when translated Bacteria / Prokaryotes Individual genes and groups of genes in operons Generally, no splicing, no polya One mrna can contain coding sequences for multiple proteins
40 Bacterial RNA-Seq Considerations rrna depletion strategies may leave considerable amounts of non-coding RNA molecules Splicing-aware aligners (such as Tophat) may not be useful Reads from polycistronic mrna may overlap two genes How would HTSeq-Count or FeatureCounts handle this? Compare alignments to the genome to alignments to transcriptome. Some aligners, such as bwa-mem, will report secondary alignments Transcriptome alignments can be used to generate counts table for edger Specialized software, such as Rockhopper (stand-alone,
Introduction to RNA-Seq
Introduction to RNA-Seq Monica Britton, Ph.D. Sr. Bioinformatics Analyst March 2015 Workshop Overview of RNA-Seq Activities RNA-Seq Concepts, Terminology, and Work Flows Using Single-End Reads and a Reference
More informationIntroduction to RNA-Seq
Introduction to RNA-Seq Monica Britton, Ph.D. Bioinformatics Analyst September 2014 Workshop Overview of Today s Activities Morning RNA-Seq Concepts, Terminology, and Work Flows Two-Condition Differential
More informationRNA-Seq with the Tuxedo Suite
RNA-Seq with the Tuxedo Suite Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 2015 Workshop The Basic Tuxedo Suite References Trapnell C, et al. 2009 TopHat: discovering splice junctions with
More informationTranscriptome Assembly, Functional Annotation (and a few other related thoughts)
Transcriptome Assembly, Functional Annotation (and a few other related thoughts) Monica Britton, Ph.D. Sr. Bioinformatics Analyst June 23, 2017 Differential Gene Expression Generalized Workflow File Types
More information10/06/2014. RNA-Seq analysis. With reference assembly. Cormier Alexandre, PhD student UMR8227, Algal Genetics Group
RNA-Seq analysis With reference assembly Cormier Alexandre, PhD student UMR8227, Algal Genetics Group Summary 2 Typical RNA-seq workflow Introduction Reference genome Reference transcriptome Reference
More informationQuantifying gene expression
Quantifying gene expression Genome GTF (annotation)? Sequence reads FASTQ FASTQ (+reference transcriptome index) Quality control FASTQ Alignment to Genome: HISAT2, STAR (+reference genome index) (known
More informationRNA-seq Data Analysis
Lecture 3. Clustering; Function/Pathway Enrichment analysis RNA-seq Data Analysis Qi Sun Bioinformatics Facility Biotechnology Resource Center Cornell University Lecture 1. Map RNA-seq read to genome Lecture
More informationTranscriptomics analysis with RNA seq: an overview Frederik Coppens
Transcriptomics analysis with RNA seq: an overview Frederik Coppens Platforms Applications Analysis Quantification RNA content Platforms Platforms Short (few hundred bases) Long reads (multiple kilobases)
More informationHow to deal with your RNA-seq data?
How to deal with your RNA-seq data? Rachel Legendre, Thibault Dayris, Adrien Pain, Claire Toffano-Nioche, Hugo Varet École de bioinformatique AVIESAN-IFB 2017 1 Rachel Legendre Bioinformatics 27/11/2018
More informationRNA-Seq Analysis. Simon Andrews, Laura v
RNA-Seq Analysis Simon Andrews, Laura Biggins simon.andrews@babraham.ac.uk @simon_andrews v2018-10 RNA-Seq Libraries rrna depleted mrna Fragment u u u u NNNN Random prime + RT 2 nd strand synthesis (+
More informationGalaxy for Next Generation Sequencing 初探次世代序列分析平台 蘇聖堯 2013/9/12
Galaxy for Next Generation Sequencing 初探次世代序列分析平台 蘇聖堯 2013/9/12 What s Galaxy? Bringing Developers And Biologists Together. Reproducible Science Is Our Goal An open, web-based platform for data intensive
More informationTranscriptome analysis
Statistical Bioinformatics: Transcriptome analysis Stefan Seemann seemann@rth.dk University of Copenhagen April 11th 2018 Outline: a) How to assess the quality of sequencing reads? b) How to normalize
More informationExperimental Design. Sequencing. Data Quality Control. Read mapping. Differential Expression analysis
-Seq Analysis Quality Control checks Reproducibility Reliability -seq vs Microarray Higher sensitivity and dynamic range Lower technical variation Available for all species Novel transcript identification
More informationApplications of short-read
Applications of short-read sequencing: RNA-Seq and ChIP-Seq BaRC Hot Topics March 2013 George Bell, Ph.D. http://jura.wi.mit.edu/bio/education/hot_topics/ Sequencing applications RNA-Seq includes experiments
More informationIntroduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013
Introduction to RNA-Seq David Wood Winter School in Mathematics and Computational Biology July 1, 2013 Abundance RNA is... Diverse Dynamic Central DNA rrna Epigenetics trna RNA mrna Time Protein Abundance
More informationNGS Data Analysis and Galaxy
NGS Data Analysis and Galaxy University of Pretoria Pretoria, South Africa 14-18 October 2013 Dave Clements, Emory University http://galaxyproject.org/ Fourie Joubert, Burger van Jaarsveld Bioinformatics
More informationSequencing applications. Today's outline. Hands-on exercises. Applications of short-read sequencing: RNA-Seq and ChIP-Seq
Sequencing applications Applications of short-read sequencing: RNA-Seq and ChIP-Seq BaRC Hot Topics March 2013 George Bell, Ph.D. http://jura.wi.mit.edu/bio/education/hot_topics/ RNA-Seq includes experiments
More informationRNA-Seq Module 2 From QC to differential gene expression.
RNA-Seq Module 2 From QC to differential gene expression. Ying Zhang Ph.D, Informatics Analyst Research Informatics Support System (RISS) MSI Apr. 24, 2012 RNA-Seq Tutorials Tutorial 1: Introductory (Mar.
More informationRNA-Seq Workshop AChemS Sunil K Sukumaran Monell Chemical Senses Center Philadelphia
RNA-Seq Workshop AChemS 2017 Sunil K Sukumaran Monell Chemical Senses Center Philadelphia Benefits & downsides of RNA-Seq Benefits: High resolution, sensitivity and large dynamic range Independent of prior
More informationAnalysis of RNA-seq Data. Feb 8, 2017 Peikai CHEN (PHD)
Analysis of RNA-seq Data Feb 8, 2017 Peikai CHEN (PHD) Outline What is RNA-seq? What can RNA-seq do? How is RNA-seq measured? How to process RNA-seq data: the basics How to visualize and diagnose your
More informationSanger vs Next-Gen Sequencing
Tools and Algorithms in Bioinformatics GCBA815/MCGB815/BMI815, Fall 2017 Week-8: Next-Gen Sequencing RNA-seq Data Analysis Babu Guda, Ph.D. Professor, Genetics, Cell Biology & Anatomy Director, Bioinformatics
More informationSequence Analysis 2RNA-Seq
Sequence Analysis 2RNA-Seq Lecture 10 2/21/2018 Instructor : Kritika Karri kkarri@bu.edu Transcriptome Entire set of RNA transcripts in a given cell for a specific developmental stage or physiological
More informationIntroduction to RNA-Seq in GeneSpring NGS Software
Introduction to RNA-Seq in GeneSpring NGS Software Dipa Roy Choudhury, Ph.D. Strand Scientific Intelligence and Agilent Technologies Learn more at www.genespring.com Introduction to RNA-Seq In a few years,
More informationEnsembl workshop. Thomas Randall, PhD bioinformatics.unc.edu. handouts, papers, datasets
Ensembl workshop Thomas Randall, PhD tarandal@email.unc.edu bioinformatics.unc.edu www.unc.edu/~tarandal/ensembl handouts, papers, datasets Ensembl is a joint project between EMBL - EBI and the Sanger
More informationBasics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility
2018 ABRF Meeting Satellite Workshop 4 Bridging the Gap: Isolation to Translation (Single Cell RNA-Seq) Sunday, April 22 Basics of RNA-Seq (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly,
More informationSO YOU WANT TO DO A: RNA-SEQ EXPERIMENT MATT SETTLES, PHD UNIVERSITY OF CALIFORNIA, DAVIS
SO YOU WANT TO DO A: RNA-SEQ EXPERIMENT MATT SETTLES, PHD UNIVERSITY OF CALIFORNIA, DAVIS SETTLES@UCDAVIS.EDU Bioinformatics Core Genome Center UC Davis BIOINFORMATICS.UCDAVIS.EDU DISCLAIMER This talk/workshop
More informationDeep Sequencing technologies
Deep Sequencing technologies Gabriela Salinas 30 October 2017 Transcriptome and Genome Analysis Laboratory http://www.uni-bc.gwdg.de/index.php?id=709 Microarray and Deep-Sequencing Core Facility University
More informationIntroduction to RNAseq Analysis. Milena Kraus Apr 18, 2016
Introduction to RNAseq Analysis Milena Kraus Apr 18, 2016 Agenda What is RNA sequencing used for? 1. Biological background 2. From wet lab sample to transcriptome a. Experimental procedure b. Raw data
More informationRNAseq Applications in Genome Studies. Alexander Kanapin, PhD Wellcome Trust Centre for Human Genetics, University of Oxford
RNAseq Applications in Genome Studies Alexander Kanapin, PhD Wellcome Trust Centre for Human Genetics, University of Oxford RNAseq Protocols Next generation sequencing protocol cdna, not RNA sequencing
More informationRNAseq Differential Gene Expression Analysis Report
RNAseq Differential Gene Expression Analysis Report Customer Name: Institute/Company: Project: NGS Data: Bioinformatics Service: IlluminaHiSeq2500 2x126bp PE Differential gene expression analysis Sample
More informationIntroduction to RNA sequencing
Introduction to RNA sequencing Bioinformatics perspective Olga Dethlefsen NBIS, National Bioinformatics Infrastructure Sweden November 2017 Olga (NBIS) RNA-seq November 2017 1 / 49 Outline Why sequence
More informationRNA-Sequencing analysis
RNA-Sequencing analysis Markus Kreuz 25. 04. 2012 Institut für Medizinische Informatik, Statistik und Epidemiologie Content: Biological background Overview transcriptomics RNA-Seq RNA-Seq technology Challenges
More informationGenome annotation & EST
Genome annotation & EST What is genome annotation? The process of taking the raw DNA sequence produced by the genome sequence projects and adding the layers of analysis and interpretation necessary
More informationVM origin. Okeanos: Image Trinity_U16 (upgrade to Ubuntu16.04, thanks to Alexandros Dimopoulos) X2go: LXDE
VM origin Okeanos: Image Trinity_U16 (upgrade to Ubuntu16.04, thanks to Alexandros Dimopoulos) X2go: LXDE NGS intro + Genome-Based Transcript Reconstruction and Analysis Using RNA-Seq Data Based on material
More informationTotal RNA isola-on End Repair of double- stranded cdna
Total RNA isola-on End Repair of double- stranded cdna mrna Isola8on using Oligo(dT) Magne8c Beads AAAAAAA A Adenyla8on (A- Tailing) A AAAAAAAAAAAA TTTTTTTTT AAAAAAA TTTTTTTTT TTTTTTTT TTTTTTTTT AAAAAAAA
More informationDifferential gene expression analysis using RNA-seq
https://abc.med.cornell.edu/ Differential gene expression analysis using RNA-seq Applied Bioinformatics Core, August 2017 Friederike Dündar with Luce Skrabanek & Ceyda Durmaz Day 3 QC of aligned reads
More informationRNA-sequencing. Next Generation sequencing analysis Anne-Mette Bjerregaard. Center for biological sequence analysis (CBS)
RNA-sequencing Next Generation sequencing analysis 2016 Anne-Mette Bjerregaard Center for biological sequence analysis (CBS) Terms and definitions TRANSCRIPTOME The full set of RNA transcripts and their
More informationLong and short/small RNA-seq data analysis
Long and short/small RNA-seq data analysis GEF5, 4.9.2015 Sami Heikkinen, PhD, Dos. Topics 1. RNA-seq in a nutshell 2. Long vs short/small RNA-seq 3. Bioinformatic analysis work flows GEF5 / Heikkinen
More informationAnalysis of data from high-throughput molecular biology experiments Lecture 6 (F6, RNA-seq ),
Analysis of data from high-throughput molecular biology experiments Lecture 6 (F6, RNA-seq ), 2012-01-26 What is a gene What is a transcriptome History of gene expression assessment RNA-seq RNA-seq analysis
More informationExperimental Design. Dr. Matthew L. Settles. Genome Center University of California, Davis
Experimental Design Dr. Matthew L. Settles Genome Center University of California, Davis settles@ucdavis.edu What is Differential Expression Differential expression analysis means taking normalized sequencing
More informationThe Ensembl Database. Dott.ssa Inga Prokopenko. Corso di Genomica
The Ensembl Database Dott.ssa Inga Prokopenko Corso di Genomica 1 www.ensembl.org Lecture 7.1 2 What is Ensembl? Public annotation of mammalian and other genomes Open source software Relational database
More informationAnalysis of RNA-seq Data. Bernard Pereira
Analysis of RNA-seq Data Bernard Pereira The many faces of RNA-seq Applications Discovery Find new transcripts Find transcript boundaries Find splice junctions Comparison Given samples from different experimental
More informationShuji Shigenobu. April 3, 2013 Illumina Webinar Series
Shuji Shigenobu April 3, 2013 Illumina Webinar Series RNA-seq RNA-seq is a revolutionary tool for transcriptomics using deepsequencing technologies. genome HiSeq2000@NIBB (Wang 2009 with modifications)
More informationUAB DNA-Seq Analysis Workshop. John Osborne Research Associate Centers for Clinical and Translational Science
+ UAB DNA-Seq Analysis Workshop John Osborne Research Associate Centers for Clinical and Translational Science ozborn@uab.,edu + Thanks in advance You are the Guinea pigs for this workshop! At this point
More informationChIP-seq and RNA-seq
ChIP-seq and RNA-seq Biological Goals Learn how genomes encode the diverse patterns of gene expression that define each cell type and state. Protein-DNA interactions (ChIPchromatin immunoprecipitation)
More informationChIP-seq and RNA-seq. Farhat Habib
ChIP-seq and RNA-seq Farhat Habib fhabib@iiserpune.ac.in Biological Goals Learn how genomes encode the diverse patterns of gene expression that define each cell type and state. Protein-DNA interactions
More informationDeep sequencing of transcriptomes
1 / 40 Deep sequencing of transcriptomes An introduction to RNA-seq Michael Dondrup UNI BCCS 2. november 2010 2 / 40 Transcriptomics by Ultra-Fast Sequencing Microarrays have been the primary transcriptomics
More informationmeasuring gene expression December 5, 2017
measuring gene expression December 5, 2017 transcription a usually short-lived RNA copy of the DNA is created through transcription RNA is exported to the cytoplasm to encode proteins some types of RNA
More informationGene Expression analysis with RNA-Seq data
Gene Expression analysis with RNA-Seq data C3BI Hands-on NGS course November 24th 2016 Frédéric Lemoine Plan 1. 2. Quality Control 3. Read Mapping 4. Gene Expression Analysis 5. Splicing/Transcript Analysis
More informationBenchmarking of RNA-seq data processing pipelines using whole transcriptome qpcr expression data
Benchmarking of RNA-seq data processing pipelines using whole transcriptome qpcr expression data Jan Hellemans 7th international qpcr & NGS Event - Freising March 24 th, 2015 Therapeutics lncrna oncology
More informationIntroduction of RNA-Seq Analysis
Introduction of RNA-Seq Analysis Jiang Li, MS Bioinformatics System Engineer I Center for Quantitative Sciences(CQS) Vanderbilt University September 21, 2012 Goal of this talk 1. Act as a practical resource
More informationBioinformatics Monthly Workshop Series. Speaker: Fan Gao, Ph.D Bioinformatics Resource Office The Picower Institute for Learning and Memory
Bioinformatics Monthly Workshop Series Speaker: Fan Gao, Ph.D Bioinformatics Resource Office The Picower Institute for Learning and Memory Schedule for Fall, 2015 PILM Bioinformatics Web Server (09/21/2015)
More informationNext Generation Sequencing
Next Generation Sequencing Complete Report Catalogue # and Service: IR16001 rrna depletion (human, mouse, or rat) IR11081 Total RNA Sequencing (80 million reads, 2x75 bp PE) Xxxxxxx - xxxxxxxxxxxxxxxxxxxxxx
More informationGuidelines Analysis of RNA Quantity and Quality for Next-Generation Sequencing Projects
Title: Protocol number: Guidelines Analysis of RNA Quantity and Quality for Next-Generation Sequencing Projects GAF S002 Version: Version 4 Date: December 17 th 2015 Author: P. van der Vlies, C.C. van
More informationRNA-Seq data analysis course September 7-9, 2015
RNA-Seq data analysis course September 7-9, 2015 Peter-Bram t Hoen (LUMC) Jan Oosting (LUMC) Celia van Gelder, Jacintha Valk (BioSB) Anita Remmelzwaal (LUMC) Expression profiling DNA mrna protein Comprehensive
More informationCBC Data Therapy. Metatranscriptomics Discussion
CBC Data Therapy Metatranscriptomics Discussion Metatranscriptomics Extract RNA, subtract rrna Sequence cdna QC Gene expression, function Institute for Systems Genomics: Computational Biology Core bioinformatics.uconn.edu
More informationIntroduction to NGS analyses
Introduction to NGS analyses Giorgio L Papadopoulos Institute of Molecular Biology and Biotechnology Bioinformatics Support Group 04/12/2015 Papadopoulos GL (IMBB, FORTH) IMBB NGS Seminar 04/12/2015 1
More informationMODULE 5: TRANSLATION
MODULE 5: TRANSLATION Lesson Plan: CARINA ENDRES HOWELL, LEOCADIA PALIULIS Title Translation Objectives Determine the codons for specific amino acids and identify reading frames by looking at the Base
More informationStatistical Genomics and Bioinformatics Workshop. Genetic Association and RNA-Seq Studies
Statistical Genomics and Bioinformatics Workshop: Genetic Association and RNA-Seq Studies RNA Seq and Differential Expression Analysis Brooke L. Fridley, PhD University of Kansas Medical Center 1 Next-generation
More informationUsing the Genome Browser: A Practical Guide. Travis Saari
Using the Genome Browser: A Practical Guide Travis Saari What is it for? Problem: Bioinformatics programs produce an overwhelming amount of data Difficult to understand anything from the raw data Data
More informationDifferential gene expression analysis using RNA-seq
https://abc.med.cornell.edu/ Differential gene expression analysis using RNA-seq Applied Bioinformatics Core, March 2018 Friederike Dündar with Luce Skrabanek & Paul Zumbo Day 1: Introduction into high-throughput
More informationRNA-Seq de novo assembly training
RNA-Seq de novo assembly training Training session aims Give you some keys elements to look at during read quality check. Transcriptome assembly is not completely a strait forward process : Multiple strategies
More information02 Agenda Item 03 Agenda Item
01 Agenda Item 02 Agenda Item 03 Agenda Item SOLiD 3 System: Applications Overview April 12th, 2010 Jennifer Stover Field Application Specialist - SOLiD Applications Workflow for SOLiD Application Application
More informationCollect, analyze and synthesize. Annotation. Annotation for D. virilis. GEP goals: Evidence Based Annotation. Evidence for Gene Models 12/26/2018
Annotation Annotation for D. virilis Chris Shaffer July 2012 l Big Picture of annotation and then one practical example l This technique may not be the best with other projects (e.g. corn, bacteria) l
More informationTECH NOTE Pushing the Limit: A Complete Solution for Generating Stranded RNA Seq Libraries from Picogram Inputs of Total Mammalian RNA
TECH NOTE Pushing the Limit: A Complete Solution for Generating Stranded RNA Seq Libraries from Picogram Inputs of Total Mammalian RNA Stranded, Illumina ready library construction in
More informationC3BI. VARIANTS CALLING November Pierre Lechat Stéphane Descorps-Declère
C3BI VARIANTS CALLING November 2016 Pierre Lechat Stéphane Descorps-Declère General Workflow (GATK) software websites software bwa picard samtools GATK IGV tablet vcftools website http://bio-bwa.sourceforge.net/
More informationRNA-Seq analysis workshop
RNA-Seq analysis workshop Zhangjun Fei Boyce Thompson Institute for Plant Research USDA Robert W. Holley Center for Agriculture and Health Cornell University Outline Background of RNA-Seq Application of
More informationRNA Seq: Methods and Applica6ons. Prat Thiru
RNA Seq: Methods and Applica6ons Prat Thiru 1 Outline Intro to RNA Seq Biological Ques6ons Comparison with Other Methods RNA Seq Protocol RNA Seq Applica6ons Annota6on Quan6fica6on Other Applica6ons Expression
More informationAnalysis of RNA-seq Data
Analysis of RNA-seq Data A physicist and an engineer are in a hot-air balloon. Soon, they find themselves lost in a canyon somewhere. They yell out for help: "Helllloooooo! Where are we?" 15 minutes later,
More informationNovel methods for RNA and DNA- Seq analysis using SMART Technology. Andrew Farmer, D. Phil. Vice President, R&D Clontech Laboratories, Inc.
Novel methods for RNA and DNA- Seq analysis using SMART Technology Andrew Farmer, D. Phil. Vice President, R&D Clontech Laboratories, Inc. Agenda Enabling Single Cell RNA-Seq using SMART Technology SMART
More informationCollect, analyze and synthesize. Annotation. Annotation for D. virilis. Evidence Based Annotation. GEP goals: Evidence for Gene Models 08/22/2017
Annotation Annotation for D. virilis Chris Shaffer July 2012 l Big Picture of annotation and then one practical example l This technique may not be the best with other projects (e.g. corn, bacteria) l
More informationRNA-SEQUENCING ANALYSIS
RNA-SEQUENCING ANALYSIS Joseph Powell SISG- 2018 CONTENTS Introduction to RNA sequencing Data structure Analyses Transcript counting Alternative splicing Allele specific expression Discovery APPLICATIONS
More informationWorkflows and Pipelines for NGS analysis: Lessons from proteomics
Workflows and Pipelines for NGS analysis: Lessons from proteomics Conference on Applying NGS in Basic research Health care and Agriculture 11 th Sep 2014 Debasis Dash Where are the protein coding genes
More informationComputational & Quantitative Biology Lecture 6 RNA Sequencing
Peter A. Sims Dept. of Systems Biology Dept. of Biochemistry & Molecular Biophysics Sulzberger Columbia Genome Center October 27, 2014 Computational & Quantitative Biology Lecture 6 RNA Sequencing We Have
More informationNGS part 2: applications. Tobias Österlund
NGS part 2: applications Tobias Österlund tobiaso@chalmers.se NGS part of the course Week 4 Friday 13/2 15.15-17.00 NGS lecture 1: Introduction to NGS, alignment, assembly Week 6 Thursday 26/2 08.00-09.45
More informationMapping strategies for sequence reads
Mapping strategies for sequence reads Ernest Turro University of Cambridge 21 Oct 2013 Quantification A basic aim in genomics is working out the contents of a biological sample. 1. What distinct elements
More informationBST 226 Statistical Methods for Bioinformatics David M. Rocke. March 10, 2014 BST 226 Statistical Methods for Bioinformatics 1
BST 226 Statistical Methods for Bioinformatics David M. Rocke March 10, 2014 BST 226 Statistical Methods for Bioinformatics 1 NGS Technologies Illumina Sequencing HiSeq 2500 & MiSeq PacBio Sequencing PacBio
More informationAn introduction to RNA-seq. Nicole Cloonan - 4 th July 2018 #UQWinterSchool #Bioinformatics #GroupTherapy
An introduction to RNA-seq Nicole Cloonan - 4 th July 2018 #UQWinterSchool #Bioinformatics #GroupTherapy The central dogma Genome = all DNA in an organism (genotype) Transcriptome = all RNA (molecular
More informationQIAseq Targeted Panel Analysis Plugin USER MANUAL
QIAseq Targeted Panel Analysis Plugin USER MANUAL User manual for QIAseq Targeted Panel Analysis 1.1 Windows, macos and Linux June 18, 2018 This software is for research purposes only. QIAGEN Aarhus Silkeborgvej
More informationmeasuring gene expression December 11, 2018
measuring gene expression December 11, 2018 Intervening Sequences (introns): how does the cell get rid of them? Splicing!!! Highly conserved ribonucleoprotein complex recognizes intron/exon junctions and
More informationWheat CAP Gene Expression with RNA-Seq
Wheat CAP Gene Expression with RNA-Seq July 9 th -13 th, 2018 Overview of the workshop, Alina Akhunova http://www.ksre.k-state.edu/igenomics/workshops/ RNA-Seq Workshop Activities Lectures Laboratory Molecular
More informationresequencing storage SNP ncrna metagenomics private trio de novo exome ncrna RNA DNA bioinformatics RNA-seq comparative genomics
RNA Sequencing T TM variation genetics validation SNP ncrna metagenomics private trio de novo exome mendelian ChIP-seq RNA DNA bioinformatics custom target high-throughput resequencing storage ncrna comparative
More informationIntroduction to DNA-Sequencing
informatics.sydney.edu.au sih.info@sydney.edu.au The Sydney Informatics Hub provides support, training, and advice on research data, analyses and computing. Talk to us about your computing infrastructure,
More informationIn vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features
In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features Yiliang Ding, Yin Tang, Chun Kit Kwok, Yu Zhang, Philip C. Bevilacqua & Sarah M. Assmann (2014) Seminar RNA Bioinformatics
More informationEucalyptus gene assembly
Eucalyptus gene assembly ACGT Plant Biotechnology meeting Charles Hefer Bioinformatics and Computational Biology Unit University of Pretoria October 2011 About Eucalyptus Most valuable and widely planted
More informationab initio and Evidence-Based Gene Finding
ab initio and Evidence-Based Gene Finding A basic introduction to annotation Outline What is annotation? ab initio gene finding Genome databases on the web Basics of the UCSC browser Evidence-based gene
More informationMODULE 1: INTRODUCTION TO THE GENOME BROWSER: WHAT IS A GENE?
MODULE 1: INTRODUCTION TO THE GENOME BROWSER: WHAT IS A GENE? Lesson Plan: Title Introduction to the Genome Browser: what is a gene? JOYCE STAMM Objectives Demonstrate basic skills in using the UCSC Genome
More informationCourse Presentation. Ignacio Medina Presentation
Course Index Introduction Agenda Analysis pipeline Some considerations Introduction Who we are Teachers: Marta Bleda: Computational Biologist and Data Analyst at Department of Medicine, Addenbrooke's Hospital
More informationDe novo assembly in RNA-seq analysis.
De novo assembly in RNA-seq analysis. Joachim Bargsten Wageningen UR/PRI/Plant Breeding October 2012 Motivation Transcriptome sequencing (RNA-seq) Gene expression / differential expression Reconstruct
More informationHigh Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center
High Throughput Sequencing the Multi-Tool of Life Sciences Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center Complementary Approaches Illumina Still-imaging of clusters (~1000
More informationParts of a standard FastQC report
FastQC FastQC, written by Simon Andrews of Babraham Bioinformatics, is a very popular tool used to provide an overview of basic quality control metrics for raw next generation sequencing data. There are
More informationRNA standards v May
Standards, Guidelines and Best Practices for RNA-Seq: 2010/2011 I. Introduction: Sequence based assays of transcriptomes (RNA-seq) are in wide use because of their favorable properties for quantification,
More informationGalaxy Platform For NGS Data Analyses
Galaxy Platform For NGS Data Analyses Weihong Yan wyan@chem.ucla.edu Collaboratory Web Site http://qcb.ucla.edu/collaboratory http://collaboratory.lifesci.ucla.edu Workshop Outline ü Day 1 UCLA galaxy
More informationTECH NOTE Stranded NGS libraries from FFPE samples
TECH NOTE Stranded NGS libraries from FFPE samples Robust performance with extremely degraded FFPE RNA (DV 200 >25%) Consistent library quality across a range of input amounts (5 ng 50 ng) Compatibility
More informationFrom Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow
From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with
More informationRNA Sequencing. Next gen insight into transcriptomes , Elio Schijlen
RNA Sequencing Next gen insight into transcriptomes 05-06-2013, Elio Schijlen Transcriptome complete set of transcripts in a cell, and their quantity, for a specific developmental stage or physiological
More informationGenomic resources. for non-model systems
Genomic resources for non-model systems 1 Genomic resources Whole genome sequencing reference genome sequence comparisons across species identify signatures of natural selection population-level resequencing
More informationDifferential gene expression analysis using RNA-seq
https://abc.med.cornell.edu/ Differential gene expression analysis using RNA-seq Applied Bioinformatics Core, August 2017 Friederike Dündar with Luce Skrabanek & Ceyda Durmaz Day 4 overview (brief) theoretical
More informationShort Read Alignment to a Reference Genome
Short Read Alignment to a Reference Genome Shamith Samarajiwa CRUK Summer School in Bioinformatics Cambridge, September 2018 Aligning to a reference genome BWA Bowtie2 STAR GEM Pseudo Aligners for RNA-seq
More informationThe first thing you will see is the opening page. SeqMonk scans your copy and make sure everything is in order, indicated by the green check marks.
Open Seqmonk Launch SeqMonk The first thing you will see is the opening page. SeqMonk scans your copy and make sure everything is in order, indicated by the green check marks. SeqMonk Analysis Page 1 Create
More information