Introduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013
|
|
- Tabitha Carmel Lucas
- 6 years ago
- Views:
Transcription
1 Introduction to RNA-Seq David Wood Winter School in Mathematics and Computational Biology July 1, 2013
2 Abundance RNA is... Diverse Dynamic Central DNA rrna Epigenetics trna RNA mrna Time Protein
3 Abundance RNA is... Diverse Dynamic Central DNA rrna Epigenetics trna RNA mrna Time Protein Qualitative Quantitative Integrative Understand the molecular basis of gene function. Classify and transform cellular states
4 RNA studies involve... Biological System Questions Project Available Resources Technology DB ~/bin
5 RNA studies involve... Biological System Questions Project Available Resources Technology DB ~/bin This talk: Focusing on reference based mammalian RNA-seq analysis
6 Transcriptional Complexity TSS TSS TSS pa pa pa pa TSS genomic DNA micrornas spliced intron TSS transcription start site protein coding regions pa polyadenylation signal translation start site non-coding regions polyadenylation
7 Transcriptional Complexity TSS TSS TSS pa pa pa pa TSS tirna PASR mirna genomic DNA micrornas spliced intron TSS transcription start site protein coding regions pa polyadenylation signal translation start site non-coding regions polyadenylation
8 Transcriptional Complexity TSS TSS TSS pa pa pa pa TSS tirna PASR Alu mirna genomic DNA micrornas spliced intron TSS transcription start site protein coding regions pa polyadenylation signal translation start site non-coding regions polyadenylation
9 Transcriptional Complexity Mutations Allelic Expression TSS TSS TSS pa pa pa pa TSS tirna PASR Alu mirna RNA Editing genomic DNA micrornas spliced intron TSS transcription start site protein coding regions pa polyadenylation signal translation start site non-coding regions polyadenylation
10 RNA-seq TSS TSS TSS pa pa pa pa TSS tirna PASR Alu mirna non-spliced reads mutations strand specific junction reads Cloonan et al. Nat Methods 2008; 5:
11 Advantages of RNA-seq Discovery genes, exons, junctions, UTRs, fusions (Present and Future) %#!!!!" %!!!!!",-./ " $#!!!!" $!!!!!" #!!!!"!" #&" #'" (!" ($" (%" ()" (*" (#" ((" (+" (&" ('" +!" +$" 5/06789":6-02;/" /1BCD" <06E>;?6/6"
12 Advantages of RNA-seq Discovery genes, exons, junctions, UTRs, fusions (Present and Future) Dynamic Range,-./ " %#!!!!" %!!!!!" $#!!!!" $!!!!!" #!!!!"!" #&" #'" (!" ($" (%" ()" (*" (#" ((" (+" (&" ('" +!" +$" 5/06789":6-02;/" /1BCD" <06E>;?6/6" Mortazavi et al. Nat. Methods 2008; 5:
13 Advantages of RNA-seq Discovery genes, exons, junctions, UTRs, fusions (Present and Future) Dynamic Range,-./ " %#!!!!" %!!!!!" $#!!!!" $!!!!!" #!!!!"!" #&" #'" (!" ($" (%" ()" (*" (#" ((" (+" (&" ('" +!" +$" 5/06789":6-02;/" /1BCD" <06E>;?6/6" Mortazavi et al. Nat. Methods 2008; 5: Nucleotide Specific
14 Typical experiment workflow Field / Clinic Wet Lab Dry Lab Run Experiment Design Experiment Sample Acquisition Field / Clinic / Lab Obtain RNA Make Library Sequencing 1 Base Calling Mapping 2 Library QC 2 Sample Acquisition Verification Validation Analysis Interpretation 3 3 Publish
15 Typical experiment workflow Field / Clinic Wet Lab Dry Lab Run Experiment Design Experiment Sample Acquisition Field / Clinic / Lab Obtain RNA Make Library Sequencing 1 Base Calling Mapping 2 Library QC 2 Sample Acquisition Verification Validation Analysis Interpretation 3 3 Publish
16 Typical experiment workflow Field / Clinic Wet Lab Dry Lab Run Experiment Design Experiment Sample Acquisition Field / Clinic / Lab Obtain RNA Make Library Sequencing 1 Base Calling Mapping 2 Library QC 2 Sample Acquisition Verification Validation Analysis Interpretation 3 3 Publish
17 Typical experiment workflow Field / Clinic Wet Lab Dry Lab Run Experiment Design Experiment Sample Acquisition Field / Clinic / Lab Obtain RNA Make Library Sequencing 1 Base Calling Mapping 2 Library QC 2 Sample Acquisition Verification Validation Analysis Interpretation 3 3 Publish
18 Library Construction trna (15%) 5% Deplete rrna Enrich polya RNA AA AA AA Target RNA rrna (80%) Profile (ribosomes) AA A Fragment cellular RNA Capture (tiling arrays) ds-cdna synthesis Sequencing Ligate adaptors + Amplify
19 Typical experiment workflow Field / Clinic Wet Lab Dry Lab Run Experiment Design Experiment Sample Acquisition Field / Clinic / Lab Obtain RNA Make Library Sequencing 1 Base Calling Mapping 2 Library QC 2 Sample Acquisition Verification Validation Analysis Interpretation 3 3 Publish
20 RNA-seq Mapping Challenge #1: Introns
21 RNA-seq Mapping Challenge #1: Introns Align to database of junctions or transcriptome Split Read Alignments Wood et al. Bioinformatics 2011; 27: Trapnell et al. Bioinformatics 2009; 25:
22 RNA-seq Mapping Challenge #1: Introns Align to database of junctions or transcriptome Split Read Alignments Wood et al. Bioinformatics 2011; 27: Trapnell et al. Bioinformatics 2009; 25: Challenge #2: Correctness Sufficient Overlap Sufficient Evidence
23 RNA-seq Mapping Challenge #1: Introns Align to database of junctions or transcriptome Split Read Alignments Wood et al. Bioinformatics 2011; 27: Trapnell et al. Bioinformatics 2009; 25: Challenge #2: Correctness Challenge #3: Multi-mappers Sufficient Overlap Sufficient Evidence Align to the transcriptome Sequence Similarity
24 RNA-seq Mapping Data QC (clipping) Align to Filter Set Align to genome Align to junctions Split read Alignment Exclude Flag and Exclude Choose Alignments, Disambiguate Tophat: Trapnell et al. Bioinformatics 2009; 25:
25 RNA-seq Mapping Data QC (clipping) Align to Filter Set Align to genome Align to junctions Split read Alignment Exclude Flag and Exclude Choose Alignments, Disambiguate Tophat: Trapnell et al. Bioinformatics 2009; 25: BAM BAM BAM Alignment Filtering Library QC Analysis
26 RNA-seq Mapping rrna, trna? reference? diploid? gene model? ESTs? Algorithm? Data QC (clipping) Align to Filter Set Align to genome Align to junctions Split read Alignment Exclude Flag and Exclude Choose Alignments, Disambiguate Tophat: Trapnell et al. Bioinformatics 2009; 25: BAM BAM BAM Alignment Filtering Library QC Analysis
27 Typical experiment workflow Field / Clinic Wet Lab Dry Lab Run Experiment Design Experiment Sample Acquisition Field / Clinic / Lab Obtain RNA Make Library Sequencing 1 Base Calling Mapping 2 Library QC 2 Sample Acquisition Verification Validation Analysis Interpretation 3 3 Publish
28 Library Quality Control (QC) trna (15%) 5% Deplete rrna Enrich polya RNA AA AA AA Target RNA rrna (80%) Profile (ribosomes) AA A Fragment cellular RNA Capture (tiling arrays) ds-cdna synthesis Sequencing Ligate adaptors + Amplify
29 Library Quality Control (QC) trna (15%) 5% Deplete rrna Enrich polya RNA AA AA AA Target RNA Affects RNA content (Expression quantification) rrna (80%) Profile (ribosomes) AA A Fragment cellular RNA Capture (tiling arrays) ds-cdna synthesis Sequencing Ligate adaptors + Amplify
30 Library Quality Control (QC) trna (15%) 5% Deplete rrna Enrich polya RNA AA AA AA Target RNA Affects RNA content (Expression quantification) rrna (80%) cellular RNA Profile (ribosomes) Capture (tiling arrays) AA A Fragment Affects Insert Size (transcript identification) ds-cdna synthesis Sequencing Ligate adaptors + Amplify
31 Library Quality Control (QC) trna (15%) 5% Deplete rrna Enrich polya RNA AA AA AA Target RNA Affects RNA content (Expression quantification) rrna (80%) cellular RNA Profile (ribosomes) Capture (tiling arrays) AA A Fragment Affects Insert Size (transcript identification) ds-cdna synthesis Affects Strand Specificity Sequencing Ligate adaptors + Amplify
32 Library Quality Control (QC) trna (15%) 5% Deplete rrna Enrich polya RNA AA AA AA Target RNA Affects RNA content (Expression quantification) rrna (80%) cellular RNA Profile (ribosomes) Capture (tiling arrays) AA A Fragment Affects Insert Size (transcript identification) ds-cdna synthesis Affects Strand Specificity Sequencing Ligate adaptors + Amplify Affects Library Complexity (Tag uniqueness)
33 Library Quality Control (QC) trna (15%) 5% Deplete rrna Enrich polya RNA AA AA AA Target RNA Affects RNA content (Expression quantification) rrna (80%) cellular RNA Profile (ribosomes) Capture (tiling arrays) AA A Fragment Affects Insert Size (transcript identification) ds-cdna synthesis Affects Strand Specificity Affects Mapping Rate Paired-end? Sequencing Ligate adaptors + Amplify Affects Library Complexity (Tag uniqueness)
34 Typical experiment workflow Field / Clinic Wet Lab Dry Lab Run Experiment Design Experiment Sample Acquisition Field / Clinic / Lab Obtain RNA Make Library Sequencing 1 Base Calling Mapping 2 Library QC 2 Sample Acquisition Verification Validation Analysis Interpretation 3 3 Publish
35 Calculate Gene Expression Gene A 3500nt (700 reads) Gene B 400nt (160 reads)
36 Calculate Gene Expression Gene A 3500nt (700 reads) Gene B 400nt (160 reads) RPKM = 2.0 RPKM = 4.0 Reads Per Kilobase per Million 10 RPKM = R L N R = Gene Read Count L = Length of gene N = Library Size Mortazavi et al. Nat. Methods 2008; 5:
37 Further Normalisation Repeat Normalise to mappable gene length Koehler et al. Bioinformatics 2010
38 Further Normalisation Repeat Normalise to mappable gene length Scale Expression Values by TMM Koehler et al. Bioinformatics 2010 Cellular RNA Cond. 1 Cond. 2 Robinson et al. Genome Biology 2010; 11:R25
39 Further Normalisation Repeat Normalise to mappable gene length Scale Expression Values by TMM Koehler et al. Bioinformatics 2010 Cellular RNA RPKM Cond. 1 Cond. 2 Cond. 1 Cond. 2 Robinson et al. Genome Biology 2010; 11:R25
40 Further Normalisation Repeat Normalise to mappable gene length Scale Expression Values by TMM Koehler et al. Bioinformatics 2010 Robinson et al. Genome Biology 2010; 11:R25 Benjamini et al. NAR; 2012 Normalise to GC content of region
41 Calculate Feature Expression
42 Calculate Feature Expression Exonic Region
43 Calculate Feature Expression Exonic Region Exon Junction
44 Calculate Feature Expression Exonic Region Exon Junction Intronic Region
45 Calculate Feature Expression Exonic Region Exon Junction Intronic Region Exon Boundary
46 Calculate Feature Expression Exonic Region Exon Junction Intronic Region Exon Boundary Intergenic Region
47 Calculate Feature Expression Calculate RPKM for any feature Exonic Region Exon Junction Intronic Region Exon Boundary Intergenic Region
48 Calculate Feature Expression Calculate RPKM for any feature Exonic Region Exon Junction Intronic Region Exon Boundary Intergenic Region Extended 3 UTR
49 Calculate Feature Expression Calculate RPKM for any feature Exonic Region Exon Junction Intronic Region Exon Boundary Intergenic Region Extended 3 UTR Retained Intron
50 Calculate Transcript Expression
51 Calculate Transcript Expression diagnostic feature
52 Calculate Transcript Expression diagnostic feature Approach #1: Expression calculated using diagnostic features Strong Evidence Easy to calculate Sampling Variability Lacks statistical robustness Dependent on gene model Excludes Transcripts ALEXA-seq: Griffith et al. Nat. Methods 2010; 11:R25
53 Calculate Transcript Expression
54 Calculate Transcript Expression Approach #2: Expression estimated Construct bipartite graph, then finds minimum path Cufflinks: Trapnell et al. Nat. Biotech. 2010, 28:
55 Calculate Transcript Expression Approach #2: Expression estimated Construct bipartite graph, then finds minimum path Cufflinks: Trapnell et al. Nat. Biotech. 2010, 28: Estimates expression for all transcripts Incorporates ambiguous reads Model can fail in complex / highly expressed regions More statistically robust Error rate largely unknown
56 Expressed or not? Cond. 1 Cond. 2 Cond. 3 Frequency not expressed expressed Need to determine expression cut-off value log2 (expression)
57 Expressed or not? 1 Expressed if > 1 RPKM Has literature support Lacks sensitivity Arbitrary
58 Expressed or not? 1 Expressed if > 1 RPKM Has literature support Lacks sensitivity Arbitrary 2 Expressed if above intergenic background Frequency log2 Expression 95th percentile
59 Expressed or not? 1 Expressed if > 1 RPKM Has literature support Lacks sensitivity Arbitrary 2 Expressed if above intergenic background Cut-off based on empirical evidence Still somewhat arbitrary Frequency log2 Expression 95th percentile
60 Expressed or not? Expressed if > 1 RPKM Expressed if above intergenic background Incorporate replicate information Has literature support Cut-off based on empirical evidence np IDR Based on observed reproducibility Lacks sensitivity Still somewhat arbitrary Requires replicates log2 (expression) bins Arbitrary Frequency Rep 1 vs Rep 2 Rep 2 vs Rep 1 Mean Cut off log2 Expression th percentile
61 Expressed or not? 1 Expressed if > 1 RPKM Has literature support Lacks sensitivity Arbitrary 2 Expressed if above intergenic background Cut-off based on empirical evidence Still somewhat arbitrary Frequency log2 Expression 95th percentile 3 Incorporate replicate information Based on observed reproducibility Requires replicates np IDR Rep 1 vs Rep 2 Rep 2 vs Rep 1 Mean Cut off log2 (expression) bins
62 Expressed or not? 1 Expressed if > 1 RPKM Has literature support Lacks sensitivity Arbitrary 2 Expressed if above intergenic background Cut-off based on empirical evidence Still somewhat arbitrary Frequency log2 Expression 95th percentile 3 Incorporate replicate information Based on observed reproducibility Requires replicates np IDR Rep 1 vs Rep 2 Rep 2 vs Rep 1 Mean Cut off log2 (expression) bins Choose what is reasonable for your experiment, be consistent!
63 Nucleotide-Resolution Analysis Imprinting ICR
64 Nucleotide-Resolution Analysis Imprinting eqtl sqtl
65 Nucleotide-Resolution Analysis Imprinting eqtl sqtl Complex Traits
66 Nucleotide-Resolution Analysis Imprinting eqtl sqtl Complex Traits Allelic Fraction A B C SNPs
67 Nucleotide-Resolution Analysis Imprinting eqtl sqtl Complex Traits A B C SNPs Allelic Fraction Density Expected Mean Observed Mean Reference bias Fraction of RNA seq Reads Matching Reference Allele Degner et al. Bioinformatics 2009
68 Nucleotide-Resolution Analysis Imprinting eqtl sqtl Complex Traits A B C SNPs Allelic Fraction Density Expected Mean Observed Mean Reference bias Map to a diploid genome Fraction of RNA seq Reads Matching Reference Allele AlleleSeq: Rozowsky et al. Mol. Sys. Bio 2011 Degner et al. Bioinformatics 2009
69 Typical experiment workflow Field / Clinic Wet Lab Dry Lab Run Experiment Design Experiment Sample Acquisition Field / Clinic / Lab Obtain RNA Make Library Sequencing 1 Base Calling Mapping 2 Library QC 2 Sample Acquisition Verification Validation Analysis Interpretation 3 3 Publish
70 The future of RNA-seq (now) Single Cell Shalek, et al. Nature 2013
71 The future of RNA-seq (now) Single Cell Huge Cohort Genotype-Tissue Expression project (GTEx) 900 donors 30,000 RNA-seq data sets! Shalek, et al. Nature 2013 Lonsdale, et al. Nature Genetics 2013
72 Summary 1 Choose an alignment approach suitable for your experiment, available resources and tools 2 Assess library quality, specifically rrna contamination, insert size, strand specificity and library complexity 3 Gene and Feature Expression can be calculated using count data, and normalised by length, library size and GC content 4 Transcript expression calculation requires alternative approaches and algorithms, which although common, are largely unproven 5 RNA-seq can interrogate nucleotide specific questions, but be careful of alignment biases (diploid mapping can help here)
73 Questions and References Cloonan et al. Nat Methods 2008; Stem cell transcriptome profiling via massive-scale mrna sequencing Mortazavi et al. Nat. Methods 2008; Mapping and quantifying mammalian transcriptomes by RNA-Seq Wood et al. Bioinformatics 2011; X-MATE: A flexible system for mapping short read data Trapnell et al. Bioinformatics 2009; TopHat: discovering splice junctions with RNA-Seq Koehler et al. Bioinformatics The Uniqueome: A mappability resource for short-tag sequencing Robinson et al. Genome Biology 2010; A scaling normalization method for differential expression analysis of RNA-seq data. Benjamini et al. NAR; Summarizing and correcting the GC content bias in high-throughput sequencing Griffith et al. Nat. Methods 2010; Alternative expression analysis by RNA sequencing. Trapnell et al. Nat. Biotech. 2010; Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform Degner et al. Bioinformatics 2009; Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing Rozowsky et al. Mol. Sys. Bio 2011; AlleleSeq: analysis of allele-specific expression and binding in a Shalek, et al. Nature 2013; Single-cell transcriptomics reveals bimodality in expression and splicing in immune cells Lonsdale, et al. Nature Genetics 2013; The Genotype-Tissue Expression (GTEx) project.
RNA-Sequencing analysis
RNA-Sequencing analysis Markus Kreuz 25. 04. 2012 Institut für Medizinische Informatik, Statistik und Epidemiologie Content: Biological background Overview transcriptomics RNA-Seq RNA-Seq technology Challenges
More informationRNA-Seq with the Tuxedo Suite
RNA-Seq with the Tuxedo Suite Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 2015 Workshop The Basic Tuxedo Suite References Trapnell C, et al. 2009 TopHat: discovering splice junctions with
More informationmeasuring gene expression December 5, 2017
measuring gene expression December 5, 2017 transcription a usually short-lived RNA copy of the DNA is created through transcription RNA is exported to the cytoplasm to encode proteins some types of RNA
More informationSNPs - GWAS - eqtls. Sebastian Schmeier
SNPs - GWAS - eqtls s.schmeier@gmail.com http://sschmeier.github.io/bioinf-workshop/ 17.08.2015 Overview Single nucleotide polymorphism (refresh) SNPs effect on genes (refresh) Genome-wide association
More informationSMARTer Ultra Low RNA Kit for Illumina Sequencing Two powerful technologies combine to enable sequencing with ultra-low levels of RNA
SMARTer Ultra Low RNA Kit for Illumina Sequencing Two powerful technologies combine to enable sequencing with ultra-low levels of RNA The most sensitive cdna synthesis technology, combined with next-generation
More informationWhole Transcriptome Analysis of Illumina RNA- Seq Data. Ryan Peters Field Application Specialist
Whole Transcriptome Analysis of Illumina RNA- Seq Data Ryan Peters Field Application Specialist Partek GS in your NGS Pipeline Your Start-to-Finish Solution for Analysis of Next Generation Sequencing Data
More informationMapping strategies for sequence reads
Mapping strategies for sequence reads Ernest Turro University of Cambridge 21 Oct 2013 Quantification A basic aim in genomics is working out the contents of a biological sample. 1. What distinct elements
More informationRNA-Seq Workshop AChemS Sunil K Sukumaran Monell Chemical Senses Center Philadelphia
RNA-Seq Workshop AChemS 2017 Sunil K Sukumaran Monell Chemical Senses Center Philadelphia Benefits & downsides of RNA-Seq Benefits: High resolution, sensitivity and large dynamic range Independent of prior
More informationMachine Learning Methods for RNA-seq-based Transcriptome Reconstruction
Machine Learning Methods for RNA-seq-based Transcriptome Reconstruction Gunnar Rätsch Friedrich Miescher Laboratory Max Planck Society, Tübingen, Germany NGS Bioinformatics Meeting, Paris (March 24, 2010)
More informationGene Expression Technology
Gene Expression Technology Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu Gene expression Gene expression is the process by which information from a gene
More informationIntroduction to RNA-Seq
Introduction to RNA-Seq Monica Britton, Ph.D. Sr. Bioinformatics Analyst March 2015 Workshop Overview of RNA-Seq Activities RNA-Seq Concepts, Terminology, and Work Flows Using Single-End Reads and a Reference
More informationTranscription in Eukaryotes
Transcription in Eukaryotes Biology I Hayder A Giha Transcription Transcription is a DNA-directed synthesis of RNA, which is the first step in gene expression. Gene expression, is transformation of the
More informationRNA-Seq Software, Tools, and Workflows
RNA-Seq Software, Tools, and Workflows Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 1, 2016 Some mrna-seq Applications Differential gene expression analysis Transcriptional profiling Assumption:
More informationIntroduction to transcriptome analysis using High Throughput Sequencing technologies. D. Puthier 2012
Introduction to transcriptome analysis using High Throughput Sequencing technologies D. Puthier 2012 A typical RNA-Seq experiment Library construction Protocol variations Fragmentation methods RNA: nebulization,
More informationNon-Organic-Based Isolation of Mammalian microrna using Norgen s microrna Purification Kit
Application Note 13 RNA Sample Preparation Non-Organic-Based Isolation of Mammalian microrna using Norgen s microrna Purification Kit B. Lam, PhD 1, P. Roberts, MSc 1 Y. Haj-Ahmad, M.Sc., Ph.D 1,2 1 Norgen
More informationGene Expression: Transcription
Gene Expression: Transcription The majority of genes are expressed as the proteins they encode. The process occurs in two steps: Transcription = DNA RNA Translation = RNA protein Taken together, they make
More informationLong and short/small RNA-seq data analysis
Long and short/small RNA-seq data analysis GEF5, 4.9.2015 Sami Heikkinen, PhD, Dos. Topics 1. RNA-seq in a nutshell 2. Long vs short/small RNA-seq 3. Bioinformatic analysis work flows GEF5 / Heikkinen
More informationAssay Standards Working Group Nov 2012 Assay Standards Working Group Recommendations, November 2012
Assay Standards Working Group Recommendations, November 2012 Contents Assay Standards Working Group Recommendations, August 2012... 1 Contents... 1 Introduction... 2 1: Reference Epigenome Criteria...
More informationGene Regulation Solutions. Microarrays and Next-Generation Sequencing
Gene Regulation Solutions Microarrays and Next-Generation Sequencing Gene Regulation Solutions The Microarrays Advantage Microarrays Lead the Industry in: Comprehensive Content SurePrint G3 Human Gene
More informationRNAseq Differential Gene Expression Analysis Report
RNAseq Differential Gene Expression Analysis Report Customer Name: Institute/Company: Project: NGS Data: Bioinformatics Service: IlluminaHiSeq2500 2x126bp PE Differential gene expression analysis Sample
More informationWelcome to the NGS webinar series
Welcome to the NGS webinar series Webinar 1 NGS: Introduction to technology, and applications NGS Technology Webinar 2 Targeted NGS for Cancer Research NGS in cancer Webinar 3 NGS: Data analysis for genetic
More informationMicroarray Gene Expression Analysis at CNIO
Microarray Gene Expression Analysis at CNIO Orlando Domínguez Genomics Unit Biotechnology Program, CNIO 8 May 2013 Workflow, from samples to Gene Expression data Experimental design user/gu/ubio Samples
More informationRNA Seq: Methods and Applica6ons. Prat Thiru
RNA Seq: Methods and Applica6ons Prat Thiru 1 Outline Intro to RNA Seq Biological Ques6ons Comparison with Other Methods RNA Seq Protocol RNA Seq Applica6ons Annota6on Quan6fica6on Other Applica6ons Expression
More informationCourse Presentation. Ignacio Medina Presentation
Course Index Introduction Agenda Analysis pipeline Some considerations Introduction Who we are Teachers: Marta Bleda: Computational Biologist and Data Analyst at Department of Medicine, Addenbrooke's Hospital
More informationUnit 6: Molecular Genetics & DNA Technology Guided Reading Questions (100 pts total)
Name: AP Biology Biology, Campbell and Reece, 7th Edition Adapted from chapter reading guides originally created by Lynn Miriello Chapter 16 The Molecular Basis of Inheritance Unit 6: Molecular Genetics
More informationAGILENT S BIOINFORMATICS ANALYSIS SOFTWARE
ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS
More informationMultiple choice questions (numbers in brackets indicate the number of correct answers)
1 Multiple choice questions (numbers in brackets indicate the number of correct answers) February 1, 2013 1. Ribose is found in Nucleic acids Proteins Lipids RNA DNA (2) 2. Most RNA in cells is transfer
More informationIntroduction to RNA sequencing
Introduction to RNA sequencing Bioinformatics perspective Olga Dethlefsen NBIS, National Bioinformatics Infrastructure Sweden November 2017 Olga (NBIS) RNA-seq November 2017 1 / 49 Outline Why sequence
More informationGreen Center Computational Core ChIP- Seq Pipeline, Just a Click Away
Green Center Computational Core ChIP- Seq Pipeline, Just a Click Away Venkat Malladi Computational Biologist Computational Core Cecil H. and Ida Green Center for Reproductive Biology Science Introduc
More informationQIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd
QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd 1 Our current NGS & Bioinformatics Platform 2 Our NGS workflow and applications 3 QIAGEN s
More informationAbout Strand NGS. Strand Genomics, Inc All rights reserved.
About Strand NGS Strand NGS-formerly known as Avadis NGS, is an integrated platform that provides analysis, management and visualization tools for next-generation sequencing data. It supports extensive
More informationIntroduction to the UCSC genome browser
Introduction to the UCSC genome browser Dominik Beck NHMRC Peter Doherty and CINSW ECR Fellow, Senior Lecturer Lowy Cancer Research Centre, UNSW and Centre for Health Technology, UTS SYDNEY NSW AUSTRALIA
More informationSingle Nucleotide Variant Analysis. H3ABioNet May 14, 2014
Single Nucleotide Variant Analysis H3ABioNet May 14, 2014 Outline What are SNPs and SNVs? How do we identify them? How do we call them? SAMTools GATK VCF File Format Let s call variants! Single Nucleotide
More informationBio 101 Sample questions: Chapter 10
Bio 101 Sample questions: Chapter 10 1. Which of the following is NOT needed for DNA replication? A. nucleotides B. ribosomes C. Enzymes (like polymerases) D. DNA E. all of the above are needed 2 The information
More informationChapter 15 Gene Technologies and Human Applications
Chapter Outline Chapter 15 Gene Technologies and Human Applications Section 1: The Human Genome KEY IDEAS > Why is the Human Genome Project so important? > How do genomics and gene technologies affect
More informationIntroduction to Bioinformatics and Gene Expression Technologies
Introduction to Bioinformatics and Gene Expression Technologies Utah State University Fall 2017 Statistical Bioinformatics (Biomedical Big Data) Notes 1 1 Vocabulary Gene: hereditary DNA sequence at a
More informationGenomics and Transcriptomics of Spirodela polyrhiza
Genomics and Transcriptomics of Spirodela polyrhiza Doug Bryant Bioinformatics Core Facility & Todd Mockler Group, Donald Danforth Plant Science Center Desired Outcomes High-quality genomic reference sequence
More informationless sensitive than RNA-seq but more robust analysis pipelines expensive but quantitiatve standard but typically not high throughput
Chapter 11: Gene Expression The availability of an annotated genome sequence enables massively parallel analysis of gene expression. The expression of all genes in an organism can be measured in one experiment.
More informationIntroductory Next Gen Workshop
Introductory Next Gen Workshop http://www.illumina.ucr.edu/ http://www.genomics.ucr.edu/ Workshop Objectives Workshop aimed at those who are new to Illumina sequencing and will provide: - a basic overview
More informationRNA-Seq analysis workshop
RNA-Seq analysis workshop Zhangjun Fei Boyce Thompson Institute for Plant Research USDA Robert W. Holley Center for Agriculture and Health Cornell University Outline Background of RNA-Seq Application of
More informationThe Genetic Code and Transcription. Chapter 12 Honors Genetics Ms. Susan Chabot
The Genetic Code and Transcription Chapter 12 Honors Genetics Ms. Susan Chabot TRANSCRIPTION Copy SAME language DNA to RNA Nucleic Acid to Nucleic Acid TRANSLATION Copy DIFFERENT language RNA to Amino
More informationHello! Outline. Cell Biology: RNA and Protein synthesis. In all living cells, DNA molecules are the storehouses of information. 6.
Cell Biology: RNA and Protein synthesis In all living cells, DNA molecules are the storehouses of information Hello! Outline u 1. Key concepts u 2. Central Dogma u 3. RNA Types u 4. RNA (Ribonucleic Acid)
More informationYear III Pharm.D Dr. V. Chitra
Year III Pharm.D Dr. V. Chitra 1 Genome entire genetic material of an individual Transcriptome set of transcribed sequences Proteome set of proteins encoded by the genome 2 Only one strand of DNA serves
More informationDemo of mrna NGS Concluding Report
Demo of mrna NGS Concluding Report Project: Demo Report Customer: Dr. Demo Company/Institute: Exiqon AS Date: 09-Mar-2015 Performed by Exiqon A/S Company Reg.No.(CVR) 18 98 44 31 Skelstedet 16 DK-2950,
More informationOutline. Introduction to ab initio and evidence-based gene finding. Prokaryotic gene predictions
Outline Introduction to ab initio and evidence-based gene finding Overview of computational gene predictions Different types of eukaryotic gene predictors Common types of gene prediction errors Wilson
More informationHigher Human Biology Unit 1: Human Cells Pupils Learning Outcomes
Higher Human Biology Unit 1: Human Cells Pupils Learning Outcomes 1.1 Division and Differentiation in Human Cells I can state that cellular differentiation is the process by which a cell develops more
More informationBio11 Announcements. Ch 21: DNA Biology and Technology. DNA Functions. DNA and RNA Structure. How do DNA and RNA differ? What are genes?
Bio11 Announcements TODAY Genetics (review) and quiz (CP #4) Structure and function of DNA Extra credit due today Next week in lab: Case study presentations Following week: Lab Quiz 2 Ch 21: DNA Biology
More informationDNA Structure & the Genome. Bio160 General Biology
DNA Structure & the Genome Bio160 General Biology Lecture Outline I. DNA A nucleic acid II. Chromosome Structure III. Chromosomes and Genes IV. DNA vs. RNA I. DNA A Nucleic Acid Structure of DNA: Remember:
More information132 Grundlagen der Bioinformatik, SoSe 14, D. Huson, June 22, This exposition is based on the following source, which is recommended reading:
132 Grundlagen der Bioinformatik, SoSe 14, D. Huson, June 22, 214 1 Gene Prediction Using HMMs This exposition is based on the following source, which is recommended reading: 1. Chris Burge and Samuel
More information8/21/2014. From Gene to Protein
From Gene to Protein Chapter 17 Objectives Describe the contributions made by Garrod, Beadle, and Tatum to our understanding of the relationship between genes and enzymes Briefly explain how information
More informationGene Identification in silico
Gene Identification in silico Nita Parekh, IIIT Hyderabad Presented at National Seminar on Bioinformatics and Functional Genomics, at Bioinformatics centre, Pondicherry University, Feb 15 17, 2006. Introduction
More informationGrundlagen der Bioinformatik, SoSe 11, D. Huson, July 4, This exposition is based on the following source, which is recommended reading:
Grundlagen der Bioinformatik, SoSe 11, D. Huson, July 4, 211 155 12 Gene Prediction Using HMMs This exposition is based on the following source, which is recommended reading: 1. Chris Burge and Samuel
More informationChapter 12. DNA TRANSCRIPTION and TRANSLATION
Chapter 12 DNA TRANSCRIPTION and TRANSLATION 12-3 RNA and Protein Synthesis WARM UP What are proteins? Where do they come from? From DNA to RNA to Protein DNA in our cells carry the instructions for making
More informationChIP-Seq Data Analysis. J Fass UCD Genome Center Bioinformatics Core Wednesday 15 June 2015
ChIP-Seq Data Analysis J Fass UCD Genome Center Bioinformatics Core Wednesday 15 June 2015 What s the Question? Where do Transcription Factors (TFs) bind genomic DNA 1? (Where do other things bind DNA
More informationChapter 13. From DNA to Protein
Chapter 13 From DNA to Protein Proteins All proteins consist of polypeptide chains A linear sequence of amino acids Each chain corresponds to the nucleotide base sequenceof a gene The Path From Genes to
More informationSCALABLE, REPRODUCIBLE RNA-Seq
SCALABLE, REPRODUCIBLE RNA-Seq SCALABLE, REPRODUCIBLE RNA-Seq Advances in the RNA sequencing workflow, from sample preparation through data analysis, are enabling deeper and more accurate exploration
More informationAdv Biology: DNA and RNA Study Guide
Adv Biology: DNA and RNA Study Guide Chapter 12 Vocabulary -Notes What experiments led up to the discovery of DNA being the hereditary material? o The discovery that DNA is the genetic code involved many
More informationReview of Protein (one or more polypeptide) A polypeptide is a long chain of..
Gene expression Review of Protein (one or more polypeptide) A polypeptide is a long chain of.. In a protein, the sequence of amino acid determines its which determines the protein s A protein with an enzymatic
More informationFunctional Genomics Overview RORY STARK PRINCIPAL BIOINFORMATICS ANALYST CRUK CAMBRIDGE INSTITUTE 18 SEPTEMBER 2017
Functional Genomics Overview RORY STARK PRINCIPAL BIOINFORMATICS ANALYST CRUK CAMBRIDGE INSTITUTE 18 SEPTEMBER 2017 Agenda What is Functional Genomics? RNA Transcription/Gene Expression Measuring Gene
More informationFrom Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow
From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with
More informationDNA. bioinformatics. epigenetics methylation structural variation. custom. assembly. gene. tumor-normal. mendelian. BS-seq. prediction.
Epigenomics T TM activation SNP target ncrna validation metagenomics genetics private RRBS-seq de novo trio RIP-seq exome mendelian comparative genomics DNA NGS ChIP-seq bioinformatics assembly tumor-normal
More informationLecture #1. Introduction to microarray technology
Lecture #1 Introduction to microarray technology Outline General purpose Microarray assay concept Basic microarray experimental process cdna/two channel arrays Oligonucleotide arrays Exon arrays Comparing
More informationRNA spike-in controls & analysis methods for trustworthy genome-scale measurements
RNA spike-in controls & analysis methods for trustworthy genome-scale measurements Sarah A. Munro, Ph.D. Genome-Scale Measurements Group ABRF Meeting March 29, 2015 Overview External RNA Controls Consortium
More information7.2 Protein Synthesis. From DNA to Protein Animation
7.2 Protein Synthesis From DNA to Protein Animation Proteins Why are proteins so important? They break down your food They build up muscles They send signals through your brain that control your body They
More informationINTRODUCTION TO REVERSE TRANSCRIPTION PCR (RT-PCR) ABCF 2016 BecA-ILRI Hub, Nairobi 21 st September 2016 Roger Pelle Principal Scientist
INTRODUCTION TO REVERSE TRANSCRIPTION PCR (RT-PCR) ABCF 2016 BecA-ILRI Hub, Nairobi 21 st September 2016 Roger Pelle Principal Scientist Objective of PCR To provide a solution to one of the most pressing
More informationChapter 12 Packet DNA 1. What did Griffith conclude from his experiment? 2. Describe the process of transformation.
Chapter 12 Packet DNA and RNA Name Period California State Standards covered by this chapter: Cell Biology 1. The fundamental life processes of plants and animals depend on a variety of chemical reactions
More informationSOLiD Total RNA-Seq Kit SOLiD RNA Barcoding Kit
SOLiD Total RNA-Seq Kit SOLiD RNA Barcoding Kit Agenda SOLiD Total RNAseq Kit Overview Kit Configurations Barcoding Kit Introduction New Small RNA and WT Workflow Small RNA Workflow Step-by-step Workflow
More informationMicroarrays: since we use probes we obviously must know the sequences we are looking at!
These background are needed: 1. - Basic Molecular Biology & Genetics DNA replication Transcription Post-transcriptional RNA processing Translation Post-translational protein modification Gene expression
More informationProtein Synthesis: Transcription and Translation
Protein Synthesis: Transcription and Translation Proteins In living things, proteins are in charge of the expression of our traits (hair/eye color, ability to make insulin, predisposition for cancer, etc.)
More informationRNA-seq Data Analysis
Lecture 3. Clustering; Function/Pathway Enrichment analysis RNA-seq Data Analysis Qi Sun Bioinformatics Facility Biotechnology Resource Center Cornell University Lecture 1. Map RNA-seq read to genome Lecture
More informationMolecular Markers CRITFC Genetics Workshop December 9, 2014
Molecular Markers CRITFC Genetics Workshop December 9, 2014 Molecular Markers Tools that allow us to collect information about an individual, a population, or a species Application in fisheries mating
More informationCM581A2: NEXT GENERATION SEQUENCING PLATFORMS AND LIBRARY GENERATION
CM581A2: NEXT GENERATION SEQUENCING PLATFORMS AND LIBRARY GENERATION Fall 2015 Instructors: Coordinator: Carol Wilusz, Associate Professor MIP, CMB Instructor: Dan Sloan, Assistant Professor, Biology,
More informationTop 5 Lessons Learned From MAQC III/SEQC
Top 5 Lessons Learned From MAQC III/SEQC Weida Tong, Ph.D Division of Bioinformatics and Biostatistics, NCTR/FDA Weida.tong@fda.hhs.gov; 870 543 7142 1 MicroArray Quality Control (MAQC) An FDA led community
More informationBIO 311C Spring Lecture 36 Wednesday 28 Apr.
BIO 311C Spring 2010 1 Lecture 36 Wednesday 28 Apr. Synthesis of a Polypeptide Chain 5 direction of ribosome movement along the mrna 3 ribosome mrna NH 2 polypeptide chain direction of mrna movement through
More informationFigure S1: NUN preparation yields nascent, unadenylated RNA with a different profile from Total RNA.
Summary of Supplemental Information Figure S1: NUN preparation yields nascent, unadenylated RNA with a different profile from Total RNA. Figure S2: rrna removal procedure is effective for clearing out
More informationCH 17 :From Gene to Protein
CH 17 :From Gene to Protein Defining a gene gene gene Defining a gene is problematic because one gene can code for several protein products, some genes code only for RNA, two genes can overlap, and there
More informationAgilent Genomics Software Future Directions
Agilent Genomics Software Future Directions Michael Rosenberg, PhD Director, Genomics Software Agilent: A Focused Measurement Company Serving Diverse End Markets Electronic Measurement 2008 Revenue: $3.6
More informationAnalysis of Biological Sequences SPH
Analysis of Biological Sequences SPH 140.638 swheelan@jhmi.edu nuts and bolts meet Tuesdays & Thursdays, 3:30-4:50 no exam; grade derived from 3-4 homework assignments plus a final project (open book,
More informationBIOLOGY Dr.Locke Lecture# 27 An Introduction to Polymerase Chain Reaction (PCR)
BIOLOGY 207 - Dr.Locke Lecture# 27 An Introduction to Polymerase Chain Reaction (PCR) Required readings and problems: Reading: Open Genetics, Chapter 8.1 Problems: Chapter 8 Optional Griffiths (2008) 9
More informationNext Generation Sequencing. Target Enrichment
Next Generation Sequencing Target Enrichment Next Generation Sequencing Your Partner in Every Step from Sample to Data NGS: Revolutionizing Genetic Analysis with Single-Molecule Resolution Next generation
More informationAnalysis of Differential Gene Expression in Cattle Using mrna-seq
Analysis of Differential Gene Expression in Cattle Using mrna-seq mrna-seq A rough guide for green horns Animal and Grassland Research and Innovation Centre Animal and Bioscience Research Department Teagasc,
More informationRegulation of eukaryotic transcription:
Promoter definition by mass genome annotation data: in silico primer extension EMBNET course Bioinformatics of transcriptional regulation Jan 28 2008 Christoph Schmid Regulation of eukaryotic transcription:
More informationFrom DNA to Protein: Genotype to Phenotype
12 From DNA to Protein: Genotype to Phenotype 12.1 What Is the Evidence that Genes Code for Proteins? The gene-enzyme relationship is one-gene, one-polypeptide relationship. Example: In hemoglobin, each
More informationTarget Enrichment Strategies for Next Generation Sequencing
Target Enrichment Strategies for Next Generation Sequencing Anuj Gupta, PhD Agilent Technologies, New Delhi Genotypic Conference, Sept 2014 NGS Timeline Information burst Nearly 30,000 human genomes sequenced
More informationGenome Annotation. What Does Annotation Describe??? Genome duplications Genes Mobile genetic elements Small repeats Genetic diversity
Genome Annotation Genome Sequencing Costliest aspect of sequencing the genome o But Devoid of content Genome must be annotated o Annotation definition Analyzing the raw sequence of a genome and describing
More informationGenomics and Gene Recognition Genes and Blue Genes
Genomics and Gene Recognition Genes and Blue Genes November 3, 2004 Eukaryotic Gene Structure eukaryotic genomes are considerably more complex than those of prokaryotes eukaryotic cells have organelles
More informationComputational Biology I LSM5191
Computational Biology I LSM5191 Lecture 5 Notes: Genetic manipulation & Molecular Biology techniques Broad Overview of: Enzymatic tools in Molecular Biology Gel electrophoresis Restriction mapping DNA
More informationEukaryotic Gene Prediction. Wei Zhu May 2007
Eukaryotic Gene Prediction Wei Zhu May 2007 In nature, nothing is perfect... - Alice Walker Gene Structure What is Gene Prediction? Gene prediction is the problem of parsing a sequence into nonoverlapping
More informationSection 10.3 Outline 10.3 How Is the Base Sequence of a Messenger RNA Molecule Translated into Protein?
Section 10.3 Outline 10.3 How Is the Base Sequence of a Messenger RNA Molecule Translated into Protein? Messenger RNA Carries Information for Protein Synthesis from the DNA to Ribosomes Ribosomes Consist
More informationEukaryotic Gene Structure
Eukaryotic Gene Structure Terminology Genome entire genetic material of an individual Transcriptome set of transcribed sequences Proteome set of proteins encoded by the genome 2 Gene Basic physical and
More informationBundle 6 Test Review
Bundle 6 Test Review DNA vs. RNA DNA Replication Gene Mutations- Protein Synthesis 1. Label the different components and complete the complimentary base pairing. What is this molecule called? Deoxyribonucleic
More informationRNA : functional role
RNA : functional role Hamad Yaseen, PhD MLS Department, FAHS Hamad.ali@hsc.edu.kw RNA mrna rrna trna 1 From DNA to Protein -Outline- From DNA to RNA From RNA to Protein From DNA to RNA Transcription: Copying
More informationThemes: RNA and RNA Processing. Messenger RNA (mrna) What is a gene? RNA is very versatile! RNA-RNA interactions are very important!
Themes: RNA is very versatile! RNA and RNA Processing Chapter 14 RNA-RNA interactions are very important! Prokaryotes and Eukaryotes have many important differences. Messenger RNA (mrna) Carries genetic
More informationQuantitative Real Time PCR USING SYBR GREEN
Quantitative Real Time PCR USING SYBR GREEN SYBR Green SYBR Green is a cyanine dye that binds to double stranded DNA. When it is bound to D.S. DNA it has a much greater fluorescence than when bound to
More informationDNA Structure and Replication, and Virus Structure and Replication Test Review
DNA Structure and Replication, and Virus Structure and Replication Test Review What does DNA stand for? Deoxyribonucleic Acid DNA is what type of macromolecule? DNA is a nucleic acid The building blocks
More informationProtein Synthesis Notes
Protein Synthesis Notes Protein Synthesis: Overview Transcription: synthesis of mrna under the direction of DNA. Translation: actual synthesis of a polypeptide under the direction of mrna. Transcription
More informationApplications and Uses. (adapted from Roche RealTime PCR Application Manual)
What Can You Do With qpcr? Applications and Uses (adapted from Roche RealTime PCR Application Manual) What is qpcr? Real time PCR also known as quantitative PCR (qpcr) measures PCR amplification as it
More informationMOLECULAR GENETICS PROTEIN SYNTHESIS. Molecular Genetics Activity #2 page 1
AP BIOLOGY MOLECULAR GENETICS ACTIVITY #2 NAME DATE HOUR PROTEIN SYNTHESIS Molecular Genetics Activity #2 page 1 GENETIC CODE PROTEIN SYNTHESIS OVERVIEW Molecular Genetics Activity #2 page 2 PROTEIN SYNTHESIS
More informationChromosomes. Chromosomes. Genes. Strands of DNA that contain all of the genes an organism needs to survive and reproduce
Chromosomes Chromosomes Strands of DNA that contain all of the genes an organism needs to survive and reproduce Genes Segments of DNA that specify how to build a protein genes may specify more than one
More informationLecture for Wednesday. Dr. Prince BIOL 1408
Lecture for Wednesday Dr. Prince BIOL 1408 THE FLOW OF GENETIC INFORMATION FROM DNA TO RNA TO PROTEIN Copyright 2009 Pearson Education, Inc. Genes are expressed as proteins A gene is a segment of DNA that
More information