elin grundberg JULY 2018 JK - PRESENTATION- 11 MINS [MIXED RESPONDENTS] [Other comments:]

Size: px
Start display at page:

Download "elin grundberg JULY 2018 JK - PRESENTATION- 11 MINS [MIXED RESPONDENTS] [Other comments:]"

Transcription

1 elin grundberg JULY 2018 JK - PRESENTATION- 11 MINS [MIXED RESPONDENTS] [Other comments:] So, we're going to be talking about epigenomic research, so, Elin, are you around? Come over. You can introduce yourself Elin, so, please, go ahead. Thank you. Yes, I am Elin Grundberg, and previously of MacEwan University in Canada, now relocated to the US, so, I will talk to you with both hats on, MacEwan, and the new institution, the Children's Mercy Kansas City. First, thank you Biobank for this invitation. I'm very excited to be here. I see myself talking on behalf of many of you here that I've spoken to earlier today, and on s, that are excited to see the possibilities of expanding the molecular profiling in UK Biobank, including epigenetic profiling, and I will show you some of our insights in doing this enlarged cohort, and, hopefully, we'll come up with a strong proposal and convince funders that this is the next layer of information that we need in UK Biobank. So, during the day today, I think we got an amazing overview about UK Biobank as a research, with deep phenotypes, but also, on the genetic side, genotyping spanning out from [unclear word 0:01:12.6] and now with Regeneron and [unclear word 0:01:14.8], with exome sequencing, as well as the whole genome sequencing, and similarly on the environmental lifestyle factor, how this is very well covered with the deep catalogue of information. So, what I will try to convince you is that we can use epigenetics to connect environmental with genetic factors and learn phenotypic risk and disease biology, and I will talk about how we can use epigenetic changes, or epigenetic variation, to interpret or translate genetic variation into regulatory causal pathways, and that epigenetic variation might be superior to the previous transcriptomic variation that was popularly use in complex disease studies, but also show you that environmental lifestyle factors actually can impact our epigenome, and cause epigenetic modifications, and that, in a similar way, can impact and identify causal or regulatory pathways for complex disease, and just touch upon the fact that, compared to genetic factors, the environmental factors encourage some epigenetic changes. We can get a consequence of that, based on the disease. So, as probably many of you know, this is what we do when we do an epigenome-wide association study, very much similar to a genome-wide association study. We pick our study populations of interest, either a case control, or a population base, and we do epigenetic profiling, either dependent on funding, or interest targeted by array, or we do whole genome or at least [unclear word 0:02:55.3] the sequencing base, and then we do large-scale association studies to find regions of the epigenome, in terms of marks that have an association to our disease of interest. I think many of you would agree with me that the most commonly used epigenetic trait in large-scale population-based studies is DNA methylation, and I will list three reasons why I think that's the trait to use. Obviously, we're dealing with very robust and reproduceable assets. Most of them are based on bisulphite conversion of our DNA, and we do have automated solutions in place, meaning that we can do this now in large scale, and we can both have microarray-based method, as well as NGS, that do this in a very accurate manner.

2 2 Obviously, DNA methylation benefits from being a quantitative trait. We're mostly dealing with bulk tissue, so we're dealing with a distribution of zero to 100 per cent of our trait, which makes it easier than other reprogenetic trace to do statistical analysis, but, obviously, as we're dealing with a single base pair, our solutions vary, and we do have up to 30 million of these sites that we can measure genome-wide, and also, compared to transcriptomic [unclear word 0:04:18.2], epigenetic, or DNA methylation is a stable trait, and we can use this, actually, as a predictor of regulatory element. So, what have we learned in doing epigenome variation, or DNA methylation in large scales, based on [unclear word 0:04:33.6]? We started, and I have a long-standing collaboration with The Twins UK, so we did this a couple of years ago now, where we were interested to see, if we do this genome-wide, using whole genome bisulphite sequencing, how is the DNA methylation variation dependent across individuals, across tissue, and across regions of the genome, and it was really striking to see that DNA methylation variation is very region-dependent. We do see various static patterns of DNA methylation in certain areas, such as the promotors, whereas the variability is really enriched in more distal regulatory elements as the enhancers. So, we were not keen on using the larger scale, the targeted array at that time, the 450k array, we saw that it's really a biosource attainment with a large proportion of those CPGs being in promotors. Similarly, we noticed that where we have an association with a complex, or a disease trait, these are also in a similar way as we've seen for genetic variants, enriched in these enhancer elements. Obviously, at that time, and still, the whole genome bisulphite sequencing is costly, but, also, that we're dealing with a large proportion that actually might not be very useful, as it's not variable across individuals. We wanted to see how we can improve the profiling of DNA methylation by still restricting to certain areas, so we initiated a collaboration with Roche NimbleGen to follow the same procedures as a whole genome bisulphite sequencing, so, a shotgun library, but instead of sequencing that full library, we target and enrich, similar as in an exome sequencing experiment, we enrich for regulatory elements which we were interested to do, and by doing so, we can do multiplexing, we can cut down the cost, and we can increase our coverage of the regions that we were interested. So, we're designing these panels based on our tissue of interest. Obviously, blood is a tissue that we do have most samples available from, so we design panels based on regulatory regions, specifically targeted for blood, so making sure that we're covering all regulatory elements in all, or at least the data that's available from different purified blood cells, and also, we include other types of regions that we think is useful. For instance, the CPGs that have been added on other types of methods. The benefit from this, we did genome-type at the same time so we make sure that we include also genetic variants, rarer variants that we can then genome-type at the same time. The bottom figures is basically showing what this looks like. Basically, the red bar showing the regions that we cover, and then the So, basically, this is what we cover, and this is when we're sequencing, and you can compare that with a whole genome. Obviously, it's not fully covered, but compared with 450k, we do cover densely the regions that we're interested in. By having full, or dense coverage of regulatory elements, we can also start to see the pattern of ethnogenetic association within an

3 3 element, and we have started to detect very interesting features. This is showing a statistical significant association with a trait, and those that are enhanced, there seems to be preferably mapping at the midpoint of the enhancers, and if we then limit to promotors, the significant disease epigenetic variants have a preferable association towards the gene regions. So, if we then mirror that, and see what do we have available on arrays, we do see that there's an underrepresentation of that region, specifically on promotors, which is the majority of the regions on this array, so we were kind of still interested to start, or continue to do this dense coverage of the regulatory elements, so if we do that in disease studies, how do they look? We used cigarette smoking as one of the proof of principle, environmental factors, that epigenome or environmental factor actually changed our epigenome quite dramatically, but also learn what is the advantage of doing dense regulatory sequencing over scatters, so we obviously confirmed this is the most famous chromosome five regions for smoking, the HRR. That is the most reproduceable, or replicable finding in epigenome-wide association studies, but this is the enhancer that has been strongly associated with the activity, and expression of this gene, but we do also see two additional regulatory elements that shows signals and been affected by smoking. But, maybe most interesting is that we would look in these specific regions that we know has strongest effect, based on smoking, where the blue line is actually seeing if we sequenced that element in higher depth, we do fine-map the association, and have a CPG with a much better predictable scores when we're trying to predict an individual's smoking status. Then the green one, which is the CPG that has been on the array based, but also we were interested to see if we have this preferentially mapping of epigenetic variants within regulatory element. Is that useful, does it tell us something, so we can zoom in and look at these associations in the midpoints, as well as in the downstream of the promotor, and actually learn how the transcription regulation is linked to this. We find, in this case, it's metabolic disease associations, and we see strong enrichment of transcription factors that are binding in these specific regions, and we can follow up and do additional thing. We can obviously do our replication, but we can link it to gene expression, and then do similar thing. Instead of doing the regulatory element and transcription factor, per se, we can see the downstream genes and do pathway analysis, and actually see that similar pathways are coming up as our transcription factor binding analysis. Then we zoom in on one of these genes that has been strongly linked to this metabolic trait. We know it's the vascular and endothelial growth factor. There is a genome-wide association study using the UK Biobank data here, strongly showing an association as downstream with the gene. We identify a regulatory element, rather far upstream of the gene. It's a tissue-specific enhancer, robustly associated with a metabolic trait that's relatively linked to the GWAS trait, and, obviously, with the genetic information in these cohorts, we could integrate that and see that the association of epigenetic variation in this region is non-genetic, so that's basically, potentially indicating that we have an additive effect of a locus that is relevant to this disease. Obviously, as I mentioned, the key feature of the epigenetic or molecular profiling is to understand genetic susceptibility, and, obviously, we can do this too with DNA methylation, and, as I said, even better

4 4 than using gene expression. I take this example which Nicole Serrant, as we heard earlier today, published together with Tommy [?Preston 0:12:25.3] at our institution a few years ago, using the blueprint data, basically collecting a large number of individuals with various different blood cells, and different layers of molecular trait, and trying to see what is the co-localisation of a disease, where it shows molecular, and, actually, that the phytogene expression only accounts for a minority of the disease link, where the majority is actually epigenetic variation, and specifically DNA methylation as a strong molecular trait, explaining the link between a genetic variance and the disease outcome. Again, we can zoom in and take an example to try to link it to data from the UK Biobank. This is an interesting signal from the Blood Pressure Consortium, using UK Biobank data, and similar, here is a tissue-specific only occurring in blood tissue, a strong regulatory element with robust epigenetic variation of DNA methylation within the GDF7 gene, and, compared to the previous example I took if we integrate genetic information with this DNA methylation, we see a perfectly nicely co-localisation where further integration with gene expression data actually can indicate that we're dealing with this gene that seems to be genetically modulated through epigenetic variation at this locus. Finally, as we're dealing with epigenetic tissue specificity, and talking about blood tissue, a tissue heterogeneity is a key thing to consider. It matters, it's important, and we need to account for that. I, personally, was less worried about this than the previous year, basically, since we know we can actually correct for this relatively well. Many of you have seen the vast amount of available packages, basically, reference-free, referencebased, and mainly our supporting array-based data. What we actually do, we can actually do supervised prediction of cell counts in our next generation sequencing-based data, by basically identifying CPGs that are very specific to a blood cell type, and we can use publicly available data for up to [unclear word 0:14:49.5] purified cell type, really by using stringent criteria, and identify hundreds of CPGs that are only active or hypermethylating in a cell type, and using those then to fetch the average methylation in whole blood data, we do see that they vey well correlate with our lab tests, meaning that we actually can use this in statistical analysis, in cohorts where blood counts are not available, and, actually adjusting for this in a much more accurate manner than if we have the CBC test alone. So, I'll summarise, and try to highlight why I think epigenetics is an important and useful trait in large resources, such as the UK Biobank. We do know that we can discover regulatory pathway that underlies complex trait and diseases. These can both be genetically-driven, but with epigenetic, we can actually pick up environmental, or lifestyle-driven effects. We believe that the higher resolution profiling regulatory element has a great advantage, as it has this opportunity to add additional insight into regulatory pathways. DNA methylation really is a perfect molecular trait to annotate the punitive functional consequences of genetic variant, and, again, if we're doing very dense coverage of DNA methylation, we'll be able to cover the majority of these regulatory disturbances that non-coding genetic variants cause. Finally, the opportunity to add epigenetic profiling, really and have the potential to allow us to build this combined risk scores, maybe, including epigenetic variance as well, and for having prediction of later life

5 5 outcomes, and, obviously, we're really dependent on large cohorts and [unclear word 0:16:46.3] type studies in order to do that, and obviously UK Biobank is the perfect tool for that, so I will end there. Just highlight everyone who has been involved in our project, but specifically the UK Biobank team, with Rory Collins, and Paul Matthews, and both Jimmy Bell and Gemma that's here, that's been very supportive in discussing this, and hopefully we'll come up with a strong proposal and convince funders that epigenetic profiling is the next trait to be asset. Thank you. [Applause] [END OF TRANSCRIPT]

Aris Baras JULY 2018 CRI - PRESENTATION - 17 MINS [MALE RESPONDENT] [Other comments:]

Aris Baras JULY 2018 CRI - PRESENTATION - 17 MINS [MALE RESPONDENT] [Other comments:] Aris Baras JULY 2018 CRI - PRESENTATION - 17 MINS [MALE RESPONDENT] [Other comments:] Thanks to Sekar for taking us through a good example about having this population scale of genome typing data available

More information

SNPs - GWAS - eqtls. Sebastian Schmeier

SNPs - GWAS - eqtls. Sebastian Schmeier SNPs - GWAS - eqtls s.schmeier@gmail.com http://sschmeier.github.io/bioinf-workshop/ 17.08.2015 Overview Single nucleotide polymorphism (refresh) SNPs effect on genes (refresh) Genome-wide association

More information

NGS Approaches to Epigenomics

NGS Approaches to Epigenomics I519 Introduction to Bioinformatics, 2013 NGS Approaches to Epigenomics Yuzhen Ye (yye@indiana.edu) School of Informatics & Computing, IUB Contents Background: chromatin structure & DNA methylation Epigenomic

More information

Introducing QIAseq. Accelerate your NGS performance through Sample to Insight solutions. Sample to Insight

Introducing QIAseq. Accelerate your NGS performance through Sample to Insight solutions. Sample to Insight Introducing QIAseq Accelerate your NGS performance through Sample to Insight solutions Sample to Insight From Sample to Insight let QIAGEN enhance your NGS-based research High-throughput next-generation

More information

DNA METHYLATION RESEARCH TOOLS

DNA METHYLATION RESEARCH TOOLS SeqCap Epi Enrichment System Revolutionize your epigenomic research DNA METHYLATION RESEARCH TOOLS Methylated DNA The SeqCap Epi System is a set of target enrichment tools for DNA methylation assessment

More information

Axiom Biobank Genotyping Solution

Axiom Biobank Genotyping Solution TCCGGCAACTGTA AGTTACATCCAG G T ATCGGCATACCA C AGTTAATACCAG A Axiom Biobank Genotyping Solution The power of discovery is in the design GWAS has evolved why and how? More than 2,000 genetic loci have been

More information

Bioinformatics Advice on Experimental Design

Bioinformatics Advice on Experimental Design Bioinformatics Advice on Experimental Design Where do I start? Please refer to the following guide to better plan your experiments for good statistical analysis, best suited for your research needs. Statistics

More information

Crash-course in genomics

Crash-course in genomics Crash-course in genomics Molecular biology : How does the genome code for function? Genetics: How is the genome passed on from parent to child? Genetic variation: How does the genome change when it is

More information

What Can the Epigenome Teach Us About Cellular States and Diseases?

What Can the Epigenome Teach Us About Cellular States and Diseases? What Can the Epigenome Teach Us About Cellular States and Diseases? (a computer scientist s view) Luca Pinello Outline Epigenetic: the code over the code What can we learn from epigenomic data? Resources

More information

The MiniSeq System. Explore the possibilities. Discover demonstrated NGS workflows for molecular biology applications.

The MiniSeq System. Explore the possibilities. Discover demonstrated NGS workflows for molecular biology applications. The MiniSeq System. Explore the possibilities. Discover demonstrated NGS workflows for molecular biology applications. Let your work flow with Illumina NGS. The MiniSeq System delivers powerful and cost-effective

More information

Introduction to Bioinformatics and Gene Expression Technology

Introduction to Bioinformatics and Gene Expression Technology Vocabulary Introduction to Bioinformatics and Gene Expression Technology Utah State University Spring 2014 STAT 5570: Statistical Bioinformatics Notes 1.1 Gene: Genetics: Genome: Genomics: hereditary DNA

More information

Whole genome sequencing in the UK Biobank

Whole genome sequencing in the UK Biobank Whole genome sequencing in the UK Biobank Part of the UK Government s Industrial Strategy Challenge Fund (ISCF) for the Data to Early Diagnosis and Precision Medicine initiative Aim to produce deep characterisation

More information

NimbleGen Arrays and LightCycler 480 System: A Complete Workflow for DNA Methylation Biomarker Discovery and Validation.

NimbleGen Arrays and LightCycler 480 System: A Complete Workflow for DNA Methylation Biomarker Discovery and Validation. Cancer Research Application Note No. 9 NimbleGen Arrays and LightCycler 480 System: A Complete Workflow for DNA Methylation Biomarker Discovery and Validation Tomasz Kazimierz Wojdacz, PhD Institute of

More information

Result Tables The Result Table, which indicates chromosomal positions and annotated gene names, promoter regions and CpG islands, is the best way for

Result Tables The Result Table, which indicates chromosomal positions and annotated gene names, promoter regions and CpG islands, is the best way for Result Tables The Result Table, which indicates chromosomal positions and annotated gene names, promoter regions and CpG islands, is the best way for you to discover methylation changes at specific genomic

More information

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with

More information

Massive Analysis of cdna Ends for simultaneous Genotyping and Transcription Profiling in High Throughput

Massive Analysis of cdna Ends for simultaneous Genotyping and Transcription Profiling in High Throughput Next Generation (Sequencing) Tools for Nucleotide-Based Information Massive Analysis of cdna Ends for simultaneous Genotyping and Transcription Profiling in High Throughput Björn Rotter, PhD GenXPro GmbH,

More information

2/19/13. Contents. Applications of HMMs in Epigenomics

2/19/13. Contents. Applications of HMMs in Epigenomics 2/19/13 I529: Machine Learning in Bioinformatics (Spring 2013) Contents Applications of HMMs in Epigenomics Yuzhen Ye School of Informatics and Computing Indiana University, Bloomington Spring 2013 Background:

More information

Redefine what s possible with the Axiom Genotyping Solution

Redefine what s possible with the Axiom Genotyping Solution Redefine what s possible with the Axiom Genotyping Solution From discovery to translation on a single platform The Axiom Genotyping Solution enables enhanced genotyping studies to accelerate your research

More information

Applications of HMMs in Epigenomics

Applications of HMMs in Epigenomics I529: Machine Learning in Bioinformatics (Spring 2013) Applications of HMMs in Epigenomics Yuzhen Ye School of Informatics and Computing Indiana University, Bloomington Spring 2013 Contents Background:

More information

Potential of human genome sequencing. Paul Pharoah Reader in Cancer Epidemiology University of Cambridge

Potential of human genome sequencing. Paul Pharoah Reader in Cancer Epidemiology University of Cambridge Potential of human genome sequencing Paul Pharoah Reader in Cancer Epidemiology University of Cambridge Key considerations Strength of association Exposure Genetic model Outcome Quantitative trait Binary

More information

Ecological genomics and molecular adaptation: state of the Union and some research goals for the near future.

Ecological genomics and molecular adaptation: state of the Union and some research goals for the near future. Ecological genomics and molecular adaptation: state of the Union and some research goals for the near future. Louis Bernatchez Genomics and Conservation of Aquatic Resources Université LAVAL! Molecular

More information

Genomic solutions for complex disease

Genomic solutions for complex disease Genomic solutions for complex disease Power your with our genomic solutions Access a breadth of applications. Gain a depth of insights. To enhance their understanding of complex disease, researchers are

More information

www.illumina.com/hiseq www.illumina.com FOR RESEARCH USE ONLY 2012 2014 Illumina, Inc. All rights reserved. Illumina, BaseSpace, cbot, CSPro, Genetic Energy, HiSeq, Nextera, TruSeq, the pumpkin orange

More information

INTRODUCTION TO MOLECULAR GENETICS. Andrew McQuillin Molecular Psychiatry Laboratory UCL Division of Psychiatry 22 Sept 2017

INTRODUCTION TO MOLECULAR GENETICS. Andrew McQuillin Molecular Psychiatry Laboratory UCL Division of Psychiatry 22 Sept 2017 INTRODUCTION TO MOLECULAR GENETICS Andrew McQuillin Molecular Psychiatry Laboratory UCL Division of Psychiatry 22 Sept 2017 Learning Objectives Understand: The distinction between Quantitative Genetic

More information

2/10/17. Contents. Applications of HMMs in Epigenomics

2/10/17. Contents. Applications of HMMs in Epigenomics 2/10/17 I529: Machine Learning in Bioinformatics (Spring 2017) Contents Applications of HMMs in Epigenomics Yuzhen Ye School of Informatics and Computing Indiana University, Bloomington Spring 2017 Background:

More information

DNA. bioinformatics. genomics. personalized. variation NGS. trio. custom. assembly gene. tumor-normal. de novo. structural variation indel.

DNA. bioinformatics. genomics. personalized. variation NGS. trio. custom. assembly gene. tumor-normal. de novo. structural variation indel. DNA Sequencing T TM variation DNA amplicon mendelian trio genomics NGS bioinformatics tumor-normal custom SNP resequencing target validation de novo prediction personalized comparative genomics exome private

More information

Global Screening Array (GSA)

Global Screening Array (GSA) Technical overview - Infinium Global Screening Array (GSA) with optional Multi-disease drop in (MD) The Infinium Global Screening Array (GSA) combines a highly optimized, universal genome-wide backbone,

More information

DNA. bioinformatics. epigenetics methylation structural variation. custom. assembly. gene. tumor-normal. mendelian. BS-seq. prediction.

DNA. bioinformatics. epigenetics methylation structural variation. custom. assembly. gene. tumor-normal. mendelian. BS-seq. prediction. Epigenomics T TM activation SNP target ncrna validation metagenomics genetics private RRBS-seq de novo trio RIP-seq exome mendelian comparative genomics DNA NGS ChIP-seq bioinformatics assembly tumor-normal

More information

Welcome to this IBM podcast. What is product. line engineering? I'm Angelique Matheny with IBM. It's not

Welcome to this IBM podcast. What is product. line engineering? I'm Angelique Matheny with IBM. It's not IBM Podcast [ MUSIC ] MATHENY: Welcome to this IBM podcast. What is product line engineering? I'm Angelique Matheny with IBM. It's not easy to build a smarter product. Now try to build more than one at

More information

From Genotype to Phenotype

From Genotype to Phenotype From Genotype to Phenotype Johanna Vilkki Green technology, Natural Resources Institute Finland Systems biology Genome Transcriptome genes mrna Genotyping methodology SNP TOOLS, WG SEQUENCING Functional

More information

Bi 1x Spring 2016: LacI Titration

Bi 1x Spring 2016: LacI Titration Bi 1x Spring 2016: LacI Titration 1 Overview In this experiment, you will measure the effect of various mutated LacI repressor ribosome binding sites in an E. coli cell by measuring the expression of a

More information

Gene Regulation Solutions. Microarrays and Next-Generation Sequencing

Gene Regulation Solutions. Microarrays and Next-Generation Sequencing Gene Regulation Solutions Microarrays and Next-Generation Sequencing Gene Regulation Solutions The Microarrays Advantage Microarrays Lead the Industry in: Comprehensive Content SurePrint G3 Human Gene

More information

Computational Workflows for Genome-Wide Association Study: I

Computational Workflows for Genome-Wide Association Study: I Computational Workflows for Genome-Wide Association Study: I Department of Computer Science Brown University, Providence sorin@cs.brown.edu October 16, 2014 Outline 1 Outline 2 3 Monogenic Mendelian Diseases

More information

7. For What Molecule Do Genes Contain The

7. For What Molecule Do Genes Contain The 7. For What Molecule Do Genes Contain The Instructions For Building For what molecule do genes contain the instructions for building? Are there many or few acetyl molecules attached to the genes associated

More information

MITOCW MIT7_01SCF11_track10_300k.mp4

MITOCW MIT7_01SCF11_track10_300k.mp4 MITOCW MIT7_01SCF11_track10_300k.mp4 Now having cut DNA, we then need to do something else. What's the next thing we have to do? Paste it. Paste it right. Oh, I put past. It should be paste. There we go.

More information

Bi8 Lecture 19. Review and Practice Questions March 8th 2016

Bi8 Lecture 19. Review and Practice Questions March 8th 2016 Bi8 Lecture 19 Review and Practice Questions March 8th 2016 Common Misconceptions Where Words Matter Common Misconceptions DNA vs RNA vs Protein Su(H) vs Su(H) Replication Transcription Transcribed vs

More information

Introduction to RNA-Seq in GeneSpring NGS Software

Introduction to RNA-Seq in GeneSpring NGS Software Introduction to RNA-Seq in GeneSpring NGS Software Dipa Roy Choudhury, Ph.D. Strand Scientific Intelligence and Agilent Technologies Learn more at www.genespring.com Introduction to RNA-Seq In a few years,

More information

latestdevelopments relevant for the Ag sector André Eggen Agriculture Segment Manager, Europe

latestdevelopments relevant for the Ag sector André Eggen Agriculture Segment Manager, Europe Overviewof Illumina s latestdevelopments relevant for the Ag sector André Eggen Agriculture Segment Manager, Europe Seminar der Studienrichtung Tierwissenschaften, TÜM, July 1, 2009 Overviewof Illumina

More information

resequencing storage SNP ncrna metagenomics private trio de novo exome ncrna RNA DNA bioinformatics RNA-seq comparative genomics

resequencing storage SNP ncrna metagenomics private trio de novo exome ncrna RNA DNA bioinformatics RNA-seq comparative genomics RNA Sequencing T TM variation genetics validation SNP ncrna metagenomics private trio de novo exome mendelian ChIP-seq RNA DNA bioinformatics custom target high-throughput resequencing storage ncrna comparative

More information

AN EVALUATION OF POWER TO DETECT LOW-FREQUENCY VARIANT ASSOCIATIONS USING ALLELE-MATCHING TESTS THAT ACCOUNT FOR UNCERTAINTY

AN EVALUATION OF POWER TO DETECT LOW-FREQUENCY VARIANT ASSOCIATIONS USING ALLELE-MATCHING TESTS THAT ACCOUNT FOR UNCERTAINTY AN EVALUATION OF POWER TO DETECT LOW-FREQUENCY VARIANT ASSOCIATIONS USING ALLELE-MATCHING TESTS THAT ACCOUNT FOR UNCERTAINTY E. ZEGGINI and J.L. ASIMIT Wellcome Trust Sanger Institute, Hinxton, CB10 1HH,

More information

What is Epigenetics? Watch the video

What is Epigenetics? Watch the video EPIGENETICS What is Epigenetics? The study of environmental factors on gene expression in DNA. The molecule is called methylation controls when genes are turned on. Methylation turns off genes. Acetylation

More information

BTRY 7210: Topics in Quantitative Genomics and Genetics

BTRY 7210: Topics in Quantitative Genomics and Genetics BTRY 7210: Topics in Quantitative Genomics and Genetics Jason Mezey Biological Statistics and Computational Biology (BSCB) Department of Genetic Medicine jgm45@cornell.edu Spring 2015, Thurs.,12:20-1:10

More information

res2)=)colocalisation(eqtl_file="icoslg_eqtl.txt",) gwas_file="uc_gwas_icoslg_locus.txt",)p1)=)1e+4,)p2)=)1e+4,)p12)=)1e+5,) region)=)200000)$

res2)=)colocalisation(eqtl_file=icoslg_eqtl.txt,) gwas_file=uc_gwas_icoslg_locus.txt,)p1)=)1e+4,)p2)=)1e+4,)p12)=)1e+5,) region)=)200000)$ Homework 7 res1)=)colocalisation(eqtl_file="ptk2b_eqtl.txt",) gwas_file="ad_gwas_ptk2b_locus.txt",)p1)=)1e+4,)p2)=)1e+4,)p12)=)1e+5,) region)=)200000)$ ##))1.02e+06))2.43e+05))6.80e+04))1.51e+02))9.84e+01)$

More information

Basics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility

Basics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility 2018 ABRF Meeting Satellite Workshop 4 Bridging the Gap: Isolation to Translation (Single Cell RNA-Seq) Sunday, April 22 Basics of RNA-Seq (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly,

More information

From reads to results: differential. Alicia Oshlack Head of Bioinformatics

From reads to results: differential. Alicia Oshlack Head of Bioinformatics From reads to results: differential expression analysis with ihrna seq Alicia Oshlack Head of Bioinformatics Murdoch Childrens Research Institute Benefits and opportunities ii of RNA seq All transcripts

More information

Gene expression microarrays and assays. Because your results can t wait

Gene expression microarrays and assays. Because your results can t wait Gene expression microarrays and assays Because your results can t wait A simple path from data to decision-making The power of expression microarrays Transcriptome-wide analysis can be complex. Matching

More information

Introduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013

Introduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013 Introduction to RNA-Seq David Wood Winter School in Mathematics and Computational Biology July 1, 2013 Abundance RNA is... Diverse Dynamic Central DNA rrna Epigenetics trna RNA mrna Time Protein Abundance

More information

Exome Sequencing Exome sequencing is a technique that is used to examine all of the protein-coding regions of the genome.

Exome Sequencing Exome sequencing is a technique that is used to examine all of the protein-coding regions of the genome. Glossary of Terms Genetics is a term that refers to the study of genes and their role in inheritance the way certain traits are passed down from one generation to another. Genomics is the study of all

More information

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 Topics Genetic variation Population structure Linkage disequilibrium Natural disease variants Genome Wide Association Studies Gene

More information

Introduction to Bioinformatics and Gene Expression Technologies

Introduction to Bioinformatics and Gene Expression Technologies Introduction to Bioinformatics and Gene Expression Technologies Utah State University Fall 2017 Statistical Bioinformatics (Biomedical Big Data) Notes 1 1 Vocabulary Gene: hereditary DNA sequence at a

More information

Introduction to Bioinformatics and Gene Expression Technologies

Introduction to Bioinformatics and Gene Expression Technologies Vocabulary Introduction to Bioinformatics and Gene Expression Technologies Utah State University Fall 2017 Statistical Bioinformatics (Biomedical Big Data) Notes 1 Gene: Genetics: Genome: Genomics: hereditary

More information

The New Genome Analyzer IIx Delivering more data, faster, and easier than ever before. Jeremy Preston, PhD Marketing Manager, Sequencing

The New Genome Analyzer IIx Delivering more data, faster, and easier than ever before. Jeremy Preston, PhD Marketing Manager, Sequencing The New Genome Analyzer IIx Delivering more data, faster, and easier than ever before Jeremy Preston, PhD Marketing Manager, Sequencing Illumina Genome Analyzer: a Paradigm Shift 2000x gain in efficiency

More information

Serial Analysis of Gene Expression

Serial Analysis of Gene Expression Serial Analysis of Gene Expression Cloning of Tissue-Specific Genes Using SAGE and a Novel Computational Substraction Approach. Genomic (2001) Hung-Jui Shih Outline of Presentation SAGE EST Article TPE

More information

Deep Sequencing technologies

Deep Sequencing technologies Deep Sequencing technologies Gabriela Salinas 30 October 2017 Transcriptome and Genome Analysis Laboratory http://www.uni-bc.gwdg.de/index.php?id=709 Microarray and Deep-Sequencing Core Facility University

More information

GENOMICS WORKFLOW SOLUTIONS THAT GO WHERE THE SCIENCE LEADS. Genomics Solutions Portfolio

GENOMICS WORKFLOW SOLUTIONS THAT GO WHERE THE SCIENCE LEADS. Genomics Solutions Portfolio GENOMICS WORKFLOW SOLUTIONS THAT GO WHERE THE SCIENCE LEADS Genomics Solutions Portfolio WORKFLOW SOLUTIONS FROM EXTRACTION TO ANALYSIS Application-based answers for every step of your workflow Scientists

More information

Understanding the Genetic Architecture of Complex Human Diseases: Biomedicine, Statistics, and Large Genomic Data Sets Josée Dupuis

Understanding the Genetic Architecture of Complex Human Diseases: Biomedicine, Statistics, and Large Genomic Data Sets Josée Dupuis Understanding the Genetic Architecture of Complex Human Diseases: Biomedicine, Statistics, and Large Genomic Data Sets Josée Dupuis Professor and Interim Chair Department of Biostatistics Boston University

More information

Transcriptome Assembly, Functional Annotation (and a few other related thoughts)

Transcriptome Assembly, Functional Annotation (and a few other related thoughts) Transcriptome Assembly, Functional Annotation (and a few other related thoughts) Monica Britton, Ph.D. Sr. Bioinformatics Analyst June 23, 2017 Differential Gene Expression Generalized Workflow File Types

More information

Section C: The Control of Gene Expression

Section C: The Control of Gene Expression Section C: The Control of Gene Expression 1. Each cell of a multicellular eukaryote expresses only a small fraction of its genes 2. The control of gene expression can occur at any step in the pathway from

More information

Total genomic solutions for biobanks. Maximizing the value of your specimens.

Total genomic solutions for biobanks. Maximizing the value of your specimens. Total genomic solutions for biobanks. Maximizing the value of your specimens. Unlock the true potential of your biological samples. Greater understanding. Increased value. Value-driven biobanking. Now

More information

Satellite Education Workshop (SW4): Epigenomics: Design, Implementation and Analysis for RNA-seq and Methyl-seq Experiments

Satellite Education Workshop (SW4): Epigenomics: Design, Implementation and Analysis for RNA-seq and Methyl-seq Experiments Satellite Education Workshop (SW4): Epigenomics: Design, Implementation and Analysis for RNA-seq and Methyl-seq Experiments Saturday March 17, 2012 Orlando, Florida Workshop Description: This full day

More information

Lecture 22 Eukaryotic Genes and Genomes III

Lecture 22 Eukaryotic Genes and Genomes III Lecture 22 Eukaryotic Genes and Genomes III In the last three lectures we have thought a lot about analyzing a regulatory system in S. cerevisiae, namely Gal regulation that involved a hand full of genes.

More information

BBMRI.NL A story of Sharing Eline Slagboom Molecular Epidemiology, Leiden University Medical Center Brussel 20 Juni 2017

BBMRI.NL A story of Sharing Eline Slagboom Molecular Epidemiology, Leiden University Medical Center Brussel 20 Juni 2017 BBMRI.NL A story of Sharing Eline Slagboom Molecular Epidemiology, Leiden University Medical Center Brussel 20 Juni 2017 BBMRI-NL TOWARDS A NATIONAL BIOBANKING INFRASTRUCTURE Founded in 2009 as the Dutch

More information

About Strand NGS. Strand Genomics, Inc All rights reserved.

About Strand NGS. Strand Genomics, Inc All rights reserved. About Strand NGS Strand NGS-formerly known as Avadis NGS, is an integrated platform that provides analysis, management and visualization tools for next-generation sequencing data. It supports extensive

More information

Go to Bottom Left click WashU Epigenome Browser. Click

Go to   Bottom Left click WashU Epigenome Browser. Click Now you are going to look at the Human Epigenome Browswer. It has a more sophisticated but weirder interface than the UCSC Genome Browser. All the data that you will view as tracks is in reality just files

More information

Targeted RNA sequencing reveals the deep complexity of the human transcriptome.

Targeted RNA sequencing reveals the deep complexity of the human transcriptome. Targeted RNA sequencing reveals the deep complexity of the human transcriptome. Tim R. Mercer 1, Daniel J. Gerhardt 2, Marcel E. Dinger 1, Joanna Crawford 1, Cole Trapnell 3, Jeffrey A. Jeddeloh 2,4, John

More information

Show notes for today's conversation are available at the podcast web site.

Show notes for today's conversation are available at the podcast web site. Protecting Information Privacy: How To and Lessons Learned Transcript Part 1: Why Should Privacy Be on A Business Leader s Radar Screen? Julia Allen: Welcome to CERT's Podcast Series: Security for Business

More information

CLONING. By Mario De Greiff Meraz

CLONING. By Mario De Greiff Meraz CLONING By Mario De Greiff Meraz WHAT IS ITS PURPOSE? In biochemistry (which is a mix of chemistry and biology) a clone is a replicate (a fragment of DNA placed in an organism) so that there is enough

More information

Deep learning sequence-based ab initio prediction of variant effects on expression and disease risk

Deep learning sequence-based ab initio prediction of variant effects on expression and disease risk Summer Review 7 Deep learning sequence-based ab initio prediction of variant effects on expression and disease risk Jian Zhou 1,2,3, Chandra L. Theesfeld 1, Kevin Yao 3, Kathleen M. Chen 3, Aaron K. Wong

More information

Genome-wide association studies (GWAS) Part 1

Genome-wide association studies (GWAS) Part 1 Genome-wide association studies (GWAS) Part 1 Matti Pirinen FIMM, University of Helsinki 03.12.2013, Kumpula Campus FIMM - Institiute for Molecular Medicine Finland www.fimm.fi Published Genome-Wide Associations

More information

The first and only fully-integrated microarray instrument for hands-free array processing

The first and only fully-integrated microarray instrument for hands-free array processing The first and only fully-integrated microarray instrument for hands-free array processing GeneTitan Instrument Transform your lab with a GeneTitan Instrument and experience the unparalleled power of streamlining

More information

Human Genetic Variation. Ricardo Lebrón Dpto. Genética UGR

Human Genetic Variation. Ricardo Lebrón Dpto. Genética UGR Human Genetic Variation Ricardo Lebrón rlebron@ugr.es Dpto. Genética UGR What is Genetic Variation? Origins of Genetic Variation Genetic Variation is the difference in DNA sequences between individuals.

More information

Measuring and Understanding Gene Expression

Measuring and Understanding Gene Expression Measuring and Understanding Gene Expression Dr. Lars Eijssen Dept. Of Bioinformatics BiGCaT Sciences programme 2014 Why are genes interesting? TRANSCRIPTION Genome Genomics Transcriptome Transcriptomics

More information

Introduction to Microarray Analysis

Introduction to Microarray Analysis Introduction to Microarray Analysis Methods Course: Gene Expression Data Analysis -Day One Rainer Spang Microarrays Highly parallel measurement devices for gene expression levels 1. How does the microarray

More information

Do I need to open a store?

Do I need to open a store? Do you know anyone who doesn't own at least one t-shirt? Almost every person owns at least one t-shirt. Take a look at a street in Spring, Summer or Autumn. How many of these pedestrians wear t-shirts?

More information

Experimental design of RNA-Seq Data

Experimental design of RNA-Seq Data Experimental design of RNA-Seq Data RNA-seq course: The Power of RNA-seq Thursday June 6 th 2013, Marco Bink Biometris Overview Acknowledgements Introduction Experimental designs Randomization, Replication,

More information

solid S Y S T E M s e q u e n c i n g See the Difference Discover the Quality Genome

solid S Y S T E M s e q u e n c i n g See the Difference Discover the Quality Genome solid S Y S T E M s e q u e n c i n g See the Difference Discover the Quality Genome See the Difference With a commitment to your peace of mind, Life Technologies provides a portfolio of robust and scalable

More information

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS

More information

Illumina s Suite of Targeted Resequencing Solutions

Illumina s Suite of Targeted Resequencing Solutions Illumina s Suite of Targeted Resequencing Solutions Colin Baron Sr. Product Manager Sequencing Applications 2011 Illumina, Inc. All rights reserved. Illumina, illuminadx, Solexa, Making Sense Out of Life,

More information

Genomics Resources in WHI. WHI ( ) Extension Study Steering Committee Meeting Seattle, WA May 05-06, 2011

Genomics Resources in WHI. WHI ( ) Extension Study Steering Committee Meeting Seattle, WA May 05-06, 2011 Genomics Resources in WHI WHI (2010-2015) Extension Study Steering Committee Meeting Seattle, WA May 05-06, 2011 WHI Genomic Resources in dbgap Outcomes and traits in AA and Hispanics GWAS-SHARe Sequencing-ESP

More information

Tutorial. Bisulfite Sequencing. Sample to Insight. September 15, 2016

Tutorial. Bisulfite Sequencing. Sample to Insight. September 15, 2016 Bisulfite Sequencing September 15, 2016 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com support-clcbio@qiagen.com Bisulfite

More information

RNA-Seq Now What? BIS180L Professor Maloof May 24, 2018

RNA-Seq Now What? BIS180L Professor Maloof May 24, 2018 RNA-Seq Now What? BIS180L Professor Maloof May 24, 2018 We have differentially expressed genes, what do we want to know about them? We have differentially expressed genes, what do we want to know about

More information

Following text taken from Suresh Kumar. Bioinformatics Web - Comprehensive educational resource on Bioinformatics. 6th May.2005

Following text taken from Suresh Kumar. Bioinformatics Web - Comprehensive educational resource on Bioinformatics. 6th May.2005 Bioinformatics is the recording, annotation, storage, analysis, and searching/retrieval of nucleic acid sequence (genes and RNAs), protein sequence and structural information. This includes databases of

More information

7.03, 2006, Lecture 23 Eukaryotic Genes and Genomes IV

7.03, 2006, Lecture 23 Eukaryotic Genes and Genomes IV 1 Fall 2006 7.03 7.03, 2006, Lecture 23 Eukaryotic Genes and Genomes IV In the last three lectures we have thought a lot about analyzing a regulatory system in S. cerevisiae, namely Gal regulation that

More information

7.05, 2005, Lecture 23 Eukaryotic Genes and Genomes IV

7.05, 2005, Lecture 23 Eukaryotic Genes and Genomes IV 7.05, 2005, Lecture 23 Eukaryotic Genes and Genomes IV In the last three lectures we have thought a lot about analyzing a regulatory system in S. cerevisiae, namely Gal regulation that involved a hand

More information

Machine learning applications in genomics: practical issues & challenges. Yuzhen Ye School of Informatics and Computing, Indiana University

Machine learning applications in genomics: practical issues & challenges. Yuzhen Ye School of Informatics and Computing, Indiana University Machine learning applications in genomics: practical issues & challenges Yuzhen Ye School of Informatics and Computing, Indiana University Reference Machine learning applications in genetics and genomics

More information

Integrated Environmental Assessment and Management

Integrated Environmental Assessment and Management Integrated Environmental Assessment and Management IEAM Podcast 8.4 Transcript* Title: California Sediment Quality Objectives, with Steve Bay JENNY SHAW: Hello, and welcome to the podcast for the journal,

More information

All research lines at the LMU have a strong methodological focus

All research lines at the LMU have a strong methodological focus Content: Genetic Epidemiology and Statistical Genetics in Complex Diseases... 2 Molecular Epidemiology of Complex Phenotypes... 2 Clinical Trials and Translational Medicine... 3 Clinical Epidemiology and

More information

The Genomic Transformation of Health

The Genomic Transformation of Health The Genomic Transformation of Health or: an introduction to the potential of genomics in healthcare Dr Tom Connor School of Biosciences and Systems Immunity University Research Institute Cardiff University

More information

ILLUMINA SEQUENCING SYSTEMS

ILLUMINA SEQUENCING SYSTEMS ILLUMINA SEQUENCING SYSTEMS PROVEN QUALITY. TRUSTED SOLUTIONS. Every day, researchers are using Illumina next-generation sequencing (NGS) systems to better understand human health and disease, as well

More information

Genetics One Gene, One Phenotype

Genetics One Gene, One Phenotype Review Session Genomics Tuesday 4pm Chapter 19* Site TBA Genetics One Gene, One Phenotype Genotype Sickle Cell Anemia Amino Acid Auxotrophy Antibiotic Resistance Phenotype One Gene, One Phenotype This

More information

MITOCW MIT7_01SCF11_track21_300k.mp4

MITOCW MIT7_01SCF11_track21_300k.mp4 MITOCW MIT7_01SCF11_track21_300k.mp4 Recombinant DNA, often referred to also as genetic engineering. This is a series of techniques, series of methods that allow us to manipulate DNA for a variety of reasons.

More information

Session 8. Differential gene expression analysis using RNAseq data

Session 8. Differential gene expression analysis using RNAseq data Functional and Comparative Genomics 2018 Session 8. Differential gene expression analysis using RNAseq data Tutors: Hrant Hovhannisyan, PhD student, email: grant.hovhannisyan@gmail.com Uciel Chorostecki,

More information

Briefly, this exercise can be summarised by the follow flowchart:

Briefly, this exercise can be summarised by the follow flowchart: Workshop exercise Data integration and analysis In this exercise, we would like to work out which GWAS (genome-wide association study) SNP associated with schizophrenia is most likely to be functional.

More information

Venture Capital Is Fueling Clean Tech

Venture Capital Is Fueling Clean Tech Early stage venture capital increasingly is a key source of finance for clean tech and clean energy, says Andrew Chung, a partner at Khosla Ventures, a venture capital firm that focuses on both areas.

More information

Impact of gdna Integrity on the Outcome of DNA Methylation Studies

Impact of gdna Integrity on the Outcome of DNA Methylation Studies Impact of gdna Integrity on the Outcome of DNA Methylation Studies Application Note Nucleic Acid Analysis Authors Emily Putnam, Keith Booher, and Xueguang Sun Zymo Research Corporation, Irvine, CA, USA

More information

RNA-SEQUENCING ANALYSIS

RNA-SEQUENCING ANALYSIS RNA-SEQUENCING ANALYSIS Joseph Powell SISG- 2018 CONTENTS Introduction to RNA sequencing Data structure Analyses Transcript counting Alternative splicing Allele specific expression Discovery APPLICATIONS

More information

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary Executive Summary Helix is a personal genomics platform company with a simple but powerful mission: to empower every person to improve their life through DNA. Our platform includes saliva sample collection,

More information

LARGE DATA AND BIOMEDICAL COMPUTATIONAL PIPELINES FOR COMPLEX DISEASES

LARGE DATA AND BIOMEDICAL COMPUTATIONAL PIPELINES FOR COMPLEX DISEASES 1 LARGE DATA AND BIOMEDICAL COMPUTATIONAL PIPELINES FOR COMPLEX DISEASES Ezekiel Adebiyi, PhD Professor and Head, Covenant University Bioinformatics Research and CU NIH H3AbioNet node Covenant University,

More information

Target Enrichment Strategies for Next Generation Sequencing

Target Enrichment Strategies for Next Generation Sequencing Target Enrichment Strategies for Next Generation Sequencing Anuj Gupta, PhD Agilent Technologies, New Delhi Genotypic Conference, Sept 2014 NGS Timeline Information burst Nearly 30,000 human genomes sequenced

More information

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center High Throughput Sequencing the Multi-Tool of Life Sciences Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center Complementary Approaches Illumina Still-imaging of clusters (~1000

More information