BTRY 7210: Topics in Quantitative Genomics and Genetics

Size: px
Start display at page:

Download "BTRY 7210: Topics in Quantitative Genomics and Genetics"

Transcription

1 BTRY 7210: Topics in Quantitative Genomics and Genetics Jason Mezey Biological Statistics and Computational Biology (BSCB) Department of Genetic Medicine January 29, 2015

2 Why you re here (=eqtl): Spring 2015 Course Announcement BTRY 7210 Topics in Quantitative Genomics and Genetics Professor: Jason Mezey Biological Statistics and Computational Biology Time: Thurs. 12:20-1:10 PM Room: 224 Weill Hall COURSE DESCRIPTION: We will consider the problem of identifying and leveraging expression Quantitative Trait Loci (eqtl) when analyzing genome-wide data. The class will include a TBD ratio of lectures by the instructor : reading / discussion of papers. Students taking the class for a grade will be required to produce a single, minicritique report of current papers touching on topics covered in the class. General topics areas that will be considered will include: probability and statistics necessary for understanding eqtl analysis, the basics of eqtl analysis, quality control and model checking in eqtl analysis, advanced eqtl analysis techniques including hidden factor analysis, extending analyses to xqtl, biological value and interpretation of eqtl, combining eqtl and other bioinformatics data for biological discovery, structure and application of probabilistic graphical models that make use of eqtl for network analysis and discovery. GRADING: S/U or Audit. CREDITS: 1 SUGGESTED PREREQUISITES: Quantitative Genomics and Genetics (BTRY 6830 / 4830) and/or background in statistics and/or background in eqtl, GWAS or related genetic mapping analyses

3 Today Logistics reminders Introduction to eqtl part 2

4 Logistics Format: a combination of lectures and possibly discussion of papers (that I will select) focused on specific subjects Updated info. on the class website (bottom of classes page): Make sure you are on the listserv (!!!) ( me to join / remove): mezey-groupm-l@cornell.edu If you can register for the class, please do (=Audit) - you may also register S/U (either leading a discussion or a minor ~10-20 hour final report will be required)

5 The eqtl concept II expression Quantitative Trait Locus (eqtl) - a polymorphic locus where an experimental exchange of one allele for another produces a change in expression on average under specified conditions: A 1 A 2 Y The allelic states defined by the original mutation event define the causal polymorphism of the eqtl Intuitive example: if rs27290 was a causal allele, changing A -> G would change the measured expression of ERAP2 eqtl ERAP2 expression A/A A/G G/G rs27290 genotype

6 Detecting eqtl from the analysis of genome-wide data I Since eqtl reflect a case where different allelic combinations (genotypes) lead to different levels of gene expression, we could in theory discover an eqtl by testing for an association between measured genotypes and gene expression levels Most eqtl are discovered using this type of approach A typical (human) eqtl experiment includes m (= ~10-30K) expression variables and N (= ~0.1-10mil) genotypes measured in n individuals sampled from a population A typical (most!) analysis of such data proceeds by performing independent statistical tests of (a subset of) genotype-expression pairs, where tests that are significant after a multiple test correct (e.g. Bonferroni), are assumed to indicate an eqtl

7 Detecting eqtl from the analysis of genome-wide data II eqtl We (almost) always detect eqtl by testing (non-causal) markers in LD with the causal polymorphism rs27290 ERAP2 expression eqtl A/A A/G G/G rs27290 genotype Copyright: Journal of Diabetes and its Complications; Science Direct; Vendramini et al ERAP2 expression T/T A/A T/G A/G G/G T45G rs27290 genotype

8 Genome-wide scan for eqtl: typical outcome eqtl (p < ) ERAP2 expression A/A A/G G/G rs27290 genotype no eqtl (n.s.) ERAP2 expression T/T T/C C/C rs genotype

9 Statistical foundation I (see BTRY 6830 lectures on class site) We need to begin by defining our sample space for an eqtl A experiment: For each individual in our sample { space, } we are interested in pairs of sample outcomes {(a single pair at a time!): Where (Ω g is the set { of possible genotype outcomes for an individual at a locus and Ω P is the set of values of the expression variable for an individual Note that for a diploid, with { two alleles } (typical for humans!): Ω g = {A 1 A 1,A 1 A 2,A 2 A 2 } Ω = {possible individuals} Ω = {Ω g Ω P } F F

10 Statistical foundation II Next, we need to define { the probability model: Pr(F Ω )=Pr(F g,p ) We will define two (types) or random R variables (* = state does F not matter): F Y :(, Ω P ) R Y = measurable expression value X :(Ω g, ) R X(A 1 A 1 )= 1,X(A 1 A 2 )=0,X(A 2 A 2 ) = 1 Note that the probability model induces a (joint) probability distribution on the these random variables: Pr(Y,X)

11 Statistical foundation III To assess whether the marker genotype indicates an eqtl, we need to assess the following hypothesis: H 0 : Cov(Y,X) = 0 H A : Cov(Y,X) = 0 To do this, we will collect a sample of size n of expression and genotype pairs (y, x) and define a statistic T(y, x), for which we know the distribution under the null hypothesis, such that we can calculate a p-value: pval = Pr(T t H 0 : true) p-value - the probability of obtaining a value of a statistic, or more extreme, conditional on H0 being true To analyze the data from a genome-wide eqtl experiment, we calculate a p- values for each of (a subset of) the total set of expression-genotype pairs and for cases where we reject the null (at an appropriate multiple test corrected type I error), we assume that this indicates an eqtl Note that we usually consider a run of contiguous genotypes for which we reject the null for the same expression variable to indicate the position of a single causal eqtl polymorphism

12 Genome-wide scan for eqtl: typical outcome eqtl (p < ) ERAP2 expression A/A A/G G/G rs27290 genotype no eqtl (n.s.) ERAP2 expression T/T T/C C/C rs genotype

13 Typical outcome: zooming in and cis- v trans- This is a cis- eqtl because the significant genotypes are in the same location as the expressed gene (otherwise, it would be a trans- eqtl) Most eqtl are cis-, which makes biological sense

14 What we will discuss 1: genomewide identification of eqtl one gene, one SNP one gene, multiple SNPs all genes, all SNPs. one gene, all SNPs.

15 What we will discuss II: reducing eqtl false positives Population structure and and hidden factors can cause false positive associations = correlations - that don t represent true true genetic genetic effects. effects These effects are visible on the p-value heatmap: population structure hidden factor Usually we can remove these artifacts by including appropriate covariates in our analysis We can sometimes remove these artifacts by including appropriate covariates in our analysis in a mixed model (we will also consider binary phenotype!) or by using a hidden factor analysis

16 What we will discuss III: can we identify causal genotypes (alleles)?

17 What we discuss IV: leveraging eqtl for annotation? eqtl SNPs tend to be in disequilibrium (LD) blocks enriched for ENCODE Transcription Factor motifs, suggesting a functional mechanism

18 What we will discuss V: leveraging eqtl for GWAS eqtl co-localize with disease loci identified in GWAS, indicating a common genetic basis and a method for identifying candidate causal polymorphisms for disease risk

19 What we will discuss VI: leveraging eqtl for network inference eqtl are used within Probabilistic Graphical Modeling (PGM) frameworks to discover new network / pathway / regulatory relationships

20 That s it for today Reminder: please me to join the listserv (!!)

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Jason Mezey Biological Statistics and Computational Biology (BSCB) Department of Genetic Medicine Institute for Computational Biomedicine

More information

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.03 Jason Mezey Biological Statistics and Computational Biology (BSCB) Department of Genetic Medicine Institute for Computational Biomedicine

More information

By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs

By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs (3) QTL and GWAS methods By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs Under what conditions particular methods are suitable

More information

Computational Workflows for Genome-Wide Association Study: I

Computational Workflows for Genome-Wide Association Study: I Computational Workflows for Genome-Wide Association Study: I Department of Computer Science Brown University, Providence sorin@cs.brown.edu October 16, 2014 Outline 1 Outline 2 3 Monogenic Mendelian Diseases

More information

Genome-Wide Association Studies (GWAS): Computational Them

Genome-Wide Association Studies (GWAS): Computational Them Genome-Wide Association Studies (GWAS): Computational Themes and Caveats October 14, 2014 Many issues in Genomewide Association Studies We show that even for the simplest analysis, there is little consensus

More information

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 Topics Genetic variation Population structure Linkage disequilibrium Natural disease variants Genome Wide Association Studies Gene

More information

http://genemapping.org/ Epistasis in Association Studies David Evans Law of Independent Assortment Biological Epistasis Bateson (99) a masking effect whereby a variant or allele at one locus prevents

More information

Genetic Variation and Genome- Wide Association Studies. Keyan Salari, MD/PhD Candidate Department of Genetics

Genetic Variation and Genome- Wide Association Studies. Keyan Salari, MD/PhD Candidate Department of Genetics Genetic Variation and Genome- Wide Association Studies Keyan Salari, MD/PhD Candidate Department of Genetics How many of you did the readings before class? A. Yes, of course! B. Started, but didn t get

More information

Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010

Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010 Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010 Traditional QTL approach Uses standard bi-parental mapping populations o F2 or RI These have a limited number of

More information

Exploring the Genetic Basis of Congenital Heart Defects

Exploring the Genetic Basis of Congenital Heart Defects Exploring the Genetic Basis of Congenital Heart Defects Sanjay Siddhanti Jordan Hannel Vineeth Gangaram szsiddh@stanford.edu jfhannel@stanford.edu vineethg@stanford.edu 1 Introduction The Human Genome

More information

Answers to additional linkage problems.

Answers to additional linkage problems. Spring 2013 Biology 321 Answers to Assignment Set 8 Chapter 4 http://fire.biol.wwu.edu/trent/trent/iga_10e_sm_chapter_04.pdf Answers to additional linkage problems. Problem -1 In this cell, there two copies

More information

Genome-wide association studies (GWAS) Part 1

Genome-wide association studies (GWAS) Part 1 Genome-wide association studies (GWAS) Part 1 Matti Pirinen FIMM, University of Helsinki 03.12.2013, Kumpula Campus FIMM - Institiute for Molecular Medicine Finland www.fimm.fi Published Genome-Wide Associations

More information

Linking Genetic Variation to Important Phenotypes

Linking Genetic Variation to Important Phenotypes Linking Genetic Variation to Important Phenotypes BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2018 Anthony Gitter gitter@biostat.wisc.edu These slides, excluding third-party material, are licensed under

More information

Linkage Disequilibrium. Adele Crane & Angela Taravella

Linkage Disequilibrium. Adele Crane & Angela Taravella Linkage Disequilibrium Adele Crane & Angela Taravella Overview Introduction to linkage disequilibrium (LD) Measuring LD Genetic & demographic factors shaping LD Model predictions and expected LD decay

More information

KNN-MDR: a learning approach for improving interactions mapping performances in genome wide association studies

KNN-MDR: a learning approach for improving interactions mapping performances in genome wide association studies Abo Alchamlat and Farnir BMC Bioinformatics (2017) 18:184 DOI 10.1186/s12859-017-1599-7 METHODOLOGY ARTICLE Open Access KNN-MDR: a learning approach for improving interactions mapping performances in genome

More information

Beyond single genes or proteins

Beyond single genes or proteins Beyond single genes or proteins Marylyn D Ritchie, PhD Professor, Biochemistry and Molecular Biology Director, Center for Systems Genomics The Pennsylvania State University Traditional Approach Genome-wide

More information

Mapping and Mapping Populations

Mapping and Mapping Populations Mapping and Mapping Populations Types of mapping populations F 2 o Two F 1 individuals are intermated Backcross o Cross of a recurrent parent to a F 1 Recombinant Inbred Lines (RILs; F 2 -derived lines)

More information

PERSPECTIVES. A gene-centric approach to genome-wide association studies

PERSPECTIVES. A gene-centric approach to genome-wide association studies PERSPECTIVES O P I N I O N A gene-centric approach to genome-wide association studies Eric Jorgenson and John S. Witte Abstract Genic variants are more likely to alter gene function and affect disease

More information

After the association: Functional and Biological Validation of Variants

After the association: Functional and Biological Validation of Variants After the association: Functional and Biological Validation of Variants Jason L. Stein Geschwind Laboratory / Imaging Genetics Center University of California, Los Angeles (but soon to be at UNC-Chapel

More information

A/A;b/b x a/a;b/b. The doubly heterozygous F1 progeny generally show a single phenotype, determined by the dominant alleles of the two genes.

A/A;b/b x a/a;b/b. The doubly heterozygous F1 progeny generally show a single phenotype, determined by the dominant alleles of the two genes. Name: Date: Title: Gene Interactions in Corn. Introduction. The phenotype of an organism is determined, at least in part, by its genotype. Thus, given the genotype of an organism, and an understanding

More information

H3A - Genome-Wide Association testing SOP

H3A - Genome-Wide Association testing SOP H3A - Genome-Wide Association testing SOP Introduction File format Strand errors Sample quality control Marker quality control Batch effects Population stratification Association testing Replication Meta

More information

Exam 1 Answers Biology 210 Sept. 20, 2006

Exam 1 Answers Biology 210 Sept. 20, 2006 Exam Answers Biology 20 Sept. 20, 2006 Name: Section:. (5 points) Circle the answer that gives the maximum number of different alleles that might exist for any one locus in a normal mammalian cell. A.

More information

An introduction to genetics and molecular biology

An introduction to genetics and molecular biology An introduction to genetics and molecular biology Cavan Reilly September 5, 2017 Table of contents Introduction to biology Some molecular biology Gene expression Mendelian genetics Some more molecular

More information

Statistical challenges to genome-wide association study

Statistical challenges to genome-wide association study 1 Statistical challenges to genome-wide association study Naoyuki Kamatani, M.D., Ph.D. 1. Director and Professor, Institute of Rheumatology, Tokyo Women s Medical University 2. Director, Medical Informatics

More information

General aspects of genome-wide association studies

General aspects of genome-wide association studies General aspects of genome-wide association studies Abstract number 20201 Session 04 Correctly reporting statistical genetics results in the genomic era Pekka Uimari University of Helsinki Dept. of Agricultural

More information

Human linkage analysis. fundamental concepts

Human linkage analysis. fundamental concepts Human linkage analysis fundamental concepts Genes and chromosomes Alelles of genes located on different chromosomes show independent assortment (Mendel s 2nd law) For 2 genes: 4 gamete classes with equal

More information

POPULATION GENETICS Winter 2005 Lecture 18 Quantitative genetics and QTL mapping

POPULATION GENETICS Winter 2005 Lecture 18 Quantitative genetics and QTL mapping POPULATION GENETICS Winter 2005 Lecture 18 Quantitative genetics and QTL mapping - from Darwin's time onward, it has been widely recognized that natural populations harbor a considerably degree of genetic

More information

ARTICLE Sherlock: Detecting Gene-Disease Associations by Matching Patterns of Expression QTL and GWAS

ARTICLE Sherlock: Detecting Gene-Disease Associations by Matching Patterns of Expression QTL and GWAS ARTICLE Sherlock: Detecting Gene-Disease Associations by Matching Patterns of Expression QTL and GWAS Xin He, 1,2 Chris K. Fuller, 1 Yi Song, 1 Qingying Meng, 3 Bin Zhang, 4 Xia Yang, 3 and Hao Li 1, *

More information

Chapter 11: Genome-Wide Association Studies

Chapter 11: Genome-Wide Association Studies Education Chapter 11: Genome-Wide Association Studies William S. Bush 1 *, Jason H. Moore 2 1 Department of Biomedical Informatics, Center for Human Genetics Research, Vanderbilt University Medical School,

More information

Machine learning applications in genomics: practical issues & challenges. Yuzhen Ye School of Informatics and Computing, Indiana University

Machine learning applications in genomics: practical issues & challenges. Yuzhen Ye School of Informatics and Computing, Indiana University Machine learning applications in genomics: practical issues & challenges Yuzhen Ye School of Informatics and Computing, Indiana University Reference Machine learning applications in genetics and genomics

More information

Genetic Association Studies

Genetic Association Studies 1 Genetic Association Studies Recent technological advancements allowing for large-scale sequencing efforts present an exciting opportunity to uncover the genetic underpinnings of complex diseases. In

More information

Axiom mydesign Custom Array design guide for human genotyping applications

Axiom mydesign Custom Array design guide for human genotyping applications TECHNICAL NOTE Axiom mydesign Custom Genotyping Arrays Axiom mydesign Custom Array design guide for human genotyping applications Overview In the past, custom genotyping arrays were expensive, required

More information

Illumina s GWAS Roadmap: next-generation genotyping studies in the post-1kgp era

Illumina s GWAS Roadmap: next-generation genotyping studies in the post-1kgp era Illumina s GWAS Roadmap: next-generation genotyping studies in the post-1kgp era Anthony Green Sr. Genotyping Sales Specialist North America 2010 Illumina, Inc. All rights reserved. Illumina, illuminadx,

More information

Designing Genome-Wide Association Studies: Sample Size, Power, Imputation, and the Choice of Genotyping Chip

Designing Genome-Wide Association Studies: Sample Size, Power, Imputation, and the Choice of Genotyping Chip : Sample Size, Power, Imputation, and the Choice of Genotyping Chip Chris C. A. Spencer., Zhan Su., Peter Donnelly ", Jonathan Marchini " * Department of Statistics, University of Oxford, Oxford, United

More information

MAPPING GENES TO TRAITS IN DOGS USING SNPs

MAPPING GENES TO TRAITS IN DOGS USING SNPs MAPPING GENES TO TRAITS IN DOGS USING SNPs OVERVIEW This instructor guide provides support for a genetic mapping activity, based on the dog genome project data. The activity complements a 29-minute lecture

More information

OBJECTIVES-ACTIVITIES 2-4

OBJECTIVES-ACTIVITIES 2-4 OBJECTIVES-ACTIVITIES 2-4 Germplasm Phenotyping Genomics PBA BIMS MAB Pipeline Implementation GOALS, ACTIVITIES, & DELIVERABLES Cameron Peace, project co-director & MAB Pipeline Team leader Outline of

More information

Section 4 - Guidelines for DNA Technology. Version October, 2017

Section 4 - Guidelines for DNA Technology. Version October, 2017 Section 4 - Guidelines for DNA Technology Section 4 DNA Technology Table of Contents Overview 1 Molecular genetics... 4 1.1 Introduction... 4 1.2 Current and potential uses of DNA technologies... 4 1.2.1

More information

Biology 142 Advanced Topics in Genetics and Molecular Biology Course Syllabus Spring 2006

Biology 142 Advanced Topics in Genetics and Molecular Biology Course Syllabus Spring 2006 Biology 142 Advanced Topics in Genetics and Molecular Biology Course Syllabus Spring 2006 Faculty Information: Dr. Nitya Jacob, Office: Room 104, Pierce Hall; Phone: 770-784-8346 Office Hours: T 9:30-10:30

More information

Haplotype phasing in large cohorts: Modeling, search, or both?

Haplotype phasing in large cohorts: Modeling, search, or both? Haplotype phasing in large cohorts: Modeling, search, or both? Po-Ru Loh Harvard T.H. Chan School of Public Health Department of Epidemiology Broad MIA Seminar, 3/9/16 Overview Background: Haplotype phasing

More information

The 150+ Tomato Genome (re-)sequence Project; Lessons Learned and Potential

The 150+ Tomato Genome (re-)sequence Project; Lessons Learned and Potential The 150+ Tomato Genome (re-)sequence Project; Lessons Learned and Potential Applications Richard Finkers Researcher Plant Breeding, Wageningen UR Plant Breeding, P.O. Box 16, 6700 AA, Wageningen, The Netherlands,

More information

DO NOT OPEN UNTIL TOLD TO START

DO NOT OPEN UNTIL TOLD TO START DO NOT OPEN UNTIL TOLD TO START BIO 312, Section 1, Spring 2011 February 21, 2011 Exam 1 Name (print neatly) Instructor 7 digit student ID INSTRUCTIONS: 1. There are 11 pages to the exam. Make sure you

More information

Author's response to reviews

Author's response to reviews Author's response to reviews Title: A pooling-based genome-wide analysis identifies new potential candidate genes for atopy in the European Community Respiratory Health Survey (ECRHS) Authors: Francesc

More information

Conifer Translational Genomics Network Coordinated Agricultural Project

Conifer Translational Genomics Network Coordinated Agricultural Project Conifer Translational Genomics Network Coordinated Agricultural Project Genomics in Tree Breeding and Forest Ecosystem Management ----- Module 11 Association Genetics Nicholas Wheeler & David Harry Oregon

More information

Title: Powerful SNP Set Analysis for Case-Control Genome Wide Association Studies. Running Title: Powerful SNP Set Analysis. Hill, NC. MD.

Title: Powerful SNP Set Analysis for Case-Control Genome Wide Association Studies. Running Title: Powerful SNP Set Analysis. Hill, NC. MD. Title: Powerful SNP Set Analysis for Case-Control Genome Wide Association Studies Running Title: Powerful SNP Set Analysis Michael C. Wu 1, Peter Kraft 2,3, Michael P. Epstein 4, Deanne M. Taylor 2, Stephen

More information

DNA Structure & the Genome. Bio160 General Biology

DNA Structure & the Genome. Bio160 General Biology DNA Structure & the Genome Bio160 General Biology Lecture Outline I. DNA A nucleic acid II. Chromosome Structure III. Chromosomes and Genes IV. DNA vs. RNA I. DNA A Nucleic Acid Structure of DNA: Remember:

More information

Lees J.A., Vehkala M. et al., 2016 In Review

Lees J.A., Vehkala M. et al., 2016 In Review Sequence element enrichment analysis to determine the genetic basis of bacterial phenotypes Lees J.A., Vehkala M. et al., 2016 In Review Journal Club Triinu Kõressaar 16.03.2016 Introduction Bacterial

More information

A strategy for multiple linkage disequilibrium mapping methods to validate additive QTL. Abstracts

A strategy for multiple linkage disequilibrium mapping methods to validate additive QTL. Abstracts Proceedings 59th ISI World Statistics Congress, 5-30 August 013, Hong Kong (Session CPS03) p.3947 A strategy for multiple linkage disequilibrium mapping methods to validate additive QTL Li Yi 1,4, Jong-Joo

More information

AP BIOLOGY Population Genetics and Evolution Lab

AP BIOLOGY Population Genetics and Evolution Lab AP BIOLOGY Population Genetics and Evolution Lab In 1908 G.H. Hardy and W. Weinberg independently suggested a scheme whereby evolution could be viewed as changes in the frequency of alleles in a population

More information

PLINK gplink Haploview

PLINK gplink Haploview PLINK gplink Haploview Whole genome association software tutorial Shaun Purcell Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA Broad Institute of Harvard & MIT, Cambridge,

More information

Bayesian Networks as framework for data integration

Bayesian Networks as framework for data integration Bayesian Networks as framework for data integration Jun Zhu, Ph. D. Department of Genomics and Genetic Sciences Icahn Institute of Genomics and Multiscale Biology Icahn Medical School at Mount Sinai New

More information

MAS refers to the use of DNA markers that are tightly-linked to target loci as a substitute for or to assist phenotypic screening.

MAS refers to the use of DNA markers that are tightly-linked to target loci as a substitute for or to assist phenotypic screening. Marker assisted selection in rice Introduction The development of DNA (or molecular) markers has irreversibly changed the disciplines of plant genetics and plant breeding. While there are several applications

More information

Analysis of Microarray Data

Analysis of Microarray Data Analysis of Microarray Data Lecture 3: Visualization and Functional Analysis George Bell, Ph.D. Senior Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Review

More information

Evaluation of Genome wide SNP Haplotype Blocks for Human Identification Applications

Evaluation of Genome wide SNP Haplotype Blocks for Human Identification Applications Ranajit Chakraborty, Ph.D. Evaluation of Genome wide SNP Haplotype Blocks for Human Identification Applications Overview Some brief remarks about SNPs Haploblock structure of SNPs in the human genome Criteria

More information

Linkage & Genetic Mapping in Eukaryotes. Ch. 6

Linkage & Genetic Mapping in Eukaryotes. Ch. 6 Linkage & Genetic Mapping in Eukaryotes Ch. 6 1 LINKAGE AND CROSSING OVER! In eukaryotic species, each linear chromosome contains a long piece of DNA A typical chromosome contains many hundred or even

More information

SNPassoc: an R package to perform whole genome association studies

SNPassoc: an R package to perform whole genome association studies SNPassoc: an R package to perform whole genome association studies Juan R González, Lluís Armengol, Xavier Solé, Elisabet Guinó, Josep M Mercader, Xavier Estivill, Víctor Moreno November 16, 2006 Contents

More information

Gene Linkage and Genetic. Mapping. Key Concepts. Key Terms. Concepts in Action

Gene Linkage and Genetic. Mapping. Key Concepts. Key Terms. Concepts in Action Gene Linkage and Genetic 4 Mapping Key Concepts Genes that are located in the same chromosome and that do not show independent assortment are said to be linked. The alleles of linked genes present together

More information

Haplotypes Personalized Medicine: Understanding Your Own Genome Fall 2014

Haplotypes Personalized Medicine: Understanding Your Own Genome Fall 2014 Haplotypes 02-223 Personalized Medicine: Understanding Your Own Genome Fall 2014 Terminology Review llele: different forms of genecc variacons at a given gene or genecc locus Locus 1 has two alleles, and

More information

GENETIC ALGORITHMS. Narra Priyanka. K.Naga Sowjanya. Vasavi College of Engineering. Ibrahimbahg,Hyderabad.

GENETIC ALGORITHMS. Narra Priyanka. K.Naga Sowjanya. Vasavi College of Engineering. Ibrahimbahg,Hyderabad. GENETIC ALGORITHMS Narra Priyanka K.Naga Sowjanya Vasavi College of Engineering. Ibrahimbahg,Hyderabad mynameissowji@yahoo.com priyankanarra@yahoo.com Abstract Genetic algorithms are a part of evolutionary

More information

Supplementary Note: Detecting population structure in rare variant data

Supplementary Note: Detecting population structure in rare variant data Supplementary Note: Detecting population structure in rare variant data Inferring ancestry from genetic data is a common problem in both population and medical genetic studies, and many methods exist to

More information

S SG. Metabolomics meets Genomics. Hemant K. Tiwari, Ph.D. Professor and Head. Metabolomics: Bench to Bedside. ection ON tatistical.

S SG. Metabolomics meets Genomics. Hemant K. Tiwari, Ph.D. Professor and Head. Metabolomics: Bench to Bedside. ection ON tatistical. S SG ection ON tatistical enetics Metabolomics meets Genomics Hemant K. Tiwari, Ph.D. Professor and Head Section on Statistical Genetics Department of Biostatistics School of Public Health Metabolomics:

More information

Genetic load. For the organism as a whole (its genome, and the species), what is the fitness cost of deleterious mutations?

Genetic load. For the organism as a whole (its genome, and the species), what is the fitness cost of deleterious mutations? Genetic load For the organism as a whole (its genome, and the species), what is the fitness cost of deleterious mutations? Anth/Biol 5221, 25 October 2017 We saw that the expected frequency of deleterious

More information

ARTICLE Leveraging Multi-ethnic Evidence for Mapping Complex Traits in Minority Populations: An Empirical Bayes Approach

ARTICLE Leveraging Multi-ethnic Evidence for Mapping Complex Traits in Minority Populations: An Empirical Bayes Approach ARTICLE Leveraging Multi-ethnic Evidence for Mapping Complex Traits in Minority Populations: An Empirical Bayes Approach Marc A. Coram, 1,8,9 Sophie I. Candille, 2,8 Qing Duan, 3 Kei Hang K. Chan, 4 Yun

More information

Concepts and relevance of genome-wide association studies

Concepts and relevance of genome-wide association studies Science Progress (2016), 99(1), 59 67 Paper 1500149 doi:10.3184/003685016x14558068452913 Concepts and relevance of genome-wide association studies ANDREAS SCHERER and G. BRYCE CHRISTENSEN Dr Andreas Scherer

More information

Molecular markers in plant breeding

Molecular markers in plant breeding Molecular markers in plant breeding Jumbo MacDonald et al., MAIZE BREEDERS COURSE Palace Hotel Arusha, Tanzania 4 Sep to 16 Sep 2016 Molecular Markers QTL Mapping Association mapping GWAS Genomic Selection

More information

Age-Adjusted Death Rates for Coronary Heart Disease, U.S.,

Age-Adjusted Death Rates for Coronary Heart Disease, U.S., Age-Adjusted Death Rates for Coronary Heart Disease, U.S., 1950-2004 Deaths/100,000 Population 600 500 400 300 200 100 Risk Factors U.S. Actual U.S. "Could Be" (Based on Japan Actual) 0 1950 1960 1970

More information

Strategy for applying genome-wide selection in dairy cattle

Strategy for applying genome-wide selection in dairy cattle J. Anim. Breed. Genet. ISSN 0931-2668 ORIGINAL ARTICLE Strategy for applying genome-wide selection in dairy cattle L.R. Schaeffer Department of Animal and Poultry Science, Centre for Genetic Improvement

More information

Association mapping of Sclerotinia stalk rot resistance in domesticated sunflower plant introductions

Association mapping of Sclerotinia stalk rot resistance in domesticated sunflower plant introductions Association mapping of Sclerotinia stalk rot resistance in domesticated sunflower plant introductions Zahirul Talukder 1, Brent Hulke 2, Lili Qi 2, and Thomas Gulya 2 1 Department of Plant Sciences, NDSU

More information

SNP calling and VCF format

SNP calling and VCF format SNP calling and VCF format Laurent Falquet, Oct 12 SNP? What is this? A type of genetic variation, among others: Family of Single Nucleotide Aberrations Single Nucleotide Polymorphisms (SNPs) Single Nucleotide

More information

Reading Between the Genes: Computational Models to Discover Function from Noncoding DNA

Reading Between the Genes: Computational Models to Discover Function from Noncoding DNA Reading Between the Genes: Computational Models to Discover Function from Noncoding DNA Yves A. Lussier, Joanne Berghout, Francesca Vitali, Kenneth S. Ramos Center for Biomedical Informatics and Biostatistics,

More information

Population and Community Dynamics. The Hardy-Weinberg Principle

Population and Community Dynamics. The Hardy-Weinberg Principle Population and Community Dynamics The Hardy-Weinberg Principle Key Terms Population: same species, same place, same time Gene: unit of heredity. Controls the expression of a trait. Can be passed to offspring.

More information

Conifer Translational Genomics Network Coordinated Agricultural Project

Conifer Translational Genomics Network Coordinated Agricultural Project Conifer Translational Genomics Network Coordinated Agricultural Project Genomics in Tree Breeding and Forest Ecosystem Management ----- Module 2 Genes, Genomes, and Mendel Nicholas Wheeler & David Harry

More information

Question. In the last 100 years. What is Feed Efficiency? Genetics of Feed Efficiency and Applications for the Dairy Industry

Question. In the last 100 years. What is Feed Efficiency? Genetics of Feed Efficiency and Applications for the Dairy Industry Question Genetics of Feed Efficiency and Applications for the Dairy Industry Can we increase milk yield while decreasing feed cost? If so, how can we accomplish this? Stephanie McKay University of Vermont

More information

Genomic resources and gene/qtl discovery in cereals

Genomic resources and gene/qtl discovery in cereals Genomic resources and gene/qtl discovery in cereals Roberto Tuberosa Dept. of Agroenvironmental Sciences & Technology University of Bologna, Italy The ABDC Congress 1-4 March 2010 Gudalajara, Mexico Outline

More information

Logistics. Final exam date. Project Presentation. Plan for this week. Evolutionary Algorithms. Crossover and Mutation

Logistics. Final exam date. Project Presentation. Plan for this week. Evolutionary Algorithms. Crossover and Mutation Logistics Crossover and Mutation Assignments Checkpoint -- Problem Graded -- comments on mycourses Checkpoint --Framework Mostly all graded -- comments on mycourses Checkpoint -- Genotype / Phenotype Due

More information

LAB ACTIVITY ONE POPULATION GENETICS AND EVOLUTION 2017

LAB ACTIVITY ONE POPULATION GENETICS AND EVOLUTION 2017 OVERVIEW In this lab you will: 1. learn about the Hardy-Weinberg law of genetic equilibrium, and 2. study the relationship between evolution and changes in allele frequency by using your class to represent

More information

Pathway approach for candidate gene identification and introduction to metabolic pathway databases.

Pathway approach for candidate gene identification and introduction to metabolic pathway databases. Marker Assisted Selection in Tomato Pathway approach for candidate gene identification and introduction to metabolic pathway databases. Identification of polymorphisms in data-based sequences MAS forward

More information

Figure S4 A-H : Initiation site properties and evolutionary changes

Figure S4 A-H : Initiation site properties and evolutionary changes A 0.3 Figure S4 A-H : Initiation site properties and evolutionary changes G-correction not used 0.25 Fraction of total counts 0.2 0.5 0. tag 2 tags 3 tags 4 tags 5 tags 6 tags 7tags 8tags 9 tags >9 tags

More information

Quantitative Genetics

Quantitative Genetics Quantitative Genetics Polygenic traits Quantitative Genetics 1. Controlled by several to many genes 2. Continuous variation more variation not as easily characterized into classes; individuals fall into

More information

Genome-wide association studies. Gene regulation and the genomics of complex traits. Relating Variation to Phenotype. Relating Variation to Phenotype

Genome-wide association studies. Gene regulation and the genomics of complex traits. Relating Variation to Phenotype. Relating Variation to Phenotype Genome-wide association studies Gene regulation and the genomics of complex traits Eric R. Gamazon Vanderbilt University; Academic Medical Center, University of Amsterdam Nancy J. Cox Vanderbilt University

More information

Stefano Monti. Workshop Format

Stefano Monti. Workshop Format Gad Getz Stefano Monti Michael Reich {gadgetz,smonti,mreich}@broad.mit.edu http://www.broad.mit.edu/~smonti/aws Broad Institute of MIT & Harvard October 18-20, 2006 Cambridge, MA Workshop Format Morning

More information

Concepts: What are RFLPs and how do they act like genetic marker loci?

Concepts: What are RFLPs and how do they act like genetic marker loci? Restriction Fragment Length Polymorphisms (RFLPs) -1 Readings: Griffiths et al: 7th Edition: Ch. 12 pp. 384-386; Ch.13 pp404-407 8th Edition: pp. 364-366 Assigned Problems: 8th Ch. 11: 32, 34, 38-39 7th

More information

We can use a Punnett Square to determine how the gametes will recombine in the next, or F2 generation.

We can use a Punnett Square to determine how the gametes will recombine in the next, or F2 generation. AP Lab 7: The Mendelian Genetics of Corn Objectives: In this laboratory investigation, you will: Use corn to study genetic crosses, recognize contrasting phenotypes, collect data from F 2 ears of corn,

More information

less sensitive than RNA-seq but more robust analysis pipelines expensive but quantitiatve standard but typically not high throughput

less sensitive than RNA-seq but more robust analysis pipelines expensive but quantitiatve standard but typically not high throughput Chapter 11: Gene Expression The availability of an annotated genome sequence enables massively parallel analysis of gene expression. The expression of all genes in an organism can be measured in one experiment.

More information

Cowpea Breeding. Ainong Shi. University of Arkansas

Cowpea Breeding. Ainong Shi. University of Arkansas Cowpea Breeding Ainong Shi University of Arkansas UAF Vegetable Breeding UAF AR USA International Collaboration Classic Breeding Molecular Breeding Student Training Classic breeding such as crossing, generation

More information

University of Groningen. The value of haplotypes Vries, Anne René de

University of Groningen. The value of haplotypes Vries, Anne René de University of Groningen The value of haplotypes Vries, Anne René de IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS

Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS The Bioinformatics book covers new topics in the rapidly

More information

Improved Power by Use of a Weighted Score Test for Linkage Disequilibrium Mapping

Improved Power by Use of a Weighted Score Test for Linkage Disequilibrium Mapping Improved Power by Use of a Weighted Score Test for Linkage Disequilibrium Mapping Tao Wang and Robert C. Elston REPORT Association studies offer an exciting approach to finding underlying genetic variants

More information

Whole genome sequencing in drug discovery research: a one fits all solution?

Whole genome sequencing in drug discovery research: a one fits all solution? Whole genome sequencing in drug discovery research: a one fits all solution? Marc Sultan, September 24th, 2015 Biomarker Development, Translational Medicine, Novartis On behalf of the BMD WGS pilot team:

More information

ACCEPTED. Victoria J. Wright Corresponding author.

ACCEPTED. Victoria J. Wright Corresponding author. The Pediatric Infectious Disease Journal Publish Ahead of Print DOI: 10.1097/INF.0000000000001183 Genome-wide association studies in infectious diseases Eleanor G. Seaby 1, Victoria J. Wright 1, Michael

More information

EVOLUTION/HERDEDITY UNIT Unit 1 Part 8A Chapter 23 Activity Lab #11 A POPULATION GENETICS AND EVOLUTION

EVOLUTION/HERDEDITY UNIT Unit 1 Part 8A Chapter 23 Activity Lab #11 A POPULATION GENETICS AND EVOLUTION AP BIOLOGY EVOLUTION/HERDEDITY UNIT Unit Part 8A Chapter Activity Lab # A NAME DATE PERIOD POPULATION GENETICS AND EVOLUTION In 908 G. H. Hardy and W. Weinberg independently suggest a scheme whereby evolution

More information

Derrek Paul Hibar

Derrek Paul Hibar Derrek Paul Hibar derrek.hibar@ini.usc.edu Obtain the ADNI Genetic Data Quality Control Procedures Missingness Testing for relatedness Minor allele frequency (MAF) Hardy-Weinberg Equilibrium (HWE) Testing

More information

Phasing of 2-SNP Genotypes based on Non-Random Mating Model

Phasing of 2-SNP Genotypes based on Non-Random Mating Model Phasing of 2-SNP Genotypes based on Non-Random Mating Model Dumitru Brinza and Alexander Zelikovsky Department of Computer Science, Georgia State University, Atlanta, GA 30303 {dima,alexz}@cs.gsu.edu Abstract.

More information

POPULATION GENETICS AND EVOLUTION

POPULATION GENETICS AND EVOLUTION AP BIOLOGY EVOLUTION ACTIVITY # NAME DATE HOUR POPULATION GENETICS AND EVOLUTION INTRODUCTION In 908 G. H. Hardy and W. Weinberg independently suggest a scheme whereby evolution could be viewed as changes

More information

Genetics of extreme body size evolution in mice from Gough Island

Genetics of extreme body size evolution in mice from Gough Island Genetics of extreme body size evolution in mice from Gough Island Karl Broman Biostatistics & Medical Informatics, UW Madison kbroman.org github.com/kbroman @kwbroman bit.ly/bds2017-07 This is a collaboration

More information

A Statistical Framework for Joint eqtl Analysis in Multiple Tissues

A Statistical Framework for Joint eqtl Analysis in Multiple Tissues A Statistical Framework for Joint eqtl Analysis in Multiple Tissues Timothée Flutre 1,2., Xiaoquan Wen 3., Jonathan Pritchard 1,4, Matthew Stephens 1,5 * 1 Department of Human Genetics, University of Chicago,

More information

An introductory overview of the current state of statistical genetics

An introductory overview of the current state of statistical genetics An introductory overview of the current state of statistical genetics p. 1/9 An introductory overview of the current state of statistical genetics Cavan Reilly Division of Biostatistics, University of

More information

Runs of Homozygosity Analysis Tutorial

Runs of Homozygosity Analysis Tutorial Runs of Homozygosity Analysis Tutorial Release 8.7.0 Golden Helix, Inc. March 22, 2017 Contents 1. Overview of the Project 2 2. Identify Runs of Homozygosity 6 Illustrative Example...............................................

More information