Identifying genetic variants for complex disorders
|
|
- Bartholomew Gray
- 5 years ago
- Views:
Transcription
1 Identifying genetic variants for complex disorders Peristera Paschou, PhD Associate Professor of Population Genetics Dept. of Molecular Biology and Genetics Democritus University of Thrace Marianthi Georgitsi, PhD Assistant Professor of Medical Biology-Genetics Dept. of Medicine Aristotle University of Thessaloniki
2 We are quite similar, but we are different The average genome (2x3 billion bp) contains: 3-4 million single nucleotide variations, compared to the reference sequence (Single Nucleotide Polymorphisms SNPs) ~0.5 million small insertions or deletions indels (1-100bp) ~5,000 larger insertions or deletions (>100bp) Variation across all (~23,000) genes - the exome ~18,000 variants ~8-9,000 functional variants ~95% of variants are common ~ genes with new mutations ~ knock-out mutations
3 Monogenic Disorders vs Complex Disorders
4 Single gene disorders Low impact on public health cost One gene Mendelian inheritance Rare mutations Classical genetics approaches Examples: Huntington s disease Cystic fibrosis Multifactorial disorders (Complex traits) Serious impact on public health cost Multiple genes Complex pattern of inheritance Common and rare genetic variants Genome scans new technologies Examples: Stroke Diabetes Schizophrenia
5 Interaction of genetic and environmental factors Gastric ulcer Duchenne muscular dystrophy Hemophilia Diabetes Schizophrenia Bipolar disorder GENETICS ENVIRONMENT Phenylketonuria Galactosemia Cardiovascular disease Ankylosing spondylitis Car accident Rare disorders Simple genetic basis Unifactorial Common disorders Complex genetic basis Multifactorial
6 λ = Relative risk of symptom development in relatives (sibs) of affected individuals Disorder λ Schizophrenia 12 Autism spectrum disorder 150 Bipolar disorder 7 Type I diabetes 35 Crohn s disease 25 Multiple sclerosis 24 λ=1 risk in general population Nussbaum et al: Thompson and Thompson s Genetics in Medicine, 2007, p 153.
7 Gene-hunting strategies for multifactorial disorders Candidate gene approach Disorder Etiological pathway Gene Mapping Positional cloning Disorder Mapping Gene Etiological pathway
8 Gene identification -Size of the human genome 3 x 10 9 bp -23,000 genes -Large portion of the genome function is not known. Claude Maunet Haystack at the Sunset near Giverny
9 Looking for genes in 23 pairs of chromosomes
10 SNPs can be used to create dense marker maps microsatellites SNPs Recent genome-wide association studies use millions of SNPs.
11 Positional cloning Clinical Phenotype Indications for a genetic basis? Study design families Sib pairs Single patients Sample and data collection Analysis Linkage analysis Association studies Define candidate regions Physical mapping/ gene identification
12 Linkage analysis Large families Sib-pair studies
13 1989 The cystic fibrosis gene is identified First successful positional cloning study
14 Linkage analysis What is the probability that the disease-causing mutation and the polymorphism we are studying are linked (at a specific genetic distance) vs non-linked?
15 Recombination and linkage Two loci are linked when little or no recombination occurs among them. Recombination: Exchange of DNA segments among homologous regions during meiosis and gamete formation. gametes
16 Recombination and linkage Two loci at different chromosomes cannot be linked. Linkage increases as physical distance decreases. 1cM distance among two genetic loci, means that 1% of produced gametes will be recombined.
17 Logarithm of the odds LOD score Logarithm of the probability of linkage Ratio of the probability that the two loci are linked vs the probability that they are not linked. likelihood of obtaining the test data if the two loci are indeed linked, to the likelihood of observing the same data purely by chance. LOD = 3.0 = odds of 1000/1 for linkage Corresponds to error of 5%
18 Linkage analysis Great power for gene identification for mendelian disorders Limited success when used for multifactorial disorders 1,2 1,2 1,2 1,2 1,1 1,1 2,2 1,1 2,2 2,2 1,1 2,2
19 Association studies Population studies trios
20 Genetic Association Studies Aim: To unravel associations between genetic data (ie alleles or genotypes) of commonly occurring genetic variants with information regarding a trait or a medical phenotype (ie disease) under study, using statistical analyses and a large enough sample size (typically cases versus controls), in order to support the statistics that these variants contribute to trait/disease risk. Examples of complex diseases Type II Diabetes Mellitus Obesity Cardiovascular diseases Cancer (non-hereditary) Osteoarthritis Autoimmune disorders Alzheimer s disease. Schizophrenia Autism Bipolar Disorder Obsessive Compulsive Disorder Learning disabilities (Dyslexia). 13/04/2016
21 Linkage disequilibrium (LD) The non random association of alleles at different loci Essential tool for genetic association studies
22 Example Linkage Disequilibrium SNP1: A 50% C 50% SNP2: A 50% G 50% SNP1 SNP2 expected frequency of haplotypes A A 0,5 x 0,5 A G 0,5 x 0,5 C A 0,5 x 0,5 C G 0,5 x 0,5 If total LD exists only 2 haplotypes will be observed (eg) A G C A
23 LD around a mutation generations k g
24 Extant chromosomes mutation Ancestral chromosome Ancestral DNA sequence Novel DNA sequence due to recombination Jobling, Hurles & Tyler-Smith. Human Evolutionary Genetics
25 Extant chromosomes mutation Ancestral chromosome Ancestral DNA sequence Novel DNA sequence due to recombination
26 Extant chromosomes polymorphism mutation Ancestral chromosome Ancestral DNA sequence Novel DNA sequence due to recombination
27 Extant chromosomes mutation SNPs Ancestral chromosome Ancestral DNA sequence Novel DNA sequence due to recombination
28 Genome-wide Association Studies (GWAS) Ideally Identify all SNPs (eg ) Collect a very large sample (eg 1,000 patients and 1,000 control individuals) Genotype all individuals for all SNPs 30 billion genotypes! Cost in 2002: 50 cents/genotype. $15 billion for each disorder!
29 Genome structure allows the selection of tagging SNPs mutation SNP mutation Hirschhorn & Daly, Nat Rev Genet 2005 Candidate gene or GWAS signal SNPs suffice for a rough scan of the human genome Paschou et al. Genome Research 2007 De Bakker et al. Nature Genetics 2005
30 Genetic Association Studies versus Linkage studies Appropriate for complex phenotypes Increased genetic (locus) heterogeneity (ie multigenic variance) many genes, many variants Common variance ( common disease common variants concept) modest disease risk per variant Inadequate power to detect associations Large numbers of cases and controls (healthy individuals), or family-based associations (trios, ie affected child and both parents) or extreme phenotypes Mostly SNPs (taggingsnps): A single or a few SNPs within a chromosomal region that capture(s) (ie tags ) most of the common DNA variation in this particular region, owing to the effect of Linkage Disequilibrium (LD). Associated SNPs most often not coding (they can be in LD with the causal variant or have a regulatory effect) Appropriate for Mendelian traits Reduced genetic heterogeneity (one or a few genes) Typically rare variants ( rare disease rare variants concept) in all affected individuals Large, multigenerational pedigrees detection power (parametric, non-parametric linkage analyses) Typically microsatellite markers, SNPs, or long stretches of chromosomal homozygosity (for recessive traits only) Note: LD is defined as the phenomenon of coinheritance (non-random association) of studied genetic marker (SNPs) alleles, unlikely to be separated by homologous recombination ( linked markers) within a population. 13/04/
31
32 Technology reduces the cost 2007 Candidate genes 1$ per SNP SNPs
33 Technology reduces the cost 2007 Candidate genes 1$ per SNP SNPs Genome genotyping (GWAS) 250$ 1 million SNPs
34 Genome-wide association studies (GWAS) Patients Thousands of samples The more the better! Controls Genotyping eg 600,000 SNPs Comparison of alleles Statistical analysis
35
36 September 2012: studies associated SNPs
37 Current status of available SNP genotyping platforms A comparison 13/04/2016
38 Workflow of a typical GWAS study Sampl e collecti on Peripheral blood Buccal cells DNA extraction and Quantifica tion Manual purification Kit-based purification DNA Plating 96-well plates 384-well plates Array run QC Pathwa y analyse s Statistica l analysis (traitdependent) Replication study (stage 2) ANOVA Chi-square Fisher s Exact Covariate adjustment Multiple testing Sample size Population Phenotypic criteria Fluorospectrophotometry Spectrophotometry Metaanalyse s Validation (technical, functional) Other genotyping platform In silico analysis (SNPs in LD) Gene expression analysis Cell-based analysis Animal models Resequencing of region Marianthi Georgitsi 13/04/2016
39 T2DM associated genes before 2007 Confirmed susceptibility genes in the pre-gwas era Altshuler et al, Nature Genetics 2000 Gloyn et al, Diabetes 2003 Grant et al, Nature Genetics 2006
40 T2DM associated genes after 2007 Candidate gene studies GWAS for T2DM GWAS for DM associated traits Meta-analysis studies
41 Body Mass Index GWAS and meta-analysis women of African ancestry >3.2 million SNPs 39,144 men and women of African ancestry Monda et al. Nature Genetics 2013
42 Alleles associated with multifactorial disorders differ in frequency around the world CORONA ET AL. 12TH INTERNATIONAL CONGRESS OF HUMAN GENETICS, 2011
43 Different genetic structure in different populations Significant differentiation even among European populations SNPs in different European populations. Adygei Sardinian Russian Italian/Tuscan Orcadian French Basque Paschou et al. Journal of Medical Genetics 2010 Paschou et al. PLoS Genetics 2008 Paschou et al. PLoS Genetics 2007
44 For each population, the average individual as well as standard deviation is shown Stathias et al. Annals of Human Genetics 2012 Principal Components Analysis 1,200 SNPs CEU Greeks TSI (Italians) (North Europe) (South Europe)
45 Population 1 cases Population 2 Population stratification is a confounding factor in genetic association studies controls aa Aa AA
46 Using genetic markers to correct for population stratification in genome-wide association studies European Asian African Paschou et al PLoS Genet 2007 Paschou et al PLoS Genet 2008
47 How many more genes are we looking for? Type 2 Diabetes already 45 genes associated DIAbetes Genetics Replication And Metaanalysis (DIAGRAM) Consortium 34,840 patients 114,981 controls 10 additional genes Morris et al. Nature Genetics 2012
48 Type 2 Diabetes associated genes account for 10-28% of heritability
49 The case of the missing heritability (Maher, Nature 2008)
50 What GWAS studies have to offer Pros and Cons Pros (+) High-throughput analysis (hundreds of thousands up to million variants) Large-scale projects with thousands participants Suitable for complex, non-mendelian disorders, quantitative traits, eqtls, Variety of software tools for computerized analysis Dataset repositories for meta-analyses continuously curated and updated Pathway analyses Certified service providers worldwide... Missing heritability of common diseases/disorders Marianthi Georgitsi Cons (-) They target pre-defined variants (biased) Not suitable for identification (de novo) studies Not suitable for studying Mendelian disorders Alleles confer small effect sizes [small Odds Ratios (typically OR<1.5)] False-positive (population stratification, genotyping errors, selection bias, etc) or falsenegative results (insensitivity to rare variants, lack of genetic variants from platforms, lack of variation in a SNP in the population under study) The richer they are in context, the more expensive Their analysis requires training in bioinformatics and computationally intensive analyses and infrastructure Functional approaches to interpret the data are needed (gene expression analysis, cell and animal model manipulations, etc)...
51 What GWAS studies have to offer Prof. Scharf J, Massachusetts General Hospital, Boston, personal communication, 2015 Missing heritability not targeted by GWAS studies Rare variants (MAF<1%) Structural variants Gene-gene interactions Gene-environment interactions Population isolates and population extremes Human Genome Low-hanging fruit? (high penetrance, high frequency) Marianthi Georgitsi
52 Where is the missing heritability? Common disease rare variant hypothesis McCarthy et al. Nat Rev Genet 2008
53 Technology reduces the cost 2007 Candidate genes 1$ per SNP SNPs Genome genotyping (GWAS) 250$ 1 million SNPs
54 Technology reduces the cost ? Candidate genes Genome genotyping (GWAS) Exome sequencing (NGS) Whole genome sequencing (NGS) 1$ per SNP 250$ 1000$ < 1000$ SNPs 1 million SNPs All exons The whole genome
55
56 1000 genomes project
57 Nature 526, (01 October 2015)
58 Exome sequencing identifies the cause of a Mendelian disorder Four affected individuals in three independent kindreds, Exome sequencing at 40x coverage. Filtering against public SNP databases and eight HapMap exomes identified a single candidate gene DHODH (Dihydroorotate dehydrogenase), encodes a key enzyme in the pyrimidine de novo biosynthesis pathway. Sanger sequencing confirmed the presence of DHODH mutations in three additional families with Miller syndrome. Ng et al. Nature Genetics 2010
59 Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome Ten unrelated individuals. Seven individuals had nonsense or frameshift mutations at MLL2 MLL2 - encodes a Trithorax-group histone methyltransferase Sanger sequencing revealed MLL2 mutations in 2 of the remaining individuals De novo or transmitted according to phenotype Ng et al. Nature Genetics 2010
60 Exome sequencing to identify rare variants in complex disorders? Large families unrelateds trios Comparing the extremes of a phenotypic distribution Bamshad et al. Nature Reviews Genetics, 2011
61 The Exome Sequencing Project Sequencing the exomes of >7,000 individuals with extreme phenotypes Goal: find genes that underlie common cardiovascular disease (eg earlyonset myocardial infarction and stroke) and lung disease (eg chronic obstructive pulmonary disease) Funded by the US National Heart, Lung, and Blood Institute Fu et al. Nature 2013 Boileau et al. Nature Genetics 2012 Emond et al. Nature Genetics 2012 Tennessen et al. Science 2012
62 Data analysis of a typical NGS experiment Illumina site:
63
64
65 Workflow to identify disease variants in exome sequencing data Pabinger et al. Briefings in Bioinformatics, 2013
66 Making sense of NGS data Factors affecting the identification of causal alleles against a background of common polymorphisms: ~ SNPs (exome-seq) ~2% of SNPs identified per individual by WES is novel ~ SNPs (exome-seq) What is the mode of inheritance of a trait? Does population structure affect causal alleles? Does the phenotype arise de novo or due to inherited variants? Locus heterogeneity of a trait (=how many genes affect the manifestation of a trait) How large sample size in order to identify traitassociated alleles? What analytical framework (software, pipeline) to be used? Marianthi Georgitsi
67
68 20,000-25,000 exonic variants What is the effect? Coding? Intronic? Deleterious? Non-functional? Amino-acid change? Regulatory?
69 Who else has the variant? Human Variation Databases Goal: Determine presence of variant in others dbsnp - Includes everything!! - SNPs have information about origin -VCF available ClinSeq Exomes with phen. (dbgap) - CCDS and knowngenes - soon in dbsnp, VCF to be available 1000 Genomes - 1,094 low coverage genomes (~4x) - >=1% sensitivity - In dbsnp, VCF available Exomes CCDS (coming soon) NHLBI Exome Sequencing Project - 2,500 Exomes with phen. (dbgap) - in dbsnp, VCF available
70 Potential frequencies of causal variants in complex traits Cirulli and Goldstein, Nature Reviews Genetics 2010
71 How NGS has advanced our knowledge in disease predisposition and disease gene identification Chong et al., AJHG, 97(2): , 2015
72
73 Risk of developing a disease Lifestyle changes Personalized treatment (pharmacogenomics) Less toxicity & ADR / Better response Family planning Prenatal screening / Genetic counseling Presymptomatic or predictive screening Disease management and follow-up
74
75
76
77
78
79
80
81
EPIB 668 Genetic association studies. Aurélie LABBE - Winter 2011
EPIB 668 Genetic association studies Aurélie LABBE - Winter 2011 1 / 71 OUTLINE Linkage vs association Linkage disequilibrium Case control studies Family-based association 2 / 71 RECAP ON GENETIC VARIANTS
More informationComputational Workflows for Genome-Wide Association Study: I
Computational Workflows for Genome-Wide Association Study: I Department of Computer Science Brown University, Providence sorin@cs.brown.edu October 16, 2014 Outline 1 Outline 2 3 Monogenic Mendelian Diseases
More informationAssociation studies (Linkage disequilibrium)
Positional cloning: statistical approaches to gene mapping, i.e. locating genes on the genome Linkage analysis Association studies (Linkage disequilibrium) Linkage analysis Uses a genetic marker map (a
More informationHuman Genetic Variation. Ricardo Lebrón Dpto. Genética UGR
Human Genetic Variation Ricardo Lebrón rlebron@ugr.es Dpto. Genética UGR What is Genetic Variation? Origins of Genetic Variation Genetic Variation is the difference in DNA sequences between individuals.
More informationAssociation Mapping. Mendelian versus Complex Phenotypes. How to Perform an Association Study. Why Association Studies (Can) Work
Genome 371, 1 March 2010, Lecture 13 Association Mapping Mendelian versus Complex Phenotypes How to Perform an Association Study Why Association Studies (Can) Work Introduction to LOD score analysis Common
More informationUnderstanding genetic association studies. Peter Kamerman
Understanding genetic association studies Peter Kamerman Outline CONCEPTS UNDERLYING GENETIC ASSOCIATION STUDIES Genetic concepts: - Underlying principals - Genetic variants - Linkage disequilibrium -
More informationExome Sequencing Exome sequencing is a technique that is used to examine all of the protein-coding regions of the genome.
Glossary of Terms Genetics is a term that refers to the study of genes and their role in inheritance the way certain traits are passed down from one generation to another. Genomics is the study of all
More informationSUPPLEMENTARY INFORMATION
doi:10.1038/nature13127 Factors to consider in assessing candidate pathogenic mutations in presumed monogenic conditions The questions itemized below expand upon the definitions in Table 1 and are provided
More informationThe Human Genome Project has always been something of a misnomer, implying the existence of a single human genome
The Human Genome Project has always been something of a misnomer, implying the existence of a single human genome Of course, every person on the planet with the exception of identical twins has a unique
More informationPotential of human genome sequencing. Paul Pharoah Reader in Cancer Epidemiology University of Cambridge
Potential of human genome sequencing Paul Pharoah Reader in Cancer Epidemiology University of Cambridge Key considerations Strength of association Exposure Genetic model Outcome Quantitative trait Binary
More informationCS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016
CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 Topics Genetic variation Population structure Linkage disequilibrium Natural disease variants Genome Wide Association Studies Gene
More informationGenome-Wide Association Studies (GWAS): Computational Them
Genome-Wide Association Studies (GWAS): Computational Themes and Caveats October 14, 2014 Many issues in Genomewide Association Studies We show that even for the simplest analysis, there is little consensus
More informationIntroduction to Add Health GWAS Data Part I. Christy Avery Department of Epidemiology University of North Carolina at Chapel Hill
Introduction to Add Health GWAS Data Part I Christy Avery Department of Epidemiology University of North Carolina at Chapel Hill Outline Introduction to genome-wide association studies (GWAS) Research
More informationGenome-wide association studies (GWAS) Part 1
Genome-wide association studies (GWAS) Part 1 Matti Pirinen FIMM, University of Helsinki 03.12.2013, Kumpula Campus FIMM - Institiute for Molecular Medicine Finland www.fimm.fi Published Genome-Wide Associations
More informationAxiom Biobank Genotyping Solution
TCCGGCAACTGTA AGTTACATCCAG G T ATCGGCATACCA C AGTTAATACCAG A Axiom Biobank Genotyping Solution The power of discovery is in the design GWAS has evolved why and how? More than 2,000 genetic loci have been
More informationCrash-course in genomics
Crash-course in genomics Molecular biology : How does the genome code for function? Genetics: How is the genome passed on from parent to child? Genetic variation: How does the genome change when it is
More informationGenetic Variation and Genome- Wide Association Studies. Keyan Salari, MD/PhD Candidate Department of Genetics
Genetic Variation and Genome- Wide Association Studies Keyan Salari, MD/PhD Candidate Department of Genetics How many of you did the readings before class? A. Yes, of course! B. Started, but didn t get
More informationPopulation and Statistical Genetics including Hardy-Weinberg Equilibrium (HWE) and Genetic Drift
Population and Statistical Genetics including Hardy-Weinberg Equilibrium (HWE) and Genetic Drift Heather J. Cordell Professor of Statistical Genetics Institute of Genetic Medicine Newcastle University,
More informationStructure, Measurement & Analysis of Genetic Variation
Structure, Measurement & Analysis of Genetic Variation Sven Cichon, PhD Professor of Medical Genetics, Director, Division of Medcial Genetics, University of Basel Institute of Neuroscience and Medicine
More informationApplied Bioinformatics
Applied Bioinformatics In silico and In clinico characterization of genetic variations Assistant Professor Department of Biomedical Informatics Center for Human Genetics Research ATCAAAATTATGGAAGAA ATCAAAATCATGGAAGAA
More informationRedefine what s possible with the Axiom Genotyping Solution
Redefine what s possible with the Axiom Genotyping Solution From discovery to translation on a single platform The Axiom Genotyping Solution enables enhanced genotyping studies to accelerate your research
More informationAn introduction to genetics and molecular biology
An introduction to genetics and molecular biology Cavan Reilly September 5, 2017 Table of contents Introduction to biology Some molecular biology Gene expression Mendelian genetics Some more molecular
More informationProstate Cancer Genetics: Today and tomorrow
Prostate Cancer Genetics: Today and tomorrow Henrik Grönberg Professor Cancer Epidemiology, Deputy Chair Department of Medical Epidemiology and Biostatistics ( MEB) Karolinska Institutet, Stockholm IMPACT-Atanta
More informationPersonal Genomics Platform White Paper Last Updated November 15, Executive Summary
Executive Summary Helix is a personal genomics platform company with a simple but powerful mission: to empower every person to improve their life through DNA. Our platform includes saliva sample collection,
More informationFamilial Breast Cancer
Familial Breast Cancer SEARCHING THE GENES Samuel J. Haryono 1 Issues in HSBOC Spectrum of mutation testing in familial breast cancer Variant of BRCA vs mutation of BRCA Clinical guideline and management
More informationUK Biobank Axiom Array
DATA SHEET Advancing human health studies with powerful genotyping technology Array highlights The Applied Biosystems UK Biobank Axiom Array is a powerful array for translational research. Designed using
More informationWhy learn linkage analysis?
Why learn linkage analysis? - and some basic genetics Kaja Selmer 2013 Outline What is linkage analysis and why learn it? An example of a successful linkage analysis story Basic genetics DNA content and
More informationGenome Wide Association Studies
Genome Wide Association Studies Liz Speliotes M.D., Ph.D., M.P.H. Instructor of Medicine and Gastroenterology Massachusetts General Hospital Harvard Medical School Fellow Broad Institute Outline Introduction
More informationHuman linkage analysis. fundamental concepts
Human linkage analysis fundamental concepts Genes and chromosomes Alelles of genes located on different chromosomes show independent assortment (Mendel s 2nd law) For 2 genes: 4 gamete classes with equal
More informationGenomics: Human variation
Genomics: Human variation Lecture 1 Introduction to Human Variation Dr Colleen J. Saunders, PhD South African National Bioinformatics Institute/MRC Unit for Bioinformatics Capacity Development, University
More informationH3A - Genome-Wide Association testing SOP
H3A - Genome-Wide Association testing SOP Introduction File format Strand errors Sample quality control Marker quality control Batch effects Population stratification Association testing Replication Meta
More informationS G. Design and Analysis of Genetic Association Studies. ection. tatistical. enetics
S G ection ON tatistical enetics Design and Analysis of Genetic Association Studies Hemant K Tiwari, Ph.D. Professor & Head Section on Statistical Genetics Department of Biostatistics School of Public
More informationSNPs - GWAS - eqtls. Sebastian Schmeier
SNPs - GWAS - eqtls s.schmeier@gmail.com http://sschmeier.github.io/bioinf-workshop/ 17.08.2015 Overview Single nucleotide polymorphism (refresh) SNPs effect on genes (refresh) Genome-wide association
More informationTHE HEALTH AND RETIREMENT STUDY: GENETIC DATA UPDATE
: GENETIC DATA UPDATE April 30, 2014 Biomarker Network Meeting PAA Jessica Faul, Ph.D., M.P.H. Health and Retirement Study Survey Research Center Institute for Social Research University of Michigan HRS
More informationBioinformatic Analysis of SNP Data for Genetic Association Studies EPI573
Bioinformatic Analysis of SNP Data for Genetic Association Studies EPI573 Mark J. Rieder Department of Genome Sciences mrieder@u.washington washington.edu Epidemiology Studies Cohort Outcome Model to fit/explain
More informationWhat is genetic variation?
enetic Variation Applied Computational enomics, Lecture 05 https://github.com/quinlan-lab/applied-computational-genomics Aaron Quinlan Departments of Human enetics and Biomedical Informatics USTAR Center
More informationPUBH 8445: Lecture 1. Saonli Basu, Ph.D. Division of Biostatistics School of Public Health University of Minnesota
PUBH 8445: Lecture 1 Saonli Basu, Ph.D. Division of Biostatistics School of Public Health University of Minnesota saonli@umn.edu Statistical Genetics It can broadly be classified into three sub categories:
More informationInfinium Global Screening Array-24 v1.0 A powerful, high-quality, economical array for population-scale genetic studies.
Infinium Global Screening Array-24 v1.0 A powerful, high-quality, economical array for population-scale genetic studies. Highlights Global Content Includes a multiethnic genome-wide backbone, expertly
More informationDNA Collection. Data Quality Control. Whole Genome Amplification. Whole Genome Amplification. Measure DNA concentrations. Pros
DNA Collection Data Quality Control Suzanne M. Leal Baylor College of Medicine sleal@bcm.edu Copyrighted S.M. Leal 2016 Blood samples For unlimited supply of DNA Transformed cell lines Buccal Swabs Small
More informationGenome-Wide Association Studies. Ryan Collins, Gerissa Fowler, Sean Gamberg, Josselyn Hudasek & Victoria Mackey
Genome-Wide Association Studies Ryan Collins, Gerissa Fowler, Sean Gamberg, Josselyn Hudasek & Victoria Mackey Introduction The next big advancement in the field of genetics after the Human Genome Project
More informationProf. Dr. Konstantin Strauch
Genetic Epidemiology and Personalized Medicine Prof. Dr. Konstantin Strauch IBE - Lehrstuhl für Genetische Epidemiologie Ludwig-Maximilians-Universität Institut für Genetische Epidemiologie Helmholtz-Zentrum
More informationEPIB 668 Introduction to linkage analysis. Aurélie LABBE - Winter 2011
EPIB 668 Introduction to linkage analysis Aurélie LABBE - Winter 2011 1 / 49 OUTLINE Meiosis and recombination Linkage: basic idea Linkage between 2 locis Model based linkage analysis (parametric) Example
More informationGlobal Screening Array (GSA)
Technical overview - Infinium Global Screening Array (GSA) with optional Multi-disease drop in (MD) The Infinium Global Screening Array (GSA) combines a highly optimized, universal genome-wide backbone,
More informationB I O I N F O R M A T I C S
Bioinformatics LECTURE 3-16 B I O I N F O R M A T I C S Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be Bioinformatics LECTURE
More informationIntroduction to Quantitative Genomics / Genetics
Introduction to Quantitative Genomics / Genetics BTRY 7210: Topics in Quantitative Genomics and Genetics September 10, 2008 Jason G. Mezey Outline History and Intuition. Statistical Framework. Current
More informationVariant prioritization in NGS studies: Annotation and Filtering "
Variant prioritization in NGS studies: Annotation and Filtering Colleen J. Saunders (PhD) DST/NRF Innovation Postdoctoral Research Fellow, South African National Bioinformatics Institute/MRC Unit for Bioinformatics
More informationREVIEWS GENOME-WIDE ASSOCIATION STUDIES FOR COMMON DISEASES AND COMPLEX TRAITS. Joel N. Hirschhorn* and Mark J. Daly*
GENOME-WIDE ASSOCIATION STUDIES FOR COMMON DISEASES AND COMPLEX TRAITS Joel N. Hirschhorn* and Mark J. Daly* Abstract Genetic factors strongly affect susceptibility to common diseases and also influence
More informationAlgorithms for Genetics: Introduction, and sources of variation
Algorithms for Genetics: Introduction, and sources of variation Scribe: David Dean Instructor: Vineet Bafna 1 Terms Genotype: the genetic makeup of an individual. For example, we may refer to an individual
More informationGenetics and Bioinformatics
Genetics and Bioinformatics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be Lecture 3: Genome-wide Association Studies 1 Setting
More informationUnderstanding the science and technology of whole genome sequencing
Understanding the science and technology of whole genome sequencing Dag Undlien Department of Medical Genetics Oslo University Hospital University of Oslo and The Norwegian Sequencing Centre d.e.undlien@medisin.uio.no
More informationHuman Genetics and Gene Mapping of Complex Traits
Human Genetics and Gene Mapping of Complex Traits Advanced Genetics, Spring 2018 Human Genetics Series Thursday 4/5/18 Nancy L. Saccone, Ph.D. Dept of Genetics nlims@genetics.wustl.edu / 314-747-3263 What
More informationPOLYMORPHISM AND VARIANT ANALYSIS. Matt Hudson Crop Sciences NCSA HPCBio IGB University of Illinois
POLYMORPHISM AND VARIANT ANALYSIS Matt Hudson Crop Sciences NCSA HPCBio IGB University of Illinois Outline How do we predict molecular or genetic functions using variants?! Predicting when a coding SNP
More informationHuman linkage analysis. fundamental concepts
Human linkage analysis fundamental concepts Genes and chromosomes Alelles of genes located on different chromosomes show independent assortment (Mendel s 2nd law) For 2 genes: 4 gamete classes with equal
More informationHuman Genetics and Gene Mapping of Complex Traits
Human Genetics and Gene Mapping of Complex Traits Advanced Genetics, Spring 2015 Human Genetics Series Thursday 4/02/15 Nancy L. Saccone, nlims@genetics.wustl.edu ancestral chromosome present day chromosomes:
More informationMulti-SNP Models for Fine-Mapping Studies: Application to an. Kallikrein Region and Prostate Cancer
Multi-SNP Models for Fine-Mapping Studies: Application to an association study of the Kallikrein Region and Prostate Cancer November 11, 2014 Contents Background 1 Background 2 3 4 5 6 Study Motivation
More informationPOPULATION GENETICS studies the genetic. It includes the study of forces that induce evolution (the
POPULATION GENETICS POPULATION GENETICS studies the genetic composition of populations and how it changes with time. It includes the study of forces that induce evolution (the change of the genetic constitution)
More informationIntroduction to Genetics and Pharmacogenomics
Introduction to Genetics and Pharmacogenomics Ching-Lung Cheung, PhD Assistant Professor, Department of Pharmacology and Pharmacy, Centre for Genomic Sciences, HKU Survey on pharmacogenomic knowledge Survey
More informationDanika Bannasch DVM PhD. School of Veterinary Medicine University of California Davis
Genetics 101 Danika Bannasch DVM PhD Maxine Adler Endowed Chair in Genetics School of Veterinary Medicine University of California Davis Outline Basic genetics: The Rules Not so basic genetics: The exceptions
More informationGenomic Research: Issues to Consider. IRB Brown Bag August 28, 2014 Sharon Aufox, MS, LGC
Genomic Research: Issues to Consider IRB Brown Bag August 28, 2014 Sharon Aufox, MS, LGC Outline Key genomic terms and concepts Issues in genomic research Consent models Types of findings Returning results
More informationHuman Chromosomes Section 14.1
Human Chromosomes Section 14.1 In Today s class. We will look at Human chromosome and karyotypes Autosomal and Sex chromosomes How human traits are transmitted How traits can be traced through entire families
More informationGenetics and Psychiatric Disorders Lecture 1: Introduction
Genetics and Psychiatric Disorders Lecture 1: Introduction Amanda J. Myers LABORATORY OF FUNCTIONAL NEUROGENOMICS All slides available @: http://labs.med.miami.edu/myers Click on courses First two links
More informationPopulation Genetics Sequence Diversity Molecular Evolution. Physiology Quantitative Traits Human Diseases
Population Genetics Sequence Diversity Molecular Evolution Physiology Quantitative Traits Human Diseases Bioinformatics problems in medicine related to physiology and quantitative traits Note: Genetics
More informationText Reference: Ch and 12-2
Text Reference: Ch. 12-1 and 12-2 Name Date Block Part I: Short Answer/ Completion 1. What combination of sex chromosomes produces a female? 2. What combination of sex chromosomes produces a male? 3. Which
More informationPharmacogenetics: A SNPshot of the Future. Ani Khondkaryan Genomics, Bioinformatics, and Medicine Spring 2001
Pharmacogenetics: A SNPshot of the Future Ani Khondkaryan Genomics, Bioinformatics, and Medicine Spring 2001 1 I. What is pharmacogenetics? It is the study of how genetic variation affects drug response
More informationLesson Overview. Studying the Human Genome. Lesson Overview Studying the Human Genome
Lesson Overview 14.3 Studying the Human Genome THINK ABOUT IT Just a few decades ago, computers were gigantic machines found only in laboratories and universities. Today, many of us carry small, powerful
More informationAge-Adjusted Death Rates for Coronary Heart Disease, U.S.,
Age-Adjusted Death Rates for Coronary Heart Disease, U.S., 1950-2004 Deaths/100,000 Population 600 500 400 300 200 100 Risk Factors U.S. Actual U.S. "Could Be" (Based on Japan Actual) 0 1950 1960 1970
More informationNext Genera*on Sequencing II: Personal Genomics. Jim Noonan Department of Gene*cs
Next Genera*on Sequencing II: Personal Genomics Jim Noonan Department of Gene*cs Personal genome sequencing Iden*fying the gene*c basis of phenotypic diversity among humans Gene*c risk factors for disease
More informationSingle Nucleotide Variant Analysis. H3ABioNet May 14, 2014
Single Nucleotide Variant Analysis H3ABioNet May 14, 2014 Outline What are SNPs and SNVs? How do we identify them? How do we call them? SAMTools GATK VCF File Format Let s call variants! Single Nucleotide
More informationExploring the Genetic Basis of Congenital Heart Defects
Exploring the Genetic Basis of Congenital Heart Defects Sanjay Siddhanti Jordan Hannel Vineeth Gangaram szsiddh@stanford.edu jfhannel@stanford.edu vineethg@stanford.edu 1 Introduction The Human Genome
More informationGenetic dissection of complex traits, crop improvement through markerassisted selection, and genomic selection
Genetic dissection of complex traits, crop improvement through markerassisted selection, and genomic selection Awais Khan Adaptation and Abiotic Stress Genetics, Potato and sweetpotato International Potato
More informationSequence Variations. Baxevanis and Ouellette, Chapter 7 - Sequence Polymorphisms. NCBI SNP Primer:
Sequence Variations Baxevanis and Ouellette, Chapter 7 - Sequence Polymorphisms NCBI SNP Primer: http://www.ncbi.nlm.nih.gov/about/primer/snps.html Overview Mutation and Alleles Linkage Genetic variation
More informationEstimation problems in high throughput SNP platforms
Estimation problems in high throughput SNP platforms Rob Scharpf Department of Biostatistics Johns Hopkins Bloomberg School of Public Health November, 8 Outline Introduction Introduction What is a SNP?
More informationINTRODUCTION TO MOLECULAR GENETICS. Andrew McQuillin Molecular Psychiatry Laboratory UCL Division of Psychiatry 22 Sept 2017
INTRODUCTION TO MOLECULAR GENETICS Andrew McQuillin Molecular Psychiatry Laboratory UCL Division of Psychiatry 22 Sept 2017 Learning Objectives Understand: The distinction between Quantitative Genetic
More informationMONTE CARLO PEDIGREE DISEQUILIBRIUM TEST WITH MISSING DATA AND POPULATION STRUCTURE
MONTE CARLO PEDIGREE DISEQUILIBRIUM TEST WITH MISSING DATA AND POPULATION STRUCTURE DISSERTATION Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate
More informationUKPMC Funders Group Author Manuscript Nature. Author manuscript; available in PMC 2011 April 1.
UKPMC Funders Group Author Manuscript Published in final edited form as: Nature. 2010 October 28; 467(7319): 1061 1073. doi:10.1038/nature09534. A map of human genome variation from population scale sequencing
More informationBA, BSc, and MSc Degree Examinations
Examination Candidate Number: Desk Number: BA, BSc, and MSc Degree Examinations 2017-8 Department : BIOLOGY Title of Exam: Human genetics Time Allowed: 2 hours Marking Scheme: Total marks available for
More informationChapter 11: Genome-Wide Association Studies
Education Chapter 11: Genome-Wide Association Studies William S. Bush 1 *, Jason H. Moore 2 1 Department of Biomedical Informatics, Center for Human Genetics Research, Vanderbilt University Medical School,
More informationBy the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs
(3) QTL and GWAS methods By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs Under what conditions particular methods are suitable
More informationMultiple Sclerosis: Recent Insights from Genomics. Bruce Cree, MD, PhD, MAS University of California San Francisco. Disclosure
Multiple Sclerosis: Recent Insights from Genomics Bruce Cree, MD, PhD, MAS University of California San Francisco Disclosure Bruce Cree has received personal compensation for consulting from Abbvie, Biogen
More informationFactors affecting statistical power in the detection of genetic association
Review series Factors affecting statistical power in the detection of genetic association Derek Gordon 1 and Stephen J. Finch 2 1 Laboratory of Statistical Genetics, Rockefeller University, New York, New
More informationTrudy F C Mackay, Department of Genetics, North Carolina State University, Raleigh NC , USA.
Question & Answer Q&A: Genetic analysis of quantitative traits Trudy FC Mackay What are quantitative traits? Quantitative, or complex, traits are traits for which phenotypic variation is continuously distributed
More informationLecture 23: Causes and Consequences of Linkage Disequilibrium. November 16, 2012
Lecture 23: Causes and Consequences of Linkage Disequilibrium November 16, 2012 Last Time Signatures of selection based on synonymous and nonsynonymous substitutions Multiple loci and independent segregation
More informationSupplementary Note: Detecting population structure in rare variant data
Supplementary Note: Detecting population structure in rare variant data Inferring ancestry from genetic data is a common problem in both population and medical genetic studies, and many methods exist to
More informationWhy do we need statistics to study genetics and evolution?
Why do we need statistics to study genetics and evolution? 1. Mapping traits to the genome [Linkage maps (incl. QTLs), LOD] 2. Quantifying genetic basis of complex traits [Concordance, heritability] 3.
More informationLet s call the recessive allele r and the dominant allele R. The allele and genotype frequencies in the next generation are:
Problem Set 8 Genetics 371 Winter 2010 1. In a population exhibiting Hardy-Weinberg equilibrium, 23% of the individuals are homozygous for a recessive character. What will the genotypic, phenotypic and
More informationProduced by the Centre for Genetics Education. Internet: 5
Important points Genes are made of DNA The information in the DNA is in the form of a chemical code made up of four letters (A, T, C and G). Each word in the information is made up of three of these four
More informationWhole Genome Sequencing. Biostatistics 666
Whole Genome Sequencing Biostatistics 666 Genomewide Association Studies Survey 500,000 SNPs in a large sample An effective way to skim the genome and find common variants associated with a trait of interest
More informationAnswers to additional linkage problems.
Spring 2013 Biology 321 Answers to Assignment Set 8 Chapter 4 http://fire.biol.wwu.edu/trent/trent/iga_10e_sm_chapter_04.pdf Answers to additional linkage problems. Problem -1 In this cell, there two copies
More informationIntroduction to Next Generation Sequencing (NGS) Andrew Parrish Exeter, 2 nd November 2017
Introduction to Next Generation Sequencing (NGS) Andrew Parrish Exeter, 2 nd November 2017 Topics to cover today What is Next Generation Sequencing (NGS)? Why do we need NGS? Common approaches to NGS NGS
More informationSEQUENCING. M Ataei, PhD. Feb 2016
CLINICAL NEXT GENERATION SEQUENCING M Ataei, PhD Tehran Medical Genetics Laboratory Feb 2016 Overview 2 Background NGS in non-invasive prenatal diagnosis (NIPD) 3 Background Background 4 In the 1970s,
More informationb. (3 points) The expected frequencies of each blood type in the deme if mating is random with respect to variation at this locus.
NAME EXAM# 1 1. (15 points) Next to each unnumbered item in the left column place the number from the right column/bottom that best corresponds: 10 additive genetic variance 1) a hermaphroditic adult develops
More informationPERSPECTIVES. A gene-centric approach to genome-wide association studies
PERSPECTIVES O P I N I O N A gene-centric approach to genome-wide association studies Eric Jorgenson and John S. Witte Abstract Genic variants are more likely to alter gene function and affect disease
More informationSYLLABUS AND SAMPLE QUESTIONS FOR JRF IN BIOLOGICAL ANTHROPOLGY 2011
SYLLABUS AND SAMPLE QUESTIONS FOR JRF IN BIOLOGICAL ANTHROPOLGY 2011 SYLLABUS 1. Introduction: Definition and scope; subdivisions of anthropology; application of genetics in anthropology. 2. Human evolution:
More informationCS 262 Lecture 14 Notes Human Genome Diversity, Coalescence and Haplotypes
CS 262 Lecture 14 Notes Human Genome Diversity, Coalescence and Haplotypes Coalescence Scribe: Alex Wells 2/18/16 Whenever you observe two sequences that are similar, there is actually a single individual
More informationProfessor Jane Farrar School of Genetics & Microbiology, TCD.
Lecture 1 Genetics An Overview Professor Jane Farrar School of Genetics & Microbiology, TCD. CAMPBELL BIOLOGY Campbell, Reece and Mitchell CONCEPTS OF GENTICS Klug and Cummings A Whistle-Stop Tour of 150
More informationTake Home Message. Molecular Imaging Genomics. How to do Genetics. Questions for the Study of. Two Common Methods for Gene Localization
Take Home Message Molecular Imaging Genomics David C. Glahn, PhD Olin Neuropsychiatry Research Center & Department of Psychiatry, Yale University This may be the most important thing I say in this lecture.
More informationAnalytics Behind Genomic Testing
A Quick Guide to the Analytics Behind Genomic Testing Elaine Gee, PhD Director, Bioinformatics ARUP Laboratories 1 Learning Objectives Catalogue various types of bioinformatics analyses that support clinical
More informationLinking Genetic Variation to Important Phenotypes
Linking Genetic Variation to Important Phenotypes BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2018 Anthony Gitter gitter@biostat.wisc.edu These slides, excluding third-party material, are licensed under
More informationLD Mapping and the Coalescent
Zhaojun Zhang zzj@cs.unc.edu April 2, 2009 Outline 1 Linkage Mapping 2 Linkage Disequilibrium Mapping 3 A role for coalescent 4 Prove existance of LD on simulated data Qualitiative measure Quantitiave
More informationDNA. bioinformatics. genomics. personalized. variation NGS. trio. custom. assembly gene. tumor-normal. de novo. structural variation indel.
DNA Sequencing T TM variation DNA amplicon mendelian trio genomics NGS bioinformatics tumor-normal custom SNP resequencing target validation de novo prediction personalized comparative genomics exome private
More information