Coordinating Center Report. Paul Harris, Marylyn Ritchie, Josh Denny, & Brad Malin

Size: px
Start display at page:

Download "Coordinating Center Report. Paul Harris, Marylyn Ritchie, Josh Denny, & Brad Malin"

Transcription

1 Coordinating Center Report Paul Harris, Marylyn Ritchie, Josh Denny, & Brad Malin

2 accelerate phenotype development & sharing expand genomic data integration enhance methods to determine re-identification & privacy risks Coordinating Center provide logistical support & project facilitation sites External Scientific Panel

3 Accelerate phenotype development & sharing

4 Phenotyping Efficiency and Transportability: PheKB Designed to enable a workflow and integrated tools and standards that guide the user in navigating from early stage development to final public sharing. emerge, PCORnet, NIH Collaboratory, PGRN, PGPop 414 active users, 52 institutions 30 publicly available phenotypes, 62 phenotypes in development 149 implementations (for 53 phenotypes) 3,086 views/month..increasing the efficiency and accuracy of phenotype definition

5 PheKB phenotype views Type II DM ADHD MidSouth CDRN Coronary Heart Disease Atrial fibrillation Demonstration Drug Induced Liver Injury Cardiac Conduction Dementia Severe Early Childhood Obesity Ace-inhibitor induced cough Type-2 Diabetes Demonstration Crohns Disease Demonstration Peripheral Arterial Disease Cataracts Diabetic Retinopathy Height Hypothyroidism Lipids Rheumatoid Arthritis Demonstration Multiple Sclerosis Demonstration Red blood cell indices High Density Lipoproteins White blood cell indices Unique Page Views

6 Phenotype Modalities used in 92 PheKB phenotypes Data modalities or methods Number of phenotypes utilizing these features Public (N=30) Non-Public* (N=62) Percent of Total ICD-9 Codes % Medications % Natural Language Processing % CPT Codes % Laboratory test results %

7 Sharing to other networks Key example - T2D algorithm reused in 40 non-emerge projects NIH Collaboratory, PCORnet sites using PheKB now 7 phenotypes in PheKB currently amongst these 25 non-emerge papers reference PheKB as a source

8 Data Sharing The Data Dictionary/Data Validation Tool (Luke Rasmussen) Verifies a data dictionary file adherence to emerge and dbgap standards Verifies a data set against a data dictionary Guarantees that shared files are consistently formatted to minimize time needed to clean/reformat data..increasing the efficiency and accuracy of data sharing

9 Results Dissemination and Re-use: PheWAS Catalog An interactive tool of posted results from a large PheWAS study. Currently disseminating the first of many emerge PheWAS study results. PheWAS results for >3000 SNPs identified in GWAS studies search SNPs, phenotypes, genes make/save graphs export data sets 2,300 users visit to the site

10 R PheWAS Package ex=generateexample() EHR Biobank ICD9 Codes Demographics R PheWAS Package PheWAS Code Translation Phenotypes Genotypes Association Testing Results Plotting

11 Some emerge PheWAS examples GWAS QRS duration (VU) GWAS Platelet size/volume (Mayo) GWAS Herpes Zoster (GHC/UW) Pediatric PheWAS (CCHMC and CHOP) Null variant studies (Network) CNV PheWAS (VU/GHC/UW/CHOP) Neanderthal PheWAS (VU, UW) PCSK9 (Mayo) FTO introns (Adult and pediatric) PheWAS on autism comorbidities HLA variants, Functional variants (Marshfield) Developing text and NLP-based PheWAS

12 Shameer et al. Hum Genet 2014 PheWAS of Platelet size (a) and volume (b)

13 Hypothesis Generation and Testing: emerge Record Counter (erc) Web-based research tool for members of the emerge community with over 55,000 subjects with demographic, diagnosis, and procedure codes. Drag and Drop feature that utilized basic Boolean-logic Used to assess how well a phenotype might do in a genotyped population as well as for site-level assessment improving early stage feasibility to produce the highest rate of return on resources.

14 Medication extraction and curation Developed and distributed MedEx (emerge 1) and MedEx-UIMA (emerge 2) MedEx-UIMA maps to RxNorm codes for specific drug and generic ingredient Have used these and other tools to clean SPHINX medication data Data dictionaries for collection and validation of central data for emerge GWAS and emerge-pgx sets

15 Expand genomic data integration

16 Evaluation of quality control for merging genomic datasets Dataset IMPUTE2 Imputed Data # Genotyped Samples Merged emerge-i 1M 2,634 Merged emerge-i ,029 Geisinger 3,111 Group Health/ U Wash 731 Marshfield/Essentia/PSU 616 Mayo Clinic 3,121 Mount Sinai 6,290 Northwestern 2,951 Vanderbilt 7,616 CCHMC/BCH 5,346 CHOP 6,850 Total All IMPUTE2 Imputed Samples 55,292 Developed emerge imputation pipeline Round 1: BEAGLE Round 2: IMPUTE2 Merged imputed dataset released to dbgap Determined subtle batch effects which need to be corrected for by site, platform, sex, and PCs

17

18 Extraction of additional information from merged GWAS datasets CNV calling in emerge-i data All intensity files available for emerge-i CNV calling done by CC and by CHOP QC ed CNV dataset available from CC CNV calling requested in emerge-ii data CC solicited intensity files for emerge-ii Limited sets have been submitted thus far

19 Evaluation of genetic models for clinical relevance Null variant PheWAS project led by CC Selected 25 null variants Adult and pediatric PheWAS conducted Manuscript in progress

20 emerge-pgx CC developed variant calling pipeline for emerge-pgx in collaboration with genomics workgroup Performed variant calling for all emerge-pgx samples Concordance analyses across sites and platforms (manuscript in progress) Developed SPHINX for disseminating variant information to the community (manuscript submitted)

21 Average of 200 Unique users per month. 2,334 unique users since release and 7,447 page views since release

22 Impact & Leadership Led a special issue to Frontiers in Genetics

23 Enhance methods to determine re-identification & privacy risks

24 emerge Data Privacy Advancements Reviews of the Field Biobanking Malin. Identifiability in biobanks: models, measures, and mitigation strategies. Human Genetics Attacks & Protections Naveed. Privacy in the genomic era. ACM Computing Surveys. In Press. Evaluations Risk-Utility Frontiers Across emerge States W. Xia. R-U policy frontiers for health data de-identification PheWAS Anonymization Across emerge Sites R. Heatherly. Multi-site anonymization. JAMIA 2 nd round. R. Heatherly. Size matters. J Biomedical Informatics Technology Enhancements Natural Language De-identification Carrell. Hiding in plain sight: use of realistic surrogates to reduce exposure of protected health information in clinical text. JAMIA Li. De-identification of clinical narratives through writing complexity measures. IJMI Risk-Utility Frontiers for De-identification W. Xia. Efficient discovery of de-identification policies. ACM CODASPY Secure Multiparty Computation W. Xie, et al. SecureMA: Protecting participant privacy in genetic association meta-analysis. Bioinformatics Policy Assessments Diagnosis Re-identification Adversarial Knowledge R. Heatherly. Enabling genomic-phenomic association discovery without sacrificing anonymity. PLoS One Game Theory for Demographic Re-identification Z. Wan. A Game theoretic framework for analyzing re-identification risks. PLoS One W. Xia. Process-based data privacy. In review.

25 Policy: Stackelberg Games (Z. Wan, et al. PLoS One. 2015) Sharing Strategy 1 Utility 1 Risk B Publisher Benefit per record = $1200 Cost per violation = $300 ~35000 Census records 5-digit ZIP Age (Years) Race Sex No Attack Scenarios exist! Recipient s Best Strategy Attack Strategy A Utility A Risk A Attack Strategy B Utility B Risk B Attack Strategy C Utility C Risk C Recipient

26 Policy: The Homer Attack CONTROLS (e.g., No diabetes) Association Studies (e.g., Sphinx) SNP 1 SNP 2 SNP n Allele AA AT TT CC CG GG GG GT TT PRIVATE adversary Name SNP 1 SNP 2 SNP n Alice AA CG TT Bob AT GG GT Probability CASES (e.g., Diabetes) SNP 1 SNP 2 SNP n Allele AA AT TT CC CG GG GG GT TT Probability General Population (e.g., HapMAP) SNP 1 SNP 2 SNP n Allele AA AT TT CC CG GG GG GT TT Probability

27 Policy: Sphinx Genomic Inference Game Privacy risk: Likelihood statistic* Are you in the Sphinx pool or the population? More genomic regions we disclose, greater the risk Utility: Proportional to number of regions you disclose Can tune to account for associations with phenotype *Sankararaman S, Obozinski G, Jordan MI, Halperin E. Genomic privacy and limits of individual detection in a pool. Nature Genetics Sep;41(9):965-7.

28 Policy: Sphinx Analysis (Z. Wan, et al. In Development) Utility for emerge & Adversary: Each record worth $99 (based on 23andme) Each SNP has worth proportional to # instances in NHGRI GWAS catalog Probability of detection set to 1 Expected Penalty: is probability of detection * fine

29 Experiment: Sensitivity to Penalty Configuration All SNPs sorted according to MAF (ascending). Utility for each SNP is uniformly distributed Pre-processing keeps 322 SNPs The target set consists of 500 random samples from the pool and the reference respectively. The maximal benefit the publisher gain from publishing all these 322 SNPs is $17,720

30 Optimal # of Released SNPs

31 Major Finding Larger the expected penalty the more SNPs publishable When the expected penalty is equal to the benefit of a successful inference attack, the publisher can publish all the SNPs.

32 It s a Much Larger Analysis If only we had time. Larger publisher s utility more SNPs publishable Alternative: All SNPs sorted in descending MAF order Allows more SNPs to be published! The utility distribution for all 218 independent SNPs

33 Provide logistical support & project facilitation

34 Providing a National Home

35 Tools for project facilitation

36 Supporting and Streamlining Publications Policy, Process & Reporting Assess, Build, Report Meeting Support, Communication, and Project Management

37 Supporting and Streamlining Publications Policy, Process & Reporting Assess, Build, Report Meeting Support, Communication, and Project Management

38 Supporting and Streamlining Publications Policy, Process & Reporting Assess, Build, Report Meeting Support, Communication, and Project Management During emerge Phase II, the CC has provided support for: 700+ workgroup calls 16 Steering Committee/ESP Meetings 8 ESP Packets

39 emerge Coordinating Center Team Paul Harris Marylyn Ritchie Melissa Basford Bradley Malin Josh Denny Sarah Stallings Jacqueline Kirby Mollie Bodin Claar Adam Hardebeck Brianne Derveloy Peter Speltz Shefali Verma Anurag Verma John Wallace Yuki Bradford Sarah Pendergrass Alex Frase Scott Dudek Anna Okula Anastasia Lucas Jason Tan Lixen Chen Yi Jiang Lisa Bastarache Martha Shrubsole Raymond Heatherly Weiyi Xia Zhiyu Wan Jonathan Schildcrout Nathaniel Mercaldo Kyle Brothers

emerge-ii site report Vanderbilt

emerge-ii site report Vanderbilt emerge-ii site report Vanderbilt 29 June 2015 Vanderbilt activities emerge II PGx implementation locally and emerge-pgx SCN5A/KCNH2 project provider attitudes Phenotype contributions Methods development

More information

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary Executive Summary Helix is a personal genomics platform company with a simple but powerful mission: to empower every person to improve their life through DNA. Our platform includes saliva sample collection,

More information

Joshua C. Denny Dan M. Roden Laura J. Rasmussen-Torvik. Workgroup Leads

Joshua C. Denny Dan M. Roden Laura J. Rasmussen-Torvik. Workgroup Leads Joshua C. Denny Dan M. Roden Laura J. Rasmussen-Torvik Workgroup Leads emerge PGx Plenary Session Accomplishments PGx Implementation PGx Cohort Description Sharing PGx Data Publications Ongoing Work Updates

More information

UK Biobank Axiom Array

UK Biobank Axiom Array DATA SHEET Advancing human health studies with powerful genotyping technology Array highlights The Applied Biosystems UK Biobank Axiom Array is a powerful array for translational research. Designed using

More information

Linking Genetic Variation to Important Phenotypes

Linking Genetic Variation to Important Phenotypes Linking Genetic Variation to Important Phenotypes BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2018 Anthony Gitter gitter@biostat.wisc.edu These slides, excluding third-party material, are licensed under

More information

Introduction to statistics for Genome- Wide Association Studies (GWAS) Day 2 Section 8

Introduction to statistics for Genome- Wide Association Studies (GWAS) Day 2 Section 8 Introduction to statistics for Genome- Wide Association Studies (GWAS) 1 Outline Background on GWAS Presentation of GenABEL Data checking with GenABEL Data analysis with GenABEL Display of results 2 R

More information

Age-Adjusted Death Rates for Coronary Heart Disease, U.S.,

Age-Adjusted Death Rates for Coronary Heart Disease, U.S., Age-Adjusted Death Rates for Coronary Heart Disease, U.S., 1950-2004 Deaths/100,000 Population 600 500 400 300 200 100 Risk Factors U.S. Actual U.S. "Could Be" (Based on Japan Actual) 0 1950 1960 1970

More information

Beyond single genes or proteins

Beyond single genes or proteins Beyond single genes or proteins Marylyn D Ritchie, PhD Professor, Biochemistry and Molecular Biology Director, Center for Systems Genomics The Pennsylvania State University Traditional Approach Genome-wide

More information

Pharmacogenomics and Health Policy

Pharmacogenomics and Health Policy Pharmacogenomics and Health Policy Amalia M. Issa, PhD, MPH Founding Director, Program in Personalized Medicine & Targeted Therapeutics Professor and Chair Department of Health Policy and Public Health

More information

Privacy Preserving Data Mining in Application

Privacy Preserving Data Mining in Application Brookings Institution Expert Workshop on Statistical & Epidemiological Issues in Active Medical Product Surveillance A Brief Introduction to Privacy Enhancing Technologies for Surveillance Purposes Bradley

More information

Exploring the Genetic Basis of Congenital Heart Defects

Exploring the Genetic Basis of Congenital Heart Defects Exploring the Genetic Basis of Congenital Heart Defects Sanjay Siddhanti Jordan Hannel Vineeth Gangaram szsiddh@stanford.edu jfhannel@stanford.edu vineethg@stanford.edu 1 Introduction The Human Genome

More information

Illumina s GWAS Roadmap: next-generation genotyping studies in the post-1kgp era

Illumina s GWAS Roadmap: next-generation genotyping studies in the post-1kgp era Illumina s GWAS Roadmap: next-generation genotyping studies in the post-1kgp era Anthony Green Sr. Genotyping Sales Specialist North America 2010 Illumina, Inc. All rights reserved. Illumina, illuminadx,

More information

The Electronic Medical Records and Genomics (emerge) Network: past, present, and future

The Electronic Medical Records and Genomics (emerge) Network: past, present, and future American College of Medical Genetics and Genomics Review Open The Electronic Medical Records and Genomics (emerge) Network: past, present, and future Omri Gottesman, MD 1, Helena Kuivaniemi, MD, PhD 2,

More information

Complete automation for NGS interpretation and reporting with evidence-based clinical decision support

Complete automation for NGS interpretation and reporting with evidence-based clinical decision support Brochure Bioinformatics for Clinical Oncology Testing Complete automation for NGS interpretation and reporting with evidence-based clinical decision support Sample to Insight Powering clinical insights

More information

Genomic Research: Issues to Consider. IRB Brown Bag August 28, 2014 Sharon Aufox, MS, LGC

Genomic Research: Issues to Consider. IRB Brown Bag August 28, 2014 Sharon Aufox, MS, LGC Genomic Research: Issues to Consider IRB Brown Bag August 28, 2014 Sharon Aufox, MS, LGC Outline Key genomic terms and concepts Issues in genomic research Consent models Types of findings Returning results

More information

Re-Identification of DNA through an Automated Linkage Process

Re-Identification of DNA through an Automated Linkage Process Re-Identification of DNA through an Automated Linkage Process Bradley Malin and Latanya Sweeney Laboratory for International Data Privacy School of Computer Science and Heinz School of Public Policy Carnegie

More information

Regulatory and Ethical Issues in Repository Research

Regulatory and Ethical Issues in Repository Research Regulatory and Ethical Issues in Repository Research Kelly Fryer-Edwards ITHS Bioethics Core and UW Bioethics Shannon Sewards Human Subjects Division Agenda Review of Regulatory and Ethical Issues Creating

More information

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with

More information

High Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays

High Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays High Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays Ali Pirani and Mohini A Patil ISAG July 2017 The world leader in serving science

More information

S SG. Metabolomics meets Genomics. Hemant K. Tiwari, Ph.D. Professor and Head. Metabolomics: Bench to Bedside. ection ON tatistical.

S SG. Metabolomics meets Genomics. Hemant K. Tiwari, Ph.D. Professor and Head. Metabolomics: Bench to Bedside. ection ON tatistical. S SG ection ON tatistical enetics Metabolomics meets Genomics Hemant K. Tiwari, Ph.D. Professor and Head Section on Statistical Genetics Department of Biostatistics School of Public Health Metabolomics:

More information

Derrek Paul Hibar

Derrek Paul Hibar Derrek Paul Hibar derrek.hibar@ini.usc.edu Obtain the ADNI Genetic Data Quality Control Procedures Missingness Testing for relatedness Minor allele frequency (MAF) Hardy-Weinberg Equilibrium (HWE) Testing

More information

Testimony of Christopher Newton-Cheh, MD, MPH Volunteer for the American Heart Association

Testimony of Christopher Newton-Cheh, MD, MPH Volunteer for the American Heart Association Testimony of Christopher Newton-Cheh, MD, MPH Volunteer for the American Heart Association Before the House Energy and Commerce Subcommittee on Health 21st Century Cures: Examining the Regulation of Laboratory

More information

State of the Art in Data Management for Precision Medicine & Genomics. March 8, pm 3 pm ET

State of the Art in Data Management for Precision Medicine & Genomics. March 8, pm 3 pm ET State of the Art in Data Management for Precision Medicine & Genomics March 8, 2017 2 pm 3 pm ET Housekeeping Issues All participants are muted To ask a question or make a comment, please submit via the

More information

ICH Topic E16 Genomic Biomarkers Related to Drug Response: Context, Structure and Format of Qualification Submissions. Step 3

ICH Topic E16 Genomic Biomarkers Related to Drug Response: Context, Structure and Format of Qualification Submissions. Step 3 European Medicines Agency June 2009 EMEA/CHMP/ICH/380636/2009 ICH Topic E16 Genomic Biomarkers Related to Drug Response: Context, Structure and Format of Qualification Submissions Step 3 NOTE FOR GUIDANCE

More information

New Frontiers in Personalized Medicine

New Frontiers in Personalized Medicine New Frontiers in Personalized Medicine Oracle Open World Shanghai 2013 Neil de Crescenzo SVP and GM 1 Safe Harbor Statement The following is intended to outline our general product direction. It is intended

More information

Comments on Use of Databases for Establishing the Clinical Relevance of Human Genetic Variants

Comments on Use of Databases for Establishing the Clinical Relevance of Human Genetic Variants Division of Dockets Management (HFA-305) Food and Drug Administration 5630 Fishers Lane, Rm. 1061 Rockville, MD 20852 The American Society of Human Genetics 9650 Rockville Pike Bethesda, MD 20814 24 December

More information

Biomedical Big Data and Precision Medicine

Biomedical Big Data and Precision Medicine Biomedical Big Data and Precision Medicine Jie Yang Department of Mathematics, Statistics, and Computer Science University of Illinois at Chicago October 8, 2015 1 Explosion of Biomedical Data 2 Types

More information

PLINK gplink Haploview

PLINK gplink Haploview PLINK gplink Haploview Whole genome association software tutorial Shaun Purcell Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA Broad Institute of Harvard & MIT, Cambridge,

More information

Whole genome sequencing in drug discovery research: a one fits all solution?

Whole genome sequencing in drug discovery research: a one fits all solution? Whole genome sequencing in drug discovery research: a one fits all solution? Marc Sultan, September 24th, 2015 Biomarker Development, Translational Medicine, Novartis On behalf of the BMD WGS pilot team:

More information

MedSavant: An open source platform for personal genome interpretation

MedSavant: An open source platform for personal genome interpretation MedSavant: An open source platform for personal genome interpretation Marc Fiume 1, James Vlasblom 2, Ron Ammar 3, Orion Buske 1, Eric Smith 1, Andrew Brook 1, Sergiu Dumitriu 2, Christian R. Marshall

More information

Gap hunting to characterize clustered probe signals in Illumina methylation array data

Gap hunting to characterize clustered probe signals in Illumina methylation array data DOI 10.1186/s13072-016-0107-z Epigenetics & Chromatin RESEARCH Gap hunting to characterize clustered probe signals in Illumina methylation array data Shan V. Andrews 1,2, Christine Ladd Acosta 1,2,3, Andrew

More information

Data Sources and Biobanks in the Asia-Pacific Region. Wei Zhou, MD, Ph.D. Department of Epidemiology, Merck Research Laboratories October 23, 2014

Data Sources and Biobanks in the Asia-Pacific Region. Wei Zhou, MD, Ph.D. Department of Epidemiology, Merck Research Laboratories October 23, 2014 Data Sources and Biobanks in the Asia-Pacific Region Wei Zhou, MD, Ph.D. Department of Epidemiology, Merck Research Laboratories October 23, 2014 1 Disclosures Wei Zhou is currently an employee of Merck

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Alla L Lapidus, Ph.D. SPbSU St. Petersburg Term Bioinformatics Term Bioinformatics was invented by Paulien Hogeweg (Полина Хогевег) and Ben Hesper in 1970 as "the study of

More information

Axiom mydesign Custom Array design guide for human genotyping applications

Axiom mydesign Custom Array design guide for human genotyping applications TECHNICAL NOTE Axiom mydesign Custom Genotyping Arrays Axiom mydesign Custom Array design guide for human genotyping applications Overview In the past, custom genotyping arrays were expensive, required

More information

Haplotype phasing in large cohorts: Modeling, search, or both?

Haplotype phasing in large cohorts: Modeling, search, or both? Haplotype phasing in large cohorts: Modeling, search, or both? Po-Ru Loh Harvard T.H. Chan School of Public Health Department of Epidemiology Broad MIA Seminar, 3/9/16 Overview Background: Haplotype phasing

More information

Population Genetics & Drug Discovery

Population Genetics & Drug Discovery Population Genetics & Drug Discovery examples from Finland Mark J. Daly Chief, Analytic and Translational Genetics Unit Massachusetts General Hospital Co-director, Medical and Population Genetics Broad

More information

http://genemapping.org/ Epistasis in Association Studies David Evans Law of Independent Assortment Biological Epistasis Bateson (99) a masking effect whereby a variant or allele at one locus prevents

More information

Bioinformatics. Ingo Ruczinski. Some selected examples... and a bit of an overview

Bioinformatics. Ingo Ruczinski. Some selected examples... and a bit of an overview Bioinformatics Some selected examples... and a bit of an overview Department of Biostatistics Johns Hopkins Bloomberg School of Public Health July 19, 2007 @ EnviroHealth Connections Bioinformatics and

More information

Big Data Era in Medicine

Big Data Era in Medicine Big Data Era in Medicine brought by Genome Omics Information Hiroshi Tanaka Tohoku Medical Megabank Organization Tohoku University and Dept. Biomedical Informatics Tokyo Medical and Dental University the

More information

The Clinical Pharmacogenetics Implementation Consortium (CPIC): supporting the adoption of pharmacogenetics into the EHR

The Clinical Pharmacogenetics Implementation Consortium (CPIC): supporting the adoption of pharmacogenetics into the EHR The Clinical Pharmacogenetics Implementation Consortium (CPIC): supporting the adoption of pharmacogenetics into the EHR James M. Hoffman, Pharm.D. M.S. Chief Patient Safety Officer Associate Member, Pharmaceutical

More information

IIS Competency Domain Model

IIS Competency Domain Model IIS Competency Domain Model Knowledge, Skills and Abilities for IIS Job Roles PHII Academy 18 November 2015 www.informaticsacademy.org Standards and Interoperability Applies informatics standards to ensure

More information

ICH E18: Guideline for Genomic Sampling and Management of Genomic Data. Prepared by the ICH E18 Expert Working Group

ICH E18: Guideline for Genomic Sampling and Management of Genomic Data. Prepared by the ICH E18 Expert Working Group ICH E18: Guideline for Genomic Sampling and Prepared by the ICH E18 Expert Working Group October 2017 International Council for Harmonisation of Technical Requirements for Pharmaceuticals for Human Use

More information

Accelerate High Throughput Analysis for Genome Sequencing with GPU

Accelerate High Throughput Analysis for Genome Sequencing with GPU Accelerate High Throughput Analysis for Genome Sequencing with GPU ATIP - A*CRC Workshop on Accelerator Technologies in High Performance Computing May 7-10, 2012 Singapore BingQiang WANG, Head of Scalable

More information

VCGDB: A Virtual and Dynamic Genome Database of the Chinese Population

VCGDB: A Virtual and Dynamic Genome Database of the Chinese Population VCGDB: A Virtual and Dynamic Genome Database of the Chinese Population Jiayan Wu Associate Professor Director of Science and Technology Department Director of Core Facility Beijing Institute of Genomics,

More information

Aristos Aristodimou, Athos Antoniades, Constantinos Pattichis University of Cyprus David Tian, Ann Gledson, John Keane University of Manchester

Aristos Aristodimou, Athos Antoniades, Constantinos Pattichis University of Cyprus David Tian, Ann Gledson, John Keane University of Manchester Aristos Aristodimou, Athos Antoniades, Constantinos Pattichis University of Cyprus David Tian, Ann Gledson, John Keane University of Manchester Linked2Safety: FP7 project funded by the European Commission

More information

TOTAL CANCER CARE: CREATING PARTNERSHIPS TO ADDRESS PATIENT NEEDS

TOTAL CANCER CARE: CREATING PARTNERSHIPS TO ADDRESS PATIENT NEEDS TOTAL CANCER CARE: CREATING PARTNERSHIPS TO ADDRESS PATIENT NEEDS William S. Dalton, PhD, MD CEO, M2Gen & Director, Personalized Medicine Institute, Moffitt Cancer Center JULY 15, 2013 MOFFITT CANCER CENTER

More information

By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs

By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs (3) QTL and GWAS methods By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs Under what conditions particular methods are suitable

More information

ICH guideline E18 on genomic sampling and management of genomic data

ICH guideline E18 on genomic sampling and management of genomic data 1 2 3 22 January 2016 EMA/CHMP/ICH/11623/2016 Committee for Human Medicinal Products 4 5 6 ICH guideline E18 on genomic sampling and management of genomic data Step 3 Adopted by CHMP for release for consultation

More information

Personal Genomic Information Management and Personalized Medicine: Challenges, Current Solutions, and Roles of HIM Professionals

Personal Genomic Information Management and Personalized Medicine: Challenges, Current Solutions, and Roles of HIM Professionals Personal Genomic Information Management and Personalized Medicine: Challenges, Current Solutions, and Roles of HIM Professionals Personal Genomic Information Management and Personalized Medicine: Challenges,

More information

ELE4120 Bioinformatics. Tutorial 5

ELE4120 Bioinformatics. Tutorial 5 ELE4120 Bioinformatics Tutorial 5 1 1. Database Content GenBank RefSeq TPA UniProt 2. Database Searches 2 Databases A common situation for alignment is to search through a database to retrieve the similar

More information

Supplementary Note: Detecting population structure in rare variant data

Supplementary Note: Detecting population structure in rare variant data Supplementary Note: Detecting population structure in rare variant data Inferring ancestry from genetic data is a common problem in both population and medical genetic studies, and many methods exist to

More information

Personalized. Health in Canada

Personalized. Health in Canada Personalized Health in Canada Canadian Institutes of Health Research Personalized Medicine Signature Initiative 2010-2013 0 Dr. Morag Park CIHR Institute of Cancer Research Dr. Paul Lasko CIHR Institute

More information

Author's response to reviews

Author's response to reviews Author's response to reviews Title: A pooling-based genome-wide analysis identifies new potential candidate genes for atopy in the European Community Respiratory Health Survey (ECRHS) Authors: Francesc

More information

The Yale Open Data Access (YODA) Project: Lessons Learned in Data Sharing

The Yale Open Data Access (YODA) Project: Lessons Learned in Data Sharing The Yale Open Data Access (YODA) Project: Lessons Learned in Data Sharing Grand Rounds: A Shared Forum of the NIH HCS Collaboratory and PCORnet June 30, 2017 Joseph S. Ross, MD, MHS Section of General

More information

KNN-MDR: a learning approach for improving interactions mapping performances in genome wide association studies

KNN-MDR: a learning approach for improving interactions mapping performances in genome wide association studies Abo Alchamlat and Farnir BMC Bioinformatics (2017) 18:184 DOI 10.1186/s12859-017-1599-7 METHODOLOGY ARTICLE Open Access KNN-MDR: a learning approach for improving interactions mapping performances in genome

More information

ACCEPTED. Victoria J. Wright Corresponding author.

ACCEPTED. Victoria J. Wright Corresponding author. The Pediatric Infectious Disease Journal Publish Ahead of Print DOI: 10.1097/INF.0000000000001183 Genome-wide association studies in infectious diseases Eleanor G. Seaby 1, Victoria J. Wright 1, Michael

More information

Short Course: Adaptive Clinical Trials

Short Course: Adaptive Clinical Trials Short Course: Adaptive Clinical Trials Presented at the 2 Annual Meeting of the Society for Clinical Trials Vancouver, Canada Roger J. Lewis, MD, PhD Department of Emergency Medicine Harbor-UCLA Medical

More information

Exploring genomic databases: Practical session "

Exploring genomic databases: Practical session Exploring genomic databases: Practical session Work through the following practical exercises on your own. The objective of these exercises is to become familiar with the information available in each

More information

Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS

Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS The Bioinformatics book covers new topics in the rapidly

More information

Targeted resequencing

Targeted resequencing Targeted resequencing Sarah Calvo, Ph.D. Computational Biologist Vamsi Mootha laboratory Snapshots of Genome Wide Analysis in Human Disease (MPG), 4/20/2010 Vamsi Mootha, PI How can I assess a small genomic

More information

Overview of Health Informatics. ITI BMI-Dept

Overview of Health Informatics. ITI BMI-Dept Overview of Health Informatics ITI BMI-Dept Fellowship Week 5 Overview of Health Informatics ITI, BMI-Dept Day 10 7/5/2010 2 Agenda 1-Bioinformatics Definitions 2-System Biology 3-Bioinformatics vs Computational

More information

Detecting ancient admixture using DNA sequence data

Detecting ancient admixture using DNA sequence data Detecting ancient admixture using DNA sequence data October 10, 2008 Jeff Wall Institute for Human Genetics UCSF Background Origin of genus Homo 2 2.5 Mya Out of Africa (part I)?? 1.6 1.8 Mya Further spread

More information

AMERICAN HEART ASSOCIATION RESEARCH FACTS FY

AMERICAN HEART ASSOCIATION RESEARCH FACTS FY AMERICAN HEART ASSOCIATION RESEARCH FACTS FY 2016-2017 LIFE IS WHY. RESEARCH IS HOW. The AHA research program has tremendous impact on science discovery and researchers careers. New knowledge that results

More information

Webinar Tips Please mute your phone. Please do not put the call on hold. There will be time for questions after the presentation.

Webinar Tips Please mute your phone. Please do not put the call on hold. There will be time for questions after the presentation. Webinar Tips Please mute your phone. Please do not put the call on hold. There will be time for questions after the presentation. Carolinas Collaborative Pilot RFA Webinar May 8, 2017 Partners Overview

More information

A Codon Frequency Obfuscation Heuristic for Raw Genomic Data Privacy

A Codon Frequency Obfuscation Heuristic for Raw Genomic Data Privacy A Codon Frequency Obfuscation Heuristic for Raw Genomic Data Privacy Kato Mivule Computer Science Department Bowie State University kmivule@gmail.com Abstract Genomic data provides clinical researchers

More information

General aspects of genome-wide association studies

General aspects of genome-wide association studies General aspects of genome-wide association studies Abstract number 20201 Session 04 Correctly reporting statistical genetics results in the genomic era Pekka Uimari University of Helsinki Dept. of Agricultural

More information

The Future of HealthCare Information Technology

The Future of HealthCare Information Technology HST.921 / HST.922 Information Technology in the Health Care System of the Future, Spring 2009 Harvard-MIT Division of Health Sciences and Technology Course Directors: Dr. Steven Locke, Dr. Bryan Bergeron,

More information

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 Topics Genetic variation Population structure Linkage disequilibrium Natural disease variants Genome Wide Association Studies Gene

More information

Spy vs Spy: Anonymous Messaging

Spy vs Spy: Anonymous Messaging Spy vs Spy: Anonymous Messaging Sewoong Oh University of Illinois at Urbana-Champaign Joint work with Giulia Fanti, Peter Kairouz and Pramod Viswanath Anonymous Social Media provide meta-data privacy Anonymous

More information

SNP calling and VCF format

SNP calling and VCF format SNP calling and VCF format Laurent Falquet, Oct 12 SNP? What is this? A type of genetic variation, among others: Family of Single Nucleotide Aberrations Single Nucleotide Polymorphisms (SNPs) Single Nucleotide

More information

Governance Policies for PCORnet, the National Patient-Centered Clinical Research Network

Governance Policies for PCORnet, the National Patient-Centered Clinical Research Network Governance Policies for PCORnet, the National Patient-Centered Clinical Research Network 1.0 Leadership and Decision-Making When reading this document it is important to note that there are several categories

More information

A Random Forest proximity matrix as a new measure for gene annotation *

A Random Forest proximity matrix as a new measure for gene annotation * A Random Forest proximity matrix as a new measure for gene annotation * Jose A. Seoane 1, Ian N.M. Day 1, Juan P. Casas 2, Colin Campbell 3 and Tom R. Gaunt 1,4 1 Bristol Genetic Epidemiology Labs. School

More information

Legislation and Facilitation of Health Data in Denmark Short update. Mads Melbye, MD, DMSc Statens Serum Institut, Copenhagen, Denmark

Legislation and Facilitation of Health Data in Denmark Short update. Mads Melbye, MD, DMSc Statens Serum Institut, Copenhagen, Denmark Legislation and Facilitation of Health Data in Denmark Short update Mads Melbye, MD, DMSc Statens Serum Institut, Copenhagen, Denmark Legislation short facts Register research: exempted from informed consent

More information

Custom TaqMan Assays DESIGN AND ORDERING GUIDE. For SNP Genotyping and Gene Expression Assays. Publication Number Revision G

Custom TaqMan Assays DESIGN AND ORDERING GUIDE. For SNP Genotyping and Gene Expression Assays. Publication Number Revision G Custom TaqMan Assays DESIGN AND ORDERING GUIDE For SNP Genotyping and Gene Expression Assays Publication Number 4367671 Revision G For Research Use Only. Not for use in diagnostic procedures. Manufacturer:

More information

Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010

Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010 Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010 Traditional QTL approach Uses standard bi-parental mapping populations o F2 or RI These have a limited number of

More information

Computational methods for the analysis of rare variants

Computational methods for the analysis of rare variants Computational methods for the analysis of rare variants Shamil Sunyaev Harvard-M.I.T. Health Sciences & Technology Division Combine all non-synonymous variants in a single test Theory: 1) Most new missense

More information

University of Groningen. The value of haplotypes Vries, Anne René de

University of Groningen. The value of haplotypes Vries, Anne René de University of Groningen The value of haplotypes Vries, Anne René de IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

ACCELERATING GENOMIC ANALYSIS ON THE CLOUD. Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes

ACCELERATING GENOMIC ANALYSIS ON THE CLOUD. Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes ACCELERATING GENOMIC ANALYSIS ON THE CLOUD Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia

More information

Concepts and relevance of genome-wide association studies

Concepts and relevance of genome-wide association studies Science Progress (2016), 99(1), 59 67 Paper 1500149 doi:10.3184/003685016x14558068452913 Concepts and relevance of genome-wide association studies ANDREAS SCHERER and G. BRYCE CHRISTENSEN Dr Andreas Scherer

More information

Genomic Medicine in France

Genomic Medicine in France Genomic Medicine in France Catherine Nguyen, PhD Director ITMO GGB Genetic, Genomic & Bioinformatic Advancing Disease Modeling in Animal-Based research in Support of Precision Medecine: a Workshop ILAR

More information

ATHENA: A TOOL FOR META-DIMENSIONAL ANALYSIS APPLIED TO GENOTYPES AND GENE EXPRESSION DATA TO PREDICT HDL CHOLESTEROL LEVELS

ATHENA: A TOOL FOR META-DIMENSIONAL ANALYSIS APPLIED TO GENOTYPES AND GENE EXPRESSION DATA TO PREDICT HDL CHOLESTEROL LEVELS ATHENA: A TOOL FOR META-DIMENSIONAL ANALYSIS APPLIED TO GENOTYPES AND GENE EXPRESSION DATA TO PREDICT HDL CHOLESTEROL LEVELS EMILY R. HOLZINGER Center for Human Genetics Research, Vanderbilt University

More information

Using the Trio Workflow in Partek Genomics Suite v6.6

Using the Trio Workflow in Partek Genomics Suite v6.6 Using the Trio Workflow in Partek Genomics Suite v6.6 This user guide will illustrate the use of the Trio/Duo workflow in Partek Genomics Suite (PGS) and discuss the basic functions available within the

More information

Clinical trial information leaflet and consent

Clinical trial information leaflet and consent Informed consent 1(7) Clinical trial information leaflet and consent General You must provide sufficient information on the rights of clinical trial subjects, the purpose and nature of the trial, the methodologies

More information

Next Generation Sequencing. Target Enrichment

Next Generation Sequencing. Target Enrichment Next Generation Sequencing Target Enrichment Next Generation Sequencing Your Partner in Every Step from Sample to Data NGS: Revolutionizing Genetic Analysis with Single-Molecule Resolution Next generation

More information

BBMRI.NL A story of Sharing Eline Slagboom Molecular Epidemiology, Leiden University Medical Center Brussel 20 Juni 2017

BBMRI.NL A story of Sharing Eline Slagboom Molecular Epidemiology, Leiden University Medical Center Brussel 20 Juni 2017 BBMRI.NL A story of Sharing Eline Slagboom Molecular Epidemiology, Leiden University Medical Center Brussel 20 Juni 2017 BBMRI-NL TOWARDS A NATIONAL BIOBANKING INFRASTRUCTURE Founded in 2009 as the Dutch

More information

PROTOCOL FOR A RESEARCH SAMPLE REPOSITORY FOR ALLOGENEIC HEMATOPOIETIC STEM CELL TRANSPLANTATION, OTHER CELLULAR THERAPIES AND MARROW TOXIC INJURIES

PROTOCOL FOR A RESEARCH SAMPLE REPOSITORY FOR ALLOGENEIC HEMATOPOIETIC STEM CELL TRANSPLANTATION, OTHER CELLULAR THERAPIES AND MARROW TOXIC INJURIES Center for International Blood and Marrow Transplant Research PROTOCOL FOR A RESEARCH SAMPLE REPOSITORY FOR ALLOGENEIC HEMATOPOIETIC STEM CELL TRANSPLANTATION, OTHER CELLULAR THERAPIES AND MARROW TOXIC

More information

The Sentieon Genomic Tools Improved Best Practices Pipelines for Analysis of Germline and Tumor-Normal Samples

The Sentieon Genomic Tools Improved Best Practices Pipelines for Analysis of Germline and Tumor-Normal Samples The Sentieon Genomic Tools Improved Best Practices Pipelines for Analysis of Germline and Tumor-Normal Samples Andreas Scherer, Ph.D. President and CEO Dr. Donald Freed, Bioinformatics Scientist, Sentieon

More information

Genetics and Inflammatory Bowel Disease

Genetics and Inflammatory Bowel Disease Genetics and Inflammatory Bowel Disease Louis Ghanem M.D, Ph.D. Attending Physician, The Children s Hospital of Philadelphia Division of Gastroenterology, Hepatology and Nutrition Etiology of IBD Genetic

More information

The Views of Participants in DNA Biobanks

The Views of Participants in DNA Biobanks The Views of Participants in DNA Biobanks Kelly E. Ormond, I Maureen E. Smith, II & Wendy A. Wolf III Abstract Biobanks are generally created with the long-term goal of establishing genotype-phenotype

More information

TaqPath ProAmp Master Mixes

TaqPath ProAmp Master Mixes PRODUCT BULLETIN es es Applied Biosystems TaqPath ProAmp Master Mixes are versatile master mixes developed for high-throughput genotyping and copy number variation (CNV) analysis protocols that require

More information

Runs of Homozygosity Analysis Tutorial

Runs of Homozygosity Analysis Tutorial Runs of Homozygosity Analysis Tutorial Release 8.7.0 Golden Helix, Inc. March 22, 2017 Contents 1. Overview of the Project 2 2. Identify Runs of Homozygosity 6 Illustrative Example...............................................

More information

A GENOTYPE CALLING ALGORITHM FOR AFFYMETRIX SNP ARRAYS

A GENOTYPE CALLING ALGORITHM FOR AFFYMETRIX SNP ARRAYS Bioinformatics Advance Access published November 2, 2005 The Author (2005). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

More information

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB

Quantitative Genomics and Genetics BTRY 4830/6830; PBSB Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Jason Mezey Biological Statistics and Computational Biology (BSCB) Department of Genetic Medicine Institute for Computational Biomedicine

More information

European Induced Pluripotent Stem Cell Bank

European Induced Pluripotent Stem Cell Bank European Induced Pluripotent Stem Cell Bank Background Researchers are looking for ways to validate pathways using human data. ips cells are one approach to this. ips research and banking is fragmented

More information

Create a Planned Run. Using the Ion AmpliSeq Pharmacogenomics Research Panel Plugin USER BULLETIN. Publication Number MAN Revision A.

Create a Planned Run. Using the Ion AmpliSeq Pharmacogenomics Research Panel Plugin USER BULLETIN. Publication Number MAN Revision A. USER BULLETIN Create a Planned Run Using the Ion AmpliSeq Pharmacogenomics Research Panel Plugin Publication Number MAN0013730 Revision A.0 For Research Use Only. Not for use in diagnostic procedures.

More information

ARTICLE Sherlock: Detecting Gene-Disease Associations by Matching Patterns of Expression QTL and GWAS

ARTICLE Sherlock: Detecting Gene-Disease Associations by Matching Patterns of Expression QTL and GWAS ARTICLE Sherlock: Detecting Gene-Disease Associations by Matching Patterns of Expression QTL and GWAS Xin He, 1,2 Chris K. Fuller, 1 Yi Song, 1 Qingying Meng, 3 Bin Zhang, 4 Xia Yang, 3 and Hao Li 1, *

More information

Overall Architecture and Distributed Analysis Tools

Overall Architecture and Distributed Analysis Tools Overall Architecture and Distributed Analysis Tools Daniella Meeker, PhD Department of Preventive Medicine University of Southern California Michael E. Matheny, MD, MS, MPH Tennessee Valley Healthcare

More information