Coordinating Center Report. Paul Harris, Marylyn Ritchie, Josh Denny, & Brad Malin
|
|
- Sabrina Anthony
- 6 years ago
- Views:
Transcription
1 Coordinating Center Report Paul Harris, Marylyn Ritchie, Josh Denny, & Brad Malin
2 accelerate phenotype development & sharing expand genomic data integration enhance methods to determine re-identification & privacy risks Coordinating Center provide logistical support & project facilitation sites External Scientific Panel
3 Accelerate phenotype development & sharing
4 Phenotyping Efficiency and Transportability: PheKB Designed to enable a workflow and integrated tools and standards that guide the user in navigating from early stage development to final public sharing. emerge, PCORnet, NIH Collaboratory, PGRN, PGPop 414 active users, 52 institutions 30 publicly available phenotypes, 62 phenotypes in development 149 implementations (for 53 phenotypes) 3,086 views/month..increasing the efficiency and accuracy of phenotype definition
5 PheKB phenotype views Type II DM ADHD MidSouth CDRN Coronary Heart Disease Atrial fibrillation Demonstration Drug Induced Liver Injury Cardiac Conduction Dementia Severe Early Childhood Obesity Ace-inhibitor induced cough Type-2 Diabetes Demonstration Crohns Disease Demonstration Peripheral Arterial Disease Cataracts Diabetic Retinopathy Height Hypothyroidism Lipids Rheumatoid Arthritis Demonstration Multiple Sclerosis Demonstration Red blood cell indices High Density Lipoproteins White blood cell indices Unique Page Views
6 Phenotype Modalities used in 92 PheKB phenotypes Data modalities or methods Number of phenotypes utilizing these features Public (N=30) Non-Public* (N=62) Percent of Total ICD-9 Codes % Medications % Natural Language Processing % CPT Codes % Laboratory test results %
7 Sharing to other networks Key example - T2D algorithm reused in 40 non-emerge projects NIH Collaboratory, PCORnet sites using PheKB now 7 phenotypes in PheKB currently amongst these 25 non-emerge papers reference PheKB as a source
8 Data Sharing The Data Dictionary/Data Validation Tool (Luke Rasmussen) Verifies a data dictionary file adherence to emerge and dbgap standards Verifies a data set against a data dictionary Guarantees that shared files are consistently formatted to minimize time needed to clean/reformat data..increasing the efficiency and accuracy of data sharing
9 Results Dissemination and Re-use: PheWAS Catalog An interactive tool of posted results from a large PheWAS study. Currently disseminating the first of many emerge PheWAS study results. PheWAS results for >3000 SNPs identified in GWAS studies search SNPs, phenotypes, genes make/save graphs export data sets 2,300 users visit to the site
10 R PheWAS Package ex=generateexample() EHR Biobank ICD9 Codes Demographics R PheWAS Package PheWAS Code Translation Phenotypes Genotypes Association Testing Results Plotting
11 Some emerge PheWAS examples GWAS QRS duration (VU) GWAS Platelet size/volume (Mayo) GWAS Herpes Zoster (GHC/UW) Pediatric PheWAS (CCHMC and CHOP) Null variant studies (Network) CNV PheWAS (VU/GHC/UW/CHOP) Neanderthal PheWAS (VU, UW) PCSK9 (Mayo) FTO introns (Adult and pediatric) PheWAS on autism comorbidities HLA variants, Functional variants (Marshfield) Developing text and NLP-based PheWAS
12 Shameer et al. Hum Genet 2014 PheWAS of Platelet size (a) and volume (b)
13 Hypothesis Generation and Testing: emerge Record Counter (erc) Web-based research tool for members of the emerge community with over 55,000 subjects with demographic, diagnosis, and procedure codes. Drag and Drop feature that utilized basic Boolean-logic Used to assess how well a phenotype might do in a genotyped population as well as for site-level assessment improving early stage feasibility to produce the highest rate of return on resources.
14 Medication extraction and curation Developed and distributed MedEx (emerge 1) and MedEx-UIMA (emerge 2) MedEx-UIMA maps to RxNorm codes for specific drug and generic ingredient Have used these and other tools to clean SPHINX medication data Data dictionaries for collection and validation of central data for emerge GWAS and emerge-pgx sets
15 Expand genomic data integration
16 Evaluation of quality control for merging genomic datasets Dataset IMPUTE2 Imputed Data # Genotyped Samples Merged emerge-i 1M 2,634 Merged emerge-i ,029 Geisinger 3,111 Group Health/ U Wash 731 Marshfield/Essentia/PSU 616 Mayo Clinic 3,121 Mount Sinai 6,290 Northwestern 2,951 Vanderbilt 7,616 CCHMC/BCH 5,346 CHOP 6,850 Total All IMPUTE2 Imputed Samples 55,292 Developed emerge imputation pipeline Round 1: BEAGLE Round 2: IMPUTE2 Merged imputed dataset released to dbgap Determined subtle batch effects which need to be corrected for by site, platform, sex, and PCs
17
18 Extraction of additional information from merged GWAS datasets CNV calling in emerge-i data All intensity files available for emerge-i CNV calling done by CC and by CHOP QC ed CNV dataset available from CC CNV calling requested in emerge-ii data CC solicited intensity files for emerge-ii Limited sets have been submitted thus far
19 Evaluation of genetic models for clinical relevance Null variant PheWAS project led by CC Selected 25 null variants Adult and pediatric PheWAS conducted Manuscript in progress
20 emerge-pgx CC developed variant calling pipeline for emerge-pgx in collaboration with genomics workgroup Performed variant calling for all emerge-pgx samples Concordance analyses across sites and platforms (manuscript in progress) Developed SPHINX for disseminating variant information to the community (manuscript submitted)
21 Average of 200 Unique users per month. 2,334 unique users since release and 7,447 page views since release
22 Impact & Leadership Led a special issue to Frontiers in Genetics
23 Enhance methods to determine re-identification & privacy risks
24 emerge Data Privacy Advancements Reviews of the Field Biobanking Malin. Identifiability in biobanks: models, measures, and mitigation strategies. Human Genetics Attacks & Protections Naveed. Privacy in the genomic era. ACM Computing Surveys. In Press. Evaluations Risk-Utility Frontiers Across emerge States W. Xia. R-U policy frontiers for health data de-identification PheWAS Anonymization Across emerge Sites R. Heatherly. Multi-site anonymization. JAMIA 2 nd round. R. Heatherly. Size matters. J Biomedical Informatics Technology Enhancements Natural Language De-identification Carrell. Hiding in plain sight: use of realistic surrogates to reduce exposure of protected health information in clinical text. JAMIA Li. De-identification of clinical narratives through writing complexity measures. IJMI Risk-Utility Frontiers for De-identification W. Xia. Efficient discovery of de-identification policies. ACM CODASPY Secure Multiparty Computation W. Xie, et al. SecureMA: Protecting participant privacy in genetic association meta-analysis. Bioinformatics Policy Assessments Diagnosis Re-identification Adversarial Knowledge R. Heatherly. Enabling genomic-phenomic association discovery without sacrificing anonymity. PLoS One Game Theory for Demographic Re-identification Z. Wan. A Game theoretic framework for analyzing re-identification risks. PLoS One W. Xia. Process-based data privacy. In review.
25 Policy: Stackelberg Games (Z. Wan, et al. PLoS One. 2015) Sharing Strategy 1 Utility 1 Risk B Publisher Benefit per record = $1200 Cost per violation = $300 ~35000 Census records 5-digit ZIP Age (Years) Race Sex No Attack Scenarios exist! Recipient s Best Strategy Attack Strategy A Utility A Risk A Attack Strategy B Utility B Risk B Attack Strategy C Utility C Risk C Recipient
26 Policy: The Homer Attack CONTROLS (e.g., No diabetes) Association Studies (e.g., Sphinx) SNP 1 SNP 2 SNP n Allele AA AT TT CC CG GG GG GT TT PRIVATE adversary Name SNP 1 SNP 2 SNP n Alice AA CG TT Bob AT GG GT Probability CASES (e.g., Diabetes) SNP 1 SNP 2 SNP n Allele AA AT TT CC CG GG GG GT TT Probability General Population (e.g., HapMAP) SNP 1 SNP 2 SNP n Allele AA AT TT CC CG GG GG GT TT Probability
27 Policy: Sphinx Genomic Inference Game Privacy risk: Likelihood statistic* Are you in the Sphinx pool or the population? More genomic regions we disclose, greater the risk Utility: Proportional to number of regions you disclose Can tune to account for associations with phenotype *Sankararaman S, Obozinski G, Jordan MI, Halperin E. Genomic privacy and limits of individual detection in a pool. Nature Genetics Sep;41(9):965-7.
28 Policy: Sphinx Analysis (Z. Wan, et al. In Development) Utility for emerge & Adversary: Each record worth $99 (based on 23andme) Each SNP has worth proportional to # instances in NHGRI GWAS catalog Probability of detection set to 1 Expected Penalty: is probability of detection * fine
29 Experiment: Sensitivity to Penalty Configuration All SNPs sorted according to MAF (ascending). Utility for each SNP is uniformly distributed Pre-processing keeps 322 SNPs The target set consists of 500 random samples from the pool and the reference respectively. The maximal benefit the publisher gain from publishing all these 322 SNPs is $17,720
30 Optimal # of Released SNPs
31 Major Finding Larger the expected penalty the more SNPs publishable When the expected penalty is equal to the benefit of a successful inference attack, the publisher can publish all the SNPs.
32 It s a Much Larger Analysis If only we had time. Larger publisher s utility more SNPs publishable Alternative: All SNPs sorted in descending MAF order Allows more SNPs to be published! The utility distribution for all 218 independent SNPs
33 Provide logistical support & project facilitation
34 Providing a National Home
35 Tools for project facilitation
36 Supporting and Streamlining Publications Policy, Process & Reporting Assess, Build, Report Meeting Support, Communication, and Project Management
37 Supporting and Streamlining Publications Policy, Process & Reporting Assess, Build, Report Meeting Support, Communication, and Project Management
38 Supporting and Streamlining Publications Policy, Process & Reporting Assess, Build, Report Meeting Support, Communication, and Project Management During emerge Phase II, the CC has provided support for: 700+ workgroup calls 16 Steering Committee/ESP Meetings 8 ESP Packets
39 emerge Coordinating Center Team Paul Harris Marylyn Ritchie Melissa Basford Bradley Malin Josh Denny Sarah Stallings Jacqueline Kirby Mollie Bodin Claar Adam Hardebeck Brianne Derveloy Peter Speltz Shefali Verma Anurag Verma John Wallace Yuki Bradford Sarah Pendergrass Alex Frase Scott Dudek Anna Okula Anastasia Lucas Jason Tan Lixen Chen Yi Jiang Lisa Bastarache Martha Shrubsole Raymond Heatherly Weiyi Xia Zhiyu Wan Jonathan Schildcrout Nathaniel Mercaldo Kyle Brothers
emerge-ii site report Vanderbilt
emerge-ii site report Vanderbilt 29 June 2015 Vanderbilt activities emerge II PGx implementation locally and emerge-pgx SCN5A/KCNH2 project provider attitudes Phenotype contributions Methods development
More informationPersonal Genomics Platform White Paper Last Updated November 15, Executive Summary
Executive Summary Helix is a personal genomics platform company with a simple but powerful mission: to empower every person to improve their life through DNA. Our platform includes saliva sample collection,
More informationJoshua C. Denny Dan M. Roden Laura J. Rasmussen-Torvik. Workgroup Leads
Joshua C. Denny Dan M. Roden Laura J. Rasmussen-Torvik Workgroup Leads emerge PGx Plenary Session Accomplishments PGx Implementation PGx Cohort Description Sharing PGx Data Publications Ongoing Work Updates
More informationUK Biobank Axiom Array
DATA SHEET Advancing human health studies with powerful genotyping technology Array highlights The Applied Biosystems UK Biobank Axiom Array is a powerful array for translational research. Designed using
More informationLinking Genetic Variation to Important Phenotypes
Linking Genetic Variation to Important Phenotypes BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2018 Anthony Gitter gitter@biostat.wisc.edu These slides, excluding third-party material, are licensed under
More informationIntroduction to statistics for Genome- Wide Association Studies (GWAS) Day 2 Section 8
Introduction to statistics for Genome- Wide Association Studies (GWAS) 1 Outline Background on GWAS Presentation of GenABEL Data checking with GenABEL Data analysis with GenABEL Display of results 2 R
More informationAge-Adjusted Death Rates for Coronary Heart Disease, U.S.,
Age-Adjusted Death Rates for Coronary Heart Disease, U.S., 1950-2004 Deaths/100,000 Population 600 500 400 300 200 100 Risk Factors U.S. Actual U.S. "Could Be" (Based on Japan Actual) 0 1950 1960 1970
More informationBeyond single genes or proteins
Beyond single genes or proteins Marylyn D Ritchie, PhD Professor, Biochemistry and Molecular Biology Director, Center for Systems Genomics The Pennsylvania State University Traditional Approach Genome-wide
More informationPharmacogenomics and Health Policy
Pharmacogenomics and Health Policy Amalia M. Issa, PhD, MPH Founding Director, Program in Personalized Medicine & Targeted Therapeutics Professor and Chair Department of Health Policy and Public Health
More informationPrivacy Preserving Data Mining in Application
Brookings Institution Expert Workshop on Statistical & Epidemiological Issues in Active Medical Product Surveillance A Brief Introduction to Privacy Enhancing Technologies for Surveillance Purposes Bradley
More informationExploring the Genetic Basis of Congenital Heart Defects
Exploring the Genetic Basis of Congenital Heart Defects Sanjay Siddhanti Jordan Hannel Vineeth Gangaram szsiddh@stanford.edu jfhannel@stanford.edu vineethg@stanford.edu 1 Introduction The Human Genome
More informationIllumina s GWAS Roadmap: next-generation genotyping studies in the post-1kgp era
Illumina s GWAS Roadmap: next-generation genotyping studies in the post-1kgp era Anthony Green Sr. Genotyping Sales Specialist North America 2010 Illumina, Inc. All rights reserved. Illumina, illuminadx,
More informationThe Electronic Medical Records and Genomics (emerge) Network: past, present, and future
American College of Medical Genetics and Genomics Review Open The Electronic Medical Records and Genomics (emerge) Network: past, present, and future Omri Gottesman, MD 1, Helena Kuivaniemi, MD, PhD 2,
More informationComplete automation for NGS interpretation and reporting with evidence-based clinical decision support
Brochure Bioinformatics for Clinical Oncology Testing Complete automation for NGS interpretation and reporting with evidence-based clinical decision support Sample to Insight Powering clinical insights
More informationGenomic Research: Issues to Consider. IRB Brown Bag August 28, 2014 Sharon Aufox, MS, LGC
Genomic Research: Issues to Consider IRB Brown Bag August 28, 2014 Sharon Aufox, MS, LGC Outline Key genomic terms and concepts Issues in genomic research Consent models Types of findings Returning results
More informationRe-Identification of DNA through an Automated Linkage Process
Re-Identification of DNA through an Automated Linkage Process Bradley Malin and Latanya Sweeney Laboratory for International Data Privacy School of Computer Science and Heinz School of Public Policy Carnegie
More informationRegulatory and Ethical Issues in Repository Research
Regulatory and Ethical Issues in Repository Research Kelly Fryer-Edwards ITHS Bioethics Core and UW Bioethics Shannon Sewards Human Subjects Division Agenda Review of Regulatory and Ethical Issues Creating
More informationFrom Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow
From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with
More informationHigh Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays
High Cross-Platform Genotyping Concordance of Axiom High-Density Microarrays and Eureka Low-Density Targeted NGS Assays Ali Pirani and Mohini A Patil ISAG July 2017 The world leader in serving science
More informationS SG. Metabolomics meets Genomics. Hemant K. Tiwari, Ph.D. Professor and Head. Metabolomics: Bench to Bedside. ection ON tatistical.
S SG ection ON tatistical enetics Metabolomics meets Genomics Hemant K. Tiwari, Ph.D. Professor and Head Section on Statistical Genetics Department of Biostatistics School of Public Health Metabolomics:
More informationDerrek Paul Hibar
Derrek Paul Hibar derrek.hibar@ini.usc.edu Obtain the ADNI Genetic Data Quality Control Procedures Missingness Testing for relatedness Minor allele frequency (MAF) Hardy-Weinberg Equilibrium (HWE) Testing
More informationTestimony of Christopher Newton-Cheh, MD, MPH Volunteer for the American Heart Association
Testimony of Christopher Newton-Cheh, MD, MPH Volunteer for the American Heart Association Before the House Energy and Commerce Subcommittee on Health 21st Century Cures: Examining the Regulation of Laboratory
More informationState of the Art in Data Management for Precision Medicine & Genomics. March 8, pm 3 pm ET
State of the Art in Data Management for Precision Medicine & Genomics March 8, 2017 2 pm 3 pm ET Housekeeping Issues All participants are muted To ask a question or make a comment, please submit via the
More informationICH Topic E16 Genomic Biomarkers Related to Drug Response: Context, Structure and Format of Qualification Submissions. Step 3
European Medicines Agency June 2009 EMEA/CHMP/ICH/380636/2009 ICH Topic E16 Genomic Biomarkers Related to Drug Response: Context, Structure and Format of Qualification Submissions Step 3 NOTE FOR GUIDANCE
More informationNew Frontiers in Personalized Medicine
New Frontiers in Personalized Medicine Oracle Open World Shanghai 2013 Neil de Crescenzo SVP and GM 1 Safe Harbor Statement The following is intended to outline our general product direction. It is intended
More informationComments on Use of Databases for Establishing the Clinical Relevance of Human Genetic Variants
Division of Dockets Management (HFA-305) Food and Drug Administration 5630 Fishers Lane, Rm. 1061 Rockville, MD 20852 The American Society of Human Genetics 9650 Rockville Pike Bethesda, MD 20814 24 December
More informationBiomedical Big Data and Precision Medicine
Biomedical Big Data and Precision Medicine Jie Yang Department of Mathematics, Statistics, and Computer Science University of Illinois at Chicago October 8, 2015 1 Explosion of Biomedical Data 2 Types
More informationPLINK gplink Haploview
PLINK gplink Haploview Whole genome association software tutorial Shaun Purcell Center for Human Genetic Research, Massachusetts General Hospital, Boston, MA Broad Institute of Harvard & MIT, Cambridge,
More informationWhole genome sequencing in drug discovery research: a one fits all solution?
Whole genome sequencing in drug discovery research: a one fits all solution? Marc Sultan, September 24th, 2015 Biomarker Development, Translational Medicine, Novartis On behalf of the BMD WGS pilot team:
More informationMedSavant: An open source platform for personal genome interpretation
MedSavant: An open source platform for personal genome interpretation Marc Fiume 1, James Vlasblom 2, Ron Ammar 3, Orion Buske 1, Eric Smith 1, Andrew Brook 1, Sergiu Dumitriu 2, Christian R. Marshall
More informationGap hunting to characterize clustered probe signals in Illumina methylation array data
DOI 10.1186/s13072-016-0107-z Epigenetics & Chromatin RESEARCH Gap hunting to characterize clustered probe signals in Illumina methylation array data Shan V. Andrews 1,2, Christine Ladd Acosta 1,2,3, Andrew
More informationData Sources and Biobanks in the Asia-Pacific Region. Wei Zhou, MD, Ph.D. Department of Epidemiology, Merck Research Laboratories October 23, 2014
Data Sources and Biobanks in the Asia-Pacific Region Wei Zhou, MD, Ph.D. Department of Epidemiology, Merck Research Laboratories October 23, 2014 1 Disclosures Wei Zhou is currently an employee of Merck
More informationIntroduction to Bioinformatics
Introduction to Bioinformatics Alla L Lapidus, Ph.D. SPbSU St. Petersburg Term Bioinformatics Term Bioinformatics was invented by Paulien Hogeweg (Полина Хогевег) and Ben Hesper in 1970 as "the study of
More informationAxiom mydesign Custom Array design guide for human genotyping applications
TECHNICAL NOTE Axiom mydesign Custom Genotyping Arrays Axiom mydesign Custom Array design guide for human genotyping applications Overview In the past, custom genotyping arrays were expensive, required
More informationHaplotype phasing in large cohorts: Modeling, search, or both?
Haplotype phasing in large cohorts: Modeling, search, or both? Po-Ru Loh Harvard T.H. Chan School of Public Health Department of Epidemiology Broad MIA Seminar, 3/9/16 Overview Background: Haplotype phasing
More informationPopulation Genetics & Drug Discovery
Population Genetics & Drug Discovery examples from Finland Mark J. Daly Chief, Analytic and Translational Genetics Unit Massachusetts General Hospital Co-director, Medical and Population Genetics Broad
More informationhttp://genemapping.org/ Epistasis in Association Studies David Evans Law of Independent Assortment Biological Epistasis Bateson (99) a masking effect whereby a variant or allele at one locus prevents
More informationBioinformatics. Ingo Ruczinski. Some selected examples... and a bit of an overview
Bioinformatics Some selected examples... and a bit of an overview Department of Biostatistics Johns Hopkins Bloomberg School of Public Health July 19, 2007 @ EnviroHealth Connections Bioinformatics and
More informationBig Data Era in Medicine
Big Data Era in Medicine brought by Genome Omics Information Hiroshi Tanaka Tohoku Medical Megabank Organization Tohoku University and Dept. Biomedical Informatics Tokyo Medical and Dental University the
More informationThe Clinical Pharmacogenetics Implementation Consortium (CPIC): supporting the adoption of pharmacogenetics into the EHR
The Clinical Pharmacogenetics Implementation Consortium (CPIC): supporting the adoption of pharmacogenetics into the EHR James M. Hoffman, Pharm.D. M.S. Chief Patient Safety Officer Associate Member, Pharmaceutical
More informationIIS Competency Domain Model
IIS Competency Domain Model Knowledge, Skills and Abilities for IIS Job Roles PHII Academy 18 November 2015 www.informaticsacademy.org Standards and Interoperability Applies informatics standards to ensure
More informationICH E18: Guideline for Genomic Sampling and Management of Genomic Data. Prepared by the ICH E18 Expert Working Group
ICH E18: Guideline for Genomic Sampling and Prepared by the ICH E18 Expert Working Group October 2017 International Council for Harmonisation of Technical Requirements for Pharmaceuticals for Human Use
More informationAccelerate High Throughput Analysis for Genome Sequencing with GPU
Accelerate High Throughput Analysis for Genome Sequencing with GPU ATIP - A*CRC Workshop on Accelerator Technologies in High Performance Computing May 7-10, 2012 Singapore BingQiang WANG, Head of Scalable
More informationVCGDB: A Virtual and Dynamic Genome Database of the Chinese Population
VCGDB: A Virtual and Dynamic Genome Database of the Chinese Population Jiayan Wu Associate Professor Director of Science and Technology Department Director of Core Facility Beijing Institute of Genomics,
More informationAristos Aristodimou, Athos Antoniades, Constantinos Pattichis University of Cyprus David Tian, Ann Gledson, John Keane University of Manchester
Aristos Aristodimou, Athos Antoniades, Constantinos Pattichis University of Cyprus David Tian, Ann Gledson, John Keane University of Manchester Linked2Safety: FP7 project funded by the European Commission
More informationTOTAL CANCER CARE: CREATING PARTNERSHIPS TO ADDRESS PATIENT NEEDS
TOTAL CANCER CARE: CREATING PARTNERSHIPS TO ADDRESS PATIENT NEEDS William S. Dalton, PhD, MD CEO, M2Gen & Director, Personalized Medicine Institute, Moffitt Cancer Center JULY 15, 2013 MOFFITT CANCER CENTER
More informationBy the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs
(3) QTL and GWAS methods By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs Under what conditions particular methods are suitable
More informationICH guideline E18 on genomic sampling and management of genomic data
1 2 3 22 January 2016 EMA/CHMP/ICH/11623/2016 Committee for Human Medicinal Products 4 5 6 ICH guideline E18 on genomic sampling and management of genomic data Step 3 Adopted by CHMP for release for consultation
More informationPersonal Genomic Information Management and Personalized Medicine: Challenges, Current Solutions, and Roles of HIM Professionals
Personal Genomic Information Management and Personalized Medicine: Challenges, Current Solutions, and Roles of HIM Professionals Personal Genomic Information Management and Personalized Medicine: Challenges,
More informationELE4120 Bioinformatics. Tutorial 5
ELE4120 Bioinformatics Tutorial 5 1 1. Database Content GenBank RefSeq TPA UniProt 2. Database Searches 2 Databases A common situation for alignment is to search through a database to retrieve the similar
More informationSupplementary Note: Detecting population structure in rare variant data
Supplementary Note: Detecting population structure in rare variant data Inferring ancestry from genetic data is a common problem in both population and medical genetic studies, and many methods exist to
More informationPersonalized. Health in Canada
Personalized Health in Canada Canadian Institutes of Health Research Personalized Medicine Signature Initiative 2010-2013 0 Dr. Morag Park CIHR Institute of Cancer Research Dr. Paul Lasko CIHR Institute
More informationAuthor's response to reviews
Author's response to reviews Title: A pooling-based genome-wide analysis identifies new potential candidate genes for atopy in the European Community Respiratory Health Survey (ECRHS) Authors: Francesc
More informationThe Yale Open Data Access (YODA) Project: Lessons Learned in Data Sharing
The Yale Open Data Access (YODA) Project: Lessons Learned in Data Sharing Grand Rounds: A Shared Forum of the NIH HCS Collaboratory and PCORnet June 30, 2017 Joseph S. Ross, MD, MHS Section of General
More informationKNN-MDR: a learning approach for improving interactions mapping performances in genome wide association studies
Abo Alchamlat and Farnir BMC Bioinformatics (2017) 18:184 DOI 10.1186/s12859-017-1599-7 METHODOLOGY ARTICLE Open Access KNN-MDR: a learning approach for improving interactions mapping performances in genome
More informationACCEPTED. Victoria J. Wright Corresponding author.
The Pediatric Infectious Disease Journal Publish Ahead of Print DOI: 10.1097/INF.0000000000001183 Genome-wide association studies in infectious diseases Eleanor G. Seaby 1, Victoria J. Wright 1, Michael
More informationShort Course: Adaptive Clinical Trials
Short Course: Adaptive Clinical Trials Presented at the 2 Annual Meeting of the Society for Clinical Trials Vancouver, Canada Roger J. Lewis, MD, PhD Department of Emergency Medicine Harbor-UCLA Medical
More informationExploring genomic databases: Practical session "
Exploring genomic databases: Practical session Work through the following practical exercises on your own. The objective of these exercises is to become familiar with the information available in each
More informationPéter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS
Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS The Bioinformatics book covers new topics in the rapidly
More informationTargeted resequencing
Targeted resequencing Sarah Calvo, Ph.D. Computational Biologist Vamsi Mootha laboratory Snapshots of Genome Wide Analysis in Human Disease (MPG), 4/20/2010 Vamsi Mootha, PI How can I assess a small genomic
More informationOverview of Health Informatics. ITI BMI-Dept
Overview of Health Informatics ITI BMI-Dept Fellowship Week 5 Overview of Health Informatics ITI, BMI-Dept Day 10 7/5/2010 2 Agenda 1-Bioinformatics Definitions 2-System Biology 3-Bioinformatics vs Computational
More informationDetecting ancient admixture using DNA sequence data
Detecting ancient admixture using DNA sequence data October 10, 2008 Jeff Wall Institute for Human Genetics UCSF Background Origin of genus Homo 2 2.5 Mya Out of Africa (part I)?? 1.6 1.8 Mya Further spread
More informationAMERICAN HEART ASSOCIATION RESEARCH FACTS FY
AMERICAN HEART ASSOCIATION RESEARCH FACTS FY 2016-2017 LIFE IS WHY. RESEARCH IS HOW. The AHA research program has tremendous impact on science discovery and researchers careers. New knowledge that results
More informationWebinar Tips Please mute your phone. Please do not put the call on hold. There will be time for questions after the presentation.
Webinar Tips Please mute your phone. Please do not put the call on hold. There will be time for questions after the presentation. Carolinas Collaborative Pilot RFA Webinar May 8, 2017 Partners Overview
More informationA Codon Frequency Obfuscation Heuristic for Raw Genomic Data Privacy
A Codon Frequency Obfuscation Heuristic for Raw Genomic Data Privacy Kato Mivule Computer Science Department Bowie State University kmivule@gmail.com Abstract Genomic data provides clinical researchers
More informationGeneral aspects of genome-wide association studies
General aspects of genome-wide association studies Abstract number 20201 Session 04 Correctly reporting statistical genetics results in the genomic era Pekka Uimari University of Helsinki Dept. of Agricultural
More informationThe Future of HealthCare Information Technology
HST.921 / HST.922 Information Technology in the Health Care System of the Future, Spring 2009 Harvard-MIT Division of Health Sciences and Technology Course Directors: Dr. Steven Locke, Dr. Bryan Bergeron,
More informationCS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016
CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 Topics Genetic variation Population structure Linkage disequilibrium Natural disease variants Genome Wide Association Studies Gene
More informationSpy vs Spy: Anonymous Messaging
Spy vs Spy: Anonymous Messaging Sewoong Oh University of Illinois at Urbana-Champaign Joint work with Giulia Fanti, Peter Kairouz and Pramod Viswanath Anonymous Social Media provide meta-data privacy Anonymous
More informationSNP calling and VCF format
SNP calling and VCF format Laurent Falquet, Oct 12 SNP? What is this? A type of genetic variation, among others: Family of Single Nucleotide Aberrations Single Nucleotide Polymorphisms (SNPs) Single Nucleotide
More informationGovernance Policies for PCORnet, the National Patient-Centered Clinical Research Network
Governance Policies for PCORnet, the National Patient-Centered Clinical Research Network 1.0 Leadership and Decision-Making When reading this document it is important to note that there are several categories
More informationA Random Forest proximity matrix as a new measure for gene annotation *
A Random Forest proximity matrix as a new measure for gene annotation * Jose A. Seoane 1, Ian N.M. Day 1, Juan P. Casas 2, Colin Campbell 3 and Tom R. Gaunt 1,4 1 Bristol Genetic Epidemiology Labs. School
More informationLegislation and Facilitation of Health Data in Denmark Short update. Mads Melbye, MD, DMSc Statens Serum Institut, Copenhagen, Denmark
Legislation and Facilitation of Health Data in Denmark Short update Mads Melbye, MD, DMSc Statens Serum Institut, Copenhagen, Denmark Legislation short facts Register research: exempted from informed consent
More informationCustom TaqMan Assays DESIGN AND ORDERING GUIDE. For SNP Genotyping and Gene Expression Assays. Publication Number Revision G
Custom TaqMan Assays DESIGN AND ORDERING GUIDE For SNP Genotyping and Gene Expression Assays Publication Number 4367671 Revision G For Research Use Only. Not for use in diagnostic procedures. Manufacturer:
More informationAssociation Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010
Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010 Traditional QTL approach Uses standard bi-parental mapping populations o F2 or RI These have a limited number of
More informationComputational methods for the analysis of rare variants
Computational methods for the analysis of rare variants Shamil Sunyaev Harvard-M.I.T. Health Sciences & Technology Division Combine all non-synonymous variants in a single test Theory: 1) Most new missense
More informationUniversity of Groningen. The value of haplotypes Vries, Anne René de
University of Groningen The value of haplotypes Vries, Anne René de IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document
More informationACCELERATING GENOMIC ANALYSIS ON THE CLOUD. Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes
ACCELERATING GENOMIC ANALYSIS ON THE CLOUD Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia
More informationConcepts and relevance of genome-wide association studies
Science Progress (2016), 99(1), 59 67 Paper 1500149 doi:10.3184/003685016x14558068452913 Concepts and relevance of genome-wide association studies ANDREAS SCHERER and G. BRYCE CHRISTENSEN Dr Andreas Scherer
More informationGenomic Medicine in France
Genomic Medicine in France Catherine Nguyen, PhD Director ITMO GGB Genetic, Genomic & Bioinformatic Advancing Disease Modeling in Animal-Based research in Support of Precision Medecine: a Workshop ILAR
More informationATHENA: A TOOL FOR META-DIMENSIONAL ANALYSIS APPLIED TO GENOTYPES AND GENE EXPRESSION DATA TO PREDICT HDL CHOLESTEROL LEVELS
ATHENA: A TOOL FOR META-DIMENSIONAL ANALYSIS APPLIED TO GENOTYPES AND GENE EXPRESSION DATA TO PREDICT HDL CHOLESTEROL LEVELS EMILY R. HOLZINGER Center for Human Genetics Research, Vanderbilt University
More informationUsing the Trio Workflow in Partek Genomics Suite v6.6
Using the Trio Workflow in Partek Genomics Suite v6.6 This user guide will illustrate the use of the Trio/Duo workflow in Partek Genomics Suite (PGS) and discuss the basic functions available within the
More informationClinical trial information leaflet and consent
Informed consent 1(7) Clinical trial information leaflet and consent General You must provide sufficient information on the rights of clinical trial subjects, the purpose and nature of the trial, the methodologies
More informationNext Generation Sequencing. Target Enrichment
Next Generation Sequencing Target Enrichment Next Generation Sequencing Your Partner in Every Step from Sample to Data NGS: Revolutionizing Genetic Analysis with Single-Molecule Resolution Next generation
More informationBBMRI.NL A story of Sharing Eline Slagboom Molecular Epidemiology, Leiden University Medical Center Brussel 20 Juni 2017
BBMRI.NL A story of Sharing Eline Slagboom Molecular Epidemiology, Leiden University Medical Center Brussel 20 Juni 2017 BBMRI-NL TOWARDS A NATIONAL BIOBANKING INFRASTRUCTURE Founded in 2009 as the Dutch
More informationPROTOCOL FOR A RESEARCH SAMPLE REPOSITORY FOR ALLOGENEIC HEMATOPOIETIC STEM CELL TRANSPLANTATION, OTHER CELLULAR THERAPIES AND MARROW TOXIC INJURIES
Center for International Blood and Marrow Transplant Research PROTOCOL FOR A RESEARCH SAMPLE REPOSITORY FOR ALLOGENEIC HEMATOPOIETIC STEM CELL TRANSPLANTATION, OTHER CELLULAR THERAPIES AND MARROW TOXIC
More informationThe Sentieon Genomic Tools Improved Best Practices Pipelines for Analysis of Germline and Tumor-Normal Samples
The Sentieon Genomic Tools Improved Best Practices Pipelines for Analysis of Germline and Tumor-Normal Samples Andreas Scherer, Ph.D. President and CEO Dr. Donald Freed, Bioinformatics Scientist, Sentieon
More informationGenetics and Inflammatory Bowel Disease
Genetics and Inflammatory Bowel Disease Louis Ghanem M.D, Ph.D. Attending Physician, The Children s Hospital of Philadelphia Division of Gastroenterology, Hepatology and Nutrition Etiology of IBD Genetic
More informationThe Views of Participants in DNA Biobanks
The Views of Participants in DNA Biobanks Kelly E. Ormond, I Maureen E. Smith, II & Wendy A. Wolf III Abstract Biobanks are generally created with the long-term goal of establishing genotype-phenotype
More informationTaqPath ProAmp Master Mixes
PRODUCT BULLETIN es es Applied Biosystems TaqPath ProAmp Master Mixes are versatile master mixes developed for high-throughput genotyping and copy number variation (CNV) analysis protocols that require
More informationRuns of Homozygosity Analysis Tutorial
Runs of Homozygosity Analysis Tutorial Release 8.7.0 Golden Helix, Inc. March 22, 2017 Contents 1. Overview of the Project 2 2. Identify Runs of Homozygosity 6 Illustrative Example...............................................
More informationA GENOTYPE CALLING ALGORITHM FOR AFFYMETRIX SNP ARRAYS
Bioinformatics Advance Access published November 2, 2005 The Author (2005). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org
More informationQuantitative Genomics and Genetics BTRY 4830/6830; PBSB
Quantitative Genomics and Genetics BTRY 4830/6830; PBSB.5201.01 Jason Mezey Biological Statistics and Computational Biology (BSCB) Department of Genetic Medicine Institute for Computational Biomedicine
More informationEuropean Induced Pluripotent Stem Cell Bank
European Induced Pluripotent Stem Cell Bank Background Researchers are looking for ways to validate pathways using human data. ips cells are one approach to this. ips research and banking is fragmented
More informationCreate a Planned Run. Using the Ion AmpliSeq Pharmacogenomics Research Panel Plugin USER BULLETIN. Publication Number MAN Revision A.
USER BULLETIN Create a Planned Run Using the Ion AmpliSeq Pharmacogenomics Research Panel Plugin Publication Number MAN0013730 Revision A.0 For Research Use Only. Not for use in diagnostic procedures.
More informationARTICLE Sherlock: Detecting Gene-Disease Associations by Matching Patterns of Expression QTL and GWAS
ARTICLE Sherlock: Detecting Gene-Disease Associations by Matching Patterns of Expression QTL and GWAS Xin He, 1,2 Chris K. Fuller, 1 Yi Song, 1 Qingying Meng, 3 Bin Zhang, 4 Xia Yang, 3 and Hao Li 1, *
More informationOverall Architecture and Distributed Analysis Tools
Overall Architecture and Distributed Analysis Tools Daniella Meeker, PhD Department of Preventive Medicine University of Southern California Michael E. Matheny, MD, MS, MPH Tennessee Valley Healthcare
More information