The HL7 Clinical Genomics Work Group

Similar documents
IHIC 2011 Orlando, FL

Developing Data Models and Standards to Support Use Cases

What s New in HL7. W. Ed Hammond, PhD Director, Duke Center for Health Informatics FACMI, FAIMBE, FHL7, FIMIA Secretary, HL7; Chair-Emeritus, HL7

Comments on Use of Databases for Establishing the Clinical Relevance of Human Genetic Variants

Single Nucleotide Variant Analysis. H3ABioNet May 14, 2014

HITSP Construct HL7 Standard

The Future of Laboratory Implementation Guides

Informatics of Clinical Genomics

Introduction to Next Generation Sequencing (NGS) Andrew Parrish Exeter, 2 nd November 2017

ACCELERATING GENOMIC ANALYSIS ON THE CLOUD. Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes

Complete automation for NGS interpretation and reporting with evidence-based clinical decision support

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

An Automated Pipeline for NGS Testing and Reporting in a Commercial Molecular Pathology Lab: The Genoptix Case

Genome Sequencing: Applications and Newborn Screening. June 2, 2014

Project Scope Statement for: EHR- S FM + PHR- S FM + RM- ES FP on FHIR

Whole genome sequencing in drug discovery research: a one fits all solution?

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016

Clinician s Guide to Actionable Genes and Genome Interpretation

PHYSICIAN RESOURCES MAPPED TO GENOMICS COMPETENCIES AND GAPS IDENTIFIED WITH CURRENT EDUCATIONAL RESOURCES AVAILABLE 06/04/14

SNOMED CT in the Age of Precision Medicine

Genomic Research: Issues to Consider. IRB Brown Bag August 28, 2014 Sharon Aufox, MS, LGC

FHIR. FHIR : Fast Healthcare Interoperability Resources (Pronounced Fire ) The hottest thing interoperability this year

Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS

SENIOR BIOLOGY. Blueprint of life and Genetics: the Code Broken? INTRODUCTORY NOTES NAME SCHOOL / ORGANISATION DATE. Bay 12, 1417.

SNPs - GWAS - eqtls. Sebastian Schmeier

In This Issue September 9, 2010

Biomarker Data Management in Clinical Trials: Addressing the Challenges of the New Regulatory Landscape

Corporate Medical Policy

VARIANT TERMINOLOGY AND EXON NUMBERING

Axiom mydesign Custom Array design guide for human genotyping applications

Ontologies - Useful tools in Life Sciences and Forensics

Read Mapping and Variant Calling. Johannes Starlinger

Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie. Sander van Boheemen Medical Microbiology

Linking Genetic Variation to Important Phenotypes

"Stratification biomarkers in personalised medicine"

Fundamentals of Clinical Genomics

BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers

ICH Topic E16 Genomic Biomarkers Related to Drug Response: Context, Structure and Format of Qualification Submissions. Step 3

Target Enrichment Strategies for Next Generation Sequencing

Supreme Court of the United States ASSOCIATION FOR MOLECULAR PATHOLOGY et al., Petitioners v. MYRIAD GENETICS, INC., et al. Decided June 13, 2013.

Classification and Learning Using Genetic Algorithms

Handbook. Precautions/warnings: For professional use only. To support clinical diagnosis. Not yet offered in the US. Page 1

HL7 and Service-oriented Architecture (SOA) Ambassador Briefing

Next Generation Sequencing. Target Enrichment

Bridging the Genomics-Health IT Gap for Precision Medicine

Join us in Booth #4325 for live presentations on HL7 and Meaningful Use, Mobile Health, Consolidated CDA and other HL7 standards.

American Board of Medical Genetics and Genomics

Sanger vs Next-Gen Sequencing

BBMRI.NL A story of Sharing Eline Slagboom Molecular Epidemiology, Leiden University Medical Center Brussel 20 Juni 2017

Mutation entries in SMA databases Guidelines for national curators

Introduction to Bioinformatics

Preanalytical Processing: The Biospecimen Quality Imperative

ICH guideline E18 on genomic sampling and management of genomic data

Predictive Genomics for Prevention, Interception and Cure Health Datapalooza

SNP calling and VCF format

Types of Databases - By Scope

Lecture 2: Biology Basics Continued

MOLECULAR GENETICS REPORT: Glycogen Storage Disease NextGen Sequencing Panel

Terminology for personalized medicine

ICH Topic E15 Definitions for genomic biomarkers, pharmacogenomics, pharmacogenetics, genomic data and sample coding categories.

Bioinformatics for Proteomics. Ann Loraine

ELE4120 Bioinformatics. Tutorial 5

Bridging the Gap in Precision Medicine An Informatics Solution Connecting Research, Pathology, and Clinical Care

UNDERSTANDING GENETIC RESEARCH AND ALTERNATING HEMIPLEGIA OF CHILDHOOD. IT ALL BEGINS WITH THE AHC PATIENT And The Scientific Process

SUPPLEMENTARY INFORMATION

First Annual Biomarker Symposium Quest Diagnostics Clinical Trials

TOTAL CANCER CARE: CREATING PARTNERSHIPS TO ADDRESS PATIENT NEEDS

Complementary Technologies for Precision Genetic Analysis

Semantic Enrichment and the Information Manager

UVA Biorepository & Tissue Research Facility (BTRF) Coded-Specimens Guidance

Personalized Medicine


Molecular Diagnostics

Towards detection of minimal residual disease in multiple myeloma through circulating tumour DNA sequence analysis

Diagnostics in Oncology Mark Kockx MD, PhD

Symphony Genomics Workflow Management System Managing, Integrating, and Delivering Your NGS Data and Results

Sequence Variations. Baxevanis and Ouellette, Chapter 7 - Sequence Polymorphisms. NCBI SNP Primer:

Welcome to the NGS webinar series

Developing an Accurate and Precise Companion Diagnostic Assay for Targeted Therapies in DLBCL

Introduction to Bioinformatics

Genomic Data Workgroup. Issues and Trends in Electronic Genomic Data Exchange

Cancer Genetics Solutions

SYMPOSIUM March 22-23, 2018

PCORI Methodology Standards: Academic Curriculum Patient-Centered Outcomes Research Institute. All Rights Reserved.

Forthcoming Calls for Proposals (10/11) Barcelona 28 June 2013

5/18/2017. Genotypic, phenotypic or allelic frequencies each sum to 1. Changes in allele frequencies determine gene pool composition over generations

Konica Minolta to Acquire Invicro (US)

Big Data & Clinical Informatics

Leveraging of CDISC Standards to Support Dissemination, Integration and Analysis of Raw Clinical Trials Data to Advance Open Science

TRANSFORMING GLOBAL GENETIC DATA INTO MEDICAL DECISIONS

The 100,000 Genomes Project Genomics Collaboration Event Finnish Residence

The 150+ Tomato Genome (re-)sequence Project; Lessons Learned and Potential

QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd

Course Presentation. Ignacio Medina Presentation

TTT: 7 WT: Text book by N.C.E.R.T. 2. Reference book by Dinesh Publications.

Course Overview: Mutation Detection Using Massively Parallel Sequencing

Ensembl Tools. EBI is an Outstation of the European Molecular Biology Laboratory.

COMPUTER RESOURCES II:

Atlas of Genetics and Cytogenetics in Oncology and Haematology. IMMUNOGLOBULIN GENES: CONCEPT OF DNA REARRANGEMENT * Introduction

Transcription:

HL7 Clinical Genomics Domain Information Model The HL7 Clinical Genomics Work Group Prepared by Amnon Shabo (Shvo), PhD HL7 Clinical Genomics WG Co-chair and Modeling Facilitator Pedigree (family health history) Co-editor GTR Implementation Guide Primary Editor HL7 Structured Documents WG CDA R2 Co-editor CCD Implementation Guide Co-editor

Overview of Activities Main efforts: (chronologically ordered) v3: v2: CDA: Family History (Pedigree) Topic Deprecated: Genetic Locus / Loci Genetic Variations Topic Gene Expression Topic v2 Implementation Guides * The IG Genetic Test Result Reporting to EHR is modeled after the HL7 Version 2.5.1 Implementation Guide: Orders And Observations; Interoperable Laboratory Result Reporting To EHR (US Realm), Release 1 A CDA Implementation Guide for Genetic Testing Reports (GTR) Under Development: FHIR: Genomic resources Genetic family history profile Common: Domain Analysis Models Domain Information Model (DIM) describing the common semantics DIM is standards-independent; Semantic alignment among the various specs Normative DSTU Informative 2

CG Domain Information Models (DIMs) Represent agreed-upon and common CG semantics Are aligned with the CG DAM requirements and models A focal point to align the various CG specifications Balloted in 9/2014 at informative level Consisted of two models: Clinical Genomics Statement (CGS) A specialization of the HL7 Clinical Statement model An extension of the GTR Clinical Genomics Statement Family Health History (utilizing the CGS) Passed, as follows: Aff. : 29, Neg.: 12, Abst.: 69, NV: 24 All comments regarding CGS - reconciled (need to reconcile Pedigree) 3

Clinical Genomic Statement - Core Principles Explicit representation of a 'genotype-phenotype' association The association should not be pre-coordinated, that is, a phenotype is not represented as part of some genetic variation representation, e.g., a genetic biomarker is an explicit association of two distinct observations and there is no pre-coordination of the interpretation into a single biomarker code The association is a many-to-many relationship, e.g., a genetic variation can be associated with several phenotypes and a phenotype could be associated with multiple genotypes It is possible to populate merely the genotype part, thus serving genetic variation representation in general, even if no phenotypic data is available 4

Genetic Observations Capture various types of data, from known mutations to sequencingbased variations (e.g., somatic mutations in tumor tissues) to structural variations (e.g., large deletions, cytogenetics) to gene expression and so forth The core class GeneticObservation contains data that is not the result of downstream analysis, i.e., that cannot be derived analytically from another observation available to performer Link to GeneticObservationComponent class (zero to many) enables the post-coordination of any core data item (e.g., HGVS string) to its unambiguous basic primitives Enabling encapsulation of key raw omics data that is the basis of creating this clinical genomics statement The encapsulated data can have links to the full-blown omics data (e.g., NGS sequencing data) 5

Phenotypes Phenotype class is abstract and must be instantiated At instantiation time, distinguish between interpretive phenotype and observed Interpretation is represented as a distinct observation with its own attributes such as time, method, performer, etc. Links to knowledge sources / methods / services used for the interpretation 6

CG DIM Overview The genetic observation(s) The backbone: Gen-Phen association The Phenotype(s) Encapsulated raw data Interpretive phenotype Observed phenotype 7

Class: GeneticObservation Representation of any observation that cannot be analytically derived from other observations made by the performer, including genetic loci in the size of a gene, e.g., genes, biomarkers, point DNA mutations, etc., as well as structural variations, e.g., large DNA deletions or duplications, etc. Note that omics data derived from this observations, e.g., RNA, proteins, expression, etc., should be represented using the linked DerivedGeneticData class. 8

Class: Phenotype (abstract) Description: Representation of any data considered phenotype of the source genetic observation. For each genetic observation (e.g., variation), there could be multiple phenotypes, however, each phenotype object should represent a single phenotype. Defining a set of properties to be a 'single' phenotype might be determined by the intended utility of this phenotype, e.g. in patient care or in clinical trials or earlier on in different phases of research. Attributes: id: a globally-unique id of this observation type: the type of phenotypic observation, e.g., drug responsiveness value: the actual observation (should be drawn from dedicated ontologies, e.g., Human Phenotype Ontology, http://www.humanphenotype-ontology.org/) 9

Association Class: Gen-Phen This association class adds characteristics that cannot be represented through either side of the core association, e.g., type of relation like causality (pathogenic if phenotype is disease) or non-causative (benign) status of this relation, e.g., established, provisional, presumed, preliminary, etc." 10

Post-coordination of Gen. and Phen. Obs. Phenotye components Genetic observation components 11

Class: GeneticObservationComponent Description: Representation of components of the core genetic observation in a post-coordinated manner, i.e., breaking it down to its primitives in order to disambiguate data held by the GeneticObservation class, e.g., an HGVS string. Attributes: type: the type of genetic observation component, e.g., variation location, variation length, etc. value: the actual component observation, e.g., if type is variation base length then value is the actual length 12

Derived Genetic Data DerivedGeneticData: the results of downstream analysis of the source genetic observation 13

Class DerivedGeneticData Description: Representing any downstream data derived from the core genetic observation (including omics data), e.g., RNA, proteins, expression, etc. Attributes: id: a globally-unique id of this observation type: the type of observation, e.g., characteristics of the resulting protein value: the actual observation performer: e.g., unique id of the genetic lab that generated this data 14

Molecular Specimen and Indication Indication of performing the genetic observation Specimen used for the genetic observation 15

Encapsulated Key Raw Data Sets Key raw data sets are encapsulated in its native format, with references to the full-blown raw data 16

Class: KeyRawOmicsData Description: Representation of key data extracted from mass raw genomic data, e.g., an exon's sequence that is considered key to this clinical genomics statement, is extracted from some larger sequencing (up to whole exome squening) and encapsulated within this statement structure. Another example: certain lines of a VCF file (preferably its 'clinical-grade' version). The encapsulated data should contain references to the full-blown raw omics data. Attributes: encapsulateddata: a placeholder for the key raw omics data, e.g., DNA sequences, variants calls, expression data, etc. Could be any type of data encoding: binary, text, XML, etc. reference: a pointer to the repository containing the full-blown data sets out of which these key data items were `extracted from type: designating the notation used in the encapsulateddata attribute, e.g., VCF, FASTA, etc. schemareference: a pointer to the schema governing the representation of the data in the encapsulateddata attribute 17

Interpretive versus Observed Phenotype The Phenotype class is abstract and must be instantiated Additional information pertinent to the core phenotype observation Interpretive Phenotype an distinct observation Observed Phenotype, that is, observed in the individual and associated with the genetic observation 18

Class ObservedPhenotype (ext. Phenotype) Description: This is a sub-class of Phenotype, representing a phenotype observed in the subject, e.g., if the somatic mutation is known to be the reason of positive responsiveness to some drug, then that responsiveness is an observed phenotype. Attributes: clinicaltime: the effective time for the subject of this observation ageofonset: use this attribute in cases where dates are not available (e.g., in clinical genomics statements pertaining to a relative of a patient in a family health history) referencetohealthrecord: a pointer to an information system (e.g., EHR system) holding the broader contextual data of this observation (this could be implemented by guidance provided by health information exchange systems such as the specifications set out by the NwHIN in the US) 19

Class InterpretivePhenotype (ext. Phenotype) Description: This is a sub-class of Phenotype, representing a phenotype that is likely to be manifested in the subject, e.g, the somatic mutation is likely to cause resistance to some drug Attributes: method: the method by this interpretation was made (e.g., an identified algorithm, a rule engine, a clinical decision support application/service, etc.) interpretation Time: the time this interpretation was made available for this Clinical Genomics statement instance) performer: unique id of the entity making this interpretation (e.g., clinical decision support application/service) 20

Anatomic Pathology Specimen & Findings Anatomic pathology findings with optional reference to the full AP report Anatomic pathology specimen (prior to DNA/RNA extraction) 21

The End Thank you for your attention Questions? Contact Amnon at amnon.shvo@gmail.com Comments of general interest should be posted to the CG mailing list at clingenomics@lists.hl7.org 22