USER MANUAL for the use of the human Genome Clinical Annotation Tool (h-gcat) uthors: Klaas J. Wierenga, MD & Zhijie Jiang, P PhD

Size: px
Start display at page:

Download "USER MANUAL for the use of the human Genome Clinical Annotation Tool (h-gcat) uthors: Klaas J. Wierenga, MD & Zhijie Jiang, P PhD"

Transcription

1 USER MANUAL for the use of the human Genome Clinical Annotation Tool (h-gcat)) Authors: Klaas J. Wierenga, MD & Zhijie Jiang, PhD First edition, May

2 Introduction The Human Genome Clinical Annotation Tool, or h-gcat, is the product of a collaboration between Dr. Zhijie Jiang, PhD (ZJiang@med.miami.edu) at the Center for Computational Science (CCS) of the University of Miami, and Dr. Klaas Wierenga, MD (klaaswierenga@ouhsc.edu) at the Division of Genetics, Department of Pediatrics at the Oklahoma University Health Sciences Center. h-gcat is a server-based tool designed for the analysis of whole exome/genome sequencing data obtained from families affected by genetic disorder(s). The objective of h-gcat is to provide the user with a relatively simple and intuitive interface to proceed through the analysis. However, behind this deceptively simple front-end is a powerful analysis integrated with several clinical relevant databases. The user will be taken in a few steps through this analysis process. On the first page (Login Page), the user can register and, after registration, access the site. On the second page (Upload Page), a data file called variant call format (VCF) file can be uploaded onto the tool. On the third page (Quality Control Page), initial filtering parameters are set to remove the sites (or calls) that do not require further analysis. On the fourth page (Sample Page), the user can provide clinical, genetic and pedigree information, and select the type(s) of genetic locations and types of mutations that are deemed of highest interest. In case the user thinks that the disorder may already have a clinical annotation in the Human Gene Mutation Database (HGMD), Disease Ontology (DO) and/or Online Mendelian Inheritance in Man (OMIM), the user can use this clinical information to refine such a search. Within OMIM, the user can also search for OMIM-derived clinical keywords, or, alternatively, use Human Phenotype Ontology codes. Once this is accomplished, the user has the option of entering coordinates of regions of homozygosity (ROH) to focus the analysis on these regions, if homozygous recessive conditions within these ROH are deemed likely. Lastly, the user has an option of analyzing a limited number of genes (eg. a favorite gene list, based on genomic location, or on association with disorders with genetic heterogeneity). On the fifth page (Result Page), the user can review the sites (calls) that 'survived' after these iterative filtering activities. The result page provides the user with relevant information on the location of the sites (calls), the dbsnp status, the gene-based cdna location of the sites (calls) for the various isoforms, as well as the effect of the mutation on the protein. Link Outs to the National Center for Biotechnology Information (NCBI), UCSC Genome Browser and OMIM entries are provided for review. To the very right of the page, the user can review the data for 1

3 the various family members studied, and review the quality of the data underlying genotype status (ref/ref, ref/alt, alt/alt) by mousing over the various genotypes. On the last 2 pages (Sample Page and Result Page) the user can review the filter parameters selected by clicking Review your filtering selection. At the end of the analysis, the entire activity can be downloaded to the desktop in Excel spreadsheet format for record keeping. Our tool is still being developed. We will focus shortly on annotating sites for splice site mutation potential, and on building a missense mutation taster to provide an informed prediction as to whether a missense mutation is damaging or not. 2

4 Login Page The user needs to register for the site using full name, address and affiliation (this should be the institute or company the user is associated with). Once registered, a password is sent to this address. The password can be changed by the user any time. After registration has been completed successfully, and on any occasion thereafter, the user enters the address and password to get access too the Upload Page. 3

5 Upload page Currently, the only file format that can be uploaded to thee h-gcat tool is a variant call format (VCF) file. The file should be located on a hard drive, memory stick, or cloud storage. Using the Browse button, the file to be analyzed can be identified, and selected for upload to the tool. The file to be uploaded should be visible on the box beside the Browse button. Keep the radiobutton at the default selection (VCF), and click the Upload button. Upload may take a few minutes, depending on the size of the file, and the upload speed of your internet access. We suggest thatt the size of the VCF filee should not exceed 1000 Mbyte. Once completed, you will be taken to the Quality Control page. 4

6 Quality Control page Typically, in a variant call format (VCF), there are 40, ,000 sites (calls). This means that there are 40,000-80,000 sites that are different from thee reference sequence. However, many of these 40,000-80,,000 sites do not need to be considered as the cause of the Mendelian disorder being studied because some of them have poor sequencing quality, genotype calling quality, while others are major (common) alleles in somee populations. In the Quality Control page, we can filter quickly, by removing: 1. calls for which the data quality is poor 2. common SNPs, since a common SNP is not likely thee cause of a rare disorder 3. (calls on) chromosomes that we are not interested in ad 1. The sequencing quality of the site is to a large extent measured by the sequencing depth (30X, 50X, etc) ), and the genotype calling quality is measured by the transformed probability. Not every call however is of high quality. In preparing a VCFF file, not all sites with poor quality are removed, as there is no point in spending a lot of time analyzing poor quality data. Typically, the default values of the tool can be used. They can be changed as needed, based on information available to you concerning the platform used to obtain the VCF file.. ad 2. There are 2 standardd allele frequency datasets, derived from multiple populations, the 1,0000 genome dataset, or 1kG, and the 5600 NHLBI dataset, or ESP. The assumption is that it is unlikely that these common SNPs/variants are the causee of a rare Mendelian disorder. The user can decide which dataset(s) to use to filter out these common SNPs, given the known ethnicity of the patient/family being studied, and then decide on the minor allele frequency for filtering (ie. 5

7 any SNP/variant with a minor allele frequency greater than the user-defined cut-off will be removed). ad 3. Typically, the user will have informed assumptions of the type of inheritance that is likely underlying the disorder in the family studied. Typically, if the condition is assumed to be X-linked, the user can remove the sites (calls) from all other chromosomes, and just enter X. In autosomal recessive or autosomal dominant conditions, the user may want to discard sites on chromosomes X and Y, and enter In all other cases, the user should use the default all. There is one more option, to be used selectively. This provides the user with information on the genes that were poorly covered by exome sequencing. This option may be useful at a later time, if data analysis fails to come up with reasonable hits. If used, be prepared to wait a while, as this feature takes time to complete. Once the selections are made, the Apply button can be clicked, and the h-gcat tool will filter out those variants/calls using the parameters selected in this page. The program then removes those sites lying outside of the set parameters, and retains a more limited dataset for further scrutiny. 6

8 Sample page On the Sample Page the user can click on the header Review your filtering selections to quickly review the filtering criteria and the outcome of thee filtering process, i.e., the number of sites removed, and the number of sites retained for further scrutiny. In the cartoon above, one can see that the VCF file contained 75,917 sites, and that 4,962 were of poor quality, and hence removed. One can also see that after removing common variants using 1kG for Global and ESP for All Populations, and after removing sites on chromosomes X and Y, of the 64,104 sites 'only' 8,586 remain. The Sample Page is the heart of the program. First, it allows the user to specify the inheritance pattern. Commonly assumed inheritance patterns are: 1. de novo dominant - the proband has a dominant disorder, but both parents do not have this condition. 2. dominant inheritance - the proband has a dominant disorder, and one of the parents, and possibly other relatives, have the same condition. 3. recessive inheritance, homozygous - the proband, and possibly some siblings as well, have a recessive disorder due to homozygous mutations, while parents are carriers of this mutation - there may be (remote) consanguinity or inbreeding. 4. recessive inheritance, compound heterozygous - the proband, and possibly some siblings as well, have a recessive disorder caused by the presence of 2, in trans, mutations. Each parent is a carrier for one of the mutations. 5. X-linked recessive - the (male) patient typically inherits the mutation from his mother. 6. de novo X-linked recessivee - similar to de novo dominant, the parents (here: the mother) does not carry the mutation causing the disorder in her son. 7

9 In the next few pages you will see the Sample Page filled out by inheritance pattern. Of course, it is not always known what the inheritance pattern is. The analysis can be done more than once. Eg., a single affected male in a family could have an X-linked de novo, X-linked inherited, a homozygous recessive, a compound heterozygous, or even a de novo dominant disorder, implying that the analysis may have to be done using 5 different templates. 8

10 Template for de novo dominant disorders All unaffected individuals are (ref/ref, ref = the referencee allele), while the single affected person would be (ref/alt) - the alt being a minor allele, in this case due to a spontaneous mutation. 9

11 Template for dominantly inherited disorders All unaffected individuals are (ref/ref), while the affected individuals are (ref/alt), in this case passed on from one affected individual to other(s). 10

12 Template for homozygous recessive inheritance In a setting where homozygosityy risk is increased, it is useful to identify any homozygous mutations in affected individuals (alt/alt). In such case, the parents should both be carrier (ref/alt), and the healthy siblings can be (ref/ref) or (ref/alt). In some cases, a SNP array may have been done, and in that case the regions of homozygosity are known (and short ROH can be obtained from the laboratory where the SNP array was done). The various ROH can be copied and pasted for further filtering, selecting only sites that map to the various ROH. For now, the coordinates of the various ROH need to be in the hg19 version of the human genome assembly. 11

13 Template for compound heterozygous recessive inheritance In cases to be evaluated for compound heterozygosity, the situation is complicated. Typically, 2 mutations in one gene can be in cis or in trans, also called phase. Phase cannot be assessed from the patient s dataa alone, and parental information is therefore required. Therefore, in order to properly evaluate for compound heterozygosity, the data of at least one affected individual and the 2 unaffected parents should be available. Filling out the expected genotypes in non-intuitive (and not needed), instead we have provided the user with a compound heterozygosity option, see black arrow. 12

14 Template for X-linked recessive inherited In X-linked disorder, the affected male is hemizygous (alt), but in the site annotation this appears as (alt/alt). In many instances the mother is a carrier (ref/alt), while the genotype of the father is irrelevant, and can be left blank. 13

15 Template for de novo X-linked recessive e In X-linked disorder, the affected male is hemizygous (alt), but in the site annotation this appears as (alt/alt). In this scenario, the mother is not a carrier, but wild type (ref/ref), while the genotypee of the father is irrelevant, and can be left blank. 14

16 Filters by Mutation(s) The next step on the Sample Page is the filter by mutation type. The user will have to decide whether to search for all sites, or to select only annotated sites, or alternatively to select only non-annotated (novel) sites. Next, the user will have to select where the sites to be considered may be located: within the gene, or outside the gene. If within the gene, the user can select exons or introns, and if exons are selected, decide to evaluate only untranslated regions (UTR) or only coding sequences (CDS). Then, if CDS is selected, the user can decide to select all mutations, only synonymous sites, or only non-synonymousynonymous sites. Currently, we cannot evaluate SNPs for splice mutation potential, but expect this to be available in the next few sites. On first run, it may be useful to select novel SNPs, CDS and non- weeks. 15

17 Filter by Annotations This option is useful if the user would like to know whether the patient may have a known (annotated), but unrecognized genetic condition. Humann Genome Mutation Database (HGMD) can be search for mutations in genes annotated as causing a disorder, similar to Disease Ontology (DO). Alternatively, OMIM keywords or HPO search terms can be used. OMIM search HPO search 16

18 Filter by Preferred Genes Lastly, the user has an option of checking on sites withinn a favorite gene list, for example a list of genes involved in a certain pathway, or found in conditions with genetic heterogeneity, eg. as seen in retinitis pigmentosa, channelopathies etc. 17

19 Result Page The result page reports the sitess that survived the various filtering activities. The sites are organized by chromosome, and ordered from pter to qter. The position is clickable, and Links Out to UCSC genome browser. If this position is a known SNP, a clickable entry can be found in the SNP ID column, and Links Out to dbsnp. Next listed are a reference site(s) (by isoforms) and alternative site( (s). The gene within which the sites are located is listed in the Gene column, which Links Out to NCBI gene.. Next listed are the various isoforms found for the gene, with the corresponding NM_xxxxxx identifier, which is also clickable. The SNP function provides information on the type of mutation, followed by the residue (amino acid) change in case of missense mutation. Then, the user can review the Linked Out OMIM entries (gene / associated disorders), as well as HGMD and DO entries. Lastly, the user can review the genotype data by individual, and by mousing over the genotype, the user can review the reads, to see if the genotype was called correctly. In the result page below, as an example, the user selected for compound heterozygosity, with the affected child being compound heterozygous, and each parents a carrier of one of the sites. In the result page below, as an example, a family with consanguineous parents and 3 children, 1 of whom is affected. The search was done looking for homozygous mutations in the affected child, with parents assumed to be carriers, and healthy siblings carriers or wild-type. 18

20 Download / Saving to desktop Lastly, the user can save the conducted search and results to the Desktop in Excel spreadsheet format, for later review and comparison. 19

21 Under development: We aim to develop two additions to the tool: 1. Checking sites for potential for causing splice mutations 2. Tasting missense mutations, using various available tools (SIFT, Provean, Polyphen2, etc) 20

Novel Variant Discovery Tutorial

Novel Variant Discovery Tutorial Novel Variant Discovery Tutorial Release 8.4.0 Golden Helix, Inc. August 12, 2015 Contents Requirements 2 Download Annotation Data Sources...................................... 2 1. Overview...................................................

More information

Prioritization: from vcf to finding the causative gene

Prioritization: from vcf to finding the causative gene Prioritization: from vcf to finding the causative gene vcf file making sense A vcf file from an exome sequencing project may easily contain 40-50 thousand variants. In order to optimize the search for

More information

Niemann-Pick Type C Disease Gene Variation Database ( )

Niemann-Pick Type C Disease Gene Variation Database (   ) NPC-db (vs. 1.1) User Manual An introduction to the Niemann-Pick Type C Disease Gene Variation Database ( http://npc.fzk.de ) curated 2007/2008 by Dirk Dolle and Heiko Runz, Institute of Human Genetics,

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION doi:10.1038/nature13127 Factors to consider in assessing candidate pathogenic mutations in presumed monogenic conditions The questions itemized below expand upon the definitions in Table 1 and are provided

More information

Variant prioritization in NGS studies: Annotation and Filtering "

Variant prioritization in NGS studies: Annotation and Filtering Variant prioritization in NGS studies: Annotation and Filtering Colleen J. Saunders (PhD) DST/NRF Innovation Postdoctoral Research Fellow, South African National Bioinformatics Institute/MRC Unit for Bioinformatics

More information

Single Nucleotide Variant Analysis. H3ABioNet May 14, 2014

Single Nucleotide Variant Analysis. H3ABioNet May 14, 2014 Single Nucleotide Variant Analysis H3ABioNet May 14, 2014 Outline What are SNPs and SNVs? How do we identify them? How do we call them? SAMTools GATK VCF File Format Let s call variants! Single Nucleotide

More information

Introduction to Next Generation Sequencing (NGS) Andrew Parrish Exeter, 2 nd November 2017

Introduction to Next Generation Sequencing (NGS) Andrew Parrish Exeter, 2 nd November 2017 Introduction to Next Generation Sequencing (NGS) Andrew Parrish Exeter, 2 nd November 2017 Topics to cover today What is Next Generation Sequencing (NGS)? Why do we need NGS? Common approaches to NGS NGS

More information

MedSavant: An open source platform for personal genome interpretation

MedSavant: An open source platform for personal genome interpretation MedSavant: An open source platform for personal genome interpretation Marc Fiume 1, James Vlasblom 2, Ron Ammar 3, Orion Buske 1, Eric Smith 1, Andrew Brook 1, Sergiu Dumitriu 2, Christian R. Marshall

More information

In silico variant analysis: Challenges and Pitfalls

In silico variant analysis: Challenges and Pitfalls In silico variant analysis: Challenges and Pitfalls Fiona Cunningham Variation annotation coordinator EMBL-EBI www.ensembl.org Sequencing -> Variants -> Interpretation Structural variants SNP? In-dels

More information

RareVariantVis 2: R suite for analysis of rare variants in whole genome sequencing data.

RareVariantVis 2: R suite for analysis of rare variants in whole genome sequencing data. RareVariantVis 2: R suite for analysis of rare variants in whole genome sequencing data. Adam Gudyś and Tomasz Stokowy October 30, 2017 Introduction The search for causative genetic variants in rare diseases

More information

Exploring genomic databases: Practical session "

Exploring genomic databases: Practical session Exploring genomic databases: Practical session Work through the following practical exercises on your own. The objective of these exercises is to become familiar with the information available in each

More information

Using VarSeq to Improve Variant Analysis Research

Using VarSeq to Improve Variant Analysis Research Using VarSeq to Improve Variant Analysis Research June 10, 2015 G Bryce Christensen Director of Services Questions during the presentation Use the Questions pane in your GoToWebinar window Agenda 1 Variant

More information

Handbook. Precautions/warnings: For professional use only. To support clinical diagnosis. Not yet offered in the US. Page 1

Handbook. Precautions/warnings: For professional use only. To support clinical diagnosis. Not yet offered in the US. Page 1 Handbook Precautions/warnings: For professional use only. To support clinical diagnosis. Not yet offered in the US. Page 1 CentoMDx TM 1.1_Handbook_V1.1_March2018 Contents Introduction... 3 Intended use...

More information

Gen e e n t e i t c c V a V ri r abi b li l ty Biolo l gy g Lec e tur u e e 9 : 9 Gen e et e ic I n I her e itan a ce

Gen e e n t e i t c c V a V ri r abi b li l ty Biolo l gy g Lec e tur u e e 9 : 9 Gen e et e ic I n I her e itan a ce Genetic Variability Biology 102 Lecture 9: Genetic Inheritance Asexual reproduction = daughter cells genetically identical to parent (clones) Sexual reproduction = offspring are genetic hybrids Tendency

More information

Answers to additional linkage problems.

Answers to additional linkage problems. Spring 2013 Biology 321 Answers to Assignment Set 8 Chapter 4 http://fire.biol.wwu.edu/trent/trent/iga_10e_sm_chapter_04.pdf Answers to additional linkage problems. Problem -1 In this cell, there two copies

More information

ANNOVAR Variant Annotation and Interpretation

ANNOVAR Variant Annotation and Interpretation 1 ANNOVAR Variant Annotation and Interpretation Copyrighted 2018 Isabelle Schrauwen and Suzanne M. Leal This exercise touches on several functionalities of the program ANNOVAR to annotate and interpret

More information

Applied Bioinformatics

Applied Bioinformatics Applied Bioinformatics In silico and In clinico characterization of genetic variations Assistant Professor Department of Biomedical Informatics Center for Human Genetics Research ATCAAAATTATGGAAGAA ATCAAAATCATGGAAGAA

More information

Investigating Inherited Diseases

Investigating Inherited Diseases Investigating Inherited Diseases The purpose of these exercises is to introduce bioinformatics databases and tools. We investigate an important human gene and see how mutations give rise to inherited diseases.

More information

BICD100 Midterm (10/27/10) KEY

BICD100 Midterm (10/27/10) KEY BICD100 Midterm (10/27/10) KEY 1. Variation in tail length is characteristic of some dog breeds, such as Pembroke Welsh Corgis, which sometimes show a bob tail (short tail) phenotype (see illustration

More information

Identification of the Photoreceptor Transcriptional Co-Repressor SAMD11 as Novel Cause of. Autosomal Recessive Retinitis Pigmentosa

Identification of the Photoreceptor Transcriptional Co-Repressor SAMD11 as Novel Cause of. Autosomal Recessive Retinitis Pigmentosa Identification of the Photoreceptor Transcriptional Co-Repressor SAMD11 as Novel Cause of Autosomal Recessive Retinitis Pigmentosa Corton M 1,2 *, Avila-Fernández A 1,2, Campello L 3, Sánchez M 1,2, Benavides

More information

Briefly, this exercise can be summarised by the follow flowchart:

Briefly, this exercise can be summarised by the follow flowchart: Workshop exercise Data integration and analysis In this exercise, we would like to work out which GWAS (genome-wide association study) SNP associated with schizophrenia is most likely to be functional.

More information

OncoMD User Manual Version 2.6. OncoMD: Cancer Analytics Platform

OncoMD User Manual Version 2.6. OncoMD: Cancer Analytics Platform OncoMD: Cancer Analytics Platform 1 Table of Contents 1. INTRODUCTION... 3 2. OVERVIEW OF ONCOMD... 3 3. ORGANIZATION OF INFORMATION IN ONCOMD... 3 4. GETTING STARTED... 6 4.1 USER AUTHENTICATION... 6

More information

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with

More information

Produced by the Centre for Genetics Education. Internet: 5

Produced by the Centre for Genetics Education. Internet:   5 Important points Genes are made of DNA The information in the DNA is in the form of a chemical code made up of four letters (A, T, C and G). Each word in the information is made up of three of these four

More information

THE HEALTH AND RETIREMENT STUDY: GENETIC DATA UPDATE

THE HEALTH AND RETIREMENT STUDY: GENETIC DATA UPDATE : GENETIC DATA UPDATE April 30, 2014 Biomarker Network Meeting PAA Jessica Faul, Ph.D., M.P.H. Health and Retirement Study Survey Research Center Institute for Social Research University of Michigan HRS

More information

Annotating your variants: Ensembl Variant Effect Predictor (VEP) Helen Sparrow Ensembl EMBL-EBI 2nd November 2016

Annotating your variants: Ensembl Variant Effect Predictor (VEP) Helen Sparrow Ensembl EMBL-EBI 2nd November 2016 Training materials Ensembl training materials are protected by a CC BY license http://creativecommons.org/licenses/by/4.0/ If you wish to re-use these materials, please credit Ensembl for their creation

More information

Selecting TILLING mutants

Selecting TILLING mutants Selecting TILLING mutants The following document will explain how to select TILLING mutants for your gene(s) of interest. To begin, you will need the IWGSC gene model identifier for your gene(s), the IWGSC

More information

The human gene encoding Glucose-6-phosphate dehydrogenase (G6PD) is located on chromosome X in cytogenetic band q28.

The human gene encoding Glucose-6-phosphate dehydrogenase (G6PD) is located on chromosome X in cytogenetic band q28. Data mining in Ensembl with BioMart Worked Example The human gene encoding Glucose-6-phosphate dehydrogenase (G6PD) is located on chromosome X in cytogenetic band q28. Which other genes related to human

More information

user s guide Question 3

user s guide Question 3 Question 3 During a positional cloning project aimed at finding a human disease gene, linkage data have been obtained suggesting that the gene of interest lies between two sequence-tagged site markers.

More information

Understanding the science and technology of whole genome sequencing

Understanding the science and technology of whole genome sequencing Understanding the science and technology of whole genome sequencing Dag Undlien Department of Medical Genetics Oslo University Hospital University of Oslo and The Norwegian Sequencing Centre d.e.undlien@medisin.uio.no

More information

Hands-On Four Investigating Inherited Diseases

Hands-On Four Investigating Inherited Diseases Hands-On Four Investigating Inherited Diseases The purpose of these exercises is to introduce bioinformatics databases and tools. We investigate an important human gene and see how mutations give rise

More information

Fast, Accurate and Sensitive DNA Variant Detection from Sanger Sequencing:

Fast, Accurate and Sensitive DNA Variant Detection from Sanger Sequencing: Fast, Accurate and Sensitive DNA Variant Detection from Sanger Sequencing: Patented, Anti-Correlation Technology Provides 99.5% Accuracy & Sensitivity to 5% Variant Knowledge Base and External Annotation

More information

user s guide Question 3

user s guide Question 3 Question 3 During a positional cloning project aimed at finding a human disease gene, linkage data have been obtained suggesting that the gene of interest lies between two sequence-tagged site markers.

More information

Week 1 BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers

Week 1 BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers Week 1 BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers Web resources: NCBI database: http://www.ncbi.nlm.nih.gov/ Ensembl database: http://useast.ensembl.org/index.html

More information

Alissa Interpret The next evolution of Cartagenia Bench

Alissa Interpret The next evolution of Cartagenia Bench Alissa Interpret The next evolution of Cartagenia Bench Case Study: An Efficient Clinical Pipeline for Microcephaly, RASopathy and Leukodystrophy Gene Panels Using Alissa Interpret s Flexible Classification

More information

Exome Sequencing Exome sequencing is a technique that is used to examine all of the protein-coding regions of the genome.

Exome Sequencing Exome sequencing is a technique that is used to examine all of the protein-coding regions of the genome. Glossary of Terms Genetics is a term that refers to the study of genes and their role in inheritance the way certain traits are passed down from one generation to another. Genomics is the study of all

More information

INTRODUCTION NEW GENETIC TECHNIQUES IN METABOLIC DISEASES 26/01/2016 FROM ONE GENERATION TO THE NEXT. Image challenge of the week D.

INTRODUCTION NEW GENETIC TECHNIQUES IN METABOLIC DISEASES 26/01/2016 FROM ONE GENERATION TO THE NEXT. Image challenge of the week D. NEW GENETIC TECHNIQUES IN METABOLIC DISEASES D. RYMEN, MD, PHD Pentalfa session Leuven, January 21 st 2016 INTRODUCTION Image challenge of the week 1 INTRODUCTION But what about Hypotonia Facial dysmorphism

More information

Exam 1 Answers Biology 210 Sept. 20, 2006

Exam 1 Answers Biology 210 Sept. 20, 2006 Exam Answers Biology 20 Sept. 20, 2006 Name: Section:. (5 points) Circle the answer that gives the maximum number of different alleles that might exist for any one locus in a normal mammalian cell. A.

More information

REPRODUCTIVE HEALTH. Phosphorus. Diagnostics

REPRODUCTIVE HEALTH. Phosphorus. Diagnostics 14 32 6 32 7 REPRODUCTIVE HEALTH Phosphorus Diagnostics Genomics, Illuminated We focus on your peace of mind. Our expert-guided assay designs, powerful data analytics, and rigorous variant interpretation

More information

BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers

BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers Web resources: NCBI database: http://www.ncbi.nlm.nih.gov/ Ensembl database: http://useast.ensembl.org/index.html UCSC

More information

Oncomine cfdna Assays Part III: Variant Analysis

Oncomine cfdna Assays Part III: Variant Analysis Oncomine cfdna Assays Part III: Variant Analysis USER GUIDE for use with: Oncomine Lung cfdna Assay Oncomine Colon cfdna Assay Oncomine Breast cfdna Assay Catalog Numbers A31149, A31182, A31183 Publication

More information

Gene-centered databases and Genome Browsers

Gene-centered databases and Genome Browsers COURSE OF BIOINFORMATICS a.a. 2015-2016 Gene-centered databases and Genome Browsers We searched Accession Number: M60495 AT NCBI Nucleotide Gene has been implemented at NCBI to organize information about

More information

Gene-centered databases and Genome Browsers

Gene-centered databases and Genome Browsers COURSE OF BIOINFORMATICS a.a. 2016-2017 Gene-centered databases and Genome Browsers We searched Accession Number: M60495 AT NCBI Nucleotide Gene has been implemented at NCBI to organize information about

More information

Molecular Genetics of Disease and the Human Genome Project

Molecular Genetics of Disease and the Human Genome Project 9 Molecular Genetics of Disease and the Human Genome Project Fig. 1. The 23 chromosomes in the human genome. There are 22 autosomes (chromosomes 1 to 22) and two sex chromosomes (X and Y). Females inherit

More information

Assignment 9: Genetic Variation

Assignment 9: Genetic Variation Assignment 9: Genetic Variation Due Date: Friday, March 30 th, 2018, 10 am In this assignment, you will profile genome variation information and attempt to answer biologically relevant questions. The variant

More information

Genomics: Human variation

Genomics: Human variation Genomics: Human variation Lecture 1 Introduction to Human Variation Dr Colleen J. Saunders, PhD South African National Bioinformatics Institute/MRC Unit for Bioinformatics Capacity Development, University

More information

RFLP Method - Restriction Fragment Length Polymorphism

RFLP Method - Restriction Fragment Length Polymorphism RFLP Method - Restriction Fragment Length Polymorphism RFLP (often pronounced "rif lip", as if it were a word) is a method used by molecular biologists to follow a particular sequence of DNA as it is passed

More information

Identification of Single Nucleotide Polymorphisms and associated Disease Genes using NCBI resources

Identification of Single Nucleotide Polymorphisms and associated Disease Genes using NCBI resources Identification of Single Nucleotide Polymorphisms and associated Disease Genes using NCBI resources Navreet Kaur M.Tech Student Department of Computer Engineering. University College of Engineering, Punjabi

More information

Genetics and Genomics in Clinical Research

Genetics and Genomics in Clinical Research Genetics and Genomics in Clinical Research An Immersion Course for Clinical Investigators at UAB Introduction and Overview Bruce R. Korf, MD, PhD Goals Describe approaches to study of the genomic contributions

More information

Bioinformatics for Proteomics. Ann Loraine

Bioinformatics for Proteomics. Ann Loraine Bioinformatics for Proteomics Ann Loraine aloraine@uab.edu What is bioinformatics? The science of collecting, processing, organizing, storing, analyzing, and mining biological information, especially data

More information

Mutations in CKAP2L, the Human Homolog of the Mouse Radmis Gene, Cause Filippi Syndrome

Mutations in CKAP2L, the Human Homolog of the Mouse Radmis Gene, Cause Filippi Syndrome The American Journal of Human Genetics, Volume 95 Supplemental Data Mutations in CKAP2L, the Human Homolog of the Mouse Radmis Gene, Cause Filippi Syndrome Muhammad Sajid Hussain, Agatino Battaglia, Sandra

More information

Targeted resequencing

Targeted resequencing Targeted resequencing Sarah Calvo, Ph.D. Computational Biologist Vamsi Mootha laboratory Snapshots of Genome Wide Analysis in Human Disease (MPG), 4/20/2010 Vamsi Mootha, PI How can I assess a small genomic

More information

Training materials.

Training materials. Training materials - Ensembl training materials are protected by a CC BY license - http://creativecommons.org/licenses/by/4.0/ - If you wish to re-use these materials, please credit Ensembl for their creation

More information

DESIGNER GENES SAMPLE TOURNAMENT

DESIGNER GENES SAMPLE TOURNAMENT DESIGNER GENES SAMPLE TOURNAMENT PART ONE- GENETICS PROBLEMS In dogs, the inheritance of hair color involves a gene (B) for black hair and a gene (b) for brown hair. A dominant (C) is also involved. It

More information

SNPTracker: A swift tool for comprehensive tracking and unifying dbsnp rsids and genomic coordinates of massive sequence variants

SNPTracker: A swift tool for comprehensive tracking and unifying dbsnp rsids and genomic coordinates of massive sequence variants G3: Genes Genomes Genetics Early Online, published on November 19, 2015 as doi:10.1534/g3.115.021832 SNPTracker: A swift tool for comprehensive tracking and unifying dbsnp rsids and genomic coordinates

More information

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 Topics Genetic variation Population structure Linkage disequilibrium Natural disease variants Genome Wide Association Studies Gene

More information

HST.161 Molecular Biology and Genetics in Modern Medicine Fall 2007

HST.161 Molecular Biology and Genetics in Modern Medicine Fall 2007 MIT OpenCourseWare http://ocw.mit.edu HST.161 Molecular Biology and Genetics in Modern Medicine Fall 2007 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

SeattleSNPs Interactive Tutorial: Database Inteface Entrez, dbsnp, HapMap, Perlegen

SeattleSNPs Interactive Tutorial: Database Inteface Entrez, dbsnp, HapMap, Perlegen SeattleSNPs Interactive Tutorial: Database Inteface Entrez, dbsnp, HapMap, Perlegen The tutorial is designed to take you through the steps necessary to access SNP data from the primary database resources:

More information

Part I: Predicting Genetic Outcomes

Part I: Predicting Genetic Outcomes Part I: Predicting Genetic Outcomes Deoxyribonucleic acid (DNA) is found in every cell of living organisms, and all of the cells in each organism contain the exact same copy of that organism s DNA. Because

More information

Custom Panels via Clinical Exomes

Custom Panels via Clinical Exomes Custom Panels via Clinical Exomes Andrew Wallace Feb 2015 Genomic Diagnostics Laboratory St. Mary s Hospital, Manchester Custom Panel via Exome Approach Pros Reduced validation overhead Single validation

More information

HGVS Mutation Nomenclature Queries

HGVS Mutation Nomenclature Queries Molecular Genetics Participants Meeting 2014 HGVS Mutation Nomenclature Queries UK NATIONAL EXTERNAL QUALITY ASSESSMENT SERVICES General Remark HGVS nomenclature aim: To provide unambiguous descriptions

More information

Runs of Homozygosity Analysis Tutorial

Runs of Homozygosity Analysis Tutorial Runs of Homozygosity Analysis Tutorial Release 8.7.0 Golden Helix, Inc. March 22, 2017 Contents 1. Overview of the Project 2 2. Identify Runs of Homozygosity 6 Illustrative Example...............................................

More information

AS91159 Demonstrate understanding of gene expression

AS91159 Demonstrate understanding of gene expression AS91159 Demonstrate understanding of gene expression Mutations and Metabolic Pathways (2015,2) In 1941 biologists George Beadle and Edward Tatum exposed the bread mould Neurospora crassa to radiation.

More information

Maternal mosaicism for IDUA deletion clarifies recurrence risk in MPS I.

Maternal mosaicism for IDUA deletion clarifies recurrence risk in MPS I. Maternal mosaicism for IDUA deletion clarifies recurrence risk in MPS I. Catherine Breen 1, Jean Mercer 1, Simon A Jones 1, Amir Jahic 2, Lesley Heptinstall 1, Karen Tylee 1, William G Newman 1,* Christian

More information

Structure, Measurement & Analysis of Genetic Variation

Structure, Measurement & Analysis of Genetic Variation Structure, Measurement & Analysis of Genetic Variation Sven Cichon, PhD Professor of Medical Genetics, Director, Division of Medcial Genetics, University of Basel Institute of Neuroscience and Medicine

More information

CITATION FILE CONTENT / FORMAT

CITATION FILE CONTENT / FORMAT CITATION 1) For any resultant publications using single samples please cite: Matthew A. Field, Vicky Cho, T. Daniel Andrews, and Chris C. Goodnow (2015). "Reliably detecting clinically important variants

More information

i. This type of DNA damage will result in (circle all correct answers) b. a transversion mutation

i. This type of DNA damage will result in (circle all correct answers) b. a transversion mutation Biol 321 Winter 2010 Quiz 5 40 points NAME ANSWERS IN RED COMMENTS IN BLUE 1. (2 pts.) Recall the article entitled: Human Genetic Variation By each statement circle True/False/Not addressed in the paper.

More information

Guided tour to Ensembl

Guided tour to Ensembl Guided tour to Ensembl Introduction Introduction to the Ensembl project Walk-through of the browser Variations and Functional Genomics Comparative Genomics BioMart Ensembl Genome browser http://www.ensembl.org

More information

Analytics Behind Genomic Testing

Analytics Behind Genomic Testing A Quick Guide to the Analytics Behind Genomic Testing Elaine Gee, PhD Director, Bioinformatics ARUP Laboratories 1 Learning Objectives Catalogue various types of bioinformatics analyses that support clinical

More information

Autozygosity by difference a method for locating autosomal recessive mutations. Geoff Pollott

Autozygosity by difference a method for locating autosomal recessive mutations. Geoff Pollott Autozygosity by difference a method for locating autosomal recessive mutations Geoff Pollott Background Mutations occur regularly in all species Autosomal recessive conditions arise in most breeds from

More information

3. A form of a gene that is only expressed in the absence of a dominant alternative is:

3. A form of a gene that is only expressed in the absence of a dominant alternative is: Student Name: Teacher: Date: District: Robeson Assessment: 9_12 Agriculture AU71 - Biotech and Agrisci Rsch I Test 3 Description: Obj 12 - Simple Mendelian Genetics Form: 501 1. The genotype of an organism

More information

NGS in Pathology Webinar

NGS in Pathology Webinar NGS in Pathology Webinar NGS Data Analysis March 10 2016 1 Topics for today s presentation 2 Introduction Next Generation Sequencing (NGS) is becoming a common and versatile tool for biological and medical

More information

SENIOR BIOLOGY. Blueprint of life and Genetics: the Code Broken? INTRODUCTORY NOTES NAME SCHOOL / ORGANISATION DATE. Bay 12, 1417.

SENIOR BIOLOGY. Blueprint of life and Genetics: the Code Broken? INTRODUCTORY NOTES NAME SCHOOL / ORGANISATION DATE. Bay 12, 1417. SENIOR BIOLOGY Blueprint of life and Genetics: the Code Broken? NAME SCHOOL / ORGANISATION DATE Bay 12, 1417 Bay number Specimen number INTRODUCTORY NOTES Blueprint of Life In this part of the workshop

More information

Basics in Genetics. Teruyoshi Hishiki

Basics in Genetics. Teruyoshi Hishiki Basics in Genetics Teruyoshi Hishiki Advanced Bioinformatics 10/Apr/2017 1 Contents 1. Human Genetics: an application A case study of Familial Mediterranean Fever (FMF) patients 2. Introduction to human

More information

SENIOR BIOLOGY. Blueprint of life and Genetics: the Code Broken? INTRODUCTORY NOTES NAME SCHOOL / ORGANISATION DATE. Bay 12, 1417.

SENIOR BIOLOGY. Blueprint of life and Genetics: the Code Broken? INTRODUCTORY NOTES NAME SCHOOL / ORGANISATION DATE. Bay 12, 1417. SENIOR BIOLOGY Blueprint of life and Genetics: the Code Broken? NAME SCHOOL / ORGANISATION DATE Bay 12, 1417 Bay number Specimen number INTRODUCTORY NOTES Blueprint of Life In this part of the workshop

More information

Complex Inheritance and Human Heredity

Complex Inheritance and Human Heredity Complex Inheritance and Human Heredity Before You Read Use the What I Know column to list the things you know about human heredity and genetics. Then list the questions you have about these topics in the

More information

Genetics and Heredity Power Point Questions

Genetics and Heredity Power Point Questions Name period date assigned date due date returned Genetics and Heredity Power Point Questions 1. Heredity is the process in which pass from parent to offspring. 2. is the study of heredity. 3. A trait is

More information

Jumping Into Your Gene Pool: Understanding Genetic Test Results

Jumping Into Your Gene Pool: Understanding Genetic Test Results Jumping Into Your Gene Pool: Understanding Genetic Test Results Susan A. Berry, MD Professor and Director Division of Genetics and Metabolism Department of Pediatrics University of Minnesota Acknowledgment

More information

Ensembl workshop. Thomas Randall, PhD bioinformatics.unc.edu. handouts, papers, datasets

Ensembl workshop. Thomas Randall, PhD bioinformatics.unc.edu.   handouts, papers, datasets Ensembl workshop Thomas Randall, PhD tarandal@email.unc.edu bioinformatics.unc.edu www.unc.edu/~tarandal/ensembl handouts, papers, datasets Ensembl is a joint project between EMBL - EBI and the Sanger

More information

Activity 3.3.1: How is DNA Passed through the Generations?

Activity 3.3.1: How is DNA Passed through the Generations? Activity 3.3.1: How is DNA Passed through the Generations? Introduction In the previous activities, you learned that Anna Garcia lived with a life altering disease called sickle cell anemia. Unlike the

More information

Variant calling in NGS experiments

Variant calling in NGS experiments Variant calling in NGS experiments Jorge Jiménez jjimeneza@cipf.es BIER CIBERER Genomics Department Centro de Investigacion Principe Felipe (CIPF) (Valencia, Spain) 1 Index 1. NGS workflow 2. Variant calling

More information

user s guide Question 1

user s guide Question 1 Question 1 How does one find a gene of interest and determine that gene s structure? Once the gene has been located on the map, how does one easily examine other genes in that same region? doi:10.1038/ng966

More information

Dr. Mallery Biology Workshop Fall Semester CELL REPRODUCTION and MENDELIAN GENETICS

Dr. Mallery Biology Workshop Fall Semester CELL REPRODUCTION and MENDELIAN GENETICS Dr. Mallery Biology 150 - Workshop Fall Semester CELL REPRODUCTION and MENDELIAN GENETICS CELL REPRODUCTION The goal of today's exercise is for you to look at mitosis and meiosis and to develop the ability

More information

Biology 105: Introduction to Genetics Page 1 Midterm Fall 2004

Biology 105: Introduction to Genetics Page 1 Midterm Fall 2004 Biology 105: Introduction to Genetics Page 1 Midterm Fall 2004 KEY Part I. Definitions Hemizygous: The genotype for genes present only in one copy in an otherwise diploid organism; e.g. X-linked genes

More information

Analysis of neo-antigens to identify T-cell neo-epitopes in human Head & Neck cancer. Project XX1001. Customer Detail

Analysis of neo-antigens to identify T-cell neo-epitopes in human Head & Neck cancer. Project XX1001. Customer Detail Analysis of neo-antigens to identify T-cell neo-epitopes in human Head & Neck cancer Project XX Customer Detail Table of Contents. Bioinformatics analysis pipeline...3.. Read quality check. 3.2. Read alignment...3.3.

More information

Terminology: chromosome; gene; allele; proteins; enzymes

Terminology: chromosome; gene; allele; proteins; enzymes Title Workshop on genetic disease and gene therapy Authors Danielle Higham (BSc Genetics), Dr. Maggy Fostier Contact Maggy.fostier@manchester.ac.uk Target level KS4 science, GCSE (or A-level) Publication

More information

Gene-Level Analysis of Exon Array Data using Partek Genomics Suite 6.6

Gene-Level Analysis of Exon Array Data using Partek Genomics Suite 6.6 Gene-Level Analysis of Exon Array Data using Partek Genomics Suite 6.6 Overview This tutorial will demonstrate how to: Summarize core exon-level data to produce gene-level data Perform exploratory analysis

More information

Ensembl Tools. EBI is an Outstation of the European Molecular Biology Laboratory.

Ensembl Tools. EBI is an Outstation of the European Molecular Biology Laboratory. Ensembl Tools EBI is an Outstation of the European Molecular Biology Laboratory. Questions? We ve muted all the mics Ask questions in the Chat box in the webinar interface I will check the Chat box periodically

More information

Human Chromosomes Section 14.1

Human Chromosomes Section 14.1 Human Chromosomes Section 14.1 In Today s class. We will look at Human chromosome and karyotypes Autosomal and Sex chromosomes How human traits are transmitted How traits can be traced through entire families

More information

BIOL 225 Genetics-Final Exam November 17, 2006 Dr. Sandra Davis

BIOL 225 Genetics-Final Exam November 17, 2006 Dr. Sandra Davis BIOL 225 Genetics-Final Exam November 17, 2006 Dr. Sandra Davis INSTRUCTIONS: 1. Read the questions carefully and write your answers in the space provided. If you need more space, clearly indicate WHERE

More information

Genetics, Fall 2005 TEST 2, 11/16/05 Page 1

Genetics, Fall 2005 TEST 2, 11/16/05 Page 1 Genetics, Fall 2005 TEST 2, 11/16/05 Page 1 STUDENT NAME: Give a brief definition of the following terms (5 points each; only nine definitions count for the grade): 1. phenotype 2. homozygous 3. codominance

More information

1. What is the wild-type phenotype? 2. What is the assay? 3. Who are the mutants? Fascinated, Garrod decided to study the disease more.

1. What is the wild-type phenotype? 2. What is the assay? 3. Who are the mutants? Fascinated, Garrod decided to study the disease more. B. Pedigrees In 1901 a physician in London named Archibald Garrod had some new patients with an unusual condition: when their urine came into contact with air it turned black. 1. What is the wild-type

More information

What is genetic variation?

What is genetic variation? enetic Variation Applied Computational enomics, Lecture 05 https://github.com/quinlan-lab/applied-computational-genomics Aaron Quinlan Departments of Human enetics and Biomedical Informatics USTAR Center

More information

Table of Contents. Chapter: Heredity. Section 1: Genetics. Section 2: Genetics Since Mendel. Section 3: Biotechnology

Table of Contents. Chapter: Heredity. Section 1: Genetics. Section 2: Genetics Since Mendel. Section 3: Biotechnology Table of Contents Chapter: Heredity Section 1: Genetics Section 2: Genetics Since Mendel Section 3: Biotechnology 1 Genetics Inheriting Traits Eye color, nose shape, and many other physical features are

More information

Exam 3 4/25/07. Total of 7 questions, 100 points.

Exam 3 4/25/07. Total of 7 questions, 100 points. Exam 3 4/25/07 BISC 4A P. Sengupta Total of 7 questions, 100 points. QUESTION 1. Circle the correct answer. Total of 40 points 4 points each. 1. Which of the following is typically attacked by the antibody-mediated

More information

Gathering of pathogenicity evidence for novel variants. By Lewis Pang

Gathering of pathogenicity evidence for novel variants. By Lewis Pang Gathering of pathogenicity evidence for novel variants By Lewis Pang Novel variants A newly discovered, distinct genetic alteration Within our lab a variant is deemed novel if we haven t seen it before.

More information

7.03 Exam 3 Review GENE REGULATION

7.03 Exam 3 Review GENE REGULATION GENE REGULATION Example Problem You are studying the regulation of a yeast enzyme Cal1, whose activity you can measure directly. Expression of the gene encoding Cal1 is inhibited by the presence of calcium

More information

Huether and McCance: Understanding Pathophysiology, 5 th Edition

Huether and McCance: Understanding Pathophysiology, 5 th Edition Huether and McCance: Understanding Pathophysiology, 5 th Edition Chapter 02: Genes and Genetic Diseases Test Bank MULTIPLE CHOICE 1. A nurse recalls the basic components of DNA are: a. Pentose sugars and

More information

Tecnologie NGS - Next Generation Sequencing e implicazioni diagnosticoterapeutiche

Tecnologie NGS - Next Generation Sequencing e implicazioni diagnosticoterapeutiche Tecnologie NGS - Next Generation Sequencing e implicazioni diagnosticoterapeutiche Maria Iascone Lab. Genetica Medica ASST-PG23, Bergamo Trento, 24 Settembre 2016 Genetic Discoveries Timeline $ 3 billion

More information

Mutation entries in SMA databases Guidelines for national curators

Mutation entries in SMA databases Guidelines for national curators 1 Mutation entries in SMA databases Guidelines for national curators GENERAL CONSIDERATIONS Role of the curator(s) of a national database Molecular data can be collected by many different ways. There are

More information