Sequence Variations. Baxevanis and Ouellette, Chapter 7 - Sequence Polymorphisms. NCBI SNP Primer:

Size: px
Start display at page:

Download "Sequence Variations. Baxevanis and Ouellette, Chapter 7 - Sequence Polymorphisms. NCBI SNP Primer:"

Transcription

1 Sequence Variations Baxevanis and Ouellette, Chapter 7 - Sequence Polymorphisms NCBI SNP Primer:

2 Overview Mutation and Alleles Linkage Genetic variation in populations SNPs as genetic markers Classical genetic diseases Multi-factorial diseases and risk factors Genome scans (genotyping)

3 A review of some basic genetics

4 Alleles An allele is a particular DNA sequence for a gene. Some gene alleles are responsible for ordinary phenotypes like blue/brown eyes. Others lead to classic genetic diseases like cystic fibrosis or Huntington s disease.

5 Changes occur in DNA sequences = mutations

6 Many Causes of Mutations Somatic vs. reproductive cells Radiation and/or chemical damage to DNA Random errors of the replication machinery Normal biological processes - methylation

7 Mutations Create Alleles Mutations occur randomly throughout DNA. Most have no phenotypic effect (non-coding regions, equivalent codons, similar AAs). Some damage the function of a protein or regulatory element. A very few provide an evolutionary advantage.

8 Population Genetics Chromosome pairs segregate and recombine in every generation. Every allele of every gene has its own independent evolutionary history (and future). Frequencies of various alleles differ in different subpopulations of people.

9 Human Alleles The OMIM (Online Mendelian Inheritance in Man) database at the NCBI tracks all human mutations with known pheontypes. It contains a total of about 2,000 genetic diseases [and another ~11,000 genetic loci with known phenotypes - but not necessarily known gene sequences] It is designed for use by physicians: can search by disease name contains summaries from clinical studies

10 OMIM Morbid Map: Cytogenetic map location of disease genes.

11 Variation Makes Life Interesting The Human Genome has been sequenced; what s next? Much of what makes us unique individuals is represented by the differences in our DNA sequence from other people. There are rare and common forms (alleles) of every gene. Probably only 3-4 alleles are present in 95% of the population for most genes, but lots of rare mutations.

12 SNPs are Mutations

13 SNPs A mutation that causes a single base change is known as a Single Nucleotide Polymorphism (SNP). Other kinds of mutations include insertions and deletions. Large breaks and rearrangement of chromosomes also occur (translocations)s GATTTAGATCGCGATAGAG GATTTAGATCTCGATAGAG ^

14 SNPs are Very Common SNPs are very common in the human population. Between any two people, there is an average of one SNP every ~1250 bases. Most of these have no phenotypic effect. Only <1% of all human SNPs impact protein function (non-coding regions). Selection against mis-sense mutations (think about what would happen to dominant lethal mutations?). Some are alleles of genes.

15 Genome Sequencing finds SNPs The Human Genome Project involves sequencing DNA cloned from a number of different people. [The Celera sequence comes from 5 people.] Even within one person s DNA, the homologous chromosomes have SNPs. This inevitably leads to the discovery of SNPs - any single base sequence difference These SNPs can be valuable as the basis for diagnostic tests

16

17 We describe a map of 1.42 million single nucleotide polymorphisms (SNPs) distributed throughout the human genome, providing an average density on available sequence of one SNP every 1.9 kilobases. These SNPs were primarily discovered by two projects: The SNP Consortium and the analysis of clone overlaps by the International Human Genome Sequencing Consortium. The map integrates all publicly available SNPs with described genes and other genomic features. We estimate that 60,000 SNPs fall within exon (coding and untranslated regions), and 85% of exons are within 5 kb of the nearest SNP. Nucleotide diversity varies greatly across the genome, in a manner broadly consistent with a standard population genetic model of human history. This high-density SNP map provides a public resource for defining haplotype variation across the genome, and should help to identify biomedically important genes for diagnosis and therapy.

18

19 SNP Discovery: dbsnp database

20

21 Search dbsnp with BLAST As of June, 2008, dbsnp has 12.8 million SNPs in the human genome It is possible to search dbsnp by BLAST comparisons to a target sequence

22 >gnl dbsnp rs _allelepos=51 total len = 101 taxid = 9606 snpclass = 1 Length = 101 Score = 149 bits (75), Expect = 3e-33 Identities = 79/81 (97%) Strand = Plus / Plus If a matching SNP is found, then it can be directly located on the Genome map Query: 1489 ccctcttccctgacctcccaactctaaagccaagcactttatatttttctcttagatatt 1548 Sbjct: 1 ccctcttccctgacctcccaactctaaagccaagcactttatattttcctyttagatatt 60 Query: 1549 cactaaggacttaaaataaaa 1569 Sbjct: 61 cactaaggacttaaaataaaa 81

23 Uses for SNPs Diagnostic tests for disease alleles Markers to aid in cloning of interesting genes (disease genes) Pharmacogenomics - genetics of reponse to drugs (effectiveness and side effects)

24 DNA Diagnostic Testing Hereditary diseases - potential parents, prenatal, late onset diseases. Genes that predispose to disease (risk factors). Genotyping of infectious agents (bacterial & viral). Forensics - using DNA testing to establish identity.

25 Clinical Manifestations of Genetic Variation (All disease has a genetic component) Susceptibility vs. resistance Variations in disease severity or symptoms Reaction to drugs (pharmacogenetics) Variable disease course and prognosis SNPs can be found that are linked to all of these traits.

26 Finding Disease Genes Virtually all diseases have a genetic component. Start with DNA samples from families that show inheritance of the disease. Use STS markers to map the gene or genes involved (linkage analysis). Find SNPs in the genetic region(s) that are likely candidates for involvement in that disease. Get the gene from genomic sub-clone.

27 Some Diseases Involve Many Genes There are a number of classic genetic diseases caused by mutations of a single gene. Huntington s, Cystic Fibrosis, Tay-Sachs, PKU, etc. There are also many diseases that are the result of the interactions of many genes: asthma, heart disease, cancer Each of these genes may be considered to be a risk factor for the disease. Groups of genetic markers (SNPs) may be associated with a disease without determining a mechanism.

28 Multiple Causes Some diseases may actually be caused by any of a group of different genes (multiple causes), but all show the same symptoms. SNP linkage analysis can identify these sub-populations more efficiently than classical molecular genetic approaches. Machine learning, genetic algorithms, SVMs

29 The study of the distribution of genetic variants, including SNPs, lies within the domain of population genetics, and the study of the relationship between SNPs and phenotypic variation lies in the domain of quantitative genetics. Gibson&Muse

30 A B c a B C a B C A B c a B C a B c A b c A b c A b c a b C a b C A b c A b c a B C A B c a b C a B c A b c Quantitative Trait Locus Mapping A B C a b c F 1 A B C a b c F 1 X a b c a b c A B C A B C Parent 3 Parent 4 X HEIGHT GENOTYPE BB Bb bb B b Bb Bb Bb BB BB BB bb bb bb a b c a b c A B C A B C Parent 1 Parent 2 X Knott et al. (1997) TAG 84:

31 Association Mapping ancestral chromosomes G T * recombination through evolutionary history present-day chromosomes in natural population G A C C G A T C * G A T T * *

32 SNP Discovery Methods Pairwise Sequence Comparison from databases, esnp Deep Resequencing

33 SNP Analysis Agenda Sequence-Based SNP Identification Common Bioinformatic Solutions Phred, Phrap, Consed, Polyphred, and Polybayes High-Throughput SNP Identification Solution

34

35 Overlapping PCR Amplicons across entire gene Make no assumptions about sequence function Sequence diversity and genetic structure for each gene is different Proper association studies can only be designed in this context Complete resequencing facilitates population genetics methods

36 Sequence-based SNP Identification Amplify DNA 5 3 Sequence Phred Phrap Sequence each end of the fragment. Base-calling Quality determination Contig assembly Final quality determination PolyPhred/Polybayes Polymorphism detection ATAGACG ATAGACG ATACACG ATACACG ATAGACG ATACACG Consed Sequence viewing Polymorphism tagging Analysis Homozygotes Heterozygote Polymorphism reporting Individual genotyping Phylogenetic analysis

37 Phred, Phrap, Consed, Polyphred, Polybayes phred: Base calling and quality assignments phrap: Contig formation and new quality assignments consed: Visual X-Windows graphic interface, to view and edit alignments and contigs, and to view the original traces polyphred: find polymorphisms in phrap contigs, quality calls, add data to phrap files to permit consed finding and visualization of polymorphisms. polybayes: Fully probabilistic SNP detection algorithm that calculates the probability (SNP score) that discrepancies at a given location of a multiple alignment represent true sequence variations as opposed to sequencing errors.

38 Nature Genetics 23, (1999) A general approach to single-nucleotide polymorphism discovery Gabor T. Marth, Ian Korf, Mark D. Yandell, Raymond T. Yeh, Zhijie Gu, Hamideh Zakeri, Nathan O. Stitziel, LaDeana Hillier, Pui-Yan Kwok & Warren R. Gish Figure 1. Application of the POLYBAYES procedure to EST data. a, Regions of known human repeats in a genomic sequence are masked. b, Matching human ESTs are retrieved from dbest and traces are re-called. c, Paralogous ESTs are identified and discarded. d, Alignments of native EST reads are screened for candidate variable sites. e, An STS is designed for the verification of a candidate SNP. f, The uniqueness of the genomic location is determined by sequencing the STS in CHM1 (homozygous DNA). g, The presence of a SNP is analysed by sequencing the STS from pooled DNA samples.

39 PolyPhred: automating the detection and genotyping of single nucleotide substitutions using fluorescence-based resequencing Deborah A. Nickerson*, Vincent O. Tobe and Scott L. Taylor Nucleic Acids Research; :2745 SNP calling Correct call False positive False positivefalse positive

40 Trace File High quality region no ambiguities

41 Trace File Medium quality region some ambiguities

42 Trace File Poor quality region low confidence

43 Using PolyPhred to Visualize SNPs Compares sequences across traces obtained from different individuals to identify sites for SNPs. Will occasionally miscall genotypes - frequency of such mistakes depends on the sequencing chemistry used to generate the trace. To reduce the number of miscalled sites, ignores regions of poor quality & ends

44 Polyphred Reads the ACE file to obtain the consensus sequence and the names of the trace (chromat) files used in the assembly. Reads the PHD files associated with each trace. During the SNP search phase, PolyPhred combines information from all of the sequence traces to derive a genotype and a score for each sequence The score indicates how well the trace at the site matches the expected pattern for a SNP. Updates the ACE and PHD files by adding tags that mark the positions of the sites. The tagged sites can then be examined using Consed.

45 Polybayes Bayesian statistical model takes into account: - depth of coverge - base quality values of the sequences Polybayes calculations are aided with information on major/minor allele frequencies as well as polymorphism rates within the species under investigation **Can also integrate into the poly files for viewing with Consed

46 Alignment and SNP Calling Pipeline Challenges in High-Throughput SNP Identification Alignment Critical in the automation of base calls Commonly used Phrap (from PhredPhrap) is an assembler and is NOT ideal for alignments Many commonly used aligners work best with protein sequences or with a reference sequence Preservation of quality scores for input into SNP identification programs Speed for high-throughput programs Automated SNP Calls - Reference Sequence Required - Traditional approaches without reference sequence include esnps (human, maize, and pine) -Very little redundancy outside of abundant genes -Overall high number of false positives (single pass reads) - Not specific to frequencies observed in different organisms - High number of false positives in currently accepted methods (Polybayes & Polyphred)

47 5 UTR exon Intron 3 UTR

48 4-Coumarate CoA Ligase (4CL) F4 R4 F3 R3 F2 R1A F5 R3 F6 R bound_moiety="amp" proposed active site A C T A C T G A A T A C T A C T G A A T A C T A C T G A A T A C T A C T G A A T A C T A C T G G A T A C T A C T G G A T A C T A C T G G A T A C T A C T G G A T A C T A C C G G A T A C T A C C G G A T A C T A C C G G A T A C T A C C G G A T A C T A C C G G A C A C T A C C G G A C A C T A C C G G A C A C T A C C G G A C A C T A C C A G A C A C T A C C A G A C A C T A C C A G A C A C T A C C A G A C A C T A C C A G A C A C T A C C A G A C A C T A C C A G A C A C T G T C G G G C A C T G T C G G G C G C A G C C G G G C

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 Topics Genetic variation Population structure Linkage disequilibrium Natural disease variants Genome Wide Association Studies Gene

More information

Mutations during meiosis and germ line division lead to genetic variation between individuals

Mutations during meiosis and germ line division lead to genetic variation between individuals Mutations during meiosis and germ line division lead to genetic variation between individuals Types of mutations: point mutations indels (insertion/deletion) copy number variation structural rearrangements

More information

An introduction to genetics and molecular biology

An introduction to genetics and molecular biology An introduction to genetics and molecular biology Cavan Reilly September 5, 2017 Table of contents Introduction to biology Some molecular biology Gene expression Mendelian genetics Some more molecular

More information

BENG 183 Trey Ideker. Genome Assembly and Physical Mapping

BENG 183 Trey Ideker. Genome Assembly and Physical Mapping BENG 183 Trey Ideker Genome Assembly and Physical Mapping Reasons for sequencing Complete genome sequencing!!! Resequencing (Confirmatory) E.g., short regions containing single nucleotide polymorphisms

More information

SNP calling and VCF format

SNP calling and VCF format SNP calling and VCF format Laurent Falquet, Oct 12 SNP? What is this? A type of genetic variation, among others: Family of Single Nucleotide Aberrations Single Nucleotide Polymorphisms (SNPs) Single Nucleotide

More information

Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010

Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010 Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010 Traditional QTL approach Uses standard bi-parental mapping populations o F2 or RI These have a limited number of

More information

Concepts: What are RFLPs and how do they act like genetic marker loci?

Concepts: What are RFLPs and how do they act like genetic marker loci? Restriction Fragment Length Polymorphisms (RFLPs) -1 Readings: Griffiths et al: 7th Edition: Ch. 12 pp. 384-386; Ch.13 pp404-407 8th Edition: pp. 364-366 Assigned Problems: 8th Ch. 11: 32, 34, 38-39 7th

More information

Midterm 1 Results. Midterm 1 Akey/ Fields Median Number of Students. Exam Score

Midterm 1 Results. Midterm 1 Akey/ Fields Median Number of Students. Exam Score Midterm 1 Results 10 Midterm 1 Akey/ Fields Median - 69 8 Number of Students 6 4 2 0 21 26 31 36 41 46 51 56 61 66 71 76 81 86 91 96 101 Exam Score Quick review of where we left off Parental type: the

More information

FORENSIC GENETICS. DNA in the cell FORENSIC GENETICS PERSONAL IDENTIFICATION KINSHIP ANALYSIS FORENSIC GENETICS. Sources of biological evidence

FORENSIC GENETICS. DNA in the cell FORENSIC GENETICS PERSONAL IDENTIFICATION KINSHIP ANALYSIS FORENSIC GENETICS. Sources of biological evidence FORENSIC GENETICS FORENSIC GENETICS PERSONAL IDENTIFICATION KINSHIP ANALYSIS FORENSIC GENETICS Establishing human corpse identity Crime cases matching suspect with evidence Paternity testing, even after

More information

Sept 2. Structure and Organization of Genomes. Today: Genetic and Physical Mapping. Sept 9. Forward and Reverse Genetics. Genetic and Physical Mapping

Sept 2. Structure and Organization of Genomes. Today: Genetic and Physical Mapping. Sept 9. Forward and Reverse Genetics. Genetic and Physical Mapping Sept 2. Structure and Organization of Genomes Today: Genetic and Physical Mapping Assignments: Gibson & Muse, pp.4-10 Brown, pp. 126-160 Olson et al., Science 245: 1434 New homework:due, before class,

More information

Worksheet for Bioinformatics

Worksheet for Bioinformatics Worksheet for Bioinformatics ACTIVITY: Learn to use biological databases and sequence analysis tools Exercise 1 Biological Databases Objective: To use public biological databases to search for latest research

More information

Computational Workflows for Genome-Wide Association Study: I

Computational Workflows for Genome-Wide Association Study: I Computational Workflows for Genome-Wide Association Study: I Department of Computer Science Brown University, Providence sorin@cs.brown.edu October 16, 2014 Outline 1 Outline 2 3 Monogenic Mendelian Diseases

More information

Conifer Translational Genomics Network Coordinated Agricultural Project

Conifer Translational Genomics Network Coordinated Agricultural Project Conifer Translational Genomics Network Coordinated Agricultural Project Genomics in Tree Breeding and Forest Ecosystem Management ----- Module 2 Genes, Genomes, and Mendel Nicholas Wheeler & David Harry

More information

Theory and Application of Multiple Sequence Alignments

Theory and Application of Multiple Sequence Alignments Theory and Application of Multiple Sequence Alignments a.k.a What is a Multiple Sequence Alignment, How to Make One, and What to Do With It Brett Pickett, PhD History Structure of DNA discovered (1953)

More information

Gap Filling for a Human MHC Haplotype Sequence

Gap Filling for a Human MHC Haplotype Sequence American Journal of Life Sciences 2016; 4(6): 146-151 http://www.sciencepublishinggroup.com/j/ajls doi: 10.11648/j.ajls.20160406.12 ISSN: 2328-5702 (Print); ISSN: 2328-5737 (Online) Gap Filling for a Human

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION doi:10.1038/nature09937 a Name Position Primersets 1a 1b 2 3 4 b2 Phenotype Genotype b Primerset 1a D T C R I E 10000 8000 6000 5000 4000 3000 2500 2000 1500 1000 800 Donor (D)

More information

Measurement of Molecular Genetic Variation. Forces Creating Genetic Variation. Mutation: Nucleotide Substitutions

Measurement of Molecular Genetic Variation. Forces Creating Genetic Variation. Mutation: Nucleotide Substitutions Measurement of Molecular Genetic Variation Genetic Variation Is The Necessary Prerequisite For All Evolution And For Studying All The Major Problem Areas In Molecular Evolution. How We Score And Measure

More information

Finishing Fosmid DMAC-27a of the Drosophila mojavensis third chromosome

Finishing Fosmid DMAC-27a of the Drosophila mojavensis third chromosome Finishing Fosmid DMAC-27a of the Drosophila mojavensis third chromosome Ruth Howe Bio 434W 27 February 2010 Abstract The fourth or dot chromosome of Drosophila species is composed primarily of highly condensed,

More information

4.1. Genetics as a Tool in Anthropology

4.1. Genetics as a Tool in Anthropology 4.1. Genetics as a Tool in Anthropology Each biological system and every human being is defined by its genetic material. The genetic material is stored in the cells of the body, mainly in the nucleus of

More information

Structural variation. Marta Puig Institut de Biotecnologia i Biomedicina Universitat Autònoma de Barcelona

Structural variation. Marta Puig Institut de Biotecnologia i Biomedicina Universitat Autònoma de Barcelona Structural variation Marta Puig Institut de Biotecnologia i Biomedicina Universitat Autònoma de Barcelona Genetic variation How much genetic variation is there between individuals? What type of variants

More information

Identification of Single Nucleotide Polymorphisms and associated Disease Genes using NCBI resources

Identification of Single Nucleotide Polymorphisms and associated Disease Genes using NCBI resources Identification of Single Nucleotide Polymorphisms and associated Disease Genes using NCBI resources Navreet Kaur M.Tech Student Department of Computer Engineering. University College of Engineering, Punjabi

More information

The Polymerase Chain Reaction. Chapter 6: Background

The Polymerase Chain Reaction. Chapter 6: Background The Polymerase Chain Reaction Chapter 6: Background Invention of PCR Kary Mullis Mile marker 46.58 in April of 1983 Pulled off the road and outlined a way to conduct DNA replication in a tube Worked for

More information

Basic Concepts of Human Genetics

Basic Concepts of Human Genetics Basic Concepts of Human Genetics The genetic information of an individual is contained in 23 pairs of chromosomes. Every human cell contains the 23 pair of chromosomes. One pair is called sex chromosomes

More information

7-1. Read this exercise before you come to the laboratory. Review the lecture notes from October 15 (Hardy-Weinberg Equilibrium)

7-1. Read this exercise before you come to the laboratory. Review the lecture notes from October 15 (Hardy-Weinberg Equilibrium) 7-1 Biology 1001 Lab 7: POPULATION GENETICS PREPARTION Read this exercise before you come to the laboratory. Review the lecture notes from October 15 (Hardy-Weinberg Equilibrium) OBECTIVES At the end of

More information

Multiple choice questions (numbers in brackets indicate the number of correct answers)

Multiple choice questions (numbers in brackets indicate the number of correct answers) 1 Multiple choice questions (numbers in brackets indicate the number of correct answers) February 1, 2013 1. Ribose is found in Nucleic acids Proteins Lipids RNA DNA (2) 2. Most RNA in cells is transfer

More information

Mapping and Mapping Populations

Mapping and Mapping Populations Mapping and Mapping Populations Types of mapping populations F 2 o Two F 1 individuals are intermated Backcross o Cross of a recurrent parent to a F 1 Recombinant Inbred Lines (RILs; F 2 -derived lines)

More information

AGRO/ANSC/BIO/GENE/HORT 305 Fall, 2016 Overview of Genetics Lecture outline (Chpt 1, Genetics by Brooker) #1

AGRO/ANSC/BIO/GENE/HORT 305 Fall, 2016 Overview of Genetics Lecture outline (Chpt 1, Genetics by Brooker) #1 AGRO/ANSC/BIO/GENE/HORT 305 Fall, 2016 Overview of Genetics Lecture outline (Chpt 1, Genetics by Brooker) #1 - Genetics: Progress from Mendel to DNA: Gregor Mendel, in the mid 19 th century provided the

More information

Why learn sequence database searching? Searching Molecular Databases with BLAST

Why learn sequence database searching? Searching Molecular Databases with BLAST Why learn sequence database searching? Searching Molecular Databases with BLAST What have I cloned? Is this really!my gene"? Basic Local Alignment Search Tool How BLAST works Interpreting search results

More information

Lecture Four. Molecular Approaches I: Nucleic Acids

Lecture Four. Molecular Approaches I: Nucleic Acids Lecture Four. Molecular Approaches I: Nucleic Acids I. Recombinant DNA and Gene Cloning Recombinant DNA is DNA that has been created artificially. DNA from two or more sources is incorporated into a single

More information

3. human genomics clone genes associated with genetic disorders. 4. many projects generate ordered clones that cover genome

3. human genomics clone genes associated with genetic disorders. 4. many projects generate ordered clones that cover genome Lectures 30 and 31 Genome analysis I. Genome analysis A. two general areas 1. structural 2. functional B. genome projects a status report 1. 1 st sequenced: several viral genomes 2. mitochondria and chloroplasts

More information

Lecture #1. Introduction to microarray technology

Lecture #1. Introduction to microarray technology Lecture #1 Introduction to microarray technology Outline General purpose Microarray assay concept Basic microarray experimental process cdna/two channel arrays Oligonucleotide arrays Exon arrays Comparing

More information

Lecture 8: Sequencing and SNP. Sept 15, 2006

Lecture 8: Sequencing and SNP. Sept 15, 2006 Lecture 8: Sequencing and SNP Sept 15, 2006 Announcements Random questioning during literature discussion sessions starts next week for real! Schedule changes Moved QTL lecture up Removed landscape genetics

More information

Gene-centered resources at NCBI

Gene-centered resources at NCBI COURSE OF BIOINFORMATICS a.a. 2014-2015 Gene-centered resources at NCBI We searched Accession Number: M60495 AT NCBI Nucleotide Gene has been implemented at NCBI to organize information about genes, serving

More information

Answers to additional linkage problems.

Answers to additional linkage problems. Spring 2013 Biology 321 Answers to Assignment Set 8 Chapter 4 http://fire.biol.wwu.edu/trent/trent/iga_10e_sm_chapter_04.pdf Answers to additional linkage problems. Problem -1 In this cell, there two copies

More information

Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS

Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS Péter Antal Ádám Arany Bence Bolgár András Gézsi Gergely Hajós Gábor Hullám Péter Marx András Millinghoffer László Poppe Péter Sárközy BIOINFORMATICS The Bioinformatics book covers new topics in the rapidly

More information

Sequence Assembly and Alignment. Jim Noonan Department of Genetics

Sequence Assembly and Alignment. Jim Noonan Department of Genetics Sequence Assembly and Alignment Jim Noonan Department of Genetics james.noonan@yale.edu www.yale.edu/noonanlab The assembly problem >>10 9 sequencing reads 36 bp - 1 kb 3 Gb Outline Basic concepts in genome

More information

The Polymerase Chain Reaction. Chapter 6: Background

The Polymerase Chain Reaction. Chapter 6: Background The Polymerase Chain Reaction Chapter 6: Background PCR Amplify= Polymerase Chain Reaction (PCR) Invented in 1984 Applications Invention of PCR Kary Mullis Mile marker 46.58 in April of 1983 Pulled off

More information

Chapter 15 Gene Technologies and Human Applications

Chapter 15 Gene Technologies and Human Applications Chapter Outline Chapter 15 Gene Technologies and Human Applications Section 1: The Human Genome KEY IDEAS > Why is the Human Genome Project so important? > How do genomics and gene technologies affect

More information

Bio 311 Learning Objectives

Bio 311 Learning Objectives Bio 311 Learning Objectives This document outlines the learning objectives for Biol 311 (Principles of Genetics). Biol 311 is part of the BioCore within the Department of Biological Sciences; therefore,

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Richard Corbett Canada s Michael Smith Genome Sciences Centre Vancouver, British Columbia June 28, 2017 Our mandate is to advance knowledge about cancer and other diseases

More information

Molecular Markers CRITFC Genetics Workshop December 9, 2014

Molecular Markers CRITFC Genetics Workshop December 9, 2014 Molecular Markers CRITFC Genetics Workshop December 9, 2014 Molecular Markers Tools that allow us to collect information about an individual, a population, or a species Application in fisheries mating

More information

PV92 PCR Bio Informatics

PV92 PCR Bio Informatics Purpose of PCR Chromosome 16 PV92 PV92 PCR Bio Informatics Alu insert, PV92 locus, chromosome 16 Introduce the polymerase chain reaction (PCR) technique Apply PCR to population genetics Directly measure

More information

Read Mapping and Variant Calling. Johannes Starlinger

Read Mapping and Variant Calling. Johannes Starlinger Read Mapping and Variant Calling Johannes Starlinger Application Scenario: Personalized Cancer Therapy Different mutations require different therapy Collins, Meredith A., and Marina Pasca di Magliano.

More information

Linking Genetic Variation to Important Phenotypes

Linking Genetic Variation to Important Phenotypes Linking Genetic Variation to Important Phenotypes BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2018 Anthony Gitter gitter@biostat.wisc.edu These slides, excluding third-party material, are licensed under

More information

By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs

By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs (3) QTL and GWAS methods By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs Under what conditions particular methods are suitable

More information

Mutation entries in SMA databases Guidelines for national curators

Mutation entries in SMA databases Guidelines for national curators 1 Mutation entries in SMA databases Guidelines for national curators GENERAL CONSIDERATIONS Role of the curator(s) of a national database Molecular data can be collected by many different ways. There are

More information

Mendel & Inheritance. SC.912.L.16.1 Use Mendel s laws of segregation and independent assortment to analyze patterns of inheritance.

Mendel & Inheritance. SC.912.L.16.1 Use Mendel s laws of segregation and independent assortment to analyze patterns of inheritance. Mendel & Inheritance SC.912.L.16.1 Use Mendel s laws of segregation and independent assortment Mendel s Law of Segregation: gene pairs separate when gametes (sex cells) are formed; each gamete as only

More information

MICROSATELLITE MARKER AND ITS UTILITY

MICROSATELLITE MARKER AND ITS UTILITY Your full article ( between 500 to 5000 words) - - Do check for grammatical errors or spelling mistakes MICROSATELLITE MARKER AND ITS UTILITY 1 Prasenjit, D., 2 Anirudha, S. K. and 3 Mallar, N.K. 1,2 M.Sc.(Agri.),

More information

Association mapping of Sclerotinia stalk rot resistance in domesticated sunflower plant introductions

Association mapping of Sclerotinia stalk rot resistance in domesticated sunflower plant introductions Association mapping of Sclerotinia stalk rot resistance in domesticated sunflower plant introductions Zahirul Talukder 1, Brent Hulke 2, Lili Qi 2, and Thomas Gulya 2 1 Department of Plant Sciences, NDSU

More information

Why can GBS be complicated? Tools for filtering, error correction and imputation.

Why can GBS be complicated? Tools for filtering, error correction and imputation. Why can GBS be complicated? Tools for filtering, error correction and imputation. Edward Buckler USDA-ARS Cornell University http://www.maizegenetics.net Many Organisms Are Diverse Humans are at the lower

More information

Genetic Equilibrium: Human Diversity Student Version

Genetic Equilibrium: Human Diversity Student Version Genetic Equilibrium: Human Diversity Student Version Key Concepts: A population is a group of organisms of the same species that live and breed in the same area. Alleles are alternate forms of genes. In

More information

Hands-On Four Investigating Inherited Diseases

Hands-On Four Investigating Inherited Diseases Hands-On Four Investigating Inherited Diseases The purpose of these exercises is to introduce bioinformatics databases and tools. We investigate an important human gene and see how mutations give rise

More information

LATE-PCR. Linear-After-The-Exponential

LATE-PCR. Linear-After-The-Exponential LATE-PCR Linear-After-The-Exponential A Patented Invention of the Laboratory of Human Genetics and Reproductive Biology Lab. Director: Lawrence J. Wangh, Ph.D. Department of Biology, Brandeis University,

More information

Finishing Drosophila Ananassae Fosmid 2728G16

Finishing Drosophila Ananassae Fosmid 2728G16 Finishing Drosophila Ananassae Fosmid 2728G16 Kyle Jung March 8, 2013 Bio434W Professor Elgin Page 1 Abstract For my finishing project, I chose to finish fosmid 2728G16. This fosmid carries a segment of

More information

7 Gene Isolation and Analysis of Multiple

7 Gene Isolation and Analysis of Multiple Genetic Techniques for Biological Research Corinne A. Michels Copyright q 2002 John Wiley & Sons, Ltd ISBNs: 0-471-89921-6 (Hardback); 0-470-84662-3 (Electronic) 7 Gene Isolation and Analysis of Multiple

More information

Bioinformatics for Proteomics. Ann Loraine

Bioinformatics for Proteomics. Ann Loraine Bioinformatics for Proteomics Ann Loraine aloraine@uab.edu What is bioinformatics? The science of collecting, processing, organizing, storing, analyzing, and mining biological information, especially data

More information

Gen e e n t e i t c c V a V ri r abi b li l ty Biolo l gy g Lec e tur u e e 9 : 9 Gen e et e ic I n I her e itan a ce

Gen e e n t e i t c c V a V ri r abi b li l ty Biolo l gy g Lec e tur u e e 9 : 9 Gen e et e ic I n I her e itan a ce Genetic Variability Biology 102 Lecture 9: Genetic Inheritance Asexual reproduction = daughter cells genetically identical to parent (clones) Sexual reproduction = offspring are genetic hybrids Tendency

More information

PHYSICIAN RESOURCES MAPPED TO GENOMICS COMPETENCIES AND GAPS IDENTIFIED WITH CURRENT EDUCATIONAL RESOURCES AVAILABLE 06/04/14

PHYSICIAN RESOURCES MAPPED TO GENOMICS COMPETENCIES AND GAPS IDENTIFIED WITH CURRENT EDUCATIONAL RESOURCES AVAILABLE 06/04/14 PHYSICIAN RESOURCES MAPPED TO GENOMICS COMPETENCIES AND GAPS IDENTIFIED WITH CURRENT EDUCATIONAL RESOURCES AVAILABLE 06/04/14 Submitted resources from physician organization representatives were mapped

More information

Genomic Research: Issues to Consider. IRB Brown Bag August 28, 2014 Sharon Aufox, MS, LGC

Genomic Research: Issues to Consider. IRB Brown Bag August 28, 2014 Sharon Aufox, MS, LGC Genomic Research: Issues to Consider IRB Brown Bag August 28, 2014 Sharon Aufox, MS, LGC Outline Key genomic terms and concepts Issues in genomic research Consent models Types of findings Returning results

More information

Polymerase Chain Reaction (PCR) and Its Applications

Polymerase Chain Reaction (PCR) and Its Applications Polymerase Chain Reaction (PCR) and Its Applications What is PCR? PCR is an exponentially progressing synthesis of the defined target DNA sequences in vitro. It was invented in 1983 by Dr. Kary Mullis,

More information

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary Executive Summary Helix is a personal genomics platform company with a simple but powerful mission: to empower every person to improve their life through DNA. Our platform includes saliva sample collection,

More information

Chp 10 Patterns of Inheritance

Chp 10 Patterns of Inheritance Chp 10 Patterns of Inheritance Dogs, one of human s longest genetic experiments Over 1,000 s of years, humans have chosen and mated dogs with specific traits. A process called -artificial selection The

More information

MAS refers to the use of DNA markers that are tightly-linked to target loci as a substitute for or to assist phenotypic screening.

MAS refers to the use of DNA markers that are tightly-linked to target loci as a substitute for or to assist phenotypic screening. Marker assisted selection in rice Introduction The development of DNA (or molecular) markers has irreversibly changed the disciplines of plant genetics and plant breeding. While there are several applications

More information

Book chapter appears in:

Book chapter appears in: Mass casualty identification through DNA analysis: overview, problems and pitfalls Mark W. Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA 29 August 2007 2007 Cybergenetics Book chapter appears in:

More information

Next Generation Sequencing. Target Enrichment

Next Generation Sequencing. Target Enrichment Next Generation Sequencing Target Enrichment Next Generation Sequencing Your Partner in Every Step from Sample to Data NGS: Revolutionizing Genetic Analysis with Single-Molecule Resolution Next generation

More information

Principles of Population Genetics

Principles of Population Genetics Principles of Population Genetics Leo P ten Kate, MD, PhD Em. Prof. of Clinical Genetics VU University Medical Center Amsterdam, the Netherlands Training Course in Sexual and Reproductive Health Research

More information

BA, BSc, and MSc Degree Examinations

BA, BSc, and MSc Degree Examinations Examination Candidate Number: Desk Number: BA, BSc, and MSc Degree Examinations 2017-8 Department : BIOLOGY Title of Exam: Genetics Time Allowed: 1 hour and 30 minutes Marking Scheme: Total marks available

More information

Observing Patterns In Inherited Traits

Observing Patterns In Inherited Traits Observing Patterns In Inherited Traits Ø Where Modern Genetics Started/ Gregor Mendel Ø Law of Segregation Ø Law of Independent Assortment Ø Non-Mendelian Inheritance Ø Complex Variations in Traits Genetics:

More information

B I O I N F O R M A T I C S

B I O I N F O R M A T I C S B I O I N F O R M A T I C S Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be SUPPLEMENTARY CHAPTER: DATA BASES AND MINING 1 What

More information

Genome Sequence Assembly

Genome Sequence Assembly Genome Sequence Assembly Learning Goals: Introduce the field of bioinformatics Familiarize the student with performing sequence alignments Understand the assembly process in genome sequencing Introduction:

More information

Introduction to the UCSC genome browser

Introduction to the UCSC genome browser Introduction to the UCSC genome browser Dominik Beck NHMRC Peter Doherty and CINSW ECR Fellow, Senior Lecturer Lowy Cancer Research Centre, UNSW and Centre for Health Technology, UTS SYDNEY NSW AUSTRALIA

More information

Terminology: chromosome; gene; allele; proteins; enzymes

Terminology: chromosome; gene; allele; proteins; enzymes Title Workshop on genetic disease and gene therapy Authors Danielle Higham (BSc Genetics), Dr. Maggy Fostier Contact Maggy.fostier@manchester.ac.uk Target level KS4 science, GCSE (or A-level) Publication

More information

Introduction to BIOINFORMATICS

Introduction to BIOINFORMATICS Introduction to BIOINFORMATICS Antonella Lisa CABGen Centro di Analisi Bioinformatica per la Genomica Tel. 0382-546361 E-mail: lisa@igm.cnr.it http://www.igm.cnr.it/pagine-personali/lisa-antonella/ What

More information

Introduction to Pharmacogenetics Competency

Introduction to Pharmacogenetics Competency Introduction to Pharmacogenetics Competency Updated on 6/2015 Pre-test Question # 1 Pharmacogenetics is the study of how genetic variations affect drug response a) True b) False Pre-test Question # 2 Pharmacogenetic

More information

Basic Concepts of Human Genetics

Basic Concepts of Human Genetics Basic oncepts of Human enetics The genetic information of an individual is contained in 23 pairs of chromosomes. Every human cell contains the 23 pair of chromosomes. ne pair is called sex chromosomes

More information

Lawrence Berkeley National Laboratory Lawrence Berkeley National Laboratory

Lawrence Berkeley National Laboratory Lawrence Berkeley National Laboratory Lawrence Berkeley National Laboratory Lawrence Berkeley National Laboratory Title SNP-VISTA: An Interactive SNPs Visualization Tool Permalink https://escholarship.org/uc/item/74z965bg Authors Shah, Nameeta

More information

Population and Community Dynamics. The Hardy-Weinberg Principle

Population and Community Dynamics. The Hardy-Weinberg Principle Population and Community Dynamics The Hardy-Weinberg Principle Key Terms Population: same species, same place, same time Gene: unit of heredity. Controls the expression of a trait. Can be passed to offspring.

More information

Molecular Biology: DNA sequencing

Molecular Biology: DNA sequencing Molecular Biology: DNA sequencing Author: Prof Marinda Oosthuizen Licensed under a Creative Commons Attribution license. SEQUENCING OF LARGE TEMPLATES As we have seen, we can obtain up to 800 nucleotides

More information

CHAPTER 21 LECTURE SLIDES

CHAPTER 21 LECTURE SLIDES CHAPTER 21 LECTURE SLIDES Prepared by Brenda Leady University of Toledo To run the animations you must be in Slideshow View. Use the buttons on the animation to play, pause, and turn audio/text on or off.

More information

3I03 - Eukaryotic Genetics Repetitive DNA

3I03 - Eukaryotic Genetics Repetitive DNA Repetitive DNA Satellite DNA Minisatellite DNA Microsatellite DNA Transposable elements LINES, SINES and other retrosequences High copy number genes (e.g. ribosomal genes, histone genes) Multifamily member

More information

Genome research in eukaryotes

Genome research in eukaryotes Functional Genomics Genome and EST sequencing can tell us how many POTENTIAL genes are present in the genome Proteomics can tell us about proteins and their interactions The goal of functional genomics

More information

can be found from OMIM (Online Mendelian Inheritance in Man),

can be found from OMIM (Online Mendelian Inheritance in Man), Lectures 4 & 5 Wednesday, October 5, 2011 & Friday, October 7, 2011 Forces causing gene frequency change Mutation Random mating does not cause allele frequencies to change, but other forces do. Mutation

More information

Genomic resources and gene/qtl discovery in cereals

Genomic resources and gene/qtl discovery in cereals Genomic resources and gene/qtl discovery in cereals Roberto Tuberosa Dept. of Agroenvironmental Sciences & Technology University of Bologna, Italy The ABDC Congress 1-4 March 2010 Gudalajara, Mexico Outline

More information

Outline. Evolution. Adaptive convergence. Common similarity problems. Chapter 7: Similarity searches on sequence databases

Outline. Evolution. Adaptive convergence. Common similarity problems. Chapter 7: Similarity searches on sequence databases Chapter 7: Similarity searches on sequence databases All science is either physics or stamp collection. Ernest Rutherford Outline Why is similarity important BLAST Protein and DNA Interpreting BLAST Individualizing

More information

Table of Contents. Chapter: Heredity. Section 1: Genetics. Section 2: Genetics Since Mendel. Section 3: Biotechnology

Table of Contents. Chapter: Heredity. Section 1: Genetics. Section 2: Genetics Since Mendel. Section 3: Biotechnology Table of Contents Chapter: Heredity Section 1: Genetics Section 2: Genetics Since Mendel Section 3: Biotechnology 1 Genetics Inheriting Traits Eye color, nose shape, and many other physical features are

More information

Variant calling in NGS experiments

Variant calling in NGS experiments Variant calling in NGS experiments Jorge Jiménez jjimeneza@cipf.es BIER CIBERER Genomics Department Centro de Investigacion Principe Felipe (CIPF) (Valencia, Spain) 1 Index 1. NGS workflow 2. Variant calling

More information

Genetics Lecture 21 Recombinant DNA

Genetics Lecture 21 Recombinant DNA Genetics Lecture 21 Recombinant DNA Recombinant DNA In 1971, a paper published by Kathleen Danna and Daniel Nathans marked the beginning of the recombinant DNA era. The paper described the isolation of

More information

Exploring the Genetic Basis of Congenital Heart Defects

Exploring the Genetic Basis of Congenital Heart Defects Exploring the Genetic Basis of Congenital Heart Defects Sanjay Siddhanti Jordan Hannel Vineeth Gangaram szsiddh@stanford.edu jfhannel@stanford.edu vineethg@stanford.edu 1 Introduction The Human Genome

More information

HLA and Next Generation Sequencing it s all about the Data

HLA and Next Generation Sequencing it s all about the Data HLA and Next Generation Sequencing it s all about the Data John Ord, NHSBT Colindale and University of Cambridge BSHI Annual Conference Manchester September 2014 Introduction In 2003 the first full public

More information

Expressed Sequence Tags: Clustering and Applications

Expressed Sequence Tags: Clustering and Applications 12 Expressed Sequence Tags: Clustering and Applications Anantharaman Kalyanaraman Iowa State University Srinivas Aluru Iowa State University 12.1 Introduction... 12-1 12.2 Sequencing ESTs... 12-2 12.3

More information

Guided tour to Ensembl

Guided tour to Ensembl Guided tour to Ensembl Introduction Introduction to the Ensembl project Walk-through of the browser Variations and Functional Genomics Comparative Genomics BioMart Ensembl Genome browser http://www.ensembl.org

More information

Chapter 14: Genes in Action

Chapter 14: Genes in Action Chapter 14: Genes in Action Section 1: Mutation and Genetic Change Mutation: Nondisjuction: a failure of homologous chromosomes to separate during meiosis I or the failure of sister chromatids to separate

More information

Higher Human Biology Unit 1: Human Cells Pupils Learning Outcomes

Higher Human Biology Unit 1: Human Cells Pupils Learning Outcomes Higher Human Biology Unit 1: Human Cells Pupils Learning Outcomes 1.1 Division and Differentiation in Human Cells I can state that cellular differentiation is the process by which a cell develops more

More information

Targeted resequencing

Targeted resequencing Targeted resequencing Sarah Calvo, Ph.D. Computational Biologist Vamsi Mootha laboratory Snapshots of Genome Wide Analysis in Human Disease (MPG), 4/20/2010 Vamsi Mootha, PI How can I assess a small genomic

More information

Observing Patterns in Inherited Traits. Chapter 11

Observing Patterns in Inherited Traits. Chapter 11 Observing Patterns in Inherited Traits Chapter 11 Impacts, Issues: The Color of Skin Like most human traits, skin color has a genetic basis; more than 100 gene products affect the synthesis and deposition

More information

GENETICS. I. Review of DNA/RNA A. Basic Structure DNA 3 parts that make up a nucleotide chains wrap around each other to form a

GENETICS. I. Review of DNA/RNA A. Basic Structure DNA 3 parts that make up a nucleotide chains wrap around each other to form a GENETICS I. Review of DNA/RNA A. Basic Structure DNA 3 parts that make up a nucleotide 1. 2. 3. chains wrap around each other to form a Chains run in opposite direction known as Type of bond between the

More information

Uniparental disomy (UPD) analysis of chromosome 15

Uniparental disomy (UPD) analysis of chromosome 15 YOUR INNOVATIVE RESEARCH Analysis of UPD by STR-PCR Uniparental disomy (UPD) analysis of chromosome 15 Applied Biosystems 3500xL Genetic Analyzer Introduction Researchers at the Laboratory of Medical Genetics

More information

Application for Automating Database Storage of EST to Blast Results. Vikas Sharma Shrividya Shivkumar Nathan Helmick

Application for Automating Database Storage of EST to Blast Results. Vikas Sharma Shrividya Shivkumar Nathan Helmick Application for Automating Database Storage of EST to Blast Results Vikas Sharma Shrividya Shivkumar Nathan Helmick Outline Biology Primer Vikas Sharma System Overview Nathan Helmick Creating ESTs Nathan

More information