Mutation Rates and Sequence Changes

Similar documents
Reading for today. Adaptive Molecular Evolution. Predictions of neutral theory. The neutral theory of molecular evolution

Neutral theory: The neutral theory does not say that all evolution is neutral and everything is only due to to genetic drift.

Adaptive Molecular Evolution. Reading for today. Neutral theory. Predictions of neutral theory. The neutral theory of molecular evolution

Midterm exam BIOSCI 113/244 WINTER QUARTER,

What is molecular evolution? BIOL2007 Molecular Evolution. Modes of molecular evolution. Modes of molecular evolution

MATH 5610, Computational Biology

The neutral theory of molecular evolution

Neutrality Test. Neutrality tests allow us to: Challenges in neutrality tests. differences. data. - Identify causes of species-specific phenotype

BIOINFORMATICS 1 INTRODUCTION TO MOLECULAR EVOLUTION EVOLUTION BY DESCENT WITH MODIFICATION DUALITY OF MOLECULAR EVOLUTION DNA AS A GENETIC MATERIAL

Lecture 19: Hitchhiking and selective sweeps. Bruce Walsh lecture notes Synbreed course version 8 July 2013

Dynamic Programming Algorithms

Identifying genes that evolved under the influence of positive natural

You are genetically unique

2. The rate of gene evolution (substitution) is inversely related to the level of functional constraint (purifying selection) acting on the gene.

Molecular Evolution. Lectures Papers Lab. Dr. Walter Salzburger. Structure of the course: Structure i

Disease and selection in the human genome 3

Important points from last time

Evolutionary Genetics: Part 1 Polymorphism in DNA

Molecular Evolution. COMP Fall 2010 Luay Nakhleh, Rice University

Simulation Study of the Reliability and Robustness of the Statistical Methods for Detecting Positive Selection at Single Amino Acid Sites

Mutagenesis. Classification of mutation. Spontaneous Base Substitution. Molecular Mutagenesis. Limits to DNA Pol Fidelity.

Chapter 17. Gene Mutations and DNA Repair (Part 1) What does this tell us?

MSc in Genetics. Population Genomics of model species. Antonio Barbadilla. Course

HW2 mean (among turned-in hws): If I have a fitness of 0.9 and you have a fitness of 1.0, are you 10% better?

A Simple Method for Estimating the Intensity of Purifying Selection in Protein-Coding Genes

Amino Acid Sequences and Evolutionary Relationships

Basic concepts and terminology

Simple Methods for Estimating the Numbers of Synonymous and Nonsynonymous Nucleotide Substitutions

1 (1) 4 (2) (3) (4) 10


Selection against deleterious mutations leads to a mutation

Molecular Evolution. Wen-Hsiung Li

University of York Department of Biology B. Sc Stage 2 Degree Examinations

Untangling the Effects of Codon Mutation and Amino Acid Exchangeability. L.Y. Yampolsky and A. Stoltzfus

New Alleles? Mendel reasoned thus: Point Mutation

CS313 Exercise 1 Cover Page Fall 2017

Lecture 10 Molecular evolution. Jim Watson, Francis Crick, and DNA

Single Nucleotide Variant Analysis. H3ABioNet May 14, 2014

con t. Chapter 32 The Genetic Code Nature of the Genetic Code BCH 4054 Spring 2001 Chapter 32 Lecture Notes Slide 1 Slide 2

Expected Relationship Between the Silent Substitution Rate and the GC Content: Implications for the Evolution of Isochores

Figure S4 A-H : Initiation site properties and evolutionary changes

Minor Introns vs Major Introns

Chapter 10: Gene mutation 2002 by W. H. Freeman and Company Chapter 10: Gene mutation 2002 by W. H. Freeman and Company

Synonymous and Nonsynonymous Substitutions in Mammalian Genes: Intragenic Correlations

Positive and Negative Selection on Noncoding DNA in Drosophila simulans

Biological Sciences 50 Practice Exam 2

Amino Acid Sequences and Evolutionary Relationships. How do similarities in amino acid sequences of various species provide evidence for evolution?

Basic concepts of molecular biology

Unit 6 DNA ppt 3 Gene Expression and Mutations Chapter 8.6 & 8.7 pg

Package sscu. November 19, 2017

Outline. Pseudogenes. Pseudo-genes. The genetic code (DNA version) What is a gene? What is a gene? Dead genes Vitamin C Urate oxidase. Alan R.

Any questions? d N /d S ratio (ω) estimated across all sites is inefficient at detecting positive selection

CODON-BASED DETECTION OF POSITIVE SELECTION CAN BE BIASED BY HETEROGENEOUS DISTRIBUTION OF POLAR AMINO ACIDS ALONG PROTEIN SEQUENCES

Estimating Synonymous and Nonsynonymous Substitution Rates Under Realistic Evolutionary Models

Degenerate site - twofold degenerate site - fourfold degenerate site

2. Write an essay on reinforcement and its relationship to sympatric speciation.

Deleterious mutations

Intro to population genetics

466 Asn (N) to Ala (A) Generate beta dimer Interface

Polymorphism [Greek: poly=many, morph=form]

Overview of using molecular markers to detect selection

Human Genetic Variation. Ricardo Lebrón Dpto. Genética UGR

Codon usage and secondary structure of MS2 pfaage RNA. Michael Bulmer. Department of Statistics, 1 South Parks Road, Oxford OX1 3TG, UK

Genetic drift 10/13/2014. Random factors in evolution. Sampling error. Genetic drift. Random walk. Genetic drift

Introduction to Population Genetics. Spezielle Statistik in der Biomedizin WS 2014/15

Bi Lecture 3 Loss-of-function (Ch. 4A) Monday, April 8, 13

THE GENETIC CODE Figure 1: The genetic code showing the codons and their respective amino acids

Bio 312, Exam 3 ( 1 ) Name:

Biology Evolution Dr. Kilburn, page 1 Mutation and genetic variation

Lecture 21: Association Studies and Signatures of Selection. November 6, 2006

What determines if a mutation is deleterious, neutral, or beneficial?

b. (3 points) The expected frequencies of each blood type in the deme if mating is random with respect to variation at this locus.

Gene mutation and DNA polymorphism

Basic Concepts of Human Genetics

Mutation. ! Mutation occurs when a DNA gene is damaged or changed in such a way as to alter the genetic message carried by that gene

1/4/18 NUCLEIC ACIDS. Nucleic Acids. Nucleic Acids. ECS129 Instructor: Patrice Koehl

NUCLEIC ACIDS. ECS129 Instructor: Patrice Koehl

Park /12. Yudin /19. Li /26. Song /9

RNA ID missing Word ID missing Word DNA ID missing Word

CHAPTER 21 GENOMES AND THEIR EVOLUTION

Detecting amino acid sites under positive. selection and purifying selection

1) (15 points) Next to each term in the left-hand column place the number from the right-hand column that best corresponds:

CS 262 Lecture 14 Notes Human Genome Diversity, Coalescence and Haplotypes

Biology From gene to protein

Selective constraints on noncoding DNA of mammals. Peter Keightley Institute of Evolutionary Biology University of Edinburgh

Lac Operon contains three structural genes and is controlled by the lac repressor: (1) LacY protein transports lactose into the cell.

Outline. Detecting Selective Sweeps. Are we still evolving? Finding sweeping alleles

Charles Darwin. Charles Darwin. Charles Darwin. Darwin s Observations. Darwin s Observations 1/4/2018. Microevolution: Unique Gene Pools

Balancing and disruptive selection The HKA test

Comparative Multivariate Analysis of Codon and Amino Acid Usage in Three Leishmania Genomes

An Introduction to Population Genetics

Detecting selection on nucleotide polymorphisms

ENZYMES AND METABOLIC PATHWAYS

Patterns of Selection on Synonymous and Non-Synonymous

(a) Which enzyme(s) make 5' - 3' phosphodiester bonds? (c) Which enzyme(s) make single-strand breaks in DNA backbones?

Chapter 10: Gene Expression and Regulation

Proteomics Informatics (BMSC-GA 4437)

BS 50 Genetics and Genomics Week of Oct 24

Lecture #8 2/4/02 Dr. Kopeny

ECS 129: Structural Bioinformatics March 15, 2016

Transcription:

s and Sequence Changes part of Fortgeschrittene Methoden in der Bioinformatik Computational EvoDevo University Leipzig Leipzig, WS 2011/12

From Molecular to Population Genetics molecular level substitution rate population level mutation rate fixation rate retention positive selection drift nucleotide change mutation new, minor allele fixed allele substitution repair negative selection, drift discarded mutation discarded allele evolution rate functionally relevant substitution functionally neutral neutral substitution

Nucleotide Exchanges transition: exchange purine for purine (C T) or pyrimidine for pyrimidine (A G) transversion: exchange purine for pyrimidine or pyrimidine for purine (C T A G) synonymous substitution: nucleotide changes that are functionally neutral nonsynonymous substitution: nucleotide changes that change the function

Estimating s take two species that diverged a time T ago (i.e. had a common ancestor a time T ago) select regions that are 1:1 orthologs of each other (i.e. have a common ancestral sequence in the common ancestor and were not duplicated since) evolved neutrally (i.e. were not under positive or negative selection since their divergence from the common ancestor) can be aligned without errors count the number of substitutions correct for reversion and multiple mutations at the same site and biases devide the number of nucleotide exchanges (mutations) by T

Purifying versus Positive Selection I Selection can only occure at nonsynonymous sites. Mutations fixed by purifying selection: the rate of fixation of synonymous changes is greater than the rate of fixation of nonsynonymous changes (ω S < 1). Mutations fixed by positive selection: the rate of fixation of nonsynonymous changes is greater than the rate of fixation of synonymous changes (ω S > 1). ω S = d N d S (1) ω S... selection ratio d s... synonymous divergence per synonymous site d N... nonsynonymous divergence per nonsynonymous site

Purifying versus Positive Selection II The following would be more accurate: ω = d N/2T µ N (2) The selection ratio ω is the ratio of the rate of nonsynonymous substitutions per site d N to the rate of nonsynonymous mutations per site µ N. How can we estimate µ N?

4-fold Degenerate Sites?-fold degenerate site:? = the number of different nucleotides that can occure at the site without changing the protein sequence CU* Leu GU* Val UC* Ser CC* Pro AC* Thr GC* Ala CG* Arg GG* Gly Assumption: 4-fold degenerate sites are synonymous sites.

Nucleotide Occurence at Codon Positions in Drosophila melanogaster 1300 1st codon position 1300 3rd codon position 1200 1100 1000 A C G U 1200 1100 1000 A C G U 900 900 800 800 700 700 600 600 500 500 400 400 300 300 200 200 100 100 1300 0 0 0.1 0.2 0.3 0.4 0.5 0.6 2nd codon position 0-fold degenerate 0 0 0.1 0.2 0.3 0.4 0.5 0.6 part of 3rd codon position 4-fold degenerate 1300 1200 1100 1000 A C G U 1200 1100 1000 A C G U 900 900 800 800 700 700 600 600 500 500 400 400 300 300 200 200 100 100 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0 0 0.1 0.2 0.3 0.4 0.5 0.6

Why are nucleotide frequences different for different codon positions? Potential Causes codon usage bias base composition bias selective constraints on other levels than the coding sequence

Estimating the Codon Usage Bias I Relative Synonymous Codon Usage (RSCU) j E(X ij ) = X ij (3) n ij RSCU ij = X ij E(X ij ) = X ij/(1/n i n X ij ) (4) i... index running over the 20 amino acids j i... index running over the codons for amino acid i n ij... the number of different codons for amino acid i X ij... observed number of codon j for amino acid i RSCU ij = 1 usage of codon j is neither prefered nor avoided RSCU ij > 1 codon j is used preferentially RSCU ij < 1 codon j is avoided j=1

Estimating the Base Composition Bias Base Composition Skew (BCS) BCS = n i {ACGT } (n i E(n i )) 2 (5) Sum of the squared deviation of the observed nucleotide frequency from the expected nucleotide frequency E(n A ) = E(n T ) = E(n C ) = E(n G ) = 0.25.

Genomic Mutation Distances d Sg = (1 f g )d µg (6) d Sg... synonymous distance for gene g acording to the Tamura-Nei model f g... fraction of mutations underestimated due to biases d µg... mutation distance for gene g f g = ηbcs g (7) BCS g... base compostion skew for gene g η... obtaind by divinding the absolute value of the slope of the linear regression of BSC on d S by the y-intercept of the regression line

References [Filipski, 2008] Alan Filipski, Sonja J. Prohaska and Sudhir Kumar. Molecular Signatures of Adaptive Evolution. in Evolutionary Genomics and Proteomics edited by Mark Pagel; Sinauer Associates, Inc. Sunderland 2008. Chapter 11, p241-254. [Tamura, 2004] Koichiro Tamura, Sankar Subramanian and Sudhir Kumar. Temporal Patterns of Fruit Fly (Drosophila) Evolution Revealed by Mutation Clock. Mol. Biol. Evol. 2004. 21(1), p36-44.