An Introduction to Population Genetics

Similar documents
Park /12. Yudin /19. Li /26. Song /9

A Primer of Ecological Genetics

Lecture 10: Introduction to Genetic Drift. September 28, 2012

Why do we need statistics to study genetics and evolution?

Introduction to Population Genetics. Spezielle Statistik in der Biomedizin WS 2014/15

Genetic drift. 1. The Nature of Genetic Drift

(c) Suppose we add to our analysis another locus with j alleles. How many haplotypes are possible between the two sites?

Conifer Translational Genomics Network Coordinated Agricultural Project

Questions we are addressing. Hardy-Weinberg Theorem

Papers for 11 September

5/18/2017. Genotypic, phenotypic or allelic frequencies each sum to 1. Changes in allele frequencies determine gene pool composition over generations

Variation Chapter 9 10/6/2014. Some terms. Variation in phenotype can be due to genes AND environment: Is variation genetic, environmental, or both?

b. (3 points) The expected frequencies of each blood type in the deme if mating is random with respect to variation at this locus.

Molecular Evolution. COMP Fall 2010 Luay Nakhleh, Rice University

Chapter 25 Population Genetics

A brief introduction to population genetics

Conifer Translational Genomics Network Coordinated Agricultural Project

Introduction to Quantitative Genetics

Overview of using molecular markers to detect selection

Population Genetics. Ben Hecht CRITFC Genetics Training December 11, 2013

POPULATION GENETICS studies the genetic. It includes the study of forces that induce evolution (the

Constancy of allele frequencies: -HARDY WEINBERG EQUILIBRIUM. Changes in allele frequencies: - NATURAL SELECTION

Lecture 19: Hitchhiking and selective sweeps. Bruce Walsh lecture notes Synbreed course version 8 July 2013

TEST FORM A. 2. Based on current estimates of mutation rate, how many mutations in protein encoding genes are typical for each human?

The evolutionary significance of structure. Detecting and describing structure. Implications for genetic variability

Neutrality Test. Neutrality tests allow us to: Challenges in neutrality tests. differences. data. - Identify causes of species-specific phenotype

Molecular Signatures of Natural Selection

Lecture WS Evolutionary Genetics Part I - Jochen B. W. Wolf 1

SYLLABUS AND SAMPLE QUESTIONS FOR JRF IN BIOLOGICAL ANTHROPOLGY 2011

Explaining the evolution of sex and recombination

BST227 Introduction to Statistical Genetics. Lecture 3: Introduction to population genetics

COMPUTER SIMULATIONS AND PROBLEMS

University of York Department of Biology B. Sc Stage 2 Degree Examinations

Distinguishing Among Sources of Phenotypic Variation in Populations

What is genetic variation?

Genotype AA Aa aa Total N ind We assume that the order of alleles in Aa does not play a role. The genotypic frequencies follow as

BST227 Introduction to Statistical Genetics. Lecture 3: Introduction to population genetics

PopGen1: Introduction to population genetics

HWE Tutorial (October 2007) Mary Jo Zurbey PharmD Candidate 2008

Population Structure and Gene Flow. COMP Fall 2010 Luay Nakhleh, Rice University

Random Allelic Variation

POPULATION GENETICS. Evolution Lectures 1

Population genetics. Population genetics provides a foundation for studying evolution How/Why?

Population Genetics. If we closely examine the individuals of a population, there is almost always PHENOTYPIC

Linkage Disequilibrium. Adele Crane & Angela Taravella

POPULATION GENETICS. Evolution Lectures 4

Quantitative Genetics, Genetical Genomics, and Plant Improvement

Methods for the analysis of nuclear DNA data

LAB ACTIVITY ONE POPULATION GENETICS AND EVOLUTION 2017

Mathematical Population Genetics

Edexcel (B) Biology A-level

Balancing and disruptive selection The HKA test

Lecture 23: Causes and Consequences of Linkage Disequilibrium. November 16, 2012

Introduction to population genetics. CRITFC Genetics Training December 13-14, 2016

Two-locus models. Two-locus models. Two-locus models. Two-locus models. Consider two loci, A and B, each with two alleles:

Model based inference of mutation rates and selection strengths in humans and influenza. Daniel Wegmann University of Fribourg

LABORATORY 8. POPULATION GENETICS AND EVOLUTION

Population Genetics Matthew B. Hamilton

arxiv: v1 [q-bio.pe] 16 Jul 2013

EXERCISE 1. Testing Hardy-Weinberg Equilibrium. 1a. Fill in Table 1. Calculate the initial genotype and allele frequencies.

Algorithms for Genetics: Introduction, and sources of variation

Exam 1, Fall 2012 Grade Summary. Points: Mean 95.3 Median 93 Std. Dev 8.7 Max 116 Min 83 Percentage: Average Grade Distribution:

Genetic drift 10/13/2014. Random factors in evolution. Sampling error. Genetic drift. Random walk. Genetic drift

Signatures of a population bottleneck can be localised along a recombining chromosome

Introductory Models, Effective Population Size

On the Power to Detect SNP/Phenotype Association in Candidate Quantitative Trait Loci Genomic Regions: A Simulation Study

Lecture #3 1/23/02 Dr. Kopeny Model of polygenic inheritance based on three genes

HISTORICAL LINGUISTICS AND MOLECULAR ANTHROPOLOGY

1) (15 points) Next to each term in the left-hand column place the number from the right-hand column that best corresponds:

Hardy Weinberg Equilibrium

MECHANISMS FOR EVOLUTION CHAPTER 20

Summary. Introduction

7-1. Read this exercise before you come to the laboratory. Review the lecture notes from October 15 (Hardy-Weinberg Equilibrium)

GENETICS - CLUTCH CH.21 POPULATION GENETICS.

AP BIOLOGY Population Genetics and Evolution Lab

Natural selection and the distribution of Identity By. Descent in the human genome

Evolutionary Mechanisms

GREG GIBSON SPENCER V. MUSE

Quantitative Genetics for Using Genetic Diversity

Population Genetics II. Bio

The Evolution of Populations

Bio 312, Exam 3 ( 1 ) Name:

1. BASICS OF POPULATION GENETICS.

EVOLUTION/HERDEDITY UNIT Unit 1 Part 8A Chapter 23 Activity Lab #11 A POPULATION GENETICS AND EVOLUTION

Understanding genetic association studies. Peter Kamerman

Lecture 5: Inbreeding and Allozymes. Sept 1, 2006

Let s call the recessive allele r and the dominant allele R. The allele and genotype frequencies in the next generation are:

Neutral mutations in ideal populations. The null model in population genetics The Fisher-Wright population model

The Evolution of Populations

SLiM: Simulating Evolution with Selection and Linkage

Hardy-Weinberg Principle

Genetics - Fall 2004 Massachusetts Institute of Technology Professor Chris Kaiser Professor Gerry Fink Professor Leona Samson

Measurement of Molecular Genetic Variation. Forces Creating Genetic Variation. Mutation: Nucleotide Substitutions

AP Biology Laboratory 8 Population Genetics Virtual Student Guide

GENE FLOW AND POPULATION STRUCTURE

Statistical Methods for Quantitative Trait Loci (QTL) Mapping

POPULATION GENETICS Winter 2005 Lecture 18 Quantitative genetics and QTL mapping

Lab 2: Mathematical Modeling: Hardy-Weinberg 1. Overview. In this lab you will:

Basics in Population Genetics. Teruyoshi Hishiki

Lecture 12: Effective Population Size and Gene Flow. October 5, 2012

Transcription:

An Introduction to Population Genetics THEORY AND APPLICATIONS f 2 A (1 ) E 1 D [ ] = + 2M ES [ ] fa fa = 1 sf a Rasmus Nielsen Montgomery Slatkin Sinauer Associates, Inc. Publishers Sunderland, Massachusetts U.S.A.

Brief Contents Chapter 1 Allele Frequencies, Genotype Frequencies, and Hardy Weinberg Equilibrium 5 Chapter 2 Genetic Drift and Mutation 21 Chapter 3 Coalescence Theory: Relating Theory to Data 35 Chapter 4 Population Subdivision 59 Chapter 5 Inferring Population History and Demography 77 Chapter 6 Linkage Disequilibrium and Gene Mapping 107 Chapter 7 Selection I 129 Chapter 8 Selection in a Finite Population 153 Chapter 9 The Neutral Theory and Tests of Neutrality 179 Chapter 10 Selection II: Interactions and Conflict 195 Chapter 11 Quantitative Genetics 215 Appendix A Basic Probability Theory 233 Appendix B The Exponential Distribution and Coalescence Times 245 Appendix C Maximum Likelihood and Bayesian Estimation 249 Appendix D Critical Values of the Chi-Square Distribution with d Degrees of Freedom 255

Contents Preface xi CHAPTER 1 Introduction 1 Types of Genetic Data 1 Detecting Differences in Genotype 2 Allele Frequencies, Genotype Frequencies, and Hardy Weinberg Equilibrium 5 Allele Frequencies 6 Genotype Frequencies 6 K-Allelic Loci 7 Example: The MC1R Gene 7 Hardy Weinberg Equilibrium 8 The MC1R Gene Revisited 9 BOX 1.1 Probability and Independence 10 BOX 1.2 Derivation of HWE Genotype Frequencies 11 Tay Sachs Disease 11 Extensions and Generalizations of HWE 12 Deviations from HWE 1: Assortative Mating 12 Deviations from HWE 2: Inbreeding 13 Deviations from HWE 3: Population Structure 13 Deviations from HWE 4: Selection 14 The Inbreeding Coefficient 15 Testing for Deviations from HWE 16 BOX 1.3 The Chi-Square Test 17 Using Allele Frequencies to Identify Individuals 18

Contents vii CHAPTER 2 Genetic Drift and Mutation 21 The Wright Fisher Model 22 Genetic Drift and Expected Allele Frequencies 23 BOX 2.1 Expectation 24 Patterns of Genetic Drift in the Wright Fisher Model 24 Effect of Population Size in the Wright Fisher Model 25 Mutation 27 Effects of Mutation on Allele Frequency 28 Probability of Fixation 29 Species Divergence and the Rate of Substitution 30 The Molecular Clock 30 Dating the Human Chimpanzee Divergence Time 31 CHAPTER 3 Coalescence Theory: Relating Theory to Data 35 Coalescence in a Sample of Two Chromosomes (n = 2) 36 Coalescence in Large Populations 38 Mutation, Genetic Variability, and Population Size 40 Infinite Sites Model 41 The Tajima s Estimator 42 The Concept of Effective Population Size 43 Interpreting Estimates of q 46 The Infinite Alleles Model and Expected Heterozygosity 47 The Coalescence Process in a Sample of n Individuals 49 The Coalescence Tree and the tmrca 50 Total Tree Length and the Number of Segregating Sites 51 The Site Frequency Spectrum (SFS) 53 Tree Shape as a Function of Population Size 55 CHAPTER 4 Population Subdivision 59 The Wahlund Effect 59 F ST : Quantifying Population Subdivision 60 The Wright Fisher Model with Migration 63 The Coalescence Process with Migration 64 Expected Coalescence Times for n = 2 66 F ST and Migration Rates 68 Divergence Models 70 Expected Coalescence Times, Pairwise Difference and F ST in Divergence Models 71 Isolation by Distance 72

viii Contents CHAPTER 5 Inferring Population History and Demography 77 Inferring Demography Using Summary Statistics 77 Coalescence Simulations and Confidence Intervals 79 BOX 5.1 Simulating Coalescence Trees 80 Estimating Evolutionary Trees 81 BOX 5.2 The UPGMA Method for Estimating Trees 83 Gene Trees vs. Species Trees 84 Interpreting Estimated Trees from Population Genetic Data 88 Likelihood and the Felsenstein Equation 92 MCMC and Bayesian Methods 94 The Effect of Recombination 97 Population Assignment, Clustering, and Admixture 99 CHAPTER 6 Linkage Disequilibrium and Gene Mapping 107 Linkage Disequilibrium 108 BOX 6.1 Coefficients of Linkage Disequilibrium 109 BOX 6.2 LD Coefficients for Two Diallelic Loci 110 BOX 6.3 r 2 as a Correlation Coefficient 112 Evolution of D 112 BOX 6.4 r 2 and c 2 113 BOX 6.5 Change in D Due to Random Mating 114 BOX 6.6 Recurrent Mutation Reduces D 116 Two-Locus Wahlund Effect 116 BOX 6.7 Two-Locus Wahlund Effect 117 Genealogical Interpretation of LD 118 Recombination 118 Association Mapping 121 BOX 6.8 Example of a Case Control Test 123 CHAPTER 7 Selection I 129 Selection in Haploids 129 Selection in Diploids 132 BOX 7.1 Haploid Selection 133 BOX 7.2 One Generation of Viability Selection 135 BOX 7.3 Algebraic Calculation of Allele Frequency Changes 136 BOX 7.4 Special Cases of Selection 137 BOX 7.5 Genic Selection 138 BOX 7.6 Heterozygote Advantage 142 BOX 7.7 Estimates of Selection Coefficients for the S Allele in a West African Population 143 Mutation Selection Balance 144

Contents ix CHAPTER 8 Selection in a Finite Population 153 Fixation Probabilities of New Mutations 153 BOX 8.1 Simulating Trajectories 154 Rates of Substitution of Selected Alleles 161 BOX 8.2 Accounting for Multiple Substitutions 162 BOX 8.3 Computing Synonymous and Nonsynonymous Rates 164 Genetic Hitchhiking 166 Selective Sweeps 166 BOX 8.4 Hitchhiking in a Haploid Population 168 Partial Sweeps 170 Associative Overdominance 171 BOX 8.5 Estimating the Age of a Mutation 172 CHAPTER 9 The Neutral Theory and Tests of Neutrality 179 The HKA Test 182 The MacDonald Kreitman (MK) Test 183 The Site Frequency Spectrum (SFS) 184 Tajima s D Test 186 Tests Based on Genetic Differentiation among Populations 188 Tests Using LD and Haplotype Structure 190 CHAPTER 10 Selection II: Interactions and Conflict 195 Selection on Sex Ratio 195 Resolving Conflicts 198 BOX 10.1 The Prisoner s Dilemma 200 Kin Selection 202 Selfish Genes 205 Meiotic Drive 205 Transposons 207 Species Formation 208 CHAPTER 11 Quantitative Genetics 215 Biometrical Analysis 216 BOX 11.1 Normal Distribution 217 BOX 11.2 Variance of the Mid-parental Value 220 Breeding Value 222 Quantitative Trait Loci 224 Multiple Quantitative Trait Loci 227

x Contents Genotype-Environment Interactions 228 Mapping Quantitative Trait Loci 229 BOX 11.3 Mapping Alleles When Starting with Homozygous Populations 230 APPENDIX A Basic Probability Theory 233 The Binomial RV 234 APPENDIX B PMF: Bernoulli 235 PMF: Binomial 235 Expectation 237 Variance 239 The Poisson RV 240 PMF: Poisson 240 The Geometric RV 241 PMF: Geometric 241 The Exponential Distribution and Coalescence Times 245 APPENDIX C Maximum Likelihood and Bayesian Estimation 249 Bayesian Estimation 252 APPENDIX D Critical Values of the Chi-Square Distribution with d Degrees of Freedom 255 Solutions to Odd-Numbered Exercises 257 Glossary 271 Credits 279 Index 281