Introduction to Genetics. Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012

Similar documents
Lecture 1 Basic Concepts in Mendelian, Population and Quantitative Genetics

Genetics & The Work of Mendel

Topic 11. Genetics. I. Patterns of Inheritance: One Trait Considered

MENDELIAN GENETICS This presentation contains copyrighted material under the educational fair use exemption to the U.S. copyright law.

Linkage & Genetic Mapping in Eukaryotes. Ch. 6

Genetics II: Linkage and the Chromosomal Theory

GENETICS AND MENDEL 2/4/2018. Mendel s Experiment. Genetic Terms. Genetic Terms. Mendel: Experiment 1 HISTORY OF DISCOVERERY OF HEREDITY

Human linkage analysis. fundamental concepts

LINKAGE AND CHROMOSOME MAPPING IN EUKARYOTES

Ch. 14 Mendel and the Gene Idea

1/21/ Exploring Mendelian Genetics. What is the principle of independent assortment? Independent Assortment. Biology.

Inheritance Biology. Unit Map. Unit

Chapter 14: Mendel and the Gene Idea

Midterm#1 comments#2. Overview- chapter 6. Crossing-over

Would expect variation to disappear Variation in traits persists (Example: freckles show up in unfreckled parents offspring!)

Mendel and the Gene Idea

Genetics Essentials 9/10/13. Concepts and Connections. Mendel and His Study of Heredity. The Case of the Red Hair. Before we Continue

Observing Patterns In Inherited Traits

Chapter 02 Mendel s Principles of Heredity

Concepts: What are RFLPs and how do they act like genetic marker loci?

Exploring Mendelian Genetics. Dihybrid crosses. Dihybrid crosses

Non Mendelian Genetics

Gregor Mendel. Austrian Monk Worked with pea plants

Biol 1408 : Chapter 9 Patterns of Inheritance

Observing Patterns in Inherited Traits. Chapter 11

Genetics Test. Multiple Choice Identify the choice that best completes the statement or answers the question.

GENETICS AND MENDEL 2/20/2013. Mendel s Experiment. Genetic Terms. How is each group the same? How is each group different?

Linkage & Crossing over

Classical (Mendelian) Genetics. Gregor Mendel

Mendel & Inheritance. SC.912.L.16.1 Use Mendel s laws of segregation and independent assortment to analyze patterns of inheritance.

Basic Concepts of Human Genetics

Gene Linkage and Genetic. Mapping. Key Concepts. Key Terms. Concepts in Action

Midterm 1 Results. Midterm 1 Akey/ Fields Median Number of Students. Exam Score

Genetics. Genetics- is the study of all manifestation of inheritance from the distributions of traits to the molecules of the gene itself

LECTURE 5: LINKAGE AND GENETIC MAPPING

Introduction to Genetics. DANILO V. ROGAYAN JR. Faculty, Department of Natural Sciences

Overview of Human Genetics

Basic Concepts of Human Genetics

Introduction. Thomas Hunt Morgan. Chromosomes and Inheritance. Drosophila melanogaster

Chapter 14. Mendel and the Gene Idea

Chp 10 Patterns of Inheritance

Sept 2. Structure and Organization of Genomes. Today: Genetic and Physical Mapping. Sept 9. Forward and Reverse Genetics. Genetic and Physical Mapping

Gen e e n t e i t c c V a V ri r abi b li l ty Biolo l gy g Lec e tur u e e 9 : 9 Gen e et e ic I n I her e itan a ce

Answers to additional linkage problems.

Estimation of Genetic Recombination Frequency with the Help of Logarithm Of Odds (LOD) Method

The information in this document is meant to cover topic 4 and topic 10 of the IB syllabus. Details of meiosis are found in Notes for Cells.

4.1. Genetics as a Tool in Anthropology

-Genes on the same chromosome are called linked. Human -23 pairs of chromosomes, ~35,000 different genes expressed.

Why Pea Plants? Mendel chose to study garden peas, because: 1. They reproduce & have a short life cycle 1

Mendel and The Gene Idea

Exploring Mendelian Genetics 11-3

#3: Random Fertilization. If DNA replication and cell division are both so precise, and so accurate, why are we all so unique??

7-1. Read this exercise before you come to the laboratory. Review the lecture notes from October 15 (Hardy-Weinberg Equilibrium)

B.6.F predict possible outcomes of various genetic combinations such as monohybrid crosses, dihybrid crosses and non Mendelian inheritance

Inheritance (IGCSE Biology Syllabus )

FINDING THE PAIN GENE How do geneticists connect a specific gene with a specific phenotype?

Genetics Sperm Meiotic cell division Egg Chromosome Segments of DNA Code DNA for traits Code for a trait Gene

Genetics of dairy production

Active Learning Exercise 8 Mendelian Genetics & the Chromosomal Basis of Inheritance

Biology Genetics Practice Quiz

Genetics. What DNA is telling us!

Genetics: the scientific study of heredity

What DNA is telling us!

DO NOT OPEN UNTIL TOLD TO START

Conifer Translational Genomics Network Coordinated Agricultural Project

Chapter 11 Reading Guide: Mendel and the Gene Idea

Genetics is the study of heredity

Genetics - Problem Drill 05: Genetic Mapping: Linkage and Recombination

Quiz will begin at 10:00 am. Please Sign In

Scrambling information

Answers DNA and Genes

Name: Bio AP Mendelian Genetics & Chromosomal Inheritance

Why peas? Pea S by Mendel. F 1 generation

Unit 10: Genetics. Chapter 9: Read P

TEST FORM A. 2. Based on current estimates of mutation rate, how many mutations in protein encoding genes are typical for each human?

Chapter 6 Linkage and Chromosome Mapping in Eukaryotes

four chromosomes ` four chromosomes correct markers (sister chromatids identical!)

Central Dogma of genetics: DNA -> Transcription -> RNA -> Translation > Protein

HARDY-WEINBERG EQUILIBRIUM

Genetics & The Work of Mendel

Q.2: Write whether the statement is true or false. Correct the statement if it is false.

Fundamentals of Genetics. 4. Name the 7 characteristics, giving both dominant and recessive forms of the pea plants, in Mendel s experiments.

The Modern Synthesis. Terms and Concepts. Evolutionary Processes. I. Introduction: Where do we go from here? What do these things have in common?

Experimental Design and Sample Size Requirement for QTL Mapping

12 The Chromosomal Basis of Inheritance

Lecture 3 Monohybrid and Dihybrid Crosses

POPULATION GENETICS: The study of the rules governing the maintenance and transmission of genetic variation in natural populations.

Kosambi and the Genetic Mapping Function

We can use a Punnett Square to determine how the gametes will recombine in the next, or F2 generation.

Read each question, and write your answer in the space provided. 2. How did Mendel s scientific work differ from the work of T. A. Knight?

DNA segment: T A C T G T G G C A A A

Genotype AA Aa aa Total N ind We assume that the order of alleles in Aa does not play a role. The genotypic frequencies follow as

Exam 1 Answers Biology 210 Sept. 20, 2006

By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs

Mendelian Genetics. What is Gregor Mendel known for and what organism did he use? When did Mendel conduct most of his work?

POPULATION GENETICS Winter 2005 Lecture 18 Quantitative genetics and QTL mapping

Mendel & Nonmendelian Problems

Chapter 14: Mendel and the Gene Idea

Figure 1: Testing the CIT: T.H. Morgan s Fruit Fly Mating Experiments

Transcription:

Introduction to Genetics Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012

Darwin and Mendel Mendel genetics Topics Mendel's experiments Mendel's laws Genes and chromosomes Linkage Prior probability of linkage Genes and DNA Basics of DNA structure Types of Genetic markers

Darwin & Mendel Darwin (1859) Origin of Species Instant Classic, major immediate impact Problem: Model of Inheritance Darwin assumed Blending inheritance Offspring = average of both parents z o = (z m + z f )/2 Fleming Jenkin (1867) pointed out problem Var(z o ) = Var[(z m + z f )/2] = (1/2) Var(parents) Hence, under blending inheritance, half the variation is removed each generation and this must somehow be replenished by mutation.

Mendel

Mendel Mendel (1865), Experiments in Plant Hybridization No impact, paper essentially ignored Ironically, Darwin had an apparently unread copy in his library Why ignored? Perhaps too mathematical for 19th century biologists The rediscovery in 1900 (by three independent groups) Mendel s key idea: Genes are discrete particles passed on intact from parent to offspring

Mendel s experiments with the Garden Pea 7 traits examined

Mendel crossed a pure-breeding yellow pea line with a pure-breeding green line. Let P1 denote the pure-breeding yellow (parental line 1) P2 the pure-breed green (parental line 2) The F1, or first filial, generation is the cross of P1 x P2 (yellow x green). All resulting F1 were yellow The F2, or second filial, generation is a cross of two F1 s In F2, 1/4 are green, 3/4 are yellow This outbreak of variation blows the theory of blending inheritance right out of the water.

Mendel also observed that the P1, F1 and F2 Yellow lines behaved differently when crossed to pure green P1 yellow x P2 (pure green) --> all yellow F1 yellow x P2 (pure green) --> 1/2 yellow, 1/2 green F2 yellow x P2 (pure green) --> 2/3 yellow, 1/3 green

Mendel s explanation Genes are discrete particles, with each parent passing one copy to its offspring. Let an allele be a particular copy of a gene. In Diploids, each parent carries two alleles for every gene Pure Yellow parents have two Y (or yellow) alleles We can thus write their genotype as YY Likewise, pure green parents have two g (or green) alleles Their genotype is thus gg Since there are lots of genes, we refer to a particular gene by given names, say the pea-color gene (or locus)

Each parent contributes one of its two alleles (at random) to its offspring Hence, a YY parent always contributes a Y, while a gg parent always contributes a g An individual carrying only one type of an allele (e.g. yy or gg) is said to be a homozygote In the F1, YY x gg --> all individuals are Yg An individual carrying two types of alleles is said to be a heterozygote.

The phenotype of an individual is the trait value we observe For this particular gene, the map from genotype to phenotype is as follows: YY --> yellow Yg --> yellow gg --> green Since the Yg heterozygote has the same phenotypic value as the YY homozygote, we say (equivalently) Y is dominant to g, or g is recessive to Y

Explaining the crosses F1 x F1 -> Yg x Yg Prob(YY) = yellow(dad)*yellow(mom) = (1/2)*(1/2) Prob(gg) = green(dad)*green(mom) = (1/2)*(1/2) Prob(Yg) = 1-Pr(YY) - Pr(gg) = 1/2 Prob(Yg) = yellow(dad)*green(mom) + green(dad)*yellow(mom) Hence, Prob(Yellow phenotype) = Pr(YY) + Pr(Yg) = 3/4 Prob(green phenotype) = Pr(gg) = 1/4

Review of terms (so far) Gene Locus Allele Homozygote Heterozygote Dominant Recessive Genotype Phenotype

In class problem (5 minutes) Explain why F2 yellow x P2 (pure green) - -> 2/3 yellow, 1/3 green F2 yellows are a mix, being either Yg or YY Prob(F2 yellow is Yg) = Pr(yellow Yg)*Pr(Yg in F2) Pr(Yellow) = (1* 1/2)/(3/4) = 2/3 2/3 of crosses are Yg x gg -> 1/2 Yg (yellow), 1/2 gg (green) 1/3 of crosses are YY x gg -> all Yg (yellow) Pr(yellow) = (2/3)*(1/2) + (1/3) = 2/3

Dealing with two (or more) genes For his 7 traits, Mendel observed Independent Assortment The genotype at one locus is independent of the second RR, Rr - round seeds, rr - wrinkled seeds Pure round, green (RRgg) x pure wrinkled yellow (rryy) F1 --> RrYg = round, yellow What about the F2?

Let R- denote RR and Rr. R- are round. Note in F2, Pr(R-) = 1/2 + 1/4 = 3/4 Likewise, Y- are YY or Yg, and are yellow Phenotype Yellow, round Yellow, wrinkled Green, round Green, wrinkled Genotype Y-R- Y-rr ggrggrr Frequency (3/4)*(3/4) = 9/16 (3/4)*(1/4) = 3/16 (1/4)*(3/4) = 3/16 (1/4)*(1/4) = 1/16 Or a 9:3:3:1 ratio

Probabilities for more complex genotypes Cross AaBBCcDD X aabbccdd What is Pr(aaBBCCDD)? Under independent assortment, = Pr(aa)*Pr(BB)*Pr(CC)*Pr(DD) = (1/2*1)*(1*1/2)*(1/2*1/2)*(1*1/2) = 1/2 5 What is Pr(AaBbCc)? = Pr(Aa)*Pr(Bb)*Pr(Cc) = (1/2)*(1/2)*(1/2) = 1/8

Mendel was wrong: Linkage Bateson and Punnet looked at flower color: P (purple) dominant over p (red ) pollen shape: L (long) dominant over l (round) Phenotype Genotype Observed Expected Purple long P-L- 284 215 Purple round P-ll 21 71 Red long ppl- 21 71 Red round ppll 55 24 Excess of PL, pl gametes over Pl, pl Departure from independent assortment

Interlude: Chromosomal theory of inheritance Early light microscope work on dividing cells revealed small (usually) rod-shaped structures that appear to pair during cell division. These are chromosomes. It was soon postulated that Genes are carried on chromosomes, because chromosomes behaved in a fashion that would generate Mendel s laws. We now know that each chromosome consists of a single double-stranded DNA molecule (covered with proteins), and it is this DNA that codes for the genes.

Humans have 23 pairs of chromosomes (for a total of 46) 22 pairs of autosomes (chromosomes 1 to 22) 1 pair of sex chromosomes -- XX in females, XY in males Humans also have another type of DNA molecule, namely the mitochondrial DNA genome that exists in tens to thousands of copies in the mitochondria present in all our cells mtdna is usual in that it is strictly maternally inherited. Offspring get only their mother s mtdna.

Linkage If genes are located on different chromosomes they (with very few exceptions) show independent assortment. Indeed, peas have only 7 chromosomes, so was Mendel lucky in choosing seven traits at random that happen to all be on different chromosomes? Exercise: compute this probability. However, genes on the same chromosome, especially if they are close to each other, tend to be passed onto their offspring in the same configuration as on the parental chromosomes.

Consider the Bateson-Punnet pea data Let PL / pl denote that in the parent, one chromosome carries the P and L alleles (at the flower color and pollen shape loci, respectively), while the other chromosome carries the p and l alleles. Unless there is a recombination event, one of the two parental chromosome types (PL or pl) are passed onto the offspring. These are called the parental gametes. However, if a recombination event occurs, a PL/pl parent can generate Pl and pl recombinant chromosomes to pass onto its offspring.

Let c denote the recombination frequency --- the probability that a randomly-chosen gamete from the parent is of the recombinant type (i.e., it is not a parental gamete). For a PL/pl parent, the gamete frequencies are Gamete type PL pl pl Pl Frequency (1-c)/2 (1-c)/2 c/2 c/2 Expectation under independent assortment 1/4 1/4 1/4 1/4

Parental gametes in excess, as (1-c)/2 > 1/4 for c < 1/2 Gamete type PL pl pl Pl Frequency (1-c)/2 (1-c)/2 c/2 c/2 Expectation under independent assortment 1/4 1/4 1/4 1/4 Recombinant gametes in deficiency, as c/2 < 1/4 for c < 1/2

Expected genotype frequencies under linkage Suppose we cross PL/pl X PL/pl parents What are the expected frequencies in their offspring? Pr(PPLL) = Pr(PL father)*pr(pl mother) = [(1-c)/2]*[(1-c)/2] = (1-c) 2 /4 Likewise, Pr(ppll) = (1-c) 2 /4 Recall from previous data that freq(ppll) = 55/381 =0.144 Hence, (1-c) 2 /4 = 0.144, or c = 0.24

A (slightly) more complicated case Again, assume the parents are both PL/pl. Compute Pr(PpLl) Two situations, as PpLl could be PL/pl or Pl/pL Pr(PL/pl) = Pr(PL dad)*pr(pl mom) + Pr(PL mom)*pr(pl dad) = [(1-c)/2]*[(1-c)/2] + [(1-c)/2]*[(1-c)/2] Pr(Pl/pL) = Pr(Pl dad)*pr(pl mom) + Pr(Pl mom)*pr(pl dad) = (c/2)*(c/2) + (c/2)*(c/2) Thus, Pr(PpLl) = (1-c) 2 /2 + c 2 /2

Generally, to compute the expected genotype probabilities, need to consider the frequencies of gametes produced by both parents. Suppose dad = Pl/pL, mom = PL/pl Pr(PPLL) = Pr(PL dad)*pr(pl mom) = [c/2]*[(1-c)/2] Notation: when PL/pl, we say that alleles P and L are in coupling When parent is Pl/pL, we say that P and L are in repulsion

Genetic Maps and Mapping Functions The unit of genetic distance between two markers is the recombination frequency, c (also called!) If the phase of a parent is AB/ab, then 1-c is the frequency of parental gametes (e.g., AB and ab), while c is the frequency of nonparental gametes (e.g.. Ab and ab). A parental gamete results from an EVEN number of crossovers, e.g., 0, 2, 4, etc. For a nonparental (also called a recombinant) gamete, need an ODD number of crossovers between A & b e.g., 1, 3, 5, etc.

Hence, simply using the frequency of recombinant (i.e. nonparental) gametes UNDERESTIMATES the m number of crossovers, with E[m] > c In particular, c = Prob(odd number of crossovers) Mapping functions attempt to estimate the expected number of crossovers m from observed recombination frequencies c When considering two linked loci, the phenomena of interference must be taken into account The presence of a crossover in one interval typically decreases the likelihood of a nearby crossover

Suppose the order of the genes is A-B-C. If there is no interference (i.e., crossovers occur independently of each other) then Probability(odd number of crossovers btw A and C) Even number of crossovers btw A & B, Odd number between B & C c AC = c AB (1 c BC ) + (1 c AB ) c BC = c AB + c BC 2c AB c BC odd number in A-B, even number in B-C

We need to assume independence of crossovers in order to multiply these two probabilities When interference is present, we can write this as c AC = c AB + c BC 2(1 δ)c AB c BC Interference parameter " = 1 --> complete interference: The presence of a crossover eliminates nearby crossovers " = 0 --> No interference. Crossovers occur independently of each other

Mapping functions. Moving from c to m Haldane s mapping function (gives Haldane map distances) Assume the number k of crossovers in a region follows a Poisson distribution with parameter m This makes the assumption of NO INTERFERENCE Pr(Poisson = k) = # k Exp[-#]/k! # = expected number of successes c = p(m, 2k + 1) = e m k=0 k=0 m 2 k+1 (2k + 1)! = 1 e 2m 2

Prob(Odd number of crossovers) c = p(m, 2k + 1) = e m k=0 k=0 m 2 k+1 (2k + 1)! = 1 e 2m 2 Odd number Relates recombination fraction c to expected number of crossovers m This gives the estimated Haldane distance as m = ln(1 2c) 2 Usually reported in units of Morgans or Centimorgans (Cm) One morgan --> m = 1.0. One cm --> m = 0.01

The Prior Probability of Linkage Morton (1955), in the context of linkage analysis in humans, introduced the concept of a Posterior Error Rate, or PER PER = probability that a test declared significant is a false positive, PER = Pr(false positive significant test) The screening paradox: type I error control may not lead to a suitably low PER With PER, conditioning on the test being significant, As opposed to!conditioning on the hypothesis being a null, as occurs with type I error control ($)

Let $ be the Type 1 error, % the type 2 error (1- % = power) And & be the fraction of null hypothesis, then from Bayes theorem PER = Pr(false positive significant) PER = Pr(false positive null True )* Pr(null) Pr(significant test) Since there are!23 pairs of human chromosomes, Morton argued that two randomly-chosen genes had a 1/23 (roughly 5%) prior probability of linkage, i.e. & = 0.95

Assuming $ type I error of a = 0.05 and 80% power (% = 0.2), the expected PER is 0.05*0.95 0.05*0.95 + 0.8*0.05 = 0.54 Hence, even with a 5% type-i error control, a random significant test has a 54% chance of being a false-positive. This is because most of the hypotheses are expected to null. If we draw 1000 random pairs of loci, 950!are expected to be unlinked, and we expect!950 * 0.05 = 47.5 of these to show a false-positive. Conversely, only 50 are expected to be linked, and we would declare 50 * 0.80 = 40 of these to be significant, so that 47.5/87.5 of the significant results are due to false-positives.

Molecular Markers You and your neighbor differ at roughly 22,000,000 nucleotides (base pairs) out of the roughly 3 billion bp that comprises the human genome Hence, LOTS of molecular variation to exploit SNP -- single nucleotide polymorphism. A particular position on the DNA (say base 123,321 on chromosome 1) that has two different nucleotides (say G or A) segregating STR -- simple tandem arrays. An STR locus consists of a number of short repeats, with alleles defined by the number of repeats. For example, you might have 6 and 4 copies of the repeat on your two chromosome 7s

SNPs vs STRs SNPs STRs Cons: Less polymorphic (at most 2 alleles) Pros: Low mutation rates, alleles very stable Excellent for looking at historical long-term associations (association mapping) Cons: High mutation rate Pros: Very highly polymorphic Excellent for linkage studies within an extended Pedigree (QTL mapping in families or pedigrees)