Fossils From Vindija Cave, Croatia (38 44 kya) Admixture between Archaic and Modern Humans

Similar documents
Detecting ancient admixture using DNA sequence data

Themes. Homo erectus. Jin and Su, Nature Reviews Genetics (2000)

The genetic cost of Neanderthal introgression

Learning about human population history from ancient and modern genomes

LETTER. An early modern human from Romania with a recent Neanderthal ancestor

Zebrafish and Skin Color Reference Data

Genetic Variation. Genetic Variation within Populations. Population Genetics. Darwin s Observations

Linkage Disequilibrium. Adele Crane & Angela Taravella

IN recent years, prodigious technological advances have

TEST FORM A. 2. Based on current estimates of mutation rate, how many mutations in protein encoding genes are typical for each human?

Park /12. Yudin /19. Li /26. Song /9

Supplementary Figures

Random Allelic Variation

Petar Pajic 1 *, Yen Lung Lin 1 *, Duo Xu 1, Omer Gokcumen 1 Department of Biological Sciences, University at Buffalo, Buffalo, NY.

The Red Queen Model of Recombination Hotspots Evolution in the Light of Archaic and Modern Human Genomes

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016

MUTATION RATES. Genetica per Scienze Naturali a.a prof S. Presciuttini

can be found from OMIM (Online Mendelian Inheritance in Man),

Overview One of the promises of studies of human genetic variation is to learn about human history and also to learn about natural selection.

Gen e e n t e i t c c V a V ri r abi b li l ty Biolo l gy g Lec e tur u e e 9 : 9 Gen e et e ic I n I her e itan a ce

Conifer Translational Genomics Network Coordinated Agricultural Project

File S1 Technical details of a SNP array optimized for population genetics

An introduction to genetics and molecular biology

Quiz will begin at 10:00 am. Please Sign In

How about the genes? Biology or Genes? DNA Structure. DNA Structure DNA. Proteins. Life functions are regulated by proteins:

% Viability. isw2 ino isw2 ino isw2 ino isw2 ino mM HU 4-NQO CPT

11.1 Genetic Variation Within Population. KEY CONCEPT A population shares a common gene pool.

The landscape of Neandertal ancestry in present-day humans

Computational Workflows for Genome-Wide Association Study: I

5/18/2017. Genotypic, phenotypic or allelic frequencies each sum to 1. Changes in allele frequencies determine gene pool composition over generations

Chapter 14: Genes in Action

Lecture 10: Introduction to Genetic Drift. September 28, 2012

Introduction to population genetics. CRITFC Genetics Training December 13-14, 2016

3I03 - Eukaryotic Genetics Repetitive DNA

Lecture 10 Molecular evolution. Jim Watson, Francis Crick, and DNA

Ohio s State Tests ITEM RELEASE SPRING 2017 BIOLOGY

homology - implies common ancestry. If you go back far enough, get to one single copy from which all current copies descend (premise 1).

DNA - The Double Helix

Concepts: What are RFLPs and how do they act like genetic marker loci?

Lecture 21: Association Studies and Signatures of Selection. November 6, 2006

Review. Molecular Evolution and the Neutral Theory. Genetic drift. Evolutionary force that removes genetic variation

DNA - The Double Helix

Section KEY CONCEPT A population shares a common gene pool.

Name: Date: Pd: Nucleic acids

DNA - The Double Helix

Runs of Homozygosity Analysis Tutorial

Review of Old Information: What is the monomer and polymer of: Macromolecule Monomer Polymer Carbohydrate Lipid Protein

Stalking the Genetic Basis of a Trait

CRISPR/Cas9 Genome Editing: Transfection Methods

Measurement of Molecular Genetic Variation. Forces Creating Genetic Variation. Mutation: Nucleotide Substitutions

Lecture 2: Biology Basics Continued

Keystone Biology Remediation B2: Genetics

Genetic Variation and Genome- Wide Association Studies. Keyan Salari, MD/PhD Candidate Department of Genetics

Midterm 1 Results. Midterm 1 Akey/ Fields Median Number of Students. Exam Score

Inconsistencies in Neanderthal Genomic DNA Sequences

What is DNA??? DNA = Deoxyribonucleic acid IT is a molecule that contains the code for an organism s growth and function

By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs

COMPUTER SIMULATIONS AND PROBLEMS

DNA - The Double Helix

Multiple choice questions (numbers in brackets indicate the number of correct answers)

Nucleic acids. What important polymer is located in the nucleus? is the instructions for making a cell's.

Mapping and Mapping Populations

1) Genetic Drift. Genetic Drift - population with stable size ~ 10

Bio 102 Practice Problems Genetic Code and Mutation

Exam 2 3/19/07 P. Sengupta BISC 4A

Genetics 101. Prepared by: James J. Messina, Ph.D., CCMHC, NCC, DCMHS Assistant Professor, Troy University, Tampa Bay Site

ISAG FAO Genetic Diversity

The neutral theory of molecular evolution

QUESTIONS 16 THROUGH 30 FROM EXAM 3 OF FALL, 2010

Fundamentals of Genetics. 4. Name the 7 characteristics, giving both dominant and recessive forms of the pea plants, in Mendel s experiments.

Genetic variation, genetic drift (summary of topics)

Answer: Sequence overlap is required to align the sequenced segments relative to each other.

8/21/2014. From Gene to Protein

Section 10.3 Outline 10.3 How Is the Base Sequence of a Messenger RNA Molecule Translated into Protein?

4.1. Genetics as a Tool in Anthropology

Pre-Lab Questions. 1. Use the following data to construct a cladogram of the major plant groups.

Review of Protein (one or more polypeptide) A polypeptide is a long chain of..

Genome Patterns of Selection and Introgression of Haplotypes in Natural Populations of the House Mouse (Mus musculus)

Fig State the precise name of each of the parts of the DNA molecule labelled X, Y and Z. X... Y... Z... [3]

The Modern Synthesis. Terms and Concepts. Evolutionary Processes. I. Introduction: Where do we go from here? What do these things have in common?

DNA Structure & the Genome. Bio160 General Biology

Data Retrieval from GenBank

Basic Concepts and History of Genetic Engineering. Mitesh Shrestha

Genetic load. For the organism as a whole (its genome, and the species), what is the fitness cost of deleterious mutations?

Protein Synthesis

POPULATION GENETICS. Evolution Lectures 4

The human noncoding genome defined by genetic diversity

LACTASE PERSISTENCE: EVIDENCE FOR SELECTION

Neutral theory: The neutral theory does not say that all evolution is neutral and everything is only due to to genetic drift.

Investigating Common Descent: Formulating Explanations and Model

Practice Test #3. Multiple Choice Identify the choice that best completes the statement or answers the question.

M I C R O B I O L O G Y WITH DISEASES BY TAXONOMY, THIRD EDITION

DNA. Introduction. The Structure of DNA. From DNA to Gene. Making Copies of DNA

Nature Biotechnology: doi: /nbt.3943

SNPs - GWAS - eqtls. Sebastian Schmeier

The Theory of Evolution

Human SNP haplotypes. Statistics 246, Spring 2002 Week 15, Lecture 1

5 FINGERS OF EVOLUTION

They are similar to one another but different from other species: They are capable of breeding: Artificial classification: Natural classification:

Transcription:

Fossils From Vindija Cave, Croatia (38 44 kya) Admixture between Archaic and Modern Humans Alan R Rogers February 12, 2018 1 / 63 2 / 63 Hominin tooth from Denisova Cave, Altai Mtns, southern Siberia (41 kya) Hominin finger from Denisova Cave 3 / 63 4 / 63 Sites yielding Archaic hominin DNA Outline Estimating admixture from shared derived alleles Deep separation plus extensive LD Selection against archaic DNA Excess Neanderthal in Asia Archaic adaptations in modern humans 5 / 63 6 / 63

Nucleotide site patterns Nucleotide Site Pattern ea en an Eur 1 1 0 Afr 1 0 1 Nea 0 1 1 Chmp 0 0 0 # 303,340 103,612 95,347 Ancestral allele (0) is shared with chimp Derived allele (1) shared by two human populations Pattern ea: most common; reflects history of population splits Patterns en & an: how do they arise? Why does en exceed an? Population tree A E N C A, Africa; E, Europe; N, Neanderthal; C, chimpanzee 7 / 63 8 / 63 Embedded gene genealogy with mutation Incomplete lineage sorting A E N C 1 1 0 0 Genealogy of 4 genes shown in color Bullet ( ) marks mutation from allele 0 to allele 1 Descendants of mutant have allele 1; others have 0 Gene genealogy matches phylogeny Mutant allele shared by closest relatives, A and E Pattern an Pattern en A E N C A E N C 1 0 1 0 0 1 1 0 These two should be equally common 9 / 63 10 / 63 Nucleotide site patterns again Nucleotide Site Pattern ea en an European 1 1 0 African 1 0 1 Neanderthal 0 1 1 Chimpanzee 0 0 0 # sites 303,340 103,612 95,347 Common pattern (ea) reflects history of population splits Absent admixture, the other two should be equally common Why does en exceed an? Neanderthal admixture inflates en site pattern Admixed locus m A E N C 0 1 1 0 Not admixed m A E N C 0 1 1 0 Key: A, Africa; E, Europe; N, Neanderthal; C, chimpanzee 11 / 63 12 / 63

Estimate from Neandertal DNA Eurasians share some derived alleles with archaics DNA of modern Eurasians is 15 21% Neandertal (Prüfer et al 2014) Same is true for modern people of east Asia and Papua New Guinea, but not Africa Green et al (2010) Admixture must have occurred after moderns left Africa but before they expanded throughout the world Eurasian introgressed segments most similar to Neanderthal from Caucasus (Mezmaiskaya) (Prüfer et al 2014) Neanderthal matches French 46% more often than Yoruban (African) Denisova matches French 18% more often than Yoruban (African) Archaic component of Eurasian genome more Neanderthal than Denisovan Green et al 2010; Reich et al 2010 13 / 63 14 / 63 So do Asians and Papuans Gradual decline in Neanderthal admixture Asians and Papuans carry as many Neanderthal alleles as Europeans do: 15 21% 15 / 63 16 / 63 Denisovan DNA most common in Australia, NG, and Oceania Outline Estimating admixture from shared derived alleles Deep separation plus extensive LD Selection against archaic DNA Excess Neanderthal in Asia Archaic adaptations in modern humans 17 / 63 18 / 63

Another approach: look for deep separation plus extensive LD OAS1 innate immunity locus Examine variation in modern human DNA Look for long segments of chromosome with many nucleotide differences Many nucleotide differences deep separation Long segment extensive LD Extensive LD short residence in modern population two forms of gene in Melanesia: one shared with rest of world one only in Melanesia 19 / 63 20 / 63 Worldwide frequency of Melanesian OAS1 allele Melanesian OAS1 allele w/i Melanesia 21 / 63 22 / 63 Melanesian OAS1 allele is old yet young The 2 alleles differ at many nucleotide sites separation time 34 my Long (90 kb) LD block they ve been together only 25 ky Melanesian allele matches that in Denisovan hominin skeleton archaic admixture into Melanesia Introgressed segments in Eurasian genomes Blue, Europe; red, Asia Inserts are dense in some regions, scarce in others Vernot and Akey (2014) 23 / 63 24 / 63

Outline Selection against hybrids Estimating admixture from shared derived alleles Deep separation plus extensive LD Selection against archaic DNA Excess Neanderthal in Asia Archaic adaptations in modern humans Vernot and Akey (2014) Introgressed segments rare where modern-neandertal difference large 25 / 63 26 / 63 Outline Neanderthal inserts rare where selection is strong Sankararaman et al (2014) Estimating admixture from shared derived alleles Deep separation plus extensive LD Selection against archaic DNA Excess Neanderthal in Asia Archaic adaptations in modern humans Decreasing selection 27 / 63 28 / 63 Asians have more Neandertal than Europeans Did Asians receive a 2nd dose on Neanderthal DNA? Vernot and Akey (2014) Vernot and Akey (2014) 29 / 63 30 / 63

Why do Asians have more Neanderthal DNA than Europeans? 2-pulse model fits; 1-pulse model doesn t Vernot and Akey (2014) Hypotheses 1 2nd pulse of Neanderthal admixture into Asia 2 European dilution 2nd pulse of non-neanderthal admixture into Europe 3 Purifying selection Selection against Neanderthal alleles more effective in Europe, because of larger population size Two recent papers have tested hypothesis 3 31 / 63 32 / 63 Has selection removed Neanderthal DNA in Europe Measuring these effects Selection removes variation Other things equal, variation should be low where selection has been strong Hypothesis: such regions should contain fewer Neanderthal inserts in Europe than in Asia Vernot and Akey (2015) B measures the strength of purifying selection B = 0 means strong selection; B = 1 means no selection R ind > 1 means the average Asian has more Neanderthal DNA than the average European R pop > 1 means the Asian population includes more of the Neanderthal genome than the European one 33 / 63 34 / 63 Illustration of R ind and R pop (not real data) Selection doesn t explain Asian excess Selective constraint decreases from left to right R ind = 12 Average Asian has 20% more Neanderthal DNA than average European R pop = 1 Asia and Europe include same fraction of Neanderthal genome Decreasing selection No effect on R ind Selection not responsible for Asian excess (Vernot et al 2015) 35 / 63 36 / 63

Decreasing selection Selective constraint decreases from left to right R pop declines where selection is weak Vernot and Akey attribute this to greater drift in Asia (But why does this affect only neutral loci?) 2nd source of admixture λ κ ζ m N δ α m D X Y N D X, Africa; Y, Europe; N, Neanderthal; D, Denisova 37 / 63 38 / 63 Primary archaic admixture Secondary (ghost) archaic admixture λ ζ δ α m N m D X Y N D 0 1 1 κ Red: derived Blue: ancestral λ ζ δ α m N m D X Y N D 0 1 1 κ 39 / 63 40 / 63 Ghost admixture causes bias E[R N ] 03 02 01 m N m D 00 01 03 05 bias Neanderthal-Denisovan separation time (units of N XYND generations) Ghost admixture causes bias Admixture: 5% Neanderthal & 25% Denisovan Red line and circles: Expected value of estimator of m N Horizontal dashed line: true parameter value The difference is bias If East Asians received genes from Denisovans as well as Neanderthals, our estimates of Neanderthal admixture would be inflated May explain Asian excess of Neanderthal admixture 41 / 63 42 / 63

Outline Estimating admixture from shared derived alleles Deep separation plus extensive LD Selection against archaic DNA Excess Neanderthal in Asia Archaic adaptations in modern humans Neanderthal adaptations in modern humans BCN2: affects skin color POU2F3: affects skin cells (keratin) Both affect regulatory regions, not protein itself Vernot and Akey (2014, Fig 4) 43 / 63 44 / 63 Gene categories in Neanderthal inserts Outline Neanderthal Pathway insert Europe East Asia Keratin filament *** *** Nucleic acid binding * * RNA processing ** * Ribonucleoprotein complex *** * Organelle part *** * Intracellular organelle part *** * mrna metabolic process *** * Nuclear lumen * * Nuclear part ** * Key: *, FWER < 005; **, FWER < 001; ***, FWER < 0001;, enriched;, depleted Sankararaman et al (2014) Estimating admixture from shared derived alleles Deep separation plus extensive LD Selection against archaic DNA Excess Neanderthal in Asia Archaic adaptations in modern humans 45 / 63 46 / 63 Consequences of small population size Estimated times to most recent common ancestor (TMRCA) Prüfer et al (2014) Low heterozygosity Drift strong, so deleterious mutations accumulate Extinction? Long stretches of low TMRCA recent close inbreeding (Prüfer et al 2014) 47 / 63 48 / 63

Altai Neanderthal was very closely inbred Long history of small population size All of these pedigrees are plausible (Prüfer et al 2014) Even after removing the effects of inbreeding during the last few generations, the Altai Neandertal is still highly inbred 49 / 63 50 / 63 Other archaics were less inbred Coding sequence variation in archaics Altai Neanderthal had many runs of homozygosite longer than 10 cm A centimorgan (cm) is about a million nucleotides Castellano et al (2014) study 17,367 protein-coding genes in several Neanderthals, the Denisovan, and several moderns Vindija Neanderthal had fewer such runs Denisovan had none All had runs in range 25 10 cm 51 / 63 52 / 63 Neanderthals had low heterozygosity Archaics had low heterozygosity Heterozygosity Species Population 1000 Neanderthal El Sidrón 0143 Vindija 0127 Altai 0113 Modern African 0507 European 0387 Asian 0358 Low heterozygosity small population Long runs of homozygosity recent close inbreeding Castellano et al (2014) 53 / 63 54 / 63

Tree based on mutations (left) and drift (right) Selection less effective in Neanderthals Neighbor-joining tree F ST tree Long archaic branches on F ST tree small population size Castellano et al (2014) 55 / 63 56 / 63 Selection less effective in Neanderthals Many Neanderthal alleles have large effect on protein structure and are probably deleterious (Castellano et al 2014) How many deleterious mutations would you expect? How large would these deterious effects tend to be? Simulation designed to answer these questions (Harris and Nielsen 2015) Simulated fitnesses of Neanderthals and moderns 57 / 63 A few alleles of large effect, or many small effects? 58 / 63 59 / 63 60 / 63

Most of the action is where N A s 1 Neanderthal effects strongest on brain Largest effect on fitness: 00001 < s < 0001 Or: 01 < N A s < 1, where N A is size of archaic population Genetic disease dominated by alleles with small effect (Simonti et al 2016) 61 / 63 62 / 63 Summary We can detect archaic admixture in (at least) two ways: 1 Archaic and modern populations share derived alleles more often that expected by chance 2 Deep separation plus extensive LD Neanderthals 15 21% of DNA in Eurasia; not Africa Denisovan DNA most common in Melanesia and Oceania Archaic DNA uneven along chromosome: rare where selection is strong or modern-archaic difference large Neanderthal in Asia than Europe, probably because Asians received a 2nd pulse of admixture Some archaic alleles have been favored by selection: skin color and skin texture Small archaic populations many deleterious alleles Genetic disease dominated by genes with small effect 63 / 63