Association genetics in barley

Similar documents
Association Mapping in Wheat: Issues and Trends

Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010

Curriculum Vitae. El-Sayed Mansour El-Sayed Telephone: Name:

Linkage Disequilibrium

Improving barley and wheat germplasm for changing environments

Adaptation to new environments

ABSTRACT : 162 IQUIRA E & BELZILE F*

Public-Private Partnership in Pre-Breeding

Genome-wide association studies for agronomical traits in a world wide spring barley collection

GBS Usage Cases: Examples from Maize

Genetic dissection of complex traits, crop improvement through markerassisted selection, and genomic selection

SNP calling and Genome Wide Association Study (GWAS) Trushar Shah

Brassica carinata crop improvement & molecular tools for improving crop performance

Application of Genotyping-By-Sequencing and Genome-Wide Association Analysis in Tetraploid Potato

Genome-wide association mapping in barley

MAS refers to the use of DNA markers that are tightly-linked to target loci as a substitute for or to assist phenotypic screening.

Lecture 1 Introduction to Modern Plant Breeding. Bruce Walsh lecture notes Tucson Winter Institute 7-9 Jan 2013

The Interaction of Nitrogen with Yield in Wheat

Mapping and Mapping Populations

Genomic Selection in Breeding Programs BIOL 509 November 26, 2013

Identifying and exploiting natural variation

Diversity and Agronomic Potential of Barley (Hordeum vulgare L.) Landraces in Variable Production System, Ethiopia

Development of a cost-effective pyrosequencing approach for SNP genotyping in

Lecture 2: Height in Plants, Animals, and Humans. Michael Gore lecture notes Tucson Winter Institute version 18 Jan 2013

By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs

Fusarium head blight or scab, caused primarily by F. graminearum

Fusarium head blight or scab, caused primarily by F. graminearum

Genome Wide Association Study for Binomially Distributed Traits: A Case Study for Stalk Lodging in Maize

Prostate Cancer Genetics: Today and tomorrow

Usage Cases of GBS. Jeff Glaubitz Senior Research Associate, Buckler Lab, Cornell University Panzea Project Manager

Towards whole genome association genetic scans in barley

Supplementary Figure 1 Genotyping by Sequencing (GBS) pipeline used in this study to genotype maize inbred lines. The 14,129 maize inbred lines were

Module 1 Principles of plant breeding

Association Mapping of Agronomic QTLs in US Spring Barley Breeding Germplasm

USDA-ARS/ U.S. Wheat and Barley Scab Initiative FY16 Final Performance Report Due date: July 28, USWBSI Individual Project(s) 7/26/2017

A brief introduction to Marker-Assisted Breeding. a BASF Plant Science Company

Genomic Selection: A Step Change in Plant Breeding. Mark E. Sorrells

General aspects of genome-wide association studies

HCS806 Summer 2010 Methods in Plant Biology: Breeding with Molecular Markers

Introduction to Quantitative Genomics / Genetics

Construction of high-density linkage maps for mapping quantitative trait loci for multiple traits in field pea (Pisum sativum L.)

D3.1. Whealbi. Wheat and barley Legacy for Breeding Improvement. Grant agreement number: FP Collaborative Project SEVENTH FRAMEWORK PROGRAMME

Understanding genetic association studies. Peter Kamerman

Why can GBS be complicated? Tools for filtering & error correction. Edward Buckler USDA-ARS Cornell University

Funded by the Overseas Development Administration (ODA)

A genome-wide association study of malting quality across eight US barley breeding programs

USWBSI Barley CP Milestone Matrix Updated

AP 2010 Biotechnologie-Bioressources

INVESTIGATORS: Chris Mundt, Botany and Plant Pathology, OSU C. James Peterson, Crop and Soil Science, OSU

Gene Mapping in Natural Plant Populations Guilt by Association

Potential of human genome sequencing. Paul Pharoah Reader in Cancer Epidemiology University of Cambridge

Advanced Plant Technology Program Vocabulary

Marker types. Potato Association of America Frederiction August 9, Allen Van Deynze

Quantitative Genetics, Genetical Genomics, and Plant Improvement

Genomic resources and gene/qtl discovery in cereals

SNPs - GWAS - eqtls. Sebastian Schmeier

Systems for making NIAS Core Collections, single-seedderived germplasm, and plant photo images available to the research community

Research Article A High-Density SNP and SSR Consensus Map Reveals Segregation Distortion Regions in Wheat

A major gene for leaf cadmium. (Zea mays L.)

EPIB 668 Genetic association studies. Aurélie LABBE - Winter 2011

Genomic Selection with Linear Models and Rank Aggregation

A high density GBS map of bread wheat and its application for genetic improvement of the crop

GENETICS - CLUTCH CH.20 QUANTITATIVE GENETICS.

Optimizing Traditional and Marker Assisted Evaluation in Beef Cattle

Multi-SNP Models for Fine-Mapping Studies: Application to an. Kallikrein Region and Prostate Cancer

Dissecting the genetic basis of grain size in sorghum. Yongfu Tao DO NOT COPY. Postdoctoral Research Fellow

Improving barley and wheat germplasm for changing environments

GBS Usage Cases: Non-model Organisms. Katie E. Hyma, PhD Bioinformatics Core Institute for Genomic Diversity Cornell University

Comparative assessment of wheat landraces from AWCC, ICARDA and VIR germplasm collections based on the analysis of SSR markers

USDA-ARS/ U.S. Wheat and Barley Scab Initiative FY16 Final Performance Report Due date: July 28, USWBSI Individual Project(s)

I.1 The Principle: Identification and Application of Molecular Markers

Genetic drift in perennial ryegrass populations: Analysis of phenotypic data, Illumina genotyping assays and candidate genes

latestdevelopments relevant for the Ag sector André Eggen Agriculture Segment Manager, Europe

Identifying Genes Underlying QTLs

Efficiency of selective genotyping for genetic analysis of complex traits and potential applications in crop improvement

GENETIC VARIATION FOR SEED PROTEIN IN BARLEY GERMPLASM

BTRY 7210: Topics in Quantitative Genomics and Genetics

REFORMA Resilient, water- and energy-efficient forage and feed crops for Mediterranean agricultural systems

Quantitative Genetics for Using Genetic Diversity

Supplementary Note: Detecting population structure in rare variant data

Genomics and Developing World Agriculture. John Witcombe CAZS Natural Resources University Wales, Bangor

Association mapping of Sclerotinia stalk rot resistance in domesticated sunflower plant introductions

Traits and technologies to design crop breeding systems for climate change

Using molecular marker technology in studies on plant genetic diversity Final considerations

High-density SNP Genotyping Analysis of Broiler Breeding Lines

Zahirul Talukder 1, Yunming Long 1, Thomas Gulya 2, Charles Block 3, Gerald Seiler 2, Lili Qi 2. Department of Plant Sciences, NDSU

Agricultural Applications for Genome Sequencing

Cloning drought-related QTLs. WUEMED training course June 5-10, 2006

PLANT BREEDING FOR YIELD IMPROVEMENT

METODOLOGIE INTEGRATE PER LA SELEZIONE GENOMICA DI PIANTE ORTIVE SELEZIONE GENOMICA

Wilds. Cultivars. Landraces. Number of Haplotypes, h 8 Haplotype (gene) diversity, Hd HORVU5Hr1G

Feature Selection in Pharmacogenetics

Inflorescence QTL, Canalization, and Selectable Cryptic Variation. Patrick J. Brown Department of Crop Sciences UIUC

Association Mapping Identifies QTLs Underlying Cassava Brown Streak Disease Resistance

Anchoring and ordering NGS contig assemblies by population sequencing (POPSEQ)

Molecular and Applied Genetics

QTL Mapping, MAS, and Genomic Selection

Genome-wide association studies (GWAS) Part 1

Cowpea Breeding. Ainong Shi. University of Arkansas

A Fresh Look at Field Pea Breeding. Dr Garry Rosewarne Senior Research Scientist

Transcription:

Association genetics in barley Ildikó Karsai, Hungarian Academy of Sciences, Martonvásár, Hungary Frank Ordon, Dragan Perovic, Julius Kühn Institute, Quedlinburg, Germany Günther Schweizer, Bianca Büttner, Bayerische Landesanstalt für Landwirtschaft, Freising, Germany Jutta Förster, Saaten Union Biotech, Germany José Luis Molina, IRTA Lleida, Spain Marian Moralejo, Universitat de Lleida, Spain Francisco Ciudad, ITACyL Valladolid, Spain Ana Casas, Samia Yahiaoui (INRA Algerie), Alfonso Cuesta-Marcos (Oregon State Univ, Corvallis), Cristina Casao (MPIZ, Kölln), Cristina Silvar (Univ A Coruña), Pilar Gracia, José Manuel Lasa, E Igartua

Association in the Spanish Barley Core Collection (SBCC) Choice of germplasm. Emphasis on application for breeding http://www.eead.csic.es/eead/barley/

Association in the Spanish Barley Core Collection (SBCC) Choice of germplasm. Emphasis on application for breeding WHAT IS A CORE COLLECTION? Frankel (1984) introduced the concept and defined it as 'a limited set of accessions which represents, with a minimum of repetitiveness, the genetic diversity of a crop species and its wild relatives. Core collections are increasingly used as diversity panels to mine for useful traits and genes, using genome wide association. They have the advantage of being a permanent resource, available to the scientific community, therefore fostering widespread use. Excellent potential for high throughput phenotyping Frankel, O.H., 1984. Genetic perspectives of germplasm conservation. p. 161-170. In W. Arber, K. Llimensee, W.J. Peacock and P. Starlinger (eds.), Genetic Manipulation: Impact on Man and Society, Cambridge University Press, Cambridge.

Association in the SBCC. Materials and Methods Plant material: 185 genotypes, including 159 landrace-derived inbred lines (148 6-row, 11 2-row) from the Spanish Barley Core Collection, and 26 cultivars (13 6-row, 13 2-row) Markers: Barley OPA1 (1536 SNPs), Infinium iselect (9000 SNPs ), functional markers for flowering time genes, approx. 4000 polymorphic markers with map position. Marker selection: <10% missing, MAF>5% Phenotypes: 50 traits, agronomic (recorded at 10 field trials in Spain), morphological, and disease resistance (field evaluations in Spain and controlled inoculations in Germany, JKI and LfL) Association: TASSEL, model K (for Kinship). K matrix from TASSEL, done with 500 best quality SNPs, equally distributed.

SBCC: Diversity Lasa et al. 2002, Hereditas; Silvar et al. 2010, Plant Breeding

SBCC: Population structure Previously unknown underlying populations revealed by cluster analysis (STRUCTURE) Spanish 6-row landraces are constituted by two main populations, close to other Mediterranean cultivars, but different from cultivars from more Northern countries. Spanish 2-row landraces are not much different from cultivars from other origins Reference barleys 6-row Reference barleys 2-row Spanish barleys 6-row Yahiaoui et al. 2008, Theor Appl Genet Spanish barleys 2-row

Association mapping issues Genome wide association mapping (GWAS) procedures must be fine tuned to each experiment: One size does NOT fit all Choice of genotypic panel: phenotypic diversity for target traits must be sufficient Population structure induces high rates of false associations (false positives). Avoid disconnected genotypic groups known a priori Density of markers must be enough to provide a good genome cover. Even then, density can be highly variable Ascertainment bias. Selection of SNPs in relevant groups of germplasm SNP selection: polymorphisms targeted may not be functional, or even representative of the loci Not for every trait (more later)

Association mapping issues: density Average LD (r 2 ) 0.5 Promedio de R^2 0.45 0.4 0.35 0.3 0.25 0.2 Significant LD up to 3 cm 0.15 0.1 0.05 0 0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 32 34 36 38 40 42 44 46 48 50 cm Clase 1cM

Association mapping issues: landscape of genomic diversity. Ascertainment bias in SNPs? 0.5 0.4 0.3 0.2 0.1 0 0 200 400 600 800 1000 Igartua et al. 2012, Genet Res Crop Evol

Association mapping issues: LD landscape Level of interchromosomal LD: number of loci in linkage disequilibrium (LD, r 2 ) with loci from other chromosomes. Ideally, the profile should be flat to avoid distortion in association patterns

Methods to control population structure Nothing. OK if structure is weak Cluster analysis (STRUCTURE) = Q. Usually regarded as too liberal. Genomic control. Not widely used. Principal component analysis (Eigenstrat). Widely used. Mixed models (Yu.Buckler 2005, Nature Genet) Q K. Widely used Q+K. Widely used Comparison of methods: Pasam et al 2012, BMC Plant Biology; Wang et al 2012, Theor Appl Genet

Methods to control for multiple testing Nothing. Not recommended, but still used in some cases Estimated proportion of significant tests (Pasam et al, 2012, BMC plant Biology). Liberal, but acceptable if used with strict control of population structure Multiple regression, with sequential introduction of significant markers. Seldom used. Threshold set by random variables analyzed with mixed models (Comadran et al, 2011, Theor Appl Genet) False Discovery Rate, FDR. Very popular, implemented in standard statistical packages (SAS, Genstat, R?) Bonferroni correction, full (too strict) or for an estimated effective number of independent tests (Varshney et al 2012, Field Crops Res)

Population structure. Performance of models Comparison of models to control for population structure for row type. 224 spring barleys, worldwide. Expected versus observed P values for the association tests. Pasam et al. 2012. BMC Plant Biology

Population structure. Performance of models Comparison of 2 models to control for population structure for 25 traits of the SBCC. Expected versus observed P values for the association tests.

Population structure. Performance of models Linear correlation coefficients between P of association tests and several genetic features Interchromosomal LD Intrachromosomal LD Naïve Q model K model

Association results. K model, FDR 0.1 Omited from the original presentation: Genome-wide graphs, log10probability vs cm for: Powdery mildew tolerance, average Powdery mildew tolerance, 7 isolates Leaf rust tolerance Scald Grain yield under low productivity Heading time in autumn sowings Row type Vernalization effect

Candidate genes for adaptation associated with geographical features and function Days to heading VrnH1 160 140 120 100 80 60 40 20 0 0 days 15 days 30 days 45 days Typical winter VrnH1-6 VrnH1-4 Typical spring VrnH1 allele Casao et al. 2011, Molec Breeding

Validation, QTL search, fine mapping: biparental populations Population Type Objective SBCC097 x Plaisant RIL Powdery mildew, flowering SBCC145 x Beatrix DH Scald, powdery mildew, flowering, height SBCC154 x Beatrix DH Scald, flowering, spike morphology SBCC042 x Cierzo BC2 RIL Yield SBCC073 x Cierzo BC2 RIL Yield SBCC016 x Esterel RIL Vernalization SBCC157 x Hispanic BC3 Vernalization, cold tolerance SBCC058 x Plaisant BC3 Vernalization, cold tolerance

Candidate genes for adaptation associated with geographical features and function VrnH3 Casas et al. 2011, Theor Appl Genet

Candidate genes for adaptation associated with geographical features and function VrnH3 Adapted from Casas et al. 2011, Theor Appl Genet

Validation, QTL search, fine mapping: biparental populations SBCC145 x Beatrix SBCC154 x Beatrix VrnH3, promoter VrnH3, 1st intron VrnH3. Three alleles confronted in two populations with one common parent. Effect of the polimorphisms at the promoter and first intron apparently confirmed with markers developed within the gene. Ponce-Molina et al., 2012, Crop Sci

Validation, QTL search, fine mapping: biparental populations SBCC097 x Plaisant SBCC145 x Beatrix Powdery mildew. Three alleles found in two populations with two different accessions as donors, apparently new for cultivated barley. One, on 6HL extreme, confirms association results Silvar et al. 2010, Molec Breeding; Silvar et al., 2011, Theor Appl Genet

Candidate genes for adaptation associated with geographical features and function PpdH2, winter barleys N N N Casao, Karsai et al. 2011, BMC Plant Biology

Messages GWAS in plants feasible, but not for every trait. Very difficult for traits directly related with history of selection, or correlated with population structure. The choice of germplasm is critical. Core collections are a good starting point, but further elaboration may be needed to achieve an optimum panel. Association with geo-referenced accessions offers further layers of information to extract biological meaning. Validation in biparental populations provides further proof for trait discovery. Ascertainment bias can be an issue for SNP platforms. GBS will be unbiased.

Thanks for your attention