Association mapping of Sclerotinia stalk rot resistance in domesticated sunflower plant introductions

Similar documents
Zahirul Talukder 1, Yunming Long 1, Thomas Gulya 2, Charles Block 3, Gerald Seiler 2, Lili Qi 2. Department of Plant Sciences, NDSU

QTL mapping of Sclerotinia basal stalk rot (BSR) resistance in sunflower using genotyping-bysequencing

Development of SNP markers linked to the Downy Mildew Resistance Gene Pl 8 in Sunflower

Part 1:Progress on Rust Resistance Breeding of Confection Sunflowers and Part 2: Sunflower SNP Genetic Mapping

SEARCHING FOR DNA MARKERS ASSOCIATED WITH SCLEROTINIA TOLERANCE IN CULTIVATED SUNFLOWER

GREG GIBSON SPENCER V. MUSE

SNP calling and Genome Wide Association Study (GWAS) Trushar Shah

Linkage Disequilibrium

Characterization of Sclerotinia phenotypic variation relevant to diseases of sunflower

Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010

Mapping and Mapping Populations

New sunflower rust projects in the USDA Sunflower Research Unit

Genetic dissection of complex traits, crop improvement through markerassisted selection, and genomic selection

Pharmacogenetics: A SNPshot of the Future. Ani Khondkaryan Genomics, Bioinformatics, and Medicine Spring 2001

GBS Usage Cases: Examples from Maize

Gene Mapping in Natural Plant Populations Guilt by Association

Why do we need statistics to study genetics and evolution?

Genomic Selection in Breeding Programs BIOL 509 November 26, 2013

Understanding genetic association studies. Peter Kamerman

I.1 The Principle: Identification and Application of Molecular Markers

Map-Based Cloning of Qualitative Plant Genes

Pathway approach for candidate gene identification and introduction to metabolic pathway databases.

Lecture 12. Genomics. Mapping. Definition Species sequencing ESTs. Why? Types of mapping Markers p & Types

b. (3 points) The expected frequencies of each blood type in the deme if mating is random with respect to variation at this locus.

By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs

Crash-course in genomics

Identifying the functional bases of trait variation in Brassica napus using Associative Transcriptomics

Supplementary Figure 1 Genotyping by Sequencing (GBS) pipeline used in this study to genotype maize inbred lines. The 14,129 maize inbred lines were

GENETICS - CLUTCH CH.20 QUANTITATIVE GENETICS.

Marker types. Potato Association of America Frederiction August 9, Allen Van Deynze

Development of Early Maturing GEM lines with Value Added Traits: Moving U.S. Corn Belt GEM Germplasm Northward

Identifying Genes Underlying QTLs

Genome Wide Association Study for Binomially Distributed Traits: A Case Study for Stalk Lodging in Maize

A brief introduction to Marker-Assisted Breeding. a BASF Plant Science Company

BIOLOGY - CLUTCH CH.20 - BIOTECHNOLOGY.

Association studies (Linkage disequilibrium)

Introduction to Add Health GWAS Data Part I. Christy Avery Department of Epidemiology University of North Carolina at Chapel Hill

Yield Gains in Sunflower. Brent S. Hulke USDA-ARS-Northern Crop Science Laboratory Larry W. Kleingartner National Sunflower Association

GBS Usage Cases: Non-model Organisms. Katie E. Hyma, PhD Bioinformatics Core Institute for Genomic Diversity Cornell University

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016

EPIB 668 Genetic association studies. Aurélie LABBE - Winter 2011

Brassica carinata crop improvement & molecular tools for improving crop performance

LD Mapping and the Coalescent

Statistical Methods for Quantitative Trait Loci (QTL) Mapping

Ultra-dense SNP genetic map construction and identification of SiDt gene controlling the

Journal Club Kairi Raime

Analysis of genome-wide genotype data

Usage Cases of GBS. Jeff Glaubitz Senior Research Associate, Buckler Lab, Cornell University Panzea Project Manager

SolCAP. Executive Commitee : David Douches Walter De Jong Robin Buell David Francis Alexandra Stone Lukas Mueller AllenVan Deynze

DETECTING THE PRESENCE OF DOWNY MILDEW AND GENOTYPING SUNFLOWER HOST PLANTS IN THE SAME PCR REACTION

Sequence Variations. Baxevanis and Ouellette, Chapter 7 - Sequence Polymorphisms. NCBI SNP Primer:

QTL Mapping, MAS, and Genomic Selection

Lecture 6: Association Mapping. Michael Gore lecture notes Tucson Winter Institute version 18 Jan 2013

Determination of vegetative compatibility groups using molecular markers, and their aggressiveness of Verticillium dahliae occurring on sunflower

Summary of research needs and challenges for sunflower genetics. Brent Hulke Research Geneticist USDA-ARS

Massive Analysis of cdna Ends for simultaneous Genotyping and Transcription Profiling in High Throughput

FFGWAS. Fast Functional Genome Wide Association AnalysiS of Surface-based Imaging Genetic Data

Computational Workflows for Genome-Wide Association Study: I

MAS refers to the use of DNA markers that are tightly-linked to target loci as a substitute for or to assist phenotypic screening.

QTL mapping in domesticated and natural fish populations

BENG 183 Trey Ideker. Genome Assembly and Physical Mapping

Genetics Effective Use of New and Existing Methods

CMSC423: Bioinformatic Algorithms, Databases and Tools. Some Genetics

Liu et al. Genetics 2013 (in press)

Lecture 1 Introduction to Modern Plant Breeding. Bruce Walsh lecture notes Tucson Winter Institute 7-9 Jan 2013

Linking Genetic Variation to Important Phenotypes: SNPs, CNVs, GWAS, and eqtls

Quantitative Genetics, Genetical Genomics, and Plant Improvement

Quantitative Genetics for Using Genetic Diversity

Traditional Genetic Improvement. Genetic variation is due to differences in DNA sequence. Adding DNA sequence data to traditional breeding.

Detecting gene-gene interactions in high-throughput genotype data through a Bayesian clustering procedure

Erhard et al. (2013). Plant Cell /tpc

Questions we are addressing. Hardy-Weinberg Theorem

Linking Genetic Variation to Important Phenotypes

Molecular and Applied Genetics

Bioinformatic Analysis of SNP Data for Genetic Association Studies EPI573

Why can GBS be complicated? Tools for filtering, error correction and imputation.

GDMS Templates Documentation GDMS Templates Release 1.0

Feature Selection in Pharmacogenetics

Authors: Vivek Sharma and Ram Kunwar

This is a closed book, closed note exam. No calculators, phones or any electronic device are allowed.

Why can GBS be complicated? Tools for filtering & error correction. Edward Buckler USDA-ARS Cornell University

HCS806 Summer 2010 Methods in Plant Biology: Breeding with Molecular Markers

Lecture 23: Causes and Consequences of Linkage Disequilibrium. November 16, 2012

ABSTRACT : 162 IQUIRA E & BELZILE F*

3I03 - Eukaryotic Genetics Repetitive DNA

Question. In the last 100 years. What is Feed Efficiency? Genetics of Feed Efficiency and Applications for the Dairy Industry

POPULATION GENETICS Winter 2005 Lecture 18 Quantitative genetics and QTL mapping

Module 1 Principles of plant breeding

Human Genetic Variation. Ricardo Lebrón Dpto. Genética UGR

Frumkin, 2e Part 1: Methods and Paradigms. Chapter 6: Genetics and Environmental Health

SNP genotyping and linkage map construction of the C417 mapping population

Algorithms for Genetics: Introduction, and sources of variation

MSc in Genetics. Population Genomics of model species. Antonio Barbadilla. Course

Trudy F C Mackay, Department of Genetics, North Carolina State University, Raleigh NC , USA.

Chapter 15 Gene Technologies and Human Applications

GENETICS - CLUTCH CH.15 GENOMES AND GENOMICS.

The Human Genome Project has always been something of a misnomer, implying the existence of a single human genome

QTL Mapping Using Multiple Markers Simultaneously

Introduction to Quantitative Genomics / Genetics

Genome Sequence Assembly

Transcription:

Association mapping of Sclerotinia stalk rot resistance in domesticated sunflower plant introductions Zahirul Talukder 1, Brent Hulke 2, Lili Qi 2, and Thomas Gulya 2 1 Department of Plant Sciences, NDSU 2 USDA-ARS ARS Northern Crop Science Laboratory Fargo, North Dakota

Background Sclerotinia sclerotiorum causes two serious diseases in Sunflower Stalk rot and Head rot, genetics of resistance is different Resistance is polygenic, no major resistant gene is known Resistance breeding relies on incorporating genetic factors from various partially tolerant tolerant breeding lines

Quantitative disease resistance Two approaches mapping Linkage analysis based QTL mapping, traditional tool need to develop bi parental mapping population by controlled crosses with known ancestry few opportunities for recombination, resulting in low mapping resolution QTLs that differ between two parental lines can only be detected in each mapping population

Quantitative disease resistance mapping Linkage disequilibrium (LD) based Association Mapping, new tool no need to develop mapping population exploits historical and evolutionary recombination events at the population level mapping resolution is higher than bi parental mapping population of the same size suffer from risk of incurring false positives due to population structure and kinship among individuals

Two approaches Association Mapping Genome Wide Association Mapping whole genome may be scanned to identify markers that are associated with a particular phenotype Candidate Gene Association Mapping alleles at a few selected functional candidate genes thought to be involved in controlling the trait of interest may be tested for association Candidate Gene Association Mapping study is more hypothesis driven than a Genome Wide study

Candidate-Gene AM of Sclerotinia stalk rot resistance in sunflower Methods AM population = 260 sunflower lines 249 domesticated plant introductions (PIs) 11 elite USDA inbred lines 2 hybrid checks, susceptible (Car 270) & resistant (Croplan 305) examined for stalk rot resistance in 2008 and 2009 in multi location replicated trials 52 genotypes with best resistant response, and 52 most susceptible genotypes were sequenced for selected candidate genes

Six Candidate genes (Arabidopsis thaliana defense genes) : ABI1 (ABA Insensitive 1), and ABI2 (ABA Insensitive 2) involved in abscisic acid (ABA) signal transduction EIN2 (Ethylene Insensitive 2) central regulator of ethylene signaling LACS2 (Long chain Acyl CoA Synthetase 2) involved in cutin biosynthesis pathway DET3 (De Etiolated 3) involved in oxalic acid signaling, and COI1 (Coronatine Insensitive 1) jasmonate receptor Primer design: Nucleotide sequences of the candidate genes were used to BLAST search against the NCBI EST database for sunflower EST sequences Sunflower EST sequences with high score and e value were then selected for each gene, and searched for contig assembly sequences in the Compositae Genome Project database (http://cgpdb.ucdavis.edu/) The contig sequences were reverse BLAST against GenBank to confirm the gene identity Multiple overlapping primer pairs were designed from contig sequences using the Primer3 software

DNA extraction, PCR amplification & sequencing: DNA extracted from 260 individuals of AM population PCR conditions optimized for each pair of primers Cleaned PCR amplicons sent for sequencing to Genomics and Bioinformatics Research Unit at USDA ARS, Stoneville, MS Sequence analysis and SNP survey DnaSP v.5.1 software and web based tool SNiPlay were used for sequence analysis and SNP survey within the gene sequences Population structure and Kinship analysis SNP markers develop by NSA were used for both population structure and kinship analysis. Structure analysis was done with 750 randomly chosen SNP markers using Structure v.2.3.3 software and, kinship analysis was performed with 5244 SNPs using SPAGeDi v1.3a software Linkage disequilibrium and Association mapping Haploview v.4.2 and TASSEL v.3.0 software were used for LD and association mapping analysis, respectively

Results Summary table of sequence analysis for eight candidate gene sequences ences Sequence name Length (bp) LG Map position (cm) SNP InDel Coding region Non-coding region Length SNP Freq. Length SNP Freq. ABI1N1 279 - - 12 4 171 1 (0) 1/171 108 11 1/10 ABI2N2 189 10 44.9 5 0 189 5 (4) 1/38 0 - - COI24 668 14 50.9 23 1 668 23 (5) 1/29 0 - - COI57 660 14 7.0 12 0 660 12 (1) 1/55 0 - - DET1N1 808 16 68.3 61 24 192 7 (3) 1/27 616 54 1/11 EIN1 483 14 30.1 15 3 403 13 (2) 1/31 80 2 1/40 EIN2N1 239 - - 3 0 239 3 (0) 1/80 0 - - LAC1N2 465 12 47.1 18 6 174 2 (0) 1/87 291 16 1/18 Total 3791 149 38 2696 66 66 (15) 1/41 number of SNP that led to changes in amino acid codons are shown in the parenthesis 1/41 1095 83 1/13

Estimation of number of sub-populations in the CG-AM population The log probability of data, Ln P(D) for each value of K averaged over 3 independent runs of structure with 200,000 burn in steps and 200,000 simulation steps. Estimation of the number of populations for K ranging from 1 to 10 by calculating delta K values.

Bar plot of the STRUCTURE analysis Cluster 1 Mixed Cluster 2 K = 2 K = 3 K = 4 K = 10 Each of the 104 genotypes is represented by a vertical bar, which h is partitioned into K colored segments that represent the individual s s estimated membership to the K clusters. The separation of cluster 1 and cluster 2 was done by the threshold for membership coefficient, Q 0.75.

Patterns of linkage disequilibrium (LD) for eight candidate gene sequences ABI1N1 ABI2N2 LAC1N2 DET1N1

COI24 COI57 EIN1 EIN2N1

Linkage disequilibrium (LD) decay of eight candidate genes LD decay of eight candidate genes was measured by plotting squired correlations of allele frequencies (r( 2 ) against genetic distance (bp) between pairs of polymorphic sites. Inner fitted trend line is a nonlinear logarithmic regression sion curve.

Significant association between candidate gene SNP markers and Stalk S rot resistance in sunflower AM population using four statistical models Gene Significant SNPs Tag SNP GLM 3 Environment mean Davenport COI1 COI57_459, COI57_462, COI57_115, COI57_117 & COI57_408 COI57_408 *** ** ** ** *** ** * * COI57_24 and COI57_72 COI57_72 *** ns ns ns *** ns ns ns COI24_236, COI24_251, COI24_308, COI24_356, COI24_410, COI24_536i, COI24_170 & COI24_38 COI24_251 *** ns * ns *** ns ns ns COI24_631 COI24_631 *** ns ns ns ** ns ns ns COI24_606 COI24_606 ns ns * ns ns ns ns ns COI24_312 & COI24_314 COI24_312 ** ns ns ns ** ns ns ns ABI1 ABI2N2_32 ABI2N2_32 *** ns ns ns ** ns * ns and ABI2N2_163 ABI2N2_163 ** ns ns ns ns ns ns ns ABI2 ABI2N2_155 ABI2N2_155 ** ns ns ns ns ns ns ns ABI1N1_141, ABI1N1_146, ABI1N1_148i, ABI1N1_157, ABI1N1_158, ABI1N1_158, ABI1N1_158, ABI1N1_159, ABI1N1_146 ns ns ns ns * ns ns ns ABI1N1_163, ABI1N1_165 & ABI1N1_172i EIN2 EIN1_246, EIN1_250, EIN1_222i & EIN1_206 EIN1_250 ** ** ** ns ** ns ns ns EIN1_127 EIN1_127 ns ns * ns ns ** * * DET3 DET1N1_614i DET1N1_614i ** ns ns ns ** ns ns ns DET1N1_535,DET1N1_537, DET1N1_217, DET1N1_109, DET1N1_661i,DET1N1_103,DET1N1_428i, DET1N1_181, DET1N1_217 DET1N1_540i,DET1N1_660, DET1N1_677i, DET1N1_538 ns ns * ns ns ns ns ns & DET1N1_202 DET1N1_394i DET1N1_394i ns ns * ns ns ns ns ns DET1N1_25 DET1N1_25 ns ns ns ns *** ns * ns DET1N1_34, DET1N1_37, DET1N1_107i & DET1N1_105i DET1N1_37 ns ns ns ns *** ns ns ns GLM +Q MLM +K MLM +K+Q GLM GLM +Q MLM +K MLM +K+Q

Progress in Genome-Wide AM of stalk rot and head rot resistance Population structure and Kinship analysis of AM population is complete Mapping of NSA SNPs in the sunflower linkage map are in progress Phenotypic evaluation of AM population Stalk rot is complete (2 yrs x 2 locations) Head rot is in progress (1 yr x 2 locations is done)

Future Plans Genotyping of the remaining lines of the AM population with significant CG SNPs, and complete CG AM work Complete GW AM of Stalk rot resistance Continue working on association mapping of Head rot resistance and Phomopsis resistance

Acknowledgement National Sclerotinia Initiative for funding National Sunflower Association for SNP marker development The authors thank following individuals for their technical assistance Angelia Hogness Leanne Matthiesen Dana Weiskopf Megan Ramsett Nikolay Balbyshev Arun Jani