Supplementary Text. eqtl mapping in the Bay x Sha recombinant population.

Similar documents
Mapping and Mapping Populations

Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010

SNPs - GWAS - eqtls. Sebastian Schmeier

By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs

1 why study multiple traits together?

Multiple Traits & Microarrays

Axiom mydesign Custom Array design guide for human genotyping applications

Experimental Design and Sample Size Requirement for QTL Mapping

Chapter 5: Overview. Overview. Introduction. Genetic linkage and. Genes located on the same chromosome. linkage. recombinant progeny with genotypes

The principles of QTL analysis (a minimal mathematics approach)

UPIC: Perl scripts to determine the number of SSR markers to run

eqtl Networks Reveal Complex Genetic Architecture in the Immature Soybean Seed

Nature Genetics: doi: /ng Supplementary Figure 1. QQ plots of P values from the SMR tests under a range of simulation scenarios.

Guilherme J. M. Rosa, Natalia de Leon and Artur J. M. Rosa

Supplementary Figure 1 qrt-pcr expression analysis of NLP8 with and without KNO 3 during germination.

Human linkage analysis. fundamental concepts

Design and Construction of Recombinant Inbred Lines

BTRY 7210: Topics in Quantitative Genomics and Genetics

CHAPTER 4 STURTEVANT: THE FIRST GENETIC MAP: DROSOPHILA X CHROMOSOME LINKED GENES MAY BE MAPPED BY THREE-FACTOR TEST CROSSES STURTEVANT S EXPERIMENT

Linkage & Crossing over

MENDELIAN GENETICS This presentation contains copyrighted material under the educational fair use exemption to the U.S. copyright law.

Genetics of extreme body size evolution in mice from Gough Island

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016

Conifer Translational Genomics Network Coordinated Agricultural Project

MAPPING OF QUANTITATIVE TRAIT LOCI (QTL)

Mapping Genes Controlling Quantitative Traits Using MAPMAKER/QTL Version 1.1: A Tutorial and Reference Manual

Using semantic web technology to accelerate plant breeding.

DESIGNS FOR QTL DETECTION IN LIVESTOCK AND THEIR IMPLICATIONS FOR MAS

QTLs for days to silking in a recombinant inbred line maize population subjected to high and low nitrogen regimes

Analysis of Microarray Data

Introduction to Bioinformatics and Gene Expression Technologies

Pathway approach for candidate gene identification and introduction to metabolic pathway databases.

-Genes on the same chromosome are called linked. Human -23 pairs of chromosomes, ~35,000 different genes expressed.

Gregor Mendel. Austrian Monk Worked with pea plants

MAS refers to the use of DNA markers that are tightly-linked to target loci as a substitute for or to assist phenotypic screening.

Quantitative Genetics

Reciprocal backcross mice confirm major loci linked to hyperoxic acute lung injury survival time

Topic 11. Genetics. I. Patterns of Inheritance: One Trait Considered

MulCom: a Multiple Comparison statistical test for microarray data in Bioconductor.

POPULATION GENETICS Winter 2005 Lecture 18 Quantitative genetics and QTL mapping

Genome-wide association studies (GWAS) Part 1

Observing Patterns In Inherited Traits

Question. In the last 100 years. What is Feed Efficiency? Genetics of Feed Efficiency and Applications for the Dairy Industry

5 Results. 5.1 AB-QTL analysis. Results Phenotypic traits

Gene Discovery of Nitrogen Utilization in Maize Yuhe Liu, Devin Nichols, Stephen Moose Department of Crop Sciences, University of Illinois

Linking Genetic Variation to Important Phenotypes

Genetic loci mapping associated with maize kernel number per ear based on a recombinant inbred line population grown under different nitrogen regimes

Human SNP haplotypes. Statistics 246, Spring 2002 Week 15, Lecture 1

Using mutants to clone genes

/ Computational Genomics. Time series analysis

Research Article A High-Density SNP and SSR Consensus Map Reveals Segregation Distortion Regions in Wheat

Genomic analysis of natural variation for seed and plant size in maize Dr. Shawn Kaepler University of Wisconsin

The tomato genome re-seq project

Compendium of Immune Signatures Identifies Conserved and Species-Specific Biology in Response to Inflammation

Next-Generation Sequencing Gene Expression Analysis Using Agilent GeneSpring GX

Biotools: an R function to predict spatial gene diversity via an individual-based approach

H3A - Genome-Wide Association testing SOP

Heredity and DNA Assignment 1

Advanced breeding of solanaceous crops using BreeDB

Gene Linkage and Genetic. Mapping. Key Concepts. Key Terms. Concepts in Action

Fine-scale mapping of meiotic recombination in Asians

Supplementary Note: Detecting population structure in rare variant data

Introduction to Genetics. DANILO V. ROGAYAN JR. Faculty, Department of Natural Sciences

Chapter 14: Mendel and the Gene Idea

Gene Signal Estimates from Exon Arrays

Ning Yang 1., Yanli Lu 1,2., Xiaohong Yang 3, Juan Huang 1, Yang Zhou 1, Farhan Ali 1, Weiwei Wen 1, Jie Liu 1, Jiansheng Li 3, Jianbing Yan 1 *

GENE MAPPING. Genetica per Scienze Naturali a.a prof S. Presciuttini

Linkage Disequilibrium. Adele Crane & Angela Taravella

TTT: 7 WT: Text book by N.C.E.R.T. 2. Reference book by Dinesh Publications.

Genetics - Problem Drill 19: Dissection of Gene Function: Mutational Analysis of Model Organisms

Heritable Epigenetic Variation among Maize Inbreds

Advanced Plant Technology Program Vocabulary

Genetics. Genetics- is the study of all manifestation of inheritance from the distributions of traits to the molecules of the gene itself

Conifer Translational Genomics Network Coordinated Agricultural Project

Chapter 6. Linkage Analysis and Mapping. Three point crosses mapping strategy examples. ! Mapping human genes

Recombination. The kinetochore ("spindle attachment ) always separates reductionally at anaphase I and equationally at anaphase II.

Joint Genetic Analysis of Gene Expression Data with Inferred Cellular Phenotypes

Using the Association Workflow in Partek Genomics Suite

LINKAGE AND CHROMOSOME MAPPING IN EUKARYOTES

Association mapping of Sclerotinia stalk rot resistance in domesticated sunflower plant introductions

Association Mapping in Wheat: Issues and Trends

Mendel and the Gene Idea

Genetics & The Work of Mendel

Genetics of dairy production

Observing Patterns in Inherited Traits. Chapter 11

Molecular Marker-Facilitated Investigations of Quantitative Trait Loci in Maize. II. Factors Influencing Yield and Its Component Traits

Quantitative trait mapping in a diallel cross of recombinant inbred lines

Targeted Recombinant Progeny: a design for ultra-high resolution mapping of Quantitative Trait Loci in crosses between inbred or pure lines

Introduction Executive Summary

Package ABC.RAP. October 20, 2016

Introduction. Thomas Hunt Morgan. Chromosomes and Inheritance. Drosophila melanogaster

LINKAGE DISEQUILIBRIUM MAPPING USING SINGLE NUCLEOTIDE POLYMORPHISMS -WHICH POPULATION?

ChIP-Seq Data Analysis. J Fass UCD Genome Center Bioinformatics Core Wednesday 15 June 2015

Sept 2. Structure and Organization of Genomes. Today: Genetic and Physical Mapping. Sept 9. Forward and Reverse Genetics. Genetic and Physical Mapping

A/A;b/b x a/a;b/b. The doubly heterozygous F1 progeny generally show a single phenotype, determined by the dominant alleles of the two genes.

Combinatorial activities of SHORT VEGETATIVE PHASE and FLOWERING LOCUS C define distinct modes of flowering regulation in Arabidopsis

Analysis of Gene Expression in Resynthesized Brassica napus Allopolyploids Using Arabidopsis 70mer Oligo Microarrays

GENETIC ALGORITHMS. Narra Priyanka. K.Naga Sowjanya. Vasavi College of Engineering. Ibrahimbahg,Hyderabad.

Transcription:

Supplementary Text eqtl mapping in the Bay x Sha recombinant population. Expression levels for 24,576 traits (Gene-specific Sequence Tags: GSTs, CATMA array version 2) was measured in RNA extracted from developing seeds (harvested 10 days after pollination) of 157 RILs from the Bay-0 x Sha population (hereafter BaySha)[1]. Display, sample treatment, microarray analysis and data normalisation were performed as indicated in Methods within the main text. We used this dataset to develop and test the efficiency of the novel R/eqtl package as a tool for identifying expression quantitative trait loci (eqtls) in recombinant populations. Linkage analyses on expression traits allowed us to identify 19,021 significant eqtls (FDR = 5%, LOD > 2.47 after 1,000 permutation tests on 100 randomly chosen expression traits; Figure S6a; Table S2c). For each of them, the variance explained, the supporting interval, the additive effect and the classification as potentially cis- or trans-acting eqtl (see main text for details) were determined. The accurate results provided by R/eqtl in the BaySha population allowed us to perform further mapping and crosses comparisons with the microarray data generated in the CviCol and BurCol recombinant populations. The number of eqtls identified in BaySha exceeds those ones found for the CviCol and BurCol recombinant populations by 4.4 and 2.9 fold, respectively. The majority of the eqtls were mapped as distant (91.7%), and only a minority (8.3%) was found to act locally (Figure S6a-b). The significant over-representation of distant association and the total lower number of local-eqtls detected in the BaySha population explains the greater difference between this set and the two crosses involving the Col-0 accession. Nevertheless, in agreement with CviCol and BurCol findings, we observed a higher average variance explained by local eqtls compared to distant-eqtls (13.2% vs. 11.3% respectively, P < 2.2e-16, Figure S6b, Figure S7) with more local-eqtls having larger effects and significance (Figure S6b). Most of the traits were associated with a unique genomic region (Figure S8), in agreement with CviCol and BurCol findings. The distribution of the distant-eqtls along the genome was not uniform and most of them gathered in specific regions of the genome (Figure S9). In order to map intervals significantly over-represented with distant-eqtls (hotspots), we performed a permutation test after dividing the genome into 1Mb intervals (see Methods in main text). Only three significant hotspots were detected localised in the left arm of chromosome 4 and 5. Interestingly, two well-characterised pleiotropic QTLs known to segregate in this cross were found within these regions. Below the strongest hotspot (gathering ~12,000 eqtls on chromosome 4) lies the flowering time gene FRIGIDA (FRI) as an obvious effector. Similarly, the second major hotspot, linked to the expression of over 2,000 genes, localised on chromosome 5 in an

interval neighbouring the FLC locus. Variation at FRI and FLC is known to control most of the variation for flowering time in this cross [1]. Additionally, these hotpots are not independent, with most of the genes controlled by a distant-eqtl at FLC also controlled by a distant-eqtl at FRI, in most instances with opposite allelic effects. In other words, this means that hundreds of transcripts have a genetic architecture (= detected QTLs and their parameters) which is very close to the one obtained for the flowering precocity trait itself. Although FLC has been hypothesised to directly regulate many genes outside of the flowering time pathway [2], the observed pattern fits better with an indirect effect of flowering time on the timing of gene expression in the developing seeds after pollination. Previously, linkage mapping of rosette transcripts for this line detected 36,871 eqtls, which is almost twice as much as detected in this study. Contrary to our results, many of the eqtls gathered in a distant-hotspot localised on chromosome 2 and no evidence for a large number of transcripts regulated by either FRI or FLC was reported [3]. Discrepancies between studies could be explained by the different stages at which very different tissues were sampled, where different trans-factors could lead the expression of thousands of transcripts. The BaySha developping seeds eqtls results are stored in the QTLStore portal. References 1. Loudet O, Chaillou S, Camilleri C, Bouchez D, Daniel-Vedele F: Bay-0 x Shahdara recombinant inbred line population: a powerful tool for the genetic dissection of complex traits in Arabidopsis. Theor Appl Genet 2002, 104:1173-1184. 2. Deng W, Ying H, Helliwell CA, Taylor JM, Peacock WJ, Dennis ES: FLOWERING LOCUS C (FLC) regulates development pathways throughout the life cycle of Arabidopsis. Proc Natl Acad Sci USA 2011, 108(16):6680-6685. 3. West MA, Kim K, Kliebenstein DJ, van Leeuwen H, Michelmore RW, Doerge RW, St Clair DA: Global eqtl mapping reveals the complex genetic architecture of transcript-level variation in Arabidopsis. Genetics 2007, 175(3):1441-1450.

Supporting Figures Figure S1. Venn diagram depicting the overlap between genes with differential expression in parental accessions pairs (Cvi vs. Col and Bur vs. Col). Figure S2. Histograms of the explained phenotypic variance (R 2 ; %) for the eqtls in the a.cvicol and b. BurCol populations. Figure S3. Number of eqtls per trait. The percentage and number of traits explained by 1 to 5 eqtls are indicated along the y-axis and on top of each bar, respectively. a. CviCol b. BurCol at a FDR of 5%. Figure S4. Venn diagram depicting the overlap between probes with local eqtls in the CviCol and BurCol populations. Figure S5. Histogram of the number of probes with a significant eqtl for different numbers of hidden factors tested with VBQTL in CviCol. Figure S6. Genetic landscape for transcript accumulation variation in BaySha. a. eqtl heatmap for BaySha population significant at a 5% FDR. Each horizontal bar represents an eqtl mapped on the x-axis and controlling the accumulation of a transcript expressed from the locus indicated on the y-axis. The colour of the bar indicates the direction and strength of the eqtl additive effect, and its length along the x axis encompasses the eqtl support interval. Local eqtls form the diagonal, while distant eqtls fall elsewhere in the map. b. Bar plot indicating the proportion of local and distant eqtls for increasing LOD value intervals. Figure S7. Histogram of the explained phenotypic variance (R 2 ) for the eqtls in the BaySha population Figure S8. Number of eqtls per trait in BaySha. The percentage and number of traits explained by 1 to 5 eqtls are indicated along the y-axis and on top of each bar, respectively. Figure S9. Distribution of distant-eqtls along the genome in BaySha. The number of eqtls (y-axis) is plotted against the physical position of the 1Mb-window where they peak (x-axis). Intervals with an excess of eqtls relative to the threshold estimated by permutation (red dashed line) were classified as hotspots.

Cvi-Col Bur-Col 523 560 1186 Figure S1

a Frequency 0 500 1000 1500 2000 2500 CviCol 0 20 40 60 80 Explained Phenotypic Variance (%) b Frequency 0 500 1000 1500 2000 2500 BurCol 0 20 40 60 80 Explained Phenotypic Variance (%) Figure S2

a 100 CviCol 3464 75 Percentage 50 25 342 0 42 4 0 1 2 3 4 5 eqtls/trait b 100 4905 BurCol 75 Percentage 50 25 708 0 61 5 2 1 2 3 4 5 eqtls/trait Figure S3

CviCol BurCol 1581 564 1099 Figure S4

8000 Probes with eqtls 6400 4800 3200 1600 0 0 5 10 20 30 Number of Hidden Factors Figure S5

a Bay Additive Effect Sha Physical e trait location (Chromosome) I II III IV V I II III IV V Physical eqtl location (Chromosome) b 100 Local Distant 75 Percentage 50 25 0 2.48-3 3-4 4-5 5-10 10-20 20-30 30-50 > 50 LOD Figure S6

Frequency 0 1000 2000 3000 4000 5000 6000 0 20 40 60 80 Explained Phenotypic Variance (%) Figure S7

100 13774 75 50 25 2388 131 17 2 0 1 2 3 4 5 eqtls/trait Figure S8

14000 Number of eqtls 10000 6000 BaySha 2000 I II III IV V Chromosome Figure S9