Functional genomics to improve wheat disease resistance. Dina Raats Postdoctoral Scientist, Krasileva Group

Similar documents
The Genome Analysis Centre. Building Excellence in Genomics and Computational Bioscience

Selecting TILLING mutants

Wheat Genome Structural Annotation Using a Modular and Evidence-combined Annotation Pipeline

Genomic resources. for non-model systems

A mutation in TaGW2-A increases thousand grain weight in wheat. James Simmonds

Supplementary Data 1.

Picture Andre Schönhofen. Jorge Dubcovsky Seed Central, October

The Genome Analysis Centre. Building Excellence in Genomics and Computa5onal Bioscience

Next-Generation Sequencing. Technologies

Analysis of neo-antigens to identify T-cell neo-epitopes in human Head & Neck cancer. Project XX1001. Customer Detail

Next Generation Genetics: Using deep sequencing to connect phenotype to genotype

Anchoring and ordering NGS contig assemblies by population sequencing (POPSEQ)

Prioritization: from vcf to finding the causative gene

The Diploid Genome Sequence of an Individual Human

The genome of Fraxinus excelsior (European Ash)

De novo assembly in RNA-seq analysis.

A barley root mutant collection for NGS-based fast-forward genetics

Comparison and Evaluation of Cotton SNPs Developed by Transcriptome, Genome Reduction on Restriction Site Conservation and RAD-based Sequencing

RNA-SEQUENCING ANALYSIS

HIGH-QUALITY ASSEMBLY OF THE DURUM WHEAT GENOME CV. SVEVO

Transcriptomics analysis with RNA seq: an overview Frederik Coppens

Analytics Behind Genomic Testing

Next Generation Sequencing. Tobias Österlund

Welcome to the NGS webinar series

A brief introduction to Marker-Assisted Breeding. a BASF Plant Science Company


Human Genome Sequencing Over the Decades The capacity to sequence all 3.2 billion bases of the human genome (at 30X coverage) has increased

Bioinformatics Advice on Experimental Design

Transcriptome Assembly, Functional Annotation (and a few other related thoughts)

Fast, Accurate and Sensitive DNA Variant Detection from Sanger Sequencing:

Single Cell Transcriptomics scrnaseq

Applicazioni biotecnologiche

Lecture 7. Next-generation sequencing technologies

Application of Genotyping-By-Sequencing and Genome-Wide Association Analysis in Tetraploid Potato

RADSeq Data Analysis. Through STACKS on Galaxy. Yvan Le Bras Anthony Bretaudeau Cyril Monjeaud Gildas Le Corguillé

Mapping and Mapping Populations

Gap Filling for a Human MHC Haplotype Sequence

Genome Assembly of the Obligate Crassulacean Acid Metabolism (CAM) Species Kalanchoë laxiflora

Map-Based Cloning of Qualitative Plant Genes

DNBseq TM SERVICE OVERVIEW Plant and Animal Whole Genome Re-Sequencing

Genome 373: Mapping Short Sequence Reads II. Doug Fowler

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center

Plant Breeding and Agri Genomics. Team Genotypic 24 November 2012

Course Presentation. Ignacio Medina Presentation

GENOTYPING-BY-SEQUENCING USING CUSTOM ION AMPLISEQ TECHNOLOGY AS A TOOL FOR GENOMIC SELECTION IN ATLANTIC SALMON

SNP calling and VCF format

SNP calling. Jose Blanca COMAV institute bioinf.comav.upv.es

RNA-Sequencing analysis

Genomics and Transcriptomics of Spirodela polyrhiza

Genomic Technologies. Michael Schatz. Feb 1, 2018 Lecture 2: Applied Comparative Genomics

How much sequencing do I need? Emily Crisovan Genomics Core

Introduction to RNA-Seq in GeneSpring NGS Software

Marker types. Potato Association of America Frederiction August 9, Allen Van Deynze

How much sequencing do I need? Emily Crisovan Genomics Core September 26, 2018

DE NOVO WHOLE GENOME ASSEMBLY AND SEQUENCING OF THE SUPERB FAIRYWREN. (Malurus cyaneus) JOSHUA PEÑALBA LEO JOSEPH CRAIG MORITZ ANDREW COCKBURN

Biology 163 Laboratory in Genetics Midterm 2, Nov. 14, Honor Pledge: I have neither given nor received any unauthorized help on this exam:

Illumina s Suite of Targeted Resequencing Solutions

Course Overview: Mutation Detection Using Massively Parallel Sequencing

Structure, Measurement & Analysis of Genetic Variation

Services Presentation Genomics Experts

Sequence assembly. Jose Blanca COMAV institute bioinf.comav.upv.es

Crash-course in genomics

Deep Sequencing technologies

Linking Genetic Variation to Important Phenotypes: SNPs, CNVs, GWAS, and eqtls

Multiplex Assay Design

Midterm 1 Results. Midterm 1 Akey/ Fields Median Number of Students. Exam Score

G E N OM I C S S E RV I C ES

SolCAP. Executive Commitee : David Douches Walter De Jong Robin Buell David Francis Alexandra Stone Lukas Mueller AllenVan Deynze

Single Nucleotide Variant Analysis. H3ABioNet May 14, 2014

Next Gen Sequencing. Expansion of sequencing technology. Contents

ChIP-seq and RNA-seq

Next-generation sequencing technologies

DNA. bioinformatics. genomics. personalized. variation NGS. trio. custom. assembly gene. tumor-normal. de novo. structural variation indel.

A draft sequence of bread wheat chromosome 7B based on individual MTP BAC sequencing using pair end and mate pair libraries.

Why are we here? Introduction

DNA concentration and purity were initially measured by NanoDrop 2000 and verified on Qubit 2.0 Fluorometer.

Introduction to the MiSeq

Read Mapping and Variant Calling. Johannes Starlinger

Genome Projects. Part III. Assembly and sequencing of human genomes

ChIP-seq and RNA-seq. Farhat Habib

The Expanded Illumina Sequencing Portfolio New Sample Prep Solutions and Workflow

Maximizing your NGS sequencing with IDT. Adam Chernick, PhD Field Applications Manager, Functional Genomics

Surely Better Target Enrichment from Sample to Sequencer

De novo whole genome assembly

Introduction to Next Generation Sequencing (NGS) Andrew Parrish Exeter, 2 nd November 2017

UHT Sequencing Course Large-scale genotyping. Christian Iseli January 2009

Add 2016 GBS Poster As Slide One

Data Basics. Josef K Vogt Slides by: Simon Rasmussen Next Generation Sequencing Analysis

resequencing storage SNP ncrna metagenomics private trio de novo exome ncrna RNA DNA bioinformatics RNA-seq comparative genomics

Molecular Markers CRITFC Genetics Workshop December 9, 2014

Before starting, write your name on the top of each page Make sure you have all pages

DEVELOPMENT OF OPTIMIZED FIBER FEEDSTOCKS THROUGH APPLIED GENOMICS MICHAEL DEYHOLOS UNIVERSITY OF BRITISH COLUMBIA (OKANAGAN CAMPUS)

Introduction to Bioinformatics

Introducing combined CGH and SNP arrays for cancer characterisation and a unique next-generation sequencing service. Dr. Ruth Burton Product Manager

INTRODUCTION TO MOLECULAR GENETICS. Andrew McQuillin Molecular Psychiatry Laboratory UCL Division of Psychiatry 22 Sept 2017

Concepts of Genetics Ninth Edition Klug, Cummings, Spencer, Palladino

Whole Genome Sequencing. Biostatistics 666

THE HEALTH AND RETIREMENT STUDY: GENETIC DATA UPDATE

About Strand NGS. Strand Genomics, Inc All rights reserved.

Transcription:

Functional genomics to improve wheat disease resistance Dina Raats Postdoctoral Scientist, Krasileva Group

Talk plan Goal: to contribute to the crop improvement by isolating YR resistance genes from cultivated wheat Forward screens for Yellow rust resistance in large Kronos TILLING population APR phenotyping in US and UK field trails Seedling phenotyping in CER Speed breeding for development of bi-parental mapping populations Mapping-by-sequencing with newly developed EI resources and HPC for in wheat: CS42 TGACv1 genome assembly and annotation Kronos genome assembly DRAGEN BioIT processor

Yellow rust disease of wheat Susceptible Resistant

Mapping-by-sequencing strategy R- bulk S- bulk Mapping to reference genome Resistant F1/F2 Susceptible Wild type selfing Exome capture Illumina sequencing ATGT TACT CGTG ATGC TGTT AGTG F3 Phenotyping Mutation identification ATGT TACT CGTG ATGC TGTT AGTG Allelic frequency scoring, mapping region of interest Resistant F2 bulk Susceptible F2 bulk

Kronos TILLING population EMS treatment G to A and C to T base changes www.dubcovskylab.ucdavis.edu www.wheat-tilling.com M1 M2 Krasileva et al, PNAS, 2017 Genomic DNA Over 1,500 lines and seed

Yellow rust resistance field trails Mutant lines with APR in 2 environments: California, US (UCD) and Norwich, Norfolk (EI) Mutant APR Mutant Increased APR Susceptible

Seedling resistance Resistant Mutant Susceptible Kronos WT Mutant with heterozygous resistant phenotype

Mapping populations development Resistant Mutant X Kronos WT Speed-breeding (Riaz et al. Plant Methods (2016)) Cross -> F1 -> F2 -> F3 (~9 month) 7 July 2016 7 August 2016 phenotyping field growth chambers

Chinese Spring 42 assembly and annotation Open access in Genome Research http://genome.cshlp.org/content/early/2017/04/04/gr.217117.116.abstract The reference sequence and annotation: http://opendata.earlham.ac.uk/triticum_aestivum/tgac/v1/annotation/ http://plants.ensembl.org/triticum_aestivum/info/index M. Clark The RNA sequencing reads can be downloaded from ENA (project PRJEB15048): http://www.ebi.ac.uk/ena/data/view/prjeb15048

W2RAP w2rap PCR-free PE Nextera LMP B. Clavijo From raw Illumina data to Scaffolds LMP Long (improved mate-pair method lib. by EI) PCR-free 2x250bp (~700bp) w2rap-contigger Contigs (includes pe-gaps) Nextera LMP pre-processing Accuracy and contiguity Metrics and traceability all along Available on github. SOAPdenovo scaffolding N-stretches re-mapping Scaffolds Do-it-yourself! https://github.com/bioinfologics/w2rap Clavijo et al - available on github ( https://github.com/bioinfologics/w2rap-contigger ) - on preparation / soon on Biorxiv

Improving genome annotation 217,907 loci 104,091 high-confidence protein coding genes. High quality: RNA-seq with PacBio strand specific Illumina data D. Swarbreck Tools developed to improve transcript reconstruction: Genomic assembly PacBio Isoseq Illumina strand specific Cross species proteins ALIGNMENT Splice junction quality filtering with Portcullis Splice junctions filtering https://github.com/maplesond/portcullis REPEAT IDENTIFICATION TRANSCRIPT ASSEMBLY AND SELECTION multiple assembly methods Selection by Mikado GENE PREDICTOR TRAINING GENE PREDICTION GENE MODEL REFINEMENT + ADDITION OF SPLICE VARIANTS GENE CONFIDENCE CLASSIFICATION https://github.com/lucventurini/mikado Scores transcripts qualities Robustly integrate multiple RNA-Seq assemblies Detects and resolves chimeric transcripts FUNCTIONAL ANNOTATION

APR mapping in selected mutant selfing F1 selfing F2 selfing F3 phenotyping of 30 progenies J Hegarty Resistant mutant Susceptible Kronos wt F2 homozygous resistant F2 homozygous susceptible

APR mapping in selected mutant 48 individually barcoded libraries 16-plex pools 82.4 Mb exome-capture design (Krasileva et al, Genome Biology 2013) 3 Illumina HiSeq 2500 lanes mapping to CS42 TGAC and Kronos (bwa and Dragen bwa) variant calling (GATC, Dragen GATC and Freebayes) Identification of chromosomal region VEP analysis -> causative mutation

Mapping and variant calling with CS CS reference Signal EMS mutations Kronos wild type mutant 1 mutant 2 Noise Paralog/homoeolog SNPs Varietal SNPs PCR errors Off target repeats mutant 3

Assembling multiple Wheat genomes

Mapping and variant calling with Kronos ref Kronos reference Signal EMS mutations Kronos wild type mutant 1 mutant 2 Noise Paralog/homoeolog SNPs PCR errors Off target repeats mutant 3

2.3billion reads 2x126bp Pipeline Quality control Timescale HPC hours / sample C Schudoma Mapping to reference 5 min / sample days / sample Mutation Detection 5 min / sample 2-3 days Filtering and Allele Frequency analysis <1h Putative region Dynamic Read Analysis for Genomics The DRAGEN Processor uses a field-programmable gate array (FPGA) to provide hardware-accelerated implementations of genome pipeline algorithms

Chromosomal region and Putative causative mutations 45 F2 samples, 3 Kronos wt 180 SNPs on 42 scaffolds 2 genetic bins (TGACv1 assembly mapped to POPSEQ Chapman et al. Genome Biology (2015)) 2.5 cm 120 SNPs in High Confidence genes TGACv1 CS annotation VEP analysis -> Putative causative mutations: 18 genes - missense variant Cysteine-rich-receptor-like-protein -kinase Protein-kinase-family-protein CLV1-receptor-kinase-like-protein

Validation Refining genetic map by screening of additional phenotyped F2 with KASPs markers Backcrosses Kronos wt other susceptible tetraploid cultivars

Acknowledgments Ksenia Krasileva Christian Schudoma Andrew Deatker Amelie Heckmann Anthony Hall Matt Clark Bernardo Clavijo David Swarbreck Luca Venturini Federica di Palma Rob Davey Leah Clissold Jorge Dubcovsky Joshua Hegarty

Kronos reference Kronos wild type mutant 1 mutant 2 mutant 3 Filtering IncredibleBulk.py : Only EMS (GA/CT) mut polymorphic between Kronos wt and Kronos Mut R bulk mut allele keep S bulk mut allele out 30x on site coverage for each line in a bulk HomMut ratio 0.9 (2 reads)