INTRODUCCIÓ A LES TECNOLOGIES DE 'NEXT GENERATION SEQUENCING'

Similar documents
Deep Sequencing technologies

Third Generation Sequencing

Overview of Next Generation Sequencing technologies. Céline Keime

Research school methods seminar Genomics and Transcriptomics

Next Generation Sequencing. Jeroen Van Houdt - Leuven 13/10/2017

Next-generation sequencing Technology Overview

Matthew Tinning Australian Genome Research Facility. July 2012

DNA-Sequencing. Technologies & Devices. Matthias Platzer. Genome Analysis Leibniz Institute on Aging - Fritz Lipmann Institute (FLI)

The Journey of DNA Sequencing. Chromosomes. What is a genome? Genome size. H. Sunny Sun

DNA-Sequencing. Technologies & Devices. Matthias Platzer. Genome Analysis Leibniz Institute on Aging - Fritz Lipmann Institute (FLI)

Welcome to the NGS webinar series

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday September 15, 2014

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday June 16, 2014

Next Generation Sequencing (NGS)

DNA-Sequencing. Technologies & Devices

Aaron Liston, Oregon State University Botany 2012 Intro to Next Generation Sequencing Workshop

DNA-Sequencing. Technologies & Devices

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Next generation sequencing techniques" Toma Tebaldi Centre for Integrative Biology University of Trento

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Next Generation Sequencing. Tobias Österlund

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center

Contact us for more information and a quotation

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Introduction to NGS. Josef K Vogt Slides by: Simon Rasmussen Next Generation Sequencing Analysis

Introduction to Next Generation Sequencing (NGS)

A Crash Course in NGS for GI Pathologists. Sandra O Toole

High Throughput Sequencing Technologies. UCD Genome Center Bioinformatics Core Monday 15 June 2015

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Tuesday December 16, 2014

Outline General NGS background and terms 11/14/2016 CONFLICT OF INTEREST. HLA region targeted enrichment. NGS library preparation methodologies

Next-generation sequencing technologies

TREE CODE PRODUCT BROCHURE

Phenotype analysis: biological-biochemical analysis. Genotype analysis: molecular and physical analysis

Understanding the science and technology of whole genome sequencing

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center

Phenotype analysis: biological-biochemical analysis. Genotype analysis: molecular and physical analysis

Next-Generation Sequencing. Technologies

Genome Resequencing. Rearrangements. SNPs, Indels CNVs. De novo genome Sequencing. Metagenomics. Exome Sequencing. RNA-seq Gene Expression

Using New ThiNGS on Small Things. Shane Byrne

Wet-lab Considerations for Illumina data analysis

CM581A2: NEXT GENERATION SEQUENCING PLATFORMS AND LIBRARY GENERATION

02 Agenda Item 03 Agenda Item

Chapter 7. DNA Microarrays

FUTURE PROSPECTS IN MOLECULAR INFECTIOUS DISEASES DIAGNOSIS

Next Gen Sequencing. Expansion of sequencing technology. Contents

HLA-Typing Strategies

MHC Region. MHC expression: Class I: All nucleated cells and platelets Class II: Antigen presenting cells

Applications of Next Generation Sequencing in Metagenomics Studies

NGS technologies: a user s guide. Karim Gharbi & Mark Blaxter

DNA-Sequenzierung. Technologien & Geräte

The New Genome Analyzer IIx Delivering more data, faster, and easier than ever before. Jeremy Preston, PhD Marketing Manager, Sequencing

Introduction to NGS. Simon Rasmussen Associate Professor DTU Bioinformatics Technical University of Denmark 2018

DNA Sequencing by Ion Torrent. Marc Lavergne CHEM 4590

RNA Sequencing. Next gen insight into transcriptomes , Elio Schijlen

Targeted Sequencing in the NBS Laboratory

Introducing QIAseq. Accelerate your NGS performance through Sample to Insight solutions. Sample to Insight

Transcriptomics analysis with RNA seq: an overview Frederik Coppens

Human genome sequence

solid S Y S T E M s e q u e n c i n g See the Difference Discover the Quality Genome

Molecular Biology and Functional Genomic Core Facility

Ultrasequencing: methods and applications of the new generation sequencing platforms

Functional Genomics Research Stream. Research Meetings: November 2 & 3, 2009 Next Generation Sequencing

Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie. Sander van Boheemen Medical Microbiology

Next-generation sequencing and quality control: An introduction 2016

Bioinformatics Advice on Experimental Design

GENOMICS WORKFLOW SOLUTIONS THAT GO WHERE THE SCIENCE LEADS. Genomics Solutions Portfolio

Ultrasequencing: Methods and Applications of the New Generation Sequencing Platforms

Outline. General principles of clonal sequencing Analysis principles Applications CNV analysis Genome architecture

Sequencing techniques and applications

Sample to Insight. Dr. Bhagyashree S. Birla NGS Field Application Scientist

The Expanded Illumina Sequencing Portfolio New Sample Prep Solutions and Workflow

Next Generation Sequencing. Simon Rasmussen Assistant Professor Center for Biological Sequence analysis Technical University of Denmark

SEQUENCING FROM SAMPLE TO SEQUENCE READY

RIPTIDE HIGH THROUGHPUT RAPID LIBRARY PREP (HT-RLP)

Axygen AxyPrep Magnetic Bead Purification Kits. A Corning Brand

Integrated NGS Sample Preparation Solutions for Limiting Amounts of RNA and DNA. March 2, Steven R. Kain, Ph.D. ABRF 2013

Genetics and Genomics in Medicine Chapter 3. Questions & Answers

DNA METHYLATION RESEARCH TOOLS

Next Generation Sequencing Lecture Saarbrücken, 19. March Sequencing Platforms

Galaxy Workshop

GENETICS - CLUTCH CH.15 GENOMES AND GENOMICS.

DNA Sequencing and Assembly

Genome Sequencing. I: Methods. MMG 835, SPRING 2016 Eukaryotic Molecular Genetics. George I. Mias

Sequencing technologies

GENOMICS WORKFLOW SOLUTIONS THAT GO WHERE THE SCIENCE LEADS. Genomics Solutions Portfolio

Wheat CAP Gene Expression with RNA-Seq

Lecture 7. Next-generation sequencing technologies

Joint RuminOmics/Rumen Microbial Genomics Network Workshop

Surely Better Target Enrichment from Sample to Sequencer

1. Introduction Gene regulation Genomics and genome analyses

Bio(tech) Interlude. 3 Nobel Prizes: PCR: Kary Mullis, 1993 Electrophoresis: A.W.K. Tiselius, 1948 DNA Sequencing: Frederick Sanger, 1980

RNA-Seq data analysis course September 7-9, 2015

Considerations for Illumina library preparation. Henriette O Geen June 20, 2014 UCD Genome Center

resequencing storage SNP ncrna metagenomics private trio de novo exome ncrna RNA DNA bioinformatics RNA-seq comparative genomics

The Genome Analysis Centre. Building Excellence in Genomics and Computa5onal Bioscience


CMPS 6630: Introduction to Computational Biology and Bioinformatics. High-Throughput Sequencing and Applications

Gene Expression Technology

Get to Know Your DNA. Every Single Fragment.

Transcription:

INTRODUCCIÓ A LES TECNOLOGIES DE 'NEXT GENERATION SEQUENCING' Bioinformàtica per a la Recerca Biomèdica Ricardo Gonzalo Sanz ricardo.gonzalo@vhir.org 14/12/2016

1. Introduction to NGS 2. First Generation Sequencing 3. Second Generation Sequencing 4. Third Generation Sequencing 5. Sequencing generation face to face 6. Applications of NGS techniques 7. A (very) brief introduction to DoE

1. Introduction to NGS 2. First Generation Sequencing 3. Second Generation Sequencing 4. Third Generation Sequencing 5. Sequencing generation face to face 6. Applications of NGS techniques 7. A (very) brief introduction to DoE

1. Introduction to NGS Diagnosis/Prognosis What happens? Why happens?

1. Introduction to NGS Diagnosis/Prognosis Treatment What happens? Why happens? Prevention

1. Introduction to NGS Diagnosis Signs Symptoms Medical examination Medical tests Many medicines are not as effective as expected in all patients Some patients may suffer from serious adverse reactions

1. Introduction to NGS Personalized medicine era -The right therapeutic strategy for the right person at the right time -Predisposition to disease -Early and targeted prevention https://iipm.medicinedept.iu.edu/ Biomarker identification: Diagnostic Susceptibility/risk (prevention) Prognostic (indolent vs. aggressive) Predictive (response)

1. Introduction to NGS Genomics is a branch of genetics that enables the study of genomes of whole organisms. Gregor Mendel It differs from classical genetics in that it considers the hereditary material of an organism in global rather than one gene or one gene product at a time.

1. Introduction to NGS Deoxyribose nucleic acid (DNA)

1. Introduction to NGS Genome sequencing Genome sequencing is figuring out the order of DNA nucleotides, or bases, in a genome the order of As, Cs, Gs, and Ts that make up an organism's DNA.

1. Introduction to NGS Why is genome sequencing so important? Sequencing the genome is an important step towards understanding it. The genome sequence will help scientists find genes much more easily and quickly. Study the entire genome sequence will help them understand how the genome as a whole works how genes work together to direct the growth, development and maintenance of an entire organism. Genes account for less than 25 percent of the DNA in the genome, and so knowing the entire genome sequence will help scientists study the parts of the genome outside the genes.

1. Introduction to NGS Human Genome Project The Human Genome Project (HGP) was the international, collaborative research program whose goal was the complete mapping and understanding of all the genes of human beings. Begin in 1990 First draft in February 2001 full sequence April 2003 It catalyzes the developing and improving of sequencing techniques

1. Introduction to NGS

1. Introduction to NGS

1. Introduction to NGS Consortia-based projects

1. Introduction to NGS 3.234,83 Mb (haploid) $ 2,7 billion 1st generation 2nd generation 3rd generation http://www.ipc.nxgenomics.org/newsletter/no11.htm

1. Introduction to NGS Cost of sequencing Size of the genome Whole genome/exome/ de novo Data quality Moore s Law: the number of transistors in an integrated circuit doubles approximately every two years (1965).

1. Introduction to NGS Cost of sequencing Cost of human genome sequence: HGP: 0.5-1x10 9 $ 2006: 14x10 6 $ 2016: 1000-4000$

1. Introduction to NGS 2. First Generation Sequencing 3. Second Generation Sequencing 4. Third Generation Sequencing 5. Sequencing generation face to face 6. Applications of NGS techniques 7. A (very) brief introduction to DoE

2. First Generation Sequencing Sanger sequencing It is a method of DNA sequencing based on the selective incorporation of chainterminating dideoxynucleotides by DNA polymerase during in vitro DNA replication. Developed by Frederick Sanger and colleagues in 1977, it was the most widely used sequencing method for approximately 25 years. Frederick Sanger received two Nobel prizes (in the same category), for his work on protein sequencing and DNA sequencing http://www.yourgenome.org/stories/third-generation-sequencing

2. First Generation Sequencing Sanger sequencing. How it works? A DNAp enzyme is used to replicate a ssdna. In the mix reaction, there exist normal and modified nucleotides. Random incorporation of modified nucleotides stops the synthesis reaction. Each generated fragment will have a different length that could be distinguish by gel electrophoresis. https://www.youtube.com/watch?v=vk-hlmaitne <1kb read

1. Introduction to NGS 2. First Generation Sequencing 3. Second Generation Sequencing 4. Third Generation Sequencing 5. Sequencing generation face to face 6. Applications of NGS techniques 7. A (very) brief introduction to DoE

3. Second Generation Sequencing Why second generation sequencing? One of the main problems of Sanger sequencing was dealing with larger sequence output using of gels or polymers as separation media limited number of samples which could be handled in parallel difficulties with complete automation of the sample preparation These limitations triggered the efforts to develop new techniques

3. Second Generation Sequencing Main characteristics of NGS: high speed and throughput shorter reads more accuracy much higher degree of sequence coverage easier assemblies of the data generated Huge data storage demands

3.Second Generation Sequencing High througput 2ond generation instruments Benchtop ROCHE x GS FLX+ 454 x GS Junior 454 Illumina NextSeq500 HiSeq HiSeq X-Ten MiniSeq Life Technologies x SOLID5500xl Ion S5 and S5 XL IonPGM

3. Second Generation Sequencing Basic NGS workflow. 1. Library Preparation It is prepared by random fragmentation of DNA or cdna sample, followed by 5 and 3 adapter ligation. Adapter-ligated fragments are then PCR amplified and gel purified 2. Clonal amplification Each DNA fragment is amplified millions of times. 3. Sequencing The nucleotides incorporated are read by the detector

3. Second Generation Sequencing Library preparation: i. Fragmenting/selecting the target sequences: i. Physical (acoustic shearing, sonications. I.e. Covaris) ii. Enzymatic (i.e Nextera) iii. Chemical (less frequent) ii. Converting target to dsdna (if necessary) iii. End-repair and adaptors ligation (barcode for multplexing) iv. Quantification and QC of the final library product

3. Second Generation Sequencing Library preparation: Useful for multiplexing samples Depending on the final application and method used for sequencing, different kits are available

3. Second Generation Sequencing Clonal amplification: For empcr based systems (Ion/PGM, 454) High-speed shaker -1 starting effective fragment per microreactor - ~10 6 microreactors per ml - All processed in parallel (Clonal amplification)

3. Second Generation Sequencing Clonal amplification: No empty beads Clonal amplification?? No beads containing more than one amplified fragment 1) Titration: constant number of beads vs. different DNA starting quantities 2) Optimal enrichment: one single fragment amplified millions of times in one single bead

3. Second Generation Sequencing Clonal amplification:

3. Second Generation Sequencing Clonal amplification:

3. Second Generation Sequencing Clonal amplification: Bridge PCR (Illumina):

3. Second Generation Sequencing Sequencing: Roche 454 (pyrosequencing, sequencing by synthesis) Roche-454 technology: pyrosequencing (sequencing by synthesis) Metal coated PTP reduces crosstalk 29 μm well diameter (20/bead) 3,400,000 wells per PTP

3. Second Generation Sequencing Sequencing: Roche 454 (pyrosequencing, sequencing by synthesis) CCD Camera flowgram (signal intensity is proportional to the number of nucleotides incorporated in the sequence) - throughput limited by the nº of wells in the PTP - errors in homopolymers (454) - long sequences (up to 1000bp) - low throughput, very expensive reagents - required for some specific applications, advisable for others ( de novo sequencing)

3. Second Generation Sequencing Sequencing: Illumina (dye terminator nt, sequencing by synthesis) - Limited by the fragment length than can efectively bridge - Labelled nucleotides are not incorporated as efficiently as native ones - Short sequences (improving, specially in MiSeq PE 2x300 bp) -Strand-specific errors, substitutions towards the end of the read, base substitution errors (sistematic error GGT >GGG) -Scalable set of machines suitable for nearly all the applications -High throughput, expensive machines, cost per Mb OK

3. Second Generation Sequencing Illumina workflow. https://www.youtube.com/watch?annotation_id=annotation_1533942809&feature=iv&src_vid=hmycqwhw B8E&v=fCd6B5HRaZ8

3. Second Generation Sequencing Sequencing: Ion S5/PGM Beads containing the clonally amplified library are loaded onto the chip The chip is run in the machine

3. Second Generation Sequencing Sequencing: Ion Torrent

3. Second Generation Sequencing Sequencing: Ion Torrent https://www.youtube.com/watch?v=wybzbxifuks

3. Second Generation Sequencing Small NGS platforms GS Junior Plus (Roche) Read Length: 700 pb Output: 70 Mb Running time: 18 hours Ion Torrent PGM (LifeTechnol.) Read length: 35 to 400 bp Output: 30Mb 2Gb Running time: 2,3-4,4 hours Applications: Amplicon sequencing Targeted sequencing Metagenomics (16S) Small genome (i.e. Bacteria, fungi..) sequencing RNA profiling MiSeq (Illumina) Read Length: 2x150 bp Output: 0.6-7.5 Gb Running time: 4-24 hours Source: Supplier webpage

3. Second Generation Sequencing High througput NGS Platforms NGS platforms GS FLX+ Read length: 700 bp Output: 1 Mb Running time: 23 hours Ion S5 XL Read length: Up to 200 bp Output: 10-15 Gb Run time: 2.5 hours HiSeq 4000 Read length: 2x150 bp Output: 125-1500 Gb Run time: <1-3.5 days Applications: Amplicon sequencing Targeted sequencing Metagenomics (16S) Genomes Exomes, transcriptome Source: Supplier webpage

1. Introduction to NGS 2. First Generation Sequencing 3. Second Generation Sequencing 4. Third Generation Sequencing 5. Sequencing generation face to face 6. Applications of NGS techniques 7. A (very) brief introduction to DoE

4. Third Generation Sequencing TGS: Single molecule sequencing Advantages: Less sample preparation (no PCR) No amplification No PCR errors Fewer contamination issues No GC-bias Analyze every sample (unpcrable, unclonable) Analyze low quality DNA (forensics samples, archeological) Absolute quantification Sequence RNA directly

4. Third Generation Sequencing Helicos Genetic Analysis system Workflow similar to Illumina, but without bridge amplification: relative slow and expensive short reads Bankruptcy in 2002

4. Third Generation Sequencing SMRT (Pacific bioscience) Quick library construction (6 hours) Run time: from 30 min to 6 hours Sequencing by synthesis No library amplification Long reads: 5 Kb 20 Kb High error rate (but improving!) Source: Supplier webpage

4. Third Generation Sequencing SMRT (Pacific bioscience) https://www.youtube.com/watch?v=nhcj8ptycfc

4. Third Generation Sequencing Oxford nanopore alpha-hemolysin Heptameric protein with a pore of inner diameter 1nm Pore diameter same scale of DNA Protein nanopores can be adapted. The company has optimised its large-scale production. DNA, RNA, mirna and protein analysis Changes in membrane voltage are measured

4. Third Generation Sequencing Oxford nanopore SmidgION MinION PromethION Simple 10 minute sample prep available Read length: hundreds of kb Very fast, but high error rates. Easy to work in the field (Ebola viruses sequenced in Guinea 2 days after sample collection, Quick J, 2016)

4. Third Generation Sequencing Oxford nanopore https://ww.youtube.com/watch?v=3uw22hbpak

4. Third Generation Sequencing High througput 3rd generation instruments Benchtop Pacific Bioscience Sequel system xford Nanopore Technologies PromethION GridION minion

1. Introduction to NGS 2. First Generation Sequencing 3. Second Generation Sequencing 4. Third Generation Sequencing 5. Sequencing generation face to face 6. Applications of NGS techniques 7. A (very) brief introduction to DoE

5. Sequencing Generation face to face Speed variation by technology http://www.yourgenome.org/stories/third-generation-sequencing

5. Sequencing Generation face to face

5.Sequencing Generation face to face

5.Sequencing Generation face to face 1. DNA fragmentation. 1. DNA fragmentation 1. DNA fragmentation 2. Vector cloning, bacterial transformation and growth, DNA isolation and purification 2. In vitro adaptor ligation + clonal amplification 2. y 3. in vitro adaptor ligation. NO AMPLIFICATION. Massive parallel sequencing. 3. Sequencing (chainterminating ddnucleotides) ssdna DNA primer Polimerase dntps Fluorescent ddntps 3. Massive paralel sequencing 4. Image processing Capillary electrophoresis (1 Sequence/Capilar. Up to 384 DNA samples in a single run) CTATGCTCG 4. Image processing and data analysis 4. Image processing and data analysis. FGS SGS TGS

5.Sequencing Generation face to face Release dates vs Machine outputs per run

1. Introduction to NGS 2. First Generation Sequencing 3. Second Generation Sequencing 4. Third Generation Sequencing 5. Sequencing generation face to face 6. Applications of NGS techniques 7. A (very) brief introduction to DoE

6. Applications of NGS techniques

6. Applications of NGS techniques

6. Applications of NGS techniques

6. Applications of NGS techniques Whole Genome sequencing De novo sequencing Resequencing

6. Applications of NGS techniques Whole Genome sequencing Complete characterization of the entire genome. Beijing Genomics Institute ( Does it look cute, we ll sequence it ) Since now, genome-wide association studies (microarray-based) for identifying disease associations across the whole genome (4 million markers). Now it is better to sequence the whole genome (3.2 billion bases) It can be applied not only for human genomes, also for plant, microbial The rapid drop in sequencing costs allow researchers to sequence a genome quickly. PROS Global genome picture, no systematic missing of information Useful for diseases that involve multiple genetic phenomena CONS: Can miss variants in the exonics regions due to lower coverage 2nd generation NGS: some regions cannot be sequenced/assembled (i.e. repetitive regions, GC rich regions ) More expensive and time consuming

6. Applications of NGS techniques Expression analysis Since now, performed with microarrays technology Sensitivity of sequence based studies is limited by the depth of sequencing Different approaches can be used: Targeted sequencing Exome sequencing RNA-seq

6. Applications of NGS techniques Targeted sequencing Only a subset of genes or regions of the genome are isolated and sequenced. It allows to focus times, expenses and data analysis on specific areas of interest. Enables sequencing at much higher coverage levels. Target sequencing panels can be purchased with preselected content or can be custom designed.

6. Applications of NGS techniques Exome sequencing PROS Cheaper and faster Simplified bioinformatics analysis (Often) useful for Mendelian disorders Increases coverage in coding regions: variants are more easily detected CONS: Misses non-coding variation and structural variation Biases due to library methology used Sometimes unsuccessful to interpret clinical data

6. Applications of NGS techniques RNA sequencing (RNA-Seq) RNA-seq works by sequencing every RNA molecule and profiling the expression of a particular gene by counting the number of time its transcripts have been sequenced. RNA-seq applications: Differential gene expression analysis (DGE) SNPs, splice variants (resolution at base-level) Detection of novel transcripts and isoforms Transcriptome of not previously studied organisms Map exon/intron boundaries, splice junctions Detection of allele specific expression patterns

6. Applications of NGS techniques RNAseq Very high dynamic range (10 5 to 10 7 )

6. Applications of NGS techniques Epigenomics Epigenome: different chromatin states at the whole genome level Epigenetic modifications: histone variants, post-translational modifications of aminoacids on the N-terminal tail of histones, covalent modifications of the DNA bases Some approaches: ChIPSeq: transcription factor binding and histone modifications Bisulfite conversion: unmet-c converted to U. Single CpG resolution. Gold standard for DNA methylation analysis. MeDIP: enrichment for the methylated/unmethylated fractions of the genome using 5-methyl-c antibodies

6. Applications of NGS techniques The promise of single cell sequencing... Ability to detect mutations in single cells in a population Now could be possible with TGS Tumour genomes are highly heterogeneous Haplotype determination (HLA) for bone marrow transplants Mosaics in disease (Alzheimer) Preimplantation genetic diagnosis Cell line heterogenicity Nature Methods Vol.11(1) 2014

6. Applications of NGS techniques Metagenomics Is a way to make an inventory of what (DNA) is present in a sample Two approaches: Sequence it all Focus on specific conserved sequences (ribosomal genes) Technology facilitates the study of the consequences of environmental changes and the causes of the changes.

1. Introduction to NGS 2. First Generation Sequencing 3. Second Generation Sequencing 4. Third Generation Sequencing 5. Sequencing generation face to face 6. Applications of NGS techniques 7. A (very) brief introduction to DoE

7. A (very) brief introduction to DoE The (statistical) design of experiments is an efficient procedure for planning experiments so that the data obtained can be analyzed to yield valid and objective conclusions Why are many life scientists so adverse to thinking about design? It is common to think that time spent designing experiments would be better spent actually doing experiments

7. A (very) brief introduction to DoE Variability types that play in an experiment: Planned systematic variability: This is the differences in response between treatments applied. Noise variability: random noise. Differences between two consecutives measures. We cannot avoid that. Systematic variability not planned: Produce a systematic variation in the results. A priori the reason is not known. It can be avoided with the randomization and the local control.

7. A (very) brief introduction to DoE Important steps to define before begin the experiment: Establish the main objectives of the experiment. Avoid collateral problems Identify all the noise sources: Treatment, experimental errors, Allocate each experimental unit which each treatment Clarify the type of response expected in each treatment Determinate the number of individuals in each group Run a pilot study

7. A (very) brief introduction to DoE Important steps to define before begin the experiment: Establish the main objectives of the experiment. Avoid collateral problems Identify all the noise sources: Treatment, experimental errors, Allocate each experimental unit which each treatment Clarify the type of response expected in each treatment Determinate the number of individuals in each group Run a pilot study

7. A (very) brief introduction to DoE Important steps to define before begin the experiment: Establish the main objectives of the experiment. Avoid collateral problems Identify all the noise sources: Treatment, experimental errors, Allocate each experimental unit which each treatment Clarify the type of response expected in each treatment Determinate the number of individuals in each group Run a pilot study

7. A (very) brief introduction to DoE Important steps to define before begin the experiment: Establish the main objectives of the experiment. Avoid collateral problems Identify all the noise sources: Treatment, experimental errors, Allocate each experimental unit which each treatment Clarify the type of response expected in each treatment Determinate the number of individuals in each group Run a pilot study How the data will be statistically analysed.

7. A (very) brief introduction to DoE Sample Treatment Sex Batch Sample Treatment Sex Batch 1 A Male 1 1 A Male 1 2 A Male 1 2 A Female 2 3 A Male 1 3 A Male 2 4 A Male 1 4 A Female 1 5 B Female 2 5 B Male 1 6 B Female 2 6 B Female 2 7 B Female 2 7 B Male 2 8 B Female 2 8 B Female 1 Treatment are confounded between sex and batch Treatment is well balanced

7. A (very) brief introduction to DoE What do you need to ask before starting a NGS experiment? -What do I want to sequence? Whole genome, exome, metagenome, epigenome, RNAseq... -How many samples? -Length of read required? -Quality and quantity of starting material? -Size of nucleic acids to sequence -Amount of sequence needed: coverage (Depth of) Coverage: how many times a particular base is sequenced. 30x = each base has been read by 30 sequences (in average) (Breadth of) Coverage: amount of the target sequence that has been covered (with a given coverage)