FGCZ NEWSLETTER FALL Next Generation Sequencing at the Functional Genomics Center Zurich

Size: px
Start display at page:

Download "FGCZ NEWSLETTER FALL Next Generation Sequencing at the Functional Genomics Center Zurich"

Transcription

1 FGCZ NEWSLETTER FALL 2011 newsletter Technologies, Applications, and Access to Support Next Generation Sequencing at the Functional Genomics Center Zurich OVERVIEW 1 NGS AT THE FGCZ Technologies and organization 2 NGS APPLICATIONS AND TECHNOLOGIES Applications in sequencing 3 PACIFIC BIOSCIENCES RS Single molecule sequencing 4 ION TORRENT PGM Fast turnover sequencing 5 ILLUMINA HISEQ2000 High throughput short reads 6 LIFE TECHNOLOGIES SOLID 5500XL Flexible short read sequencing 7 ROCHE/454 GS FLX+ Reliable long read sequencing 8 BIOINFORMATICS OF NGS Analysis workflows and support Next Generation Sequencing (NGS) has become one of the major technologies at the FGCZ. The impressive throughput and read lengths of the current high-end systems enable research groups to de novo sequence genomes of small to complex organisms. They allow for the analysis of genome alterations, gene expression and DNA modifications, as well as a constantly growing number of other applications. To account for the vast diversity of life science research at the ETH Zurich and the University of Zurich, the FGCZ has continuously extended its portfolio of NGS technologies and applications. The emphasis of this expansion has not only been on capacities but also on capabilities, leading to the establishment of a complete range of NGS technologies: Short read technologies (Life Technologies SOLiD and Illumina HiSeq) for the generation of very large numbers of short to medium length reads, long read technologies (Roche/454 GS FLX+) for the generation of medium to large numbers of long reads, plus the latest technologies focusing on the very rapid generation of sequencing data (Ion Torrent PGM) and very long reads and novel applications at the single molecule level (Pacific Biosciences RS) $$$2009 $ $2010 $ $ $2011 $ $ $ $ $ $2012 $$ The on site availability of these systems allows us to flexibly combine the different technologies within or between projects and opens up combinations of possibilities that ensure suitable analytical strategies tailored to the research project s needs. As a consequence of this flexibility and the number of technologies available, NGS projects need careful planning and consideration at multiple levels, including sample availability and quality, robustness of methods and protocols, processing and analysis of data, as well as financial and time constraints. This complexity emphasizes the need for a close collaboration of ETH and UZH research groups with analytical and bioinformatics experts of the FGCZ, which is why access to the FGCZ NGS platform is provided through the User Lab. In parallel to the technology expansion, significant efforts have been undertaken in increasing the User Lab staff working in the technologies and bioinformatics sections. As a result, optimized analytical protocols and data analysis workflows have been established that lead to significantly shortened turnover times from sample generation to data interpretation. Technologies and support modes at the FGCZ In the following, we briefly describe the available technologies (from most recent to most established) including supported applications. More information on the setup of NGS and access via the FGCZ User Lab and User Lab Services can be found on the FGCZ website at For further or more specific questions, please contact us at sequencing@fgcz.ethz.ch NEXT GENERATION SEQUENCING AT THE FGCZ 1

2 FGCZ NEWSLETTER FALL 2011 Applications and Technologies De novo sequencing of genomes and metagenomes De novo sequencing of transcriptomes Genome-wide SNP discovery and variant detection Transcriptome analysis Small RNAs ChIP-seq Amplicon sequencing DNA Methylation analysis Emphasis on longer reads, medium to high throughput * * * Roche GS FLX+ * * * Pacific Biosciences RS * * Illumina HiSeq (+++ in combination with PacBio or GS FLX+) * * Ion Torrent PGM (for smaller genomes) Emphasis on medium to longer reads, high to very high throughput * * * Illumina HiSeq * * * Roche GS FLX+ * * Ion Torrent PGM Emphasis on high to ultra-high throughput, short to medium read lengths * * * Illumina HiSeq (max. throughput) * * * LifeTech SOLiD (max. flexibility) * * Ion Torrent PGM (max. speed) Emphasis on high to ultra-high throughput, short to medium read lengths * * * Illumina HiSeq (max. throughput) * * * LifeTech SOLiD (max. flexibility) * * Ion Torrent PGM (max. speed) Emphasis on high to ultra-high throughput, short read lengths * * * Illumina HiSeq (max. throughput) * * * LifeTech SOLiD (max. flexibility) * * Ion Torrent PGM (max. speed) Emphasis on high to ultra-high throughput, short to medium read lengths * * * Illumina HiSeq (max. throughput) * * * LifeTech SOLiD (max. flexibility) * * Ion Torrent PGM (max. speed) Emphasis on longer reads, medium to high throughput * * * Roche GS FLX+ * * * Pacific Biosciences RS * * Illumina HiSeq (+++ in combination with PacBio or GS FLX+) * * LifeTech SOLiD (+++ in combination with PacBio or GS FLX+) * * Ion Torrent PGM Emphasis on flexibility and protocols * * * Pacific Biosciences RS * * * LifeTech SOLiD * * * Illumina HiSeq The list of applications and suitable technologies is not exhaustivee nor does it exclude the use of a specific analytical technology for applications mentioned.the purpose of the list is to provide an initial overview that is always refined based on individual needs during the setup phase of an FGCZ project. Technologies of equal suitability are listed in alphabetical order. NGS@FGCZ Technology Specifications Platforms PACIFIC BIOSCIENCES RS ION TORRENT PGM Library type * Library preparation Frag PCR free Single Molecule Real Time Sequencing Frag empcr Ion semiconductor sequencing Chemistry Run time Read length (bp) 24 hrs (90 min per SMRT cell) Throughput (Gb per run) (16 SMRT cells) Amount of input materials 1-10 µg DNA 2 hrs µg DNA µg poly(a)rna or rrna - depleted total RNA ILLUMINA HISEQ2000 LIFE TECHNOLOGIES SOLID 5500XL Frag, MP, PE Frag, MP, PE Solid phase empcr Reversible Terminator Sequencing by Ligation 10 days 2x µg DNA, µg total RNA 1 10 µg for small RNA 10 µg Single ChIP enriched DNA 10 days (Frag) (PE) 2x60 (MP) µg DNA µg poly(a)rna or rrna - depleted total RNA ROCHE/454 GS FLX+ Frag, MP empcr Pyrosequencing 23 hrs µg DNA 1-2 µg total RNA *! MP: mate pair; PE: paired-end The actual amount depends on the library type (and the number of needed SMRT cells in the case of PacBio). For accurate information please refer to each technology section. NEXT GENERATION SEQUENCING AT THE FGCZ 2

3 FGCZ NEWSLETTER - NGS TECHNOLOGIES FALL 2011 TECHNOLOGY Third generation sequencer using Single Molecule Real Time Sequencing Technology (SMRT). Very long reads up to 10 kb, average read length 1.5 kb. Sample preparation and sequencing are PCR free: sequence native DNA. APPLICATIONS AND MAIN USE Backbone sequencing De novo sequencing Methylation studies Small genomes de-novo sequencing and potential detection of DNA methylation at base resolution. Single Molecule Sequencing Pacific Biosciences RS The Pacific Biosciences RS system is a third generation sequencer that provides single molecule and real time sequencing technology (SMRT) based on an uninterrupted template-directed DNApolymerase synthesis. The RS gives the longest read length achievable to date: the average read length per run is currently around 1.5 kb, with instances of over 10,000 base pairs, which facilitates mapping and assembly. SMRT technology bears the potential to retrieve kinetic sequencing information of individual molecules, a method that is foreseen to allow for the direct identification of chemical modifications, such as methylation, at a single base resolution. The Pac Bio RS, in combination with massive high throughput sequencing (i.e. HiSeq 2000 or SOLiD 5500xl) provides an ideal solution to cover a wide range of applications. Technology The SMRT DNA sequencing technology is built upon three key innovations. The SMRT Cell, provides single molecule, real-time observation of individual fluorophores against a dense background of labeled nucleotides while maintaining a high signal-to-noise ratio. In contrast to using base-linked nucleotides, SMRT Sequencing uses phospholinkednucleotides (the fluorescent dye is attached to the poly-phosphate chain of the nucleotide). Through such attachment, incorporation of a phospholinked nucleotide by the DNA polymerase results in separation of the dye molecule from the nucleotide when the enzyme forms the nucleotides. A Real Time Detection platform provides single molecule, realtime detection as well as flexibility in run configurations and applications. DNA sequencing is performed on SMRT Cells, each containing thousands of zero-mode waveguides (ZMWs). A ZMW is a hole, tens of nanometers in diameter, fabricated in a 100 nm metal film deposited on a silicon dioxide substrate. Each ZMW becomes a nanophotonic visualization chamber providing a detection volume of just 2*10-20 liters. At this volume, the activity of a single molecule can be detected amongst a background of thousands of labeled nucleotides. Within each chamber, a single DNA polymerase molecule is attached to the bottom surface such that it permanently resides within the detection volume. Phospholinked nucleotides, each type labeled with a different colored fluorophore, are then introduced into the reaction solution. As the DNA polymerase incorporates complementary nucleotides, each base is held within the detection volume for tens of milliseconds. During this time, the engaged fluorophore emits fluorescent light whose color corresponds to the base identity. Then, as part of the natural incorporation cycle, the polymerase cleaves the bond holding the incorporation, the signal immediately returns to baseline and the process repeats. The DNA polymerase continues incorporating an average of 1-2 bases per second producing a long chain of DNA in minutes. Applications SMRT technology is ideal for applications such as de novo genome and transcriptome sequencing (especially in combination with short read sequencers). PacBio s long reads make the assembly of the genomic structure much easier, enabling a comprehensive view of the genome. SMRT technology is a PCR free approach: standard PacBio library preparation and sequencing methods do not include any amplification step. Native DNA is directly sequenced without PCR bias and potentially allows the identification of chemical base modifications, such as methylation. Performance The PacBio RS can process 16 SMRT cells per run in approximately 24 hours (90 minutes per SMRT cell). Currently every cell produces up to reads, with average read length of 1.5 kb with few reads hitting the 10 kb. With the next chemistry release (Q1, 2012) the average read length will be doubled to 3 kb. Input Requirements * 500 bp library: 1 µg high quality gdna * 2000 bp library: 3 µg high quality gdna * 8000 bp library: 8 µg high quality gdna PacBio RS at FGCZ The PacBio RS system is accessible through the FGCZ User Lab. For the time being, no User Lab Services are offered but projects including all analytical steps fluorophore in place and the dye diffuses are carried out in close collaboration phospho-diester bond between two out of the detection volume. Following between FGCZ experts and User Group researchers. More information: PacBioRS NEXT GENERATION SEQUENCING AT THE FGCZ 3

4 FGCZ NEWSLETTER - NGS TECHNOLOGIES FALL 2011 TECHNOLOGY Output depending on chip > 10 Megabases > 100 Megabases Read length up to 200 bases APPLICATIONS AND MAIN USE Small genomes sequencing Targeted re-sequencing (amplicon) Low complexity transcriptomes and small RNA sequencing Run Time < 2 Hours Personal Genome Machine PGM by Ion Torrent The Personal Genome Machine (PGM) Sequencer is a benchtop PostLight semiconductor-based platform that performs a sequencing run in approximately two hours. Technology The PGM advances Next-Generation sequencing to PostLight sequencing: the translation of chemical sequence information directly into digital form. The sequencing technology underlying the PGM exploits a well-characterized biochemical process: When a nucleotide is incorporated into a strand of DNA by a polymerase, a hydrogen ion (H+) is released as a byproduct. This hydrogen ion carries a charge which the PGM System s ion sensor essentially the world's smallest solid-state ph meter can detect. As the sequencer floods the chip with one nucleotide after another, any nucleotide added to a DNA template will be detected as a voltage change, and the PGM System will call the base. If a nucleotide is not a match for a particular template, no voltage change will be detected and no base will be called for that template. By eliminating the need for the optical system, the PGM provides sequencing that is simpler, faster, more cost effective, and more scalable than any other technology available. A principal component of the PGM is the sequencing chip. This microprocessor chip incorporates an extremely dense array of >1 million micro-machined wells married to our proprietary ion sensor. Each well contains a different DNA template, allowing massively parallel sequencing. Chips can scale in density for any application, from small, targeted experiments to large genomes. Applications The PGM system enables many possible sequencing applications. Performing multiplex amplicon sequencing is possible in a very short time. The 200bp reads make sequencing long amplicons possible, resulting in more simple and affordable projects. It's perfect platform to obtain optimal results for microbial sequencing projects with extraordinary uniformity of coverage and long read lengths. The four to eight million reads produced by the 316 and 318 chips are ideal for applications such as small RNA sequencing, transcriptome sequencing, and ChIP-Seq. Performance Performance of the system directly depends on the sequencing chip used. Outlook Ion Torrent recently launched a long read kit for the PGM sequencer with modal high-quality read lengths of 225 bases. Read lengths greater than 500 bases are feasible, as demonstrated by the generation of a perfect 525 bases read. A novel paired-end sequencing (PES) method was developed for the PGM sequencer that produces paired reads, each 100 bases in length (2 x 100). We anticipate that applying recent improvements in read length to the Ion PES protocol should produce paired reads of 200 bases in length (2 x 200), with the possibility of paired reads up to 400 bases in length (2 x 400). Input Requirements * DNA Seq - From 100 ng to 5 µg highquality RNA-free genomic DNA * RNA Seq - From 200 ng to 1 µ g poly(a)rna or rrna-depleted total RNA PGM at FGCZ The PGM system is accessible through the FGCZ User Lab Services. More information: PGM Ion Semiconductor Output Read Length Total Sequencing Time Sequencing Chip 314 > 10 Megabases < 2 Hours! 316 > 100 Megabases 318 > 1 Gigabase Accuracy Year 2011 >200 bp Year 2012 > 400 bp > 99.99% consensus accuracy and >99.5% raw accuracy NEXT GENERATION SEQUENCING AT THE FGCZ 4

5 FGCZ NEWSLETTER - NGS TECHNOLOGIES FALL 2011 TECHNOLOGY Sequence up to 16 independent lanes on 2 flow cells in one run 24 unique barcodes provided by Illumina for multiplexing for RNA and DNA libraries Up to 384 independent samples per run ~ 1.5 billion high-quality reads per flow cell (~180 million/lane) up to 600 Gb output per run single-read (50 or 100 bp) or paired-end sequencing (2 x 50 bp; 2 x 100bp) APPLICATIONS AND MAIN USE DNA Sequencing Gene Regulation Analysis Sequencing-Based Transcriptome Analysis SNP Discovery and Structural Variation Analysis Cytogenetic Analysis, DNA- Protein Interaction (ChIP-Seq) Sequencing-Based Methylation Analysis Small RNA Discovery and Analysis High Throughput Short Read Sequencing Illumina HiSeq 2000 The HiSeq2000 from Illumina is a high throughput sequencer that uses a massively parallel sequencing-bysynthesis approach to generate billions of bases of high quality sequence data. To date, it delivers the industry s highest throughput of quality filtered data at up to 55 Gb per day or 600 Gb per run with read lengths of 2X100 bp. The high throughput and read length of the system provides the coverage necessary for various next generation sequencing applications.. Technology Illumina is different from the SOLiD and 454 sequencers because it does not use beads for clonal amplification of f r a g m e n t s, i n s t e a d a f t e r l i b r a r y preparation, cluster or bridge PCR is performed directly on the flow cell. For sequencing, Illumina uses the s e q u e n c i n g b y s y n t h e s i s ( S B S ) technology. It is a proprietary reversible terminator-based method that enables detection of single bases as they are incorporated into growing DNA strands. A fluorescently-labeled terminator is imaged as each dntp is added and then cleaved to allow incorporation of the next base. Since all four reversible terminator-bound dntps are present during each sequencing cycle, natural competition minimizes incorporation bias. Applications SBS technology supports both single read and paired-end libraries. A wide array of available sample preparation methods serve to enable diverse applications, including: - DNA Sequencing - Transcriptome Analysis (RNA-Seq) - SNP Discovery and Structural Variation Analysis - Cytogenetic Analysis (copy number variation analysis, CNV) - DNA-Protein Interaction Analysis by - Chromatin Immunoprecipitation (ChIP- Seq) - Methylation Analysis - Small RNA Discovery and Analysis Performance The HiSeq 2000 can process 2 flow cells in parallel. Each flow cell is made of 8 lanes that can be run independently. The HiSeq 2000 can generate a maximum of 600 Gigabases of quality-filtered sequence data per run with read lengths of up to 2 x 100 base pairs, and provide up to three billion single-end reads and up to six billion paired-end reads. A run takes about ten days. Input Requirements * DNA Seq - From 1 µg high-quality RNAfree genomic DNA * RNA Seq - From 100 ng to 4 µg column purified, genomic DNA free total RNA * Small RNA Seq - From 1 to 10 µg of Trizol purified total RNA * Chip Seq -From 10 ng Single ChIP enriched DNA or input DNA HiSeq2000 at FGCZ The HiSeq2000 system is accessible through the FGCZ User Lab Services. More information: HiSeq2000 No. of Lanes No. of reads (Millions) Throughput (Gigabases) 2 X 100 sequencing 1 lane lanes (1 flow cell) lanes (2 flow cells) ! NEXT GENERATION SEQUENCING AT THE FGCZ 5

6 FGCZ NEWSLETTER - NGS TECHNOLOGIES FALL 2011 TECHNOLOGY Sequencing by ligation, two bases at a time Sequence up to 12 independent lanes on 2 flow chips in one run 96 unique barcodes Up to 200 Gb output per run APPLICATIONS AND MAIN USE DNA Sequencing (Fragment, Paired End, Mate-Paired) Whole Transcriptome Analysis Small RNA Expression SAGE - genome-wide expression Flexible read lengths Paired-end sequencing (75 x 35 bp) is available High Accuracy and Referencefree analysis with Exact Call Chemistry Module Flexible Short Read Sequencing Life Technologies SOLiD 5500xl The SOLiD 5500xl is a highly accurate, massively parallel nextgeneration sequencing platform that supports a wide range of applications. Sequencing lanes in the flow chip can be run independently with pay-per-use reagents. Multiplexing capability (96 barcodes) allows sequencing of multiple library types in a single run. Technology The major difference between SOLiD sequencing and other high-throughput sequencing platforms is its unique sequencing by ligation instead of sequencing by synthesis chemistry. The ligation is performed using specific, flourescently labeled octamer probes made up of 2 known bases at the 3 prime end followed by three degenerate bases then three universal bases. These probes are present simultaneously and compete for incorporation. After each ligation, the fluorescence signal is measured and then cleaved before another round of ligation takes place. A reset phase allows a reduction in noise - a capping step that prevents dephasing. Sequential ligations of fluorescently labeled probes detect every combination of two adjacent bases. Multiple cycles of ligation, detection and cleavage are performed, with the number of cycles determining the eventual read length. Following a series of ligation cycles, the extension product is removed and the t e m p l a t e i s r e s e t w i t h a p r i m e r complementary to the n-1 position for a second round of ligation cycles. Five rounds of primer reset are completed for each sequence tag. Through the primer reset process, virtually every base is interrogated in two independent ligation reactions by two different primers. Up to % accuracy is achieved with the Exact Call Chemistry Module by sequencing with an additional primer using a multi-base encoding scheme. Applications The sequencing by ligation technology supports both single read and paired-end libraries. A wide array of sample preparation methods serve to enable diverse applications. - Fragment Sequencing (from 35 bp to 75 bp) also with Barcodes (currently 96) - Paired End (50-75)/35 bp (96 barcodes) - Mate-Paired 2 x 60 bp - Whole Transcriptome Analysis (96 barcodes) - Small RNA Expression (96 barcodes) - SAGE - genome-wide expression Performance The SOLiD 5500xl can process 2 flow chips in parallel. Each flow cell is made of six lanes that can be run independently. SOLiD 5500xl can to generate up to 198 gigabases of raw sequence data per run with read lengths of up to 75 X 35 base pairs. A run takes about ten days. Input Requirements * DNA Fragment library ng of high quality column purified genomic DNA * Mate Pair library µg of high quality column purified genomic DNA * Targeted Enrichment (Sure Select) - 3 µg of high quality column purified genomic DNA * Chip-Seq - 20 ng -1 µg DNA * RNA seq ng of poly(a) RNA or 200 ng of rrna-depleted RNA * small RNA analysis µg of integral high quality total RNA SOLiD 5500xl at FGCZ The SOLiD 5500xl system is accessible through the FGCZ User Lab Services. More information: SOLiD5500xl No. of Lanes No. of reads (Millions) Throughput (Gigabases) 75 X 35 sequencing 1 lane lanes (1 flow chip) lanes (2 flow chips) ! NEXT GENERATION SEQUENCING AT THE FGCZ 6

7 FGCZ NEWSLETTER - NGS TECHNOLOGIES FALL 2011 TECHNOLOGY Over 1 Mio reads per run Consensus accuracy greater than % For shotgun sequencing: Average read length of 700 bp Up to 1000 bp reads APPLICATIONS AND MAIN ADVANTAGES De novo sequencing Re-sequencing Targeted re-sequencing (Amplicon) RNA analysis Metagenomics High Reliability Long Read Sequencing Roche/454 GS FLX+ 454 Sequencing uses a large- scale parallel pyrosequencing system, which is a bioluminescence method that relies on a single addition of a dntp by a DNA polymerase. Technology The system depends on fixing nebulised and adapter-ligated DNA fragments to small DNA-capture beads in a water-in-oil emulsion. The DNA fixed to these beads, ideally one DNA molecule per bead, is then singly clonally amplified by PCR. Each DNA-bound bead is placed into a ~29 µm well on a PicoTiterPlate (PTP), a titanium coated fiber optic slide. A m i x o f enzymes such as DNA polymerase, ATP sulfurylase, a n d luciferase are also packed into the well, the Emulsion two latter PCR facilitate for signal light amplification production. detection of the light generated from each PTP well undergoing the pyrosequencing reaction. Pyrosequencing, basically, measures the release of inorganic p y r o p h o s p h a t e b y p r o p o r t i o n a l l y converting it into light using a series of enzymatic reactions. The light signal generated by the enzymatic cascade is recorded as a series of peaks called a flowgram. Applications 454 sequencing enables: - De novo whole genome sequencing - RNA analysis - Re-sequencing of whole genomes and target DNA regions - Metagenomics The GS FLX+ system allows to perform straightforward de novo assembly to decode previously uncharacterized genomes or transcriptomes, or resequence organisms with an available reference. Whole genome sequencing projects may use shotgun reads alone or in combination with mate paired reads to generate accurate draft assemblies of any organism. The clonal nature of the 454 Sequencing System renders this platform particularly suitable for amplicon sequencing. This application allows unambiguous allele resolution of variation in complex regions of the genome along with quantitative detection of variants present in less than 1 % of a mixture. Performance The GS FLX+ System features the unique combination of long reads, exceptional accuracy and high-throughput, making the system well suited for varied genomic projects. Latest improvements in sequencing chemistry, instrumentation and software have lead to an increased read length of up to 1000 bp. Thus, this technology allows a comprehensive genome coverage and the exploration of the full range of genetic variation using long, highquality reads with a very high consensus accuracy (99.995% to %). The GS FLX+ System is designed to use with both, the new long-read Sequencing Kit XL+ and existing Sequencing Kit XLR70, each with their various features. Input Requirements For shotgun fragment libraries: * 1 µg for average read length of 700 bp * 500 ng for a shotgun fragment library of 400 bp read-length * For mate pair libraries, with insert sizes of 3 kb, 8 kb or 20 kb, we need a input of gdna of 5 µg, 15 µg or 30 µg. * For RNA analysis 1 to 2 µg of total RNA. GS FLX+ at FGCZ The GS FLX+ system is accessible through the FGCZ User Lab Services. More information: FLX+ The PTP is then mounted in a flow chamber, and individual dntps are flushed across the wells in a predetermined sequential order. The lower surface of the PTP is directly attached to a Sequencing Kit New GS FLX Titanium XL+ GS FLX Titanium XLR70 Read Length Up to 1'000 bp Up to 600 bp Mode Read Length 700 bp 450 bp Typical Throughput 700 Megabases 450 Megabases Reads per Run ~1 Million Shotgun ~1 Million Shotgun ~700'000 Amplicon Consensus Accuracy % % Run Time 23 Hours 10 Hours high-resolution CCD camera, which allows Multiplexing Multiplex Identifiers (MIDs): 132 Gaskets: 2, 4, 8, 16 Regions! NEXT GENERATION SEQUENCING AT THE FGCZ 7

8 FGCZ NEWSLETTER - NGS BIOINFORMATICS FALL 2011 DATA ANALYSIS WORKFLOWS De novo assembly Mapping to the reference genome SNP and INDEL detection Digital expression ChIP-seq region detection Turning Sequencing Reads into Knowledge NGS Bioinformatics NGS technology and the resulting data are uniquely suited for answering a wide range of biological questions by overcoming limitations that so far existed using classical sequencing or alternative microarray approaches. Due to the massive amount of data involved, the management and analysis of the data requires dedicated software and highperformance and capacity computing resources. The FGCZ has developed and productively implemented data processing pipelines for standard data analysis workflows, which can be readily applied to analyze the sequencing data generated at the center. In addition to standard tools and workflows, the bioinformatics team provides access to its expertise and the optional development of customized solutions via the FGCZ User Lab. As for the experimental analysis part, the analysis and interpretation of the resulting data relies on the close interaction of FGCZ staff with the users that is essential to generate scientifically relevant results. Standard Data Analysis Support As an integral part of every sequencing project, the FGCZ provides experimental design consultation during the setup phase of a project in conjunction with the discussion for selecting the most suitable analytical platform. The actual data analysis support following data production in a given project then significantly depends on the type of the study: For projects with a reference genome, the standard support includes: - files of raw reads - read alignment to the reference(s) - initial secondary analysis (e.g. SNP calling, expression quantitation, peak finding, and so on) - QC Report For de novo sequencing projects, the standard support includes: - files of raw reads - assembly of the raw reads with vendor assembler using default parameters The currently available standard data analysis workflows cover popular NGS applications, such as: - Re-sequencing, SNP discovery and variant detection - Transcriptome analysis and expression quantitation - R e g u l o m e a n a l y s i s ( s m a l l R N A, transcription factor binding, histone modification, methylation) Standard data analysis and support efforts in the form of a limited number of consulting hours are part of the User Lab Services and are free of charge. Analysis pipelines and consulting for additional applications can be added, based on demand and available resources. Customized Data Analysis Service A significant number of research projects using the large flexibility and many options of NGS show a distinct need for customized support or even the development of new data analysis procedures or tools. Depending on the availability of resources and subject to project-specific agreements, the FGCZ collaborates on non-standard data analysis, like for example, the discovery and in silico verification of new mirrnas. Training and Support In addition to project-specific data analysis, the FGCZ provides training and education at the conceptual and concrete software usage levels. Courses and tutorials are announced on the FGCZ Genomics Bioninformatics website. NEXT GENERATION SEQUENCING AT THE FGCZ 8

9 FGCZ NEWSLETTER - NGS BIOINFORMATICS FALL 2011 Example Workflows From Raw Data to Knowledge De novo assembly and annotation of genomes and transcriptomes De novo assembly and annotation of genomes have high demands on customization of the analysis workflows. While the assembly using default parameters will frequently yield a satisfactory result, an iterative approach with manual inspection and adaptation of assembly parameters considerably improves the assembly outcome. The annotation process has two components: structural annotation (identification of genomic elements) and functional annotation (linking biological information to the genomic elements). Both steps highly depend on the species sequenced and require close collaboration of the researchers with the FGCZ Bioinformaticians. Within such collaborations we provide sequence similarity and sequence domain searches to provide in silico predictions and functional annotations of genes. Additionally, we also bring our expertise into further analyses, such as in-depth analysis of gene families (e.g. secreted proteins, membrane proteins). RNA-seq analysis For RNA-seq samples, we map the reads to both the transcriptome and genome. For example, for human samples this will be the latest version of ENSEMBL transcripts and the hg19 assembly. Subsequently we use R/ Bioconductor to load the reads from the BAM alignment files and generate expression counts for genes. With the R language DESeq package, we compute the significantly differentially expressed genes. As an example for a recent project, RNA from four samples of human tissues were sequenced as pairedreads on a single slide of the SOLiD 5500xl. Following sequencing, the reads were mapped to the reference human genome (hg19), creating four sorted and indexed BAM files. Using the R/ Bioconductor library Rsamtools, and Ensembl definitions of gene regions, the mapped reads were transformed into counts for each gene. The counts table was analyzed with DESeq test to reveal significantly differentially expressed genes. The results were uploaded into a MetaCore workspace and functionally analyzed for the enrichment of pathways, gene networks and Gene Ontology terms. As an additional element, we searched the intergenic spaces for regions of previously unknown transcription and did in-depth analysis of splicing variants for the list of genes pre-selected by the users. ChIP-seq analysis A frequent goal of ChIP-seq is to identify active transcription factor binding sites or chromatin modifications. Here the standard workflow starts with mapping the reads of the IP and control samples to the corresponding reference genome using the default mapper (e.g. for SOLiD reads included in the LifeScope software package). Depending on whether the study searches for short IP-enriched regions (e.g. transcription factor binding sites) or extended enriched regions (e.g. promoter acetylation), we select different peak-finding strategies. For short peaks, we use the MACS software and for extended regions we use SICER. Additionally, we provide a third fully customizable analysis approach using the chipseq package in R/ Bioconductor. In all cases, we provide coverage plots for visual verification of identified peaks as well as interactive browsing using the Integrative Genomics Viewer (IGV). NEXT GENERATION SEQUENCING AT THE FGCZ 9

10 FGCZ NEWSLETTER - NGS BIOINFORMATICS FALL 2011 SOFTWARE TOOLS AND ACCESS TO RESOURCES Software available: NGS reads mapping and assembly Tag counting, quantification, profile pattern discovery Sequence analysis and annotation Databases available: Primary sequence database Protein sequence database Genome database Specialized Hardware available: Dedicated high memory machine (24 processor, 146 Gb RAM) with direct user access SGE cluster of 120 cores with restricted access Access to Computing and Software Resources for Users Access to and organization of data All NGS data is automatically stored in the FGCZ B-Fabric system and accessible via web interface. Sample meta-information and project information are available through the same interface to all members of the respective User Lab Services project. Access to Computing Resources While the high performance computing infrastructure for processing and basic analysis of NGS data is used exclusively within the established pipelines and by FGCZ bioinformatics staff, the center provides users access to a dedicated highperformance linux computer at the FGCZ for NGS data analysis. This way, users have access to a wide range of NGS data analysis software tools and databases. Computationally-intensive data analysis tasks can be scheduled by the FGCZ team on the FGCZ computing cluster. Bioinformatics Tools and Databases More than one hundred bioinformatics software packages and many standard life science databases are implemented and maintained at the FGCZ. Additional applications or databases can be hosted on request. Frequently used packages are: - Open source software: Abyss, amos, BEDTools, BLAST, BLAT, Bowtie, bwa, consed, FASTQC, GATK, InterProScan, JIGSAW, MACS, MAQ, mira, Mosaik, mothur, phrap, phred, picard, PyroNoise2, Qiime, R/Bioconductor, Samtools, SHRiMP, SICER, SOAPdenovo, Tophat, velvet - Commercial software: CASAVA, CLCBio Genomics Workbench, CLCBio NGS Cell, LifeScope, 454 Software Package, tmap Frequently used databases: - Standard sequence databases: NCBI nr, nt, est, cdd, SwissProt, PFAM, KEGG - Specialized databases: mirbase, 16S rdna databases (RDP, GreenGenes). FURTHER INFORMATION SEQUENCING CONTACT Via to sequencing@fgcz.ethz.ch or sequencing@fgcz.uzh.ch. FGCZ staff will initiate a meeting to discuss options and workflows, issue quotes, and advice on all aspects from study design to analysis to data interpretation. BIOINFORMATICS SUPPORT Bioinformatics consulting and support is an integral part of User Lab projects and User Lab Services. As resources are limited, discussion about the level of support and training offered to users is part of the project setup process. USER LAB AND SERVICES More information on the general setup of the FGCZ and its access modes can be found at Sources: Pictures of instruments, consumables and workflows have been retrieved from vendor sources. Text is in part based on informa>on materials of the vendors. Disclaimer: The technologies men>oned may be well used for applica>ons men>oned or non- men>oned also in the absence of an FGCZ recommenda>on: due to the large number of protocols and applica>ons possible, the FGCZ can only support a limited number of protocols and methods per plagorm and therefore limits the number of standard recommenda>ons. Before excluding op>ons, please consult with us. 10

Overview of Next Generation Sequencing technologies. Céline Keime

Overview of Next Generation Sequencing technologies. Céline Keime Overview of Next Generation Sequencing technologies Céline Keime keime@igbmc.fr Next Generation Sequencing < Second generation sequencing < General principle < Sequencing by synthesis - Illumina < Sequencing

More information

Matthew Tinning Australian Genome Research Facility. July 2012

Matthew Tinning Australian Genome Research Facility. July 2012 Next-Generation Sequencing: an overview of technologies and applications Matthew Tinning Australian Genome Research Facility July 2012 History of Sequencing Where have we been? 1869 Discovery of DNA 1909

More information

Deep Sequencing technologies

Deep Sequencing technologies Deep Sequencing technologies Gabriela Salinas 30 October 2017 Transcriptome and Genome Analysis Laboratory http://www.uni-bc.gwdg.de/index.php?id=709 Microarray and Deep-Sequencing Core Facility University

More information

Aaron Liston, Oregon State University Botany 2012 Intro to Next Generation Sequencing Workshop

Aaron Liston, Oregon State University Botany 2012 Intro to Next Generation Sequencing Workshop Output (bp) Aaron Liston, Oregon State University Growth in Next-Gen Sequencing Capacity 3.5E+11 2002 2004 2006 2008 2010 3.0E+11 2.5E+11 2.0E+11 1.5E+11 1.0E+11 Adapted from Mardis, 2011, Nature 5.0E+10

More information

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday June 16, 2014

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday June 16, 2014 High Throughput Sequencing Technologies J Fass UCD Genome Center Bioinformatics Core Monday June 16, 2014 Sequencing Explosion www.genome.gov/sequencingcosts http://t.co/ka5cvghdqo Sequencing Explosion

More information

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday September 15, 2014

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Monday September 15, 2014 High Throughput Sequencing Technologies J Fass UCD Genome Center Bioinformatics Core Monday September 15, 2014 Sequencing Explosion www.genome.gov/sequencingcosts http://t.co/ka5cvghdqo Sequencing Explosion

More information

Contact us for more information and a quotation

Contact us for more information and a quotation GenePool Information Sheet #1 Installed Sequencing Technologies in the GenePool The GenePool offers sequencing service on three platforms: Sanger (dideoxy) sequencing on ABI 3730 instruments Illumina SOLEXA

More information

Chapter 7. DNA Microarrays

Chapter 7. DNA Microarrays Bioinformatics III Structural Bioinformatics and Genome Analysis Chapter 7. DNA Microarrays 7.9 Next Generation Sequencing 454 Sequencing Solexa Illumina Solid TM System Sequencing Process of determining

More information

DNA-Sequencing. Technologies & Devices. Matthias Platzer. Genome Analysis Leibniz Institute on Aging - Fritz Lipmann Institute (FLI)

DNA-Sequencing. Technologies & Devices. Matthias Platzer. Genome Analysis Leibniz Institute on Aging - Fritz Lipmann Institute (FLI) DNA-Sequencing Technologies & Devices Matthias Platzer Genome Analysis Leibniz Institute on Aging - Fritz Lipmann Institute (FLI) Genome analysis DNA sequencing platforms ABI 3730xl 4/2004 & 6/2006 1 Mb/day,

More information

Welcome to the NGS webinar series

Welcome to the NGS webinar series Welcome to the NGS webinar series Webinar 1 NGS: Introduction to technology, and applications NGS Technology Webinar 2 Targeted NGS for Cancer Research NGS in cancer Webinar 3 NGS: Data analysis for genetic

More information

DNA-Sequencing. Technologies & Devices. Matthias Platzer. Genome Analysis Leibniz Institute on Aging - Fritz Lipmann Institute (FLI)

DNA-Sequencing. Technologies & Devices. Matthias Platzer. Genome Analysis Leibniz Institute on Aging - Fritz Lipmann Institute (FLI) DNA-Sequencing Technologies & Devices Matthias Platzer Genome Analysis Leibniz Institute on Aging - Fritz Lipmann Institute (FLI) Genome analysis DNA sequencing platforms ABI 3730xl 4/2004 & 6/2006 1 Mb/day,

More information

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center High Throughput Sequencing the Multi-Tool of Life Sciences Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center Complementary Approaches Illumina Still-imaging of clusters (~1000

More information

Next Generation Sequencing Lecture Saarbrücken, 19. March Sequencing Platforms

Next Generation Sequencing Lecture Saarbrücken, 19. March Sequencing Platforms Next Generation Sequencing Lecture Saarbrücken, 19. March 2012 Sequencing Platforms Contents Introduction Sequencing Workflow Platforms Roche 454 ABI SOLiD Illumina Genome Anlayzer / HiSeq Problems Quality

More information

Next-generation sequencing Technology Overview

Next-generation sequencing Technology Overview Next-generation sequencing Technology Overview UQ Winter School 2018 Christopher Noune, PhD AGRF Melbourne christopher.noune@agrf.org.au What is NGS? Ion Torrent PGM (Thermo-Fisher) MiSeq (Illumina) High-Throughput

More information

Human genome sequence

Human genome sequence NGS: the basics Human genome sequence June 26th 2000: official announcement of the completion of the draft of the human genome sequence (truly finished in 2004) Francis Collins Craig Venter HGP: 3 billion

More information

RNA Sequencing. Next gen insight into transcriptomes , Elio Schijlen

RNA Sequencing. Next gen insight into transcriptomes , Elio Schijlen RNA Sequencing Next gen insight into transcriptomes 05-06-2013, Elio Schijlen Transcriptome complete set of transcripts in a cell, and their quantity, for a specific developmental stage or physiological

More information

Next Generation Sequencing (NGS)

Next Generation Sequencing (NGS) Next Generation Sequencing (NGS) Fernando Alvarez Sección Biomatemática, Facultad de Ciencias, UdelaR 1 Uruguay Montevide o 3 TANGO World Champ 1930 1950 (Maraca 4 Next Generation Sequencing module Next

More information

DNA-Sequencing. Technologies & Devices

DNA-Sequencing. Technologies & Devices DNA-Sequencing Technologies & Devices Genome analysis DNA sequencing platforms ABI 3730xl 4/2004 & 6/2006 1 Mb/day, 850 nt reads 2 Mb/day, 550 nt reads Roche/454 GS FLX 12/2006 800 Mb/23h, 800 nt reads

More information

Next Generation Sequencing. Tobias Österlund

Next Generation Sequencing. Tobias Österlund Next Generation Sequencing Tobias Österlund tobiaso@chalmers.se NGS part of the course Week 4 Friday 13/2 15.15-17.00 NGS lecture 1: Introduction to NGS, alignment, assembly Week 6 Thursday 26/2 08.00-09.45

More information

The Expanded Illumina Sequencing Portfolio New Sample Prep Solutions and Workflow

The Expanded Illumina Sequencing Portfolio New Sample Prep Solutions and Workflow The Expanded Illumina Sequencing Portfolio New Sample Prep Solutions and Workflow Marcus Hausch, Ph.D. 2010 Illumina, Inc. All rights reserved. Illumina, illuminadx, Solexa, Making Sense Out of Life, Oligator,

More information

Wet-lab Considerations for Illumina data analysis

Wet-lab Considerations for Illumina data analysis Wet-lab Considerations for Illumina data analysis Based on a presentation by Henriette O Geen Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center Complementary Approaches Illumina

More information

DNA-Sequencing. Technologies & Devices

DNA-Sequencing. Technologies & Devices DNA-Sequencing Technologies & Devices Genome analysis DNA sequencing platforms ABI 3730xl 4/2004 & 6/2006 1 Mb/day, 850 nt reads 2 Mb/day, 550 nt reads Roche/454 GS FLX 12/2006 800 Mb/23h, 800 nt reads

More information

Research school methods seminar Genomics and Transcriptomics

Research school methods seminar Genomics and Transcriptomics Research school methods seminar Genomics and Transcriptomics Stephan Klee 19.11.2014 2 3 4 5 Genetics, Genomics what are we talking about? Genetics and Genomics Study of genes Role of genes in inheritence

More information

02 Agenda Item 03 Agenda Item

02 Agenda Item 03 Agenda Item 01 Agenda Item 02 Agenda Item 03 Agenda Item SOLiD 3 System: Applications Overview April 12th, 2010 Jennifer Stover Field Application Specialist - SOLiD Applications Workflow for SOLiD Application Application

More information

Next-Generation Sequencing. Technologies

Next-Generation Sequencing. Technologies Next-Generation Next-Generation Sequencing Technologies Sequencing Technologies Nicholas E. Navin, Ph.D. MD Anderson Cancer Center Dept. Genetics Dept. Bioinformatics Introduction to Bioinformatics GS011062

More information

Next Generation Sequencing. Jeroen Van Houdt - Leuven 13/10/2017

Next Generation Sequencing. Jeroen Van Houdt - Leuven 13/10/2017 Next Generation Sequencing Jeroen Van Houdt - Leuven 13/10/2017 Landmarks in DNA sequencing 1953 Discovery of DNA double helix structure 1977 A Maxam and W Gilbert "DNA seq by chemical degradation" F Sanger"DNA

More information

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center High Throughput Sequencing the Multi-Tool of Life Sciences Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center DNA Technologies & Expression Analysis Cores HT Sequencing (Illumina

More information

Phenotype analysis: biological-biochemical analysis. Genotype analysis: molecular and physical analysis

Phenotype analysis: biological-biochemical analysis. Genotype analysis: molecular and physical analysis 1 Genetic Analysis Phenotype analysis: biological-biochemical analysis Behaviour under specific environmental conditions Behaviour of specific genetic configurations Behaviour of progeny in crosses - Genotype

More information

Phenotype analysis: biological-biochemical analysis. Genotype analysis: molecular and physical analysis

Phenotype analysis: biological-biochemical analysis. Genotype analysis: molecular and physical analysis 1 Genetic Analysis Phenotype analysis: biological-biochemical analysis Behaviour under specific environmental conditions Behaviour of specific genetic configurations Behaviour of progeny in crosses - Genotype

More information

Introduction to Next Generation Sequencing (NGS)

Introduction to Next Generation Sequencing (NGS) Introduction to Next eneration Sequencing (NS) Simon Rasmussen Assistant Professor enter for Biological Sequence analysis Technical University of Denmark 2012 Today 9.00-9.45: Introduction to NS, How it

More information

HiSeqTM 2000 Sequencing System

HiSeqTM 2000 Sequencing System IET International Equipment Trading Ltd. www.ietltd.com Proudly serving laboratories worldwide since 1979 CALL +847.913.0777 for Refurbished & Certified Lab Equipment HiSeqTM 2000 Sequencing System Performance

More information

Third Generation Sequencing

Third Generation Sequencing Third Generation Sequencing By Mohammad Hasan Samiee Aref Medical Genetics Laboratory of Dr. Zeinali History of DNA sequencing 1953 : Discovery of DNA structure by Watson and Crick 1973 : First sequence

More information

Wheat CAP Gene Expression with RNA-Seq

Wheat CAP Gene Expression with RNA-Seq Wheat CAP Gene Expression with RNA-Seq July 9 th -13 th, 2018 Overview of the workshop, Alina Akhunova http://www.ksre.k-state.edu/igenomics/workshops/ RNA-Seq Workshop Activities Lectures Laboratory Molecular

More information

solid S Y S T E M s e q u e n c i n g See the Difference Discover the Quality Genome

solid S Y S T E M s e q u e n c i n g See the Difference Discover the Quality Genome solid S Y S T E M s e q u e n c i n g See the Difference Discover the Quality Genome See the Difference With a commitment to your peace of mind, Life Technologies provides a portfolio of robust and scalable

More information

Transcriptomics analysis with RNA seq: an overview Frederik Coppens

Transcriptomics analysis with RNA seq: an overview Frederik Coppens Transcriptomics analysis with RNA seq: an overview Frederik Coppens Platforms Applications Analysis Quantification RNA content Platforms Platforms Short (few hundred bases) Long reads (multiple kilobases)

More information

Next Gen Sequencing. Expansion of sequencing technology. Contents

Next Gen Sequencing. Expansion of sequencing technology. Contents Next Gen Sequencing Contents 1 Expansion of sequencing technology 2 The Next Generation of Sequencing: High-Throughput Technologies 3 High Throughput Sequencing Applied to Genome Sequencing (TEDed CC BY-NC-ND

More information

The Journey of DNA Sequencing. Chromosomes. What is a genome? Genome size. H. Sunny Sun

The Journey of DNA Sequencing. Chromosomes. What is a genome? Genome size. H. Sunny Sun The Journey of DNA Sequencing H. Sunny Sun What is a genome? Genome is the total genetic complement of a living organism. The nuclear genome comprises approximately 3.2 * 10 9 nucleotides of DNA, divided

More information

High throughput sequencing technologies

High throughput sequencing technologies High throughput sequencing technologies and NGS applications Mei-yeh Lu 呂美曄 High Throughput Sequencing Core Manager g g p q g g Academia Sinica 6/30/2011 Outlines Evolution of sequencing technologies Sanger

More information

Next-generation sequencing technologies

Next-generation sequencing technologies Next-generation sequencing technologies NGS applications Illumina sequencing workflow Overview Sequencing by ligation Short-read NGS Sequencing by synthesis Illumina NGS Single-molecule approach Long-read

More information

High throughput DNA Sequencing. An Equal Opportunity University!

High throughput DNA Sequencing. An Equal Opportunity University! High throughput DNA Sequencing An Equal Opportunity University! irst Generation DNA sequencing utilize chain terminator technologies (adaptation of Sanger sequencing) Adapt fluorescence chemistry, high-resolution

More information

Introductory Next Gen Workshop

Introductory Next Gen Workshop Introductory Next Gen Workshop http://www.illumina.ucr.edu/ http://www.genomics.ucr.edu/ Workshop Objectives Workshop aimed at those who are new to Illumina sequencing and will provide: - a basic overview

More information

Genome 373: High- Throughput DNA Sequencing. Doug Fowler

Genome 373: High- Throughput DNA Sequencing. Doug Fowler Genome 373: High- Throughput DNA Sequencing Doug Fowler Tasks give ML unity We learned about three tasks that are commonly encountered in ML Models/Algorithms Give ML Diversity Classification Regression

More information

Genome Resequencing. Rearrangements. SNPs, Indels CNVs. De novo genome Sequencing. Metagenomics. Exome Sequencing. RNA-seq Gene Expression

Genome Resequencing. Rearrangements. SNPs, Indels CNVs. De novo genome Sequencing. Metagenomics. Exome Sequencing. RNA-seq Gene Expression Genome Resequencing De novo genome Sequencing SNPs, Indels CNVs Rearrangements Metagenomics RNA-seq Gene Expression Splice Isoform Abundance High Throughput Short Read Sequencing: Illumina Exome Sequencing

More information

SMRT Analysis Barcoding Overview (v6.0.0)

SMRT Analysis Barcoding Overview (v6.0.0) SMRT Analysis Barcoding Overview (v6.0.0) Introduction This document applies to PacBio RS II and Sequel Systems using SMRT Link v6.0.0. Note: For information on earlier versions of SMRT Link, see the document

More information

Lecture 7. Next-generation sequencing technologies

Lecture 7. Next-generation sequencing technologies Lecture 7 Next-generation sequencing technologies Next-generation sequencing technologies General principles of short-read NGS Construct a library of fragments Generate clonal template populations Massively

More information

Revolutionize Genomics with SMRT Sequencing. Single Molecule, Real-Time Technology

Revolutionize Genomics with SMRT Sequencing. Single Molecule, Real-Time Technology Revolutionize Genomics with SMRT Sequencing Single Molecule, Real-Time Technology Resolve to Master Complexity Despite large investments in population studies, the heritability of the majority of Mendelian

More information

DNA-Sequenzierung. Technologien & Geräte

DNA-Sequenzierung. Technologien & Geräte DNA-Sequenzierung Technologien & Geräte Genome analysis DNA sequencing platforms ABI 3730xl 4/2004 & 6/2006 1 Mb/day, 850 nt reads 2 Mb/day, 550 nt reads Roche/454 GS FLX 12/2006 400 Mb/7h, 350 nt reads

More information

Human Genome Sequencing Over the Decades The capacity to sequence all 3.2 billion bases of the human genome (at 30X coverage) has increased

Human Genome Sequencing Over the Decades The capacity to sequence all 3.2 billion bases of the human genome (at 30X coverage) has increased Human Genome Sequencing Over the Decades The capacity to sequence all 3.2 billion bases of the human genome (at 30X coverage) has increased exponentially since the 1990s. In 2005, with the introduction

More information

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es Sequencing technologies Jose Blanca COMAV institute bioinf.comav.upv.es Outline Sequencing technologies: Sanger 2nd generation sequencing: 3er generation sequencing: 454 Illumina SOLiD Ion Torrent PacBio

More information

Integrated NGS Sample Preparation Solutions for Limiting Amounts of RNA and DNA. March 2, Steven R. Kain, Ph.D. ABRF 2013

Integrated NGS Sample Preparation Solutions for Limiting Amounts of RNA and DNA. March 2, Steven R. Kain, Ph.D. ABRF 2013 Integrated NGS Sample Preparation Solutions for Limiting Amounts of RNA and DNA March 2, 2013 Steven R. Kain, Ph.D. ABRF 2013 NuGEN s Core Technologies Selective Sequence Priming Nucleic Acid Amplification

More information

How is genome sequencing done?

How is genome sequencing done? Click here to view Roche 454 Sequencing Genome Sequence FLX available at www.ssllc.com>> How is genome sequencing done? Using 454 Sequencing on the Genome Sequencer FLX System, DNA from a genome is converted

More information

Sequencing technologies

Sequencing technologies Sequencing technologies part of High-Throughput Analyzes of Genome Sequenzes Computational EvoDevo University of Leipzig Leipzig, WS 2014/15 Sanger Sequencing (Chain Termination Method) Sequencing of one

More information

DNA Sequencing by Ion Torrent. Marc Lavergne CHEM 4590

DNA Sequencing by Ion Torrent. Marc Lavergne CHEM 4590 DNA Sequencing by Ion Torrent Marc Lavergne CHEM 4590 OVERVIEW History DNA Synthesis and First-Gen Sequencing Technology Sequencing Signal Detection Advantages/Disadvantages Applications Current Research

More information

The New Genome Analyzer IIx Delivering more data, faster, and easier than ever before. Jeremy Preston, PhD Marketing Manager, Sequencing

The New Genome Analyzer IIx Delivering more data, faster, and easier than ever before. Jeremy Preston, PhD Marketing Manager, Sequencing The New Genome Analyzer IIx Delivering more data, faster, and easier than ever before Jeremy Preston, PhD Marketing Manager, Sequencing Illumina Genome Analyzer: a Paradigm Shift 2000x gain in efficiency

More information

Functional Genomics Research Stream. Research Meetings: November 2 & 3, 2009 Next Generation Sequencing

Functional Genomics Research Stream. Research Meetings: November 2 & 3, 2009 Next Generation Sequencing Functional Genomics Research Stream Research Meetings: November 2 & 3, 2009 Next Generation Sequencing Current Issues Research Meetings: Meet with me this Thursday or Friday. (bring laboratory notebook

More information

About Strand NGS. Strand Genomics, Inc All rights reserved.

About Strand NGS. Strand Genomics, Inc All rights reserved. About Strand NGS Strand NGS-formerly known as Avadis NGS, is an integrated platform that provides analysis, management and visualization tools for next-generation sequencing data. It supports extensive

More information

Gene Expression Technology

Gene Expression Technology Gene Expression Technology Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu Gene expression Gene expression is the process by which information from a gene

More information

High Throughput Sequencing Technologies. UCD Genome Center Bioinformatics Core Monday 15 June 2015

High Throughput Sequencing Technologies. UCD Genome Center Bioinformatics Core Monday 15 June 2015 High Throughput Sequencing Technologies UCD Genome Center Bioinformatics Core Monday 15 June 2015 Sequencing Explosion www.genome.gov/sequencingcosts http://t.co/ka5cvghdqo Sequencing Explosion 2011 PacBio

More information

RNA-Seq data analysis course September 7-9, 2015

RNA-Seq data analysis course September 7-9, 2015 RNA-Seq data analysis course September 7-9, 2015 Peter-Bram t Hoen (LUMC) Jan Oosting (LUMC) Celia van Gelder, Jacintha Valk (BioSB) Anita Remmelzwaal (LUMC) Expression profiling DNA mrna protein Comprehensive

More information

G E N OM I C S S E RV I C ES

G E N OM I C S S E RV I C ES GENOMICS SERVICES ABOUT T H E N E W YOR K G E NOM E C E N T E R NYGC is an independent non-profit implementing advanced genomic research to improve diagnosis and treatment of serious diseases. Through

More information

2 nd Genera-on ( NextGen ) Sequencing Technologies

2 nd Genera-on ( NextGen ) Sequencing Technologies 2 nd Genera-on ( NextGen ) Sequencing Technologies Jay Shendure Read Length is Not As Important For Resequencing % of Paired K-mers with Uniquely Assignable Location 100% 90% 80% 70% 60% 50% 40% 30% 20%

More information

Unbiased Quantitative RNA & DNA Specialty Sequencing Solutions

Unbiased Quantitative RNA & DNA Specialty Sequencing Solutions Unbiased Quantitative RNA & DNA Specialty Sequencing Solutions The Heliscope single molecule sequencer is the first genetic analyzer to harness the power of single molecule sequencing. True single molecule

More information

Get to Know Your DNA. Every Single Fragment.

Get to Know Your DNA. Every Single Fragment. HaloPlex HS NGS Target Enrichment System Get to Know Your DNA. Every Single Fragment. High sensitivity detection of rare variants using molecular barcodes How Does Molecular Barcoding Work? HaloPlex HS

More information

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Tuesday December 16, 2014

High Throughput Sequencing Technologies. J Fass UCD Genome Center Bioinformatics Core Tuesday December 16, 2014 High Throughput Sequencing Technologies J Fass UCD Genome Center Bioinformatics Core Tuesday December 16, 2014 Sequencing Explosion www.genome.gov/sequencingcosts http://t.co/ka5cvghdqo Sequencing Explosion

More information

CBC Data Therapy. Metagenomics Discussion

CBC Data Therapy. Metagenomics Discussion CBC Data Therapy Metagenomics Discussion General Workflow Microbial sample Generate Metaomic data Process data (QC, etc.) Analysis Marker Genes Extract DNA Amplify with targeted primers Filter errors,

More information

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es Sequencing technologies Jose Blanca COMAV institute bioinf.comav.upv.es Outline Sequencing technologies: Sanger 2nd generation sequencing: 3er generation sequencing: 454 Illumina SOLiD Ion Torrent PacBio

More information

GENOMICS WORKFLOW SOLUTIONS THAT GO WHERE THE SCIENCE LEADS. Genomics Solutions Portfolio

GENOMICS WORKFLOW SOLUTIONS THAT GO WHERE THE SCIENCE LEADS. Genomics Solutions Portfolio GENOMICS WORKFLOW SOLUTIONS THAT GO WHERE THE SCIENCE LEADS Genomics Solutions Portfolio WORKFLOW SOLUTIONS FROM EXTRACTION TO ANALYSIS Application-based answers for every step of your workflow Scientists

More information

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es

Sequencing technologies. Jose Blanca COMAV institute bioinf.comav.upv.es Sequencing technologies Jose Blanca COMAV institute bioinf.comav.upv.es Outline Sequencing technologies: Sanger 2nd generation sequencing: 3er generation sequencing: 454 Illumina SOLiD Ion Torrent PacBio

More information

NextGen Sequencing Technologies Sequencing overview

NextGen Sequencing Technologies Sequencing overview Outline Conventional NextGen High-throughput sequencing (Next-Gen sequencing) technologies. Illumina sequencing in detail. Quality control. Sequence coverage. Multiplexing. FASTQ files. Shendure and Ji

More information

Next generation sequencing techniques" Toma Tebaldi Centre for Integrative Biology University of Trento

Next generation sequencing techniques Toma Tebaldi Centre for Integrative Biology University of Trento Next generation sequencing techniques" Toma Tebaldi Centre for Integrative Biology University of Trento Mattarello September 28, 2009 Sequencing Fundamental task in modern biology read the information

More information

Ion S5 and Ion S5 XL Systems

Ion S5 and Ion S5 XL Systems Ion S5 and Ion S5 XL Systems Targeted sequencing has never been simpler Explore the Ion S5 and Ion S5 XL Systems Adopting next-generation sequencing (NGS) in your lab is now simpler than ever The Ion S5

More information

Bioinformatics Advice on Experimental Design

Bioinformatics Advice on Experimental Design Bioinformatics Advice on Experimental Design Where do I start? Please refer to the following guide to better plan your experiments for good statistical analysis, best suited for your research needs. Statistics

More information

Concepts and methods in sequencing and genome assembly

Concepts and methods in sequencing and genome assembly BCM-2002 Concepts and methods in sequencing and genome assembly http://megasun.bch.umontreal.ca/papers/bcm-2002/sequencing-bcm2002-nov2015.pdf B. Franz LANG, Département de Biochimie Bureau: H307-15 Courrier

More information

Sequence Assembly and Next Generation Sequencing Informatics CBPS7711

Sequence Assembly and Next Generation Sequencing Informatics CBPS7711 Sequence Assembly and Next Generation Sequencing Informatics CBPS7711 Oct 5, 2010 Sonia Leach, PhD Assistant Professor National Jewish Health sonia.leach@gmail.com Next Generation Sequencing Increasing

More information

Pacific Biosciences Technology Backgrounder 1 11/24/2008

Pacific Biosciences Technology Backgrounder 1 11/24/2008 Pacific Biosciences Technology Backgrounder 1 Pacific Biosciences Develops Transformative DNA Sequencing Technology Single Molecule Real Time (SMRT ) DNA Sequencing Long Reads, Short Run Time, and High

More information

Ion S5 and Ion S5 XL Systems

Ion S5 and Ion S5 XL Systems Ion S5 and Ion S5 XL Systems Targeted sequencing has never been simpler Explore the Ion S5 and Ion S5 XL Systems Adopting next-generation sequencing (NGS) in your lab is now simpler than ever The Ion S5

More information

resequencing storage SNP ncrna metagenomics private trio de novo exome ncrna RNA DNA bioinformatics RNA-seq comparative genomics

resequencing storage SNP ncrna metagenomics private trio de novo exome ncrna RNA DNA bioinformatics RNA-seq comparative genomics RNA Sequencing T TM variation genetics validation SNP ncrna metagenomics private trio de novo exome mendelian ChIP-seq RNA DNA bioinformatics custom target high-throughput resequencing storage ncrna comparative

More information

GENOMICS WORKFLOW SOLUTIONS THAT GO WHERE THE SCIENCE LEADS. Genomics Solutions Portfolio

GENOMICS WORKFLOW SOLUTIONS THAT GO WHERE THE SCIENCE LEADS. Genomics Solutions Portfolio GENOMICS WORKFLOW SOLUTIONS THAT GO WHERE THE SCIENCE LEADS Genomics Solutions Portfolio WORKFLOW SOLUTIONS FROM EXTRACTION TO ANALYSIS Application-based answers for every step of your workflow Scientists

More information

NEXT-GENERATION SEQUENCING AND BIOINFORMATICS

NEXT-GENERATION SEQUENCING AND BIOINFORMATICS NEXT-GENERATION SEQUENCING AND BIOINFORMATICS Moore's law: the number of transistors in a dense integrated circuit doubles every two years Moore's law calculates and predicts the pace of improvement of

More information

Basics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility

Basics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility 2018 ABRF Meeting Satellite Workshop 4 Bridging the Gap: Isolation to Translation (Single Cell RNA-Seq) Sunday, April 22 Basics of RNA-Seq (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly,

More information

Molecular Biology and Functional Genomic Core Facility

Molecular Biology and Functional Genomic Core Facility Molecular Biology and Functional Genomic Core Facility General Presentation Dr Odile Neyret Core Manager Myriam Rondeau Research Assistant Agnès Dumont Research Assistant Institut de recherche clinique

More information

A Crash Course in NGS for GI Pathologists. Sandra O Toole

A Crash Course in NGS for GI Pathologists. Sandra O Toole A Crash Course in NGS for GI Pathologists Sandra O Toole The Sanger Technique First generation sequencing Uses dideoxynucleotides (dideoxyadenine, dideoxyguanine, etc) These are molecules that resemble

More information

Mate-pair library data improves genome assembly

Mate-pair library data improves genome assembly De Novo Sequencing on the Ion Torrent PGM APPLICATION NOTE Mate-pair library data improves genome assembly Highly accurate PGM data allows for de Novo Sequencing and Assembly For a draft assembly, generate

More information

Joint RuminOmics/Rumen Microbial Genomics Network Workshop

Joint RuminOmics/Rumen Microbial Genomics Network Workshop Joint RuminOmics/Rumen Microbial Genomics Network Workshop Microbiome analysis - Amplicon sequencing Dr. Sinéad Waters Animal and Bioscience Research Department, Teagasc Grange, Ireland Prof. Leluo Guan

More information

APPLICATION NOTE. Abstract. Introduction

APPLICATION NOTE. Abstract. Introduction From minuscule amounts to magnificent results: reliable ChIP-seq data from 1, cells with the True MicroChIP and the MicroPlex Library Preparation kits Abstract Diagenode has developed groundbreaking solutions

More information

Illumina s Suite of Targeted Resequencing Solutions

Illumina s Suite of Targeted Resequencing Solutions Illumina s Suite of Targeted Resequencing Solutions Colin Baron Sr. Product Manager Sequencing Applications 2011 Illumina, Inc. All rights reserved. Illumina, illuminadx, Solexa, Making Sense Out of Life,

More information

SMRT Analysis Barcoding Overview

SMRT Analysis Barcoding Overview SMRT Analysis Barcoding Overview Introduction This document is for users with Sequel Systems using SMRT Link v5.0.0 or v5.0.1. This document covers: Barcoding designs, strategies and modes for preparing

More information

Using New ThiNGS on Small Things. Shane Byrne

Using New ThiNGS on Small Things. Shane Byrne Using New ThiNGS on Small Things Shane Byrne Next Generation Sequencing New Things Small Things NGS Next Generation Sequencing = 2 nd generation of sequencing 454 GS FLX, SOLiD, GAIIx, HiSeq, MiSeq, Ion

More information

TREE CODE PRODUCT BROCHURE

TREE CODE PRODUCT BROCHURE TREE CODE PRODUCT BROCHURE Single Molecule, Real-Time (SMRT) Sequencing technology offers: Long read sequencing ~10 Gb with 20 kb average read lengths for WGS ~20 Gb with 40 kb average read length for

More information

Computational & Quantitative Biology Lecture 6 RNA Sequencing

Computational & Quantitative Biology Lecture 6 RNA Sequencing Peter A. Sims Dept. of Systems Biology Dept. of Biochemistry & Molecular Biophysics Sulzberger Columbia Genome Center October 27, 2014 Computational & Quantitative Biology Lecture 6 RNA Sequencing We Have

More information

NEXT GENERATION SEQUENCING Whole Gene Sequencing

NEXT GENERATION SEQUENCING Whole Gene Sequencing NEXT GENERATION SEQUENCING Whole Gene Sequencing Ingrid Faé Educational Session 3: Next generation sequencing Stockholm, Friday, June 27 th 2014 Department for Blood Group Serology and Transfusion Medicine

More information

IMGM Laboratories GmbH. Sales Manager

IMGM Laboratories GmbH. Sales Manager IMGM Laboratories GmbH Dr. Jennifer K. Kuhn Sales Manager About IMGM Laboratories IMGM Laboratories was founded in 2001 IMGM operates as professional provider of advanced genomic services from research

More information

Applications of short-read

Applications of short-read Applications of short-read sequencing: RNA-Seq and ChIP-Seq BaRC Hot Topics March 2013 George Bell, Ph.D. http://jura.wi.mit.edu/bio/education/hot_topics/ Sequencing applications RNA-Seq includes experiments

More information

Next Generation Sequencing. Simon Rasmussen Assistant Professor Center for Biological Sequence analysis Technical University of Denmark

Next Generation Sequencing. Simon Rasmussen Assistant Professor Center for Biological Sequence analysis Technical University of Denmark Next eneration Sequencing Simon Rasmussen Assistant Professor enter for Biological Sequence analysis Technical University of Denmark DNA Sequencing DNA sequencing Reading the order of bases in DNA fragments

More information

Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie. Sander van Boheemen Medical Microbiology

Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie. Sander van Boheemen Medical Microbiology Introductie en Toepassingen van Next-Generation Sequencing in de Klinische Virologie Sander van Boheemen Medical Microbiology Next-generation sequencing Next-generation sequencing (NGS), also known as

More information

Incorporating Molecular ID Technology. Accel-NGS 2S MID Indexing Kits

Incorporating Molecular ID Technology. Accel-NGS 2S MID Indexing Kits Incorporating Molecular ID Technology Accel-NGS 2S MID Indexing Kits Molecular Identifiers (MIDs) MIDs are indices used to label unique library molecules MIDs can assess duplicate molecules in sequencing

More information

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center

High Throughput Sequencing the Multi-Tool of Life Sciences. Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center High Throughput Sequencing the Multi-Tool of Life Sciences Lutz Froenicke DNA Technologies and Expression Analysis Cores UCD Genome Center DNA Technologies & Expression Analysis Cores HT Sequencing (Illumina

More information

Introduction Bioo Scientific

Introduction Bioo Scientific Next Generation Sequencing Catalog 2014-2015 Introduction Bioo Scientific Bioo Scientific is a global life science company headquartered in Austin, TX, committed to providing innovative products and superior

More information

an innovation in high throughput single cell profiling

an innovation in high throughput single cell profiling an innovation in high throughput single cell profiling www.dolomite-bio.com Why use high throughput single cell profiling? Techniques such as high throughput scrna-seq (single cell RNA sequencing) offer

More information

Introduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013

Introduction to RNA-Seq. David Wood Winter School in Mathematics and Computational Biology July 1, 2013 Introduction to RNA-Seq David Wood Winter School in Mathematics and Computational Biology July 1, 2013 Abundance RNA is... Diverse Dynamic Central DNA rrna Epigenetics trna RNA mrna Time Protein Abundance

More information