GENES & GENOME DATABASES

Similar documents
Introduction and Public Sequence Databases. BME 110/BIOL 181 CompBio Tools

Genome and DNA Sequence Databases. BME 110: CompBio Tools Todd Lowe April 5, 2007

AP Biology

O C. 5 th C. 3 rd C. the national health museum

Bioinformatics for Genomics

A Lot More Advanced Biotechnology Tools. DNA Sequencing. DNA Sequencing. Sequencing and more. Sanger method

Concepts of Genetics, 10e (Klug/Cummings/Spencer/Palladino) Chapter 1 Introduction to Genetics

2017 Amplyus, all rights reserved

Matthew Tinning Australian Genome Research Facility. July 2012

Computational gene finding

NCBI & Other Genome Databases. BME 110/BIOL 181 CompBio Tools

Studying the Human Genome. Lesson Overview. Lesson Overview Studying the Human Genome

BENG 183 Trey Ideker. Genome Assembly and Physical Mapping

Interview with Dr. David Haussler

3. human genomics clone genes associated with genetic disorders. 4. many projects generate ordered clones that cover genome

Lesson Overview. Studying the Human Genome. Lesson Overview Studying the Human Genome

The Human Genome Project

Conditional Random Fields, DNA Sequencing. Armin Pourshafeie. February 10, 2015

Genome Projects. Part III. Assembly and sequencing of human genomes

Genomes contain all of the information needed for an organism to grow and survive.

The Structure of Proteins and DNA

Advanced Techniques. Electrophoresis, PCR, Southern Blot, Sequencing, RFLPs, Microarrays. AP Biology

DNA Sequencing and Assembly

Genomes: What we know and what we don t know

Mate-pair library data improves genome assembly

Cracking the Code of Life -Video Questions

Complete draft sequence 2001

How does the human genome stack up? Genomic Size. Genome Size. Number of Genes. Eukaryotic genomes are generally larger.

Genome Sequence Assembly

Genome Sequencing Technologies. Jutta Marzillier, Ph.D. Lehigh University Department of Biological Sciences Iacocca Hall

وراثة األحياء الدقيقة Microbial Genetics

Computational gene finding. Devika Subramanian Comp 470

DNA The Genetic Material

Biosc10 schedule reminders

Genetics Lecture 21 Recombinant DNA

Genetics and Biotechnology. Section 1. Applied Genetics

Computational Genomics

Introduction to Bioinformatics

Introduction to the course

Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH. BIOL 7210 A Computational Genomics 2/18/2015

Computational gene finding

Genetics and Genomics in Medicine Chapter 3. Questions & Answers

A Crash Course in NGS for GI Pathologists. Sandra O Toole

Genetics and Biotechnology 13.2 DNA Technology

Human genome sequence February, 2001

THE CAMPAIGN FOR UC SANTA CRUZ THE GENOMICS INSTITUTE

CHAPTER 20 DNA TECHNOLOGY AND GENOMICS. Section A: DNA Cloning

3) This diagram represents: (Indicate all correct answers)

Genetics One Gene, One Phenotype

Human genome sequence

BIOTECHNOLOGY. Biotechnology is the process by which living organisms are used to create new products THE ORGANISMS

Introduction to Bioinformatics

Biochemistry 412. Overview of Genomics & Proteomics. 20 January 2009

Introduction to 'Omics and Bioinformatics

NB536: Bioinformatics

DNA and genome sequencing. Matthew Hudson Dept of Crop Sciences University of Illinois

What is Bioinformatics?

Biology Warm Up. 1. Complete the entrance ticket you received at the door.

NOTES - CH 15 (and 14.3): DNA Technology ( Biotech )

4. Analysing genes II Isolate mutants*


Alignment and Assembly

Class 35: Decoding DNA

Synthetic Biology. IICA First Seminar on SynBio for Biotechnology Decision Makers March 16-17, Fan-Li Chou. Foreign Agricultural Service

Advanced Technology in Phytoplasma Research

Next generation sequencing techniques" Toma Tebaldi Centre for Integrative Biology University of Trento

Bioinformatics. Joshua Gilkerson Albert Kalim Ka-him Leung David Owen

Human Genomics. Higher Human Biology

The Journey of DNA Sequencing. Chromosomes. What is a genome? Genome size. H. Sunny Sun

Introduction to NGS. Josef K Vogt Slides by: Simon Rasmussen Next Generation Sequencing Analysis

Page 70 Monday December 8, 2014

CHAPTERS 16 & 17: DNA Technology

CSE182-L16. LW statistics/assembly

7.03, 2005, Lecture 20 EUKARYOTIC GENES AND GENOMES I

ab initio and Evidence-Based Gene Finding

The Diploid Genome Sequence of an Individual Human

CHAPTER 21 LECTURE SLIDES

BIOTECHNOLOGY. Sticky & blunt ends. Restriction endonucleases. Gene cloning an overview. DNA isolation & restriction

Introduction to NGS. Simon Rasmussen Associate Professor DTU Bioinformatics Technical University of Denmark 2018

Molecular Biology: DNA sequencing

DNA SEQUENCING BY SANGER METHOD

Chemical treatments cause cells and nuclei to burst The DNA is inherently sticky, and can be pulled out of the mixture This is called spooling DNA

Manipulating genes and cells (Kap. 10)

Introduction to Computational Genomics

Nuts and bolts of phage genome sequencing. the 5 5 and 5 8 perspective. Allison Johnson & Anneke Padolina

Data Basics. Josef K Vogt Slides by: Simon Rasmussen Next Generation Sequencing Analysis

NGS developments in tomato genome sequencing

Bioinformatics Support of Genome Sequencing Projects. Seminar in biology

Chapter 5. Structural Genomics

Contact us for more information and a quotation

PCR in the Classroom. UC Davis - PCR Workshop Friday, September 26, 2003

The Structure of Proteins The Structure of Proteins. How Proteins are Made: Genetic Transcription, Translation, and Regulation

Chapter 15 Gene Technologies and Human Applications

Chapter 20: The human genome

HAS 4320 Pharmaceutical Industry. Dr Burton

Polymers. DNA is a polymer of nucleotides. What polymers (macromolecules) have we met so far? Proteins are polymers of amino acids

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

Motivation From Protein to Gene

Reading for lecture 2

NextGen Sequencing and Target Enrichment

Transcription:

GENES & GENOME DATABASES BME 110/BIOL 181 Computational Biology Tools Prof. Todd Lowe April 5, 2012

ADMIN Discuss Fun Quiz Readings: Dummies Chapters 1, 2 (pp. 29-56), Ch 3; NYTimes piece on Jim Kent Assigned Review: "The impact of next-generation sequencing technology on genetics" by E. Mardis Recommended Review: "Application of next-generation sequencing technologies to microbial genetics" by MacLean et al.

STUDY SECTIONS Study Sections: Tues 6-7:15pm (Physical Sciences Building Rm 305) To be decided (look like it will be Thursday 6-7:15pm)

GENES & GENOMES

THREE DOMAINS OF LIFE Credit: http://scienceblogs.com/clock/2007/01/ current_biological_diversity.php

WHAT IS A GENOME? A complete set of instructions for life encoded in DNA Organized in chromosomes prokaryotes generally have one main circular chromosome eukaryotes have multiple linear chromosomes Instructions are generally in the form of genes, and are the unit of heredity

WHY SEQUENCE A GENOME? We wish to understand how the entire cell / organism works thousands of complex gene interactions complete parts list is first step to understanding how parts work together as a whole Economy of scale - faster, cheaper to sequence all genes at once, than one at a time by many different researchers

First fully sequenced Organism: H. Influenza The Institute for Genome Research (TIGR) - 1995

HUMAN GENOME In 1980 s, initial discussion to sequence human genome (first key meeting here at UCSC!) Began: 1990 Planned finish: 2005 Original Estimates: ~100,000 genes 3 billion nucleotides, projected cost $300 Million less than 5% of genome codes for genes Early opposition to sequencing 95% junk, taking money away from basic research Final Results: Draft completed June 2000, Finished April 2003 Final cost ~ $3 billion ~25,000-30,000 genes now 5% of genome is conserved, likely important

PREPARING FOR THE HUMAN SEQUENCING Mapping the human genome Practice: sequencing model organisms Model Species Genome Size (Mb) Completed Baker s Yeast (S. cerevisiae) 12.5 1996 E. coli 4.5 1997 Roundworm (C. elegans) 100 1998 Fruit fly (D. melanogaster) 160 2000 Mustard weed (A. thaliana) 120 2000

Sequencing Genomes: Strategy #1 Original Top- Down Strategy Deliberate, small chance for major errors

Sequencing Genomes: Strategy #2 WGS (Whole Genome Sequencing) Bottom Up Strategy No ordering of a clone library - straight to sequencing to build scaffolds Current method used for genome sequencing

RACE TO THE FINISH JUNE A new private company, Celera, announces it would use strategy #2 to beat the public, international effort to sequence the human genome, and planned to patent as many human genes as possible Public team effort changes strategy from #1 to mixture of #1 and #2 Change in strategy meant ordered clone libraries would not be finished, and an alternate method was needed for putting the pieces together (quickly) UC Santa Cruz to the rescue! David Haussler & Jim Kent, in just a four weeks, created a clever program (GigAssembler) to run on a network of 100 Dell desktop computers to assemble the genome by Celera s target date in June The whole story: http://www.nytimes.com/2001/02/13/health/13hero.html

SEQUENCING TECHNOLOGIES Sanger Sequencing Capillary sequencing: ABI 3700 and MegaBASE machines allow first rapid automated sequencing in tiny capillary tubes 600-800 bases at a time x 384 wells = ~ 268,000 bases decoded / run Bulk of orignal human genome decoded on these NextGen Technologies (Massively parallel, shorter reads) 454 / Roche Sequencers 100-500 bases, 400-600 Million bases / run Illumina Genetic Analyzer: ~2x100 bases, 150-200 Billion bases/ run ABI SOLiD Sequencing: ~2x50 base each, 80-100 Billion bases / run Chemical decoding is no longer the major cost (10 Million bp / cent) reassembling all the pieces with computer programs and interpretation (bioinformatics) is now the major cost / bottleneck

GENOME CHALLENGES $1,000 Genome Prize goal set in 2003 by J. Craig Venter Science Foundation to make personalized medicine available to all ($500,000 prize) X Prize build a machine that can sequence 100 people in 10 days or less, for $10,000/genome or less ($10 Million prize): http:// genomics.xprize.org/

PLUMMETING COST OF HUMAN GENOME SEQUENCING Source: Wetterstrand KA http://www.genome.gov/