Molecular Biology: DNA sequencing

Similar documents
Genome Sequence Assembly

BENG 183 Trey Ideker. Genome Assembly and Physical Mapping

Chapter 8: Recombinant DNA. Ways this technology touches us. Overview. Genetic Engineering

Reading Lecture 8: Lecture 9: Lecture 8. DNA Libraries. Definition Types Construction

Molecular Genetics Techniques. BIT 220 Chapter 20

DNA Technology. Asilomar Singer, Zinder, Brenner, Berg

GENETICS EXAM 3 FALL a) is a technique that allows you to separate nucleic acids (DNA or RNA) by size.

Molecular Cell Biology - Problem Drill 11: Recombinant DNA

Computational Biology I LSM5191

B. Incorrect! Ligation is also a necessary step for cloning.

We begin with a high-level overview of sequencing. There are three stages in this process.

13-2 Manipulating DNA Slide 1 of 32

Texas A&M University-Corpus Christi CHEM4402 Biochemistry II Laboratory Laboratory 4 - Polymerase Chain Reaction (PCR)

Lecture Four. Molecular Approaches I: Nucleic Acids

1. A brief overview of sequencing biochemistry

Factors affecting PCR

Chapter 20 Recombinant DNA Technology. Copyright 2009 Pearson Education, Inc.

DNA sequencing. Course Info

Genetics and Genomics in Medicine Chapter 3. Questions & Answers

7.1 Techniques for Producing and Analyzing DNA. SBI4U Ms. Ho-Lau

Restriction Site Mapping:

Chapter 5. Structural Genomics

Genetics Lecture 21 Recombinant DNA

CONSTRUCTION OF GENOMIC LIBRARY

Molecular Biology: Gene cloning

Genome Projects. Part III. Assembly and sequencing of human genomes

BSCI410-Liu/Spring 06 Exam #1 Feb. 23, 06

A Guide to Consed Michelle Itano, Carolyn Cain, Tien Chusak, Justin Richner, and SCR Elgin.

Genome Sequencing-- Strategies

DNA Sequencing and Assembly

PLNT2530 (2018) Unit 6b Sequence Libraries

Polymerase Chain Reaction PCR

Tutorial. In Silico Cloning. Sample to Insight. March 31, 2016

Computational Biology 2. Pawan Dhar BII

BIOLOGY - CLUTCH CH.20 - BIOTECHNOLOGY.

Manipulation of Purified DNA

Multiple choice questions (numbers in brackets indicate the number of correct answers)

M Keramatipour 2. M Keramatipour 1. M Keramatipour 4. M Keramatipour 3. M Keramatipour 5. M Keramatipour

Sequence Assembly and Alignment. Jim Noonan Department of Genetics

Bioinformatics Support of Genome Sequencing Projects. Seminar in biology

Contact us for more information and a quotation

Introduction to Plant Genomics and Online Resources. Manish Raizada University of Guelph

Course summary. Today. PCR Polymerase chain reaction. Obtaining molecular data. Sequencing. DNA sequencing. Genome Projects.

Diagnosis Sanger. Interpreting and Troubleshooting Chromatograms. Volume 1: Help! No Data! GENEWIZ Technical Support

Bioinformatics for Genomics

Restriction Enzymes (endonucleases)

Molecular Cloning. Genomic DNA Library: Contains DNA fragments that represent an entire genome. cdna Library:

2 Gene Technologies in Our Lives

PV92 PCR Bio Informatics

Chapter 6 - Molecular Genetic Techniques

Selected Techniques Part I

Molecular Biology (2)

Manipulating DNA. Nucleic acids are chemically different from other macromolecules such as proteins and carbohydrates.

Molecular Biology: DNA sequencing

Recombinant DNA recombinant DNA DNA cloning gene cloning

Introduction to some aspects of molecular genetics

CHAPTERS 16 & 17: DNA Technology

SEQUENCING DNA. Jos. J. Schall Biology Department University of Vermont

Map-Based Cloning of Qualitative Plant Genes

1

BIOTECHNOLOGY : PRINCIPLES AND PROCESSES

Researchers use genetic engineering to manipulate DNA.

UNIT 3: GENETICS Chapter 9: Frontiers of Biotechnology

2054, Chap. 14, page 1

MCB 102 University of California, Berkeley August 11 13, Problem Set 8

Functional Genomics Research Stream. Research Meetings: November 2 & 3, 2009 Next Generation Sequencing

DNA REPLICATION & BIOTECHNOLOGY Biology Study Review

Genetics and Biotechnology. Section 1. Applied Genetics

Problem Set 8. Answer Key

Dr. Gary Mumaugh. DNA Technology

Recitation CHAPTER 9 DNA Technologies

Chapter 15 Gene Technologies and Human Applications

CHAPTER 20 DNA TECHNOLOGY AND GENOMICS. Section A: DNA Cloning

Chapter 10 Genetic Engineering: A Revolution in Molecular Biology

Bootcamp: Molecular Biology Techniques and Interpretation

BIOTECHNOLOGY. Sticky & blunt ends. Restriction endonucleases. Gene cloning an overview. DNA isolation & restriction

Corynebacterium pseudotuberculosis genome sequencing: Final Report

Appendix A DNA and PCR in detail DNA: A Detailed Look

Fun with DNA polymerase

Introduction to Molecular Biology

Biology Chapter 9 & Honors Biology Chapter 13. Frontiers Of Biotechnology

Antisense RNA Targeting the First Periplasmic Domain of YidC in Escherichia coli Appears to Induce Filamentation but Does Not Affect Cell Viability

Chapter 20: Biotechnology

BS 50 Genetics and Genomics Week of Nov 29

Recombinant DNA Technology

Biology Warm Up. 1. Complete the entrance ticket you received at the door.

3 Designing Primers for Site-Directed Mutagenesis

BIOTECHNOLOGY. Biotechnology is the process by which living organisms are used to create new products THE ORGANISMS

Plasmids & Transposable Elements. Dr. Wiaam Ahmed Al-Amili

12/31/16. I. Manipulating DNA (9.1) A. Scientists use several techniques to manipulate DNA. 1. DNA is a very large molecule

dbcamplicons pipeline Amplicons

Genome Analysis. Bacterial genome projects

Chapter 15 The Human Genome Project and Genomics. Chapter 15 Human Heredity by Michael Cummings 2006 Brooks/Cole-Thomson Learning

Finishing Fosmid DMAC-27a of the Drosophila mojavensis third chromosome

Biotechnology. Biotechnology is difficult to define but in general it s the use of biological systems to solve problems.

Growing Needs for Practical Molecular Diagnostics: Indonesia s Preparedness for Current Trend

Page 1 of 10 MIDTERM EXAM OF BIO

dbcamplicons pipeline Amplicons

Chapter 10 (Part II) Gene Isolation and Manipulation

Transcription:

Molecular Biology: DNA sequencing Author: Prof Marinda Oosthuizen Licensed under a Creative Commons Attribution license. SEQUENCING OF LARGE TEMPLATES As we have seen, we can obtain up to 800 nucleotides from a single template. But how do we obtain the sequence of a template that is longer than 800 bp? The following will be discussed: Primer walking - for sequencing a relatively small piece of DNA (>1 4 kb) Subcloning when you need to sequence larger templates (> 4 kb) Genome sequencing Primer walking Sequencing by primer walking is an effective strategy for obtaining the sequence of DNA templates ranging in size from 1-4 kb. Primer walking is illustrated in Figure 5. For a cloned template, the initial sequence data is obtained using a standard primer which hybridizes to the vector sequence upstream of the insert DNA. For a PCR product, one of the amplification primers can be used to obtain the initial sequence data. A sequencing primer is then designed towards the 3 end of this initial sequencing data. This oligonucleotide is used to prime a second sequencing reaction on the same template DNA. The data obtained from the second reaction will overlap with the initial data and extend the sequence further downstream. By repeated cycles of oligonucleotide design and DNA sequencing, the cloned insert is sequenced completely in one direction. The same strategy is used to sequence the other strand beginning with a second standard primer which primes in the other direction. The complete DNA sequence can then be compiled from all the smaller sequences. 1 P age

Figure 5: Primer walking to obtain the full length sequence of a template that is too large to sequence in a single sequencing reaction. Subcloning It is not efficient to sequence larger templates (> 4 kb) by primer walking. Such larger DNA templates must first be broken up into smaller overlapping fragments which can be cloned into a vector for sequencing. These subclones should overlap, so that the individual DNA sequences will themselves overlap. The overlaps can be then located, either by eye or using a computer, and the master sequence gradually built up. There are several ways of producing the subclones. The DNA could be cleaved with two different restriction endonucleases, producing one set of fragments with say Sau3A and another with AluI. However, this method suffers from the drawback that the restriction enzyme sites may be inconveniently placed and individual fragments may be too long to be completely 2 P age

sequenced. Often four or five different enzymes will have to be used to clear up all the gaps in the master sequence. (For more details on cloning see the Cloning techniques course notes). An alternative method is to use shotgun cloning (Figure 6), where the DNA template is randomly sheared and cloned into a vector, producing a shotgun library. Clones from this library are sequenced and the individual sequences assembled, until the final finished sequence of the original template is obtained. One drawback of this technique is that some regions of the DNA template may be easier to clone than others, resulting in areas in the final sequence assembly which are over-represented while other regions are under-represented or represented by reads on only one strand. In fact, there may be regions where there are no subclones at all, and this would result in gaps in the master sequence. Such regions would be targeted for further sequencing. Numerous strategies, including the use of PCR, have been developed for the closure of gaps in the sequence. Figure 6: Shotgun sequencing strategy. Genome sequencing New automated sequencing technologies and improved computer assisted methods for sequence assembly have greatly increased the speed of sequencing, and have made sequencing of whole genomes a reality. By sequencing the entire genome of an organism, all of its genes can be identified. 3 Page

Scientists hope that by deciphering the genome sequence of an organism they will gain understanding of how the organism functions, and how its genes work together to direct its growth and development. In the case of disease-causing organisms, genome sequences may reveal genes responsible for virulence and pathogenesis and it may be possible to identify novel vaccine candidate genes or new drug targets. It is hoped that the human genome sequence may shed light on genetically-related diseases. Most genomes are far too big to be inserted into a vector and must be broken into smaller bits in order to be cloned. For example, bacteria have genomes that range from 1 Mb up to 10 Mb, and some are even larger. How do we sequence such a huge piece of DNA? There are two strategies for sequencing whole genomes: the hierarchical approach and the whole genome shotgun approach. See Figure 7 for a schematic comparison. In HIERARCHICAL SEQUENCING, the genomic DNA is first broken down into manageable sized pieces (approximately 100 kb), and these pieces are cloned separately into a cloning vector that will accept large inserts of 100 kb to 3 Mb (e.g. BAC, P1 or YAC). It is then possible to physically order these large clones, resulting in several clones which, between them, span the entire length of the genome. This is known as a tiling path (a minimal subset of overlapping clones that span the whole genome). Ideally the fewest possible clones are identified to make up the tiling path (there are numerous ways of physically ordering the clones, which are beyond the scope of this course. You can read about these methods elsewhere if you are interested). Although the genomic DNA has been fragmented, the size of the piece of DNA in each clone is still too large to sequence directly (remember it is only possible to sequence around 800 bp at a time). So each individual large clone is then sequenced separately, usually by breaking it up into little pieces (approximately 2 kb) and cloning these small fragments of DNA into a sequencing vector. This process is known as sub-cloning (cloning a cloned piece of DNA!). The sequences of many small 2 kb subclones are obtained and assembled to give the sequence of the original large (approximately 100 kb) clone. Since the order of the large clones in the tiling path is known, the complete genome sequence is obtained by assembling the overlapping sequences of these clones. Hierarchical sequencing was the basis of the publicly funded Human Genome Project. The WHOLE GENOME SHOTGUN APPROACH does not make use of ordered subclones. Instead, the entire genome is fragmented into small clones which are sequenced and assembled. Shotgun sequencing was the basis of the privately funded Human Genome Project, and today most small genomes (e.g. bacterial genomes) are sequenced using this strategy. 4 P age

Figure 7: Strategies for sequencing whole genomes: hierarchical sequencing versus the whole genome shotgun approach. 5 P age