Biol 478/595 Intro to Bioinformatics

Size: px
Start display at page:

Download "Biol 478/595 Intro to Bioinformatics"

Transcription

1 Biol 478/595 Intro to Bioinformatics September M 1 Labor Day 4 W 3 MG Database Searching Ch. 6 5 F 5 MG Database Searching Hw1 6 M 8 MG Scoring Matrices Ch 3 and Ch 4 7 W 10 MG Pairwise Alignment 8 F 12 MG Pairwise Alignment Hw2 9 M 15 MG Pairwise Alignment Ch W 17 MG Genome Sequencing 1 F 19 MG Gene Finding/Annotation ti Hw3 1 1 M 22 MG Gene Finding/Annotation Ch W 24 MG Sequence Motifs Ch. 7 1 F 26 MG Sequence Motifs Hw4 4 X M 29 Bot h Exam Note modification to reading: Ch 11 genome analysis 1 19-Sep-08

2 Assembly Protocols (Algorithms) Greedy Original method, not used much now Assemble best overlaps first Uses only local information no mate-pair or other map information 2 19-Sep-08

3 Overlap-Layout-Consensus Primary method used today Find all overlaps Layout as graph, removing redundant information Hamiltonian path - visit each vertex once (NP complete) Align and calculate consensus sequence Problems Repeats cause bad overlaps Multiple haplotypes Duplications 3 19-Sep-08

4 Additional Assembly Protocols Comparative assembly - Align to existing very similar to genome Several times faster, at least 3-4X Problems Insertions/deletions Rearrangements Suitable for new HT (short read technologies) Align by physical map Very time and labor intensive 4 19-Sep-08

5 New technologies 454 sequencing DNA is sheared, oligonucleotide adaptors are attached, attached to bead and PCR amplified Nucleotide incorporation releases of inorganic pyrophosphate (PPi) which is detected by luciferase (i.e., light is produced) Each base is added separately. Runs of bases generate more light. Current reads ~ 500 bp SOLiD technology Similar to 454, beads are attached to glass slides Sequence is determined by sequential hybridization and ligation of partially random oligonucleotides with a central determined base (or pair of bases) that is identified by a specific fluorophore. Record color I, cleave and remove ligated oligonucleotide, repeat The reads that are generated are currently 25 bp, but will probably increase to more than 50 bp in the future. Too short for WGS! Good enough for counting RNAs SOLEXA Amplify DNA on a solid surface using fold-back PCR and anchored primers - creates clusters of 1,000 copies of single-stranded DNA molecules. Sequencing using primers, DNA polymerase and four fluorophore-labelled, reversibly terminating nucleotides. Incorporate nucleotide, read color only one base is incorporated Remove terminators, repeat Average read length is currently 40 bp, should increase Too short for WGS! Good enough for counting RNAs 5 19-Sep-08

6 New Sequencing Technologies 454 SOLiD SOLEXA 6 19-Sep-08

7 EST Assembly Assembling RNA Similar to DNA except Contigs do not join into one sequence Sequences are often primarily from one end, e.g., from 3' (poly- A) end Special Artifacts Post-transcriptional modification Alternative splicing Trans-splicing SNPs/haplotypes Chimeras 7 19-Sep-08

8 Populus Science Sep 15;313(5793): Mb (cytogenetic estimate 550 Mb) 2447 major scaffolds containing 410 Mb 19 known chromosomes Remaining 75 Mb mostly repeated, consistent with estimate of 30% heterchromatin 95% of known genes found 45,500 "genes" 155 scaffolds (335 Mb of sequence) mapped to microsattelite markers Evidence for two whole genome duplications 8 19-Sep-08

9 Populus Clone and sequence statistics, note Proportion of small, medium large clones Amount of sequence trimmed Insert Size Kb Vector Number Reads x10-6 Number Reads Number Bases Used Qual > 20 x10-6 Mb Number Bases After Trimming % Bases Used % of Total Mb plasmid plasmid fosmid 0, Total

10 Populus assembly quality Look for chimeric reads Functional definition: include chloroplast, one end nuclear Average 410 reads/position ~5% chimeric Sep-08

11 Populus Unique (no hits) assembles better than repeats Non-redundant (BLAST hits, probably not poplar) assembles poorly Chloroplast and mitochondrial sequence assembles as separate scaffolds Sep-08

12 Fingerprint i clone and contig layout on assembly of LGII 1. (black): clone coverage - each concentric circle represents 5X clone depth. 2. (red): coverage of clones not assigned to contigs (singletons). 3. anchored contigs, alternating ti color scheme. 4. position of clones in contigs, colored by contig assignment. 5. position of clones with no contig assignment. 6. coverage provided by singletons Sep-08 alternating black and white strips = 1Mb

13 Populus Mapping BACs to chromosomes using FISH Also mapped using microsatellite markers Sep-08

14 Populus Transposable Elements LINES (long interspersed repetitive elements) typically retrotransposons, flanked by long terminal (direct) repeats or LTRs. ~20% of mammalian DNA, Copia, Gypsy, etc Contain genes for transposition Disrupt genes causing mutations Transposition RNA polymerase II transcribes the L1 DNA into RNA. The RNA is translated by ribosomes in the cytoplasm into the proteins. The proteins and RNA join together and reenter the nucleus. The endonuclease cuts a strand of "target" DNA, often in the intron of a gene. The reverse transcriptase copies the L1 RNA into L1 DNA which is inserted into the target DNA forming a new L1 element there. SINES (short interspersed repetitive elements), 10-15% of mammalian DNA, non LTR retrotransposons reverse-transcribed RNA molecules originally transcribed b Do not include transposition enzymes Insertion known to cause certain mutations/genetic diseases: Hemophilia A and B (Factor VIII and IX genes), X-linked severe combined immunodeficiency (SCID) (IL-2 receptor), porphyria, predisposition to colon polyps and cancer (APC gene), Duchenne muscular dystrophy (dystrophin gene) MITES (miniature Inverted-repeatrepeat transposable elements) 5' GGCCAGTCACAATGG..~400 nt..ccattgtgactggcc 3' 3' CCGGTCAGTGTTACC..~400 nt..ggtaacactgaccgg 5' ~ bp terminal inverted repeats related to transposons prominent in plants but found in most organisms known to cause mutations Sep-08

15 Populus Whole genome duplication 4DTV distance four-fold degenerate (synonomous) third-codon position transversion in 4 and 6 codon families, the third does not change the corresponding amino acid in WGD all genes are duplicated at same time and therefore tend to have similar 4DTV Sep-08

Reading Lecture 8: Lecture 9: Lecture 8. DNA Libraries. Definition Types Construction

Reading Lecture 8: Lecture 9: Lecture 8. DNA Libraries. Definition Types Construction Lecture 8 Reading Lecture 8: 96-110 Lecture 9: 111-120 DNA Libraries Definition Types Construction 142 DNA Libraries A DNA library is a collection of clones of genomic fragments or cdnas from a certain

More information

Multiple choice questions (numbers in brackets indicate the number of correct answers)

Multiple choice questions (numbers in brackets indicate the number of correct answers) 1 Multiple choice questions (numbers in brackets indicate the number of correct answers) February 1, 2013 1. Ribose is found in Nucleic acids Proteins Lipids RNA DNA (2) 2. Most RNA in cells is transfer

More information

GENETICS - CLUTCH CH.15 GENOMES AND GENOMICS.

GENETICS - CLUTCH CH.15 GENOMES AND GENOMICS. !! www.clutchprep.com CONCEPT: OVERVIEW OF GENOMICS Genomics is the study of genomes in their entirety Bioinformatics is the analysis of the information content of genomes - Genes, regulatory sequences,

More information

GENETICS EXAM 3 FALL a) is a technique that allows you to separate nucleic acids (DNA or RNA) by size.

GENETICS EXAM 3 FALL a) is a technique that allows you to separate nucleic acids (DNA or RNA) by size. Student Name: All questions are worth 5 pts. each. GENETICS EXAM 3 FALL 2004 1. a) is a technique that allows you to separate nucleic acids (DNA or RNA) by size. b) Name one of the materials (of the two

More information

A. Incorrect! This statement is true. Transposable elements can cause chromosome rearrangements.

A. Incorrect! This statement is true. Transposable elements can cause chromosome rearrangements. Genetics - Problem Drill 17: Transposable Genetic Elements No. 1 of 10 1. Which of the following statements is NOT true? (A) Transposable elements can cause chromosome rearrangements. (B) Transposons can

More information

Recitation CHAPTER 9 DNA Technologies

Recitation CHAPTER 9 DNA Technologies Recitation CHAPTER 9 DNA Technologies DNA Cloning: General Scheme A cloning vector and eukaryotic chromosomes are separately cleaved with the same restriction endonuclease. (A single chromosome is shown

More information

Mate-pair library data improves genome assembly

Mate-pair library data improves genome assembly De Novo Sequencing on the Ion Torrent PGM APPLICATION NOTE Mate-pair library data improves genome assembly Highly accurate PGM data allows for de Novo Sequencing and Assembly For a draft assembly, generate

More information

10/20/2009 Comp 590/Comp Fall

10/20/2009 Comp 590/Comp Fall Lecture 14: DNA Sequencing Study Chapter 8.9 10/20/2009 Comp 590/Comp 790-90 Fall 2009 1 DNA Sequencing Shear DNA into millions of small fragments Read 500 700 nucleotides at a time from the small fragments

More information

CSCI2950-C DNA Sequencing and Fragment Assembly

CSCI2950-C DNA Sequencing and Fragment Assembly CSCI2950-C DNA Sequencing and Fragment Assembly Lecture 2: Sept. 7, 2010 http://cs.brown.edu/courses/csci2950-c/ DNA sequencing How we obtain the sequence of nucleotides of a species 5 3 ACGTGACTGAGGACCGTG

More information

Genome Projects. Part III. Assembly and sequencing of human genomes

Genome Projects. Part III. Assembly and sequencing of human genomes Genome Projects Part III Assembly and sequencing of human genomes All current genome sequencing strategies are clone-based. 1. ordered clone sequencing e.g., C. elegans well suited for repetitive sequences

More information

BIOLOGY - CLUTCH CH.20 - BIOTECHNOLOGY.

BIOLOGY - CLUTCH CH.20 - BIOTECHNOLOGY. !! www.clutchprep.com CONCEPT: DNA CLONING DNA cloning is a technique that inserts a foreign gene into a living host to replicate the gene and produce gene products. Transformation the process by which

More information

Computational Biology 2. Pawan Dhar BII

Computational Biology 2. Pawan Dhar BII Computational Biology 2 Pawan Dhar BII Lecture 1 Introduction to terms, techniques and concepts in molecular biology Molecular biology - a primer Human body has 100 trillion cells each containing 3 billion

More information

Course summary. Today. PCR Polymerase chain reaction. Obtaining molecular data. Sequencing. DNA sequencing. Genome Projects.

Course summary. Today. PCR Polymerase chain reaction. Obtaining molecular data. Sequencing. DNA sequencing. Genome Projects. Goals Organization Labs Project Reading Course summary DNA sequencing. Genome Projects. Today New DNA sequencing technologies. Obtaining molecular data PCR Typically used in empirical molecular evolution

More information

"PhD Course Fall 2017"

PhD Course Fall 2017 Prof. Fahd M. Nasr Faculty of Sciences Lebanese University Beirut, Lebanon Winding your way through the genome, transcriptome, proteome to human diseases "PhD Course Fall 2017" Lecture 3 1 Genome structure,

More information

Lecture 14: DNA Sequencing

Lecture 14: DNA Sequencing Lecture 14: DNA Sequencing Study Chapter 8.9 10/17/2013 COMP 465 Fall 2013 1 Shear DNA into millions of small fragments Read 500 700 nucleotides at a time from the small fragments (Sanger method) DNA Sequencing

More information

DNA and Biotechnology Form of DNA Form of DNA Form of DNA Form of DNA Replication of DNA Replication of DNA

DNA and Biotechnology Form of DNA Form of DNA Form of DNA Form of DNA Replication of DNA Replication of DNA 21 DNA and Biotechnology DNA and Biotechnology OUTLINE: Replication of DNA Gene Expression Mutations Regulating Gene Activity Genetic Engineering Genomics DNA (deoxyribonucleic acid) Double-stranded molecule

More information

Introduction to metagenome assembly. Bas E. Dutilh Metagenomic Methods for Microbial Ecologists, NIOO September 18 th 2014

Introduction to metagenome assembly. Bas E. Dutilh Metagenomic Methods for Microbial Ecologists, NIOO September 18 th 2014 Introduction to metagenome assembly Bas E. Dutilh Metagenomic Methods for Microbial Ecologists, NIOO September 18 th 2014 Sequencing specs* Method Read length Accuracy Million reads Time Cost per M 454

More information

Chapter 8: Recombinant DNA. Ways this technology touches us. Overview. Genetic Engineering

Chapter 8: Recombinant DNA. Ways this technology touches us. Overview. Genetic Engineering Chapter 8 Recombinant DNA and Genetic Engineering Genetic manipulation Ways this technology touches us Criminal justice The Justice Project, started by law students to advocate for DNA testing of Death

More information

Human genetic variation

Human genetic variation Human genetic variation CHEW Fook Tim Human Genetic Variation Variants contribute to rare and common diseases Variants can be used to trace human origins Human Genetic Variation What types of variants

More information

d. reading a DNA strand and making a complementary messenger RNA

d. reading a DNA strand and making a complementary messenger RNA Biol/ MBios 301 (General Genetics) Spring 2003 Second Midterm Examination A (100 points possible) Key April 1, 2003 10 Multiple Choice Questions-4 pts. each (Choose the best answer) 1. Transcription involves:

More information

Bio 101 Sample questions: Chapter 10

Bio 101 Sample questions: Chapter 10 Bio 101 Sample questions: Chapter 10 1. Which of the following is NOT needed for DNA replication? A. nucleotides B. ribosomes C. Enzymes (like polymerases) D. DNA E. all of the above are needed 2 The information

More information

Lecture 3 Mutagens and Mutagenesis. 1. Mutagens A. Physical and Chemical mutagens B. Transposons and retrotransposons C. T-DNA

Lecture 3 Mutagens and Mutagenesis. 1. Mutagens A. Physical and Chemical mutagens B. Transposons and retrotransposons C. T-DNA Lecture 3 Mutagens and Mutagenesis 1. Mutagens A. Physical and Chemical mutagens B. Transposons and retrotransposons C. T-DNA 2. Mutagenesis A. Screen B. Selection C. Lethal mutations Read: 508-514 Figs:

More information

Section 14.1 Structure of ribonucleic acid

Section 14.1 Structure of ribonucleic acid Section 14.1 Structure of ribonucleic acid The genetic code Sections of DNA are transcribed onto a single stranded molecule called RNA There are two types of RNA One type copies the genetic code and transfers

More information

Degenerate site - twofold degenerate site - fourfold degenerate site

Degenerate site - twofold degenerate site - fourfold degenerate site Genetic code Codon: triple base pairs defining each amino acid. Why genetic code is triple? double code represents 4 2 = 16 different information triple code: 4 3 = 64 (two much to represent 20 amino acids)

More information

CHAPTERS , 17: Eukaryotic Genetics

CHAPTERS , 17: Eukaryotic Genetics CHAPTERS 14.1 14.6, 17: Eukaryotic Genetics 1. Review the levels of DNA packing within the eukaryote nucleus. Label each level. (A similar diagram is on pg 188 of your textbook.) 2. How do the coding regions

More information

Enzyme that uses RNA as a template to synthesize a complementary DNA

Enzyme that uses RNA as a template to synthesize a complementary DNA Biology 105: Introduction to Genetics PRACTICE FINAL EXAM 2006 Part I: Definitions Homology: Comparison of two or more protein or DNA sequence to ascertain similarities in sequences. If two genes have

More information

2054, Chap. 14, page 1

2054, Chap. 14, page 1 2054, Chap. 14, page 1 I. Recombinant DNA technology (Chapter 14) A. recombinant DNA technology = collection of methods used to perform genetic engineering 1. genetic engineering = deliberate modification

More information

CHAPTER 21 GENOMES AND THEIR EVOLUTION

CHAPTER 21 GENOMES AND THEIR EVOLUTION GENETICS DATE CHAPTER 21 GENOMES AND THEIR EVOLUTION COURSE 213 AP BIOLOGY 1 Comparisons of genomes provide information about the evolutionary history of genes and taxonomic groups Genomics - study of

More information

Chapter 5. Structural Genomics

Chapter 5. Structural Genomics Chapter 5. Structural Genomics Contents 5. Structural Genomics 5.1. DNA Sequencing Strategies 5.1.1. Map-based Strategies 5.1.2. Whole Genome Shotgun Sequencing 5.2. Genome Annotation 5.2.1. Using Bioinformatic

More information

AP Biology. The BIG Questions. Chapter 19. Prokaryote vs. eukaryote genome. Prokaryote vs. eukaryote genome. Why turn genes on & off?

AP Biology. The BIG Questions. Chapter 19. Prokaryote vs. eukaryote genome. Prokaryote vs. eukaryote genome. Why turn genes on & off? The BIG Questions Chapter 19. Control of Eukaryotic Genome How are genes turned on & off in eukaryotes? How do cells with the same genes differentiate to perform completely different, specialized functions?

More information

DNA & Protein Synthesis. The source and the process!

DNA & Protein Synthesis. The source and the process! DNA & Protein Synthesis The source and the process! Agenda I. DNA and Genes II. Protein Synthesis III. The Genetic Code I. DNA & Genes: The beauty of DNA Remember: DNA is a macromolecule that stores information

More information

Genome annotation & EST

Genome annotation & EST Genome annotation & EST What is genome annotation? The process of taking the raw DNA sequence produced by the genome sequence projects and adding the layers of analysis and interpretation necessary

More information

Sequence assembly. Jose Blanca COMAV institute bioinf.comav.upv.es

Sequence assembly. Jose Blanca COMAV institute bioinf.comav.upv.es Sequence assembly Jose Blanca COMAV institute bioinf.comav.upv.es Sequencing project Unknown sequence { experimental evidence result read 1 read 4 read 2 read 5 read 3 read 6 read 7 Computational requirements

More information

The study of the structure, function, and interaction of cellular proteins is called. A) bioinformatics B) haplotypics C) genomics D) proteomics

The study of the structure, function, and interaction of cellular proteins is called. A) bioinformatics B) haplotypics C) genomics D) proteomics Human Biology, 12e (Mader / Windelspecht) Chapter 21 DNA Which of the following is not a component of a DNA molecule? A) a nitrogen-containing base B) deoxyribose sugar C) phosphate D) phospholipid Messenger

More information

Multiple choice questions (numbers in brackets indicate the number of correct answers)

Multiple choice questions (numbers in brackets indicate the number of correct answers) 1 February 15, 2013 Multiple choice questions (numbers in brackets indicate the number of correct answers) 1. Which of the following statements are not true Transcriptomes consist of mrnas Proteomes consist

More information

Review of Protein (one or more polypeptide) A polypeptide is a long chain of..

Review of Protein (one or more polypeptide) A polypeptide is a long chain of.. Gene expression Review of Protein (one or more polypeptide) A polypeptide is a long chain of.. In a protein, the sequence of amino acid determines its which determines the protein s A protein with an enzymatic

More information

Alignment and Assembly

Alignment and Assembly Alignment and Assembly Genome assembly refers to the process of taking a large number of short DNA sequences and putting them back together to create a representation of the original chromosomes from which

More information

Genome Sequence Assembly

Genome Sequence Assembly Genome Sequence Assembly Learning Goals: Introduce the field of bioinformatics Familiarize the student with performing sequence alignments Understand the assembly process in genome sequencing Introduction:

More information

Chapter 13. From DNA to Protein

Chapter 13. From DNA to Protein Chapter 13 From DNA to Protein Proteins All proteins consist of polypeptide chains A linear sequence of amino acids Each chain corresponds to the nucleotide base sequenceof a gene The Path From Genes to

More information

Marker types. Potato Association of America Frederiction August 9, Allen Van Deynze

Marker types. Potato Association of America Frederiction August 9, Allen Van Deynze Marker types Potato Association of America Frederiction August 9, 2009 Allen Van Deynze Use of DNA Markers in Breeding Germplasm Analysis Fingerprinting of germplasm Arrangement of diversity (clustering,

More information

Chapter 19. Control of Eukaryotic Genome. AP Biology

Chapter 19. Control of Eukaryotic Genome. AP Biology Chapter 19. Control of Eukaryotic Genome The BIG Questions How are genes turned on & off in eukaryotes? How do cells with the same genes differentiate to perform completely different, specialized functions?

More information

BENG 183 Trey Ideker. Genome Assembly and Physical Mapping

BENG 183 Trey Ideker. Genome Assembly and Physical Mapping BENG 183 Trey Ideker Genome Assembly and Physical Mapping Reasons for sequencing Complete genome sequencing!!! Resequencing (Confirmatory) E.g., short regions containing single nucleotide polymorphisms

More information

BIOLOGY - CLUTCH CH.17 - GENE EXPRESSION.

BIOLOGY - CLUTCH CH.17 - GENE EXPRESSION. !! www.clutchprep.com CONCEPT: GENES Beadle and Tatum develop the one gene one enzyme hypothesis through their work with Neurospora (bread mold). This idea was later revised as the one gene one polypeptide

More information

Learning Genetic Transposition with an Interactive Computer Program

Learning Genetic Transposition with an Interactive Computer Program International Journal of Computer Engineering and Information Technology VOL. 8, NO. 8, August 2016, 146 156 Available online at: www.ijceit.org E-ISSN 2412-8856 (Online) Learning Genetic Transposition

More information

Bi 8 Lecture 5. Ellen Rothenberg 19 January 2016

Bi 8 Lecture 5. Ellen Rothenberg 19 January 2016 Bi 8 Lecture 5 MORE ON HOW WE KNOW WHAT WE KNOW and intro to the protein code Ellen Rothenberg 19 January 2016 SIZE AND PURIFICATION BY SYNTHESIS: BASIS OF EARLY SEQUENCING complex mixture of aborted DNA

More information

MBioS 503: Section 1 Chromosome, Gene, Translation, & Transcription. Gene Organization. Genome. Objectives: Gene Organization

MBioS 503: Section 1 Chromosome, Gene, Translation, & Transcription. Gene Organization. Genome. Objectives: Gene Organization Overview & Recap of Molecular Biology before the last two sections MBioS 503: Section 1 Chromosome, Gene, Translation, & Transcription Gene Organization Joy Winuthayanon, PhD School of Molecular Biosciences

More information

Unit 1: DNA and the Genome. Sub-Topic (1.3) Gene Expression

Unit 1: DNA and the Genome. Sub-Topic (1.3) Gene Expression Unit 1: DNA and the Genome Sub-Topic (1.3) Gene Expression Unit 1: DNA and the Genome Sub-Topic (1.3) Gene Expression On completion of this subtopic I will be able to State the meanings of the terms genotype,

More information

Chapter 20. Gene creatures, partii: jumping genes and junk DNA. Prepared by Woojoo Choi

Chapter 20. Gene creatures, partii: jumping genes and junk DNA. Prepared by Woojoo Choi Chapter 20. Gene creatures, partii: jumping genes and junk DNA Prepared by Woojoo Choi 1) Transposable elements (transposons): segment of DNA that can move as a unit from one location to another, but which

More information

Transcription and Translation

Transcription and Translation Transcription and Translation Central Dogma of Molecular The flow of information in the cell starts at DNA, which replicates to form more DNA. Information is then transcribed into RNA, and then it is translated

More information

CONSTRUCTION OF GENOMIC LIBRARY

CONSTRUCTION OF GENOMIC LIBRARY MODULE 4-LECTURE 4 CONSTRUCTION OF GENOMIC LIBRARY 4-4.1. Introduction A genomic library is an organism specific collection of DNA covering the entire genome of an organism. It contains all DNA sequences

More information

Contact us for more information and a quotation

Contact us for more information and a quotation GenePool Information Sheet #1 Installed Sequencing Technologies in the GenePool The GenePool offers sequencing service on three platforms: Sanger (dideoxy) sequencing on ABI 3730 instruments Illumina SOLEXA

More information

Chapter 10 Genetic Engineering: A Revolution in Molecular Biology

Chapter 10 Genetic Engineering: A Revolution in Molecular Biology Chapter 10 Genetic Engineering: A Revolution in Molecular Biology Genetic Engineering Direct, deliberate modification of an organism s genome bioengineering Biotechnology use of an organism s biochemical

More information

BIOLOGY 205 Midterm II - 19 February Each of the following statements are correct regarding Eukaryotic genes and genomes EXCEPT?

BIOLOGY 205 Midterm II - 19 February Each of the following statements are correct regarding Eukaryotic genes and genomes EXCEPT? BIOLOGY 205 Midterm II - 19 February 1999 Name Multiple choice questions 4 points each (Best 12 out of 13). 1. Each of the following statements are correct regarding Eukaryotic genes and genomes EXCEPT?

More information

Sequence Assembly and Alignment. Jim Noonan Department of Genetics

Sequence Assembly and Alignment. Jim Noonan Department of Genetics Sequence Assembly and Alignment Jim Noonan Department of Genetics james.noonan@yale.edu www.yale.edu/noonanlab The assembly problem >>10 9 sequencing reads 36 bp - 1 kb 3 Gb Outline Basic concepts in genome

More information

Direct insertional mutagenesis by L1 resulted in diseases including muscular dystrophy, hemophilia, and breast cancer

Direct insertional mutagenesis by L1 resulted in diseases including muscular dystrophy, hemophilia, and breast cancer Direct insertional mutagenesis by L1 resulted in diseases including muscular dystrophy, hemophilia, and breast cancer Haemophilia A resulting from de novo insertion of L1 sequences represents a novel mechanism

More information

Problem Set 8. Answer Key

Problem Set 8. Answer Key MCB 102 University of California, Berkeley August 11, 2009 Isabelle Philipp Online Document Problem Set 8 Answer Key 1. The Genetic Code (a) Are all amino acids encoded by the same number of codons? no

More information

Unit 6: Molecular Genetics & DNA Technology Guided Reading Questions (100 pts total)

Unit 6: Molecular Genetics & DNA Technology Guided Reading Questions (100 pts total) Name: AP Biology Biology, Campbell and Reece, 7th Edition Adapted from chapter reading guides originally created by Lynn Miriello Chapter 16 The Molecular Basis of Inheritance Unit 6: Molecular Genetics

More information

Molecular Genetics of Disease and the Human Genome Project

Molecular Genetics of Disease and the Human Genome Project 9 Molecular Genetics of Disease and the Human Genome Project Fig. 1. The 23 chromosomes in the human genome. There are 22 autosomes (chromosomes 1 to 22) and two sex chromosomes (X and Y). Females inherit

More information

I. Gene Expression Figure 1: Central Dogma of Molecular Biology

I. Gene Expression Figure 1: Central Dogma of Molecular Biology I. Gene Expression Figure 1: Central Dogma of Molecular Biology Central Dogma: Gene Expression: RNA Structure RNA nucleotides contain the pentose sugar Ribose instead of deoxyribose. Contain the bases

More information

DNA Evolution of knowledge about gene. Contains information about RNAs and proteins. Polynucleotide chains; Double stranded molecule;

DNA Evolution of knowledge about gene. Contains information about RNAs and proteins. Polynucleotide chains; Double stranded molecule; Evolution of knowledge about gene G. Mendel Hereditary factors W.Johannsen, 1909 G.W.Beadle, E.L.Tatum, 1945 Ingram, 1957 Actual concepts The gene hereditary unit located in chromosomes Hypotheses One

More information

Chapter 17. From Gene to Protein. Slide 1. Slide 2. Slide 3. Gene Expression. Which of the following is the best example of gene expression? Why?

Chapter 17. From Gene to Protein. Slide 1. Slide 2. Slide 3. Gene Expression. Which of the following is the best example of gene expression? Why? Slide 1 Chapter 17 From Gene to Protein PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley with contributions from

More information

LECTURE 20. Repeated DNA Sequences. Prokaryotes:

LECTURE 20. Repeated DNA Sequences. Prokaryotes: LECTURE 20 Repeated DNA Sequences Prokaryotes: 1) Most DNA is in the form of unique sequences. Exceptions are the genes encoding ribosomal RNA (rdna, 10-20 copies) and various recognition sequences (e.g.,

More information

DNA Function: Information Transmission

DNA Function: Information Transmission DNA Function: Information Transmission DNA is called the code of life. What does it code for? *the information ( code ) to make proteins! Why are proteins so important? Nearly every function of a living

More information

CHAPTER 9 DNA Technologies

CHAPTER 9 DNA Technologies CHAPTER 9 DNA Technologies Recombinant DNA Artificially created DNA that combines sequences that do not occur together in the nature Basis of much of the modern molecular biology Molecular cloning of genes

More information

There are four major types of introns. Group I introns, found in some rrna genes, are self-splicing: they can catalyze their own removal.

There are four major types of introns. Group I introns, found in some rrna genes, are self-splicing: they can catalyze their own removal. 1 2 Continuous genes - Intron: Many eukaryotic genes contain coding regions called exons and noncoding regions called intervening sequences or introns. The average human gene contains from eight to nine

More information

Chapter 20 Recombinant DNA Technology. Copyright 2009 Pearson Education, Inc.

Chapter 20 Recombinant DNA Technology. Copyright 2009 Pearson Education, Inc. Chapter 20 Recombinant DNA Technology Copyright 2009 Pearson Education, Inc. 20.1 Recombinant DNA Technology Began with Two Key Tools: Restriction Enzymes and DNA Cloning Vectors Recombinant DNA refers

More information

Student name ID # Second Mid Term Exam, Biology 2020, Spring 2002 Scores Total

Student name ID # Second Mid Term Exam, Biology 2020, Spring 2002 Scores Total Second Mid Term Exam, Biology 2020, Spring 2002 Scores 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19. 20. 21. Total 1 1. Matching (7 pts). Each answer is used exactly once Helicase

More information

Gene Expression Technology

Gene Expression Technology Gene Expression Technology Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu Gene expression Gene expression is the process by which information from a gene

More information

6.C: Students will explain the purpose and process of transcription and translation using models of DNA and RNA

6.C: Students will explain the purpose and process of transcription and translation using models of DNA and RNA 6.C: Students will explain the purpose and process of transcription and translation using models of DNA and RNA DNA mrna Protein DNA is found in the nucleus, but making a protein occurs at the ribosome

More information

Molecular Genetics Techniques. BIT 220 Chapter 20

Molecular Genetics Techniques. BIT 220 Chapter 20 Molecular Genetics Techniques BIT 220 Chapter 20 What is Cloning? Recombinant DNA technologies 1. Producing Recombinant DNA molecule Incorporate gene of interest into plasmid (cloning vector) 2. Recombinant

More information

Biology 105: Introduction to Genetics PRACTICE FINAL EXAM Part I: Definitions. Homology: Reverse transcriptase. Allostery: cdna library

Biology 105: Introduction to Genetics PRACTICE FINAL EXAM Part I: Definitions. Homology: Reverse transcriptase. Allostery: cdna library Biology 105: Introduction to Genetics PRACTICE FINAL EXAM 2006 Part I: Definitions Homology: Reverse transcriptase Allostery: cdna library Transformation Part II Short Answer 1. Describe the reasons for

More information

Genomes summary. Bacterial genome sizes

Genomes summary. Bacterial genome sizes Genomes summary 1. >930 bacterial genomes sequenced. 2. Circular. Genes densely packed. 3. 2-10 Mbases, 470-7,000 genes 4. Genomes of >200 eukaryotes (45 higher ) sequenced. 5. Linear chromosomes 6. On

More information

PBG 430/530 Exam

PBG 430/530 Exam 1 PBG 430/530 Exam 2 2013 1. In a deoxyribonucleotide, 5 and 3 refer to the a. start site for transcription. b. start site for translation. c. carbons where (respectively) the phosphate and hydroxyl groups

More information

We begin with a high-level overview of sequencing. There are three stages in this process.

We begin with a high-level overview of sequencing. There are three stages in this process. Lecture 11 Sequence Assembly February 10, 1998 Lecturer: Phil Green Notes: Kavita Garg 11.1. Introduction This is the first of two lectures by Phil Green on Sequence Assembly. Yeast and some of the bacterial

More information

Genome Sequencing-- Strategies

Genome Sequencing-- Strategies Genome Sequencing-- Strategies Bio 4342 Spring 04 What is a genome? A genome can be defined as the entire DNA content of each nucleated cell in an organism Each organism has one or more chromosomes that

More information

Themes: RNA and RNA Processing. Messenger RNA (mrna) What is a gene? RNA is very versatile! RNA-RNA interactions are very important!

Themes: RNA and RNA Processing. Messenger RNA (mrna) What is a gene? RNA is very versatile! RNA-RNA interactions are very important! Themes: RNA is very versatile! RNA and RNA Processing Chapter 14 RNA-RNA interactions are very important! Prokaryotes and Eukaryotes have many important differences. Messenger RNA (mrna) Carries genetic

More information

Structural variation. Marta Puig Institut de Biotecnologia i Biomedicina Universitat Autònoma de Barcelona

Structural variation. Marta Puig Institut de Biotecnologia i Biomedicina Universitat Autònoma de Barcelona Structural variation Marta Puig Institut de Biotecnologia i Biomedicina Universitat Autònoma de Barcelona Genetic variation How much genetic variation is there between individuals? What type of variants

More information

Draft 3 Annotation of DGA06H06, Contig 1 Jeannette Wong Bio4342W 27 April 2009

Draft 3 Annotation of DGA06H06, Contig 1 Jeannette Wong Bio4342W 27 April 2009 Page 1 Draft 3 Annotation of DGA06H06, Contig 1 Jeannette Wong Bio4342W 27 April 2009 Page 2 Introduction: Annotation is the process of analyzing the genomic sequence of an organism. Besides identifying

More information

Learning Objectives :

Learning Objectives : Learning Objectives : Understand the basic differences between genomic and cdna libraries Understand how genomic libraries are constructed Understand the purpose for having overlapping DNA fragments in

More information

From DNA to Protein: Genotype to Phenotype

From DNA to Protein: Genotype to Phenotype 12 From DNA to Protein: Genotype to Phenotype 12.1 What Is the Evidence that Genes Code for Proteins? The gene-enzyme relationship is one-gene, one-polypeptide relationship. Example: In hemoglobin, each

More information

Transcription in Eukaryotes

Transcription in Eukaryotes Transcription in Eukaryotes Biology I Hayder A Giha Transcription Transcription is a DNA-directed synthesis of RNA, which is the first step in gene expression. Gene expression, is transformation of the

More information

Chapter 12 Packet DNA 1. What did Griffith conclude from his experiment? 2. Describe the process of transformation.

Chapter 12 Packet DNA 1. What did Griffith conclude from his experiment? 2. Describe the process of transformation. Chapter 12 Packet DNA and RNA Name Period California State Standards covered by this chapter: Cell Biology 1. The fundamental life processes of plants and animals depend on a variety of chemical reactions

More information

CHAPTER 21 LECTURE SLIDES

CHAPTER 21 LECTURE SLIDES CHAPTER 21 LECTURE SLIDES Prepared by Brenda Leady University of Toledo To run the animations you must be in Slideshow View. Use the buttons on the animation to play, pause, and turn audio/text on or off.

More information

PLNT2530 (2018) Unit 6b Sequence Libraries

PLNT2530 (2018) Unit 6b Sequence Libraries PLNT2530 (2018) Unit 6b Sequence Libraries Molecular Biotechnology (Ch 4) Analysis of Genes and Genomes (Ch 5) Unless otherwise cited or referenced, all content of this presenataion is licensed under the

More information

Introduction to Plant Genomics and Online Resources. Manish Raizada University of Guelph

Introduction to Plant Genomics and Online Resources. Manish Raizada University of Guelph Introduction to Plant Genomics and Online Resources Manish Raizada University of Guelph Genomics Glossary http://www.genomenewsnetwork.org/articles/06_00/sequence_primer.shtml Annotation Adding pertinent

More information

XXII DNA cloning and sequencing. Outline

XXII DNA cloning and sequencing. Outline XXII DNA cloning and sequencing 1) Deriving DNA for cloning Outline 2) Vectors; forming recombinant DNA; cloning DNA; and screening for clones containing recombinant DNA [replica plating and autoradiography;

More information

Biology 201 (Genetics) Exam #3 120 points 20 November Read the question carefully before answering. Think before you write.

Biology 201 (Genetics) Exam #3 120 points 20 November Read the question carefully before answering. Think before you write. Name KEY Section Biology 201 (Genetics) Exam #3 120 points 20 November 2006 Read the question carefully before answering. Think before you write. You will have up to 50 minutes to take this exam. After

More information

Chapter 14: From DNA to Protein

Chapter 14: From DNA to Protein Chapter 14: From DNA to Protein Steps from DNA to Proteins Same two steps produce all proteins: 1) DNA is transcribed to form RNA Occurs in the nucleus RNA moves into cytoplasm 2) RNA is translated in

More information

Chapters 31-32: Ribonucleic Acid (RNA)

Chapters 31-32: Ribonucleic Acid (RNA) Chapters 31-32: Ribonucleic Acid (RNA) Short segments from the transcription, processing and translation sections of each chapter Slide 1 RNA In comparison with DNA RNA utilizes uracil in place of thymine

More information

Hello! Outline. Cell Biology: RNA and Protein synthesis. In all living cells, DNA molecules are the storehouses of information. 6.

Hello! Outline. Cell Biology: RNA and Protein synthesis. In all living cells, DNA molecules are the storehouses of information. 6. Cell Biology: RNA and Protein synthesis In all living cells, DNA molecules are the storehouses of information Hello! Outline u 1. Key concepts u 2. Central Dogma u 3. RNA Types u 4. RNA (Ribonucleic Acid)

More information

Chapter 20 DNA Technology & Genomics. If we can, should we?

Chapter 20 DNA Technology & Genomics. If we can, should we? Chapter 20 DNA Technology & Genomics If we can, should we? Biotechnology Genetic manipulation of organisms or their components to make useful products Humans have been doing this for 1,000s of years plant

More information

Key Area 1.3: Gene Expression

Key Area 1.3: Gene Expression Key Area 1.3: Gene Expression RNA There is a second type of nucleic acid in the cell, called RNA. RNA plays a vital role in the production of protein from the code in the DNA. What is gene expression?

More information

Genetics and Biotechnology. Section 1. Applied Genetics

Genetics and Biotechnology. Section 1. Applied Genetics Section 1 Applied Genetics Selective Breeding! The process by which desired traits of certain plants and animals are selected and passed on to their future generations is called selective breeding. Section

More information

Lecture for Wednesday. Dr. Prince BIOL 1408

Lecture for Wednesday. Dr. Prince BIOL 1408 Lecture for Wednesday Dr. Prince BIOL 1408 THE FLOW OF GENETIC INFORMATION FROM DNA TO RNA TO PROTEIN Copyright 2009 Pearson Education, Inc. Genes are expressed as proteins A gene is a segment of DNA that

More information

Course Syllabus for FISH/CMBL 7660 Fall 2008

Course Syllabus for FISH/CMBL 7660 Fall 2008 Course Syllabus for FISH/CMBL 7660 Fall 2008 Course title: Molecular Genetics and Biotechnology Course number: FISH 7660/CMBL7660 Instructor: Dr. John Liu Room: 303 Swingle Hall Lecture: 8:00-9:15 a.m.

More information

Lecture 1 Introduction. Human genetics, its goals and role in modern medicine. Nuclear and mitochondrial genomes.

Lecture 1 Introduction. Human genetics, its goals and role in modern medicine. Nuclear and mitochondrial genomes. Lecture 1 Introduction. Human genetics, its goals and role in modern medicine. Nuclear and mitochondrial genomes. 1. Introduction into human genetics. 2. Methods of studying in human genetics. 3. Levels

More information

Chapter 7. DNA Microarrays

Chapter 7. DNA Microarrays Bioinformatics III Structural Bioinformatics and Genome Analysis Chapter 7. DNA Microarrays 7.9 Next Generation Sequencing 454 Sequencing Solexa Illumina Solid TM System Sequencing Process of determining

More information

Big Idea 3C Basic Review

Big Idea 3C Basic Review Big Idea 3C Basic Review 1. A gene is a. A sequence of DNA that codes for a protein. b. A sequence of amino acids that codes for a protein. c. A sequence of codons that code for nucleic acids. d. The end

More information

Transcription is the first stage of gene expression

Transcription is the first stage of gene expression Transcription is the first stage of gene expression RNA synthesis is catalyzed by RNA polymerase, which pries the DNA strands apart and hooks together the RNA nucleotides The RNA is complementary to the

More information

SolCAP. Executive Commitee : David Douches Walter De Jong Robin Buell David Francis Alexandra Stone Lukas Mueller AllenVan Deynze

SolCAP. Executive Commitee : David Douches Walter De Jong Robin Buell David Francis Alexandra Stone Lukas Mueller AllenVan Deynze SolCAP Solanaceae Coordinated Agricultural Project Supported by the National Research Initiative Plant Genome Program of USDA CSREES for the Improvement of Potato and Tomato Executive Commitee : David

More information