What is Bioinformatics? Bioinformatics is the application of computational techniques to the discovery of knowledge from biological databases.

Similar documents
Algorithms in Bioinformatics

Data Mining for Biological Data Analysis

Following text taken from Suresh Kumar. Bioinformatics Web - Comprehensive educational resource on Bioinformatics. 6th May.2005

COMP 555 Bioalgorithms. Fall Lecture 1: Course Introduction

Course Information. Introduction to Algorithms in Computational Biology Lecture 1. Relations to Some Other Courses

CHAPTER 11 DNA NOTES PT. 4: PROTEIN SYNTHESIS TRANSCRIPTION & TRANSLATION

What Are the Chemical Structures and Functions of Nucleic Acids?

Introduction to Algorithms in Computational Biology Lecture 1

Chapter 6. Genes and DNA. Table of Contents. Section 1 What Does DNA Look Like? Section 2 How DNA Works

Introduction to Bioinformatics

Introduction to Bioinformatics

Introduction. CS482/682 Computational Techniques in Biological Sequence Analysis

Computers in Biology and Bioinformatics

Introducing Bioinformatics Concepts in CS1

MATH 5610, Computational Biology

DNA and Biotechnology Form of DNA Form of DNA Form of DNA Form of DNA Replication of DNA Replication of DNA

Molecular Biology. IMBB 2017 RAB, Kigali - Rwanda May 02 13, Francesca Stomeo

Chapter 13 - Concept Mapping

Introduction to Cellular Biology and Bioinformatics. Farzaneh Salari

Two Mark question and Answers

CSCI 2570 Introduction to Nanocomputing

Basic concepts of molecular biology

DNA is normally found in pairs, held together by hydrogen bonds between the bases

ESSENTIAL BIOINFORMATICS

M. Phil. (Computer Science) Programme < >

(Very) Basic Molecular Biology

ALGORITHMS IN BIO INFORMATICS. Chapman & Hall/CRC Mathematical and Computational Biology Series A PRACTICAL INTRODUCTION. CRC Press WING-KIN SUNG

What happens after DNA Replication??? Transcription, translation, gene expression/protein synthesis!!!!

Basic concepts of molecular biology

CSC 121 Computers and Scientific Thinking

Basic Bioinformatics: Homology, Sequence Alignment,

6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008

Computational Genomics. Irit Gat-Viks & Ron Shamir & Haim Wolfson Fall

BIO 101 : The genetic code and the central dogma

BIOINFORMATICS AND SYSTEM BIOLOGY (INTERNATIONAL PROGRAM)

Molecular Basis of Inheritance

PROTEINS & NUCLEIC ACIDS

Which diagram represents a DNA nucleotide? A) B) C) D)

Bioinformatics. Lecturer: Antinisca Di Marco Tutor: Francesco Gallo

SOURCES AND RESOURCES:

History of Topology. Semester I, Graham Ellis NUI Galway, Ireland

ENGR 213 Bioengineering Fundamentals April 25, A very coarse introduction to bioinformatics

DNA, RNA and protein synthesis

DNA and the Production of Proteins Course Notes. Cell Biology. Sub-Topic 1.3 DNA and the Production of Proteins

Algorithms in Bioinformatics ONE Transcription Translation

BIOLOGY 111. CHAPTER 6: DNA: The Molecule of Life

Introduction to Bioinformatics

Challenging algorithms in bioinformatics

Genome Sequence Assembly

Analysis of Biological Sequences SPH

Computational Genomics ( )

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

Engineering Genetic Circuits

THE STRUCTURE AND FUNCTION OF DNA

Chapter 16: The Molecular Basis of Inheritance

O C. 5 th C. 3 rd C. the national health museum

bioinformatica 6EF2F181AA1830ABC10ABAC56EA5E191 Bioinformatica 1 / 5

UNIT MOLECULAR GENETICS AND BIOTECHNOLOGY

10. BIOTECHNOLOGY (Code No. 045)

Bioinformatics: Sequence Analysis. COMP 571 Luay Nakhleh, Rice University

WHAT IS BIOCHEMISTRY

LESSON TITLE Decoding DNA. Guiding Question: Why should we continue to explore? Ignite Curiosity

AGENDA for 10/11/13 AGENDA: HOMEWORK: Due end of the period OBJECTIVES:

Class XII Chapter 6 Molecular Basis of Inheritance Biology

Image adapted from: National Human Genome Research Institute

Outline. Computational Genomics and Molecular Biology. Genes Encode Proteins. DNA forms a double stranded helix. DNA replication T C A G

translation The building blocks of proteins are? amino acids nitrogen containing bases like A, G, T, C, and U Complementary base pairing links

Extending Knowledge Beyond The Curriculum: The Biology Grid Computing Project

RNA & PROTEIN SYNTHESIS

This place covers: Methods or systems for genetic or protein-related data processing in computational molecular biology.

Genes and Gene Technology

DNA. translation. base pairing rules for DNA Replication. thymine. cytosine. amino acids. The building blocks of proteins are?

Chemistry 106: Drugs in Society Lecture 17: Where Do Macromolecular Targets Come From? 5/07/18

DNA & THE GENETIC CODE DON T PANIC! THIS SECTION OF SLIDES IS AVAILABLE AT CLASS WEBSITE

Gene Identification in silico

2.3. From DNA to Proteins. DNA Structure

Lecture 1 - Introduction to Structural Bioinformatics

AGRO/ANSC/BIO/GENE/HORT 305 Fall, 2016 Overview of Genetics Lecture outline (Chpt 1, Genetics by Brooker) #1

Chapter 12. DNA TRANSCRIPTION and TRANSLATION

AGENDA for 10/10/13 AGENDA: HOMEWORK: Due end of the period OBJECTIVES: Due Fri, 10-11

Introduction to Bioinformatics

LABS 9 AND 10 DNA STRUCTURE AND REPLICATION; RNA AND PROTEIN SYNTHESIS

Genome Reassembly From Fragments. 28 March 2013 OSU CSE 1

Lecture Three: Genes and Inheritance

Lecture 2: Biology Basics Continued. Fall 2018 August 23, 2018

Transcription and Translation

Optimization of Process Parameters of Global Sequence Alignment Based Dynamic Program - an Approach to Enhance the Sensitivity.

Chapter 4 Genetics: The Science of Heredity

Basic Local Alignment Search Tool

Nucleic acids. AP Biology

Nucleic Acids: DNA and RNA

Name: Date: Period:

From Gene to Protein

NUCLEIC ACIDS. DNA (Deoxyribonucleic Acid) and RNA (Ribonucleic Acid): information storage molecules made up of nucleotides.

Opening Activity. DNA is often compared to a ladder or a spiral staircase. Look at the picture above and answer the following questions.

DNA. Grade Level: 5-6

Unit 2: Biological basis of life, heredity, and genetics

BIOINFORMATICS Introduction

Biotechnology. Biotechnology. Early Biotechnology

Biotechnology. Biotechnology. Early Biotechnology

Transcription:

What is Bioinformatics? Bioinformatics is the application of computational techniques to the discovery of knowledge from biological databases. Bioinformatics is the marriage of molecular biology with computer science. It's an exciting, relatively young and emerging science, in which computer information technology greatly helps to integrate, manage, analyze, and visualize genetic and other biological information. Sophisticated laboratory technology allow the collection of biological data faster than Biologists can interpret it. Vast volumes of DNA sequence data is growing at an exponential rate. Why should we care? The potential of bioinformatics for the good of society is enormous. Among other things, scientists apply this technology to Study biological processes in organisms Determine how these processes go wrong in diseases Discover and develop drugs to treat, cure and prevent human diseases Trace the evolutionary tree based on DNA sequences Lecture 1-1

CS Department Web-site: (http://www.cs.uni.edu/overview_bioinformatics.php) Bioinformatics is an interdisciplinary curriculum combining computer science, mathematics, and biology that seeks to explore and elucidate life processes through modern genomic techniques and tools. The [Bioinformatics BS] program seeks to prepare students to understand and manipulate vast genomic data bases through an understanding of the languages, tools, and techniques of computer science, mathematics, and biology. The program seeks to be truely interdisciplinary in nature and produce students who are conversant across all three disciplines. Consider... "... Career opportunities in bioinformatics are very, very good," said John M. Greene, senior staff scientist, bioinformatics research, at Gene Logic Inc., Gaithersburg, Maryland. "It seems that every time you turn around a company has decided to set up a bioinformatics group, or expand an existing group. Many scientists are turning their careers in this direction..." -- Science Magazine: Focus on Careers: Bioinformatics Science "...Experts have already dubbed bioinformatics - a hybrid profession pairing biology and computer science - the career choice of the decade. " -- Industry Outlook: Biotechnology "There is a crying need for experts in bioinformatics and this is not something that will just fade away," said Dr. Leena Peltonen, chairwoman of the Department of Human Genetics at UCLA. " -- Industry Outlook: Biotechnology "...Frost & Sullivan, a San Jose consulting firm, has predicted a 10 percent annual growth rate in the bioinformatics market while the National Science Foundation has estimated 20,000 jobs will be created in the field by 2005." -- Industry Outlook: Biotechnology "Opportunities abound in bioinformatics, but qualified candidates are hard to find" "The field of bioinformatics is surrounded with so much hype that thinking it's a brand-new field is forgivable. The word itself was coined in the early 1990s, but people have been using databases to manage biological sequence data -- which is a part of what bioinformatics encompasses -- since the 1960s. Although the field may not be new, it's moving in new directions and creating plenty of job opportunities." -- Chemical & Engineering News Lecture 1-2

Biological Data DNA (deoxyribonucleic acid) sequence - contains 1-dimensional storage of genetic information self-replicating genetic material of inheritance encodes information used to create proteins a very long string consisting of a four-letter alphabet (ACGT) A (adenine), T (thymine), C (cytosine) and G (guanine) RNA (ribonucleic acid) - an intermediary used in transferring a small part of DNA s information for the construction of a protein RNA consists of a long string consisting of a four-letter alphabet (ACGU) A (adenine), U (uracil), C (cytosine) and G (guanine) Proteins - perform most functions in living organisms (their 3-D structure aids in the role of chemical reactions) Protien functions: tissue building blocks, called structure proteins biological catalysts for chemical reactions in cells, called enzymes oxygen transport antibody defense Proteins are chains of amino acid residues. There are 20 different amino acids. However, a protein s 3-D structure aids in the role of chemical reactions Question: How many bases of an RNA sequence are needed to encode one amino acid of a protein? (Hint: RNA s alphabet is of size 4 and protein s alphabet is of size 20) Bioinformatics Questions to be Answered: How do we figure out which parts of the DNA control the various chemical processes of life? We know the function and structure of some proteins, but how do we determine the function of other proteins? How do we predict what a protein will look like 3-dimensionally, based on knowledge of its sequence? How do we find new meaningful DNA subsequences for new proteins in a DNA sequence and add them to the DNA-protein dictionary? Lecture 1-3

Course Description: Intermediate programming with emphasis on bioinformatics. Includes file handling, memory management, multi-threading, B-trees, introduction to dynamic programming including Wunsch-Neddleman and Smith-Waterman algorithms for optimal alignments, exploration of BLAST, FASTA and gapped alignment, substitution matrices. Comments about the course: Being a new field it needs new computational tools to manage, mine, and analyze ever growing amount of genomic data Efficient algorithms are needed to process large amounts of data Our text does a great job of teaching algorithm development using bioinformatics examples, but we ll need to supplyment it with discussions of file handling and B-trees Textbook is written for senior/graduate level so problems at the end of the chapters are very hard We ll split the text approximately in halve between: Chapter 1: Introduction Chapters: 1-6 will be covered in Computing for Bioinformatics I Chapter 2: Algorithms and Complexity - some of this is written for non-cs majors Chapter 3: Molecular Biology Primer - most of this is written of non-biology majors Chapter 4: Exhaustive Search - enumerate through all possible solutions looking for the best Chapter 5: Greedy Algorithms - make decisions based on the what appears to be the best choice at the current time and live with the consequences (probably the most nature problem solving technique, but it can also get you in trouble) Chapter 6: Dynamic Programming Algorithms - growing the answer to a problem by first solving smaller problems, remembering the answer to the smaller problems, and using the answer to the smaller problems to extend the answer to the larger problem Lecture 1-4

Chapters: 7-11 will be covered in Computing for Bioinformatics II Chapter 7: Divide-and-Conquer Algorithms - split the problem into smaller problems, solve the smaller problems, and use the answer to the smaller problems to solve the original larger problem Chapter 8: Graph Algorithms - uses many problem solving techniques Chapter 9: Combinatorial Pattern Matching - hash tables, suffix trees, heuristics Chapter 10: Clustering and Trees - uses many problem solving techniques Chapter 11: Hidden Markov Models - popular machine learning approach in bioinformatics. Machine learning algorithms are presented with training data so the algorithm can derive important insights about often hidden/unknown parameters. After training, the algorithm uses the information learned in the training phase and applied it to target data of interest to answer the question. Chapter 12: Randomized Algorithms - make random decisions throughout their operation. Useful to provide approximate answers for hard problems were no known optimal algorithm runs in polynomial-time. Lecture 1-5