BGGN 213: Foundations of Bioinformatics (Fall 2017)

Size: px
Start display at page:

Download "BGGN 213: Foundations of Bioinformatics (Fall 2017)"

Transcription

1 BGGN 213: Foundations of Bioinformatics (Fall 2017) Course Instructor: Dr. Barry J. Grant ( bjgrant@ucsd.edu ) Course Website: DRAFT: (15:02:30 PDT on Thu, Aug 10) Overview: Bioinformatics is driving the collection and analysis of big data in the biosciences. This course is designed for bioscience graduate students and provides a hands-on introduction to the computer-based analysis of genomic and biomolecular data. Major topics include: Genomic and biomolecular bioinformatic resources, Advances in sequencing technologies; Genome informatics, Structural informatics, and Transcriptomics. Computational tools, techniques and best practices that foster reproducible bioinformatics research will also be introduced. A guest lecture from a genomic scientist at Illumina Inc., Synthetic Genomics Inc., Human Longevity Inc., or the La Jolla Institute for Allergy and Immunology will feature subject to student voting preferences. A comprehensive website containing all reading materials, screencasts and course notes will be maintained throughout the term. Students completing this course will be able to evaluate new genomic and biomolecular information using existing software and gain experience in combining bioinformatic approaches to answer specific biological questions. Audience: Bioscience graduate students and others familiar with basic molecular biology concepts. No formal programming training or high level mathematical skills are required. Requirements: Participants must bring a laptop with specific software installed. Schedule: Lectures are on Tuesday and Thursday at 9:00-12:00 pm in Warren Lecture Hall 2015 (WLH 2015, UCSD Map Bldg #625). These lectures will include hands-on sessions requiring both individual and small group based computational work. A detailed lecture schedule with topic outlines is provided below. Class announcements: All announcements regarding the course will be by to your UCSD address. Office hours & location: TBD For now me for a time and we will make it happen. Textbook: There is no textbook for the course. Lecture notes, homework assignments, grading criteria, pre-class screen casts and required reading material will be available from our public facing course website. Page 1 of 6

2 Lecture Schedule: Fall 2017 BGGN 213: Foundations of Bioinformatics Lectures (TuTh) 9-12 pm Th, 09/28 Tu, 10/03 Th, 10/05 Tu, 10/10 Th, 10/12 Tu, 10/17 Th, 10/19 Tu, 10/24 Th, 10/26 Tu, 10/31 Th, 11/02 Welcome to Foundations of Bioinformatics (Course introduction, Leaning goals & expectations, Biology is an information science, History of Bioinformatics, Types of data, Application areas and introduction to upcoming course segments, Student computer setup) Bioinformatics databases and key online resources (NCBI & EBI resources for the molecular domain of bioinformatics, Focus on GenBank, UniProt, Entrez and Gene Ontology. Hands on with BLAST, GenBank, OMIM, GENE, UniProt, Muscle, PFAM and PDB bioinformatics tools and databases) Sequence alignment fundamentals, algorithms and applications (Homology, Sequence similarity, Local and global alignment, classic Needleman- Wunsch, Smith-Waterman and BLAST heuristic approaches) Advanced database searching (Database searching beyond BLAST, PSI-BLAST, Profiles and HMMs, Protein structure comparisons) Introduction to UNIX for bioinformatics (Why do we use UNIX for bioinformatics? UNIX philosophy, 21 Key commands, Understanding processes, File system structure, Connecting to remote servers) Working with Unix (Bioinformatics on the command line, Redirection, streams and pipes, Workflows for batch processing, Shell scripting, Organizing computational projects) Bioinformatics data analysis with R (R language basics and the RStudio IDE, Major R data structures and functions, Using R scripts from the command line) Data exploration and visualization in R (Import data in various formats (both local and from online sources), The exploratory data analysis mindset, Data visualization best practices, Simple base graphics (scatterplots, histograms, bar graphs and boxplots), Building more complex charts with ggplot) Working with R packages for bioinformatics (Extending functionality and utility with R packages, Obtaining R packages from CRAN and bioconductor, Working with Bio3D for molecular data, Managing genome-scale data with bioconductor) Structural Bioinformatics (Protein structure function relationships, Protein structure and visualization resources, Modeling energy as a function of structure, Homology modeling, Predicting functional dynamics, Inferring protein function from structure) Bioinformatics in drug discovery and design (Target identification, Lead identification, Small molecule docking methods, Protein motion and conformational variants, Molecular simulation and drug optimization) Page 2 of 6

3 Tu, 11/07 Th, 11/09 Tu, 11/14 Th, 11/16 Tu, 11/21 Th, 11/23 Tu, 11/28 Th, 11/30 Tu, 12/05 Th, 12/07 Mid Term: Find a gene project assignment (Principles of database searching and sequence analysis) Genome informatics and high throughput sequencing (Searching genes and gene functions, Genome databases, Variation in the genome, Sequencing technologies past, present and future (Sanger, Shotgun, PacBio, Illumina, toward the $500 human genome), Biological applications of sequencing, Bioinformatics analysis methods) Major bioinformatics resources for genomics. (Databases, tools and visualization resources from NCBI, EBI & UCSC, The Galaxy platform for quality control and analysis; FASTQ, SAM and BAM file formats; Sample workflows with FASTQC and bowtie2) Transcriptomics and the analysis of RNA-Seq data (RNA-Seq aligners, Differential expression tests, RNA-Seq statistics, Counts and FPKMs and avoiding P-value misuse, Hands-on analysis of RNA-Seq data with R) Genome annotation and the interpretation of gene lists (Gene finding and functional annotation, Functional databases KEGG, InterPro, GO ontologies and functional enrichment) Happy Thanksgiving! (No class) N.B. Find a gene assignment due on Monday 11/27! Systems and network modeling (Analysis of protein-protein interactions, Pathways and networks, Computational methods of network modeling, Hands on with Cytoscape) Continuing genomic advances and bioinformatics challenges (From genome to phenotypes, Integration of heterogenous high throughput genome-wide data sets into their functional context, Data mining and hypothesis generation in the era of big data, deep learning and artificial intelligence) Guest lecture (Student selected guest presentation with possible topics including: Metagenomics / Pharmacogenomics / Epigenomics / Personal genomics / Genome evolution / Genome editing and synthetic genomics / Social impacts and ethical implications of continuing genomic advances) Course summary (Summary of learning goals, Student course evaluation time and exam preparation) Th, 12/12 Date-TBD Final exam! Page 3 of 6

4 Course Objectives: At the end of this course students will: Understand the increasing necessity for computation in modern life sciences research. Be able to use and evaluate online bioinformatics resources including major biomolecular and genomic databases, search and analysis tools, genome browsers, structure viewers, and select quality control and analysis tools to solve problems in the biological sciences. Be able to use the UNIX command line and the R environment to analyze bioinformatics data at scale. Understand the process by which genomes are currently sequenced and the bioinformatics processing and analysis required for their interpretation. Be familiar with the research objectives of the bioinformatics related sub-disciplines of Genome informatics, Transcriptomics and Structural informatics. In short, students will develop a solid foundational knowledge of bioinformatics and be able to evaluate new biomolecular and genomic information using existing bioinformatic tools and resources. Specific Learning Goals Teaching toward the specific learning goals below is expected to occupy 60%-70% of class time. The remaining course content is at the discretion of the instructor with student body input. This includes student selected topics for peer presentation as well one student selected guest lecture from an industry based genomic scientist. All students who receive a passing grade should be able to: Lecture(s): 1 Appreciate and describe in general terms the role of computation in hypothesis-driven discovery processes within the life sciences. 2 Be able to query, search, compare and contrast the data contained in major bioinformatics databases and describe how these databases intersect (GenBank, GENE, UniProt, PFAM, OMIM, PDB, UCSC, ENSEMBLE). 3 Describe how nucleotide and protein sequence and structure data are represented (FASTA, FASTQ, GenBank, UniProt, PDB). 4 Be able to describe how dynamic programming works for pairwise sequence alignment and appreciate the differences between global and local alignment along with their major application areas. 1, 2, 20 2, 12, 13 3, 10 4, 5 Page 4 of 6

5 5 Calculate the alignment score between two nucleotide or protein sequences using a provided scoring matrix and be able to perform BLAST, PSI-BLAST, HMMER and protein structure based database searches and interpret the results in terms of the biological significance of an e-value. 6 Use UNIX command-line tools for file system navigation and text file manipulation. 7 Use existing programs at the UNIX command line to analyze bioinformatics data. 8 Use R to read and parse comma-separted (.csv) formatted files ready for subsequent analysis. 9 Perform elementary statistical analysis on boimolecular and omics datasets with R and produce informative graphical displays and data summaries. 5, 10 6, 7, 10, 11, 24, 15 7, 10, 11, 13, 14, 15, 16 8, 9, 10, 11, 13, 15, 16 9, 10, 11, 13, 15, View and interpret the structural models in the PDB. 10, Explain the outputs from structure prediction algorithms and small molecule docking approaches. 12 Appreciate and describe in general terms the rapid advances in sequencing technologies and the new areas of investigation that these advances have made accessible. 13 Understand the process by which genomes are currently sequenced and the bioinformatics processing and analysis required for their interpretation. 14 For a genomic region of interest (e.g. the neighborhood of a particular gene), use a genome browser to view nearby genes, transcription factor binding regions, epigenetic information, etc. 15 Given an RNA-Seq data file, find the set of significantly differentially expressed genes and use online tools to interpret gene lists and annotate potential gene functions. 16 Perforn a GO analysis to identify the pathways relevant to a set of genes (e.g. identified by transcriptomic study or a proteomic experiment) , 14, , Use the KEGG pathway database to look up interaction pathways Use graph theory to represent biological data networks. 17, 18 Page 5 of 6

6 19 Understand the challenges in integrating and interpreting large heterogenous high throughput data sets into their functional context. 20 Have an appreciation for the social impacts and ethical implications of how genomic sequence information is used in our society Homework assignments, mid-term project and final exam: Weekly homework will consist of online knowledge assessment quizzes and application assignments together with pre-class reading and video screen-casts. Specific grading criteria (assessment rubrics) for each homework will be given at the time of assignment. Weekly grades will be posted online. Each student is responsible for checking to ensure that a grade has been entered for their submissions. Documents submitted by do not always arrive at their intended destination and late submissions will not be accepted after one week past the original due date. Collectively homework performance will account for 35% of the course grade. A total of 20% of the course grade will be assigned based on the mid-term find-a-gene project assignment. The purpose of this mid-term assignment is for you to grasp the principles of database searching, sequence analysis and functional annotation that we cover in the course (see additional details online). Further details will be given in class. There will be one final exam that accounts for 45% of the final grade for the course. Ethics Code. You are encouraged to collaborate with your fellow students. However, all material submitted to the instructor must be your own work. Academic Integrity is expected of everyone at UC San Diego. This means that you must be honest, fair, responsible, respectful, and trustworthy in all of your actions. Lying, cheating or any other forms of dishonesty will not be tolerated because they undermine learning and the University s ability to certify students knowledge and abilities. Thus, any attempt to get, or help another get, a grade by cheating, lying or dishonesty will be reported to the Academic Integrity Office and will result sanctions. Sanctions can include an F in this class and suspension or dismissal from the University. So, think carefully before you act. Before you act, ask yourself the following questions: a) is my action honest, fair, respectful, responsible & trustworthy and, b) is my action authorized by the instructor? If you are unsure, don t ask a friend ask your instructor, instructional assistant, or the Academic Integrity Office. You can learn more about academic integrity at academicintegrity.ucsd.edu (Source: UCSD Academic Integrity Office, 2017) Page 6 of 6

BIMM 143: Introduction to Bioinformatics (Winter 2018)

BIMM 143: Introduction to Bioinformatics (Winter 2018) BIMM 143: Introduction to Bioinformatics (Winter 2018) Course Instructor: Dr. Barry J. Grant ( bjgrant@ucsd.edu ) Course Website: https://bioboot.github.io/bimm143_w18/ DRAFT: 2017-12-02 (20:48:10 PST

More information

Advanced Bioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2018

Advanced Bioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2018 Advanced Bioinformatics Biostatistics & Medical Informatics 776 Computer Sciences 776 Spring 2018 Anthony Gitter gitter@biostat.wisc.edu www.biostat.wisc.edu/bmi776/ These slides, excluding third-party

More information

Grundlagen der Bioinformatik Summer Lecturer: Prof. Daniel Huson

Grundlagen der Bioinformatik Summer Lecturer: Prof. Daniel Huson Grundlagen der Bioinformatik, SoSe 11, D. Huson, April 11, 2011 1 1 Introduction Grundlagen der Bioinformatik Summer 2011 Lecturer: Prof. Daniel Huson Office hours: Thursdays 17-18h (Sand 14, C310a) 1.1

More information

Genetics and Bioinformatics

Genetics and Bioinformatics Genetics and Bioinformatics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be Lecture 1: Setting the pace 1 Bioinformatics what s

More information

ST 591: Introduction to Quantitative Genomics Syllabus

ST 591: Introduction to Quantitative Genomics Syllabus General Information Instructor: Thomas Sharpton Email: thomas.sharpton@oregonstate.edu Office: 530 Nash Hall Phone: (541) 737-8623 Office Hours: TBD Teaching Assistand: TBD Course credits: 3 Class meetings:

More information

Following text taken from Suresh Kumar. Bioinformatics Web - Comprehensive educational resource on Bioinformatics. 6th May.2005

Following text taken from Suresh Kumar. Bioinformatics Web - Comprehensive educational resource on Bioinformatics. 6th May.2005 Bioinformatics is the recording, annotation, storage, analysis, and searching/retrieval of nucleic acid sequence (genes and RNAs), protein sequence and structural information. This includes databases of

More information

NCBI web resources I: databases and Entrez

NCBI web resources I: databases and Entrez NCBI web resources I: databases and Entrez Yanbin Yin Most materials are downloaded from ftp://ftp.ncbi.nih.gov/pub/education/ 1 Homework assignment 1 Two parts: Extract the gene IDs reported in table

More information

ZOO 4926 Special Topics: Genomics and Biotechnology

ZOO 4926 Special Topics: Genomics and Biotechnology ZOO 4926 Special Topics: Genomics and Biotechnology Description Big data and genomics are prominent in the medical and agricultural life-sciences. Students will be introduced to modern next-generation

More information

Bioinformatics for Cell Biologists

Bioinformatics for Cell Biologists Bioinformatics for Cell Biologists 15 19 March 2010 Developmental Biology and Regnerative Medicine (DBRM) Schedule Monday, March 15 09.00 11.00 Introduction to course and Bioinformatics (L1) D224 Helena

More information

Sequence Based Function Annotation. Qi Sun Bioinformatics Facility Biotechnology Resource Center Cornell University

Sequence Based Function Annotation. Qi Sun Bioinformatics Facility Biotechnology Resource Center Cornell University Sequence Based Function Annotation Qi Sun Bioinformatics Facility Biotechnology Resource Center Cornell University Usage scenarios for sequence based function annotation Function prediction of newly cloned

More information

Analysis of Biological Sequences SPH

Analysis of Biological Sequences SPH Analysis of Biological Sequences SPH 140.638 swheelan@jhmi.edu nuts and bolts meet Tuesdays & Thursdays, 3:30-4:50 no exam; grade derived from 3-4 homework assignments plus a final project (open book,

More information

PBIO4_5280_Laboratory in Genomics Techniques (Spring 2017)

PBIO4_5280_Laboratory in Genomics Techniques (Spring 2017) PBIO4_5280_Laboratory in Genomics Techniques (Spring 2017) Instructor: Dr. Zhihua Hua Office: 500 Porter Hall Email: hua@ohio.edu Lecture 100 (Class Number 12469_12471) Labs 101 (Class # 12470_12472) Lab:

More information

Applied Bioinformatics

Applied Bioinformatics Applied Bioinformatics Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu Course overview What is bioinformatics Data driven science: the creation and advancement

More information

Sequence Based Function Annotation

Sequence Based Function Annotation Sequence Based Function Annotation Qi Sun Bioinformatics Facility Biotechnology Resource Center Cornell University Sequence Based Function Annotation 1. Given a sequence, how to predict its biological

More information

What is Bioinformatics?

What is Bioinformatics? What is Bioinformatics? Bioinformatics is the field of science in which biology, computer science, and information technology merge to form a single discipline. - NCBI The ultimate goal of the field is

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics Dortmund, 16.-20.07.2007 Lectures: Sven Rahmann Exercises: Udo Feldkamp, Michael Wurst 1 Goals of this course Learn about Software tools Databases Methods (Algorithms) in

More information

BIOINFORMATICS AND SYSTEM BIOLOGY (INTERNATIONAL PROGRAM)

BIOINFORMATICS AND SYSTEM BIOLOGY (INTERNATIONAL PROGRAM) BIOINFORMATICS AND SYSTEM BIOLOGY (INTERNATIONAL PROGRAM) PROGRAM TITLE DEGREE TITLE Master of Science Program in Bioinformatics and System Biology (International Program) Master of Science (Bioinformatics

More information

This practical aims to walk you through the process of text searching DNA and protein databases for sequence entries.

This practical aims to walk you through the process of text searching DNA and protein databases for sequence entries. PRACTICAL 1: BLAST and Sequence Alignment The EBI and NCBI websites, two of the most widely used life science web portals are introduced along with some of the principal databases: the NCBI Protein database,

More information

ELE4120 Bioinformatics. Tutorial 5

ELE4120 Bioinformatics. Tutorial 5 ELE4120 Bioinformatics Tutorial 5 1 1. Database Content GenBank RefSeq TPA UniProt 2. Database Searches 2 Databases A common situation for alignment is to search through a database to retrieve the similar

More information

BIORISE D6.1 SYLLABUS OF THE CSMM BIOINFORMATICS COURSE

BIORISE D6.1 SYLLABUS OF THE CSMM BIOINFORMATICS COURSE Ref. Ares(2016)1538926-31/03/2016 BIORISE D6.1 SYLLABUS OF THE CSMM BIOINFORMATICS COURSE The attached syllabus has been prepared by the CING Bioinformatics ERA Chair, Dr George Spyrou. The proposed syllabus

More information

Challenging algorithms in bioinformatics

Challenging algorithms in bioinformatics Challenging algorithms in bioinformatics 11 October 2018 Torbjørn Rognes Department of Informatics, UiO torognes@ifi.uio.no What is bioinformatics? Definition: Bioinformatics is the development and use

More information

BIOL 274 Introduction to Bioinformatics Fall 2016

BIOL 274 Introduction to Bioinformatics Fall 2016 Instructor: Eric S. Ho (hoe@lafayette.edu) Office: Kunkel 13 Office hours: TTh 2-4 pm Lecture: MWF 11:00-11:50 am, Venue: Kunkel 117 Lab: M/W 1:10-4:00 pm, Venue: Kunkel 313B TAs: Amy Boles (bolesa@lafayette.edu),

More information

Functional analysis using EBI Metagenomics

Functional analysis using EBI Metagenomics Functional analysis using EBI Metagenomics Contents Tutorial information... 2 Tutorial learning objectives... 2 An introduction to functional analysis using EMG... 3 What are protein signatures?... 3 Assigning

More information

BSC 4445C Genomics Lab: Methods in Data Collection and Analysis

BSC 4445C Genomics Lab: Methods in Data Collection and Analysis 1 BSC 4445C: Special Topics Genomics Lab: Fall 2017 (Forsman) 4 credits Course Description The field of genomics focuses on understanding the collective function of all components encoded in an organism

More information

Introduction to Bioinformatics CPSC 265. What is bioinformatics? Textbooks

Introduction to Bioinformatics CPSC 265. What is bioinformatics? Textbooks Introduction to Bioinformatics CPSC 265 Thanks to Jonathan Pevsner, Ph.D. Textbooks Johnathan Pevsner, who I stole most of these slides from (thanks!) has written a textbook, Bioinformatics and Functional

More information

High peformance computing infrastructure for bioinformatics

High peformance computing infrastructure for bioinformatics High peformance computing infrastructure for bioinformatics Scott Hazelhurst University of the Witwatersrand December 2009 What we need Skills, time What we need Skills, time Fast network Lots of storage

More information

Bioinformatics Practical Course. 80 Practical Hours

Bioinformatics Practical Course. 80 Practical Hours Bioinformatics Practical Course 80 Practical Hours Course Description: This course presents major ideas and techniques for auxiliary bioinformatics and the advanced applications. Points included incorporate

More information

Introduction to NGS analyses

Introduction to NGS analyses Introduction to NGS analyses Giorgio L Papadopoulos Institute of Molecular Biology and Biotechnology Bioinformatics Support Group 04/12/2015 Papadopoulos GL (IMBB, FORTH) IMBB NGS Seminar 04/12/2015 1

More information

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with

More information

NGS part 2: applications. Tobias Österlund

NGS part 2: applications. Tobias Österlund NGS part 2: applications Tobias Österlund tobiaso@chalmers.se NGS part of the course Week 4 Friday 13/2 15.15-17.00 NGS lecture 1: Introduction to NGS, alignment, assembly Week 6 Thursday 26/2 08.00-09.45

More information

Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH. BIOL 7210 A Computational Genomics 2/18/2015

Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH. BIOL 7210 A Computational Genomics 2/18/2015 Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH BIOL 7210 A Computational Genomics 2/18/2015 The $1,000 genome is here! http://www.illumina.com/systems/hiseq-x-sequencing-system.ilmn Bioinformatics bottleneck

More information

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools News About NCBI Site Map

More information

GNET 744 Biological Sequence Analysis, Protein Structure and Genome-Wide Data

GNET 744 Biological Sequence Analysis, Protein Structure and Genome-Wide Data GNET 744 Biological Sequence Analysis, Protein Structure and Genome-Wide Data This course provides an introduction to basic protein structure/function analyses, sequencing informatics and macromolecular

More information

Annotation and the analysis of annotation terms. Brian J. Knaus USDA Forest Service Pacific Northwest Research Station

Annotation and the analysis of annotation terms. Brian J. Knaus USDA Forest Service Pacific Northwest Research Station Annotation and the analysis of annotation terms. Brian J. Knaus USDA Forest Service Pacific Northwest Research Station 1 Library preparation Sequencing Hypothesis testing Bioinformatics 2 Why annotate?

More information

Genome 373: Genomic Informatics. Elhanan Borenstein

Genome 373: Genomic Informatics. Elhanan Borenstein Genome 373: Genomic Informatics Elhanan Borenstein Genome 373 This course is intended to introduce students to the breadth of problems and methods in computational analysis of genomes and biological systems,

More information

G4120: Introduction to Computational Biology

G4120: Introduction to Computational Biology G4120: Introduction to Computational Biology Oliver Jovanovic, Ph.D. Columbia University Department of Microbiology Lecture 3 February 13, 2003 Copyright 2003 Oliver Jovanovic, All Rights Reserved. Bioinformatics

More information

GREG GIBSON SPENCER V. MUSE

GREG GIBSON SPENCER V. MUSE A Primer of Genome Science ience THIRD EDITION TAGCACCTAGAATCATGGAGAGATAATTCGGTGAGAATTAAATGGAGAGTTGCATAGAGAACTGCGAACTG GREG GIBSON SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc.

More information

Analysis of Microarray Data

Analysis of Microarray Data Analysis of Microarray Data Lecture 3: Visualization and Functional Analysis George Bell, Ph.D. Senior Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Review

More information

Analysis of Microarray Data

Analysis of Microarray Data Analysis of Microarray Data Lecture 3: Visualization and Functional Analysis George Bell, Ph.D. Bioinformatics Scientist Bioinformatics and Research Computing Whitehead Institute Outline Review Visualizing

More information

Bioinformatics for Proteomics. Ann Loraine

Bioinformatics for Proteomics. Ann Loraine Bioinformatics for Proteomics Ann Loraine aloraine@uab.edu What is bioinformatics? The science of collecting, processing, organizing, storing, analyzing, and mining biological information, especially data

More information

CSE/Beng/BIMM 182: Biological Data Analysis. Instructor: Vineet Bafna TA: Nitin Udpa

CSE/Beng/BIMM 182: Biological Data Analysis. Instructor: Vineet Bafna TA: Nitin Udpa CSE/Beng/BIMM 182: Biological Data Analysis Instructor: Vineet Bafna TA: Nitin Udpa Today We will explore the syllabus through a series of questions? Please ASK All logistical information will be given

More information

Product Applications for the Sequence Analysis Collection

Product Applications for the Sequence Analysis Collection Product Applications for the Sequence Analysis Collection Pipeline Pilot Contents Introduction... 1 Pipeline Pilot and Bioinformatics... 2 Sequence Searching with Profile HMM...2 Integrating Data in a

More information

A WEB-BASED TOOL FOR GENOMIC FUNCTIONAL ANNOTATION, STATISTICAL ANALYSIS AND DATA MINING

A WEB-BASED TOOL FOR GENOMIC FUNCTIONAL ANNOTATION, STATISTICAL ANALYSIS AND DATA MINING A WEB-BASED TOOL FOR GENOMIC FUNCTIONAL ANNOTATION, STATISTICAL ANALYSIS AND DATA MINING D. Martucci a, F. Pinciroli a,b, M. Masseroli a a Dipartimento di Bioingegneria, Politecnico di Milano, Milano,

More information

Applications of short-read

Applications of short-read Applications of short-read sequencing: RNA-Seq and ChIP-Seq BaRC Hot Topics March 2013 George Bell, Ph.D. http://jura.wi.mit.edu/bio/education/hot_topics/ Sequencing applications RNA-Seq includes experiments

More information

CHEM 436 / 630. Molecular modelling of proteins. Winter 2018 Term. Instructor: Guillaume Lamoureux Concordia University, Montréal, Canada

CHEM 436 / 630. Molecular modelling of proteins. Winter 2018 Term. Instructor: Guillaume Lamoureux Concordia University, Montréal, Canada CHEM 436 / 630 Molecular modelling of proteins Winter 2018 Term Instructor: Guillaume Lamoureux Concordia University, Montréal, Canada Syllabus: http://faculty.concordia.ca/glamoure/pdfs/chem436_630_syllabus_2018.pdf

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Introduction to Bioinformatics If the 19 th century was the century of chemistry and 20 th century was the century of physic, the 21 st century promises to be the century of biology...professor Dr. Satoru

More information

Practical Bioinformatics for Biologists (BIOS 441/641)

Practical Bioinformatics for Biologists (BIOS 441/641) Practical Bioinformatics for Biologists (BIOS 441/641) - Course overview Yanbin Yin MO444 1 Room and computer access Room entry code: 2159 Computer access: user poduser 2 Compared to BIOS 443/643 and 646

More information

Bioinformatics 2. Lecture 1

Bioinformatics 2. Lecture 1 Bioinformatics 2 Introduction Lecture 1 Course Overview & Assessment Introduction to Bioinformatics Research Careers and PhD options Core topics in Bioinformatics the central dogma of molecular biology

More information

Lecture 1. Bioinformatics 2. About me... The class (2009) Course Outcomes. What do I think you know?

Lecture 1. Bioinformatics 2. About me... The class (2009) Course Outcomes. What do I think you know? Lecture 1 Bioinformatics 2 Introduction Course Overview & Assessment Introduction to Bioinformatics Research Careers and PhD options Core topics in Bioinformatics the central dogma of molecular biology

More information

Our website:

Our website: Biomedical Informatics Summer Internship Program (BMI SIP) The Department of Biomedical Informatics hosts an annual internship program each summer which provides high school, undergraduate, and graduate

More information

Genomics: Genome Browsing & Annota3on

Genomics: Genome Browsing & Annota3on Genomics: Genome Browsing & Annota3on Lecture 4 of 4 Introduc/on to BioMart Dr Colleen J. Saunders, PhD South African National Bioinformatics Institute/MRC Unit for Bioinformatics Capacity Development,

More information

Introduction to Genome Science - BISC 434 Syllabus Spring Semester

Introduction to Genome Science - BISC 434 Syllabus Spring Semester Introduction to Genome Science - BISC 434 Syllabus - 2018 Spring Semester Basic Information Course: Introduction to Genome Science, BISC 434, 4 credits Textbook: Arthur M. Lesk. Introduction to Genomics

More information

Sequencing applications. Today's outline. Hands-on exercises. Applications of short-read sequencing: RNA-Seq and ChIP-Seq

Sequencing applications. Today's outline. Hands-on exercises. Applications of short-read sequencing: RNA-Seq and ChIP-Seq Sequencing applications Applications of short-read sequencing: RNA-Seq and ChIP-Seq BaRC Hot Topics March 2013 George Bell, Ph.D. http://jura.wi.mit.edu/bio/education/hot_topics/ RNA-Seq includes experiments

More information

Knowledge-Guided Analysis with KnowEnG Lab

Knowledge-Guided Analysis with KnowEnG Lab Han Sinha Song Weinshilboum Knowledge-Guided Analysis with KnowEnG Lab KnowEnG Center Powerpoint by Charles Blatti Knowledge-Guided Analysis KnowEnG Center 2017 1 Exercise In this exercise we will be doing

More information

Sanger vs Next-Gen Sequencing

Sanger vs Next-Gen Sequencing Tools and Algorithms in Bioinformatics GCBA815/MCGB815/BMI815, Fall 2017 Week-8: Next-Gen Sequencing RNA-seq Data Analysis Babu Guda, Ph.D. Professor, Genetics, Cell Biology & Anatomy Director, Bioinformatics

More information

RESEARCH METHODOLOGY, BIOSTATISTICS AND IPR

RESEARCH METHODOLOGY, BIOSTATISTICS AND IPR MB 401: RESEARCH METHODOLOGY, BIOSTATISTICS AND IPR Objectives: The overall aim of the course is to deepen knowledge regarding basic concepts of Biostatistics, the research process in occupational therapy

More information

Genomics and Bioinformatics GMS6231 (3 credits)

Genomics and Bioinformatics GMS6231 (3 credits) Genomics and Bioinformatics GMS6231 (3 credits) COURSE DESCRIPTION: Principles of genomic characterization and bioinformatic analysis of eukaryotes, including an overview of analytical platforms, computational

More information

EECS 730 Introduction to Bioinformatics Sequence Alignment. Luke Huan Electrical Engineering and Computer Science

EECS 730 Introduction to Bioinformatics Sequence Alignment. Luke Huan Electrical Engineering and Computer Science EECS 730 Introduction to Bioinformatics Sequence Alignment Luke Huan Electrical Engineering and Computer Science http://people.eecs.ku.edu/~jhuan/ Database What is database An organized set of data Can

More information

Protein Sequence Analysis. BME 110: CompBio Tools Todd Lowe April 19, 2007 (Slide Presentation: Carol Rohl)

Protein Sequence Analysis. BME 110: CompBio Tools Todd Lowe April 19, 2007 (Slide Presentation: Carol Rohl) Protein Sequence Analysis BME 110: CompBio Tools Todd Lowe April 19, 2007 (Slide Presentation: Carol Rohl) Linear Sequence Analysis What can you learn from a (single) protein sequence? Calculate it s physical

More information

What is Bioinformatics? Bioinformatics is the application of computational techniques to the discovery of knowledge from biological databases.

What is Bioinformatics? Bioinformatics is the application of computational techniques to the discovery of knowledge from biological databases. What is Bioinformatics? Bioinformatics is the application of computational techniques to the discovery of knowledge from biological databases. Bioinformatics is the marriage of molecular biology with computer

More information

Data Mining for Biological Data Analysis

Data Mining for Biological Data Analysis Data Mining for Biological Data Analysis Data Mining and Text Mining (UIC 583 @ Politecnico di Milano) References Data Mining Course by Gregory-Platesky Shapiro available at www.kdnuggets.com Jiawei Han

More information

Gene-centered resources at NCBI

Gene-centered resources at NCBI COURSE OF BIOINFORMATICS a.a. 2014-2015 Gene-centered resources at NCBI We searched Accession Number: M60495 AT NCBI Nucleotide Gene has been implemented at NCBI to organize information about genes, serving

More information

ONLINE BIOINFORMATICS RESOURCES

ONLINE BIOINFORMATICS RESOURCES Dedan Githae Email: d.githae@cgiar.org BecA-ILRI Hub; Nairobi, Kenya 16 May, 2014 ONLINE BIOINFORMATICS RESOURCES Introduction to Molecular Biology and Bioinformatics (IMBB) 2014 The larger picture.. Lower

More information

Experimental Design. Sequencing. Data Quality Control. Read mapping. Differential Expression analysis

Experimental Design. Sequencing. Data Quality Control. Read mapping. Differential Expression analysis -Seq Analysis Quality Control checks Reproducibility Reliability -seq vs Microarray Higher sensitivity and dynamic range Lower technical variation Available for all species Novel transcript identification

More information

Bioinformatics for Cell Biologists

Bioinformatics for Cell Biologists Bioinformatics for Cell Biologists Rickard Sandberg Karolinska Institutet 13-17 May 2013 OUTLINE INTRODUCTION Introduce yourselves HISTORY MODERN What is bioinformatics today? COURSE ONLINE LEARNING OPPORTUNITIES

More information

CMSE 520 BIOMOLECULAR STRUCTURE, FUNCTION AND DYNAMICS

CMSE 520 BIOMOLECULAR STRUCTURE, FUNCTION AND DYNAMICS CMSE 520 BIOMOLECULAR STRUCTURE, FUNCTION AND DYNAMICS (Computational Structural Biology) OUTLINE Review: Molecular biology Proteins: structure, conformation and function(5 lectures) Generalized coordinates,

More information

Introduction to Bioinformatics. What are the goals of the course? Who is taking this course? Different user needs, different approaches

Introduction to Bioinformatics. What are the goals of the course? Who is taking this course? Different user needs, different approaches Introduction to Bioinformatics Who is taking this course? Monday, November 19, 2012 Jonathan Pevsner pevsner@kennedykrieger.org Bioinformatics M.E:800.707 People with very diverse backgrounds in biology

More information

Textbook Reading Guidelines

Textbook Reading Guidelines Understanding Bioinformatics by Marketa Zvelebil and Jeremy Baum Last updated: January 16, 2013 Textbook Reading Guidelines Preface: Read the whole preface, and especially: For the students with Life Science

More information

Introduction to Bioinformatics and Gene Expression Technologies

Introduction to Bioinformatics and Gene Expression Technologies Introduction to Bioinformatics and Gene Expression Technologies Utah State University Fall 2017 Statistical Bioinformatics (Biomedical Big Data) Notes 1 1 Vocabulary Gene: hereditary DNA sequence at a

More information

Introduction to Bioinformatics and Gene Expression Technologies

Introduction to Bioinformatics and Gene Expression Technologies Vocabulary Introduction to Bioinformatics and Gene Expression Technologies Utah State University Fall 2017 Statistical Bioinformatics (Biomedical Big Data) Notes 1 Gene: Genetics: Genome: Genomics: hereditary

More information

Textbook Reading Guidelines

Textbook Reading Guidelines Understanding Bioinformatics by Marketa Zvelebil and Jeremy Baum Last updated: May 1, 2009 Textbook Reading Guidelines Preface: Read the whole preface, and especially: For the students with Life Science

More information

Introduction to DNA-Sequencing

Introduction to DNA-Sequencing informatics.sydney.edu.au sih.info@sydney.edu.au The Sydney Informatics Hub provides support, training, and advice on research data, analyses and computing. Talk to us about your computing infrastructure,

More information

Course Presentation. Ignacio Medina Presentation

Course Presentation. Ignacio Medina Presentation Course Index Introduction Agenda Analysis pipeline Some considerations Introduction Who we are Teachers: Marta Bleda: Computational Biologist and Data Analyst at Department of Medicine, Addenbrooke's Hospital

More information

Chapter 2: Access to Information

Chapter 2: Access to Information Chapter 2: Access to Information Outline Introduction to biological databases Centralized databases store DNA sequences Contents of DNA, RNA, and protein databases Central bioinformatics resources: NCBI

More information

Introduction to EMBL-EBI.

Introduction to EMBL-EBI. Introduction to EMBL-EBI www.ebi.ac.uk What is EMBL-EBI? Part of EMBL Austria, Belgium, Croatia, Denmark, Finland, France, Germany, Greece, Iceland, Ireland, Israel, Italy, Luxembourg, the Netherlands,

More information

Protein Bioinformatics Part I: Access to information

Protein Bioinformatics Part I: Access to information Protein Bioinformatics Part I: Access to information 260.655 April 6, 2006 Jonathan Pevsner, Ph.D. pevsner@kennedykrieger.org Outline [1] Proteins at NCBI RefSeq accession numbers Cn3D to visualize structures

More information

Introduction to BIOINFORMATICS

Introduction to BIOINFORMATICS Introduction to BIOINFORMATICS Antonella Lisa CABGen Centro di Analisi Bioinformatica per la Genomica Tel. 0382-546361 E-mail: lisa@igm.cnr.it http://www.igm.cnr.it/pagine-personali/lisa-antonella/ What

More information

Transcriptome Assembly, Functional Annotation (and a few other related thoughts)

Transcriptome Assembly, Functional Annotation (and a few other related thoughts) Transcriptome Assembly, Functional Annotation (and a few other related thoughts) Monica Britton, Ph.D. Sr. Bioinformatics Analyst June 23, 2017 Differential Gene Expression Generalized Workflow File Types

More information

CHEM/ENCH 323: Biological Chemistry Winter term 2018

CHEM/ENCH 323: Biological Chemistry Winter term 2018 CHEM/ENCH 323: Biological Chemistry Winter term 2018 Course instructor Dr Avena Ross CHE407, Chernoff Hall phone: 613-533-2618 email:avena.ross@chem.queensu.ca Office hours No pre-set office hours. Please

More information

MKTG PRINCIPLES OF MARKETING. Leroy Robinson, Jr., Ph.D. Associate Professor of Marketing

MKTG PRINCIPLES OF MARKETING. Leroy Robinson, Jr., Ph.D. Associate Professor of Marketing MKTG 3031 01 PRINCIPLES OF MARKETING FALL 2013 Leroy, Jr., Ph.D. Associate Professor of Marketing UHCL Hotline: 281-283-2221 (For information on bad weather closings of the University) Email: robinsonjr@uhcl.edu

More information

Two Mark question and Answers

Two Mark question and Answers 1. Define Bioinformatics Two Mark question and Answers Bioinformatics is the field of science in which biology, computer science, and information technology merge into a single discipline. There are three

More information

Office hours: Wednesday from 11:30 to 12:30 and Friday from 11:30 to 12:30. Lectures: Wednesday and Friday 10:15 AM -11:30 AM, Room HB-130

Office hours: Wednesday from 11:30 to 12:30 and Friday from 11:30 to 12:30. Lectures: Wednesday and Friday 10:15 AM -11:30 AM, Room HB-130 BIOL 367/01 2016 Molecular Biology Fall Semester 2016 Biology Department Concordia University, Montreal, Canada Instructor: Dr. Aida Abu-Baker Office: SP- 375.35 Office phone: (514) 848-2424, x 3395 Office

More information

Week 1 BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers

Week 1 BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers Week 1 BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers Web resources: NCBI database: http://www.ncbi.nlm.nih.gov/ Ensembl database: http://useast.ensembl.org/index.html

More information

The University of California, Santa Cruz (UCSC) Genome Browser

The University of California, Santa Cruz (UCSC) Genome Browser The University of California, Santa Cruz (UCSC) Genome Browser There are hundreds of available userselected tracks in categories such as mapping and sequencing, phenotype and disease associations, genes,

More information

Transcriptome analysis

Transcriptome analysis Statistical Bioinformatics: Transcriptome analysis Stefan Seemann seemann@rth.dk University of Copenhagen April 11th 2018 Outline: a) How to assess the quality of sequencing reads? b) How to normalize

More information

Imaging informatics computer assisted mammogram reading Clinical aka medical informatics CDSS combining bioinformatics for diagnosis, personalized

Imaging informatics computer assisted mammogram reading Clinical aka medical informatics CDSS combining bioinformatics for diagnosis, personalized 1 2 3 Imaging informatics computer assisted mammogram reading Clinical aka medical informatics CDSS combining bioinformatics for diagnosis, personalized medicine, risk assessment etc Public Health Bio

More information

B I O I N F O R M A T I C S

B I O I N F O R M A T I C S B I O I N F O R M A T I C S Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be SUPPLEMENTARY CHAPTER: DATA BASES AND MINING 1 What

More information

Introduc)on to Databases and Resources Biological Databases and Resources

Introduc)on to Databases and Resources Biological Databases and Resources Introduc)on to Bioinforma)cs Online Course : IBT Introduc)on to Databases and Resources Biological Databases and Resources Learning Objec)ves Introduc)on to Databases and Resources - Understand how bioinforma)cs

More information

M252 M262 Course Syllabus M252A-B & M262A-B. Molecular Mechanisms of Human Diseases. Course Director: Professor Yibin Wang

M252 M262 Course Syllabus M252A-B & M262A-B. Molecular Mechanisms of Human Diseases. Course Director: Professor Yibin Wang M252A-B & M262A-B Molecular Mechanisms of Human Diseases Course Director: Professor Yibin Wang (yibinwang@mednet.ucla.edu) Course Coordinator: Yesenia Rayos yrayos@lifesci.ucla.edu Proposal Coordinator:

More information

INTRODUCTION TO PLANT MOLECULAR BIOLOGY

INTRODUCTION TO PLANT MOLECULAR BIOLOGY INTRODUCTION TO PLANT MOLECULAR BIOLOGY SYLLABUS I. Course and Instructor Information. Course: HOS 3305 Section: 3831 Credit Hours: 3 Period 2-3: T 8:30-9:20 am & 9:35-10:25 am R 8:30-9:20 pm Room: 2318

More information

Basics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility

Basics of RNA-Seq. (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly, PhD Team Lead, NCI Single Cell Analysis Facility 2018 ABRF Meeting Satellite Workshop 4 Bridging the Gap: Isolation to Translation (Single Cell RNA-Seq) Sunday, April 22 Basics of RNA-Seq (With a Focus on Application to Single Cell RNA-Seq) Michael Kelly,

More information

LEARNING OBJECTIVES At the conclusion of this course, students will be able to:

LEARNING OBJECTIVES At the conclusion of this course, students will be able to: FLEX LEARNING FORMAT This course is delivered in a flexible learning format. While most summer session courses meet on a regular schedule two times each week throughout the term (12 class meetings over

More information

Introduction to human genomics and genome informatics

Introduction to human genomics and genome informatics Introduction to human genomics and genome informatics Session 1 Prince of Wales Clinical School Dr Jason Wong ARC Future Fellow Head, Bioinformatics & Integrative Genomics Adult Cancer Program, Lowy Cancer

More information

Bioinformatics. Ingo Ruczinski. Some selected examples... and a bit of an overview

Bioinformatics. Ingo Ruczinski. Some selected examples... and a bit of an overview Bioinformatics Some selected examples... and a bit of an overview Department of Biostatistics Johns Hopkins Bloomberg School of Public Health July 19, 2007 @ EnviroHealth Connections Bioinformatics and

More information

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS

More information

Introduction. CS482/682 Computational Techniques in Biological Sequence Analysis

Introduction. CS482/682 Computational Techniques in Biological Sequence Analysis Introduction CS482/682 Computational Techniques in Biological Sequence Analysis Outline Course logistics A few example problems Course staff Instructor: Bin Ma (DC 3345, http://www.cs.uwaterloo.ca/~binma)

More information

Analysis of Biological Sequences SPH

Analysis of Biological Sequences SPH Analysis of Biological Sequences SPH 140.638 swheelan@jhmi.edu nuts and bolts meet Tuesdays & Thursdays, 3:30-4:50 no exam; grade derived from 3-4 homework assignments plus a final project (open book,

More information

Introduction to RNA-Seq in GeneSpring NGS Software

Introduction to RNA-Seq in GeneSpring NGS Software Introduction to RNA-Seq in GeneSpring NGS Software Dipa Roy Choudhury, Ph.D. Strand Scientific Intelligence and Agilent Technologies Learn more at www.genespring.com Introduction to RNA-Seq In a few years,

More information

INTRODUCTION TO BIOINFORMATICS. SAINTS GENETICS Ian Bosdet

INTRODUCTION TO BIOINFORMATICS. SAINTS GENETICS Ian Bosdet INTRODUCTION TO BIOINFORMATICS SAINTS GENETICS 12-120522 - Ian Bosdet (ibosdet@bccancer.bc.ca) Bioinformatics bioinformatics is: the application of computational techniques to the fields of biology and

More information

Gene-centered databases and Genome Browsers

Gene-centered databases and Genome Browsers COURSE OF BIOINFORMATICS a.a. 2015-2016 Gene-centered databases and Genome Browsers We searched Accession Number: M60495 AT NCBI Nucleotide Gene has been implemented at NCBI to organize information about

More information