Proteomics Informatics (BMSC-GA 4437)

Similar documents
De novo sequencing in the identification of mass data. Wang Quanhui Liu Siqi Beijing Institute of Genomics, CAS

Problem Set Unit The base ratios in the DNA and RNA for an onion (Allium cepa) are given below.

11 questions for a total of 120 points

Supplementary Data for Monti, et al.

03-511/711 Computational Genomics and Molecular Biology, Fall

Basic concepts of molecular biology

Amino Acid Sequences and Evolutionary Relationships. How do similarities in amino acid sequences of various species provide evidence for evolution?

In silico measurements of twist and bend. moduli for beta solenoid protein self-

Name Section Problem Set 3

Amino Acid Sequences and Evolutionary Relationships

NORWEGIAN UNIVERSITY OF SCIENCE AND TECHNOLOGY DEPARTMENT OF BIOTECHNOLOGY Professor Bjørn E. Christensen, Department of Biotechnology

From code to translation

First&year&tutorial&in&Chemical&Biology&(amino&acids,&peptide&and&proteins)&! 1.&!

7.013 Problem Set 3 FRIDAY October 8th, 2004

Case 7 A Storage Protein From Seeds of Brassica nigra is a Serine Protease Inhibitor

a) Give the sequence of the mrna transcribed from this gene and indicate the 5 and 3 ends of the mrna.

7.013 Spring 2005 Problem Set 1

Case 7 A Storage Protein From Seeds of Brassica nigra is a Serine Protease Inhibitor Last modified 29 September 2005

Comprehensive analysis of proteolysis in long-ripened hard cooked Old Saare cheese

Amino Acid Sequences and Evolutionary Relationships

Basic concepts of molecular biology

Solutions to Problem Set 1

Granby Transcription and Translation Services plc

Algorithms in Bioinformatics ONE Transcription Translation

7.014 Solution Set 4

蛋白質體學. Proteomics Amino acids, Peptides and Proteins 陳威戎 & 21

Station 1 DNA Evidence

Materials Protein synthesis kit. This kit consists of 24 amino acids, 24 transfer RNAs, four messenger RNAs and one ribosome (see below).

Important points from last time

Using DNA sequence, distinguish species in the same genus from one another.

Thr Gly Tyr. Gly Lys Asn

Alpha-helices, beta-sheets and U-turns within a protein are stabilized by (hint: two words).

Key questions of proteomics. Bioinformatics 2. Proteomics. Foundation of proteomics. What proteins are there? Protein digestion

DNA.notebook March 08, DNA Overview

Bioinformatics. ONE Introduction to Biology. Sami Khuri Department of Computer Science San José State University Biology/CS 123A Fall 2012

466 Asn (N) to Ala (A) Generate beta dimer Interface

Ali Yaghi. Tamara Wahbeh. Mamoun Ahram

Proteomics and some of its Mass Spectrometric Applications

Laboratory Evolution of Robust and Enantioselective Baeyer-Villiger Monooxygenases for Asymmetric Catalysis

DATA. mrna CODON CHART

NAME:... MODEL ANSWER... STUDENT NUMBER:... Maximum marks: 50. Internal Examiner: Hugh Murrell, Computer Science, UKZN

DE NOVO GENOME ASSEMBLY OF THE AFRICAN CATFISH (CLARIAS GARIEPINUS)

Supplemental Table 1. Amino acid sequences of synthetic kisspeptins

Center for Mass Spectrometry and Proteomics Phone (612) (612)

36. The double bonds in naturally-occuring fatty acids are usually isomers. A. cis B. trans C. both cis and trans D. D- E. L-

ENZYMES AND METABOLIC PATHWAYS

Level 2 Biology, 2017

Dynamic Programming Algorithms

Nature Genetics: doi: /ng Supplementary Figure 1

1/4/18 NUCLEIC ACIDS. Nucleic Acids. Nucleic Acids. ECS129 Instructor: Patrice Koehl

NUCLEIC ACIDS. ECS129 Instructor: Patrice Koehl

Zool 3200: Cell Biology Exam 3 3/6/15

7.014 Quiz II Handout

Ch Biophysical Chemistry

Supplementary Online Material. An Expanded Eukaryotic Genetic Code 2QH, UK. * To whom correspondence should be addressed.

Station 1: DNA Structure Use the figure above to answer each of the following questions. 1.This is the subunit that DNA is composed of. 2.


7.014 Problem Set 4 Answers to this problem set are to be turned in. Problem sets will not be accepted late. Solutions will be posted on the web.

Outline. Pseudogenes. Pseudo-genes. The genetic code (DNA version) What is a gene? What is a gene? Dead genes Vitamin C Urate oxidase. Alan R.

Protein Structure Analysis

AOCS/SQT Amino Acid Round Robin Study

Computational Methods for Protein Structure Prediction

Examining the components of your peptide sample with AccuPep QC. Lauren Lu, Ph.D. October 29, 2015, 9:00-10:00 AM EST

DNA/Protein Binding, Molecular Docking and in Vitro Anti-cancer Activity of some Thioether-Dipyrrinato Complexes

DNA and the Double Helix in the Fifties: Papers Published in Nature which mention DNA and the Double Helix

CFSSP: Chou and Fasman Secondary Structure Prediction server


PTM Identification and Localization from MS Proteomics Data

Nucleic acid and protein Flow of genetic information

Unit 1. DNA and the Genome

Worksheet: Mutations Practice

Daily Agenda. Warm Up: Review. Translation Notes Protein Synthesis Practice. Redos

Fishy Amino Acid Codon. UUU Phe UCU Ser UAU Tyr UGU Cys. UUC Phe UCC Ser UAC Tyr UGC Cys. UUA Leu UCA Ser UAA Stop UGA Stop

Protein NMR II. Lecture 5

This is the knowledge that you should understand upon completing this section:

Disease and selection in the human genome 3

EE550 Computational Biology

Chapter Twelve Protein Synthesis: Translation of the Genetic Message

Strategies for Quantitative Proteomics. Atelier "Protéomique Quantitative" La Grande Motte, France - June 26, 2007

Dr. R. Sankar, BSE 631 (2018)

Gene Expression Translation U C A G A G

NRPS Code Project Summary

Proteomics software at MSI. Pratik Jagtap Minnesota Supercomputing institute

CHAPTER 1. DNA: The Hereditary Molecule SECTION D. What Does DNA Do? Chapter 1 Modern Genetics for All Students S 33

Model Peptides Reveal Specificity of IV -Acetyltransferase from Saccharomyces cerevisiae*

Bioinformatics CSM17 Week 6: DNA, RNA and Proteins

Protein Structure Analysis

Supporting Information

Bi Lecture 3 Loss-of-function (Ch. 4A) Monday, April 8, 13

Programme Good morning and summary of last week Levels of Protein Structure - I Levels of Protein Structure - II

Paper Reference. Friday 13 June 2008 Afternoon Time: 1 hour 30 minutes

d Yield of peptide (µm)

iclicker Question #28B - after lecture Shown below is a diagram of a typical eukaryotic gene which encodes a protein: start codon stop codon 2 3

Supporting Information. Mitochondrial thioredoxin-responding off-on fluorescent probe

(a) Which enzyme(s) make 5' - 3' phosphodiester bonds? (c) Which enzyme(s) make single-strand breaks in DNA backbones?

Basic protein and peptide science for proteomics. Henrik Johansson

Biomolecules: lecture 6

7.014 Problem Set 3 Please print out this problem set and record your answers on the printed copy.

Protein analysis. Dr. Mamoun Ahram Summer semester, Resources This lecture Campbell and Farrell s Biochemistry, Chapters 5

Transcription:

Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information David@FenyoLab.org htt://fenyolab.org/resentations/proteomics_informatics_2013/

Learning Objectives Be able analyze a roteomics data set and understand the limitations of the results.

Overview of Proteomics (Week 1) Why roteomics? Bioinformatics Overview of the course

Motivating Examle: Protein Regulation Geiger et al., Proteomic changes resulting from gene coy number variations in cancer cells, PLoS Genet. 2010 Se 2;6(9). ii: e1001090.

Motivating Examle: Protein Comlexes Alber et al., Nature 2007

Motivating Examle: Signaling Choudhary & Mann, Nature Reviews Molecular Cell Biology 2010

Bioinformatics Biological System Exerimental Design Samles Measurements Raw Data Data Analysis Information

Mass Sectrometry Based Proteomics Lysis Fractionation Digestion Mass sectrometry MS Peak Finding Charge determination De-isotoing Integrating Peaks Searching Identified and Quantified Proteins

Overview of Mass sectrometry (Week 2) Ion Source Mass Analyzer Detector mass/charge

Overview of Mass sectrometry (Week 2) Ion Source Mass Analyzer 1 Fragmentation Mass Analyzer 2 Detector b y

Proteomics Informatics Overview of Mass sectrometry (Week 2) Ion Source LC Mass Analyzer 1 Fragmentation Mass Analyzer 2 Detector mass/charge mass/charge mass/charge mass/charge mass/charge mass/charge mass/charge mass/charge mass/charge mass/charge mass/charge mass/charge mass/charge mass/charge mass/charge Time

Analysis of mass sectra: signal rocessing, eak finding, and isotoe clusters (Week 3) Intensity m/z

Protein identification I: searching rotein sequence collections and significance testing (Week 4) Sequence DB Lysis Fractionation LC-MS MS/MS MS/MS Digestion Pick Protein Pick Petide All Fragment Masses Comare, Score, Test Significance Reeat for all etides Reeat for all roteins

Protein identification II: search engines and rotein sequence databases (Week 5)

% Relative Abundance Amino acid masses 1-letter 3-letter Chemical Monois Average code code formula otoic A Ala C 3 H 5 ON 71.0371 71.0788 R Arg C 6 H 12 ON 4 156.101 156.188 N Asn C 4 H 6 O 2 N 2 114.043 114.104 D As C 4 H 5 O 3 N 115.027 115.089 C Cys C 3 H 5 ONS 103.009 103.139 E Glu C 5 H 7 O 3 N 129.043 129.116 Q Gln C 5 H 8 O 2 N 2 128.059 128.131 G Gly C 2 H 3 ON 57.0215 57.0519 H His C 6 H 7 ON 3 137.059 137.141 I Ile C 6 H 11 ON 113.084 113.159 L Leu C 6 H 11 ON 113.084 113.159 K Lys C 6 H 12 ON 2 128.095 128.174 M Met C 5 H 9 ONS 131.04 131.193 F Phe C 9 H 9 ON 147.068 147.177 P Pro C 5 H 7 ON 97.0528 97.1167 S Ser C 3 H 5 O 2 N 87.032 87.0782 T Thr C 4 H 7 O 2 N 101.048 101.105 W Tr C 11 H 10 ON 2 186.079 186.213 Y Tyr C 9 H 9 O 2 N 163.063 163.176 V Val C 5 H 9 ON 99.0684 99.1326 Proteomics Informatics Protein identification III: de novo sequencing (Week 6) 100 0 292 405 260 389 534 504 [M+2H] 2+ 633 762 875 1022 663 778 9071020 1080 250 500 750 1000 m/z Sequences consistent with sectrum Mass Differences

Protein identification IV: sectrum library searching (Week 7) Identified Proteins Lysis Fractionation Digestion LC-MS/MS MS/MS Sectrum Library Pick Sectrum Reeat for all sectra Comare, Score, Test Significance

Protein quantitation I: metabolic labeling (SILAC), chemical labeling, label-free quantitation, sectrum counting (Week 8) C ij L ij D ijk LC ik I ij Pr Pe ik C Lysis Fractionation Digestion LC-MS ik k ij ij ij ijk j C k ij k L ij L Pr ij Pr I D ijk ik D Pe ik Pe ik LC ik MS ik I ik Samle i Protein j Petide k LC ik MS ik MS ik MS k

Protein quantitation I: metabolic labeling (SILAC), chemical labeling, label-free quantitation, sectrum counting (Week 8) Lysis Fractionation Digestion LC-MS Assumtion: k L ij Pr ij D ijk Pe ik LC constant for all samles C i j/ C j I j/ n m n i Samle i Protein j Petide k i ik I i m j MS ik MS MS

Protein quantitation II: software (Week 9) Skyline MaxQuant

Protein characterization I: ost-translational modifications (Week 10) Petide with two ossible modification sites Intensity Matching MS/MS sectrum m/z Which assignment does the data suort? 1, 1 or 2, or 1 and 2?

Protein Characterization II: rotein-rotein interactions, cross-linking, to-down, non-covalent comlexes (Week 11) A D A C B Protein identification

Molecular Signatures (Week 12)

Molecular Signatures (Week 12)

Presentations of rojects (Week 13) Select a ublished data set that has been made ublic and reanalyze it. Highlighted data sets: htt://www.thegm.org/ 10 min resentations

Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information David@FenyoLab.org htt://fenyolab.org/resentations/proteomics_informatics_2013/