Algorithm for Matching Additional Spectra

Similar documents
Case 7 A Storage Protein From Seeds of Brassica nigra is a Serine Protease Inhibitor

Modification Site Localization Scoring Integrated into a Search Engine

Quantitative Analysis on the Public Protein Prospector Web Site. Introduction

Case 7 A Storage Protein From Seeds of Brassica nigra is a Serine Protease Inhibitor Last modified 29 September 2005

Purification: Step 1. Lecture 11 Protein and Peptide Chemistry. Cells: Break them open! Crude Extract

Purification: Step 1. Protein and Peptide Chemistry. Lecture 11. Big Problem: Crude extract is not the natural environment. Cells: Break them open!

Evaluation of Cu(I) Binding to the E2 Domain of the Amyloid. Precursor Protein A Lesson in Quantification of Metal Binding to

Strategies for Quantitative Proteomics. Atelier "Protéomique Quantitative" La Grande Motte, France - June 26, 2007

Application Note. Authors. Abstract. Ravindra Gudihal Agilent Technologies India Pvt. Ltd. Bangalore, India

Supplementary information, Figure S1A ShHTL7 interacted with MAX2 but not another F-box protein COI1.

ProteinPilot Report for ProteinPilot Software

Supplementary Online Material. An Expanded Eukaryotic Genetic Code 2QH, UK. * To whom correspondence should be addressed.

Analysis of Monoclonal Antibodies and Their Fragments by Size Exclusion Chromatography Coupled with an Orbitrap Mass Spectrometer

Lecture 8: Affinity Chromatography-III

Application Note. Authors. Abstract. Ravindra Gudihal Agilent Technologies India Pvt. Ltd. Bangalore India

Objective. Introduction. IP assisted LC/MS/MS making study protein complexes easy. Jon Hao 1, Yi Liu 1, Xiaozhi Ren 2, and King-Wai Yau 2

Host Cell Protein Analysis Using Agilent AssayMAP Bravo and 6545XT AdvanceBio LC/Q-TOF

Confident Protein ID using Spectrum Mill Software

Proteomics. Proteomics is the study of all proteins within organism. Challenges

Ensure your Success with Agilent s Biopharma Workflows

Protein analysis. Dr. Mamoun Ahram Summer semester, Resources This lecture Campbell and Farrell s Biochemistry, Chapters 5

Basic protein and peptide science for proteomics. Henrik Johansson

MBios 478: Mass Spectrometry Applications [Dr. Wyrick] Slide #1. Lecture 25: Mass Spectrometry Applications

Protein preparation and digestion. P. aeruginosa strains were grown in in AB minimal medium

Kinetics Review. Tonight at 7 PM Phys 204 We will do two problems on the board (additional ones than in the problem sets)

Strategies in proteomics

Hongwei Xie, Martin Gilar, and John C. Gebler Waters Corporation, Milford, MA, U.S.A. INTRODUCTION EXPERIMENTAL

A Highly Accurate Mass Profiling Approach to Protein Biomarker Discovery Using HPLC-Chip/ MS-Enabled ESI-TOF MS

ProMass HR Applications!

Advanced QA/QC characterization MS in QC : Multi Attribute Method

Supplementary Results Supplementary Table 1. P1 and P2 enrichment scores for wild-type subtiligase.

Automated platform of µlc-ms/ms using SAX trap column for

Quantitative mass spec based proteomics

Purification of (recombinant) proteins. Pekka Lappalainen, Institute of Biotechnology, University of Helsinki

Proteomics Technology Note

Exam MOL3007 Functional Genomics

B05 - PEPTIDE MAPPING(Revision 1) INTRODUCTION

Spectral Counting Approaches and PEAKS

CHAPTER 5: TECHNIQUES IN PROTEIN BIOCHEMISTRY

Center for Mass Spectrometry and Proteomics Phone (612) (612)

The Peptide Mapping Games!

Application Note # ET-20 BioPharma Compass: A fully Automated Solution for Characterization and QC of Intact and Digested Proteins

Introduction to Protein Purification

Extracting Pure Proteins from Cells

Separation of Native Monoclonal Antibodies and Identification of Charge Variants:

Executing T-REX in mammalian cells

Supplementary Tables. Note: Open-pFind is embedded as the default open search workflow of the pfind tool. Nature Biotechnology: doi: /nbt.

Liver Mitochondria Proteomics Employing High-Resolution MS Technology

Highly Confident Peptide Mapping of Protein Digests Using Agilent LC/Q TOFs

Spectrum Mill MS Proteomics Workbench. Comprehensive tools for MS proteomics

SUPPLEMENTARY INFORMATION

Fast and Efficient Peptide Mapping of a Monoclonal Antibody (mab): UHPLC Performance with Superficially Porous Particles

N- The rank of the specified protein relative to all other proteins in the list of detected proteins.

5.36 Biochemistry Laboratory Spring 2009

Využití cílené proteomiky pro kontrolu falšování potravin: identifikace peptidových markerů v mase pomocí LC- Q Exactive MS/MS

Rapidly Characterize Antibody- Drug Conjugates and Derive Drug-to-Antibody Ratios Using LC/MS

Model Peptides Reveal Specificity of IV -Acetyltransferase from Saccharomyces cerevisiae*

Application Note TOF/MS

Targeted Phosphoproteomics Analysis of Immunoaffinity Enriched Tyrosine Phosphorylation in Mouse Tissue

Product. Ni-NTA His Bind Resin. Ni-NTA His Bind Superflow. His Bind Resin. His Bind Magnetic Agarose Beads. His Bind Column. His Bind Quick Resin

Rapid Peptide Mapping via Automated Integration of On-line Digestion, Separation and Mass Spectrometry for the Analysis of Therapeutic Proteins

Detecting Challenging Post Translational Modifications (PTMs) using CESI-MS

Protein Purification and Characterization Techniques. Nafith Abu Tarboush, DDS, MSc, PhD

Spirochaete flagella hook proteins self-catalyse a lysinoalanine covalent crosslink for motility

Stabilization of a virus-like particle and its application as a nanoreactor at physiological conditions

Rapid Peptide Catabolite ID using the SCIEX Routine Biotransform Solution

Nature Biotechnology: doi: /nbt Supplementary Figure 1

Solutions to 7.02 Quiz II 10/27/05

CHAPTER 4 Cloning, expression, purification and preparation of site-directed mutants of NDUFS3 and NDUFS7

Challenges in peptide mapping mass spectrometry of biopharmaceuticals

Proteins. Patrick Boyce Biopharmaceutical Marketing Manager Waters Corporation 1

Charge Variant Assessment of Nanobodies at the Intact Level by CESI-MS

Thesis of the PhD dissertation. Hajnalka Dürgő. Supervisor: Dr. Katalin F. Medzihradszky

Analysis of Monoclonal Antibodies and Their Fragments by Size-Exclusion Chromatography Coupled with an Orbitrap Mass Spectrometer

Proteomics and some of its Mass Spectrometric Applications

Proteomics. Manickam Sugumaran. Department of Biology University of Massachusetts Boston, MA 02125

TECHNICAL BULLETIN. Ni-CAM HC Resin High Capacity Nickel Chelate Affinity Matrix. Product No. N 3158 Storage Temperature 2 8 C

Isotopic Resolution of Chromatographically Separated IdeS Subunits Using the X500B QTOF System

A highly sensitive and robust 150 µm column to enable high-throughput proteomics

Utilizing novel technology for the analysis of therapeutic antibodies and host cell protein contamination

N.A. Lacher, Q. Wang, and C.W. Demarest. June 24th, Pfizer BioTherapeutics Pharmaceutical Sciences

基于质谱的蛋白质药物定性定量分析技术及应用

Investigation of a Mammalian Cellular Model for Differential Protein Expression Analysis Using 1D PAGE and Cleavable ICAT Reagents

Improving Productivity with Applied Biosystems GPS Explorer

Supporting Information I

N-terminal dual protein functionalization by strainpromoted alkyne nitrone cycloaddition

Protein isotopic enrichment for NMR studies

蛋白質體學. Proteomics Amino acids, Peptides and Proteins 陳威戎 & 21

Click Chemistry Capture Kit

Streamlined and Complete LC-MS Workflow for MAM

An Antibody-free Approach for the Global Analysis of. Protein Methylation

The world leader in serving science. Jonathan L. Josephs and Aaron O. Bailey. Life Sciences Mass Spectrometry

Modulating the Cascade architecture of a minimal Type I-F CRISPR-Cas system

ADVANCING ATTRIBUTE CONTROL OF ANTIBODIES AND ITS DERIVATIVES USING HIGH RESOLUTION ANALYTICS

columns P r o P a c H I C C o l u m n S o l u t i o n s f o r P r o t e i n A n a l y s i s

Advanced Characterization of Antibody Drug Conjugates (ADCs) by Liquid Chromatography and Mass Spectrometry (LC/MS) John Gebler, Ph.D.

Characterization of intact monoclonal antibody with microfluidic chip electrophoresis mass spectrometry

Structure Characterization and Differentiation of Biosimilar and Reference Products Using Unique Combination of Complementary Fragmentation Mechanisms

Click-&-Go TM Protein Enrichment Kit *for capture alkyne-modified proteins*

rapiflex Innovation with Integrity Designed for Molecules that Matter. MALDI TOF/TOF

Transcription:

Improved Methods for Comprehensive Sample Analysis Using Protein Prospector Peter R. Baker 1, Katalin F. Medzihradszky 1 and Alma L. Burlingame 1 1 Mass Spectrometry Facility, Dept. of Pharmaceutical Chemistry, University of California, San Francisco, USA Algorithm for Matching Additional Spectra An algorithm has been written that can consider a range of integer mass shifts on any selected amino acid. For example for the 3000 spectra in the test dataset it can consider any mass shift between -100 Da and +600 Da on 2 proteins in under 1 minute on a standard dual processor computer. This is equivalent to looking for 14000 different modifications at the same time. The mass defect can also be changed.

Experimental KatG Overexpression and Purification His-tagged Mycobacterium tuberculosis catalase/peroxidase mutants were expressed in Escherichia coli. Cells were lysed by sonication in the presence of protease inhibitors, at 4 o C. Cellular debris was pelleted at 30,000 g for 30 min, resulting in a viscous, red-brown crude extract, which was first purified by affinity chromatography employing Ni-NTA-agarose. Fractions containing KatG, identified by their red/brown color and the presence of an 85-kDa band on a 10-12% SDS-polyacrylamide gel, were pooled and concentrated using an Amicon Diaflow concentrator equipped with a 50,000 molecular weight cutoff membrane (BioMax-50). The concentrate was further purified by size-exclusion chromatography. Fractions containing KatG exhibiting no impurities by SDS-PAGE analysis were pooled and concentrated as above. [Ghiladi et al., J. Am. Chem. Soc. 127, 13428-13442 (2005).] Tryptic digestion and mass spectrometry the mutant recombinant proteins were ingel digested with side-chain protected porcine trypsin without reduction and alkylation. The digests were subjected to nano-lc/ms/ms experiments. Peptides were fractionated in 0.1% formic acid-containing mobile phase, on a 75 µm ID C18 column utilizing an Eksigent pump coupled with a Spark autosampler. Data were acquired on a QSTAR XL mass spectrometer in a data-dependent mode: 1 sec MS surveys were followed by 3 sec CID analyses on computer-selected multiply charged ions. Introduced Protein Modifications Mutations in the KatG enzyme Sample 1: Tyr-229 acetyl -Tyr Sample 2: Tyr-229 amino -Tyr Sample 3: Tyr-229 azido -Tyr Sample 4: Tyr-229 iodo -Tyr In each of these unusual amino acids the hydroxyl group of the tyrosine was replaced with the substituent indicated Sample 5: Met-255 Cys Also in all the above proteins a Gly-234 Ala mutation occurred.

Test Data Set The test data set is an LC-MS analysis of 5 Mycobacterium tuberculosis catalase/peroxidase mutants. Around 3000 Q-STAR spectra were collected in 5 LC/MS analyses. An initial database search identified the KatG protein as well as a contaminating E coli protein. A second search was then done restricting the search to these 2 proteins and looking for a standard set of modifications (acetylated N-terminus, pyroglutamic acid and oxidized methionine). The second search resulted in 184/606 (30.4%) of the spectra being matched in fraction 1 with expectation values less than 0.0001. A No Enzyme search was able to match a further 19 spectra (3.1%). Use of the new algorithm enabled an extra 292 (48.2%) of the spectra to be matched. 111 (18.3%) of the spectra remain uninterpreted. Some Modifications Found Oxidation and dioxidation of Trp (+16, +32) Trp oxidation to kynurenin (+4) Acrylamide modified Cys (+71) Deamidation of Asp (+1) KLSWADLIVFAGNC(71.0341)ALESMGFK +3 Acrylamide Modified Cysteine

Mysterious N-terminal Modification +26 - + C 2 H 2? = 26.0157 CH 2 =CH-NH~ or CH 3 -CH=N~ Other two options: O=C=N~ that would add 25.9793 N=N=N~ that would add 25.9906 however these are more reactive species, plus mass wise do not fit. The modified peptides elute later than their unmodified counterparts ~125 examples in these 5 runs! Example of +26 Modification M(26.0125)AWHAAGTYR +3 M* a 4 2+ b 4 2+

Variation On The Same Theme NO b 1 ion, BUT an abundant modified immonium ion NOT acetyl -maybe HO-CH 2 -CH=N~? 128.105 (cal: 128.107) L(42.0202)LWPVK +2 Adducts Adducts found by the algorithm include Ammonium (+17 Da), Na (+22 Da), K (+38 Da) and Fe (+53 Da). DIEEVMTTSQPWWPADYGHYGPLFIR +3 +17 Da +22 Da +38 Da +53 Da

Enzyme Catalyzed Rearrangement Reaction [1, 2] RMAMNDVETAALIVGGHTFGK +3 MAMNDVETAALIVGGHTFGK(156.0749) +4 Normally the Arg is adjacent to the Met as in the first spectrum. The second example however shows an additional Arg at the C-terminus. Disulfide Bridges The new algorithm can be used in conjunction with Protein Prospector program MS-Bridge to find peptides with disulfide bridges. Mass shifts between 0-2000 Da were considered. 3 disulphide combinations were found. An example is given below (1023.3276 +4 ). KLSWADLIVFAGNCALESMGFK +4 VSFADLVVLGGCAAIEK

Tyr(Acetyl) Mutant Peptides Sample 1 y 1 DLENPLAAVQMGLIY(Acetyl)VNPEGPNGNPDPMAAAVDIRETFR +4 y 20 2+ y 22 2+ 926.23 (4+) is DLENPLAAVQMGLIYVNPEGPNGNPDPMAAAVDIR + 40.02 1059.56(4+) is DLENPLAAVQMGLIYVNPEGPNGNPDPMAAAVDIRETFR + 40.02 That is acetyl -Tyr, ie. Y OH + CH 3 CO = +C 2 H 2 = +26.015 Da Gly Ala mutation + CH 2 = +14.015 Da +40.03 Da In this case y 20 +2 and y 22 +2 don t match as the algorithm is only currently considering a single modification per peptide. Mutant Peptides Sample 4 DLENPLAAVQMGLIY(Iodo)VNPEGPNGNPDPMAAAVDIR +4 947.24 (4+) is DLENPLAAVQMGLIYVNPEGPNGNPDPMAAAVDIR + 124.06 That is iodo -Tyr, ie. OH + I = +109.9017 Da Gly Ala mutation = +CH 2 = +14.015 Da +123.917 Da

Mutant Peptides Sample 5 C(Acrylamide)AMNDVETAALIVGGHTFGK +3 702.37 (3+) is MAMNDVETAALIVGGHTFGK + 43.021 Da Met Cys mutation: -C 2 H 4 Acrylamide addition: +C 3 H 5 NO +CHNO = +43.006 Da Incorrect Monoisotopic Mass Assignment The arrows show the monoisotopic precursor m/z value returned by the Q-STAR precursor centroiding algorithm. The last case is a mixture of a deamidated and non-deamidated peptide. The new algorithm was able to confidently match all the corresponding MSMS spectra.

Incorrect Charge Assignment If a large enough mass range is allowed the new algorithm can also match peptides where the precursor centroiding algorithm assigned an incorrect charge. For example it obtained matches for peptides with charges of 6 and 7 which the centroiding algorithm can t assign automatically. MPEQHPPITETTTGAASNGC(185.0888)PVVGHMKYPVEGGGNQDWWPNRLNLK +6 In this case only the y ions are matching so the exact hit needs to be worked out manually. Conclusions and Future Work The algorithm was able to increase the number of spectra matched from around 30% to around 80%. It can be used to identify novel modifications and experimental problems. Once the modifications have been catalogued they can be programmed into a more efficient algorithm for routine searches. We need to investigate how the mass tolerance used in the search affects the results returned. Currently the algorithms output is a peptide and a mass shift. We hope to use a multi-stage search to allow several modifications per peptide and to improve the modification site assignment.

Acknowledgements Funding: NIH NCRR grant 01614 and the Vincent Coates Foundation. Dr. Reza Ghiladi provided the KatG samples Refs.: 1) Lippincott, J. et al. (1997) Mapping of recombinant hemoglobin using immobilized trypsin cartridges. Anal. Biochem. 252, 314-325. 2) Fodor, S. and Zhang, Z. Protease-catalyzed rearrangement of terminal amino acid residues in peptides. 51st ASMS Conference, Montreal, Quebec, Canada (2003) http://www.inmerge.com/aspfolder/asmsabstracts.html. (Abstract: 406)