Current state of proteomics standardization and (C-)HPP data quality guidelines

Size: px
Start display at page:

Download "Current state of proteomics standardization and (C-)HPP data quality guidelines"

Transcription

1 9/6/ Department of Pharmacy Analytical Biochemistry Current state of proteomics standardization and (C-)HPP data quality guidelines DTL focus meeting on data integration, standards and fair principles in proteomics Péter Horvatovich

2 9/6/ Organization of the Human Proteome Project

3 9/6/ Organization of C-HPP I.

4 Organization of C-HPP II. Biobanks Integration of C-HPP and B/D-HPP Teams Slide from Mark Baker

5 9/6/ First guideline of C-HPP Paik YK, et al., Standard Guidelines for the Chromosome-Centric Human Proteome Project, PMID

6 The key to making real headway on the HPP is to agree on a common, shared, globally acceptable big data language Slide from Mark Baker

7 The Human Proteome Project Workflow HPP Publications Individual lab-based MS data ProteomeXchange PRIDE HPP Guidelines nextprot PE1-5 classifications PE1 = PE2 = PE3 = PE4 = PE5 = nextprot MassIVE PeptideAtlas PASSEL GPMdb HPP Metrics Human Protein Atlas Slide from Mark Baker

8 Slide from Lydie Lane

9 HPP/neXtProt protein existence data from PE Level PE1 Evidence at Protein Level PE2 Evidence at Transcript Level only PE3 Inferred from Homology PE4 Predicted PE5 Uncertain NeXtProt 18/09/2013 version % NeXtProt 12/02/2016 version 15, , , % TOTAL 20, , the missing proteins Slide from Mark Baker

10 Metrics Used by HPP Teams Initial 2013 definition of missing was no protein level data or insufficient documentation for ID (PE2+PE3+PE4+PE5) In 2014, revised to PE2+PE3+PE4 as PE5 proteins considered dubious Slide from Mark Baker

11 A new protein existence viewer Slide from Lydie Lane

12 9/6/ Nature papers on the draft of Human proteome PMID % PMID %

13 Testing 2014 Claims of Credible MS evidence for 108/200 ORs 1.Failure to use discriminating (proteotypic) from nondiscriminating peptides 2.Inclusion of many low-quality MS spectra 3.Use of short peptides (< 7aa containing peptides) 4.Use of older d base builds Slide from Mark Baker

14 Human peptides in PeptideAtlas million PSMs 1 million distinct peptides 14,000 canonical proteins Proteins 100% PSM FDR Peptide FDR 0.01 Protein FDR Only peptides 7 AA 70% 75% 50% 25% 0% Slide from Eric Deutsch 16

15 Olfactory receptor evidences in PeptideAtlas Slide from Eric Deutsch 17

16 Olfactory receptors in PeptideAtlas Only 2 of nextprot s 473 olfactory receptors are canonical in PeptideAtlas 18 Slide from Eric Deutsch

17 Which protein does the peptide implicate? Spectrum originally identified to: GYIVAAVVK But a better and exact match is: GYIAVAVVK But this latter sequence is not in our reference proteome. Which is why it was not identified correctly. Is it olfactory receptor OR5A2? (no other corroborating evidence) GIVSVLVVLISYGYIVAAVVKISSATGRTKAFSTCASH GYIAVAVVK Or is it serotransferrin (0.5 million PSMs) SDNCEDTPEAGYFAIAVVKKSASDLTWDNLKGKKS GYIAVAVVK I V dbsnp:rs is in our reference proteome from UniProt F I not in our reference proteome. Not in nextprot. But this protein has many SNPs, and this may be the explanation Slide from Eric Deutsch 19

18 Q9H255 = OR51E2 But GPMdb does have this one. This is the only OR that Ron Beavis thinks is legitimate. But only observed with a single peptide (many times) (in one sample that PeptideAtlas doesn t have) Ron Beavis: If you check a little closer, the older gene symbol for OR51E2 is PSGR, a prostate-specific G- coupled receptor protein (Cancer Res Dec 1;60(23): ). So, I'd actually suggest that this is a true identification and that interpreting the "OR" in the gene name as being literally true is the problem. Slide from Eric Deutsch 20

19 Growth of Human Proteome with Large Datasets from Note Savitski/Kuster reanalysis of Wilhelm et al: 14,741 proteins identified, MCP 2015 Slide from Gilbert S. Omenn

20 Latest HPP Guideline HUPO: MIAPE PSI NIH-NCI: proteogenomics guideline Journals: - Journal of Proteome Research - Molecular and Cellular Proteomics - Proteomics Clinical Applications HPP 1.0: data deposition at ProteomeXchange, FDR at PSM, peptide and proteins levels HPP 2.0: MS data interpretation PMID

21

22 Manuscript detailing the process Example dataset: PXD Title: Discovery of new CSF biomarkers for meningitis in children - 12 runs: 4 controls and 8 infected samples - Identification and quantification data Ternent et al., Proteomics, 2014 Juan A. Vizcaíno juan@ebi.ac.uk 13 th HUPO World Congress Madrid, 5 October 2014

23 PX Data workflow for MS/MS data 1. Mass spectrometer output files: raw data (binary files) or peak list spectra in a standardized format (mzml, mzxml). 2. Result files: a. Complete submissions: Result files can be converted to PRIDE XML or the mzidentml data standard. Published Raw Files Other files Juan A. Vizcaíno juan@ebi.ac.uk b. Partial submissions: For workflows not yet supported by PRIDE, search engine output files will be stored and provided in their original form. 3. Metadata: Sufficiently detailed description of sample origin, workflow, instrumentation, submitter. 4. Other files: Optional files: a. QUANT: Quantification related results e. FASTA b. PEAK: Peak list files f. SP_LIBRARY c. GEL: Gel images d. OTHER: Any other file type 13 th HUPO World Congress Madrid, 5 October 2014

24 Complete vs Partial submissions: experimental metadata Complete Partial General experimental metadata about the projects is similar. However, at the assay level information in partial submissions is not so detailed Juan A. Vizcaíno 13 th HUPO World Congress Madrid, 5 October 2014

25 Complete vs Partial submissions: processed results For complete submissions, it is possible to connect the spectra with the identification processed results and they can be visualized. Complete Partial Juan A. Vizcaíno 13 th HUPO World Congress Madrid, 5 October 2014

26 Complete submissions using mzidentml Search Engine Results + MS files Search engines mzidentml An increasing number of tools support export to mzidentml Mascot - MSGF+ - Myrimatch and related tools from D. Tabb s lab - OpenMS - PEAKS - ProCon (ProteomeDiscoverer, Sequest) - Scaffold - TPP via the idconvert tool (ProteoWizard) - ProteinPilot (planned by the end of 2014) - Others: library for X!Tandem conversion, lab internal pipelines, - Referenced spectral files need to be submitted as well (all open formats are supported). Updated list: Juan A. Vizcaíno juan@ebi.ac.uk 13 th HUPO World Congress Madrid, 5 October 2014

27 Now: native file export Tools RESULT file generation Final RESULT file Mascot ProteinPilot Scaffold PEAKS Native File export mzidentml RESULT MSGF+ Others Spectra files Juan A. Vizcaíno 13 th HUPO World Congress Madrid, 5 October 2014

28 FDR accumulation when combining datasets

29 Manual Inspection of Extraordinary Claims Reviewers and readers (and authors) need to see this: Slide from Eric Deutsch

30 Manual Inspection of Extraordinary Claims Reviewers and readers should not see this: This is what false positives look like Slide from Eric Deutsch

31 Thank you for you attention! Acknowledgement of all collaborators and members of (C)-HPP participating on C-HPP workshops and HUPO meetings Questions!

to proteomics data in the PRIDE database

to proteomics data in the PRIDE database Interactive and computational access to proteomics data in the PRIDE database Daniel RIOS PRIDE software developer PRIDE team, Proteomics Services Group PANDA group European Bioinformatics Institute Hinxton,

More information

ABSTRACT: METRICS OF PROGRESS Table 1.

ABSTRACT: METRICS OF PROGRESS Table 1. Progress on the Draft Human Proteome 2018 Metrics of HUPO s Human Proteome Project Report to the HUPO Council from the HPP Executive Committee, August 2018 Gilbert S. Omenn (Chair), Mark S. Baker (Chair-elect),

More information

Protein Grouping, FDR Analysis and Databases.

Protein Grouping, FDR Analysis and Databases. Protein Grouping, FDR Analysis and Databases. March 15th 2012 Pratik Jagtap The Minnesota http://www.mass.msi.umn.edu/ Protein Grouping, FDR Analysis and Databases Overview. Protein Grouping : Concept

More information

PTM Identification and Localization from MS Proteomics Data

PTM Identification and Localization from MS Proteomics Data 05.10.17 PTM Identification and Localization from MS Proteomics Data Marc Vaudel Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, Bergen, Norway KG Jebsen Center for Diabetes

More information

High-throughput Proteomic Data Analysis. Suh-Yuen Liang ( 梁素雲 ) NRPGM Core Facilities for Proteomics and Glycomics Academia Sinica Dec.

High-throughput Proteomic Data Analysis. Suh-Yuen Liang ( 梁素雲 ) NRPGM Core Facilities for Proteomics and Glycomics Academia Sinica Dec. High-throughput Proteomic Data Analysis Suh-Yuen Liang ( 梁素雲 ) NRPGM Core Facilities for Proteomics and Glycomics Academia Sinica Dec. 9, 2009 High-throughput Proteomic Data Are Information Rich and Dependent

More information

PRIDE and ProteomeXchange

PRIDE and ProteomeXchange PRIDE and ProteomeXchange Henning Hermjakob Head of Molecular Systems European Bioinformatics Institute hhe@ebi.ac.uk Director of Bioinformatics National Center for Protein Sciences, Beijing Data resources

More information

ProteinPilot Report for ProteinPilot Software

ProteinPilot Report for ProteinPilot Software ProteinPilot Report for ProteinPilot Software Detailed Analysis of Protein Identification / Quantitation Results Automatically Sean L Seymour, Christie Hunter SCIEX, USA Powerful mass spectrometers like

More information

Mass Spectrometry Based Proteomics Data Analysis Using GalaxyP

Mass Spectrometry Based Proteomics Data Analysis Using GalaxyP Mass Spectrometry Based Proteomics Data Analysis Using GalaxyP GCC 2015 GalaxyP Workshop July 6th, 2015 Norwich, UK Presenters: Tim Griffin, Pratik Jagtap and James Johnson Documentation: Kevin Murray,

More information

Protein Reports CPTAC Common Data Analysis Pipeline (CDAP)

Protein Reports CPTAC Common Data Analysis Pipeline (CDAP) Protein Reports CPTAC Common Data Analysis Pipeline (CDAP) v. 4/13/2015 Summary The purpose of this document is to describe the protein reports generated as part of the CPTAC Common Data Analysis Pipeline

More information

How to view Results with. Proteomics Shared Resource

How to view Results with. Proteomics Shared Resource How to view Results with Scaffold 3.0 Proteomics Shared Resource An overview This document is intended to walk you through Scaffold version 3.0. This is an introductory guide that goes over the basics

More information

ProteinPilot Software for Protein Identification and Expression Analysis

ProteinPilot Software for Protein Identification and Expression Analysis ProteinPilot Software for Protein Identification and Expression Analysis Providing expert results for non-experts and experts alike ProteinPilot Software Overview New ProteinPilot Software transforms protein

More information

How to view Results with Scaffold. Proteomics Shared Resource

How to view Results with Scaffold. Proteomics Shared Resource How to view Results with Scaffold Proteomics Shared Resource Starting out Download Scaffold from http://www.proteomes oftware.com/proteom e_software_prod_sca ffold_download.html Follow installation instructions

More information

Proteomics software at MSI. Pratik Jagtap Minnesota Supercomputing institute

Proteomics software at MSI. Pratik Jagtap Minnesota Supercomputing institute Proteomics software at MSI. Pratik Jagtap Minnesota Supercomputing institute http://www.mass.msi.umn.edu/ Proteomics software at MSI. proteomics : emerging technology proteomics workflow search algorithms

More information

Original article PRIDE: Quality control in a proteomics data repository

Original article PRIDE: Quality control in a proteomics data repository Original article PRIDE: Quality control in a proteomics data repository Attila Csordas*, David Ovelleiro, Rui Wang, Joseph M. Foster, Daniel Ríos, Juan Antonio Vizcaíno and Henning Hermjakob EMBL Outstation,

More information

Spectral Counting Approaches and PEAKS

Spectral Counting Approaches and PEAKS Spectral Counting Approaches and PEAKS INBRE Proteomics Workshop, April 5, 2017 Boris Zybailov Department of Biochemistry and Molecular Biology University of Arkansas for Medical Sciences 1. Introduction

More information

Mass spectrometry Proteomics and MIAPE

Mass spectrometry Proteomics and MIAPE Mass spectrometry Proteomics and MIAPE Florian Breitwieser - Research Center of Molecular Medicine, Vienna, Austria Nov 18, 2010 Outline BioC-devel meeting Europe 17.-18. 11. 2010 MS-MS Proteomics Standard

More information

PRIDE Inspector: a tool to visualize and validate MS proteomics data

PRIDE Inspector: a tool to visualize and validate MS proteomics data Europe PMC Funders Group Author Manuscript Published in final edited form as: Nat Biotechnol. ; 30(2): 135 137. doi:10.1038/nbt.2112. PRIDE Inspector: a tool to visualize and validate MS proteomics data

More information

GNPS: Global Natural Products Social Molecular Networking Delivering data-enabled, community-driven research

GNPS: Global Natural Products Social Molecular Networking Delivering data-enabled, community-driven research GNPS: Global Natural Products Social Molecular Networking Delivering data-enabled, community-driven research Mingxun Wang 1,2,4, Jeremy Carver 1,4, Julie Wertz 1,4, Laurence Bernstein 1,4, Seungjin Na

More information

MRMPilot Software: Accelerating MRM Assay Development for Targeted Quantitative Proteomics

MRMPilot Software: Accelerating MRM Assay Development for Targeted Quantitative Proteomics Product Bulletin MRMPilot Software: Accelerating MRM Assay Development for Targeted Quantitative Proteomics With Unique QTRAP System Technology Overview Targeted peptide quantitation is a rapidly growing

More information

MIAPE: Mass Spectrometry Informatics

MIAPE: Mass Spectrometry Informatics MIAPE: Mass Spectrometry Informatics Pierre-Alain Binz[1,2]*, Robert Barkovich[3], Ronald C. Beavis[4], David Creasy[5], David M. Horn[6], Randall K. Julian Jr.[7], Sean L. Seymour[8], Chris F. Taylor[9],

More information

False Discovery Rate ProteoRed Multicentre Study 6 Terminology Decoy sequence construction

False Discovery Rate ProteoRed Multicentre Study 6 Terminology Decoy sequence construction False Discovery Rate ProteoRed Multicentre Study 6 Terminology FDR: False Discovery Rate PSM: Peptide- Spectrum Match (a.k.a., a hit) PIT: Percentage of Incorrect Targets In the past years, a number of

More information

ProteinPilot Software Overview

ProteinPilot Software Overview ProteinPilot Software Overview High Quality, In-Depth Protein Identification and Protein Expression Analysis Sean L. Seymour and Christie L. Hunter SCIEX, USA As mass spectrometers for quantitative proteomics

More information

Important Information for MCP Authors

Important Information for MCP Authors Guidelines to Authors for Publication of Manuscripts Describing Development and Application of Targeted Mass Spectrometry Measurements of Peptides and Proteins and Submission Checklist The following Guidelines

More information

Pushing the Leading Edge in Protein Quantitation: Integrated, Precise, and Reproducible Protein Quantitation Workflow Solutions

Pushing the Leading Edge in Protein Quantitation: Integrated, Precise, and Reproducible Protein Quantitation Workflow Solutions 2017 Metabolomics Seminars Pushing the Leading Edge in Protein Quantitation: Integrated, Precise, and Reproducible Protein Quantitation Workflow Solutions The world leader in serving science 2 3 Cancer

More information

Chromosomeomosome 13 Chromosomeomosome 17 Gene a AST b nssnps c Gene AST nssnps BRCA BRCA RB1 2 3 ERBB IRS2 1 3 TP

Chromosomeomosome 13 Chromosomeomosome 17 Gene a AST b nssnps c Gene AST nssnps BRCA BRCA RB1 2 3 ERBB IRS2 1 3 TP Table 1. Features of salient genes on chromosomes 13 and 17 with respect to the presence of alternatively spliced transcripts and non-synonymous single-nucleotide polymorphisms Chromosomeomosome 13 Chromosomeomosome

More information

Accepted Article. Faculty of Science, University of Zurich, CH-8049 Zurich, Switzerland

Accepted Article. Faculty of Science, University of Zurich, CH-8049 Zurich, Switzerland Technical Brief PASSEL: The PeptideAtlas SRM Experiment Library Terry Farrah 1, Eric W. Deutsch 1 *, Richard Kreisberg 1, Zhi Sun 1, David S. Campbell 1, Luis Mendoza 1, Ulrike Kusebauch 1, Mi-Youn Brusniak

More information

Center for Mass Spectrometry and Proteomics Phone (612) (612)

Center for Mass Spectrometry and Proteomics Phone (612) (612) Outline Database search types Peptide Mass Fingerprint (PMF) Precursor mass-based Sequence tag Results comparison across programs Manual inspection of results Terminology Mass tolerance MS/MS search FASTA

More information

Bioinformatic Tools. So you acquired data.. But you wanted knowledge. So Now What?

Bioinformatic Tools. So you acquired data.. But you wanted knowledge. So Now What? Bioinformatic Tools So you acquired data.. But you wanted knowledge So Now What? We have a series of questions What the Heck is That Ion? How come my MW does not match? How do I make a DB to search against?

More information

PROTEOINFORMATICS OVERVIEW

PROTEOINFORMATICS OVERVIEW PROTEOINFORMATICS OVERVIEW August 11th 2016 Pratik Jagtap Center for Mass Spectrometry and Proteomics http://www.cbs.umn.edu/msp Outline PROTEOMICS WORKFLOW PEAKLIST PROCESSING Search Databases Overview

More information

Enabling Systems Biology Driven Proteome Wide Quantitation of Mycobacterium Tuberculosis

Enabling Systems Biology Driven Proteome Wide Quantitation of Mycobacterium Tuberculosis Enabling Systems Biology Driven Proteome Wide Quantitation of Mycobacterium Tuberculosis SWATH Acquisition on the TripleTOF 5600+ System Samuel L. Bader, Robert L. Moritz Institute of Systems Biology,

More information

Supporting Information for Comprehensive HCP Profiling by Targeted and Untargeted Analysis of DIA Mass Spectrometry Data with PRM Verification

Supporting Information for Comprehensive HCP Profiling by Targeted and Untargeted Analysis of DIA Mass Spectrometry Data with PRM Verification Supporting Information for Comprehensive HCP Profiling by Targeted and Untargeted Analysis of DIA Mass Spectrometry Data with PRM Verification Simion Kreimer 1, Yuanwei Gao 1, Somak Ray 1, Mi Jin 2,3,

More information

A Bovine PeptideAtlas of milk and mammary gland proteomes

A Bovine PeptideAtlas of milk and mammary gland proteomes Proteomics 2012, 12, 2895 2899 2895 DOI 10.1002/pmic.201200057 DATASET BRIEF A Bovine PeptideAtlas of milk and mammary gland proteomes Stine L. Bislev 1, Eric W. Deutsch 2, Zhi Sun 2, Terry Farrah 2, Ruedi

More information

MIAPE: Mass Spectrometry Quantification

MIAPE: Mass Spectrometry Quantification MIAPE: Mass Spectrometry Quantification Salvador Martínez-Bartolomé[1,12], Eric W. Deutsch[2], Pierre-Alain Binz[3], Andrew R. Jones[4], Martin Eisenacher[5], Gerhard Mayer[5], Alex Campos[6,12], Francesc

More information

RockerBox. Filtering massive Mascot search results at the.dat level

RockerBox. Filtering massive Mascot search results at the.dat level RockerBox Filtering massive Mascot search results at the.dat level Challenges Big experiments High amount of data Large raw and.dat files (> 2GB) How to handle our results?? The 2.2 peptide summary could

More information

Spectronaut Pulsar X. Maximize proteome coverage and data completeness by utilizing the power of Hybrid Libraries

Spectronaut Pulsar X. Maximize proteome coverage and data completeness by utilizing the power of Hybrid Libraries Spectronaut Pulsar X Maximize proteome coverage and data completeness by utilizing the power of Hybrid Libraries More versatility in proteomics research Spectronaut has delivered highest performance in

More information

Quantitative Proteomics: From Technology to Cancer Biology

Quantitative Proteomics: From Technology to Cancer Biology Quantitative Proteomics: From Technology to Cancer Biology Beyond the Genetic Prescription Pad: Personalizing Cancer Medicine in 2014 February 10-11, 2014 Thomas Kislinger Molecular Biomarkers in Body

More information

The Open2Dprot Proteomics Project for n-dimensional Protein Expression Data Analysis. The Open2Dprot Project. Introduction

The Open2Dprot Proteomics Project for n-dimensional Protein Expression Data Analysis. The Open2Dprot Project. Introduction The Open2Dprot Proteomics Project for n-dimensional Protein Expression Data Analysis http://open2dprot.sourceforge.net/ Revised 2-05-2006 * (cf. 2D-LC) Introduction There is a need for integrated proteomics

More information

CCRD Proteomics facility RIH-Brown University

CCRD Proteomics facility RIH-Brown University CCRD Proteomics facility RIH-Brown University Experimental design TO Publication Nagib Ahsan, PhD Where is the CCRD Proteomics Facility COBRE CCRD Proteomics Core Facility 1 Hoppin St, Providence, 02903

More information

About OMICS Group Conferences

About OMICS Group Conferences About OMICS Group OMICS Group International is an amalgamation of Open Access publications and worldwide international science conferences and events. Established in the year 2007 with the sole aim of

More information

Workflows and Pipelines for NGS analysis: Lessons from proteomics

Workflows and Pipelines for NGS analysis: Lessons from proteomics Workflows and Pipelines for NGS analysis: Lessons from proteomics Conference on Applying NGS in Basic research Health care and Agriculture 11 th Sep 2014 Debasis Dash Where are the protein coding genes

More information

MCP Papers in Press. Published on August 12, 2015 as Manuscript O

MCP Papers in Press. Published on August 12, 2015 as Manuscript O MCP Papers in Press. Published on August 12, 2015 as Manuscript O115.048777 Galaxy Integrated Omics: Web-based standards-compliant workflows for proteomics informed by transcriptomics Jun Fan 1, Shyamasree

More information

MCP Papers in Press. Published on April 29, 2011 as Manuscript O The Human Proteome Project: Current State and Future Direction

MCP Papers in Press. Published on April 29, 2011 as Manuscript O The Human Proteome Project: Current State and Future Direction MCP Papers in Press. Published on April 29, 2011 as Manuscript O111.009993 The Human Proteome Project: Current State and Future Direction Pierre Legrain* 1, Ruedi Aebersold 2, Alexander Archakov 3, Amos

More information

HIGHLIGHTS FROM HPP WORKSHOP 21 SEPTEMBER 2017 in DUBLIN

HIGHLIGHTS FROM HPP WORKSHOP 21 SEPTEMBER 2017 in DUBLIN HIGHLIGHTS FROM HPP WORKSHOP 21 SEPTEMBER 2017 in DUBLIN About 70 HPP investigators and interested HUPO Congress attendees participated in the post-congress Workshop at University College Dublin on 21

More information

Chromosome 5. Kyoto HUPO Initiative Assembly. Peter Horvatovich 1, Karin Wolters 1, Pei-Jing Pai 2 Yingwei Hu 2, Henry Lam 2, Rainer Bischoff 1

Chromosome 5. Kyoto HUPO Initiative Assembly. Peter Horvatovich 1, Karin Wolters 1, Pei-Jing Pai 2 Yingwei Hu 2, Henry Lam 2, Rainer Bischoff 1 9/23/2013 1 Chromosome 5 Peter Horvatovich 1, Karin Wolters 1, Pei-Jing Pai 2 Yingwei Hu 2, Henry Lam 2, Rainer Bischoff 1 1 University of Groningen, 2 Hong Kong University of Science and Technology Kyoto

More information

Filter-based Protein Digestion (FPD): A Detergent-free and Scaffold-based Strategy for TMT workflows

Filter-based Protein Digestion (FPD): A Detergent-free and Scaffold-based Strategy for TMT workflows Supporting Information Filter-based Protein Digestion (FPD): A Detergent-free and Scaffold-based Strategy for TMT workflows Ekaterina Stepanova 1, Steven P. Gygi 1, *, Joao A. Paulo 1, * 1 Department of

More information

Institute for Advanced Studies, City University of Hong Kong Workshop on Genomics, Cells, & Mathematics 10 July 2018

Institute for Advanced Studies, City University of Hong Kong Workshop on Genomics, Cells, & Mathematics 10 July 2018 Proteogenomics: Computational and Bioinformatics Innovations for Facilitating Identification of Missing Proteins and Predicting Functions of Unannotated Proteins (and Genes) Gilbert S. Omenn, MD, PhD Harold

More information

IPRG 2015 (PROTEOME INFORMATICS RESEARCH GROUP) DIFFERENTIAL ABUNDANCE IN LABEL-FREE PROTEOMICS. Olga Vitek

IPRG 2015 (PROTEOME INFORMATICS RESEARCH GROUP) DIFFERENTIAL ABUNDANCE IN LABEL-FREE PROTEOMICS. Olga Vitek IPRG 21 (PROTEOME INFORMATICS RESEARCH GROUP) DIFFERENTIAL ABUNDANCE IN LABEL-FREE PROTEOMICS Olga Vitek IPRG 21 iprg committee Henry Lam - Hong Kong University of Science and Technology (Co-chair) Eugene

More information

Targeted Proteomics Environment

Targeted Proteomics Environment Targeted Proteomics Environment Signal processing for quantitative proteomics Brendan MacLean MacCoss Lab Spectrum-based Quantification Uses peptide spectrum matches already calculated Spectral counting

More information

New Approaches to Quantitative Proteomics Analysis

New Approaches to Quantitative Proteomics Analysis New Approaches to Quantitative Proteomics Analysis Chris Hodgkins, Market Development Manager, SCIEX ANZ 2 nd November, 2017 Who is SCIEX? Founded by Dr. Barry French & others: University of Toronto Introduced

More information

Agilent Software Tools for Mass Spectrometry Based Multi-omics Studies

Agilent Software Tools for Mass Spectrometry Based Multi-omics Studies Agilent Software Tools for Mass Spectrometry Based Multi-omics Studies Technical Overview Introduction The central dogma for biological information flow is expressed as a series of chemical conversions

More information

Strategies for Quantitative Proteomics. Atelier "Protéomique Quantitative" La Grande Motte, France - June 26, 2007

Strategies for Quantitative Proteomics. Atelier Protéomique Quantitative La Grande Motte, France - June 26, 2007 Strategies for Quantitative Proteomics Atelier "Protéomique Quantitative", France - June 26, 2007 Bruno Domon, Ph.D. Institut of Molecular Systems Biology ETH Zurich Zürich, Switzerland OUTLINE Introduction

More information

Proteomics: A Challenge for Technology and Information Science. What is proteomics?

Proteomics: A Challenge for Technology and Information Science. What is proteomics? Proteomics: A Challenge for Technology and Information Science CBCB Seminar, November 21, 2005 Tim Griffin Dept. Biochemistry, Molecular Biology and Biophysics tgriffin@umn.edu What is proteomics? Proteomics

More information

Improving Productivity with Applied Biosystems GPS Explorer

Improving Productivity with Applied Biosystems GPS Explorer Product Bulletin TOF MS Improving Productivity with Applied Biosystems GPS Explorer Software Purpose GPS Explorer Software is the application layer software for the Applied Biosystems 4700 Proteomics Discovery

More information

SpectroDive NEXT GENERATION TARGETED PROTEOMICS. Integration of Ready-Made Panels Improved Workflow for Custom Panels

SpectroDive NEXT GENERATION TARGETED PROTEOMICS. Integration of Ready-Made Panels Improved Workflow for Custom Panels SpectroDive NEXT GENERATION TARGETED PROTEOMICS Integration of Ready-Made Panels Improved Workflow for Custom Panels IMPROVING MULTIPLEXING FOR TARGETED PROTEOMICS Novel targeted proteomics workflows can

More information

Modification Site Localization Scoring Integrated into a Search Engine

Modification Site Localization Scoring Integrated into a Search Engine Modification Site Localization Scoring Integrated into a Search Engine Peter R. Baker 1, Jonathan C. Trinidad 1, Katalin F. Medzihradszky 1, Alma L. Burlingame 1 and Robert J. Chalkley 1 1 Mass Spectrometry

More information

Version 4.0 of PaxDb: Protein abundance data, integrated across model organisms, tissues, and cell-lines

Version 4.0 of PaxDb: Protein abundance data, integrated across model organisms, tissues, and cell-lines Proteomics 2015, 15, 3163 3168 3163 DOI 10.1002/pmic.201400441 TECHNICAL BRIEF Version 4.0 of PaxDb: Protein abundance data, integrated across model organisms, tissues, and cell-lines Mingcong Wang, Christina

More information

PEAKS 8 User Manual. PEAKS Team

PEAKS 8 User Manual. PEAKS Team PEAKS 8 User Manual PEAKS Team PEAKS 8 User Manual PEAKS Team Publication date 2016 Table of Contents 1. Overview... 1 1. How to Use This Manual... 1 2. What Is PEAKS?... 1 3. What Is New in PEAKS 8?...

More information

The twenty minute guide to mztab

The twenty minute guide to mztab The twenty minute guide to mztab Johannes Griss & Juan Antonio Vizcaíno, EBI, juan@ebi.ac.uk, December 2013 Introduction The purpose of this guide is to give a quick introduction on how to use mztab efficiently.

More information

Supplementary Information

Supplementary Information Identifying sources of tick blood meals using unidentified tandem mass spectral libraries Özlem Önder 1, Wenguang Shao 2, Brian Kemps 1, Henry Lam 2,3,*, Dustin Brisson 1,* 1 Department of Biology, University

More information

Peptide and protein identification in mass spectrometry based proteomics. Yafeng Zhu, PhD student Karolinska Institutet, Scilifelab

Peptide and protein identification in mass spectrometry based proteomics. Yafeng Zhu, PhD student Karolinska Institutet, Scilifelab Peptide and protein identification in mass spectrometry based proteomics Yafeng Zhu, PhD student Karolinska Institutet, Scilifelab 2017-10-12 Content How is the peptide sequence identified? What is the

More information

PeptideShaker enables reanalysis of mass spectrometryderived. proteomics datasets

PeptideShaker enables reanalysis of mass spectrometryderived. proteomics datasets PeptideShaker enables reanalysis of mass spectrometryderived proteomics datasets Marc Vaudel 1,2, Julia M. Burkhart 1, René P. Zahedi 1, Eystein Oveland 2,3,4, Frode S. Berven 2,4,5, Albert Sickmann 1,

More information

N- The rank of the specified protein relative to all other proteins in the list of detected proteins.

N- The rank of the specified protein relative to all other proteins in the list of detected proteins. PROTEIN SUMMARY file N- The rank of the specified protein relative to all other proteins in the list of detected proteins. Unused (ProtScore) - A measure of the protein confidence for a detected protein,

More information

A High-Confidence Human Plasma Proteome Reference Set with Estimated Concentrations in PeptideAtlas

A High-Confidence Human Plasma Proteome Reference Set with Estimated Concentrations in PeptideAtlas MCP Papers in Press. Published on June 1, 2011 as Manuscript M110.006353 Plasma proteome reference set in PeptideAtlas A High-Confidence Human Plasma Proteome Reference Set with Estimated Concentrations

More information

A comprehensive evaluation of popular proteomics software workflows for label-free proteome quantification and imputation

A comprehensive evaluation of popular proteomics software workflows for label-free proteome quantification and imputation Briefings in Bioinformatics, 2017, 1 12 doi: 10.1093/bib/bbx054 Paper A comprehensive evaluation of popular proteomics software workflows for label-free proteome quantification and imputation Tommi V alikangas,

More information

Targeted Proteomics Environment

Targeted Proteomics Environment Targeted Proteomics Environment Status of the Skyline open-source software project five years after its inception Brendan MacLean User Community After 4 Years 560 registered users 150 registered for this

More information

A toolkit for the mzidentml standard: the ProteoIDViewer, the mzidlibrary

A toolkit for the mzidentml standard: the ProteoIDViewer, the mzidlibrary MCP Papers in Press. Published on June 28, 2013 as Manuscript O113.029777 TUTORIAL A toolkit for the mzidentml standard: the ProteoIDViewer, the mzidlibrary and the mzidvalidator Fawaz Ghali 1, Ritesh

More information

Faster, More Sensitive Peptide ID by Sequence DB Compression. Nathan Edwards Center for Bioinformatics and Computational Biology

Faster, More Sensitive Peptide ID by Sequence DB Compression. Nathan Edwards Center for Bioinformatics and Computational Biology Faster, More Sensitive Peptide ID by Sequence DB Compression Nathan Edwards Center for Bioinformatics and Computational Biology MS/MS Search Engines Fail when peptides are missing from sequence database

More information

Public sharing of complex MS- based qualita:ve and quan:ta:ve proteomic data analysis workflows: adding value to big data repositories

Public sharing of complex MS- based qualita:ve and quan:ta:ve proteomic data analysis workflows: adding value to big data repositories Public sharing of complex MS- based qualita:ve and quan:ta:ve proteomic data analysis workflows: adding value to big data repositories ASMS annual conference June 16, 2014 Tim Griffin tgriffin@umn.edu

More information

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS

More information

TEDDY Omics Data Availability

TEDDY Omics Data Availability Dietary Biomarkers Ascorbic Acid Result file Available, Final December 2015 1) Samples that did not pass the lab s QC were excluded from the 2) These data are considered the TEDDY standard for Ascorbic

More information

The Human Proteome Project: Current State and Future Direction

The Human Proteome Project: Current State and Future Direction : Current State and Future Direction Pierre Legrain q **, Ruedi Aebersold, Alexander Archakov, Amos Bairoch, Kumar Bala**, Laura Beretta, John Bergeron, Christoph H. Borchers, Garry L. Corthals, Catherine

More information

Integrative analysis frameworks for improved peptide and protein identifications from tandem mass spectrometry data

Integrative analysis frameworks for improved peptide and protein identifications from tandem mass spectrometry data Integrative analysis frameworks for improved peptide and protein identifications from tandem mass spectrometry data by Avinash Kumar Shanmugam A dissertation submitted in partial fulfilment of the requirements

More information

Metabolomics: Techniques and Applications ABRF

Metabolomics: Techniques and Applications ABRF Metabolomics: Techniques and Applications ABRF Sacramento, CA March 23, 2010 Overview Metabolomics Definitions Representative Project Sarcosine, a prostatic cancer biomarker Technical overview/ How we

More information

Agilent s NEW MassHunter Profinder

Agilent s NEW MassHunter Profinder Agilent s NEW MassHunter Profinder The Most Advanced Batch Feature Extraction Software for Metabolomics Theodore Sana, Ph.D. Metabolomics Marketing Manager 1 MassHunter Profinder A Batch Feature Extraction

More information

Introduction. CS482/682 Computational Techniques in Biological Sequence Analysis

Introduction. CS482/682 Computational Techniques in Biological Sequence Analysis Introduction CS482/682 Computational Techniques in Biological Sequence Analysis Outline Course logistics A few example problems Course staff Instructor: Bin Ma (DC 3345, http://www.cs.uwaterloo.ca/~binma)

More information

Dr. Robert L. Moritz Director Proteomics Research Insitutute for Systems Biology

Dr. Robert L. Moritz Director Proteomics Research Insitutute for Systems Biology Quantitative Targeted Proteomics of Mycobacterium Tuberculosis Disease Markers Dr. Robert L. Moritz Director Proteomics Research Insitutute for Systems Biology Human SRMAtlas Outline Introduction: Goals

More information

Towards unbiased biomarker discovery

Towards unbiased biomarker discovery Towards unbiased biomarker discovery High-throughput molecular profiling technologies are routinely applied for biomarker discovery to make the drug discovery process more efficient and enable personalised

More information

Clustering and scoring molecular interactions

Clustering and scoring molecular interactions Clustering and scoring molecular interactions relying on community standards Rafael C. Jimenez rafael@ebi.ac.uk EBI is an Outstation of the European Molecular Biology Laboratory.! Sharing infrastructures

More information

Center for Mass Spectrometry and Proteomics Phone (612) (612)

Center for Mass Spectrometry and Proteomics Phone (612) (612) Welcome to the Center for Mass Spectrometry & Proteomics Workshop Department of Biochemistry, Molecular Biology and Biophysics College of Biological Sciences University of Minnesota http://cbs.umn.edu/cmsp/

More information

File Formats Commonly Used in Mass Spectrometry Proteomics*

File Formats Commonly Used in Mass Spectrometry Proteomics* Author s Choice Review 2012 by The American Society for Biochemistry and Molecular Biology, Inc. This paper is available on line at http://www.mcponline.org File Formats Commonly Used in Mass Spectrometry

More information

Confident Protein ID using Spectrum Mill Software

Confident Protein ID using Spectrum Mill Software Welcome to our E-Seminar: Confident Protein ID using Spectrum Mill Software Slide 1 Spectrum Mill Informatics Software Start with batches of raw MS data! Sp ec t ru m Mi ll Biologist-friendly answers!

More information

Nature Biotechnology: doi: /nbt Supplementary Figure 1. The workflow of Open-pFind.

Nature Biotechnology: doi: /nbt Supplementary Figure 1. The workflow of Open-pFind. Supplementary Figure 1 The workflow of Open-pFind. The MS data are first preprocessed by pparse, and then the MS/MS data are searched by the open search module. Next, the MS/MS data are re-searched by

More information

Combination of Isobaric Tagging Reagents and Cysteinyl Peptide Enrichment for In-Depth Quantification

Combination of Isobaric Tagging Reagents and Cysteinyl Peptide Enrichment for In-Depth Quantification Combination of Isobaric Tagging Reagents and Cysteinyl Peptide Enrichment for In-Depth Quantification Protein Expression Analysis using the TripleTOF 5600 System and itraq Reagents Vojtech Tambor 1, Christie

More information

Mock Submissions to FDA/CDRH: History and Lessons Learned

Mock Submissions to FDA/CDRH: History and Lessons Learned Mock Submissions to FDA/CDRH: History and Lessons Learned Kyle J. Myers, PhD Director, Division of Imaging, Diagnostics, and Software Reliability Office of Science and Engineering Laboratories MDIC CM&S

More information

Complex Adaptive Systems Forum: Transformative CAS Initiatives in Biomedicine

Complex Adaptive Systems Forum: Transformative CAS Initiatives in Biomedicine Complex Adaptive Systems Forum: Transformative CAS Initiatives in Biomedicine January 18, 2013 Anna D. Barker, Ph.D. Director, Transformative Healthcare Networks C-Director, Complex Adaptive Systems Initiative

More information

Research Powered by Agilent s GeneSpring

Research Powered by Agilent s GeneSpring Research Powered by Agilent s GeneSpring Agilent Technologies, Inc. Carolina Livi, Bioinformatics Segment Manager Research Powered by GeneSpring Topics GeneSpring (GS) platform New features in GS 13 What

More information

Mass Spectrometry at EuPathDB

Mass Spectrometry at EuPathDB Mass Spec data available on most of the websites, not yet available for MicrosporidiaDB or PiroplasmaDB Mass Spectrometry at EuPathDB EuPathDB Workshop June 2011 Mark Heiges AmoebaDB

More information

Next Generation Technology for Reproducible and Precise Proteome Profiling

Next Generation Technology for Reproducible and Precise Proteome Profiling 7 th Czech Mass Spectrometry Conference April 11th, 2018 Next Generation Technology for Reproducible and Precise Proteome Profiling Lars Kristensen, Ph.D. Application and training specialist The world

More information

Highly Confident Peptide Mapping of Protein Digests Using Agilent LC/Q TOFs

Highly Confident Peptide Mapping of Protein Digests Using Agilent LC/Q TOFs Technical Overview Highly Confident Peptide Mapping of Protein Digests Using Agilent LC/Q TOFs Authors Stephen Madden, Crystal Cody, and Jungkap Park Agilent Technologies, Inc. Santa Clara, California,

More information

UNIFI: The user environment Ken Eglinton Nordic User Training, September 2013

UNIFI: The user environment Ken Eglinton Nordic User Training, September 2013 UNIFI: The user environment Ken Eglinton Nordic User Training, September 2013 2013 Waters Corporation 1 What is UNIFI? Single Platform for Chromatography, Mass Spectrometry, Data Management and Laboratory

More information

Introduction. Benefits of the SWATH Acquisition Workflow for Metabolomics Applications

Introduction. Benefits of the SWATH Acquisition Workflow for Metabolomics Applications SWATH Acquisition Improves Metabolite Coverage over Traditional Data Dependent Techniques for Untargeted Metabolomics A Data Independent Acquisition Technique Employed on the TripleTOF 6600 System Zuzana

More information

timstof Pro powered by PASEF and the Evosep One for high speed and sensitive shotgun proteomics

timstof Pro powered by PASEF and the Evosep One for high speed and sensitive shotgun proteomics timstof Pro powered by PASEF and the Evosep One for high speed and sensitive shotgun proteomics The Parallel Accumulation Serial Fragmentation (PASEF) method for trapped ion mobility spectrometry (TIMS)

More information

Proteomics Background and clinical utility

Proteomics Background and clinical utility Proteomics Background and clinical utility H.H. Helgason MD Antoni van Leeuwenhoek Hospital The Netherlands Cancer Institute Amsterdam Introduction Background Definitions Protein biomarkers Technical aspects

More information

PRIDE Inspector: a tool to visualize and validate MS proteomics data

PRIDE Inspector: a tool to visualize and validate MS proteomics data ! PRIDE Inspector: a tool to visualize and validate MS proteomics data Rui Wang, Antonio Fabregat, Daniel Ríos, David Ovelleiro, Joseph M. Foster, Richard G. Côté, Johannes Griss, Attila Csordas, Yasset

More information

Proteogenomics. Kelly Ruggles, Ph.D. Proteomics Informatics Week 9

Proteogenomics. Kelly Ruggles, Ph.D. Proteomics Informatics Week 9 Proteogenomics Kelly Ruggles, Ph.D. Proteomics Informatics Week 9 Proteogenomics: Intersection of proteomics and genomics As the cost of high-throughput genome sequencing goes down whole genome, exome

More information

Agilent Solutions for Metabolomics YOUR PATH TO SUCCESS

Agilent Solutions for Metabolomics YOUR PATH TO SUCCESS Agilent Solutions for Metabolomics YOUR PATH TO SUCCESS UNDERSTANDING METABOLOMICS Agilent is the leading global metabolomics vendor, offering our customers a broad array of cutting-edge instrumentation

More information

ENCODE DCC Antibody Validation Document

ENCODE DCC Antibody Validation Document ENCODE DCC Antibody Validation Document Date of Submission 09/12/12 Name: Trupti Kawli Email: trupti@stanford.edu Lab Snyder Antibody Name: SREBP1 (sc-8984) Target: SREBP1 Company/ Source: Santa Cruz Biotechnology

More information

Assay Validation Services

Assay Validation Services Overview PierianDx s assay validation services bring clinical genomic tests to market more rapidly through experimental design, sample requirements, analytical pipeline optimization, and criteria tuning.

More information