GNPS: Global Natural Products Social Molecular Networking Delivering data-enabled, community-driven research
|
|
- Scot Willis Singleton
- 6 years ago
- Views:
Transcription
1 GNPS: Global Natural Products Social Molecular Networking Delivering data-enabled, community-driven research Mingxun Wang 1,2,4, Jeremy Carver 1,4, Julie Wertz 1,4, Laurence Bernstein 1,4, Seungjin Na 1,2,4, Nuno Bandeira Center for Computational Mass Spectrometry 2 Computer Science and Engineering 3 Skaggs School of Pharmacy and Pharmaceutical Sciences 4 University of California, San Diego Center for Computational Mass Spectrometry
2 Proteins run life How much is in the genes? Human: ~20,000-22,000 Mouse: ~20,000-22,000 Worm: ~19,000 (C.Elegans) Rice/Corn: ~32,000 45,000 Each gene may generate several proteins the functional workhorses of the cell Combinatorial splicing Sequence variation EMILG EFILG Post-Translational Modifications Hundreds of types and sites Quantification and turnover Protein Structure and interactions Endogenous and Immune peptides Microbiome: x more genes Center for Computational Mass Spectrometry
3 TRADITIONAL WORKFLOW Design sample MS Raw data De novo interpretation Reference lookup Mass spectrometry generates `fingerprints for biomolecules (10k-250k spectra, up to 100 GB per sample) Identified peptide spectrum determines protein presence and abundance
4 PARADIGM SHIFT NETWORKED RESEARCH Design sample MS Raw data What are the similar datasets? De novo interpretation What are the most important or common unidentified spectra? Which spectra were identified before? Who analyzed similar data? Reference lookup How good is this identification?
5 BIG SOLUTIONS? Big Data Big Algorithms [271.1] F (SK) S G T E C R A S M S E C D P A E H C T G Q S MassIVE Mass Spectrometry Interactive Virtual Environment Thousands of datasets, hundreds of terabytes b-ions in each spectrum Mass difference between b-ions Oxidized Methionine Designed to build on rather than just tolerate big data Big Compute Big Community Proteomics Scalable, Accessible and Flexible environment ProteoSAFe 50+ data analysis workflows scalable to thousands of cores Empower and enable community-wide sharing of knowledge
6 KNOWING THE PROTEOME To know the proteome we should be able to: Present the evidence for any identified peptide, protein or posttranslational modification Basic: tandem MS evidence for claimed observations Provenance: experimental conditions of data acquisition Structure the knowledge in a reusable format Spectral Libraries Curated collections of identified reference spectra Personal knowledge is not community knowledge unless it is reusable in some way Simplest form of reutilization is database that can be consulted useful but low throughput Searchable spectral libraries enable high-throughput automated reutilization PNNL microbial reference largest dataset to date (12 TB) includes 100+ searchable libraries How much do we know?
7 SPECTRAL LIBRARIES COVERAGE Spectral library: curated collection of identified reference spectra Current coverage of 19,874 Human genes 12,773 with 1+ unique peptides (2,250 with shared peptides only) 4,851 with no peptides NIST SWATH Atlas % Peptides 14.9% 12.0% # Peptides 207K 152K Average protein sequence coverage is ~15% Low coverage of active/functional sites
8 BIG DATA TO THE RESCUE? 70 terabytes of Human mass spectrometry data available in 976 public ProteomeXchange datasets (~1,000 papers) containing 88,049 LCMS runs How limited is the knowledge shared through ProteomeXchange? MassIVE Mass Spectrometry Interactive Virtual Environment 86 % - partial submissions, no identifications 6 % - data format lost provenance metadata (e.g., only peptide identifications) 8 % - have results files with provenance information ono statistical controls 6.7M IDs oreport FDR > 1% - 4.5M IDs oreport FDR 1% - 1.7M IDs + 14M IDs (CPTAC colorectal) Final tally: ~4.7% of all data minimally ready to be reused (<1% without CPTAC colorectal)
9 MASSIVE REANALYSIS Community knowledge requires reproducible, well-characterized results MS-GF+ standard database search Search workflow, source code and exact search parameters available at Everyone can reproduce the searches and test the search protocol Reanalyzed 14 TB of Human data with ~200M MS/MS spectra 95 million new statistically controlled identifications (~8-50x more) 4.2 million modified versions of 3.1 million unique peptide sequences Search results are attached to each dataset Interactive visualization, available for download in open formats
10 BROADER LOOK: CPTAC COLON CANCER 125 colorectal samples: 95 TCGA cases, 30 controls Imported CPTAC results 6.9M IDs MS-GF+ database search 8.9M IDs, 70k variants (169k total) Spectral library search (MSPLIT) 10M IDs, including 387K mixture spectra Proteogenomics searches of TCGA transcriptomics sequences (Enosi) 6.8M total IDs, 19,728 proteogenomic events Blind modification search (MODa) 7.8M IDs, 2.8M IDs for 221k variants (306k total) 203K new variants (unique modified peptides) Enosi: K.K(V A)LGAFSDGLAHLDNLK.G MODa: K.K(V,-28)LGAFSDGLAHLDNLK.G
11 TRANSLATING GENOMICS ALGORITHMS Sequence Alignment and Assembly Spectrum Alignment and Assembly Global alignment Local alignment [271.1] F (SK) S G T E C R A S M S E C D P A E H C T G Q S b-ions in each spectrum Mass difference between b-ions Oxidized Methionine Spectral Networks MassIVE KQGGTLDD LEE QAREL 2 KQGGTLDD LEE QARE 3 KQGGTLDD LEE QAR 4 KQGGTLDD LEE QA 5 KQGGTLDD LEE -18 QAR 6 KQGGTLDD LEE -18 Q 7 QGGTLDD LEE QAR 8 QGGTLDD -53 LEE QAR 8 6
12 SPECTRAL NETWORKS ANALYSIS New information from algorithms that were not available at time of publication K(V,-28)LGAFSDGLAHLDNLK VLGAFSDGLAHLDNLK (V,28)LGAFSDGLAHLDNLK Ac-VLGAFSDGLAHLDNLK
13 SPECTRAL NETWORKS OF CATARACTOUS LENS Tandem MS acquisition sensitivity is still very low for deep PTM analysis we typically don t see most peptides and most modifications for lysate experiments Cataractous Lens data Lens samples from 9 different patients Spans stages of development, age and disease Proteins do not turn over, accumulate modifications over time Also technically useful Sample complexity is low enough to allow for deep acquisition of low abundance peptide variants Lens dataset is probably the most analyzed for discovery of modifications (original publication in 2005) Searle et al, J Proteome Res. 4(2):546-54,
14 RAVEN ANALYSIS OF CATARACTOUS LENS Improved spectrum identification MS-GF MODa Spectral Networks MS-GF+ MODa Discovery of proteome diversity ~100 different variations, defined as (mass,site) pairs Every variation is supported by at least one spectral pair with corroborating fragmentation patterns VQDDFVEIHGK Identificatio ns 15,771 10,462 6,355 Gain +51% +148% VQ(D,14)DFVEIHGK z=1 RaVen VQ(D,14)DFVEIHGK z=3 Identifications also in multiple charge states
15 REVEALING PROTEOME DIVERSITY Proteome diversity is determined by variation within protein regions Spectral networks significantly improve the detection of variants Highly-variant regions with 70+ variants 78 regions in 14 proteins with 10+ variants 34 regions in 10 proteins with 20+ variants RaVen MS-GF MODa Alpha-crystallin A aa variants RYRLPSNVDQSALSCSLSADGMLTFCGPK RYRLPSNVDQSALSCSLSADGMLTFCGPK YRLPSNVDQSAL YRLPSNVDQSALS YRLPSNVDQSALSC YRLPSNVDQSALSCS YRLPSNVDQSALSCSL YRLPSNVDQSALSCSLS YRLPSNVDQSALSCSLSA YRLPSNVDQSALSCSLSAD YRLPSNVDQSALSCSLSADGML YRLPSNVDQSALSCSLSADGMLTF YRLPSNVDQSALSCSLSADGMLTF YRLPSNVDQSALSCSLSADGMLTF YRLPSNVDQSALSCSLSADGMLTFCGPK YRLPSNVDQSALSCSLSADGMLTFCGPK YRLPSNVDQSALSCSLSADGMLTFCGPK YRLPSNVDQSALSCSLSADGMLTFCGPK xyrlpsnvdqsalscslsadgmltfcgpk YRLPSNVDQSALSCSLSADGMLTFCGPK xyrlpsnvdqsalscslsadgmltfcgpk YRLPSNVDQSALSCSLSADGMLTFCGPK YRLPSNVDQSALSCSLSADGMLTFCGPK LPSNVDQSALSC LPSNVDQSALSC LPSNVDQSALSCSL LPSNVDQSALSCSLS LPSNVDQSALSCSLSA LPSNVDQSALSCSLSADGMLTF LPSNVDQSALSCSLSADGMLTF LPSNVDQSALSCSLSADGMLTF LPSNVDQSALSCSLSADGMLTF LPSNVDQSALSCSLSADGMLTFCGP LPSNVDQSALSCSLSADGMLTFCGP LPSNVDQSALSCSLSADGMLTFCGP LPSNVDQSALSCSLSADGMLTFCGP LPSNVDQSALSCSLSADGMLTFCGPK LPSNVDQSALSCSLSADGMLTFCGPK LPSNVDQSALSCSLSADGMLTFCGPK LPSNVDQSALSCSLSADGMLTFCGPK LPSNVDQSALSCSLSADGMLTFCGPK LPSNVDQSALSCSLSADGMLTFCGPK LPSNVDQSALSCSLSADGMLTFCGPK LPSNVDQSALSCSLSADGMLTFCGPK LPSNVDQSALSCSLSADGMLTFCGPK LPSNVDQSALSCSLSADGMLTFCGPK LPSNVDQSALSCSLSADGMLTFCGPK LPSNVDQSALSCSLSADGMLTFCGPK LPSNVDQSALSCSLSADGMLTFCGPK LPSNVDQSALSCSLSADGMLTFCGPK LPSNVDQSALSCSLSADGMLTFCGPK xlpsnvdqsalscslsadgmltfcgpk LPSNVDQSALSCSLSADGMLTFCGPK LPSNVDQSALSCSLSADGMLTFCGPKI SNVDQSALSCSLSADGMLTFCGPK SNVDQSALSCSLSADGMLTFCGPK SNVDQSALSCSLSADGMLTFCGPK NVDQSALSCSLSADGMLTFCGPK NVDQSALSCSLSADGMLTFCGPK VDQSALSCSLSADGMLTFCGPK QSALSCSLSADGMLTFCGPK QSALSCSLSADGMLTFCGPK QSALSCSLSADGMLTFCGPK QSALSCSLSADGMLTFCGPK SALSCSLSADGMLTFCGPK SCSLSADGMLTFCGPK SLSADGMLTFCGP SLSADGMLTFCGPK SLSADGMLTFCGPK SLSADGMLTFCGPK SLSADGMLTFCGPK xslsadgmltfcgpk SLSADGMLTFCGPK LSADGMLTFCGPK SADGMLTFCGPK SADGMLTFCGPK ADGMLTFCGPK GMLTFCGPK
16 ALPHA-CRYSTALLIN A New discoveries in decade-old data Cataract-specific modification (methylation) or polymorphism (D E, bovine homolog) on Alpha-crystallin A VQDDFVEIHGK VQ(D,14)DFVEIHGK Only present in cataracts group
17 NATURAL PRODUCTS DNA makes RNA makes PROTEIN makes PEPTIDE transcription translation non-ribosomal peptide synthesis Natural products account for most antibiotics in clinical use and for 75% of antibacterials introduced in : antibiotics (penicillin, vancomicine, etc.), immunosuppressants (cyclosporin), antiviral agents (luzopeptin A), antitumor agents (bleomycin), Mass spectrometry analysis of natural products differs from proteomics and ribosomal peptides in many respects: non-linear structures, e.g., cyclic and branch-cyclic peptides they often contain non-standard amino acids increasing the number of possible building blocks from 20 to 100+ they are often heavily modified with complex PTMs they often have a non-standard backbone Each of these complications (let alone all of them) invalidates traditional algorithms for MS-based peptide sequencing and identification.
18 subtilis ACN 0_1FA MS2_ # RT: AV: 5 NL: 1.50E2 F: ITMS + p NSI d Full ms @cid35.00 [ ] m/z subtilis ACN 0_1FA MS2_ # RT: AV: 2 NL: 4.78E2 F: ITMS + p NSI d Full ms @cid35.00 [ ] m/z Relative Abundance Relative Abundance subtilis ACN 0_1FA MS2_ /24/ :40:50 PM. MOLECULAR SPECTRAL NETWORKS Watrous et al., PNAS 2012
19 gnps.ucsd.edu First MassIVE Knowledge Base, open March 2014 Wang M et al, Nature Biotech, 2016 ProteoSAFe Co-analyze private+public data MassIVE Mass Spectrometry Interactive Virtual Environment Share data Crowdsourced curated libraries Explore unknown molecules
20 GNPS DATA SHARING Since March datasets 167,058 files 3.6 TB of data 100s of species
21 IDENTIFICATION: SPECTRAL LIBRARY SEARCH New public or private datasets GNPS Spectral Libraries ProteoSAFe Dereplication spectral library search Exact match: compound identification using spectral matching Analog match: find putative related molecules using approximate spectral matching (spectral alignment) One-step comparison against all accumulated knowledge of known molecules
22 GNPS SPECTRAL LIBRARIES Open spectral libraries Antibiotics and natural products FDA approved drugs 10,454 spectra for 6,810 compounds Over 500 library revisions by curators Previously in antibiotics Hardly any spectra publicly available No natural products spectral libraries
23 CREATING AND CURATING SPECTRAL LIBRARIES I know what this is! Quality levels: Gold: approved users, synthetic compounds Silver: published compounds Bronze: everything else
24 HOW DO GNPS LIBRARIES COMPARE?
25 HOW DO GNPS LIBRARIES COMPARE?
26 HOW DO GNPS LIBRARIES COMPARE?
27 . FINDING NEW ANALOGS
28 DATASET UPDATES AND COMMENTS Datasets are almost never complete at the time of initial submission Dataset MSV0001 Here is some new data Add/Request Missing Metadata Dataset MSV0002 Dataset MSV0003 This data seems great, but could you add this
29 DATASET SUBSCRIPTIONS Living data: promote and track evolution of public datasets Subscribe to this interesting dataset What s new with my data? Dataset MSV0001 As owner, you have metadata requests Dataset MSV0002 Dataset MSV0003 What is a subscription? Digest of comments Digest of new annotations in dataset All datasets are periodically re-searched against the whole knowledge base
30 SUBSCRIPTION YOUR DATA CALLS HOME
31 Datasets immediately available for reanalysis LIVE DATASET VIEW Subscriptions available for new IDs, comments Datasets continuously reanalyzed + New results sent to subscribers Public comments, metadata, etc
32 ENABLING LIVING DATA We have shown that living data is a workable concept Continuous reanalysis upgrades data to knowledge Automated reanalysis with new or updated algorithms Community-contributed reanalyses Community-wide knowledge aggregated in crowdsourced spectral libraries Most identifications come from community reference spectra people contribute what they see Hundreds of knowledge revisions improve reference metadata New discoveries in published data Already over 20-fold increases since publication New algorithms being contributed by 3 rd party computational groups Buy-in from the community Thousands of users from over 100 countries, most joined before publication Data deposited at acquisition, not publication Subscriptions keep scientists engaged in the life cycle of published data
33 FUTURE OUTLOOK What can the community do to help? Scientists Data sharing will increase your productivity avoid rediscovery, reuse knowledge Reutilization and reanalysis increases the impact of data that otherwise is lost Journals Editors: Publication guidelines should require proper deposition of supporting raw data Just raw data is not enough complete submissions require linking claims to raw data Reviewers: confirm key claims and findings by checking supporting data at repository Data citations the best datasets will support many publications whose citations may need to be indirectly accounted through the data (e.g., accession numbers only) Academic institutions and funding agencies Understand that `data dumps are useless Published results are only as strong as the supporting data Link claims and conclusions from manuscript to the raw data
34 ACKNOWLEDGEMENTS Mingxun Wang Jeremy Carver Julie Wertz Laurence Bernstein Seungjin Na GNPS, MassIVE, ProteoSAFe Spectral networks, large-scale workflows
Sangtae Kim. Dissertation: Generating Functions of Tandem Mass Spectra and Their Applications to Peptide Identifications Advisor: Pavel Pevzner
Sangtae Kim and Engineering 9500 Gilman Drive, La Jolla, CA sak008@ucsd.edu Tel: (858) 525-5284 http://cseweb.ucsd.edu/~sak008 Education Ph.D. in Computer Science Sep. 2006 May 2012 Dissertation: Generating
More informationCurrent state of proteomics standardization and (C-)HPP data quality guidelines
9/6/2016 3 Department of Pharmacy Analytical Biochemistry Current state of proteomics standardization and (C-)HPP data quality guidelines DTL focus meeting on data integration, standards and fair principles
More informationUnderstanding life WITH NEXT GENERATION PROTEOMICS SOLUTIONS
Innovative services and products for highly multiplexed protein discovery and quantification to help you understand the biological processes that shape life. Understanding life WITH NEXT GENERATION PROTEOMICS
More informationHighly Confident Peptide Mapping of Protein Digests Using Agilent LC/Q TOFs
Technical Overview Highly Confident Peptide Mapping of Protein Digests Using Agilent LC/Q TOFs Authors Stephen Madden, Crystal Cody, and Jungkap Park Agilent Technologies, Inc. Santa Clara, California,
More informationACCELERATING GENOMIC ANALYSIS ON THE CLOUD. Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes
ACCELERATING GENOMIC ANALYSIS ON THE CLOUD Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia to analyze thousands of genomes Enabling the PanCancer Analysis of Whole Genomes (PCAWG) consortia
More informationEnabling Systems Biology Driven Proteome Wide Quantitation of Mycobacterium Tuberculosis
Enabling Systems Biology Driven Proteome Wide Quantitation of Mycobacterium Tuberculosis SWATH Acquisition on the TripleTOF 5600+ System Samuel L. Bader, Robert L. Moritz Institute of Systems Biology,
More informationAbout OMICS Group Conferences
About OMICS Group OMICS Group International is an amalgamation of Open Access publications and worldwide international science conferences and events. Established in the year 2007 with the sole aim of
More informationProteomics: A Challenge for Technology and Information Science. What is proteomics?
Proteomics: A Challenge for Technology and Information Science CBCB Seminar, November 21, 2005 Tim Griffin Dept. Biochemistry, Molecular Biology and Biophysics tgriffin@umn.edu What is proteomics? Proteomics
More informationTony Mire-Sluis Vice President, Corporate, Product and Device Quality Amgen Inc
The Regulatory Implications of the ever increasing power of Mass Spectrometry and its role in the Analysis of Biotechnology Products Where do we draw the line? Tony Mire-Sluis Vice President, Corporate,
More informationProteomics. Manickam Sugumaran. Department of Biology University of Massachusetts Boston, MA 02125
Proteomics Manickam Sugumaran Department of Biology University of Massachusetts Boston, MA 02125 Genomic studies produced more than 75,000 potential gene sequence targets. (The number may be even higher
More informationProteomics And Cancer Biomarker Discovery. Dr. Zahid Khan Institute of chemical Sciences (ICS) University of Peshawar. Overview. Cancer.
Proteomics And Cancer Biomarker Discovery Dr. Zahid Khan Institute of chemical Sciences (ICS) University of Peshawar Overview Proteomics Cancer Aims Tools Data Base search Challenges Summary 1 Overview
More informationSYMPOSIUM March 22-23, 2018
Bigger and Better Data Lessons from Frontlines of Precision Medicine Getting Your Transformation Right Frank Lee PhD IBM Global Industry Leader for Systems Group SYMPOSIUM March 22-23, 2018 5th Annual
More informationNew Approaches to Quantitative Proteomics Analysis
New Approaches to Quantitative Proteomics Analysis Chris Hodgkins, Market Development Manager, SCIEX ANZ 2 nd November, 2017 Who is SCIEX? Founded by Dr. Barry French & others: University of Toronto Introduced
More informationTowards unbiased biomarker discovery
Towards unbiased biomarker discovery High-throughput molecular profiling technologies are routinely applied for biomarker discovery to make the drug discovery process more efficient and enable personalised
More informationPeptide libraries: applications, design options and considerations. Laura Geuss, PhD May 5, 2015, 2:00-3:00 pm EST
Peptide libraries: applications, design options and considerations Laura Geuss, PhD May 5, 2015, 2:00-3:00 pm EST Overview 1 2 3 4 5 Introduction Peptide library basics Peptide library design considerations
More informationExpand Your Research with Metabolomics and Proteomics. Christine Miller Omics Market Manager ASMS 2017
Expand Your Research with Metabolomics and Proteomics Christine Miller Omics Market Manager ASMS 2017 New Additions to Agilent Omics Workflows Acquisition Ion Mobility Q-TOF IM All Ions MS/MS Find Features
More informationApplication Note # ET-20 BioPharma Compass: A fully Automated Solution for Characterization and QC of Intact and Digested Proteins
Application Note # ET-20 BioPharma Compass: A fully Automated Solution for Characterization and QC of Intact and Digested Proteins BioPharma Compass TM is a fully automated solution for the rapid characterization
More informationFourier-Transform MS for Top Down Proteomics
Fourier-Transform MS for Top Down Proteomics Proteomics Team @ Northwestern University Thermo User Meeting, December 9 th, 2015 Hosted by Thermo Fisher Scientific Odense, Denmark On the Use of Top Down
More informationThermo Scientific Compound Discoverer Software. Integrated solutions. for small molecule research
Thermo Scientific Compound Discoverer Software Integrated solutions for small molecule research Compound Discoverer Software The complete small molecule identification and characterization solution Based
More informationHow to view Results with Scaffold. Proteomics Shared Resource
How to view Results with Scaffold Proteomics Shared Resource Starting out Download Scaffold from http://www.proteomes oftware.com/proteom e_software_prod_sca ffold_download.html Follow installation instructions
More informationPTM Identification and Localization from MS Proteomics Data
05.10.17 PTM Identification and Localization from MS Proteomics Data Marc Vaudel Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, Bergen, Norway KG Jebsen Center for Diabetes
More informationMonoclonal Antibody Characterization on Q Exactive and Oribtrap Elite. Yi Zhang, Ph.D Senior Proteomic Marketing Specialist Oct.
Monoclonal Antibody Characterization on Q Exactive and Oribtrap Elite Yi Zhang, Ph.D Senior Proteomic Marketing Specialist Oct. 12, 211 Outline Orbitrap Mass Spectrometer in mab Characterization Intact
More informationfaster, confident more path to all the information in your samples. Agilent MassHunter Workstation Software Our measure is your success.
Agilent MassHunter Workstation Software faster, A confident more path to all the information in your samples. Our measure is your success. products applications soft ware services MassHunter Workstation
More informationProteomics and Cancer
Proteomics and Cancer Japan Society for the Promotion of Science (JSPS) Science Dialogue Program at Niitsu Senior High School Niitsu, Niigata September 4th 2006 Vladimir Valera, M.D, PhD JSPS Postdoctoral
More informationIntroduction to Bioinformatics
Introduction to Bioinformatics Alla L Lapidus, Ph.D. SPbSU St. Petersburg Term Bioinformatics Term Bioinformatics was invented by Paulien Hogeweg (Полина Хогевег) and Ben Hesper in 1970 as "the study of
More informationIon S5 and Ion S5 XL Systems
Ion S5 and Ion S5 XL Systems Targeted sequencing has never been simpler Explore the Ion S5 and Ion S5 XL Systems Adopting next-generation sequencing (NGS) in your lab is now simpler than ever The Ion S5
More informationHow to view Results with. Proteomics Shared Resource
How to view Results with Scaffold 3.0 Proteomics Shared Resource An overview This document is intended to walk you through Scaffold version 3.0. This is an introductory guide that goes over the basics
More informationWelcome to the NGS webinar series
Welcome to the NGS webinar series Webinar 1 NGS: Introduction to technology, and applications NGS Technology Webinar 2 Targeted NGS for Cancer Research NGS in cancer Webinar 3 NGS: Data analysis for genetic
More informationA highly sensitive and robust 150 µm column to enable high-throughput proteomics
APPLICATION NOTE 21744 Robust LC Separation Optimized MS Acquisition Comprehensive Data Informatics A highly sensitive and robust 15 µm column to enable high-throughput proteomics Authors Xin Zhang, 1
More informationSpectrum Mill MS Proteomics Workbench. Comprehensive tools for MS proteomics
Spectrum Mill MS Proteomics Workbench Comprehensive tools for MS proteomics Meeting the challenge of proteomics data analysis Mass spectrometry is a core technology for proteomics research, but large-scale
More informationHigh Resolution Accurate Mass Peptide Quantitation on Thermo Scientific Q Exactive Mass Spectrometers. The world leader in serving science
High Resolution Accurate Mass Peptide Quantitation on Thermo Scientific Q Exactive Mass Spectrometers The world leader in serving science Goals Explore the capabilities of High Resolution Accurate Mass
More informationCenter for Mass Spectrometry and Proteomics Phone (612) (612)
Outline Database search types Peptide Mass Fingerprint (PMF) Precursor mass-based Sequence tag Results comparison across programs Manual inspection of results Terminology Mass tolerance MS/MS search FASTA
More informationVyužití cílené proteomiky pro kontrolu falšování potravin: identifikace peptidových markerů v mase pomocí LC- Q Exactive MS/MS
Využití cílené proteomiky pro kontrolu falšování potravin: identifikace peptidových markerů v mase pomocí LC- Q Exactive MS/MS Michal Godula Ph.D. Thermo Fisher Scientific The world leader in serving science
More informationSkyline & Panorama: Key Tools for Establishing a Targeted LC/MS Workflow
Skyline & Panorama: Key Tools for Establishing a Targeted LC/MS Workflow Kristin Wildsmith Scientist, Biomarker Development LabKey User Conference October 6, 2016 Biomarkers enable drug development Drug
More informationThermo Scientific Peptide Mapping Workflows. Upgrade Your Maps. Fast, confident and more reliable peptide mapping.
Thermo Scientific Peptide Mapping Workflows Upgrade Your Maps Fast, confident and more reliable peptide mapping. Smarter Navigation... Peptide mapping is a core analytic in biotherapeutic development.
More informationN- The rank of the specified protein relative to all other proteins in the list of detected proteins.
PROTEIN SUMMARY file N- The rank of the specified protein relative to all other proteins in the list of detected proteins. Unused (ProtScore) - A measure of the protein confidence for a detected protein,
More informationProteomics software at MSI. Pratik Jagtap Minnesota Supercomputing institute
Proteomics software at MSI. Pratik Jagtap Minnesota Supercomputing institute http://www.mass.msi.umn.edu/ Proteomics software at MSI. proteomics : emerging technology proteomics workflow search algorithms
More informationBio-Reagent Services. Custom Gene Services. Gateway to Smooth Molecular Biology! Your Innovation Partner in Drug Discovery!
Bio-Reagent Services Custom Gene Services Gateway to Smooth Molecular Biology! Gene Synthesis Mutagenesis Mutant Libraries Plasmid Preparation sirna and mirna Services Large-scale DNA Sequencing GenPool
More informationPersonalized Medicine
Personalized Medicine Dr. Pablo Mentzinis, Director Government Relations, SAP SE Courtesy by Dominik Bertram, Marc von der Linden, Péter Adorján, SAP June 2016 1. Traditional medicine vs. personalized
More informationIntroduction to Bioinformatics
Introduction to Bioinformatics Richard Corbett Canada s Michael Smith Genome Sciences Centre Vancouver, British Columbia June 28, 2017 Our mandate is to advance knowledge about cancer and other diseases
More informationIon S5 and Ion S5 XL Systems
Ion S5 and Ion S5 XL Systems Targeted sequencing has never been simpler Introducing the Ion S5 and Ion S5 XL systems Now, adopting next-generation sequencing in your lab is simpler than ever. The Ion S5
More informationLTQ Orbitrap XL Hybrid FT Mass Spectrometer Unrivaled Performance and Flexibility
m a s s s p e c t r o m e t r y LTQ Orbitrap XL Hybrid FT Mass Spectrometer Unrivaled Performance and Flexibility Part of Thermo Fisher Scientific LTQ Orbitrap XL OFFERING OUTSTANDING MASS ACCURACY, RESOLVING
More informationless sensitive than RNA-seq but more robust analysis pipelines expensive but quantitiatve standard but typically not high throughput
Chapter 11: Gene Expression The availability of an annotated genome sequence enables massively parallel analysis of gene expression. The expression of all genes in an organism can be measured in one experiment.
More informationGenome Annotation Genome annotation What is the function of each part of the genome? Where are the genes? What is the mrna sequence (transcription, splicing) What is the protein sequence? What does
More informationCHAPTER 2A HOW DO YOU BEGIN TO CLONE A GENE? CHAPTER 2A STUDENT GUIDE 2013 Amgen Foundation. All rights reserved.
CHAPTER 2A HOW DO YOU BEGIN TO CLONE A GENE? 35 INTRODUCTION In the Program Introduction, you learned that the increase in diabetes in the United States has resulted in a great demand for its treatment,
More informationrapiflex Innovation with Integrity Designed for Molecules that Matter. MALDI TOF/TOF
rapiflex Designed for Molecules that Matter. Innovation with Integrity MALDI TOF/TOF rapiflex TM The first MALDI-TOF/TOF that adapts to your needs. The rapiflex is the most advanced MALDI TOF/TOF system
More informationFaster, easier, flexible proteomics solutions
Agilent HPLC-Chip LC/MS Faster, easier, flexible proteomics solutions Our measure is your success. products applications soft ware services Phospho- Anaysis Intact Glycan & Glycoprotein Agilent s HPLC-Chip
More informationAdvances in analytical biochemistry and systems biology: Proteomics
Advances in analytical biochemistry and systems biology: Proteomics Brett Boghigian Department of Chemical & Biological Engineering Tufts University July 29, 2005 Proteomics The basics History Current
More informationMate-pair library data improves genome assembly
De Novo Sequencing on the Ion Torrent PGM APPLICATION NOTE Mate-pair library data improves genome assembly Highly accurate PGM data allows for de Novo Sequencing and Assembly For a draft assembly, generate
More informationModification Site Localization Scoring Integrated into a Search Engine
Modification Site Localization Scoring Integrated into a Search Engine Peter R. Baker 1, Jonathan C. Trinidad 1, Katalin F. Medzihradszky 1, Alma L. Burlingame 1 and Robert J. Chalkley 1 1 Mass Spectrometry
More informationRapid Peptide Catabolite ID using the SCIEX Routine Biotransform Solution
Rapid Peptide Catabolite ID using the SCIEX Routine Biotransform Solution Rapidly Identify Major Catabolites with the SCIEX X500R QTOF System and MetabolitePilot TM 2.0 Software Ian Moore and Jinal Patel
More informationUC San Diego UC San Diego Electronic Theses and Dissertations
UC San Diego UC San Diego Electronic Theses and Dissertations Title Protein Identification via Assembly of Tandem Mass Spectra Permalink https://escholarship.org/uc/item/5p61q5st Author Guthals, Adrian
More informationMassHunter Profinder: Batch Processing Software for High Quality Feature Extraction of Mass Spectrometry Data
MassHunter Profinder: Batch Processing Software for High Quality Feature Extraction of Mass Spectrometry Data Technical Overview Introduction LC/MS metabolomics workflows typically involve the following
More informationMBios 478: Mass Spectrometry Applications [Dr. Wyrick] Slide #1. Lecture 25: Mass Spectrometry Applications
MBios 478: Mass Spectrometry Applications [Dr. Wyrick] Slide #1 Lecture 25: Mass Spectrometry Applications Measuring Protein Abundance o ICAT o DIGE Identifying Post-Translational Modifications Protein-protein
More informationProteomics. Proteomics is the study of all proteins within organism. Challenges
Proteomics Proteomics is the study of all proteins within organism. Challenges 1. The proteome is larger than the genome due to alternative splicing and protein modification. As we have said before we
More informationAdvanced QA/QC characterization MS in QC : Multi Attribute Method
Advanced QA/QC characterization MS in QC : Multi Attribute Method Global BioPharma Summit The world leader in serving science A Complex Problem: Drug Safety and Quality Safety Is the product safe to use?
More informationA Complete Workflow Solution for Intact Monoclonal Antibody Characterization Using a New High-Performance Benchtop Quadrupole- Orbitrap LC-MS/MS
A Complete Workflow Solution for Intact Monoclonal Antibody Characterization Using a New High-Performance Benchtop Quadrupole- Orbitrap LC-MS/MS Zhiqi Hao, 1 Yi Zhang, 1 David Horn, 1 Seema Sharma, 1 Shiaw-Lin
More informationExam MOL3007 Functional Genomics
Faculty of Medicine Department of Cancer Research and Molecular Medicine Exam MOL3007 Functional Genomics Tuesday May 29 th 9.00-13.00 ECTS credits: 7.5 Number of pages (included front-page): 5 Supporting
More informationGene Regulation Solutions. Microarrays and Next-Generation Sequencing
Gene Regulation Solutions Microarrays and Next-Generation Sequencing Gene Regulation Solutions The Microarrays Advantage Microarrays Lead the Industry in: Comprehensive Content SurePrint G3 Human Gene
More informationAccessible answers. Targeted sequencing: accelerating and amplifying answers for oncology research
Accessible answers Targeted sequencing: accelerating and amplifying answers for oncology research Help advance precision medicine Accelerate results with Ion Torrent NGS Life without cancer. This is our
More information[ VION IMS QTOF ] BEYOND RESOLUTION
[ VION IMS QTOF ] BEYOND RESOLUTION THE BENEFITS ARE CLEAR and routinely available, Complex samples give complex data with overlapping spectra and background interferences, making compound identification
More informationGenerating Automated and Efficient LC/MS Peptide Mapping Results with the Biopharmaceutical Platform Solution with UNIFI
with the Biopharmaceutical Platform Solution with UNIFI Vera B. Ivleva, Ying Qing Yu, Scott Berger, and Weibin Chen Waters Corporation, Milford, MA, USA A P P L I C AT ION B E N E F I T S The ability to
More informationThe Integrated Biomedical Sciences Graduate Program
The Integrated Biomedical Sciences Graduate Program at the university of notre dame Cutting-edge biomedical research and training that transcends traditional departmental and disciplinary boundaries to
More informationPractical Tips. : Practical Tips Matrix Science
Practical Tips : Practical Tips 2006 Matrix Science 1 Peak detection Especially critical for Peptide Mass Fingerprints A tryptic digest of an average protein (30 kda) should produce of the order of 50
More informationQuantitative mass spec based proteomics
Quantitative mass spec based proteomics Tuula Nyman Institute of Biotechnology tuula.nyman@helsinki.fi THE PROTEOME The complete protein complement expressed by a genome or by a cell or a tissue type (M.
More informationIntroduction to Bioinformatics and Gene Expression Technologies
Introduction to Bioinformatics and Gene Expression Technologies Utah State University Fall 2017 Statistical Bioinformatics (Biomedical Big Data) Notes 1 1 Vocabulary Gene: hereditary DNA sequence at a
More informationUtilizing novel technology for the analysis of therapeutic antibodies and host cell protein contamination
Utilizing novel technology for the analysis of therapeutic antibodies and host cell protein contamination Kelli Jonakin, Ph.D. Senior Field Application Scientist, Seattle, WA Eric Johansen, Ph.D. Senior
More informationMALDI-TOF Mass Spectrometry Hype or Revolution? Will Probert, Ph.D. Microbial Diseases Laboratory Branch California Department of Public Health
MALDI-TOF Mass Spectrometry Hype or Revolution? Will Probert, Ph.D. Microbial Diseases Laboratory Branch California Department of Public Health Matrix Assisted Laser Desorption Ionization (MALDI) Sample
More informationLeonardo Mariño-Ramírez, PhD NCBI / NLM / NIH. BIOL 7210 A Computational Genomics 2/18/2015
Leonardo Mariño-Ramírez, PhD NCBI / NLM / NIH BIOL 7210 A Computational Genomics 2/18/2015 The $1,000 genome is here! http://www.illumina.com/systems/hiseq-x-sequencing-system.ilmn Bioinformatics bottleneck
More informationRockerBox. Filtering massive Mascot search results at the.dat level
RockerBox Filtering massive Mascot search results at the.dat level Challenges Big experiments High amount of data Large raw and.dat files (> 2GB) How to handle our results?? The 2.2 peptide summary could
More informationProtein De novo Sequencing
Protein De novo Sequencing by Rong Wang A thesis presented to the University of Waterloo in fulfillment of the thesis requirement for the degree of Master of Mathematics in Computer Science Waterloo, Ontario,
More informationUNIFI: an innovative enabling platform Ken Eglinton Nordic User Training, September 2013
UNIFI: an innovative enabling platform Ken Eglinton Nordic User Training, September 2013 2013 Waters Corporation 1 Waters Laboratory Informatics Product Portfolio MassLynx Mass Spectrometry Software NuGenesis
More informationRNA-Sequencing analysis
RNA-Sequencing analysis Markus Kreuz 25. 04. 2012 Institut für Medizinische Informatik, Statistik und Epidemiologie Content: Biological background Overview transcriptomics RNA-Seq RNA-Seq technology Challenges
More informationFast and Efficient Peptide Mapping of a Monoclonal Antibody (mab): UHPLC Performance with Superficially Porous Particles
Fast and Efficient Peptide Mapping of a Monoclonal Antibody (mab): UHPLC Performance with Superficially Porous Particles Application Note Biotherapeutics and Biosimilars Authors James Martosella, Alex
More informationXevo G2-S QTof and TransOmics: A Multi-Omics System for the Differential LC/MS Analysis of Proteins, Metabolites, and Lipids
Xevo G2-S QTof and TransOmics: A Multi-Omics System for the Differential LC/MS Analysis of Proteins, Metabolites, and Lipids Ian Edwards, Jayne Kirk, and Joanne Williams Waters Corporation, Manchester,
More informationProgram overview. SciLifeLab - a short introduction. Advanced Light Microscopy. Affinity Proteomics. Bioinformatics.
Open House Program SciLifeLab Open House in Stockholm November 4, 2015 09:00-16:00 Contact: events@scilifelab.se Program overview 09:00-09:30-10:00-10:30-11:00-11:30-12:00-12:30-13:00-13:30-14:00-14:30-15:00-15:30-09:30
More informationProteins. Patrick Boyce Biopharmaceutical Marketing Manager Waters Corporation 1
Routine Characterization of mabs and Other Proteins Patrick Boyce Biopharmaceutical Marketing Manager Europe and India 2011 Waters Corporation 1 Agenda Why? What scientific challenges? Technology Example
More informationQIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd
QIAGEN s NGS Solutions for Biomarkers NGS & Bioinformatics team QIAGEN (Suzhou) Translational Medicine Co.,Ltd 1 Our current NGS & Bioinformatics Platform 2 Our NGS workflow and applications 3 QIAGEN s
More informationThe Advent of Subsequent Entry Biologics: Biosimilars Canada. The Advent of Subsequent Entry Biologics. Biosimilars in Canada
The Advent of Subsequent Entry Biologics: Biosimilars Canada The Advent of Subsequent Entry Biologics Biosimilars in Canada Spring 2014 Author: Patrick J. Mott, MSc. Editors Allison Hillier BA, MA Heather
More informationSynthetic Biology. Sustainable Energy. Therapeutics Industrial Enzymes. Agriculture. Accelerating Discoveries, Expanding Possibilities. Design.
Synthetic Biology Accelerating Discoveries, Expanding Possibilities Sustainable Energy Therapeutics Industrial Enzymes Agriculture Design Build Generate Solutions to Advance Synthetic Biology Research
More informationTRANSFORMING GLOBAL GENETIC DATA INTO MEDICAL DECISIONS
TRANSFORMING GLOBAL GENETIC DATA INTO MEDICAL DECISIONS THE DOORS ARE OPEN: FEEL FREE TO COME IN CENTOGENE UNLOCKS THE POWER OF GENETIC INSIGHTS TO IMPROVE THE QUALITY OF LIFE OF PATIENTS WITH GENETIC
More informationAlgorithm for Matching Additional Spectra
Improved Methods for Comprehensive Sample Analysis Using Protein Prospector Peter R. Baker 1, Katalin F. Medzihradszky 1 and Alma L. Burlingame 1 1 Mass Spectrometry Facility, Dept. of Pharmaceutical Chemistry,
More informationNext Generation Sequencing. Target Enrichment
Next Generation Sequencing Target Enrichment Next Generation Sequencing Your Partner in Every Step from Sample to Data NGS: Revolutionizing Genetic Analysis with Single-Molecule Resolution Next generation
More informationDetecting Challenging Post Translational Modifications (PTMs) using CESI-MS
Detecting Challenging Post Translational Modifications (PTMs) using CESI-MS Sensitive workflow for the detection of PTMs in biological samples from small sample volumes Klaus Faserl, 1 Bettina Sarg, 1
More informationOutline and learning objectives. From Proteomics to Systems Biology. Integration of omics - information
From to Systems Biology Outline and learning objectives Omics science provides global analysis tools to study entire systems How to obtain omics - What can we learn Limitations Integration of omics - In-class
More informationFast Multi-blind Modification Search through Tandem Mass Spectrometry* S
Fast Multi-blind Modification Search through Tandem Mass Spectrometry* S Seungjin Na, Nuno Bandeira **, and Eunok Paek Technological Innovation and Resources 2012 by The American Society for Biochemistry
More informationChapter 10: Gene Expression and Regulation
Chapter 10: Gene Expression and Regulation Fact 1: DNA contains information but is unable to carry out actions Fact 2: Proteins are the workhorses but contain no information THUS Information in DNA must
More informationRapid Extraction of Therapeutic Oligonucleotides from Primary Tissues for LC/ MS Analysis Using Clarity OTX, an Oligonucleotide Extraction Cartridge
Rapid Extraction of Therapeutic Oligonucleotides from Primary Tissues for LC/ MS Analysis Using Clarity OTX, an Oligonucleotide Extraction Cartridge G. Scott*, H. Gaus #, B. Rivera*, and M. McGinley* *Phenomenex,
More informationPrecise Characterization of Intact Monoclonal Antibodies by the Agilent 6545XT AdvanceBio LC/Q-TOF
Precise Characterization of Intact Monoclonal Antibodies by the Agilent 6545XT AdvanceBio LC/Q-TOF Application Note Author David L. Wong Agilent Technologies, Inc. Santa Clara, CA, USA Introduction Monoclonal
More informationHigh-throughput scale. Desktop simplicity.
High-throughput scale. Desktop simplicity. NextSeq 500 System. Flexible power. Speed and simplicity for whole-genome, exome, and transcriptome sequencing. Harness the power of next-generation sequencing.
More informationI nternet Resources for Bioinformatics Data and Tools
~i;;;;;;;'s :.. ~,;;%.: ;!,;s163 ~. s :s163:: ~s ;'.:'. 3;3 ~,: S;I:;~.3;3'/////, IS~I'//. i: ~s '/, Z I;~;I; :;;; :;I~Z;I~,;'//.;;;;;I'/,;:, :;:;/,;'L;;;~;'~;~,::,:, Z'LZ:..;;',;';4...;,;',~/,~:...;/,;:'.::.
More informationInnovative Pharmaceutical Solutions for Discovery Chemistry, Biology and cgmp Manufacturing
Innovative Pharmaceutical Solutions for Discovery Chemistry, Biology and cgmp Manufacturing Overview Serving biotech and pharma community since 1998 Proven track record of advancing small molecules from
More informationDISCOVERY AND VALIDATION OF TARGETS AND BIOMARKERS BY MASS SPECTROMETRY-BASED PROTEOMICS. September, 2011
DISCOVERY AND VALIDATION OF TARGETS AND BIOMARKERS BY MASS SPECTROMETRY-BASED PROTEOMICS September, 2011 1 CAPRION PROTEOMICS Leading proteomics-based service provider - Biomarker and target discovery
More informationThe Five Key Elements of a Successful Metabolomics Study
The Five Key Elements of a Successful Metabolomics Study Metabolomics: Completing the Biological Picture Metabolomics is offering new insights into systems biology, empowering biomarker discovery, and
More informationSNP GENOTYPING WITH iplex REAGENTS AND THE MASSARRAY SYSTEM
SNP GENOTYPING Accurate, sensitive, flexible MassARRAY System SNP GENOTYPING WITH iplex REAGENTS AND THE MASSARRAY SYSTEM Biomarker validation Routine genetic testing Somatic mutation profiling Up to 400
More informationPLRP-S Polymeric Reversed-Phase Column for LC/MS Separation of mabs and ADC
PLRP-S Polymeric Reversed-Phase Column for LC/MS Separation of mabs and ADC Analysis of Intact and Fragmented mabs and ADC Application Note Biotherapeutics and Biologics Author Suresh Babu C.V. Agilent
More informationGene Identification in silico
Gene Identification in silico Nita Parekh, IIIT Hyderabad Presented at National Seminar on Bioinformatics and Functional Genomics, at Bioinformatics centre, Pondicherry University, Feb 15 17, 2006. Introduction
More informationThermo Scientific TSQ Quantum Access MAX
www.ietltd.com Proudly serving laboratories worldwide since 1979 CALL +847.913.0777 for Refurbished & Certified Lab Equipment Thermo Scientific TSQ Quantum Access MAX UNSURPASSED PRICE TO PERFORMANCE Superior
More informationAnalysis of Biological Sequences SPH
Analysis of Biological Sequences SPH 140.638 swheelan@jhmi.edu nuts and bolts meet Tuesdays & Thursdays, 3:30-4:50 no exam; grade derived from 3-4 homework assignments plus a final project (open book,
More information