PeptideShaker enables reanalysis of mass spectrometryderived. proteomics datasets

Size: px
Start display at page:

Download "PeptideShaker enables reanalysis of mass spectrometryderived. proteomics datasets"

Transcription

1 PeptideShaker enables reanalysis of mass spectrometryderived proteomics datasets Marc Vaudel 1,2, Julia M. Burkhart 1, René P. Zahedi 1, Eystein Oveland 2,3,4, Frode S. Berven 2,4,5, Albert Sickmann 1, Lennart Martens 6,7,* and Harald Barsnes 2,8 1 Leibniz-Institut für Analytische Wissenschaften ISAS e.v., Dortmund, Germany 2 Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway 3 Department of Clinical Medicine, University of Bergen, Bergen, Norway 4 The KG Jebsen Centre for MS-research, Department of Clinical Medicine, University of Bergen, Bergen, Norway 5 The Norwegian Multiple Sclerosis Competence Centre, Department of Neurology, Haukeland University Hospital, Bergen, Norway 6 Department of Medical Protein Research, VIB, Ghent, Belgium 7 Department of Biochemistry, Ghent University, Ghent, Belgium 8 Computational Biology Unit, University of Bergen, Norway * Correspondence: Prof. Dr. Lennart Martens A. Baertsoenkaai 3 B-9000 Gent Phone: Fax: lennart.martens@ugent.be 1

2 Table of Contents Introduction Installation and Hardware Requirements Creating a New Project PRIDE Reanalysis File Import Results Navigation Validating Proteins, Peptides and PSMs PTM Scoring Reports, Follow Up Analyses and Submissions to PRIDE Command Line Use Documentation, Help, Support and Updates References

3 Introduction The interpretation of protein identification results from identification algorithms can be conducted in several software environments 1, freeware or commercial, some delivered with the instrument by the vendor. MaxQuant 2, the TransProteomic pipeline 3-4, OpenMS 5-7, and ID Picker 8 are examples of the efforts of the scientific community to provide free solutions for protein identification. Here, we present PeptideShaker, an interface to assemble and inspect results from tandem mass spectra identification algorithms. PeptideShaker allows intuitive interpretation of peptide and protein mass spectrometry based identification results. In combination with SearchGUI 9, a user-friendly graphical user interface to conduct proteomics searches, it provides a full identification solution for both locally generated datasets and publicly available data in PRIDE 10 via ProteomeXchange 11. For every identified protein, peptide and spectrum, PeptideShaker delivers useful information like identification confidence, modification site(s) and external information via resources like Ensembl 12 or PDB 13. All results can be exported in various reports, either for further follow up analyses or for submission to PRIDE. Notably, the use of PeptideShaker does not require extensive knowledge in bioinformatics. PeptideShaker is fully documented and comes with contextual help and extended tutorials. By detailing every step from data loading to results display, the present supplementary material details how the peptides and proteins are inferred from search engine results and how these are scored, displayed and connected with rich resources for protein identification. Unless stated otherwise, the data and illustration here were obtained on the PeptideShaker example dataset, a standard measurement of HeLa cell lysate as detailed elsewhere 14 and freely available in the ProteomeXchange 15 consortium via the PRIDE 10 partner repository under the accession PXD If you have any questions about PeptideShaker, please do not hesitate to contact the developers at the PeptideShaker Google Group: 3

4 1.0 Installation and Hardware Requirements PeptideShaker is an open source project developed in Java under the very permissive Apache2 open source license. The complete source code, cross-platform executables and additional information is available at The software does not require installation beyond unzipping and then double clicking the downloaded file, and works on Windows, Linux and Mac platforms. (For the first execution on newer Mac platforms, controlclick on the file icon and then select "Open." This will provide the option to run the file regardless of its (unidentified) source. Help on usage on Mac can be found on the PeptideShaker website.) The only prerequisite to run PeptideShaker is that Java is installed. However, due to the large size of modern proteomics datasets and databases, the software performance will depend on the amount of available memory, the more memory the better the performance. When creating a new project, it is recommended to provide at least 4 GB of memory for smaller projects (<100,000 spectra, <100,000 protein sequences), while bigger projects will be more memory demanding. Working with less than 4 GB of memory is supported; import time will however be substantially extended. Memory settings can be edited from the interface directly under Edit > Java Options or via the Welcome Dialog under Settings and Help. Note that Java 32-bit does not support high memory settings. It is therefore strongly recommended to work on 64-bit machines the standard for all recent computers. Java 32-bit is often installed by default on 64- bit machines, and it is then preferable to instead install Java 64-bit. Help with Java installation and usage can be found here: When not all information can be loaded into memory, the tool will interact with locally stored data. This process will be substantially sped up on SSD discs. In general, read/write operations are the main speed limiting steps; it is thus advised to operate on SSD discs. Although the creation of a PeptideShaker project is computationally demanding, its opening and viewing does not require equally powerful hardware capabilities. It is thus possible for mass spectrometry labs to create the project on a high performance machine, save it and share it to end users who can then inspect it on standard desktop computers. It is also possible to create projects automatically on servers and clusters via the command line options of the tool (see Command Line Use). 4

5 PeptideShaker is started by double clicking the jar file in the unzipped downloaded file. The PeptideShaker Welcome Dialog is then displayed, see Supplementary Figure 1. From this dialog it is possible to: (1) Create a New Project; (2) Open a Saved Project; (3) Start a Search using SearchGUI; (4) Reshake a PRIDE Project, i.e., reprocess a dataset in PRIDE; (5) Open an Example Project; and (6) See the Getting Started Mini Tutorial. Supplementary Figure 1: PeptideShaker Welcome Dialog. This dialog is displayed when starting the tool and can be used to start the processing of a dataset (see text for details). 5

6 2.0 - Creating a New Project After selecting New Project, the user sets up the new project as displayed Supplementary Figure 2. Supplementary Figure 2: New Project Dialog were the user defines the new PeptideShaker project. The setup includes: (i) annotation of the project very useful for later reuse and sharing; (ii) selection of the input files; and (iii) editing of processing parameters. The processing parameters include: (a) the settings used for the search; (b) the import filters; and (c) the import preferences. These parameters are important as they impact the protein and peptide result set and can thus not be modified after the project has been created. The search settings are the settings used for the search, and for SearchGUI results these are automatically loaded when selecting the search result files. Using import filters allow the removal of Peptide to Spectrum Matches (PSMs). Given that identification quality is known to depend on sequence length 16, it is possible to filter out short/long peptides. Filters on precursor mass deviation as suggested by Beausoleil et al. 17 is also supported. Finally, as some modifications are not recognized by all search engines, e.g., PTMs located on protein termini or PTMs targeting motifs of several amino acids. It is possible to use a comprehensive search targeting all termini or a single amino acid and refine the results a posteriori by using only the modifications of interest. This is achieved by filtering the PTMs not matching the PeptideShaker PTM definition and is activated by clicking Exclude Unknown PTMs. 6

7 The processing parameters include initial False Discovery Rate (FDR) validation thresholds which can be altered after the project has been created (see Validating Proteins, Peptides and PSMs) and PTM scoring options (see PTM Scoring). Clicking Load Data! starts the processing of the files and the creation of the PeptideShaker project. 7

8 3.0 - PRIDE Reanalysis The PRIDE Reshake option allows any scientist to easily reprocess datasets in PRIDE without requiring advanced bioinformatics skills. After clicking the PRIDE Reshake button in the Welcome Dialog, the user can choose to reanalyze either public or private datasets. (Private datasets require the input of username and password details.) In both cases the user will see the list of available projects with the associated assays and files as shown in Supplementary Figure 3. Supplementary Figure 3: PRIDE data selection. At the top the project to reanalyze is selected. The assays and associated data files are shown in the tables below. The user can search for specific projects using the advanced Find feature as shown in Supplementary Figure 4. 8

9 When the user has located the project to reanalyze, the "Reshake PRIDE Data" button is clicked. This will open the Reshake Settings Dialog shown in Supplementary Figure 5, where the user can customize the properties of the reanalysis. 9

10 After clicking the "Start the Reshaking!" button, PeptideShaker starts downloading the data file(s) and extract the spectra and search settings. (Missing information can be manually added later in SearchGUI.) PeptideShaker then starts SearchGUI where the user can edit the search parameters as displayed in Supplementary Figure 6. Supplementary Figure 4: It is possible to edit the inferred parameters to accurately reproduce the original search or completely change the context of the dataset. The search can now be started and the import in PeptideShaker is directly triggered. 10

11 4.0 - File Import Search engines assign peptide candidates derived from the protein database to every spectrum, providing a score illustrating the quality of the match: generally an e-value. PeptideShaker supports different types of import formats: X!Tandem 18 t.xml files implemented in the BioML format 19, OMSSA 20 omx files, MS Amanda 21 csv files, Mascot 22 dat files, and the PSI mzidentml 23 format. The latter, notably, allows importing search results from MS-GF+ 24, and from virtually any identification algorithm if the minimal information required for import in PeptideShaker is present in the file. Details on mzidentml and Mascot files requirements can be found on the PeptideShaker website. First, the search engine results are parsed using open source Java parsers that are all published and actively maintained The results are then loaded into an open source search engine independent structure 29, allowing PeptideShaker to manipulate, save and open this large amount of information. PeptideShaker takes advantage of the target/decoy strategy 16 to convert the scores of the search engines into Posterior Error Probability (PEP) values as commonly done in proteomics 30. Note that peptides mapping to both target and decoy proteins are excluded from the import. For every peptide candidate, the product of the search engine PEPs is given as score and the best, i.e., the lowest, scoring peptide is picked as the best candidate. Given that search engines are known to encounter difficulties at localizing PTMs, a peptide is here defined by its amino acid sequence and the number of PTMs without accounting for their location. Amino acids with a mass difference lower than the fragment ion tolerance are considered as undistinguishable. If two peptide candidates score equally, they are discriminated by: (i) the occurrence of their parent protein in the dataset; (ii) the number of search engines supporting the peptide; (iii) the number of fragment ions annotated in the spectrum; and (iv) the precursor mass error. In order to improve the identification rate, the list of PSMs resulting from the search engine combination is separated into groups according to the run (i.e. spectrum file) and identified charge. Groups are created for every spectrum file and for charges from the lowest to the highest only if the group size does not compromise statistical accuracy 31 : a group size is considered sufficient if more than 100 target hits are present before the first decoy hit and if the estimated PEP resolution is lower than 1%. A PEP is then estimated for every PSM based on the target and decoy hit distributions in the charge specific group. From these PSMs, a list of peptides is established. When two peptides differ only in the PTM localization, they are considered as separate peptide identification entities only if the PTMs are 11

12 confidently localized (see PTM Scoring). A peptide score, PSM PEPs, is attached to every peptide:, the product of the (1) Where is the estimated PEP of the i th PSM identifying the considered peptide. The peptides are then grouped according to their modification status, again, only when the size of the groups allows it. The size criterion is the same as for the PSM groups: more than 100 target hits before the first decoy hit and the estimated PEP resolution lower than 1%. A peptide level PEP is estimated based on the target and decoy hit distributions in the modification specific group. Using the FASTA file provided by the user, every peptide is mapped to the parent protein sequences. Here again, indistinguishable amino acids are considered as such based on the fragment ion accuracy. Moreover, when a protein sequence contains X's the mapping will be ignored if X's make up more than 25% of the peptide sequence. Subsequently, protein ambiguity groups are created based on peptide unicity as introduced by Nesvizhskii 32, such that peptides are unique to a group. PeptideShaker scores the protein groups using the product of the estimated peptide PEPs: (2) Where is the estimated PEP of the i th peptide identifying the considered protein group. When an ambiguity group presents a subset (for example group Proteins A or B or C is identified as well as group Protein A or B ), the complete group ( Proteins A or B or C in example) is considered as unlikely and ignored if: (i) the additional proteins in the complete group (Protein C in example) are only supported by non-enzymatic peptides (when searching with an enzyme); (ii) are uncharacterized proteins or proteins with lower evidence (UniProt accessions only); or (iii) the subset ( Proteins A or B in example) scores better and is hence more likely to be found. In these cases, the peptides are assigned to the subset. Finally, the PEP of every protein group is estimated by comparing the target and decoy distributions; a representative protein is selected for every group based on the peptide enzymaticity (when searching with an enzyme) and the protein evidence (UniProt accessions only) and description; and the peptides of complete groups ( Proteins A or B or C in example) are linked to all subsets ( Proteins A or B in example). During the processing of the data, the progress, including tips and warnings, is shown to the user as displayed in Supplementary Figure 7. 12

13 Supplementary Figure 5: The Waiting Dialog displays progress of the project creation process and also displays tips and warnings. 13

14 5.0 - Results Navigation When a project has been created, PeptideShaker s main interface displays the identification results in a clear and intuitive fashion, allowing the user to easily navigate even large datasets. The interface is divided into nine interconnected tabs. By default, the results are displayed in the Overview tab as shown in Supplementary Figure 8. Supplementary Figure 6: The interface of PeptideShaker consists of nine interconnected tabs corresponding to different use cases. The Overview tab displays extended information of the identification matches and allows the user to intuitively navigate the identification results. At the top of the Overview tab, a table displays detailed information about the protein ambiguity groups identified. The Protein Inference (PI) informs the user about the protein inference status of the protein ambiguity group using different colors: (i) green for single proteins; (ii) yellow for groups of related proteins; (iii) orange for groups of related and unrelated proteins; and (iv) red for groups of unrelated proteins only. Proteins are considered as unrelated if their associated genes (UniProt accessions only) or descriptions differ. When clicking the colored rectangle, the user can inspect the protein inference status of the protein group as displayed Supplementary Figure 9. 14

15 Supplementary Figure 7: The Protein Inference Dialog allows the user to inspect the protein ambiguity groups, here consisting of 53 proteins considered as related by PeptideShaker (Histocompatibility antigens). The first table displays the proteins matched with information about the gene, chromosome, protein evidence and peptide enzymaticity. The dialog also displays eventual unique hits which can be related to this group as here Q29960 which was found with a very low score, and other protein groups related to this group. Note that the user can alter both the protein group label and the representative protein. The three next columns in the Overview protein table provide information about the selected protein representative: protein accession, description and chromosome number. Chromosome number is available when the species is selected and is obtained from Ensembl 12. The species can be selected when creating the project or can be set later via the Edit menu. Clicking the chromosome number provides additional information about the gene associated by UniProt to the representative protein of the protein ambiguity group: Ensembl Gene ID, Gene Name, Chromosome, and GO annotation as displayed Supplementary Figure

16 Supplementary Figure 8: When clicking the chromosome number in the protein table, gene information about the selected protein is extracted from Ensembl and displayed to the user. Next, the protein coverage is displayed. The detected coverage (colored) is compared to the expected observable coverage (grey). The latter is estimated based on the size distribution of the identified peptides and the maximal size allowed for enzymatic peptides. The two following columns represent the number of identified and validated peptides and PSMs for the protein group (see Validating Proteins, Peptides and PSMs). Note how PeptideShaker takes full advantage of so-called sparklines ( wiki/sparklines) to make the coverage and the results of the statistical analysis easier to interpret. Sparklines are used throughout the PeptideShaker tables using our open source JSparklines library ( greatly enhancing the results inspection. Subsequently, a spectrum counting index is displayed. Although PeptideShaker is dedicated to identification, it does provide spectrum counting metrics that allow for a rough estimation of protein abundances directly from identification results 33. PeptideShaker comes with a version of the empai index 34 and an improved version of the NSAF index 35, chosen as the default option for its accuracy 36. NSAF is a simple and efficient method where the number of spectra for a given protein is normalized by the protein length : (3) 16

17 It should be noted that a major issue with most spectrum counting indexes is that they do not take into account protein inference issues and cleavage efficiency. PeptideShaker thus implements an improved version of the NSAF index, where the contribution of the i th validated PSM is weighted by a protein inference coefficient where is the number of protein ambiguity groups where the matched peptide is included. Note that if a peptide is redundant in the sequence of a representative protein, is increased accordingly. Moreover, the observable length of the protein, as used for the observable coverage,, is used, thus discarding all domains of the sequence which cannot generate detectable peptides: (4) Towards the right, the molecular weight (MW) of the representative protein is shown, and finally the confidence attached to the protein group and its validation status is displayed. When a protein group is selected, the peptides mapping to the selected group are displayed. As for the proteins, the protein inference status of the peptide is color coded and detailed information is accessible by clicking the colored rectangle. The peptide sequence, followed by the peptide's location in the representative protein of the protein group is shown next. Note how the graphical representation of the peptide localization in the protein sequence allows for intuitive interpretation and that PTMs are color coded in the sequence. The use of white font on a colored background indicates a confident PTM localization, while a colored font on a white background indicates a non-confident PTM localization. The PTM color coding can be edited in the search parameters. Finally, the results of the target/decoy statistical processing are displayed as in the protein table. Similarly, when a peptide is selected, the PSMs mapping to the peptide are displayed. The colored coded column (SE) shows the agreement of the search engines for the given spectrum, and clicking the colored rectangles will show the details for each search engine for the given spectrum. Subsequently, detailed information about the match is displayed: sequence with color coded PTMs, charge, precursor mass error, confidence, and validation status. The spectrum corresponding to the selected PSM is displayed at the bottom right with user customizable fragment ion annotation. The three spectrum sub plots above the spectrum make it easier to assess the quality of the match: (i) the intensity of every fragment ion is displayed relative to the peptide sequence for forward (blue, down) and rewind (red, up) ions; (ii) a histogram of the intensities of the annotated (green) and the non-annotated (grey) peaks; and (ii) the fragment ion m/z error is plotted against the peak m/z for forward (blue) and rewind (red) ions. The latter allows for straightforward detection of calibration issues. The user can 17

18 fully customize the spectrum annotation and display like color and appearance of the peaks, benefit from the advanced annotation as shown in Supplementary Figure 11, and export the plots in publication-grade quality. Supplementary Figure 9: A spectrum as displayed in the Overview tab. The intensity of every fragment ion is displayed relative to the peptide sequence at the top left for forward (blue, down) and rewind (red, up) ions. In the top middle, a histogram of the intensities of the annotated (green) and the non-annotated (grey) peaks is displayed. And on the top right the fragment ion m/z error is plotted against the peak m/z for forward (blue) and rewind (red) ions. The spectrum is displayed with fully customizable annotation as exemplified here by the overlay of automated de novo sequencing using the selected PSM. PeptideShaker also provides visualizations of multiple PSMs. When selecting different PSMs simultaneously, spectra are displayed in a so-called planetary system view where the x-axis represents the m/z of the spectrum, the y-axis the m/z error and the size of the data point represents the intensity of the peak. This display makes it easy to spot outliers or mass calibration issues as displayed in Supplementary Figure

19 Supplementary Figure 10: The planetary system view allows the comparison of multiple measurements (PSMs) of the same peptide within the same plot. For every fragment ion of every PSM matched in the respective spectrum, the x-axis represents the m/z of the annotated peak, the y-axis the m/z error and the size of the data point represents the intensity of the peak. This display makes it easy to inspect the spectrum reproducibility and detect outliers or mass deviation issues. Here the second PSM in blue seems to deviate from the others, showing a typical ppm error. The annotated peaks can also be visualized in a table as displayed in Supplementary Figure 13. Supplementary Figure 11: For a given PSM, the fragment ion matches can be plotted in a table conveniently showing the m/z values of the detected ions. 19

20 Selecting multiple PSMs shows the reproducibility of the spectrum acquisition displayed in intensities using the summed spectrum intensity for normalization as displayed in Supplementary Figure 14. Supplementary Figure 12: When selecting multiple PSMs, the intensities of the matched peaks can be displayed with error bars indicating the variability of spectrum acquisitions. At the bottom of the Overview panel, the sequence of the representative protein of the selected protein group is displayed, where colors represent the areas of the sequence covered by the experiment, and grey the coverable areas of the sequence according to the chosen enzyme and identified peptide sizes distribution. Green, yellow and red indicate the areas of the sequence covered by confident, doubtful and not validated peptide matches (see Validating Proteins, Peptides and PSMs). As displayed in Supplementary Figure 15, the PTMs are also localized and color coded, and clicking on an area of the sequence can be used to select the given peptide. Supplementary Figure 13: The protein coverage panel displays the representative protein of the selected protein group, as demonstrated here with protein of accession P49588 of the example dataset of PeptideShaker (968 amino acids). Here 17.56% of an expected 91.84% coverage is observed as displayed in color and grey, respectively. 12.6%, 3.2%, and 1.76% of the coverage was achieved using confident, doubtful, and not validated peptide matches (green, yellow and red areas), respectively. The currently selected peptide is displayed in blue and the modifications are color coded (here blue for oxidation of methionine). When clicking in the sequence, as here done between amino acids 880 and 899 the identified peptides can be selected. Here the peptide MHSPQTSAMLFTVDNEAGK was found with and without oxidation of methionine 9. According to the PTM localization scores (see PTM Scoring), the localization of the oxidation is confident as indicated by the colored background. 20

21 The other PeptideShaker tabs cover other use cases with the same focus on intuitive navigability of the identification results: (i) the Spectrum IDs tab (Supplementary Figure 16) allows the user to compare the results of the different search engines; (ii) the Fractions tab (Supplementary Figure 17) displays the contributions of different fractions to the final results; (iii) the Modifications tab (Supplementary Figure 18) makes it possible to browse peptides carrying certain variable modifications and inspect the results of the localization scoring algorithms; (iv) the 3D Structures tab (Supplementary Figure 19) maps the identification results onto 3D structures from PDB; (v) the Annotation tab (Supplementary Figure 20) connects the identification results to various external resources like Reactome 37 or STRING 38 ; (vi) the GO analysis tab (Supplementary Figure 21) displays Gene Ontology statistics of the dataset; (viii) the Validation tab (Supplementary Figure 22) allows the inspection of the target/decoy results and adapting the validation threshold; and (ix) QC Plots tab (Supplementary Figure 23) displays different quality controls metrics on the identified proteins, peptides and PSMs. For further details on the tabs see the figure legends below the figures on the following pages. 21

22 Supplementary Figure 14: The Spectrum IDs tab allows the user to compare the results of the different search engines. The table at the top shows the identification results retained by PeptideShaker for a given spectrum file with extended information on the PSM retained for every spectrum. The center-left panel shows the match retained by PeptideShaker followed by a table listing all matches from all identification algorithms including secondary hits. The confidence a secondary hit would have obtained if retained as first hit is displayed allowing hits comparison accross search engines. The center-left panel displays the spectrum of the selected PSM, it is thus possible to inspect the spectrum annotation of a secondary hit. At the bottom, several plots display identification algorithms performance with respect to the results displayed in PeptideShaker, alowing straightforward control of the performance of all algorithms. From the left to the right: (1) the number of PSMs a given algorithm would provide if used alone, (2) the number of PSMs of the merged dataset which can be ascribed to a single algorithm only, (3) the number of unassigned spectra a given algorithm would lead to if used alone, and (4) the identification yield a given algorithm would lead to if used alone. Note that these metrics are biased toward search engine agreement by the strategy used to select the best match of every spectrum. 22

23 Supplementary Figure 15: If loading mulitple spectrum files, the Fractions tab indicates in which spectrum files the different proteins would be validated if not found in the other files. Various plots are available to visualize the distrubution of each protein across the spectrum files, e.g., the number of peptides per spectrum file as shown here. It should be noted that the plots only give an indication of the protein distrubtion across the spectrum files, mainly for inspecting fractionated data, and should not be used as a comparison of distinct experiments where a more advanced processing is required

24 Supplementary Figure 16: The Modifications tab supports the browsing of the peptides carrying variable modifications. The modification of interest is selected in the top left and the modified peptides of the selected category are displayed in the top right table. When a modified peptide is selected, PeptideShaker looks for any related peptides where related peptides carry other modifications or present different cleavage sites. Here, a phosphorylated peptide is selected, and the same version with no modification and with an oxidation of methionine is found. The confidence in the PTM localization is color coded and the results of the localization scores are displayed on the sequence. More information on the localization is available when clicking in the PTM column or by selecting the desired score tab under the spectrum. 24

25 Supplementary Figure 17: The 3D Structures tab maps the identified peptides onto the 3D structure of the protein from the PDB 13 using Jmol, an open-source Java viewer for chemical structures in 3D ( Identified peptides are shown in green, the selected peptide in blue, and the PTMs are color coded and annotated on the structure. Supplementary Figure 18: The Annotation tab allows querying various external resources with results from PeptideShaker. Using the protein accession number, the external resources can be directly searched as illustrated here with a Reactome pathway and a STRING interation network. 25

26 Supplementary Figure 19: The GO Analysis tab provides Gene Ontology statistics on the validated proteins, highlighting the terms that are significantly more or less frequent in the given dataset compared to the annotation of the species in Ensembl using a hypergeometric test. 26

27 Supplementary Figure 20: The Validation tab allows the user to inspect the target/decoy results for proteins, peptides and PSMs as selected in the top left table. False positive and negative rates are displayed and can be optimized in an intuitive cost/benefit way. Supplementary Figure 21: the QC Plots tab allows the user to quality control several parameters of the protein peptide and PSM identification. Here for example, the distribution of the precursor mass error is displayed, and it is then easy to see the instrument resolution. 27

28 6.0 - Validating Proteins, Peptides and PSMs PeptideShaker takes full advantage of the target/decoy strategy to allow the user to extensively control the quality of the identifications. As already mentioned, the confidence in identification matches is displayed as the complement of the PEP: (5) As an example, 100 matches with a confidence of 90% contain 10 false positives. In proteomics, identification results are typically validated at a given False Discovery Rate (FDR). The FDR indicates the share of false positive matches in the result set: (6) Where represents the count of target false positives and N the number of retained target hits. As an example, a result set of 100 matches with an FDR of 1% will contain only one false positive. The number of false positives can be equivalently estimated via the number of decoy hits or using the expectation value of the PEP. PeptideShaker hence provides two estimations of the FDR: (7) (8) By default the classical estimator of equation (7) is used, the user can switch to the probabilistic FDR in the validation tab to estimate the FDR using the posterior error probability (8). Similarly, the number of false negatives,, can be estimated by integrating the confidence. Complementarily to the FDR, the False Negative Rate (FNR) of the identification process is hence also estimated: (9) Where is the estimated number of true positives in the imported dataset. As an example, when 100 hits are validated at an FNR of 20%, one can expect a total of 125 true positive hits among which 25 were rejected by the threshold: with an FNR threshold of 1%, one covers 99% of the possible true positive identifications loaded in PeptideShaker. 28

29 PeptideShaker allows the user to intuitively edit the validation thresholds in the Validation tab and whether a match is validated or not is shown throughout the display. The validation process, a balance between quality and quantity, between FDR and FNR, is crucial for all downstream analyses. The accuracy of the false positive rate estimations was benchmarked by searching a pyrococcus furiosus dataset against a database consisting of the concatenation of pyrococcus furiosus sequences with the eukaryota complement of the UniProt/SwissProt database 40, downloaded on the 21 st of October 2013, 181,026 (target) sequences, including the reversed version of every sequence as decoy proteins. In this setup, eukaryote sequences (excluding known contaminants) can be considered as false identifications while pyrococcus furiosus sequences can be considered as correct matches, hence allowing the verification of target/decoy derived error rates 41. Peak lists obtained in 41 were searched using OMSSA 20 version 2.1.9, X!Tandem 18 version Sledgehammer ( ), MS Amanda 21 version and MS-GF+ 24 version Beta (v10024) (5/9/2014). The search was conducted using SearchGUI 9 version The identification settings were as follows: Trypsin with a maximum of 2 missed cleavages; 10 ppm as MS1 and 0.5 Da as MS2 tolerances; fixed modifications: Carbamidomethylation of Cys ( Da), variable modifications: Oxidation of Met ( Da), Phosphorylation of Ser, Thr and ( Da). All algorithms specific settings were left to the default of SearchGUI. The mass spectrometry data along with the identification results have been deposited to the ProteomeXchange Consortium 15 via the PRIDE partner repository 10 with the dataset identifier PXD As demonstrated in Supplementary Figures 24 to 26, the error rate estimated by PeptideShaker accurately tracks the actual error rate for PSMs, peptides and proteins. As discussed in the literature 41-43, marginal underestimation of the error rate estimation can be ascribed to the second refinement procedure of X!Tandem. 29

30 Number of Decoy Proteins 2,000 1,800 1,600 1,400 1,200 1, ,000 1,500 2,000 Number of Eukaryota Proteins Proteins y=x Supplementary Figure 22: A pyrococcus furiosus dataset was searched against a concatenation of eukaryota and pyrococcus furiosus sequences. The number of retained decoy proteins is plotted against the number of identified eukaryote proteins at increasing protein score. In this setup, the number of eukaryota proteins indicates the number of false identifications. Number of Decoy Peptides Number of Eukaryota Peptides unmodified peptides modified peptides peptides y=x Supplementary Figure 23: Similarly as Supplementary Figure 24, the number of retained decoy peptides is plotted against the number of identified eukaryote peptides at increasing peptide score. In order to increase the identification rate, peptides were separated into two groups: modified and unmodified peptides. If a category of modified peptides is substantially enriched, a standalone group is automatically created by PeptideShaker (see main text for details). 30

31 3,000 2,500 Number of Decoy PSMs 2,000 1,500 1, PSMs 3+ PSMs 4+ and 5+ PSMs PSMs y=x ,000 1,500 2,000 2,500 3,000 Number of Eukaryota PSMs Supplementary Figure 24: Similarly as Supplementary Figure 25, the number of retained decoy PSMs is plotted against the number of identified eukaryote PSMs at increasing PSM score. In order to increase the identification rate, PSMs were separated into three groups: PSMs identified with a charge of 2+, 3+ and with charge >3+ (see main text for details). In addition to the statistical validation, one generally expects proteins to be identified with at least two confident peptides in large scale proteomic shotgun experiments (note that this does not apply for dataset enriched for specific species like modified peptides). Similarly as Peptizer for PSMs 44, PeptideShaker therefore inspects all the validated matches using quality filters and doubtful matches are flagged by a yellow warning icon throughout the display. Quality filters are applied to all PSMs, peptide and protein levels. Since confident peptides require confident PSMs, and in turn confident proteins require confident peptides, the quality of the identification propagates from PSM to peptide and protein levels providing the user with stringently quality controlled results. When the database or the dataset does not allow for reliable statistical estimation (e.g., when searching small databases or identifying a low number of proteins) the validation status is similarly marked as doubtful. As displayed in Supplementary Figure 27, clicking on the yellow warning icon opens a dialog with details on the validation status and allows the user to set the status of a validated match as confident or doubtful. 31

32 Supplementary Figure 25: In addition to the statistical validation, PeptideShaker inspects all identification matches using quality filters. Doubtful matches are marked with a yellow warning icon. Clicking on the icon opens a dialog allowing the inspection of the validation procedure for this match. Details concerning the database, the target/decoy scoring and results, and the inspection by quality filters. Here, the protein identification was supported by only one confident peptide and only one confident spectrum and was thus marked as doubtful. 32

33 7.0 - PTM Scoring In proteomics, there are two main paradigms for PTM localization scoring 45 : (i) using the identification algorithm score difference between assumptions carrying PTMs at different sites as a proxy to infer the quality of the localization; or (ii) using probabilistic scores to estimate the probability that a site is actually modified, these scores are inferred from the original spectra and are independent from the identification algorithm. PeptideShaker implements the widely used MD-score 46 and its multiple search engine equivalent, the D-score 47. Complementarily, two probabilistic scores are implemented, the A-score 17 and PhosphoRS 48. Although originally designed for phosphorylation only, these scores are estimated for every variable modification which can take different sites in a peptide. Notably, the peak annotation method used for these scores is the same as the one used to annotate the spectra in the interface, thus allowing visual inspection of site determining ions in spectra. During the scoring, neutral losses of the same mass as the PTM itself are not taken into account for spectrum annotation. In our experience, disabling all neutral losses (e.g. H 2O and NH 3) increases the discrimination power of the probabilistic scores. As mentioned in the project creation section, the selection of the probabilistic score is done in the processing parameters dialog. There, it is also possible to disable neutral losses annotation for the scores and set a score threshold. Finally, one can enable an automated threshold which will be calibrated to a 99% agreement with the D-score. Whenever a peptide presents more modification sites than detected modifications and the probabilistic score passes the threshold, the modification is marked as Confident. It is labeled as Doubtful if it does not pass the threshold and as Random if different modification sites score equally. If no probabilistic score is calculated, a D-score of 95% is used as the threshold. Note that the A-score was only defined for singly modified peptides. The A-score was also established for spectra searched with a tolerance of ±0.5 m/z thus using a subdivision in windows of 100 m/z: the window size equals 100 times the fragment mass resolution. In our implementation, we kept the window size and tolerance to this original ratio. As a result, for data searched with a tolerance of 0.02 m/z the spectrum will be subdivided into windows of 4 m/z and not 100 m/z as done in PhosphoRS 48. PhosphoRS was implemented according to its original publication

34 8.0 - Reports, Follow Up Analyses and Submissions to PRIDE All the generated data can be exported to text files which can subsequently be imported into external tools like Excel or Perseus ( Contextual export options are additionally available from most tables or displays. Identification details can also be exported under the Export > Identification Features menu, where the user can select information about protein, peptide, PSM or search engine specific result to be exported to a text file. A phosphorylation oriented summary is also available. Finally, fully customizable reports can export virtually any information from PeptideShaker. Five reports are available by default: (i) Certificate Analysis: all search and processing parameters with statistics on the dataset a crucial feature for service providers and publications; (ii) Default PSM Report: all identification information at the PSM level; (iii) Default Peptide Report: all identification information at the peptide level; (iv) Default Protein Report: all identification information at the protein level; and (v) Default Hierarchical Report: all identification results presented in a hierarchical manner: protein > peptide > PSM. Note that the user can create and customize his/her own reports as displayed in Supplementary Figure 28. Documentation with description of the report content can also be exported for every report. 34

35 Supplementary Figure 26: In the Report Dialog the user can select the identification features to export. These include Annotation Settings, Input Filters, Proteins, Peptides, PSMs, PTMs, Fragment Ion Information, Project Details, Search Parameters, Spectrum Counting Settings and Target/Decoy Validation Summary. From each of these category, the user can select the wanted elements as displayed here for proteins - ranging from high level information (accession, chromosome, sequence coverage, PTM mapping, etc.) to detailed mass spectrometry results, like detected fragment ion m/z error. PeptideShaker also offers various exports for post processing of the identifications, as displayed in Supplementary Figure 29. These are available under the Export > Follow Up Analysis menu. 35

36 Supplementary Figure 27: The Follow Up Analysis options in PeptideShaker include: (i) export of the spectra of non-validated matches or recalibrated spectra (at the MS1 and/or MS2 level) of the validated matches; (ii) export of accessions and sequences of validated or not validated proteins; (iii) export to the popular Progenesis LC-MS label free quantification software; (iv) export to graph databases like Cytoscape 49 ; (v) export of inclusion/exclusion lists for various instruments; and (vi) export of libraries in the SWATH format as specified by the manufacturer. Notably, the recalibration feature can be used when calibration issues are detected (as mentioned in the Results Navigation section) as for instance often encountered on Time Of Flight (TOF) instruments. Also note that the export to graph databases provides a unique intuitive view on complex datasets with a graphical approach to the protein inference problem as displayed in Supplementary Figure

37 Supplementary Figure 28: By supporting export to graph database formats like Cytoscape, PeptideShaker allows for an intuitive visalization of protein inference problems, here represented by two examples from the example dataset - proteins in red, peptides in blue. On the left, an ideal case where different peptides map uniquely to a single protein accession. On the right, individual proteins are supported both by unique and shared peptides. 37

38 PeptideShaker offers the possibility to export a summary of the methods used for identification under the Export > Methods Section menu. A text is automatically generated and can serve as a basis for inclusion in the methods section of manuscripts as illustrated in Supplementary Figure 31 (see section Validating Proteins, Peptides and PSMs for an example of application). Supplementary Figure 29: PeptideShaker generates automatically a draft of the methods used for protein identification. This text can help for manuscript writing. 38

39 The PeptideShaker project can be exported in various forms via the Export > PeptideShaker Project As menu: (i) Zip File: a zip file containing the saved project and all related files, (ii) mzidentml: the identification results in the PSI 50 mzidentml 23 format, and (iii) PRIDE XML: the peak lists and identification results in the standard PRIDE XML format. When exporting the entire project, the zip file can be shared and opened in the interface upon unzipping. Both mzidentml and PRIDE XML files allow direct submission to PRIDE via ProteomeXchange ( within a few clicks. When creating an mzidentml file, a dialog allows the annotation of the project as displayed in Supplementary Figure 32. Subsequently, a valid, fully annotated and very close to MIAPE 51 compliant mzidentml file is created. Supplementary Figure 30: The mzidentml Export Dialog makes is easy to annotate and export mzidentml files that can readily be submitted to PRIDE via ProteomeXchange. 39

40 Similarly, when creating a PRIDE XML file, the user is guided through the adding of the required meta data annotation by a user friendly interface as displayed in Supplementary Figure 33. The generated file is comprehensively annotated by information available in PeptideShaker and by user input using controlled vocabulary, facilitated by the Ontology Lookup Service 52. Supplementary Figure 31: The PRIDE Export Dialog makes is easy to create well-annotated PRIDE XML file that can readily be submitted to PRIDE via ProteomeXchange. 40

41 9.0 - Command Line Use PeptideShaker can also be used via the command line and hence run in automated batch mode. Different command line modes are available: (i) PeptideShakerCLI: process identification files and saves the project; (ii) ReportCLI: take a saved project as input and export the results as default reports or as custom reports; (iii) FollowUpCLI: take a saved project as input and export the previously described follow up features; (iv) MzidCLI: takes a saved project as input with the relevant annotation and exports an mzidentml file. Note that all command line options for ReportCLI, FollowUpCLI and MzidCLI can also be used in the PeptideShakerCLI mode. Detailed information about the parameters can be found on the PeptideShaker website ( Notably, the PeptideShaker command line version has already been included in Galaxy 53 by an independent third party ( 41

42 Documentation, Help, Support and Updates Contextual help is available everywhere in the interface in the form of question marks as displayed in Supplementary Figure 34. Supplementary Figure 32: Question marks are present everywhere in the interface triggering contextual help. 42

43 The help links to external resources, publications and additional information on the PeptideShaker website. Beginners can also have a look at our general protein identification tutorials 14 : All these resources are kept up-to-date with the development process of the software. For other questions there is also an active discussion group: peptide-shaker. Creating bug reports is easy via the Bug Report Dialog, Help > Bug Report, as displayed in Supplementary Figure 35. Please use the issue tracker at the PeptideShaker web page to report issues. Supplementary Figure 33: If encountering a problem the user is directed to online help directly from the interface, or a bug report with details can be sent to the developers for faster bug fixing. New versions are regularly released including bug fixes and new features. If an internet connection is available, the user is notified and an auto-update is proposed. Changes are documented for every version on the PeptideShaker website in the Release Note wiki ( and announced on the mailing list. 43

How to view Results with Scaffold. Proteomics Shared Resource

How to view Results with Scaffold. Proteomics Shared Resource How to view Results with Scaffold Proteomics Shared Resource Starting out Download Scaffold from http://www.proteomes oftware.com/proteom e_software_prod_sca ffold_download.html Follow installation instructions

More information

How to view Results with. Proteomics Shared Resource

How to view Results with. Proteomics Shared Resource How to view Results with Scaffold 3.0 Proteomics Shared Resource An overview This document is intended to walk you through Scaffold version 3.0. This is an introductory guide that goes over the basics

More information

Improving Productivity with Applied Biosystems GPS Explorer

Improving Productivity with Applied Biosystems GPS Explorer Product Bulletin TOF MS Improving Productivity with Applied Biosystems GPS Explorer Software Purpose GPS Explorer Software is the application layer software for the Applied Biosystems 4700 Proteomics Discovery

More information

ProteinPilot Report for ProteinPilot Software

ProteinPilot Report for ProteinPilot Software ProteinPilot Report for ProteinPilot Software Detailed Analysis of Protein Identification / Quantitation Results Automatically Sean L Seymour, Christie Hunter SCIEX, USA Powerful mass spectrometers like

More information

Mass Spectrometry Based Proteomics Data Analysis Using GalaxyP

Mass Spectrometry Based Proteomics Data Analysis Using GalaxyP Mass Spectrometry Based Proteomics Data Analysis Using GalaxyP GCC 2015 GalaxyP Workshop July 6th, 2015 Norwich, UK Presenters: Tim Griffin, Pratik Jagtap and James Johnson Documentation: Kevin Murray,

More information

PTM Identification and Localization from MS Proteomics Data

PTM Identification and Localization from MS Proteomics Data 05.10.17 PTM Identification and Localization from MS Proteomics Data Marc Vaudel Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, Bergen, Norway KG Jebsen Center for Diabetes

More information

ProteinPilot Software for Protein Identification and Expression Analysis

ProteinPilot Software for Protein Identification and Expression Analysis ProteinPilot Software for Protein Identification and Expression Analysis Providing expert results for non-experts and experts alike ProteinPilot Software Overview New ProteinPilot Software transforms protein

More information

PEAKS 8 User Manual. PEAKS Team

PEAKS 8 User Manual. PEAKS Team PEAKS 8 User Manual PEAKS Team PEAKS 8 User Manual PEAKS Team Publication date 2016 Table of Contents 1. Overview... 1 1. How to Use This Manual... 1 2. What Is PEAKS?... 1 3. What Is New in PEAKS 8?...

More information

Nature Biotechnology: doi: /nbt Supplementary Figure 1. The workflow of Open-pFind.

Nature Biotechnology: doi: /nbt Supplementary Figure 1. The workflow of Open-pFind. Supplementary Figure 1 The workflow of Open-pFind. The MS data are first preprocessed by pparse, and then the MS/MS data are searched by the open search module. Next, the MS/MS data are re-searched by

More information

False Discovery Rate ProteoRed Multicentre Study 6 Terminology Decoy sequence construction

False Discovery Rate ProteoRed Multicentre Study 6 Terminology Decoy sequence construction False Discovery Rate ProteoRed Multicentre Study 6 Terminology FDR: False Discovery Rate PSM: Peptide- Spectrum Match (a.k.a., a hit) PIT: Percentage of Incorrect Targets In the past years, a number of

More information

ProteinPilot Software Overview

ProteinPilot Software Overview ProteinPilot Software Overview High Quality, In-Depth Protein Identification and Protein Expression Analysis Sean L. Seymour and Christie L. Hunter SCIEX, USA As mass spectrometers for quantitative proteomics

More information

Spectral Counting Approaches and PEAKS

Spectral Counting Approaches and PEAKS Spectral Counting Approaches and PEAKS INBRE Proteomics Workshop, April 5, 2017 Boris Zybailov Department of Biochemistry and Molecular Biology University of Arkansas for Medical Sciences 1. Introduction

More information

Highly Confident Peptide Mapping of Protein Digests Using Agilent LC/Q TOFs

Highly Confident Peptide Mapping of Protein Digests Using Agilent LC/Q TOFs Technical Overview Highly Confident Peptide Mapping of Protein Digests Using Agilent LC/Q TOFs Authors Stephen Madden, Crystal Cody, and Jungkap Park Agilent Technologies, Inc. Santa Clara, California,

More information

Supporting Information for

Supporting Information for Supporting Information for CharmeRT: Boosting peptide identifications by chimeric spectra identification and retention time prediction Viktoria Dorfer*,,a, Sergey Maltsev,b, Stephan Winkler a, Karl Mechtler*,b,c.

More information

High-throughput Proteomic Data Analysis. Suh-Yuen Liang ( 梁素雲 ) NRPGM Core Facilities for Proteomics and Glycomics Academia Sinica Dec.

High-throughput Proteomic Data Analysis. Suh-Yuen Liang ( 梁素雲 ) NRPGM Core Facilities for Proteomics and Glycomics Academia Sinica Dec. High-throughput Proteomic Data Analysis Suh-Yuen Liang ( 梁素雲 ) NRPGM Core Facilities for Proteomics and Glycomics Academia Sinica Dec. 9, 2009 High-throughput Proteomic Data Are Information Rich and Dependent

More information

Protein Reports CPTAC Common Data Analysis Pipeline (CDAP)

Protein Reports CPTAC Common Data Analysis Pipeline (CDAP) Protein Reports CPTAC Common Data Analysis Pipeline (CDAP) v. 4/13/2015 Summary The purpose of this document is to describe the protein reports generated as part of the CPTAC Common Data Analysis Pipeline

More information

Important Information for MCP Authors

Important Information for MCP Authors Guidelines to Authors for Publication of Manuscripts Describing Development and Application of Targeted Mass Spectrometry Measurements of Peptides and Proteins and Submission Checklist The following Guidelines

More information

Supporting Information for Comprehensive HCP Profiling by Targeted and Untargeted Analysis of DIA Mass Spectrometry Data with PRM Verification

Supporting Information for Comprehensive HCP Profiling by Targeted and Untargeted Analysis of DIA Mass Spectrometry Data with PRM Verification Supporting Information for Comprehensive HCP Profiling by Targeted and Untargeted Analysis of DIA Mass Spectrometry Data with PRM Verification Simion Kreimer 1, Yuanwei Gao 1, Somak Ray 1, Mi Jin 2,3,

More information

Cell Signaling Technology

Cell Signaling Technology Cell Signaling Technology PTMScan Direct: Multipathway v2.0 Proteomics Service Group January 14, 2013 PhosphoScan Deliverables Project Overview Methods PTMScan Direct: Multipathway V2.0 (Tables 1,2) Qualitative

More information

Functional analysis using EBI Metagenomics

Functional analysis using EBI Metagenomics Functional analysis using EBI Metagenomics Contents Tutorial information... 2 Tutorial learning objectives... 2 An introduction to functional analysis using EMG... 3 What are protein signatures?... 3 Assigning

More information

ProteomicsBrowser User Guide

ProteomicsBrowser User Guide Vol.1 ProteomicsBrowser User Guide Copyrighted by Yale University Table of Contents PROTEOMICSBROWSER DOWNLOADING AND INSTALLATION...3 PROJECT MANAGEMENT...3 CREATE PROJECT... 3 SAVE PROJECT... 3 OPEN

More information

Current state of proteomics standardization and (C-)HPP data quality guidelines

Current state of proteomics standardization and (C-)HPP data quality guidelines 9/6/2016 3 Department of Pharmacy Analytical Biochemistry Current state of proteomics standardization and (C-)HPP data quality guidelines DTL focus meeting on data integration, standards and fair principles

More information

N- The rank of the specified protein relative to all other proteins in the list of detected proteins.

N- The rank of the specified protein relative to all other proteins in the list of detected proteins. PROTEIN SUMMARY file N- The rank of the specified protein relative to all other proteins in the list of detected proteins. Unused (ProtScore) - A measure of the protein confidence for a detected protein,

More information

Progenesis QI for proteomics HCP Spectral Library User Guide

Progenesis QI for proteomics HCP Spectral Library User Guide Progenesis QI for proteomics HCP Spectral Library User Guide Analysis workflow guidelines for MS E data Contents Introduction... 3 How to use this document... 3 How can I analyse my own runs using Progenesis

More information

PRIDE Inspector: a tool to visualize and validate MS proteomics data

PRIDE Inspector: a tool to visualize and validate MS proteomics data Europe PMC Funders Group Author Manuscript Published in final edited form as: Nat Biotechnol. ; 30(2): 135 137. doi:10.1038/nbt.2112. PRIDE Inspector: a tool to visualize and validate MS proteomics data

More information

Getting to grips with histone modifications.

Getting to grips with histone modifications. Getting to grips with histone modifications. 1 In the last couple of years we have seen an increase in the number of questions to our support email addresses about analyzing histone samples and their modifications.

More information

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with

More information

Supporting Information. Scanning Quadrupole Data Independent Acquisition Part A Qualitative and Quantitative Characterization

Supporting Information. Scanning Quadrupole Data Independent Acquisition Part A Qualitative and Quantitative Characterization S Supporting Information Scanning Quadrupole Data Independent Acquisition Part A Qualitative and Quantitative Characterization M. Arthur Moseley, Christopher J. Hughes 2, Praveen R. Juvvadi 3, Erik J.

More information

Spectrum Mill MS Proteomics Workbench. Comprehensive tools for MS proteomics

Spectrum Mill MS Proteomics Workbench. Comprehensive tools for MS proteomics Spectrum Mill MS Proteomics Workbench Comprehensive tools for MS proteomics Meeting the challenge of proteomics data analysis Mass spectrometry is a core technology for proteomics research, but large-scale

More information

Agilent Software Tools for Mass Spectrometry Based Multi-omics Studies

Agilent Software Tools for Mass Spectrometry Based Multi-omics Studies Agilent Software Tools for Mass Spectrometry Based Multi-omics Studies Technical Overview Introduction The central dogma for biological information flow is expressed as a series of chemical conversions

More information

Peptide and protein identification in mass spectrometry based proteomics. Yafeng Zhu, PhD student Karolinska Institutet, Scilifelab

Peptide and protein identification in mass spectrometry based proteomics. Yafeng Zhu, PhD student Karolinska Institutet, Scilifelab Peptide and protein identification in mass spectrometry based proteomics Yafeng Zhu, PhD student Karolinska Institutet, Scilifelab 2017-10-12 Content How is the peptide sequence identified? What is the

More information

Bionano Access v1.1 Release Notes

Bionano Access v1.1 Release Notes Bionano Access v1.1 Release Notes Document Number: 30188 Document Revision: C For Research Use Only. Not for use in diagnostic procedures. Copyright 2017 Bionano Genomics, Inc. All Rights Reserved. Table

More information

DeNovoGUI: an open source graphical user interface for. de novo sequencing of tandem mass spectra.

DeNovoGUI: an open source graphical user interface for. de novo sequencing of tandem mass spectra. DeNovoGUI: an open source graphical user interface for de novo sequencing of tandem mass spectra. Thilo Muth 1,$, Lisa Weilnböck 2,$, Erdmann Rapp 1, Christian G. Huber 2, Lennart Martens 3,4, Marc Vaudel

More information

PRIDE Inspector: a tool to visualize and validate MS proteomics data

PRIDE Inspector: a tool to visualize and validate MS proteomics data ! PRIDE Inspector: a tool to visualize and validate MS proteomics data Rui Wang, Antonio Fabregat, Daniel Ríos, David Ovelleiro, Joseph M. Foster, Richard G. Côté, Johannes Griss, Attila Csordas, Yasset

More information

RockerBox. Filtering massive Mascot search results at the.dat level

RockerBox. Filtering massive Mascot search results at the.dat level RockerBox Filtering massive Mascot search results at the.dat level Challenges Big experiments High amount of data Large raw and.dat files (> 2GB) How to handle our results?? The 2.2 peptide summary could

More information

Application Performance Monitoring Dashboard 7.2

Application Performance Monitoring Dashboard 7.2 User Guide Focused Insights for SAP Solution Manager Document Version: 1.1 2017-07-31 ST-OST 200 SP 1 Typographic Conventions Type Style Example Example EXAMPLE Example Example EXAMPLE Description

More information

SMRT Analysis Barcoding Overview (v6.0.0)

SMRT Analysis Barcoding Overview (v6.0.0) SMRT Analysis Barcoding Overview (v6.0.0) Introduction This document applies to PacBio RS II and Sequel Systems using SMRT Link v6.0.0. Note: For information on earlier versions of SMRT Link, see the document

More information

Center for Mass Spectrometry and Proteomics Phone (612) (612)

Center for Mass Spectrometry and Proteomics Phone (612) (612) Outline Database search types Peptide Mass Fingerprint (PMF) Precursor mass-based Sequence tag Results comparison across programs Manual inspection of results Terminology Mass tolerance MS/MS search FASTA

More information

HID Real-Time PCR Analysis Software v1.3

HID Real-Time PCR Analysis Software v1.3 PRODUCT BULLETIN Human identification HID Real-Time PCR Analysis Software v1.3 Enhanced data analysis for QuantStudio 5 and 7500 Real-Time PCR Systems with Quantifiler kit Applied Biosystems Quantifiler

More information

Oracle Enterprise Manager

Oracle Enterprise Manager Oracle Enterprise Manager System Monitoring Plug-in for Oracle Enterprise Manager Ops Center Guide 12c Release 5 (12.1.0.5.0) E38529-08 April 2016 This document describes how to use the Infrastructure

More information

Spectronaut Pulsar X. Maximize proteome coverage and data completeness by utilizing the power of Hybrid Libraries

Spectronaut Pulsar X. Maximize proteome coverage and data completeness by utilizing the power of Hybrid Libraries Spectronaut Pulsar X Maximize proteome coverage and data completeness by utilizing the power of Hybrid Libraries More versatility in proteomics research Spectronaut has delivered highest performance in

More information

Application Note TOF/MS

Application Note TOF/MS Application Note TOF/MS New Level of Confidence for Protein Identification: Results Dependent Analysis and Peptide Mass Fingerprinting Using the 4700 Proteomics Discovery System Purpose The Applied Biosystems

More information

The Open2Dprot Proteomics Project for n-dimensional Protein Expression Data Analysis. The Open2Dprot Project. Introduction

The Open2Dprot Proteomics Project for n-dimensional Protein Expression Data Analysis. The Open2Dprot Project. Introduction The Open2Dprot Proteomics Project for n-dimensional Protein Expression Data Analysis http://open2dprot.sourceforge.net/ Revised 2-05-2006 * (cf. 2D-LC) Introduction There is a need for integrated proteomics

More information

MIAPE: Mass Spectrometry Informatics

MIAPE: Mass Spectrometry Informatics MIAPE: Mass Spectrometry Informatics Pierre-Alain Binz[1,2]*, Robert Barkovich[3], Ronald C. Beavis[4], David Creasy[5], David M. Horn[6], Randall K. Julian Jr.[7], Sean L. Seymour[8], Chris F. Taylor[9],

More information

What s New in Discovery Studio 2.5.5

What s New in Discovery Studio 2.5.5 What s New in Discovery Studio 2.5.5 Discovery Studio takes modeling and simulations to the next level. It brings together the power of validated science on a customizable platform for drug discovery research.

More information

Supplementary Tables. Note: Open-pFind is embedded as the default open search workflow of the pfind tool. Nature Biotechnology: doi: /nbt.

Supplementary Tables. Note: Open-pFind is embedded as the default open search workflow of the pfind tool. Nature Biotechnology: doi: /nbt. Supplementary Tables Supplementary Table 1. Detailed information for the six datasets used in this study Dataset Mass spectrometer # Raw files # MS2 scans Reference Dong-Ecoli-QE Q Exactive 5 202,452 /

More information

A Highly Accurate Mass Profiling Approach to Protein Biomarker Discovery Using HPLC-Chip/ MS-Enabled ESI-TOF MS

A Highly Accurate Mass Profiling Approach to Protein Biomarker Discovery Using HPLC-Chip/ MS-Enabled ESI-TOF MS Application Note PROTEOMICS METABOLOMICS GENOMICS INFORMATICS GLYILEVALCYSGLUGLNALASERLEUASPARG CYSVALLYSPROLYSPHETYRTHRLEUHISLYS A Highly Accurate Mass Profiling Approach to Protein Biomarker Discovery

More information

Protein Grouping, FDR Analysis and Databases.

Protein Grouping, FDR Analysis and Databases. Protein Grouping, FDR Analysis and Databases. March 15th 2012 Pratik Jagtap The Minnesota http://www.mass.msi.umn.edu/ Protein Grouping, FDR Analysis and Databases Overview. Protein Grouping : Concept

More information

timstof Innovation with Integrity Powered by PASEF TIMS-QTOF MS

timstof Innovation with Integrity Powered by PASEF TIMS-QTOF MS timstof Powered by PASEF Innovation with Integrity TIMS-QTOF MS timstof The new standard for high speed, high sensitivity shotgun proteomics The timstof Pro with PASEF technology delivers revolutionary

More information

COMPUTER RESOURCES II:

COMPUTER RESOURCES II: COMPUTER RESOURCES II: Using the computer to analyze data, using the internet, and accessing online databases Bio 210, Fall 2006 Linda S. Huang, Ph.D. University of Massachusetts Boston In the first computer

More information

Improve the performance of your Proteomics LC-MS facility with Quic

Improve the performance of your Proteomics LC-MS facility with Quic APPLICATION NOTE Improve the performance of your Proteomics LC-MS facility with Quic FEATURES An active queue of per-run QC analyses that checking supports real-time folder monitoring A multitude of readouts

More information

Streamlined and Complete LC-MS Workflow for MAM

Streamlined and Complete LC-MS Workflow for MAM Streamlined and Complete LC-MS Workflow for MAM Multiple Attribute Methodology (MAM) for Biotherapeutic Attribute Monitoring and Purity Testing BV MAM WORKFLOW Define Track Quantify Purity New Peaks Report

More information

Introduction. Benefits of the SWATH Acquisition Workflow for Metabolomics Applications

Introduction. Benefits of the SWATH Acquisition Workflow for Metabolomics Applications SWATH Acquisition Improves Metabolite Coverage over Traditional Data Dependent Techniques for Untargeted Metabolomics A Data Independent Acquisition Technique Employed on the TripleTOF 6600 System Zuzana

More information

systemsdock Operation Manual

systemsdock Operation Manual systemsdock Operation Manual Version 2.0 2016 April systemsdock is being developed by Okinawa Institute of Science and Technology http://www.oist.jp/ Integrated Open Systems Unit http://openbiology.unit.oist.jp/_new/

More information

KnetMiner USER TUTORIAL

KnetMiner USER TUTORIAL KnetMiner USER TUTORIAL Keywan Hassani-Pak ROTHAMSTED RESEARCH 10 NOVEMBER 2017 About KnetMiner KnetMiner, with a silent "K" and standing for Knowledge Network Miner, is a suite of open-source software

More information

Global Value 7. Productivity Suite for GammaVision. Optimizing Gamma Spectrometry Processes through Secure Data Management and Measurement Automation.

Global Value 7. Productivity Suite for GammaVision. Optimizing Gamma Spectrometry Processes through Secure Data Management and Measurement Automation. Productivity Suite for GammaVision Optimizing Gamma Spectrometry Processes through Secure Data Management and Measurement Automation. Global Value was designed to transform the standard GammaVision spectroscopy

More information

Gene List Enrichment Analysis

Gene List Enrichment Analysis Outline Gene List Enrichment Analysis George Bell, Ph.D. BaRC Hot Topics March 16, 2010 Why do enrichment analysis? Main types Selecting or ranking genes Annotation sources Statistics Remaining issues

More information

Modification Site Localization Scoring Integrated into a Search Engine

Modification Site Localization Scoring Integrated into a Search Engine Modification Site Localization Scoring Integrated into a Search Engine Peter R. Baker 1, Jonathan C. Trinidad 1, Katalin F. Medzihradszky 1, Alma L. Burlingame 1 and Robert J. Chalkley 1 1 Mass Spectrometry

More information

Shedding light on black boxes in protein identification

Shedding light on black boxes in protein identification Standardisation & Guidelines Shedding light on black boxes in protein identification Marc Vaudel 1,2, A. Saskia Venne 2, Frode S. Berven 1,3,4, René P. Zahedi 2, Lennart Martens 5,6 and Harald Barsnes

More information

Ensure your Success with Agilent s Biopharma Workflows

Ensure your Success with Agilent s Biopharma Workflows Ensure your Success with Agilent s Biopharma Workflows Steve Madden Software Product Manager Agilent Technologies, Inc. June 5, 2018 Agenda Agilent BioPharma Workflow Platform for LC/MS Intact Proteins

More information

Analyzing an individual sequence in the Sequence Editor

Analyzing an individual sequence in the Sequence Editor BioNumerics Tutorial: Analyzing an individual sequence in the Sequence Editor 1 Aim The Sequence editor window is a convenient tool implemented in BioNumerics to edit and analyze nucleotide and amino acid

More information

MRMPilot Software: Accelerating MRM Assay Development for Targeted Quantitative Proteomics

MRMPilot Software: Accelerating MRM Assay Development for Targeted Quantitative Proteomics Product Bulletin MRMPilot Software: Accelerating MRM Assay Development for Targeted Quantitative Proteomics With Unique QTRAP System Technology Overview Targeted peptide quantitation is a rapidly growing

More information

Fast Track SA 2.0 Drive Thru Timing System. Document: USR-FTSA, Rev. A Effective Date: September 6, 2017

Fast Track SA 2.0 Drive Thru Timing System. Document: USR-FTSA, Rev. A Effective Date: September 6, 2017 ENGAGE INFLUENCE OPTIMIZE User Guide Fast Track SA 2.0 Drive Thru Timing System Effective Date: September 6, 2017 Fast Track SA 2.0 User Guide Page 2 of 33 Table of Contents 1 INTRODUCTION... 4 2 FAST

More information

MassHunter Profinder: Batch Processing Software for High Quality Feature Extraction of Mass Spectrometry Data

MassHunter Profinder: Batch Processing Software for High Quality Feature Extraction of Mass Spectrometry Data MassHunter Profinder: Batch Processing Software for High Quality Feature Extraction of Mass Spectrometry Data Technical Overview Introduction LC/MS metabolomics workflows typically involve the following

More information

Agilent Genomic Workbench 7.0

Agilent Genomic Workbench 7.0 Agilent Genomic Workbench 7.0 Product Overview Guide Agilent Technologies Notices Agilent Technologies, Inc. 2012, 2015 No part of this manual may be reproduced in any form or by any means (including electronic

More information

Bioinformatic Tools. So you acquired data.. But you wanted knowledge. So Now What?

Bioinformatic Tools. So you acquired data.. But you wanted knowledge. So Now What? Bioinformatic Tools So you acquired data.. But you wanted knowledge So Now What? We have a series of questions What the Heck is That Ion? How come my MW does not match? How do I make a DB to search against?

More information

Supplemental Materials

Supplemental Materials Supplemental Materials MSGFDB Parameters For all MSGFDB searches, the following parameters were used: 30 ppm precursor mass tolerance, enable target-decoy search, enzyme specific scoring (for Arg-C, Asp-N,

More information

Quality Control Assessment in Genotyping Console

Quality Control Assessment in Genotyping Console Quality Control Assessment in Genotyping Console Introduction Prior to the release of Genotyping Console (GTC) 2.1, quality control (QC) assessment of the SNP Array 6.0 assay was performed using the Dynamic

More information

Bionano Access 1.0 Software User Guide

Bionano Access 1.0 Software User Guide Bionano Access 1.0 Software User Guide Document Number: 30142 Document Revision: A For Research Use Only. Not for use in diagnostic procedures. Copyright 2017 Bionano Genomics, Inc. All Rights Reserved.

More information

Sales Order Fulfillment

Sales Order Fulfillment Sales Order Fulfillment SIMMS Inventory Management Software 2012 July 28, 2011 Contents Sales Order Fulfillment................ 1 Filling Sales Orders.................. 1 List Sales Orders for Fulfillment...........

More information

FACTORS THAT AFFECT PROTEIN IDENTIFICATION BY MASS SPECTROMETRY HAOFEI TIFFANY WANG. (Under the Direction of Ron Orlando) ABSTRACT

FACTORS THAT AFFECT PROTEIN IDENTIFICATION BY MASS SPECTROMETRY HAOFEI TIFFANY WANG. (Under the Direction of Ron Orlando) ABSTRACT FACTORS THAT AFFECT PROTEIN IDENTIFICATION BY MASS SPECTROMETRY by HAOFEI TIFFANY WANG (Under the Direction of Ron Orlando) ABSTRACT Mass spectrometry combined with database search utilities is a valuable

More information

Sage ERP MAS 90 and Intelligence Release Notes

Sage ERP MAS 90 and Intelligence Release Notes Sage ERP MAS 90 and 200 4.50 Intelligence Release Notes Table of Contents 1 New Reporting Trees... 2 1.1 Reporting Tree Example... 2 1.2 Reporting Trees... 2 1.3 Linking Reporting Tree Units to Distribution

More information

OncoMD User Manual Version 2.6. OncoMD: Cancer Analytics Platform

OncoMD User Manual Version 2.6. OncoMD: Cancer Analytics Platform OncoMD: Cancer Analytics Platform 1 Table of Contents 1. INTRODUCTION... 3 2. OVERVIEW OF ONCOMD... 3 3. ORGANIZATION OF INFORMATION IN ONCOMD... 3 4. GETTING STARTED... 6 4.1 USER AUTHENTICATION... 6

More information

Confident Protein ID using Spectrum Mill Software

Confident Protein ID using Spectrum Mill Software Welcome to our E-Seminar: Confident Protein ID using Spectrum Mill Software Slide 1 Spectrum Mill Informatics Software Start with batches of raw MS data! Sp ec t ru m Mi ll Biologist-friendly answers!

More information

Workflows and Pipelines for NGS analysis: Lessons from proteomics

Workflows and Pipelines for NGS analysis: Lessons from proteomics Workflows and Pipelines for NGS analysis: Lessons from proteomics Conference on Applying NGS in Basic research Health care and Agriculture 11 th Sep 2014 Debasis Dash Where are the protein coding genes

More information

ProMass HR Applications!

ProMass HR Applications! ProMass HR Applications! ProMass HR Features Ø ProMass HR includes features for high resolution data processing. Ø ProMass HR includes the standard ProMass deconvolution algorithm as well as the full Positive

More information

Quantification of Isotope Encoded Proteins in 2D Gels

Quantification of Isotope Encoded Proteins in 2D Gels Quantification of Isotope Encoded Proteins in 2D Gels Using Surface Enhanced Resonance Raman Giselle M. Knudsen 1, Brandon M. Davis 2, Shirshendu K. Deb 1, Yvette Loethen 2, Ravindra Gudihal 1, Pradeep

More information

Bionano Access 1.1 Software User Guide

Bionano Access 1.1 Software User Guide Bionano Access 1.1 Software User Guide Document Number: 30142 Document Revision: B For Research Use Only. Not for use in diagnostic procedures. Copyright 2017 Bionano Genomics, Inc. All Rights Reserved.

More information

Practical Tips. : Practical Tips Matrix Science

Practical Tips. : Practical Tips Matrix Science Practical Tips : Practical Tips 2006 Matrix Science 1 Peak detection Especially critical for Peptide Mass Fingerprints A tryptic digest of an average protein (30 kda) should produce of the order of 50

More information

MetaGO: Predicting Gene Ontology of non-homologous proteins through low-resolution protein structure prediction and protein-protein network mapping

MetaGO: Predicting Gene Ontology of non-homologous proteins through low-resolution protein structure prediction and protein-protein network mapping MetaGO: Predicting Gene Ontology of non-homologous proteins through low-resolution protein structure prediction and protein-protein network mapping Chengxin Zhang, Wei Zheng, Peter L Freddolino, and Yang

More information

SCIEX OS Software One Touch Productivity

SCIEX OS Software One Touch Productivity SCIEX OS Software One Touch Productivity Accuracy, Simplicity and Efficiency What SCIEX OS Software is all About Mass spectrometry and the data challenge Mass spectrometry continues to provide more comprehensive

More information

SpectroDive NEXT GENERATION TARGETED PROTEOMICS. Integration of Ready-Made Panels Improved Workflow for Custom Panels

SpectroDive NEXT GENERATION TARGETED PROTEOMICS. Integration of Ready-Made Panels Improved Workflow for Custom Panels SpectroDive NEXT GENERATION TARGETED PROTEOMICS Integration of Ready-Made Panels Improved Workflow for Custom Panels IMPROVING MULTIPLEXING FOR TARGETED PROTEOMICS Novel targeted proteomics workflows can

More information

static MM_Index snap(mm_index corect, MM_Index ligct, int imatch0, int *moleatoms, i

static MM_Index snap(mm_index corect, MM_Index ligct, int imatch0, int *moleatoms, i BIOLUMINATE static MM_Index snap(mm_index corect, MM_Index ligct, int imatch0, int *moleatoms, int *refcoreatoms){int ncoreat = :vector mappings; PhpCoreMapping mapping; for COMMON(glidelig).

More information

SignalP Plugin USER MANUAL

SignalP Plugin USER MANUAL SignalP Plugin USER MANUAL User manual for SignalP 2.3 Windows, Mac OS X and Linux November 7, 2017 This software is for research purposes only. QIAGEN Aarhus Silkeborgvej 2 Prismet DK-8000 Aarhus C Denmark

More information

Biotherapeutic Non-Reduced Peptide Mapping

Biotherapeutic Non-Reduced Peptide Mapping Biotherapeutic Non-Reduced Peptide Mapping Routine non-reduced peptide mapping of biotherapeutics on the X500B QTOF System Method details for the routine non-reduced peptide mapping of a biotherapeutic

More information

Oracle Knowledge Analytics User Guide

Oracle Knowledge Analytics User Guide Oracle Knowledge Analytics User Guide Working with Oracle Knowledge Analytics Reports Oracle Knowledge Version 8.4.2.2 April, 2012 Oracle, Inc. COPYRIGHT INFORMATION Copyright 2002, 2011, Oracle and/or

More information

About OMICS Group Conferences

About OMICS Group Conferences About OMICS Group OMICS Group International is an amalgamation of Open Access publications and worldwide international science conferences and events. Established in the year 2007 with the sole aim of

More information

Introduction to PtMS IS - the Interactive Scheduler

Introduction to PtMS IS - the Interactive Scheduler Introduction to PtMS IS - the Interactive Scheduler Introduction to PtMS IS - the Interactive Scheduler The PTMS Interactive Scheduler is an automated schedule tool for PtMS for Windows that allows users

More information

Office 365 Timesheet Add-In (User manual)

Office 365 Timesheet Add-In (User manual) Office 365 Timesheet Add-In (User manual) By For further information, visit us at www.ignatiuz.com For support, contact us at support@ignatiuz.com Version:- V 2.5.0.0 Created: November 2nd, 2018 1 Table

More information

Nature Biotechnology: doi: /nbt Supplementary Figure 1

Nature Biotechnology: doi: /nbt Supplementary Figure 1 Supplementary Figure 1 The mass accuracy of fragment ions is important for peptide recovery in wide-tolerance searches. The same data as in Figure 1B was searched with varying fragment ion tolerances (FIT).

More information

TUTORIAL. Revised in Apr 2015

TUTORIAL. Revised in Apr 2015 TUTORIAL Revised in Apr 2015 Contents I. Overview II. Fly prioritizer Function prioritization III. Fly prioritizer Gene prioritization Gene Set Analysis IV. Human prioritizer Human disease prioritization

More information

Strategies for Quantitative Proteomics. Atelier "Protéomique Quantitative" La Grande Motte, France - June 26, 2007

Strategies for Quantitative Proteomics. Atelier Protéomique Quantitative La Grande Motte, France - June 26, 2007 Strategies for Quantitative Proteomics Atelier "Protéomique Quantitative", France - June 26, 2007 Bruno Domon, Ph.D. Institut of Molecular Systems Biology ETH Zurich Zürich, Switzerland OUTLINE Introduction

More information

Index. Sarah Critchley 2018 S. Critchley, Dynamics 365 CE Essentials,

Index. Sarah Critchley 2018 S. Critchley, Dynamics 365 CE Essentials, A Accounts and contacts activities activities tab, 37 advantage of, 32 business, 29 capabilities, 35 case resolution form, 33 configuration, 36 definition, 28 feature, 29 filter, 38 form component, 34

More information

timstof Pro powered by PASEF and the Evosep One for high speed and sensitive shotgun proteomics

timstof Pro powered by PASEF and the Evosep One for high speed and sensitive shotgun proteomics timstof Pro powered by PASEF and the Evosep One for high speed and sensitive shotgun proteomics The Parallel Accumulation Serial Fragmentation (PASEF) method for trapped ion mobility spectrometry (TIMS)

More information

TrackITSystem. Facility Manager Documentation Installation and User Guide

TrackITSystem. Facility Manager Documentation Installation and User Guide TrackITSystem Facility Manager Documentation Installation and User Guide Pages 1 to 7 apply to the install and configuration for admins. The user guide begins on page 8 In a browser, go to the installation

More information

Thermo Scientific Compound Discoverer Software. Integrated solutions. for small molecule research

Thermo Scientific Compound Discoverer Software. Integrated solutions. for small molecule research Thermo Scientific Compound Discoverer Software Integrated solutions for small molecule research Compound Discoverer Software The complete small molecule identification and characterization solution Based

More information

Introduction to Cognos Analytics and Report Navigation Training. IBM Cognos Analytics 11

Introduction to Cognos Analytics and Report Navigation Training. IBM Cognos Analytics 11 Introduction to Cognos Analytics and Report Navigation Training IBM Cognos Analytics 11 Applicable for former IBM Cognos 10 report users who access CBMS Cognos to run and view reports March 2018 This training

More information

UNIFI: The user environment Ken Eglinton Nordic User Training, September 2013

UNIFI: The user environment Ken Eglinton Nordic User Training, September 2013 UNIFI: The user environment Ken Eglinton Nordic User Training, September 2013 2013 Waters Corporation 1 What is UNIFI? Single Platform for Chromatography, Mass Spectrometry, Data Management and Laboratory

More information

Fast, Accurate and Sensitive DNA Variant Detection from Sanger Sequencing:

Fast, Accurate and Sensitive DNA Variant Detection from Sanger Sequencing: Fast, Accurate and Sensitive DNA Variant Detection from Sanger Sequencing: Patented, Anti-Correlation Technology Provides 99.5% Accuracy & Sensitivity to 5% Variant Knowledge Base and External Annotation

More information

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS

More information