Figure S6. A-J) Annotated UVPD mass spectra for top ten peptides found among the peptides identified by Byonic but not SEQUEST + Percolator.

Similar documents
Unsupervised Identification of Isotope-Labeled Peptides

Characterization of Disulfide Linkages in Proteins by 193 nm Ultraviolet Photodissociation (UVPD) Mass Spectrometry. Supporting Information

Supporting Information. Lysine Propionylation to Boost Proteome Sequence. Coverage and Enable a Silent SILAC Strategy for

Nature Biotechnology: doi: /nbt Supplementary Figure 1

Desorption Electrospray Ionization Coupled with Ultraviolet Photodissociation for Characterization of Phospholipid Isomers in Tissue Sections

Enhancing Sequence Coverage in Proteomics Studies by Using a Combination of Proteolytic Enzymes

Quantitative chromatin proteomics reveals a dynamic histone. post-translational modification landscape that defines asexual

PTM Discovery Method for Automated Identification and Sequencing of Phosphopeptides Using the Q TRAP LC/MS/MS System

Molecular Cell, Volume 46. Supplemental Information

Nature Methods: doi: /nmeth.3177

Automating Mass Spectrometry-Based Quantitative Glycomics using Tandem Mass Tag (TMT) Reagents with SimGlycan

NIH Public Access Author Manuscript J Proteome Res. Author manuscript; available in PMC 2014 July 05.

Protein Reports CPTAC Common Data Analysis Pipeline (CDAP)

SIEVE 2.1 Proteomics Example

Quantification with Proteome Discoverer. Bernard Delanghe

Don t miss a thing on your peptide mapping journey How to get full coverage peptide maps using high resolution accurate mass spectrometry

Supplementary Materials for

New Instruments and Services

Types of Modifications

REDOX PROTEOMICS. Roman Zubarev.

Application of a new capillary HPLC- ICP-MS interface to the identification of selenium-containing proteins in selenized yeast

The distribution of log 2 ratio (H/L) for quantified peptides. cleavage sites in each bin of log 2 ratio of quantified. peptides

Supplementary Figures and Notes for Digestion and depletion of abundant

SimGlycan. A high-throughput glycan and glycopeptide data analysis tool for LC-, MALDI-, ESI- Mass Spectrometry workflows.

Search settings MaxQuant

Phosphorylation of proteins Steve Barnes Feb 19th, 2002 in some cases, proteins are found in a stable, hyperphosphorylated state, e.g.

Supplemental Information. Comprehensive Quantification. of the Modified Proteome Reveals Oxidative. Heart Damage in Mitochondrial Heteroplasmy

A computational framework for discovery of glycoproteomic biomarkers

Proteomics of body liquids as a source for potential methods for medical diagnostics Prof. Dr. Evgeny Nikolaev

Lecture 3. Tandem MS & Protein Sequencing

Nature Structural & Molecular Biology: doi: /nsmb.3218

for the Identification of Phosphorylated Peptides

Supporting Information Parsimonious Charge Deconvolution for Native Mass Spectrometry

Biological Mass spectrometry in Protein Chemistry

Protein sequence mapping is commonly used to

Profiling of Histone Post-translational Modifications in Mouse Brain with High Resolution Top Down Mass Spectrometry

Supplemental Information. LipiDex: An Integrated Software Package. for High-Confidence Lipid Identification

Supplementary Figure 1 (previous page). EM analysis of full-length GCGR. (a) Exemplary tilt pair images of the GCGR mab23 complex acquired for Random

2. Ionization Sources 3. Mass Analyzers 4. Tandem Mass Spectrometry

Protein Identification and Phosphorylation Site Determination by de novo sequencing using PepFrag TM MALDI-Sequencing kit

APPLICATION NOTE. Highly reproducible and Comprehensive Proteome Profiling of Formalin-Fixed Paraffin-Embedded (FFPE) Tissues Slices

Introduction to Peptide Sequencing

Learning Objectives. Overview of topics to be discussed 10/25/2013 HIGH RESOLUTION MASS SPECTROMETRY (HRMS) IN DISCOVERY PROTEOMICS

Supplementary Materials for

Extended Mass Range Triple Quadrupole for Routine Analysis of High Mass-to-charge Peptide Ions

Multiplex Protein Quantitation using itraq Reagents in a Gel-Based Workflow

Evaluation of Parameters for Confident Phosphorylation Site Localization Using an Orbitrap Fusion Tribrid Mass Spectrometer

Digitizing the Proteomes From Big Tissue Biobanks

Introduction to Proteomics 1.0

[ APPLICATION NOTE ] High Sensitivity Intact Monoclonal Antibody (mab) HRMS Quantification APPLICATION BENEFITS INTRODUCTION WATERS SOLUTIONS KEYWORDS

New Developments in LC-IMS-MS Proteomic Measurements and Informatic Analyses

SUPPLEMENTARY INFORMATION

MSnID Package for Handling MS/MS Identifications

Universal sample preparation method for proteome analysis

User Guide. Protein Clpper. Statistical scoring of protease cleavage sites. 1. Introduction Protein Clpper Analysis Procedure...

SUNY UPSTATE MEDICAL UNIVERSITY PROTEOMICS CORE

Selective phosphatidylcholine double bond. fragmentation and localization using. Paternó-Büchi reactions and ultraviolet.

One Gene, Many Proteins. Applications of Mass Spectrometry to Proteomics. Why Proteomics? Raghothama Chaerkady, Ph.D.

Mass spectrometry based proteomics

Biomolecular Mass Spectrometry

MS/MS Scan Modes. Eötvös University, Budapest April 16, MS/MS Scan Modes. Árpád Somogyi. Product Ion Scan Select. Scan. Precursor Ion Scan Scan

Mass Spectrometry and Proteomics - Lecture 4 - Matthias Trost Newcastle University

Proteins: Proteomics & Protein-Protein Interactions Part I

Top 10 Tips for Successful Searching ASMS 2003

Edman Sample Preparation Part 2. Bill Henzel Genentech Inc.

Shotgun metaproteomics of the human distal gut microbiota. Present by Lei Chen

MSSimulator. Simulation of Mass Spectrometry Data. Chris Bielow, Stephan Aiche, Sandro Andreotti, Knut Reinert FU Berlin, Germany

4th Multidimensional Chromatography Workshop Toronto (January, 2013) Herman C. Lam, Ph.D. Calibration & Validation Group

Sequence Coverage (%) Profilin-1 P UD 2

Development of a Human Cell-Free Expression System to Generate Stable-Isotope-Labeled Protein Standards for Quantitative Mass Spectrometry

Christine Vogel 1, Edward M. Marcotte 1 *

Quantification by Mass Spectrometry

Double charge of 33kD peak A1 A2 B1 B2 M2+ M/z. ABRF Proteomics Research Group - Qualitative Proteomics Study Identifier Number 14146

M2Lite: an Open-Source, Light-Weight, Pluggable and Fast Proteome Discoverer MSF to mzidentml Tool

The peptide samples of C. elegans (15 mg) and S. cerevisiae (20 mg) were prepared as

Linear Growth of Glycomics

MS1 and MS2 crosstalk in label free quantitation of mass spectrometry data independent acquisitions

Data Independent MALDI Imaging HDMS E for Visualization and Identification of Lipids Directly from a Single Tissue Section

Advances in Hybrid Mass Spectrometry

Mass Spectrometry. Mass spectrometer MALDI-TOF ESI/MS/MS. Basic components. Ionization source Mass analyzer Detector

MS/MS Library Creation of Q-TOF LC/MS Data for MassHunter PCDL Manager

A comprehensive, open-source platform for mass spectrometry based glycoproteomics. data analysis. Sciences, Buffalo, NY, USA

Metabolite identification in metabolomics: Metlin Database and interpretation of MSMS spectra

LC-MS. Pre-processing (xcms) W4M Core Team. 29/05/2017 v 1.0.0

Go beyond. to realities unexplored. Comprehensive workflows, integrated solutions

Comparison of mass spectrometers performances

More structural information with MS n

Department of Chemistry, Université de Montréal, C.P. 6128, Succursale centre-ville, Montréal, Québec, H3C 3J7, Canada.

Target Analyses in Parallel Reaction Monitoring Mode (PRM)

Faster Mass Spec: Same-Day Sample Prep Now a Reality. Michael M. Rosenblatt, Ph.D.

Bruker Daltonics. autoflex III smartbeam. The Standard in MALDI-TOF Performance MALDI-TOF/TOF. think forward

Characterization of Protein Impurities by Peptide Mapping with UPLC/MS E

Metabolite identification in metabolomics: Database and interpretation of MSMS spectra

NON TARGETED SEARCHING FOR FOOD

Symposium on Protein Fractionation

Multi-omics data integration colon cancer using proteogenomics approach

Application Note # ET-17 / MT-99 Characterization of the N-glycosylation Pattern of Antibodies by ESI - and MALDI mass spectrometry

Conflict of Interest Statement

Supplementary Figure S1. Appearacne of new acetyl groups in acetylated lysines using 2,3-13 C 6 pyruvate as a tracer instead of labeled glucose.

Workflows in SIEVE and LipidSearch

Transcription:

Extending Proteome Coverage by Combining MS/MS Methods and a Modified Bioinformatics Platform adapted for Database Searching of Positive and Negative Polarity 193 nm Ultraviolet Photodissociation Mass Spectra Sylvester Greer, 1 Marshall Bern, 2 Christopher Becker, 2 Jennifer S. Brodbelt* 1 1 Department of Chemistry, University of Texas at Austin Austin, TX 78712 2 Protein Metrics, Inc., San Carlos, CA 94070 Supplemental Material Detailed description of database search parameters and re-analysis of benchmark NUVPD datasets. Figure S1. MS/MS spectra generated by A) HCD (35 NCE) and B) UVPD (2 pulses, 2.5 mj per pulse) for peptide HNGAPAIDGIDDTIISDDTAR (3+) from a tryptic digest of proteins extracted from human hepatocytes Figure S2. Negative polarity MS/MS spectra generated by A) HCD (35 NCE) and B) UVPD (2 pulses, 2.5 mj per pulse) for peptide ELEQVCNPIISGLYQGAGGPGPGGFGAQGPK (2-) from a tryptic digest of proteins extracted from human hepatocytes. Figure S3. Overlap of peptides from HCD, UVPD and NUVPD replicates from tryptic digests of proteins extracted from human hepatocytes. Figure S4. Overlap of UVPD peptides identified by Byonic and SEQUEST+Percolator from tryptic digests of proteins extracted from human hepatocytes Figure S5. A) Portions of peptides identified by NUVPD versus UVPD from tryptic digests of proteins extracted from human hepatocytes. The peptides are ordered based on isoelectric point. B) Distribution of isoelectric point of peptides identified by UVPD and NUVPD. Figure S6. A-J) Annotated UVPD mass spectra for top ten peptides found among the peptides identified by Byonic but not SEQUEST + Percolator. Figure S7. ROC plot for NUVPD data analyzed by MassMatrix (blue) and Byonic (orange) from tryptic digests of proteins extracted from human hepatocytes Figure S8. ROC plot comparing 1D and 2D FDR results for UVPD data from tryptic digests of proteins extracted from human hepatocytes. The graphs indicate that the majority of improvement is based on the search algorithm used, not inclusion of 2D FDR. Figure S9. ROC plot comparing 1D and 2D FDR results for NUVPD data from tryptic digests of proteins extracted from human hepatocytes.the graphs indicate that the majority of the improvement is based on the search algorithm used, not inclusion of 2D FDR. S1

Figure S10. Distribution of sequence coverages obtained for proteins by HCD reported by Byonic (blue) and SEQUEST (red) from tryptic digests of proteins extracted from human hepatocytes. Figure S11. Distribution of sequence coverages obtained for proteins by UVPD reported by Byonic (blue) and SEQUEST (red) from tryptic digests of proteins extracted from human hepatocytes. Figure S12. Distribution of sequence coverages obtained for proteins by NUVPD reported by Byonic (blue) and MassMatrix (green) from tryptic digests of proteins extracted from human hepatocytes. Figure S13. Distribution of sequence coverages obtained for HCD, UVPD, and NUVPD from tryptic digests of proteins extracted from human hepatocytes Figure S14. A) HCD, B) UVPD, and C) NUVPD of phosphorylated LKDLFDYSPPLHK from a tryptic digest of proteins extracted from human hepatocytes, showing varying degrees of phosphate retention. Supplemental tables are included in Excel format that summarize all of the results for each MS/MS method. Table S1. Byonic HCD. List of peptides and proteins identified by Byonic from triplicate HCD data Table S2. Sequest HCD. List of peptides and proteins identified by SEQUEST from triplicate HCD data Table S3. Sequest UVPD. List of peptides and proteins identified by SEQUEST from triplicate UVPD data Table S4. Byonic UVPD. List of peptides and proteins identified by Byonic from triplicate UVPDD data Table S5. Byonic NUVPD. List of peptides and proteins identified by Byonic from triplicate NUVPD data Table S6. MassMatrix NUVPD. List of peptides and proteins identified by MassMatrix from triplicate NUVPD data S2

Byonic Parameters The following parameters were used for searching the LC-MS RAW files in Byonic: add decoys was selected. Cleavage Sites were set to RK and cleavage side was C-terminal, Digestion Specificity was set to Fully Specific and missed cleavages were set to 2. A precursor mass tolerance of 7 ppm was used and fragmentation type: UVPD/HCD/NUVPD were selected where appropriate. A 15 ppm fragment ion tolerance was used. The following modifications were searched for: Carbamidomethy (+57.021464) fixed at cysteine, variable deamidation (+0.984016) of asparagine (common), variable pyroglutamic acid (-17.026549) of glutamine (rare), variable pyroglutamic acid of aspartic acid (-18.010565) (rare), variable acetylation (+42.010565) at protein N-termini (rare). The maximum number of precursors per scan was set to 2, and FDR was set to 1% FDR. Unlike SEQUEST and MassMatrix, Byonic uses a protein aware FDR. The above search parameters were modified to include variable phosphorylation (+79.966331) of serine and threonine (rare), when searching for phosphopeptides Proteome Discoverer (SEQUEST) Parameters The following parameters were used for searching LC-MS RAW files in SEQUEST. Proteome Discoverer version 1.4.1.14 was used. Database searching was performed using the SEQUEST HT. Tryptic enzyme specificity was selected allowing up to two missed cleavages. A maximum delta Cn was set to 0.05. A precursor mass tolerance of 7 ppm and fragment mass tolerance of 0.02 Da was used. The following dynamic modifications were used: acetylation of N-termini, deamidation (+0.984) of asparagine, pyroglutamic acid (-17.027 Da) of glutamine, pyroglutamic acid (-18.011 Da) of aspartic acid. Carbamidomethylation (+57.021 Da) of cysteine was treated as a static modification. Result filtering was performed using Percolator with the following parameters: Max Delta Cn: 0.05, target FDR: 0.01 based on q-value. The above search was modified to include the ptmrs node for improved phosphopeptide searching. Phosphorylation (+79.966 Da) was added as a variable modification on S and T. MassMatrix Parameters The following parameters were used for searching NUVPD in MassMatrix. Trypsin was selected as the digestion method, and the fragmentation mode was set to UVPD. The following dynamic modifications were selected: acetylation of N-termini, deamidation of asparagine, pyroglutamic acid at glutamic acid, pyroglutamic acid (-17.027 Da) of glutamine. Iodoacetamide derivatization (carbamidomethyl) of cysteine was set as a fixed modification. The maximum missed cleavage was set to 2, the precursor mass tolerance was set to 7 ppm, and the default fragment mass tolerance of 0.05 Da was used. Minimum score of output was set to 2, minimum pp value and pp2 value was set to 4.3. The minimum pp tag was set to 4.0, and the maximum number of PTMs was set to 4; score of output was set to 2, minimum pp value and pp2 value was set to 4.3. The minimum pp tag was set to 4.0, and the maximum number of PTMs was set to 4. Re-analysis of benchmark NUVPD data Byonic NUVPD is one of very few algorithms adapted for database searches of LC-MS scale negative polarity datasets (MassMatrix and OMSSA are the only others familiar to the authors). The greatest gains in protein and peptide identifications were made by Byonic for the NUVPD data. In an effort to normalize instrumental variability, the original NUVPD benchmark dataset (citation given below) was re-analyzed using the newly customized Byonic NUVPD algorithm. This original NUVPD results reported in the reference shown below were obtained using a S3

ThermoFisher Scientific Orbitrap Elite mass spectrometer where UVPD is performed in the HCD cell, and these results were re-processed using the Byonic algorithm. Madsen et al. identified 659 proteins and 2350 peptides using NUVPD of a tryptic digest of HeLa cell lysate in the 2013 study. Re-analysis of the triplicate data using the Byonic NUVPD algorithm resulted in identification of 1114 proteins and 4417 peptides respectively, a 69% improvement in protein identifications and an 88% improvement in peptide identifications, thus demonstrating the enhancement in performance derived from the improvement of the Byonic algorithm. Madsen, J. A.; Xu, H.; Robinson, M. R.; Horton, A. P.; Shaw, J. B.; Giles, D. K.; Kaoud, T. S.; Dalby, K. N.; Trent, M. S.; Brodbelt, J. S. Mol. Cell. Proteomics 2013, 12 (9), 2604 2614. S4

Figure S1. MS/MS spectra generated by A) HCD (35 NCE) and B) UVPD (2 pulses, 2.5 mj per pulse) for peptide HNGAPAIDGIDDTIISDDTAR (3+) from a tryptic digest of proteins extracted from human hepatocytes. S5

Figure S2. Negative polarity MS/MS spectra generated by A) HCD (35 NCE) and B) UVPD (2 pulses, 2.5 mj per pulse) for peptide ELEQVCNPIISGLYQGAGGPGPGGFGAQGPK (2-) from a tryptic digest of proteins extracted from human hepatocytes. S6

Figure S3. Overlap of peptides from HCD, UVPD and NUVPD replicates from tryptic digests of proteins extracted from human hepatocytes. S7

Figure S4. Overlap of UVPD peptides identified by Byonic and SEQUEST+Percolator from tryptic digests of proteins extracted from human hepatocytes. S8

Figure S5. A) Portions of peptides identified by NUVPD versus UVPD from tryptic digests of proteins extracted from human hepatocytes. The peptides are ordered based on isoelectric point. B) Distribution of isoelectric point of peptides identified by UVPD and NUVPD. S9

Figure S6. A-J) Annotated UVPD mass spectra for top ten peptides found among the peptides identified by Byonic but not SEQUEST + Percolator. A) UVPD (2+) B) UVPD (2+) S10

C) UVPD (3+) D) UVPD (3+) S11

E) UVPD (2+). F) UVPD (3+) S12

G) UVPD (2+) H) UVPD (3+) S13

I) UVPD (3+) J) UVPD (3+) S14

Figure S7. ROC plot for NUVPD data analyzed by MassMatrix (blue) and Byonic (orange) from tryptic digests of proteins extracted from human hepatocytes. S15

Figure S8. ROC plot comparing 1D and 2D FDR results for UVPD data from tryptic digests of proteins extracted from human hepatocytes. The graphs indicate that the majority of improvement is based on the search algorithm used, not inclusion of 2D FDR. S16

Figure S9. ROC plot comparing 1D and 2D FDR results for NUVPD data from tryptic digests of proteins extracted from human hepatocytes.the graphs indicate that the majority of the improvement is based on the search algorithm used, not inclusion of 2D FDR. S17

Fig S10. Distribution of sequence coverages obtained for proteins by HCD reported by Byonic (blue) and SEQUEST (red) from tryptic digests of proteins extracted from human hepatocytes.. S18

Fig S11. Distribution of sequence coverages obtained for proteins by UVPD reported by Byonic (blue) and SEQUEST (red) from tryptic digests of proteins extracted from human hepatocytes. S19

Fig S12. Distribution of sequence coverages obtained for proteins by NUVPD reported by Byonic (blue) and MassMatrix (green) from tryptic digests of proteins extracted from human hepatocytes. S20

Fig S13. Distribution of sequence coverages obtained for HCD, UVPD, and NUVPD from tryptic digests of proteins extracted from human hepatocytes. S21

Fig S14. A) HCD, B) UVPD, and C) NUVPD of phosphorylated LKDLFDYSPPLHK from a tryptic digest of proteins extracted from human hepatocytes, showing varying degrees of phosphate retention. S22